KR102642718B1 - 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 - Google Patents
합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 Download PDFInfo
- Publication number
- KR102642718B1 KR102642718B1 KR1020217037656A KR20217037656A KR102642718B1 KR 102642718 B1 KR102642718 B1 KR 102642718B1 KR 1020217037656 A KR1020217037656 A KR 1020217037656A KR 20217037656 A KR20217037656 A KR 20217037656A KR 102642718 B1 KR102642718 B1 KR 102642718B1
- Authority
- KR
- South Korea
- Prior art keywords
- gly
- protein
- thr
- leu
- glu
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 390
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 377
- 238000000034 method Methods 0.000 title claims abstract description 208
- 230000001965 increasing effect Effects 0.000 title description 35
- 238000013528 artificial neural network Methods 0.000 claims abstract description 207
- 230000035772 mutation Effects 0.000 claims abstract description 113
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract description 74
- 125000000539 amino acid group Chemical group 0.000 claims abstract description 50
- 238000012549 training Methods 0.000 claims abstract description 48
- 108091005948 blue fluorescent proteins Proteins 0.000 claims abstract description 27
- 239000013078 crystal Substances 0.000 claims abstract description 13
- 150000001413 amino acids Chemical class 0.000 claims description 118
- 239000011159 matrix material Substances 0.000 claims description 29
- 102220477329 Protein XRP2_S28A_mutation Human genes 0.000 claims description 21
- 239000000126 substance Substances 0.000 claims description 20
- 102220244906 rs1554146456 Human genes 0.000 claims description 18
- 230000006872 improvement Effects 0.000 claims description 15
- 230000000087 stabilizing effect Effects 0.000 claims description 15
- 238000005070 sampling Methods 0.000 claims description 13
- 102220349284 c.287A>T Human genes 0.000 claims description 11
- 108091022912 Mannose-6-Phosphate Isomerase Proteins 0.000 claims description 8
- 102000048193 Mannose-6-phosphate isomerases Human genes 0.000 claims description 8
- 125000004435 hydrogen atom Chemical group [H]* 0.000 claims description 6
- 230000036961 partial effect Effects 0.000 claims description 5
- 229910052739 hydrogen Inorganic materials 0.000 claims description 2
- 239000001257 hydrogen Substances 0.000 claims description 2
- 238000013507 mapping Methods 0.000 claims 1
- 230000001976 improved effect Effects 0.000 abstract description 24
- 235000018102 proteins Nutrition 0.000 description 318
- 235000001014 amino acid Nutrition 0.000 description 121
- 229940024606 amino acid Drugs 0.000 description 117
- 230000006870 function Effects 0.000 description 85
- 108090000765 processed proteins & peptides Proteins 0.000 description 80
- 108091006047 fluorescent proteins Proteins 0.000 description 77
- 102000034287 fluorescent proteins Human genes 0.000 description 77
- 102000004196 processed proteins & peptides Human genes 0.000 description 65
- 229920001184 polypeptide Polymers 0.000 description 53
- 239000013598 vector Substances 0.000 description 53
- 210000004027 cell Anatomy 0.000 description 51
- 102000040430 polynucleotide Human genes 0.000 description 51
- 108091033319 polynucleotide Proteins 0.000 description 51
- 239000002157 polynucleotide Substances 0.000 description 51
- 239000012634 fragment Substances 0.000 description 43
- 238000011176 pooling Methods 0.000 description 43
- 230000014509 gene expression Effects 0.000 description 39
- 210000002569 neuron Anatomy 0.000 description 36
- 238000013527 convolutional neural network Methods 0.000 description 35
- 230000004913 activation Effects 0.000 description 32
- 150000007523 nucleic acids Chemical class 0.000 description 31
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 30
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 29
- 108010009298 lysylglutamic acid Proteins 0.000 description 29
- 239000002773 nucleotide Substances 0.000 description 29
- 125000003729 nucleotide group Chemical group 0.000 description 29
- 239000000523 sample Substances 0.000 description 29
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 28
- GJFYPBDMUGGLFR-NKWVEPMBSA-N Asn-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC(=O)N)N)C(=O)O GJFYPBDMUGGLFR-NKWVEPMBSA-N 0.000 description 28
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 28
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 28
- RKNIUWSZIAUEPK-PBCZWWQYSA-N Asp-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N)O RKNIUWSZIAUEPK-PBCZWWQYSA-N 0.000 description 28
- UICOTGULOUGGLC-NUMRIWBASA-N Gln-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N)O UICOTGULOUGGLC-NUMRIWBASA-N 0.000 description 28
- GNMQDOGFWYWPNM-LAEOZQHASA-N Gln-Gly-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)CNC(=O)[C@@H](N)CCC(N)=O)C(O)=O GNMQDOGFWYWPNM-LAEOZQHASA-N 0.000 description 28
- XFHMVFKCQSHLKW-HJGDQZAQSA-N Gln-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XFHMVFKCQSHLKW-HJGDQZAQSA-N 0.000 description 28
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 28
- LJLPOZGRPLORTF-CIUDSAMLSA-N Glu-Asn-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LJLPOZGRPLORTF-CIUDSAMLSA-N 0.000 description 28
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 28
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 28
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 28
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 28
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 28
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 28
- WNZOCXUOGVYYBJ-CDMKHQONSA-N Gly-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)CN)O WNZOCXUOGVYYBJ-CDMKHQONSA-N 0.000 description 28
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 28
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 28
- POJJAZJHBGXEGM-YUMQZZPRSA-N Gly-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN POJJAZJHBGXEGM-YUMQZZPRSA-N 0.000 description 28
- YYOCMTFVGKDNQP-IHRRRGAJSA-N His-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N YYOCMTFVGKDNQP-IHRRRGAJSA-N 0.000 description 28
- ZFDKSLBEWYCOCS-BZSNNMDCSA-N His-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CC=CC=C1 ZFDKSLBEWYCOCS-BZSNNMDCSA-N 0.000 description 28
- FXJLRZFMKGHYJP-CFMVVWHZSA-N Ile-Tyr-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N FXJLRZFMKGHYJP-CFMVVWHZSA-N 0.000 description 28
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 28
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 28
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 28
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 28
- UFPLDOKWDNTTRP-ULQDDVLXSA-N Leu-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CC=C(O)C=C1 UFPLDOKWDNTTRP-ULQDDVLXSA-N 0.000 description 28
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 28
- NNCDAORZCMPZPX-GUBZILKMSA-N Lys-Gln-Ser Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N NNCDAORZCMPZPX-GUBZILKMSA-N 0.000 description 28
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 28
- MIROMRNASYKZNL-ULQDDVLXSA-N Lys-Pro-Tyr Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MIROMRNASYKZNL-ULQDDVLXSA-N 0.000 description 28
- VSJAPSMRFYUOKS-IUCAKERBSA-N Met-Pro-Gly Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O VSJAPSMRFYUOKS-IUCAKERBSA-N 0.000 description 28
- IUVYJBMTHARMIP-PCBIJLKTSA-N Phe-Asp-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O IUVYJBMTHARMIP-PCBIJLKTSA-N 0.000 description 28
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 28
- FYQSMXKJYTZYRP-DCAQKATOSA-N Pro-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FYQSMXKJYTZYRP-DCAQKATOSA-N 0.000 description 28
- HXOLCSYHGRNXJJ-IHRRRGAJSA-N Pro-Asp-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HXOLCSYHGRNXJJ-IHRRRGAJSA-N 0.000 description 28
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 28
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 28
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 28
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 28
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 28
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 28
- HSQXHRIRJSFDOH-URLPEUOOSA-N Thr-Phe-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HSQXHRIRJSFDOH-URLPEUOOSA-N 0.000 description 28
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 28
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 28
- PKUJMYZNJMRHEZ-XIRDDKMYSA-N Trp-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PKUJMYZNJMRHEZ-XIRDDKMYSA-N 0.000 description 28
- GQVZBMROTPEPIF-SRVKXCTJSA-N Tyr-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GQVZBMROTPEPIF-SRVKXCTJSA-N 0.000 description 28
- UUJHRSTVQCFDPA-UFYCRDLUSA-N Tyr-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=C(O)C=C1 UUJHRSTVQCFDPA-UFYCRDLUSA-N 0.000 description 28
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 28
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 28
- OFQGGTGZTOTLGH-NHCYSSNCSA-N Val-Met-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N OFQGGTGZTOTLGH-NHCYSSNCSA-N 0.000 description 28
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 28
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 28
- 108010081404 acein-2 Proteins 0.000 description 28
- 108010011559 alanylphenylalanine Proteins 0.000 description 28
- 108010038633 aspartylglutamate Proteins 0.000 description 28
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 28
- 108010028295 histidylhistidine Proteins 0.000 description 28
- 108010057821 leucylproline Proteins 0.000 description 28
- 108010017391 lysylvaline Proteins 0.000 description 28
- 108010056582 methionylglutamic acid Proteins 0.000 description 28
- 102000039446 nucleic acids Human genes 0.000 description 28
- 108020004707 nucleic acids Proteins 0.000 description 28
- GFBLJMHGHAXGNY-ZLUOBGJFSA-N Ala-Asn-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GFBLJMHGHAXGNY-ZLUOBGJFSA-N 0.000 description 27
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 27
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 27
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 27
- NJNRBRKHOWSGMN-SRVKXCTJSA-N Lys-Leu-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O NJNRBRKHOWSGMN-SRVKXCTJSA-N 0.000 description 27
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 27
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 27
- 108010050848 glycylleucine Proteins 0.000 description 27
- 108010027345 wheylin-1 peptide Proteins 0.000 description 27
- WESHVRNMNFMVBE-FXQIFTODSA-N Arg-Asn-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N WESHVRNMNFMVBE-FXQIFTODSA-N 0.000 description 26
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 26
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 25
- HCOQNGIHSXICCB-IHRRRGAJSA-N Asp-Tyr-Arg Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)O HCOQNGIHSXICCB-IHRRRGAJSA-N 0.000 description 25
- 239000000370 acceptor Substances 0.000 description 25
- CUVBTVWFVIIDOC-YEPSODPASA-N Gly-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)CN CUVBTVWFVIIDOC-YEPSODPASA-N 0.000 description 24
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 23
- 230000000694 effects Effects 0.000 description 23
- 238000010586 diagram Methods 0.000 description 22
- WTXCNOPZMQRTNN-BWBBJGPYSA-N Cys-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)O WTXCNOPZMQRTNN-BWBBJGPYSA-N 0.000 description 21
- 238000006467 substitution reaction Methods 0.000 description 21
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 20
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 20
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 19
- 230000008859 change Effects 0.000 description 18
- 230000005284 excitation Effects 0.000 description 18
- 108020001507 fusion proteins Proteins 0.000 description 18
- 102000037865 fusion proteins Human genes 0.000 description 18
- 230000008569 process Effects 0.000 description 18
- 108091028043 Nucleic acid sequence Proteins 0.000 description 17
- 238000012545 processing Methods 0.000 description 17
- 238000002866 fluorescence resonance energy transfer Methods 0.000 description 16
- 239000013604 expression vector Substances 0.000 description 15
- 239000000203 mixture Substances 0.000 description 15
- 102000004190 Enzymes Human genes 0.000 description 13
- 108090000790 Enzymes Proteins 0.000 description 13
- 239000003623 enhancer Substances 0.000 description 13
- 230000014616 translation Effects 0.000 description 13
- 229940088598 enzyme Drugs 0.000 description 12
- 238000004458 analytical method Methods 0.000 description 11
- 238000012886 linear function Methods 0.000 description 11
- 238000013442 quality metrics Methods 0.000 description 11
- 230000001413 cellular effect Effects 0.000 description 10
- 239000003153 chemical reaction reagent Substances 0.000 description 10
- 238000012360 testing method Methods 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- 238000013519 translation Methods 0.000 description 10
- 108020004705 Codon Proteins 0.000 description 9
- 230000027455 binding Effects 0.000 description 9
- 241000588724 Escherichia coli Species 0.000 description 8
- 230000004048 modification Effects 0.000 description 8
- 238000012986 modification Methods 0.000 description 8
- -1 peptides Chemical class 0.000 description 8
- 239000000816 peptidomimetic Substances 0.000 description 8
- 230000001105 regulatory effect Effects 0.000 description 8
- 241000894007 species Species 0.000 description 8
- 238000013518 transcription Methods 0.000 description 8
- 230000035897 transcription Effects 0.000 description 8
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 7
- 235000005940 Centaurea cyanus Nutrition 0.000 description 7
- NRVQLLDIJJEIIZ-VZFHVOOUSA-N Cys-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N)O NRVQLLDIJJEIIZ-VZFHVOOUSA-N 0.000 description 7
- 241000333818 Northiella haematogaster Species 0.000 description 7
- 238000007792 addition Methods 0.000 description 7
- 108010005233 alanylglutamic acid Proteins 0.000 description 7
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 6
- 108020004566 Transfer RNA Proteins 0.000 description 6
- 238000003491 array Methods 0.000 description 6
- 125000004429 atom Chemical group 0.000 description 6
- 230000008901 benefit Effects 0.000 description 6
- 238000004422 calculation algorithm Methods 0.000 description 6
- 238000006243 chemical reaction Methods 0.000 description 6
- 230000000368 destabilizing effect Effects 0.000 description 6
- 238000009826 distribution Methods 0.000 description 6
- 230000001939 inductive effect Effects 0.000 description 6
- 230000003993 interaction Effects 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- VCDNHBNNPCDBKV-DLOVCJGASA-N His-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VCDNHBNNPCDBKV-DLOVCJGASA-N 0.000 description 5
- 108700008625 Reporter Genes Proteins 0.000 description 5
- 108010070944 alanylhistidine Proteins 0.000 description 5
- 238000013459 approach Methods 0.000 description 5
- 238000003776 cleavage reaction Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 5
- 235000018417 cysteine Nutrition 0.000 description 5
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 5
- 125000005647 linker group Chemical group 0.000 description 5
- 239000003550 marker Substances 0.000 description 5
- 239000012528 membrane Substances 0.000 description 5
- 238000000746 purification Methods 0.000 description 5
- 230000007017 scission Effects 0.000 description 5
- 239000000758 substrate Substances 0.000 description 5
- 102220553751 Cyclic GMP-AMP synthase_T18E_mutation Human genes 0.000 description 4
- 108020004414 DNA Proteins 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- 108020004511 Recombinant DNA Proteins 0.000 description 4
- 241000700605 Viruses Species 0.000 description 4
- 229960000723 ampicillin Drugs 0.000 description 4
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 4
- 239000000427 antigen Substances 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 238000003556 assay Methods 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- 108010002833 beta-lactamase TEM-1 Proteins 0.000 description 4
- 238000004364 calculation method Methods 0.000 description 4
- 238000000295 emission spectrum Methods 0.000 description 4
- 210000003527 eukaryotic cell Anatomy 0.000 description 4
- 238000000695 excitation spectrum Methods 0.000 description 4
- 230000004927 fusion Effects 0.000 description 4
- 230000002068 genetic effect Effects 0.000 description 4
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 4
- 238000003780 insertion Methods 0.000 description 4
- 230000037431 insertion Effects 0.000 description 4
- 230000004807 localization Effects 0.000 description 4
- 238000010801 machine learning Methods 0.000 description 4
- 239000000463 material Substances 0.000 description 4
- 229920000642 polymer Polymers 0.000 description 4
- 238000003752 polymerase chain reaction Methods 0.000 description 4
- 230000002829 reductive effect Effects 0.000 description 4
- 238000003860 storage Methods 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 239000013603 viral vector Substances 0.000 description 4
- 239000004475 Arginine Substances 0.000 description 3
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 3
- 108060003951 Immunoglobulin Proteins 0.000 description 3
- 108700005091 Immunoglobulin Genes Proteins 0.000 description 3
- QIVBCDIJIAJPQS-VIFPVBQESA-N L-tryptophane Chemical compound C1=CC=C2C(C[C@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-VIFPVBQESA-N 0.000 description 3
- 108091060545 Nonsense suppressor Proteins 0.000 description 3
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 3
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 3
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 3
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 3
- 108700029229 Transcriptional Regulatory Elements Proteins 0.000 description 3
- QIVBCDIJIAJPQS-UHFFFAOYSA-N Tryptophan Natural products C1=CC=C2C(CC(N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-UHFFFAOYSA-N 0.000 description 3
- 101710100170 Unknown protein Proteins 0.000 description 3
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 3
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 3
- 230000000052 comparative effect Effects 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 150000001875 compounds Chemical class 0.000 description 3
- 238000012217 deletion Methods 0.000 description 3
- 230000037430 deletion Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000003018 immunoassay Methods 0.000 description 3
- 102000018358 immunoglobulin Human genes 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 230000003834 intracellular effect Effects 0.000 description 3
- 238000012804 iterative process Methods 0.000 description 3
- 230000000670 limiting effect Effects 0.000 description 3
- 210000004962 mammalian cell Anatomy 0.000 description 3
- 230000035800 maturation Effects 0.000 description 3
- 238000005259 measurement Methods 0.000 description 3
- 108020004999 messenger RNA Proteins 0.000 description 3
- 230000004962 physiological condition Effects 0.000 description 3
- 230000004481 post-translational protein modification Effects 0.000 description 3
- 230000006337 proteolytic cleavage Effects 0.000 description 3
- 238000003259 recombinant expression Methods 0.000 description 3
- 108010054624 red fluorescent protein Proteins 0.000 description 3
- 238000002165 resonance energy transfer Methods 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 230000000717 retained effect Effects 0.000 description 3
- 230000035945 sensitivity Effects 0.000 description 3
- 238000004088 simulation Methods 0.000 description 3
- 239000002904 solvent Substances 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 238000003786 synthesis reaction Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 241000701161 unidentified adenovirus Species 0.000 description 3
- 238000011144 upstream manufacturing Methods 0.000 description 3
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 description 2
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 2
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 2
- PWAIZUBWHRHYKS-MELADBBJSA-N Asp-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC(=O)O)N)C(=O)O PWAIZUBWHRHYKS-MELADBBJSA-N 0.000 description 2
- IJGRMHOSHXDMSA-UHFFFAOYSA-N Atomic nitrogen Chemical compound N#N IJGRMHOSHXDMSA-UHFFFAOYSA-N 0.000 description 2
- 102100026189 Beta-galactosidase Human genes 0.000 description 2
- 108010035563 Chloramphenicol O-acetyltransferase Proteins 0.000 description 2
- 241000701022 Cytomegalovirus Species 0.000 description 2
- 108090000204 Dipeptidase 1 Proteins 0.000 description 2
- 108090000331 Firefly luciferases Proteins 0.000 description 2
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 2
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 2
- 102000005720 Glutathione transferase Human genes 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- MREVELMMFOLESM-HOCLYGCPSA-N Gly-Trp-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O MREVELMMFOLESM-HOCLYGCPSA-N 0.000 description 2
- 108010043121 Green Fluorescent Proteins Proteins 0.000 description 2
- ZRALSGWEFCBTJO-UHFFFAOYSA-N Guanidine Chemical compound NC(N)=N ZRALSGWEFCBTJO-UHFFFAOYSA-N 0.000 description 2
- 241000725303 Human immunodeficiency virus Species 0.000 description 2
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 2
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 2
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 2
- 241000713666 Lentivirus Species 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 241000829100 Macaca mulatta polyomavirus 1 Species 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- 102000035195 Peptidases Human genes 0.000 description 2
- RJKFOVLPORLFTN-LEKSSAKUSA-N Progesterone Chemical compound C1CC2=CC(=O)CC[C@]2(C)[C@@H]2[C@@H]1[C@@H]1CC[C@H](C(=O)C)[C@@]1(C)CC2 RJKFOVLPORLFTN-LEKSSAKUSA-N 0.000 description 2
- 108010076504 Protein Sorting Signals Proteins 0.000 description 2
- 102000018120 Recombinases Human genes 0.000 description 2
- 108010091086 Recombinases Proteins 0.000 description 2
- ZUXQFMVPAYGPFJ-JXUBOQSCSA-N Thr-Ala-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN ZUXQFMVPAYGPFJ-JXUBOQSCSA-N 0.000 description 2
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 2
- 239000004473 Threonine Substances 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 108091023040 Transcription factor Proteins 0.000 description 2
- 102000040945 Transcription factor Human genes 0.000 description 2
- DRTQHJPVMGBUCF-XVFCMESISA-N Uridine Chemical compound O[C@@H]1[C@H](O)[C@@H](CO)O[C@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-XVFCMESISA-N 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 150000001408 amides Chemical class 0.000 description 2
- 239000012491 analyte Substances 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000004888 barrier function Effects 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- 102000006635 beta-lactamase Human genes 0.000 description 2
- 230000003115 biocidal effect Effects 0.000 description 2
- 230000008033 biological extinction Effects 0.000 description 2
- 230000015572 biosynthetic process Effects 0.000 description 2
- 239000000872 buffer Substances 0.000 description 2
- 229910052799 carbon Inorganic materials 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 210000003763 chloroplast Anatomy 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 238000000205 computational method Methods 0.000 description 2
- 238000007796 conventional method Methods 0.000 description 2
- CVSVTCORWBXHQV-UHFFFAOYSA-N creatine Chemical compound NC(=[NH2+])N(C)CC([O-])=O CVSVTCORWBXHQV-UHFFFAOYSA-N 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 239000000539 dimer Substances 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 230000005281 excited state Effects 0.000 description 2
- 230000005714 functional activity Effects 0.000 description 2
- 230000012010 growth Effects 0.000 description 2
- 230000002209 hydrophobic effect Effects 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 229910052747 lanthanoid Inorganic materials 0.000 description 2
- 150000002602 lanthanoids Chemical class 0.000 description 2
- 239000003446 ligand Substances 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 230000004001 molecular interaction Effects 0.000 description 2
- 231100000350 mutagenesis Toxicity 0.000 description 2
- 238000002703 mutagenesis Methods 0.000 description 2
- 238000005457 optimization Methods 0.000 description 2
- 210000003463 organelle Anatomy 0.000 description 2
- 230000007030 peptide scission Effects 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 230000029983 protein stabilization Effects 0.000 description 2
- 230000004850 protein–protein interaction Effects 0.000 description 2
- 230000017854 proteolysis Effects 0.000 description 2
- 238000006862 quantum yield reaction Methods 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 102220188157 rs149825190 Human genes 0.000 description 2
- 102200065429 rs387906783 Human genes 0.000 description 2
- 102220283250 rs772349818 Human genes 0.000 description 2
- 102200109703 rs869312739 Human genes 0.000 description 2
- 102220172123 rs886048669 Human genes 0.000 description 2
- 230000009870 specific binding Effects 0.000 description 2
- 238000002945 steepest descent method Methods 0.000 description 2
- 230000002103 transcriptional effect Effects 0.000 description 2
- 238000001890 transfection Methods 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- 230000001131 transforming effect Effects 0.000 description 2
- 229910052723 transition metal Inorganic materials 0.000 description 2
- 150000003624 transition metals Chemical class 0.000 description 2
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 description 2
- 241001430294 unidentified retrovirus Species 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 238000011179 visual inspection Methods 0.000 description 2
- 210000005253 yeast cell Anatomy 0.000 description 2
- SGKRLCUYIXIAHR-AKNGSSGZSA-N (4s,4ar,5s,5ar,6r,12ar)-4-(dimethylamino)-1,5,10,11,12a-pentahydroxy-6-methyl-3,12-dioxo-4a,5,5a,6-tetrahydro-4h-tetracene-2-carboxamide Chemical compound C1=CC=C2[C@H](C)[C@@H]([C@H](O)[C@@H]3[C@](C(O)=C(C(N)=O)C(=O)[C@H]3N(C)C)(O)C3=O)C3=C(O)C2=C1O SGKRLCUYIXIAHR-AKNGSSGZSA-N 0.000 description 1
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 1
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 1
- ZOOGRGPOEVQQDX-UUOKFMHZSA-N 3',5'-cyclic GMP Chemical compound C([C@H]1O2)OP(O)(=O)O[C@H]1[C@@H](O)[C@@H]2N1C(N=C(NC2=O)N)=C2N=C1 ZOOGRGPOEVQQDX-UUOKFMHZSA-N 0.000 description 1
- 102000007469 Actins Human genes 0.000 description 1
- 108010085238 Actins Proteins 0.000 description 1
- 108010000239 Aequorin Proteins 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 102000052866 Amino Acyl-tRNA Synthetases Human genes 0.000 description 1
- 108700028939 Amino Acyl-tRNA Synthetases Proteins 0.000 description 1
- VBFJESQBIWCWRL-DCAQKATOSA-N Arg-Ala-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCNC(N)=N VBFJESQBIWCWRL-DCAQKATOSA-N 0.000 description 1
- VRZDJJWOFXMFRO-ZFWWWQNUSA-N Arg-Gly-Trp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O VRZDJJWOFXMFRO-ZFWWWQNUSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- LIJXJYGRSRWLCJ-IHRRRGAJSA-N Asp-Phe-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LIJXJYGRSRWLCJ-IHRRRGAJSA-N 0.000 description 1
- WOPJVEMFXYHZEE-SRVKXCTJSA-N Asp-Phe-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O WOPJVEMFXYHZEE-SRVKXCTJSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 1
- 241000714230 Avian leukemia virus Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- 241000894006 Bacteria Species 0.000 description 1
- 108090000363 Bacterial Luciferases Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000222122 Candida albicans Species 0.000 description 1
- 241000282465 Canis Species 0.000 description 1
- 235000014653 Carica parviflora Nutrition 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 241000243321 Cnidaria Species 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- UDMBCSSLTHHNCD-UHFFFAOYSA-N Coenzym Q(11) Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(COP(O)(O)=O)C(O)C1O UDMBCSSLTHHNCD-UHFFFAOYSA-N 0.000 description 1
- 102000000634 Cytochrome c oxidase subunit IV Human genes 0.000 description 1
- 108090000365 Cytochrome-c oxidases Proteins 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 241000702421 Dependoparvovirus Species 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- SHIBSTMRCDJXLN-UHFFFAOYSA-N Digoxigenin Natural products C1CC(C2C(C3(C)CCC(O)CC3CC2)CC2O)(O)C2(C)C1C1=CC(=O)OC1 SHIBSTMRCDJXLN-UHFFFAOYSA-N 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 101000925646 Enterobacteria phage T4 Endolysin Proteins 0.000 description 1
- YQYJSBFKSSDGFO-UHFFFAOYSA-N Epihygromycin Natural products OC1C(O)C(C(=O)C)OC1OC(C(=C1)O)=CC=C1C=C(C)C(=O)NC1C(O)C(O)C2OCOC2C1O YQYJSBFKSSDGFO-UHFFFAOYSA-N 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108091006027 G proteins Proteins 0.000 description 1
- 102000030782 GTP binding Human genes 0.000 description 1
- 108091000058 GTP-Binding Proteins 0.000 description 1
- 102000030902 Galactosyltransferase Human genes 0.000 description 1
- 108060003306 Galactosyltransferase Proteins 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- JSNNHGHYGYMVCK-XVKPBYJWSA-N Gly-Glu-Val Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O JSNNHGHYGYMVCK-XVKPBYJWSA-N 0.000 description 1
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 1
- MVORZMQFXBLMHM-QWRGUYRKSA-N Gly-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 MVORZMQFXBLMHM-QWRGUYRKSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102000004144 Green Fluorescent Proteins Human genes 0.000 description 1
- 102000009465 Growth Factor Receptors Human genes 0.000 description 1
- 108010009202 Growth Factor Receptors Proteins 0.000 description 1
- 102000001554 Hemoglobins Human genes 0.000 description 1
- 108010054147 Hemoglobins Proteins 0.000 description 1
- 241000238631 Hexapoda Species 0.000 description 1
- 241000282412 Homo Species 0.000 description 1
- 101001012157 Homo sapiens Receptor tyrosine-protein kinase erbB-2 Proteins 0.000 description 1
- 241000701044 Human gammaherpesvirus 4 Species 0.000 description 1
- 108010067060 Immunoglobulin Variable Region Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 1
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 1
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 1
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 1
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 1
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 1
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 1
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- 239000005089 Luciferase Substances 0.000 description 1
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 108010063312 Metalloproteins Proteins 0.000 description 1
- 102000010750 Metalloproteins Human genes 0.000 description 1
- 241001465754 Metazoa Species 0.000 description 1
- 241000713333 Mouse mammary tumor virus Species 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 108060008487 Myosin Proteins 0.000 description 1
- CHJJGSNFBQVOTG-UHFFFAOYSA-N N-methyl-guanidine Natural products CNC(N)=N CHJJGSNFBQVOTG-UHFFFAOYSA-N 0.000 description 1
- 102000045595 Phosphoprotein Phosphatases Human genes 0.000 description 1
- 108700019535 Phosphoprotein Phosphatases Proteins 0.000 description 1
- OAICVXFJPJFONN-UHFFFAOYSA-N Phosphorus Chemical compound [P] OAICVXFJPJFONN-UHFFFAOYSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 102100030086 Receptor tyrosine-protein kinase erbB-2 Human genes 0.000 description 1
- 108010052090 Renilla Luciferases Proteins 0.000 description 1
- 108091028664 Ribonucleotide Proteins 0.000 description 1
- 241000714474 Rous sarcoma virus Species 0.000 description 1
- KJTLSVCANCCWHF-UHFFFAOYSA-N Ruthenium Chemical compound [Ru] KJTLSVCANCCWHF-UHFFFAOYSA-N 0.000 description 1
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 102220593902 Serine/threonine-protein kinase PLK2_S14T_mutation Human genes 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- NINIDFKCEFEMDL-UHFFFAOYSA-N Sulfur Chemical compound [S] NINIDFKCEFEMDL-UHFFFAOYSA-N 0.000 description 1
- 229910052771 Terbium Inorganic materials 0.000 description 1
- 239000004098 Tetracycline Substances 0.000 description 1
- 102000006601 Thymidine Kinase Human genes 0.000 description 1
- 108020004440 Thymidine kinase Proteins 0.000 description 1
- 230000010632 Transcription Factor Activity Effects 0.000 description 1
- 108090000848 Ubiquitin Proteins 0.000 description 1
- 102000044159 Ubiquitin Human genes 0.000 description 1
- 108010005705 Ubiquitinated Proteins Proteins 0.000 description 1
- HHSILIQTHXABKM-YDHLFZDLSA-N Val-Asp-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](Cc1ccccc1)C(O)=O HHSILIQTHXABKM-YDHLFZDLSA-N 0.000 description 1
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 1
- 241000251539 Vertebrata <Metazoa> Species 0.000 description 1
- 241000269370 Xenopus <genus> Species 0.000 description 1
- INULNSAIIZKOQE-YOSAUDMPSA-N [(3r,4ar,10ar)-6-methoxy-1-methyl-3,4,4a,5,10,10a-hexahydro-2h-benzo[g]quinolin-3-yl]-[4-(4-nitrophenyl)piperazin-1-yl]methanone Chemical compound O=C([C@@H]1C[C@H]2[C@H](N(C1)C)CC=1C=CC=C(C=1C2)OC)N(CC1)CCN1C1=CC=C([N+]([O-])=O)C=C1 INULNSAIIZKOQE-YOSAUDMPSA-N 0.000 description 1
- HMNZFMSWFCAGGW-XPWSMXQVSA-N [3-[hydroxy(2-hydroxyethoxy)phosphoryl]oxy-2-[(e)-octadec-9-enoyl]oxypropyl] (e)-octadec-9-enoate Chemical compound CCCCCCCC\C=C\CCCCCCCC(=O)OCC(COP(O)(=O)OCCO)OC(=O)CCCCCCC\C=C\CCCCCCCC HMNZFMSWFCAGGW-XPWSMXQVSA-N 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 230000002730 additional effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- UDMBCSSLTHHNCD-KQYNXXCUSA-N adenosine 5'-monophosphate Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@@H]1O[C@H](COP(O)(O)=O)[C@@H](O)[C@H]1O UDMBCSSLTHHNCD-KQYNXXCUSA-N 0.000 description 1
- LNQVTSROQXJCDD-UHFFFAOYSA-N adenosine monophosphate Natural products C1=NC=2C(N)=NC=NC=2N1C1OC(CO)C(OP(O)(O)=O)C1O LNQVTSROQXJCDD-UHFFFAOYSA-N 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- 150000001412 amines Chemical class 0.000 description 1
- 230000003321 amplification Effects 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 150000001450 anions Chemical class 0.000 description 1
- 108091006049 anthozoan fluorescent proteins Proteins 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 238000000149 argon plasma sintering Methods 0.000 description 1
- 235000009582 asparagine Nutrition 0.000 description 1
- 229960001230 asparagine Drugs 0.000 description 1
- 235000003704 aspartic acid Nutrition 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- QVGXLLKOCUKJST-UHFFFAOYSA-N atomic oxygen Chemical compound [O] QVGXLLKOCUKJST-UHFFFAOYSA-N 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229940049706 benzodiazepine Drugs 0.000 description 1
- 150000001557 benzodiazepines Chemical class 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- DRTQHJPVMGBUCF-PSQAKQOGSA-N beta-L-uridine Natural products O[C@H]1[C@@H](O)[C@H](CO)O[C@@H]1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-PSQAKQOGSA-N 0.000 description 1
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 239000011942 biocatalyst Substances 0.000 description 1
- 239000003124 biologic agent Substances 0.000 description 1
- 230000004071 biological effect Effects 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 238000005415 bioluminescence Methods 0.000 description 1
- 230000029918 bioluminescence Effects 0.000 description 1
- 238000001574 biopsy Methods 0.000 description 1
- 229960002685 biotin Drugs 0.000 description 1
- 235000020958 biotin Nutrition 0.000 description 1
- 239000011616 biotin Substances 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- 239000011575 calcium Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 229940095731 candida albicans Drugs 0.000 description 1
- 125000001314 canonical amino-acid group Chemical group 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 125000004432 carbon atom Chemical group C* 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 238000006555 catalytic reaction Methods 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 230000010307 cell transformation Effects 0.000 description 1
- 230000030570 cellular localization Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 239000013043 chemical agent Substances 0.000 description 1
- 238000009614 chemical analysis method Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 239000013068 control sample Substances 0.000 description 1
- 230000008878 coupling Effects 0.000 description 1
- 238000010168 coupling process Methods 0.000 description 1
- 238000005859 coupling reaction Methods 0.000 description 1
- 229960003624 creatine Drugs 0.000 description 1
- 239000006046 creatine Substances 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 210000000172 cytosol Anatomy 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 238000004925 denaturation Methods 0.000 description 1
- 230000036425 denaturation Effects 0.000 description 1
- 239000005547 deoxyribonucleotide Substances 0.000 description 1
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- QONQRTHLHBTMGP-UHFFFAOYSA-N digitoxigenin Natural products CC12CCC(C3(CCC(O)CC3CC3)C)C3C11OC1CC2C1=CC(=O)OC1 QONQRTHLHBTMGP-UHFFFAOYSA-N 0.000 description 1
- SHIBSTMRCDJXLN-KCZCNTNESA-N digoxigenin Chemical compound C1([C@@H]2[C@@]3([C@@](CC2)(O)[C@H]2[C@@H]([C@@]4(C)CC[C@H](O)C[C@H]4CC2)C[C@H]3O)C)=CC(=O)OC1 SHIBSTMRCDJXLN-KCZCNTNESA-N 0.000 description 1
- SWSQBOPZIKWTGO-UHFFFAOYSA-N dimethylaminoamidine Natural products CN(C)C(N)=N SWSQBOPZIKWTGO-UHFFFAOYSA-N 0.000 description 1
- 229960003722 doxycycline Drugs 0.000 description 1
- 229940079593 drug Drugs 0.000 description 1
- 239000003814 drug Substances 0.000 description 1
- 238000009510 drug design Methods 0.000 description 1
- 229940111782 egg extract Drugs 0.000 description 1
- 230000005670 electromagnetic radiation Effects 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 210000002472 endoplasmic reticulum Anatomy 0.000 description 1
- 210000001163 endosome Anatomy 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- ADRVRLIZHFZKFE-UHFFFAOYSA-N ethanediperoxoic acid Chemical compound OOC(=O)C(=O)OO ADRVRLIZHFZKFE-UHFFFAOYSA-N 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 239000000835 fiber Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000000198 fluorescence anisotropy Methods 0.000 description 1
- 238000000799 fluorescence microscopy Methods 0.000 description 1
- 238000001506 fluorescence spectroscopy Methods 0.000 description 1
- 239000007850 fluorescent dye Substances 0.000 description 1
- 125000001965 gamma-lactamyl group Chemical group 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 239000003862 glucocorticoid Substances 0.000 description 1
- 235000013922 glutamic acid Nutrition 0.000 description 1
- 239000004220 glutamic acid Substances 0.000 description 1
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 1
- 230000013595 glycosylation Effects 0.000 description 1
- 238000006206 glycosylation reaction Methods 0.000 description 1
- 108010038983 glycyl-histidyl-lysine Proteins 0.000 description 1
- 239000005090 green fluorescent protein Substances 0.000 description 1
- ZRALSGWEFCBTJO-UHFFFAOYSA-O guanidinium Chemical compound NC(N)=[NH2+] ZRALSGWEFCBTJO-UHFFFAOYSA-O 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000004128 high performance liquid chromatography Methods 0.000 description 1
- 238000013537 high throughput screening Methods 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000000984 immunochemical effect Effects 0.000 description 1
- 229940072221 immunoglobulins Drugs 0.000 description 1
- 238000000338 in vitro Methods 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 230000006662 intracellular pathway Effects 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- 229960000310 isoleucine Drugs 0.000 description 1
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 1
- 230000006122 isoprenylation Effects 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000011031 large-scale manufacturing process Methods 0.000 description 1
- HWYHZTIRURJOHG-UHFFFAOYSA-N luminol Chemical class O=C1NNC(=O)C2=C1C(N)=CC=C2 HWYHZTIRURJOHG-UHFFFAOYSA-N 0.000 description 1
- 210000003712 lysosome Anatomy 0.000 description 1
- 230000001868 lysosomic effect Effects 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 230000010198 maturation time Effects 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 229910021645 metal ion Inorganic materials 0.000 description 1
- 125000000325 methylidene group Chemical group [H]C([H])=* 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 230000003228 microsomal effect Effects 0.000 description 1
- 210000003470 mitochondria Anatomy 0.000 description 1
- 210000001700 mitochondrial membrane Anatomy 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 239000003607 modifier Substances 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 238000012900 molecular simulation Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 231100000219 mutagenic Toxicity 0.000 description 1
- 230000003505 mutagenic effect Effects 0.000 description 1
- 230000007498 myristoylation Effects 0.000 description 1
- 229910052757 nitrogen Inorganic materials 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000003199 nucleic acid amplification method Methods 0.000 description 1
- 108010089433 obelin Proteins 0.000 description 1
- 238000006384 oligomerization reaction Methods 0.000 description 1
- 230000003606 oligomerizing effect Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 229910052762 osmium Inorganic materials 0.000 description 1
- SYQBFIAQOQZEGI-UHFFFAOYSA-N osmium atom Chemical compound [Os] SYQBFIAQOQZEGI-UHFFFAOYSA-N 0.000 description 1
- 229910052760 oxygen Inorganic materials 0.000 description 1
- 239000001301 oxygen Substances 0.000 description 1
- 238000012856 packing Methods 0.000 description 1
- 230000035699 permeability Effects 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 125000005541 phosphonamide group Chemical group 0.000 description 1
- 229910052698 phosphorus Inorganic materials 0.000 description 1
- 239000011574 phosphorus Substances 0.000 description 1
- 239000007856 photoaffinity label Substances 0.000 description 1
- 238000002264 polyacrylamide gel electrophoresis Methods 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 239000011148 porous material Substances 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000002953 preparative HPLC Methods 0.000 description 1
- 239000000186 progesterone Substances 0.000 description 1
- 229960003387 progesterone Drugs 0.000 description 1
- 210000001236 prokaryotic cell Anatomy 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 230000000644 propagated effect Effects 0.000 description 1
- 230000001902 propagating effect Effects 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 235000004252 protein component Nutrition 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000004853 protein function Effects 0.000 description 1
- 230000005664 protein glycosylation in endoplasmic reticulum Effects 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000009145 protein modification Effects 0.000 description 1
- 238000001243 protein synthesis Methods 0.000 description 1
- 230000002797 proteolythic effect Effects 0.000 description 1
- 230000005610 quantum mechanics Effects 0.000 description 1
- 238000010791 quenching Methods 0.000 description 1
- 230000000171 quenching effect Effects 0.000 description 1
- 230000002285 radioactive effect Effects 0.000 description 1
- 102000005962 receptors Human genes 0.000 description 1
- 108020003175 receptors Proteins 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 230000000306 recurrent effect Effects 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 239000011347 resin Substances 0.000 description 1
- 229920005989 resin Polymers 0.000 description 1
- 230000004043 responsiveness Effects 0.000 description 1
- 239000002336 ribonucleotide Substances 0.000 description 1
- 125000002652 ribonucleotide group Chemical group 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 229910052707 ruthenium Inorganic materials 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 230000037432 silent mutation Effects 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000007790 solid phase Substances 0.000 description 1
- 238000004611 spectroscopical analysis Methods 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
- 238000000859 sublimation Methods 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 229940124530 sulfonamide Drugs 0.000 description 1
- 150000003456 sulfonamides Chemical class 0.000 description 1
- 229910052717 sulfur Inorganic materials 0.000 description 1
- 239000011593 sulfur Substances 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- GZCRRIHWUXGPOV-UHFFFAOYSA-N terbium atom Chemical compound [Tb] GZCRRIHWUXGPOV-UHFFFAOYSA-N 0.000 description 1
- 229960002180 tetracycline Drugs 0.000 description 1
- 229930101283 tetracycline Natural products 0.000 description 1
- 235000019364 tetracycline Nutrition 0.000 description 1
- 150000003522 tetracyclines Chemical class 0.000 description 1
- 125000000341 threoninyl group Chemical group [H]OC([H])(C([H])([H])[H])C([H])(N([H])[H])C(*)=O 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001529453 unidentified herpesvirus Species 0.000 description 1
- DRTQHJPVMGBUCF-UHFFFAOYSA-N uracil arabinoside Natural products OC1C(O)C(CO)OC1N1C(=O)NC(=O)C=C1 DRTQHJPVMGBUCF-UHFFFAOYSA-N 0.000 description 1
- 229940045145 uridine Drugs 0.000 description 1
- 239000004474 valine Substances 0.000 description 1
- 230000003612 virological effect Effects 0.000 description 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B15/00—ICT specially adapted for analysing two-dimensional or three-dimensional molecular structures, e.g. structural or functional relations or structure alignment
- G16B15/20—Protein or domain folding
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/43504—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates
- C07K14/43595—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans from invertebrates from coelenteratae, e.g. medusae
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B20/00—ICT specially adapted for functional genomics or proteomics, e.g. genotype-phenotype associations
- G16B20/50—Mutagenesis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B30/00—ICT specially adapted for sequence analysis involving nucleotides or amino acids
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B40/00—ICT specially adapted for biostatistics; ICT specially adapted for bioinformatics-related machine learning or data mining, e.g. knowledge discovery or pattern finding
- G16B40/20—Supervised data analysis
-
- G—PHYSICS
- G16—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
- G16B—BIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
- G16B5/00—ICT specially adapted for modelling or simulations in systems biology, e.g. gene-regulatory networks, protein interaction networks or metabolic networks
Landscapes
- Health & Medical Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Medical Informatics (AREA)
- Biophysics (AREA)
- General Health & Medical Sciences (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Chemical & Material Sciences (AREA)
- Theoretical Computer Science (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Biotechnology (AREA)
- Molecular Biology (AREA)
- Data Mining & Analysis (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Artificial Intelligence (AREA)
- Software Systems (AREA)
- Evolutionary Computation (AREA)
- Analytical Chemistry (AREA)
- Epidemiology (AREA)
- Databases & Information Systems (AREA)
- Public Health (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioethics (AREA)
- Crystallography & Structural Chemistry (AREA)
- Biochemistry (AREA)
- Gastroenterology & Hepatology (AREA)
- Zoology (AREA)
- Toxicology (AREA)
- Medicinal Chemistry (AREA)
- Tropical Medicine & Parasitology (AREA)
- Physiology (AREA)
- Computational Linguistics (AREA)
- Biomedical Technology (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
Abstract
Description
전술한 목적 및 특징뿐만 아니라 다른 목적 및 특징들은, 본 발명의 이해를 제공하고 본 명세서의 일부를 구성하기 위해 포함되는 아래의 설명 및 첨부 도면을 참조하여 명백해질 것이며, 여기에서 유사한 번호는 유사한 요소를 나타내며, 여기에서:
도 1a는 합성된 단백질 특성을 증가시키기 위한 컴퓨터 구현 신경망의 구현의 다이어그램이고;
도 1b는 미세환경의 중심에서 아미노산 잔기를 결정하기 위한 방법의 구현의 흐름도이고;
도 1c는 시험 중 합성된 단백질 특성을 증가시키기 위한 방법의 구현의 흐름도이고;
도 1d는 학습 중 합성된 단백질 특성을 증가시키기 위한 신경망의 구현의 블록 다이어그램이고;
도 1e는 합성된 단백질 특성을 증가시키기 위한 컨볼루션 신경망의 구현의 블록 다이어그램이고;
도 2a는 합성된 단백질 특성을 증가시키기 위한 방법 및 시스템의 구현의 실험 결과의 그래프이고;
도 2b는 합성된 단백질 특성을 증가시키기 위한 방법 및 시스템의 구현의 실험 결과의 또 다른 그래프이고;
도 3a는 합성된 단백질 특성을 증가시키기 위한 방법 및 시스템의 구현의 실험 결과의 또 다른 그래프이고;
도 3b는 합성된 단백질 특성을 증가시키기 위한 시스템의 구현에 의해 제안된 변형을 사용하여 합성된 단백질의 사진이고;
도 4a는 합성된 단백질 특성을 증가시키기 위한 방법 및 시스템의 구현의 실험 결과의 또 다른 그래프이고;
도 4b는 합성된 단백질 특성을 증가시키기 위한 시스템의 구현에 의해 제안된 제안된 단백질 변형의 다이어그램이고;
도 5는 합성된 단백질 특성을 증가시키기 위한 시스템의 구현의 실험 결과의 일련의 사진이고;
도 6 및 도 7은 합성된 단백질 특성을 증가시키기 위한 시스템의 구현의 실험 결과의 그래프이고;
도 8은 야생형 단백질에 대한 17개의 청색 형광 단백질 변이체의 형광의 배수 변화를 나타내는 그래프이고;
도 9는 야생형 단백질에 대한 청색 형광 단백질 변이체의 형광의 배수 변화를 나타내는 그래프이고;
도 10은 모 단백질 및 다른 청색 형광 단백질과 비교하여, S28A, S114T, N173H 및 T127L 돌연변이를 포함하는 청색 형광 단백질 변이체 "블루본넷(bluebonnet)"의 형광의 예시적인 이미지를 제공하고;
도 11a 및 도 11b는 합성된 단백질 특성을 증가시키기 위한 시스템의 구현을 도시하는 블록 다이어그램이다.
Claims (48)
- 신경망을 학습시켜 단백질의 특성을 개선하는 컴퓨터 구현 방법으로서,
데이터베이스로부터 아미노산 서열 세트를 수집하는 단계;
상기 아미노산 서열 세트에 대한 화학적 환경을 갖는 3차원 구조의 세트를 컴파일링하는 단계;
상기 3차원 구조를 복셀화된 매트릭스로 번역하는 단계;
상기 복셀화된 매트릭스의 서브세트로 신경망을 학습시키는 단계;
상기 신경망을 사용하여, 표적 단백질 내에서 돌연변이시킬 후보 잔기를 식별하는 단계; 및
상기 신경망을 사용하여, 돌연변이된 단백질을 생성하도록 상기 후보 잔기를 치환하기 위한 예측 아미노산 잔기를 식별하는 단계를 포함하되,
여기에서, 상기 돌연변이된 단백질은 신규 안정화 돌연변이를 포함하고 상기 표적 단백질에 비해 특성에 있어서의 개선을 나타내는,
컴퓨터 구현 방법. - 제1항에 있어서, 수소 위치, 부분 전하, 베타 인자, 이차 구조, 방향족성, 전자 밀도 및 극성으로 이루어진 군으로부터 선택된 특징부의 공간 배열을 상기 3차원 구조 중 적어도 하나에 추가하는 단계를 추가로 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 아미노산 서열 세트를 조정하여 이들의 고유 빈도를 반영하는 단계를 추가로 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 서열 내의 무작위 위치로부터 상기 아미노산 서열 세트 내의 아미노산의 적어도 50%를 샘플링하는 단계를 추가로 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 제2 독립적인 신경망을 3차원 구조의 제2 서브세트로 학습시키는 단계, 및 두 신경망 모두의 결과에 기초하여 상기 후보 및 예측 아미노산 잔기를 식별하는 단계를 추가로 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 특성은 안정성, 성숙도, 또는 접힘인, 컴퓨터 구현 방법.
- 프로세서 및 명령어가 저장된 비일시적 컴퓨터 판독가능 매체를 포함하는, 단백질의 특성을 개선하기 위한 시스템으로서, 상기 프로세서에 의해 실행될 경우,
아미노산 서열을 포함하는 표적 단백질을 제공하는 단계;
아미노산을 둘러싸는 3차원 모델 세트 및 각각의 3차원 모델에 대한 단백질 특성 값 세트를 제공하는 단계;
상기 각각의 3차원 모델의 다양한 지점에서 파라미터 세트를 추정하는 단계;
상기 3차원 모델, 상기 파라미터, 및 상기 단백질 특성 값으로 신경망을 학습시키는 단계;
상기 신경망을 사용하여, 상기 표적 단백질 내에서 돌연변이시킬 후보 잔기를 식별하는 단계; 및
상기 신경망을 사용하여, 상기 후보 잔기를 치환하기 위한 예측 아미노산 잔기를 식별하여 돌연변이된 단백질을 생성하는 단계를 포함하되,
여기에서, 상기 돌연변이된 단백질은 신규 안정화 돌연변이를 포함하고 상기 표적 단백질에 비해 상기 특성에 있어서의 개선을 나타내는,
시스템. - 제7항에 있어서, 상기 단백질 특성은 안정성인, 시스템.
- 제7항에 있어서, 상기 단계는 업데이트된 3차원 모델을 생성하기 위해 적어도 하나의 아미노산 서열을 재컴파일링하는 단계를 포함하는, 시스템.
- 제9항에 있어서, 상기 단계는 재컴파일링 전에 적어도 하나의 아미노산 서열에 특징부의 공간적 배열을 추가하는 단계를 포함하는, 시스템.
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 삭제
- 제1항에 있어서, 상기 3차원 구조를 복셀화된 매트릭스로 번역하는 단계는 상기 3차원 구조와 연관된 좌표를 3차원 어레이로 매핑하는 단계를 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 복셀화된 매트릭스의 서브세트는 일정 비율 이상 계통발생적으로 발산되는 단백질들로부터 구축되는 학습 데이터 세트의 매트릭스를 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 복셀화된 매트릭스의 서브세트는 단백질 구조에 첨가된 수소 원자를 갖는 단백질들로부터 구축되는 학습 데이터 세트의 매트릭스를 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 복셀화된 매트릭스의 서브세트는 첨가된 생물물리학적 채널을 갖는 단백질들로부터 구축되는 학습 데이터 세트의 매트릭스를 포함하는, 컴퓨터 구현 방법.
- 제1항에 있어서, 상기 복셀화된 매트릭스의 서브세트로부터, 고해상도 임계값을 만족하는 단백질을 제거하는 단계
를 더 포함하고,
상기 고해상도 임계값은 상기 3차원 구조에 기반한 전자 밀도 맵이 상기 전자 밀도 맵의 지점들 사이에서 임계 거리로 분해 가능한 것을 나타내는, 컴퓨터 구현 방법. - 제1항에 있어서,
상기 신경망은,
입력 단백질의 이미지 내의 중앙 아미노산 주위에서 기준 프레임을 생성하고,
상기 중앙 아미노산 주위에서 특징부를 추출하도록 구성되는, 컴퓨터 구현 방법. - 단백질의 특성을 개선하는 시스템에 있어서, 프로세서 및 실행될 경우 상기 프로세서가 단계들을 수행하도록 하는 명령어가 저장된 비일시적 컴퓨터 판독가능 매체를 포함하고,
상기 단계들은,
데이터베이스로부터 아미노산 서열 세트를 수집하는 단계;
상기 아미노산 서열 세트에 대한 화학적 환경을 갖는 3차원 구조의 세트를 컴파일링하는 단계;
상기 3차원 구조를 복셀화된 매트릭스로 번역하는 단계;
상기 복셀화된 매트릭스의 서브세트로 신경망을 학습시키는 단계;
상기 신경망을 사용하여, 표적 단백질 내에서 돌연변이시킬 후보 잔기를 식별하는 단계; 및
상기 신경망을 사용하여, 돌연변이된 단백질을 생성하도록 상기 후보 잔기를 치환하기 위한 예측 아미노산 잔기를 식별하는 단계를 포함하되,
여기에서, 상기 돌연변이된 단백질은 신규 안정화 돌연변이를 포함하고 상기 표적 단백질에 비해 특성에 있어서의 개선을 나타내는,
시스템. - 제37항에 있어서, 상기 복셀화된 매트릭스의 서브세트는 일정 비율 이상 계통발생적으로 발산되는 단백질들로부터 구축되는 학습 데이터 세트의 매트릭스를 포함하는, 시스템.
- 제37항에 있어서, 상기 복셀화된 매트릭스의 서브세트는 단백질 구조에 첨가된 수소 원자를 갖는 단백질들로부터 구축되는 학습 데이터 세트의 매트릭스를 포함하는, 시스템.
- 제37항에 있어서, 상기 복셀화된 매트릭스의 서브세트는 첨가된 생물물리학적 채널을 갖는 단백질들부터 구축되는 학습 데이터 세트의 매트릭스를 포함하는, 시스템.
- 제1항에 있어서, 상기 돌연변이된 단백질은 포스포만노오스 이소머라아제를 포함하고, 상기 신규 안정화 돌연변이는 D229W, N272K, C295V, S368P, L335A, N388S, S425T 및 이들의 조합으로 이루어진 군으로부터 선택된 것인, 컴퓨터 구현 방법.
- 제7항에 있어서, 상기 돌연변이된 단백질은 포스포만노오스 이소머라아제를 포함하고, 상기 신규 안정화 돌연변이는 D229W, N272K, C295V, S368P, L335A, N388S, S425T 및 이들의 조합으로 이루어진 군으로부터 선택된 것인, 시스템.
- 제37항에 있어서, 상기 돌연변이된 단백질은 포스포만노오스 이소머라아제를 포함하고, 상기 신규 안정화 돌연변이는 D229W, N272K, C295V, S368P, L335A, N388S, S425T 및 이들의 조합으로 이루어진 군으로부터 선택된 것인, 시스템.
- 제1항에 있어서, 상기 돌연변이된 단백질은 청색 형광 단백질을 포함하고, 상기 신규 안정화 돌연변이는 S28A, Y96F, S114T, V124R, T127L, N173H 및 이들의 조합으로 이루어진 군으로부터 선택된 것인, 컴퓨터 구현 방법.
- 제7항에 있어서, 상기 돌연변이된 단백질은 청색 형광 단백질을 포함하고, 상기 신규 안정화 돌연변이는 S28A, Y96F, S114T, V124R, T127L, N173H 및 이들의 조합으로 이루어진 군으로부터 선택된 것인, 시스템.
- 제37항에 있어서, 상기 돌연변이된 단백질은 청색 형광 단백질을 포함하고, 상기 신규 안정화 돌연변이는 S28A, Y96F, S114T, V124R, T127L, N173H 및 이들의 조합으로 이루어진 군으로부터 선택된 것인, 시스템.
- 제1항에 있어서, 상기 3차원 구조의 세트의 적어도 일부는 3차원 결정 구조인, 컴퓨터 구현 방법.
- 제37항에 있어서, 상기 3차원 구조의 세트의 적어도 일부는 3차원 결정 구조인, 시스템.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020247006380A KR20240033101A (ko) | 2019-05-02 | 2020-05-01 | 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 |
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US201962841906P | 2019-05-02 | 2019-05-02 | |
US62/841,906 | 2019-05-02 | ||
PCT/US2020/031084 WO2020247126A2 (en) | 2019-05-02 | 2020-05-01 | System and method for increasing synthesized protein stability |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020247006380A Division KR20240033101A (ko) | 2019-05-02 | 2020-05-01 | 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 |
Publications (2)
Publication Number | Publication Date |
---|---|
KR20220002978A KR20220002978A (ko) | 2022-01-07 |
KR102642718B1 true KR102642718B1 (ko) | 2024-02-29 |
Family
ID=73652871
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020247006380A Pending KR20240033101A (ko) | 2019-05-02 | 2020-05-01 | 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 |
KR1020217037656A Active KR102642718B1 (ko) | 2019-05-02 | 2020-05-01 | 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
KR1020247006380A Pending KR20240033101A (ko) | 2019-05-02 | 2020-05-01 | 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 |
Country Status (7)
Country | Link |
---|---|
US (2) | US11551786B2 (ko) |
EP (1) | EP3962932A4 (ko) |
JP (2) | JP7387760B2 (ko) |
KR (2) | KR20240033101A (ko) |
CN (2) | CN113727994B (ko) |
CA (1) | CA3138861A1 (ko) |
WO (1) | WO2020247126A2 (ko) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2022260171A1 (ja) * | 2021-06-11 | 2022-12-15 | 株式会社 Preferred Networks | 推定装置及びモデル生成方法 |
US20240363200A1 (en) * | 2021-08-30 | 2024-10-31 | Intellisafe Llc | Real-time virus and damaging agent detection |
EP4222749A4 (en) * | 2021-12-24 | 2023-08-30 | GeneSense Technology Inc. | DEEP LEARNING-BASED METHODS AND SYSTEMS FOR NUCLEIC ACID SEQUENCING |
WO2023132519A1 (ko) | 2022-01-07 | 2023-07-13 | 주식회사 엘지에너지솔루션 | 화염차단부재가 부가된 버스바를 포함하는 전지셀 어셈블리 및 이를 포함하는 전지셀 어셈블리 구조체 |
CN114550824B (zh) * | 2022-01-29 | 2022-11-22 | 河南大学 | 基于嵌入特征和不平衡分类损失的蛋白质折叠识别方法及系统 |
CN115171787A (zh) * | 2022-07-08 | 2022-10-11 | 腾讯科技(深圳)有限公司 | 抗原预测方法、装置、设备以及存储介质 |
WO2024122449A1 (ja) * | 2022-12-06 | 2024-06-13 | 株式会社レボルカ | 機械学習による抗体設計法 |
CN116486903B (zh) * | 2023-04-17 | 2023-12-29 | 深圳新锐基因科技有限公司 | 基于同源蛋白序列进化方向结合自由能变提高蛋白稳定性的方法及装置 |
CN116486906B (zh) * | 2023-04-17 | 2024-03-19 | 深圳新锐基因科技有限公司 | 基于氨基酸残基突变提高蛋白质分子稳定性的方法及装置 |
CN117594114B (zh) * | 2023-10-30 | 2024-12-03 | 康复大学(筹) | 一种基于蛋白质结构域预测与生物大分子修饰结合的类抗体的方法 |
CN117831625B (zh) * | 2023-12-19 | 2024-07-19 | 苏州沃时数字科技有限公司 | 基于图神经网络的酶定向突变序列预测方法、系统及介质 |
CN119851768A (zh) * | 2025-01-07 | 2025-04-18 | 广东工业大学 | 基于卷积神经网络和残差注意力机制的蛋白质表达预测方法 |
Family Cites Families (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0589074A (ja) * | 1991-09-30 | 1993-04-09 | Fujitsu Ltd | 二次構造予測装置 |
JP2551297B2 (ja) * | 1992-05-18 | 1996-11-06 | 日本電気株式会社 | タンパク質立体構造予測方法 |
CA2232727C (en) | 1995-09-22 | 2002-03-26 | Novo Nordisk A/S | Novel variants of green fluorescent protein, gfp |
JP2000229994A (ja) | 1999-02-15 | 2000-08-22 | Nec Corp | 蛋白質立体構造予測方法及び装置 |
US20030165988A1 (en) * | 2002-02-08 | 2003-09-04 | Shaobing Hua | High throughput generation of human monoclonal antibody against peptide fragments derived from membrane proteins |
GB0109858D0 (en) | 2001-04-23 | 2001-06-13 | Amersham Pharm Biotech Uk Ltd | Fluorscent proteins |
CA2462591A1 (en) | 2001-10-05 | 2003-05-01 | Riken | Method of presuming domain linker region of protein |
JP5319865B2 (ja) | 2002-03-01 | 2013-10-16 | コデクシス メイフラワー ホールディングス, エルエルシー | 機能的生体分子を同定する方法、システム、およびソフトウェア |
AU2003268546A1 (en) * | 2002-09-06 | 2004-03-29 | Massachusetts Institute Of Technology | Lentiviral vectors, related reagents, and methods of use thereof |
CA2646508A1 (en) * | 2006-03-17 | 2007-09-27 | Biogen Idec Ma Inc. | Stabilized polypeptide compositions |
US20080215301A1 (en) * | 2006-05-22 | 2008-09-04 | Yeda Research And Development Co. Ltd. | Method and apparatus for predicting protein structure |
PL2631241T3 (pl) * | 2006-10-10 | 2019-02-28 | The Australian National University | Sposób generowania białka i jego zastosowanie |
CN102421789B (zh) * | 2009-04-15 | 2015-02-25 | 浦项工科大学校产学协力团 | 靶特异性非抗体蛋白质及其制备方法 |
CN103179940B (zh) * | 2010-08-24 | 2016-01-13 | 安全白有限公司 | 提供牙齿洁白外观的方法和材料 |
DK2951754T3 (da) * | 2013-01-31 | 2024-04-15 | Codexis Inc | Fremgangsmåder, systemer og software til identifikation af biomolekyler med interagerende komponenter |
GB201310859D0 (en) * | 2013-06-18 | 2013-07-31 | Cambridge Entpr Ltd | Rational method for solubilising proteins |
US9354175B2 (en) | 2014-01-10 | 2016-05-31 | Lucigen Corporation | Lucigen yellow (LucY), a yellow fluorescent protein |
US9920102B2 (en) * | 2015-05-15 | 2018-03-20 | Albert Einstein College Of Medicine, Inc. | Fusion tags for protein expression |
JP6975140B2 (ja) * | 2015-10-04 | 2021-12-01 | アトムワイズ,インコーポレイテッド | 畳み込みネットワークを空間データに適用するためのシステム及び方法 |
JP7048065B2 (ja) | 2017-08-02 | 2022-04-05 | 学校法人立命館 | 結合性予測方法、装置、プログラム、記録媒体、および機械学習アルゴリズムの学習方法 |
CN108460742A (zh) * | 2018-03-14 | 2018-08-28 | 日照职业技术学院 | 一种基于bp神经网络的图像复原方法 |
-
2020
- 2020-05-01 WO PCT/US2020/031084 patent/WO2020247126A2/en unknown
- 2020-05-01 CN CN202080032395.7A patent/CN113727994B/zh active Active
- 2020-05-01 JP JP2021564714A patent/JP7387760B2/ja active Active
- 2020-05-01 EP EP20818752.6A patent/EP3962932A4/en active Pending
- 2020-05-01 CA CA3138861A patent/CA3138861A1/en active Pending
- 2020-05-01 KR KR1020247006380A patent/KR20240033101A/ko active Pending
- 2020-05-01 CN CN202410966146.5A patent/CN119080901A/zh active Pending
- 2020-05-01 KR KR1020217037656A patent/KR102642718B1/ko active Active
-
2021
- 2021-10-27 US US17/512,116 patent/US11551786B2/en active Active
-
2022
- 2022-11-28 US US18/059,180 patent/US20230162816A1/en active Pending
-
2023
- 2023-11-15 JP JP2023194581A patent/JP2024016257A/ja active Pending
Also Published As
Publication number | Publication date |
---|---|
US20230162816A1 (en) | 2023-05-25 |
CN113727994A (zh) | 2021-11-30 |
WO2020247126A3 (en) | 2021-03-11 |
JP2024016257A (ja) | 2024-02-06 |
KR20240033101A (ko) | 2024-03-12 |
EP3962932A2 (en) | 2022-03-09 |
US11551786B2 (en) | 2023-01-10 |
WO2020247126A9 (en) | 2021-01-21 |
JP7387760B2 (ja) | 2023-11-28 |
CN113727994B (zh) | 2025-03-25 |
CN119080901A (zh) | 2024-12-06 |
EP3962932A4 (en) | 2023-05-10 |
CA3138861A1 (en) | 2020-12-10 |
JP2022531295A (ja) | 2022-07-06 |
KR20220002978A (ko) | 2022-01-07 |
WO2020247126A2 (en) | 2020-12-10 |
US20220076787A1 (en) | 2022-03-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR102642718B1 (ko) | 합성 단백질 안정성을 증가시키기 위한 시스템 및 방법 | |
Kerppola | Visualization of molecular interactions using bimolecular fluorescence complementation analysis: characteristics of protein fragment complementation | |
Alberts et al. | Analyzing protein structure and function | |
Krauspe et al. | Discovery of a small protein factor involved in the coordinated degradation of phycobilisomes in cyanobacteria | |
JP4505439B2 (ja) | 糖濃度に対するシグナル強度の向上した蛍光標識蛋白質及びその用途 | |
Moore et al. | Recovery of red fluorescent protein chromophore maturation deficiency through rational design | |
Bhan et al. | Recording temporal signals with minutes resolution using enzymatic DNA synthesis | |
Yudenko et al. | Rational design of a split flavin-based fluorescent reporter | |
Crone et al. | GFP-based biosensors | |
Yu et al. | Highly sensitive and selective detection of inorganic phosphates in the water environment by biosensors based on bioluminescence resonance energy transfer | |
Liu et al. | Engineering pre-SUMO4 as efficient substrate of SENP2 | |
Booth et al. | A FRET-based method for monitoring septin polymerization and binding of septin-associated proteins | |
US20240177805A1 (en) | System and methods for increasing synthesized protein stability | |
Aubel et al. | High-throughput Selection of Human de novo-emerged sORFs with High Folding Potential | |
AU2014268797A1 (en) | Genetically encoded sensors for imaging proteins and their complexes | |
Eggan et al. | Ligand-coupled conformational changes in a cyclic nucleotide-gated ion channel revealed by time-resolved transition metal ion FRET | |
Clifton et al. | Ancestral protein reconstruction and circular permutation for improving the stability and dynamic range of FRET Sensors | |
Meiresonne et al. | Detection of protein interactions in the cytoplasm and periplasm of Escherichia coli by Förster resonance energy transfer | |
HK40062411A (en) | System and method for increasing synthesized protein stability | |
Busso et al. | Using an Escherichia coli cell-free extract to screen for soluble expression of recombinant proteins | |
US7670787B2 (en) | Protein forming complex with c-Fos protein, nucleic acid encoding the same and method of using the same | |
Mathy et al. | A complete allosteric map of a GTPase switch in its native network | |
Meiresonne et al. | Detection of in vivo protein interactions in all bacterial compartments by förster resonance energy transfer with the superfolder mTurquoise2 ox-mNeongreen FRET pair | |
Krauspe et al. | Discovery of a novel small protein factor involved in the coordinated degradation of phycobilisomes in cyanobacteria | |
Baucom | Single Molecule Fluorescence Studies of protein structure and dynamics underlying the chloroplast signal recognition particle targeting pathway |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PA0105 | International application |
Patent event date: 20211118 Patent event code: PA01051R01D Comment text: International Patent Application |
|
PG1501 | Laying open of application | ||
PA0201 | Request for examination |
Patent event code: PA02012R01D Patent event date: 20221130 Comment text: Request for Examination of Application |
|
PA0302 | Request for accelerated examination |
Patent event date: 20221130 Patent event code: PA03022R01D Comment text: Request for Accelerated Examination |
|
E902 | Notification of reason for refusal | ||
PE0902 | Notice of grounds for rejection |
Comment text: Notification of reason for refusal Patent event date: 20230424 Patent event code: PE09021S01D |
|
E701 | Decision to grant or registration of patent right | ||
PE0701 | Decision of registration |
Patent event code: PE07011S01D Comment text: Decision to Grant Registration Patent event date: 20231127 |
|
GRNT | Written decision to grant | ||
PR0701 | Registration of establishment |
Comment text: Registration of Establishment Patent event date: 20240227 Patent event code: PR07011E01D |
|
PR1002 | Payment of registration fee |
Payment date: 20240227 End annual number: 3 Start annual number: 1 |
|
PG1601 | Publication of registration |