JP2003135080A - 新規遺伝子及びそれにコードされる蛋白質 - Google Patents
新規遺伝子及びそれにコードされる蛋白質Info
- Publication number
- JP2003135080A JP2003135080A JP2002220624A JP2002220624A JP2003135080A JP 2003135080 A JP2003135080 A JP 2003135080A JP 2002220624 A JP2002220624 A JP 2002220624A JP 2002220624 A JP2002220624 A JP 2002220624A JP 2003135080 A JP2003135080 A JP 2003135080A
- Authority
- JP
- Japan
- Prior art keywords
- leu
- ser
- pro
- gly
- ala
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 115
- 102000004169 proteins and genes Human genes 0.000 title claims abstract description 63
- 108090000765 processed proteins & peptides Proteins 0.000 claims abstract description 104
- 229920001184 polypeptide Polymers 0.000 claims abstract description 87
- 102000004196 processed proteins & peptides Human genes 0.000 claims abstract description 87
- 241000282414 Homo sapiens Species 0.000 claims abstract description 69
- 150000001413 amino acids Chemical class 0.000 claims abstract description 53
- 230000004071 biological effect Effects 0.000 claims abstract description 13
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 10
- 108020004414 DNA Proteins 0.000 claims description 113
- 239000002773 nucleotide Substances 0.000 claims description 20
- 125000003729 nucleotide group Chemical group 0.000 claims description 20
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 claims description 11
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 claims description 11
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 7
- 244000068988 Glycine max Species 0.000 claims description 2
- 235000010469 Glycine max Nutrition 0.000 claims description 2
- 230000036961 partial effect Effects 0.000 abstract description 25
- 210000004556 brain Anatomy 0.000 abstract description 16
- 239000002299 complementary DNA Substances 0.000 abstract description 9
- 230000001605 fetal effect Effects 0.000 abstract description 6
- 210000001320 hippocampus Anatomy 0.000 abstract description 6
- 241000282326 Felis catus Species 0.000 description 171
- 108010050848 glycylleucine Proteins 0.000 description 56
- 235000018102 proteins Nutrition 0.000 description 56
- 238000000034 method Methods 0.000 description 55
- 241000880493 Leptailurus serval Species 0.000 description 50
- 108010026333 seryl-proline Proteins 0.000 description 41
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 36
- 108010005233 alanylglutamic acid Proteins 0.000 description 35
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 34
- 108010093581 aspartyl-proline Proteins 0.000 description 33
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 30
- 108010057821 leucylproline Proteins 0.000 description 30
- 150000003839 salts Chemical class 0.000 description 30
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 29
- 108010061238 threonyl-glycine Proteins 0.000 description 28
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 27
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 26
- 108010049041 glutamylalanine Proteins 0.000 description 25
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 24
- 108010047495 alanylglycine Proteins 0.000 description 24
- 108010078144 glutaminyl-glycine Proteins 0.000 description 23
- 108010087924 alanylproline Proteins 0.000 description 22
- 235000001014 amino acid Nutrition 0.000 description 21
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 20
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 18
- 210000004027 cell Anatomy 0.000 description 18
- 108010009298 lysylglutamic acid Proteins 0.000 description 18
- 108010013835 arginine glutamate Proteins 0.000 description 17
- 108010060035 arginylproline Proteins 0.000 description 17
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 17
- 108010051242 phenylalanylserine Proteins 0.000 description 17
- 108010008355 arginyl-glutamine Proteins 0.000 description 16
- 108010089804 glycyl-threonine Proteins 0.000 description 16
- 108010015792 glycyllysine Proteins 0.000 description 16
- 108010029020 prolylglycine Proteins 0.000 description 16
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 15
- 108010044940 alanylglutamine Proteins 0.000 description 15
- 108010053725 prolylvaline Proteins 0.000 description 15
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 14
- 108010079364 N-glycylalanine Proteins 0.000 description 14
- XKUKSGPZAADMRA-UHFFFAOYSA-N glycyl-glycyl-glycine Chemical compound NCC(=O)NCC(=O)NCC(O)=O XKUKSGPZAADMRA-UHFFFAOYSA-N 0.000 description 14
- 108010010147 glycylglutamine Proteins 0.000 description 14
- 108010034529 leucyl-lysine Proteins 0.000 description 14
- 108010077112 prolyl-proline Proteins 0.000 description 14
- 108010031719 prolyl-serine Proteins 0.000 description 14
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 13
- BQVUABVGYYSDCJ-UHFFFAOYSA-N Nalpha-L-Leucyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)CC(C)C)C(O)=O)=CNC2=C1 BQVUABVGYYSDCJ-UHFFFAOYSA-N 0.000 description 13
- 108010056582 methionylglutamic acid Proteins 0.000 description 13
- 108010062796 arginyllysine Proteins 0.000 description 12
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 12
- 108010025306 histidylleucine Proteins 0.000 description 12
- 108010070643 prolylglutamic acid Proteins 0.000 description 12
- 108010048818 seryl-histidine Proteins 0.000 description 12
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 11
- SBANPBVRHYIMRR-GARJFASQSA-N Leu-Ser-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N SBANPBVRHYIMRR-GARJFASQSA-N 0.000 description 11
- 108010070944 alanylhistidine Proteins 0.000 description 11
- 108010068380 arginylarginine Proteins 0.000 description 11
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 11
- 150000001875 compounds Chemical class 0.000 description 11
- 108010081551 glycylphenylalanine Proteins 0.000 description 11
- 108010036413 histidylglycine Proteins 0.000 description 11
- 108010000761 leucylarginine Proteins 0.000 description 11
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 10
- XOWMDXHFSBCAKQ-SRVKXCTJSA-N Leu-Ser-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C XOWMDXHFSBCAKQ-SRVKXCTJSA-N 0.000 description 10
- MKGIILKDUGDRRO-FXQIFTODSA-N Pro-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 MKGIILKDUGDRRO-FXQIFTODSA-N 0.000 description 10
- 108010038633 aspartylglutamate Proteins 0.000 description 10
- 108010016616 cysteinylglycine Proteins 0.000 description 10
- 230000014509 gene expression Effects 0.000 description 10
- 108010037850 glycylvaline Proteins 0.000 description 10
- 108010085325 histidylproline Proteins 0.000 description 10
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 10
- 108010064235 lysylglycine Proteins 0.000 description 10
- 108010054155 lysyllysine Proteins 0.000 description 10
- 125000006239 protecting group Chemical group 0.000 description 10
- 239000000126 substance Substances 0.000 description 10
- 108010020532 tyrosyl-proline Proteins 0.000 description 10
- 108010073969 valyllysine Proteins 0.000 description 10
- BRPMXFSTKXXNHF-IUCAKERBSA-N (2s)-1-[2-[[(2s)-pyrrolidine-2-carbonyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound OC(=O)[C@@H]1CCCN1C(=O)CNC(=O)[C@H]1NCCC1 BRPMXFSTKXXNHF-IUCAKERBSA-N 0.000 description 9
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 9
- 108010065920 Insulin Lispro Proteins 0.000 description 9
- XVZCXCTYGHPNEM-UHFFFAOYSA-N Leu-Leu-Pro Natural products CC(C)CC(N)C(=O)NC(CC(C)C)C(=O)N1CCCC1C(O)=O XVZCXCTYGHPNEM-UHFFFAOYSA-N 0.000 description 9
- AZWNCEBQZXELEZ-FXQIFTODSA-N Ser-Pro-Ser Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O AZWNCEBQZXELEZ-FXQIFTODSA-N 0.000 description 9
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 9
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 9
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 9
- 108010047857 aspartylglycine Proteins 0.000 description 9
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 9
- 238000006243 chemical reaction Methods 0.000 description 9
- 150000002148 esters Chemical class 0.000 description 9
- 108010077515 glycylproline Proteins 0.000 description 9
- 108010092114 histidylphenylalanine Proteins 0.000 description 9
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 9
- 108010017391 lysylvaline Proteins 0.000 description 9
- 108010004914 prolylarginine Proteins 0.000 description 9
- 238000012216 screening Methods 0.000 description 9
- WKOBSJOZRJJVRZ-FXQIFTODSA-N Ala-Glu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WKOBSJOZRJJVRZ-FXQIFTODSA-N 0.000 description 8
- VBRDBGCROKWTPV-XHNCKOQMSA-N Ala-Glu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N VBRDBGCROKWTPV-XHNCKOQMSA-N 0.000 description 8
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 8
- PTRKPHUGYULXPU-KKUMJFAQSA-N Leu-Phe-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O PTRKPHUGYULXPU-KKUMJFAQSA-N 0.000 description 8
- DPURXCQCHSQPAN-AVGNSLFASA-N Leu-Pro-Pro Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DPURXCQCHSQPAN-AVGNSLFASA-N 0.000 description 8
- DXTOOBDIIAJZBJ-BQBZGAKWSA-N Pro-Gly-Ser Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CO)C(O)=O DXTOOBDIIAJZBJ-BQBZGAKWSA-N 0.000 description 8
- BGWKULMLUIUPKY-BQBZGAKWSA-N Pro-Ser-Gly Chemical compound OC(=O)CNC(=O)[C@H](CO)NC(=O)[C@@H]1CCCN1 BGWKULMLUIUPKY-BQBZGAKWSA-N 0.000 description 8
- FIODMZKLZFLYQP-GUBZILKMSA-N Pro-Val-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FIODMZKLZFLYQP-GUBZILKMSA-N 0.000 description 8
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 8
- 108010092854 aspartyllysine Proteins 0.000 description 8
- 108010060199 cysteinylproline Proteins 0.000 description 8
- 108010087823 glycyltyrosine Proteins 0.000 description 8
- 108010040030 histidinoalanine Proteins 0.000 description 8
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 8
- 210000001519 tissue Anatomy 0.000 description 8
- 108010051110 tyrosyl-lysine Proteins 0.000 description 8
- 239000013598 vector Substances 0.000 description 8
- JBVSSSZFNTXJDX-YTLHQDLWSA-N Ala-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](C)N JBVSSSZFNTXJDX-YTLHQDLWSA-N 0.000 description 7
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 7
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 7
- ZYLJULGXQDNXDK-GUBZILKMSA-N Leu-Gln-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O ZYLJULGXQDNXDK-GUBZILKMSA-N 0.000 description 7
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 7
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 7
- XMBSYZWANAQXEV-UHFFFAOYSA-N N-alpha-L-glutamyl-L-phenylalanine Natural products OC(=O)CCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XMBSYZWANAQXEV-UHFFFAOYSA-N 0.000 description 7
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 7
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 7
- CUXJENOFJXOSOZ-BIIVOSGPSA-N Ser-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CO)N)C(=O)O CUXJENOFJXOSOZ-BIIVOSGPSA-N 0.000 description 7
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 7
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 7
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 7
- 238000009833 condensation Methods 0.000 description 7
- 230000005494 condensation Effects 0.000 description 7
- 239000012634 fragment Substances 0.000 description 7
- 108010013768 glutamyl-aspartyl-proline Proteins 0.000 description 7
- 108010079547 glutamylmethionine Proteins 0.000 description 7
- 238000009396 hybridization Methods 0.000 description 7
- 108010003700 lysyl aspartic acid Proteins 0.000 description 7
- 108010005942 methionylglycine Proteins 0.000 description 7
- 108010012581 phenylalanylglutamate Proteins 0.000 description 7
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 7
- 108010080629 tryptophan-leucine Proteins 0.000 description 7
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 6
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 6
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 6
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 6
- JEOCWTUOMKEEMF-RHYQMDGZSA-N Arg-Leu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEOCWTUOMKEEMF-RHYQMDGZSA-N 0.000 description 6
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 6
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 6
- FQCILXROGNOZON-YUMQZZPRSA-N Gln-Pro-Gly Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O FQCILXROGNOZON-YUMQZZPRSA-N 0.000 description 6
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 6
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 6
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 6
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 6
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 6
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 6
- KAFOIVJDVSZUMD-UHFFFAOYSA-N Leu-Gln-Gln Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)NC(CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-UHFFFAOYSA-N 0.000 description 6
- QVFGXCVIXXBFHO-AVGNSLFASA-N Leu-Glu-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O QVFGXCVIXXBFHO-AVGNSLFASA-N 0.000 description 6
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 6
- APFJUBGRZGMQFF-QWRGUYRKSA-N Leu-Gly-Lys Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN APFJUBGRZGMQFF-QWRGUYRKSA-N 0.000 description 6
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 6
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 6
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 6
- QMKFDEUJGYNFMC-AVGNSLFASA-N Leu-Pro-Arg Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QMKFDEUJGYNFMC-AVGNSLFASA-N 0.000 description 6
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 6
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 6
- ZXFRGTAIIZHNHG-AJNGGQMLSA-N Lys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N ZXFRGTAIIZHNHG-AJNGGQMLSA-N 0.000 description 6
- XQLBWXHVZVBNJM-FXQIFTODSA-N Pro-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 XQLBWXHVZVBNJM-FXQIFTODSA-N 0.000 description 6
- HFZNNDWPHBRNPV-KZVJFYERSA-N Pro-Ala-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HFZNNDWPHBRNPV-KZVJFYERSA-N 0.000 description 6
- NXEYSLRNNPWCRN-SRVKXCTJSA-N Pro-Glu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXEYSLRNNPWCRN-SRVKXCTJSA-N 0.000 description 6
- CLNJSLSHKJECME-BQBZGAKWSA-N Pro-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H]1CCCN1 CLNJSLSHKJECME-BQBZGAKWSA-N 0.000 description 6
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 6
- MCWHYUWXVNRXFV-RWMBFGLXSA-N Pro-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 MCWHYUWXVNRXFV-RWMBFGLXSA-N 0.000 description 6
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 6
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 6
- HQTKVSCNCDLXSX-BQBZGAKWSA-N Ser-Arg-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O HQTKVSCNCDLXSX-BQBZGAKWSA-N 0.000 description 6
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 6
- MUARUIBTKQJKFY-WHFBIAKZSA-N Ser-Gly-Asp Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MUARUIBTKQJKFY-WHFBIAKZSA-N 0.000 description 6
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 6
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 6
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 6
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 6
- IGROJMCBGRFRGI-YTLHQDLWSA-N Thr-Ala-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O IGROJMCBGRFRGI-YTLHQDLWSA-N 0.000 description 6
- SLLKXDSRVAOREO-KZVJFYERSA-N Val-Ala-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N)O SLLKXDSRVAOREO-KZVJFYERSA-N 0.000 description 6
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 6
- AEMPCGRFEZTWIF-IHRRRGAJSA-N Val-Leu-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O AEMPCGRFEZTWIF-IHRRRGAJSA-N 0.000 description 6
- 210000004899 c-terminal region Anatomy 0.000 description 6
- 201000010099 disease Diseases 0.000 description 6
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 6
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 6
- 108010080575 glutamyl-aspartyl-alanine Proteins 0.000 description 6
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 6
- 108010026364 glycyl-glycyl-leucine Proteins 0.000 description 6
- 108010079413 glycyl-prolyl-glutamic acid Proteins 0.000 description 6
- 230000000984 immunochemical effect Effects 0.000 description 6
- 108010091871 leucylmethionine Proteins 0.000 description 6
- 150000007523 nucleic acids Chemical class 0.000 description 6
- 239000013612 plasmid Substances 0.000 description 6
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 6
- 108010090894 prolylleucine Proteins 0.000 description 6
- 239000011347 resin Substances 0.000 description 6
- 229920005989 resin Polymers 0.000 description 6
- 108010071207 serylmethionine Proteins 0.000 description 6
- 230000014616 translation Effects 0.000 description 6
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 5
- SGYSTDWPNPKJPP-GUBZILKMSA-N Arg-Ala-Arg Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SGYSTDWPNPKJPP-GUBZILKMSA-N 0.000 description 5
- GIVATXIGCXFQQA-FXQIFTODSA-N Arg-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N GIVATXIGCXFQQA-FXQIFTODSA-N 0.000 description 5
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 5
- PBSOQGZLPFVXPU-YUMQZZPRSA-N Arg-Glu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PBSOQGZLPFVXPU-YUMQZZPRSA-N 0.000 description 5
- NVCIXQYNWYTLDO-IHRRRGAJSA-N Arg-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCCN=C(N)N)N NVCIXQYNWYTLDO-IHRRRGAJSA-N 0.000 description 5
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 5
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 5
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 5
- NOCCABSVTRONIN-CIUDSAMLSA-N Cys-Ala-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N NOCCABSVTRONIN-CIUDSAMLSA-N 0.000 description 5
- OYTPNWYZORARHL-XHNCKOQMSA-N Gln-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N OYTPNWYZORARHL-XHNCKOQMSA-N 0.000 description 5
- QKCZZAZNMMVICF-DCAQKATOSA-N Gln-Leu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O QKCZZAZNMMVICF-DCAQKATOSA-N 0.000 description 5
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 5
- AVZHGSCDKIQZPQ-CIUDSAMLSA-N Glu-Arg-Ala Chemical compound C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AVZHGSCDKIQZPQ-CIUDSAMLSA-N 0.000 description 5
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 5
- CKOFNWCLWRYUHK-XHNCKOQMSA-N Glu-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O CKOFNWCLWRYUHK-XHNCKOQMSA-N 0.000 description 5
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 5
- MUSGDMDGNGXULI-DCAQKATOSA-N Glu-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O MUSGDMDGNGXULI-DCAQKATOSA-N 0.000 description 5
- IRXNJYPKBVERCW-DCAQKATOSA-N Glu-Leu-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IRXNJYPKBVERCW-DCAQKATOSA-N 0.000 description 5
- CQAHWYDHKUWYIX-YUMQZZPRSA-N Glu-Pro-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O CQAHWYDHKUWYIX-YUMQZZPRSA-N 0.000 description 5
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 5
- CQZDZKRHFWJXDF-WDSKDSINSA-N Gly-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)CN CQZDZKRHFWJXDF-WDSKDSINSA-N 0.000 description 5
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 5
- OLPPXYMMIARYAL-QMMMGPOBSA-N Gly-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)CN OLPPXYMMIARYAL-QMMMGPOBSA-N 0.000 description 5
- GMTXWRIDLGTVFC-IUCAKERBSA-N Gly-Lys-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O GMTXWRIDLGTVFC-IUCAKERBSA-N 0.000 description 5
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 5
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 5
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 5
- VKOAHIRLIUESLU-ULQDDVLXSA-N Leu-Arg-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VKOAHIRLIUESLU-ULQDDVLXSA-N 0.000 description 5
- OGCQGUIWMSBHRZ-CIUDSAMLSA-N Leu-Asn-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O OGCQGUIWMSBHRZ-CIUDSAMLSA-N 0.000 description 5
- VQPPIMUZCZCOIL-GUBZILKMSA-N Leu-Gln-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VQPPIMUZCZCOIL-GUBZILKMSA-N 0.000 description 5
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 5
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 5
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 5
- UCBPDSYUVAAHCD-UWVGGRQHSA-N Leu-Pro-Gly Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UCBPDSYUVAAHCD-UWVGGRQHSA-N 0.000 description 5
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 5
- QLFAPXUXEBAWEK-NHCYSSNCSA-N Lys-Val-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QLFAPXUXEBAWEK-NHCYSSNCSA-N 0.000 description 5
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 5
- IFMDQWDAJUMMJC-DCAQKATOSA-N Pro-Ala-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O IFMDQWDAJUMMJC-DCAQKATOSA-N 0.000 description 5
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 5
- KPDRZQUWJKTMBP-DCAQKATOSA-N Pro-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 KPDRZQUWJKTMBP-DCAQKATOSA-N 0.000 description 5
- VYWNORHENYEQDW-YUMQZZPRSA-N Pro-Gly-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 VYWNORHENYEQDW-YUMQZZPRSA-N 0.000 description 5
- UIMCLYYSUCIUJM-UWVGGRQHSA-N Pro-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 UIMCLYYSUCIUJM-UWVGGRQHSA-N 0.000 description 5
- JLMZKEQFMVORMA-SRVKXCTJSA-N Pro-Pro-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 JLMZKEQFMVORMA-SRVKXCTJSA-N 0.000 description 5
- SNGZLPOXVRTNMB-LPEHRKFASA-N Pro-Ser-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N2CCC[C@@H]2C(=O)O SNGZLPOXVRTNMB-LPEHRKFASA-N 0.000 description 5
- ZUGXSSFMTXKHJS-ZLUOBGJFSA-N Ser-Ala-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O ZUGXSSFMTXKHJS-ZLUOBGJFSA-N 0.000 description 5
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 5
- YRBGKVIWMNEVCZ-WDSKDSINSA-N Ser-Glu-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YRBGKVIWMNEVCZ-WDSKDSINSA-N 0.000 description 5
- WSTIOCFMWXNOCX-YUMQZZPRSA-N Ser-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N WSTIOCFMWXNOCX-YUMQZZPRSA-N 0.000 description 5
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 5
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 5
- ZIFYDQAFEMIZII-GUBZILKMSA-N Ser-Leu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZIFYDQAFEMIZII-GUBZILKMSA-N 0.000 description 5
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 5
- UPLYXVPQLJVWMM-KKUMJFAQSA-N Ser-Phe-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O UPLYXVPQLJVWMM-KKUMJFAQSA-N 0.000 description 5
- GYDFRTRSSXOZCR-ACZMJKKPSA-N Ser-Ser-Glu Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GYDFRTRSSXOZCR-ACZMJKKPSA-N 0.000 description 5
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 5
- KCRQEJSKXAIULJ-FJXKBIBVSA-N Thr-Gly-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O KCRQEJSKXAIULJ-FJXKBIBVSA-N 0.000 description 5
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 5
- MNYNCKZAEIAONY-XGEHTFHBSA-N Thr-Val-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O MNYNCKZAEIAONY-XGEHTFHBSA-N 0.000 description 5
- XGJLNBNZNMVJRS-NRPADANISA-N Val-Glu-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O XGJLNBNZNMVJRS-NRPADANISA-N 0.000 description 5
- FEXILLGKGGTLRI-NHCYSSNCSA-N Val-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N FEXILLGKGGTLRI-NHCYSSNCSA-N 0.000 description 5
- SYSWVVCYSXBVJG-RHYQMDGZSA-N Val-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N)O SYSWVVCYSXBVJG-RHYQMDGZSA-N 0.000 description 5
- VHIZXDZMTDVFGX-DCAQKATOSA-N Val-Ser-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N VHIZXDZMTDVFGX-DCAQKATOSA-N 0.000 description 5
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 5
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 5
- 150000001408 amides Chemical class 0.000 description 5
- 239000000427 antigen Substances 0.000 description 5
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 5
- 108010018691 arginyl-threonyl-arginine Proteins 0.000 description 5
- 230000000295 complement effect Effects 0.000 description 5
- 108010069495 cysteinyltyrosine Proteins 0.000 description 5
- 108010054813 diprotin B Proteins 0.000 description 5
- 239000003814 drug Substances 0.000 description 5
- 108010042598 glutamyl-aspartyl-glycine Proteins 0.000 description 5
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 5
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 5
- 108010067216 glycyl-glycyl-glycine Proteins 0.000 description 5
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 5
- 108010018006 histidylserine Proteins 0.000 description 5
- 238000003018 immunoassay Methods 0.000 description 5
- 108010044374 isoleucyl-tyrosine Proteins 0.000 description 5
- 108010083708 leucyl-aspartyl-valine Proteins 0.000 description 5
- 108010012058 leucyltyrosine Proteins 0.000 description 5
- 108020004707 nucleic acids Proteins 0.000 description 5
- 102000039446 nucleic acids Human genes 0.000 description 5
- 108010087846 prolyl-prolyl-glycine Proteins 0.000 description 5
- 108010072986 threonyl-seryl-lysine Proteins 0.000 description 5
- 108010084932 tryptophyl-proline Proteins 0.000 description 5
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 4
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 4
- TTXMOJWKNRJWQJ-FXQIFTODSA-N Ala-Arg-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N TTXMOJWKNRJWQJ-FXQIFTODSA-N 0.000 description 4
- WXERCAHAIKMTKX-ZLUOBGJFSA-N Ala-Asp-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O WXERCAHAIKMTKX-ZLUOBGJFSA-N 0.000 description 4
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 4
- HXNNRBHASOSVPG-GUBZILKMSA-N Ala-Glu-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HXNNRBHASOSVPG-GUBZILKMSA-N 0.000 description 4
- PUBLUECXJRHTBK-ACZMJKKPSA-N Ala-Glu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O PUBLUECXJRHTBK-ACZMJKKPSA-N 0.000 description 4
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 4
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 4
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 4
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 4
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 4
- SOBIAADAMRHGKH-CIUDSAMLSA-N Ala-Leu-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SOBIAADAMRHGKH-CIUDSAMLSA-N 0.000 description 4
- YHBDGLZYNIARKJ-GUBZILKMSA-N Ala-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N YHBDGLZYNIARKJ-GUBZILKMSA-N 0.000 description 4
- DCVYRWFAMZFSDA-ZLUOBGJFSA-N Ala-Ser-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DCVYRWFAMZFSDA-ZLUOBGJFSA-N 0.000 description 4
- ARHJJAAWNWOACN-FXQIFTODSA-N Ala-Ser-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O ARHJJAAWNWOACN-FXQIFTODSA-N 0.000 description 4
- VHAQSYHSDKERBS-XPUUQOCRSA-N Ala-Val-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O VHAQSYHSDKERBS-XPUUQOCRSA-N 0.000 description 4
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 4
- 108020004491 Antisense DNA Proteins 0.000 description 4
- HJAICMSAKODKRF-GUBZILKMSA-N Arg-Cys-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O HJAICMSAKODKRF-GUBZILKMSA-N 0.000 description 4
- VDBKFYYIBLXEIF-GUBZILKMSA-N Arg-Gln-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VDBKFYYIBLXEIF-GUBZILKMSA-N 0.000 description 4
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 4
- OQCWXQJLCDPRHV-UWVGGRQHSA-N Arg-Gly-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O OQCWXQJLCDPRHV-UWVGGRQHSA-N 0.000 description 4
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 4
- ATABBWFGOHKROJ-GUBZILKMSA-N Arg-Pro-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ATABBWFGOHKROJ-GUBZILKMSA-N 0.000 description 4
- KMFPQTITXUKJOV-DCAQKATOSA-N Arg-Ser-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O KMFPQTITXUKJOV-DCAQKATOSA-N 0.000 description 4
- ICRHGPYYXMWHIE-LPEHRKFASA-N Arg-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ICRHGPYYXMWHIE-LPEHRKFASA-N 0.000 description 4
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 4
- WOZDCBHUGJVJPL-AVGNSLFASA-N Arg-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WOZDCBHUGJVJPL-AVGNSLFASA-N 0.000 description 4
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 4
- IARGXWMWRFOQPG-GCJQMDKQSA-N Asn-Ala-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IARGXWMWRFOQPG-GCJQMDKQSA-N 0.000 description 4
- ZMUQQMGITUJQTI-CIUDSAMLSA-N Asn-Leu-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMUQQMGITUJQTI-CIUDSAMLSA-N 0.000 description 4
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 4
- FBODFHMLALOPHP-GUBZILKMSA-N Asn-Lys-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O FBODFHMLALOPHP-GUBZILKMSA-N 0.000 description 4
- XAJRHVUUVUPFQL-ACZMJKKPSA-N Asp-Glu-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XAJRHVUUVUPFQL-ACZMJKKPSA-N 0.000 description 4
- OMMIEVATLAGRCK-BYPYZUCNSA-N Asp-Gly-Gly Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)NCC(O)=O OMMIEVATLAGRCK-BYPYZUCNSA-N 0.000 description 4
- QCVXMEHGFUMKCO-YUMQZZPRSA-N Asp-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O QCVXMEHGFUMKCO-YUMQZZPRSA-N 0.000 description 4
- KPSHWSWFPUDEGF-FXQIFTODSA-N Asp-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(O)=O KPSHWSWFPUDEGF-FXQIFTODSA-N 0.000 description 4
- HICVMZCGVFKTPM-BQBZGAKWSA-N Asp-Pro-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HICVMZCGVFKTPM-BQBZGAKWSA-N 0.000 description 4
- FAUPLTGRUBTXNU-FXQIFTODSA-N Asp-Pro-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O FAUPLTGRUBTXNU-FXQIFTODSA-N 0.000 description 4
- RVMXMLSYBTXCAV-VEVYYDQMSA-N Asp-Pro-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMXMLSYBTXCAV-VEVYYDQMSA-N 0.000 description 4
- XWKBWZXGNXTDKY-ZKWXMUAHSA-N Asp-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O XWKBWZXGNXTDKY-ZKWXMUAHSA-N 0.000 description 4
- 244000063299 Bacillus subtilis Species 0.000 description 4
- 235000014469 Bacillus subtilis Nutrition 0.000 description 4
- LKHMGNHQULEPFY-ACZMJKKPSA-N Cys-Ser-Glu Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O LKHMGNHQULEPFY-ACZMJKKPSA-N 0.000 description 4
- LPBUBIHAVKXUOT-FXQIFTODSA-N Cys-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N LPBUBIHAVKXUOT-FXQIFTODSA-N 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 4
- 108090000790 Enzymes Proteins 0.000 description 4
- 241000588724 Escherichia coli Species 0.000 description 4
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 4
- WQWMZOIPXWSZNE-WDSKDSINSA-N Gln-Asp-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O WQWMZOIPXWSZNE-WDSKDSINSA-N 0.000 description 4
- XJKAKYXMFHUIHT-AUTRQRHGSA-N Gln-Glu-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N XJKAKYXMFHUIHT-AUTRQRHGSA-N 0.000 description 4
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 4
- LPIKVBWNNVFHCQ-GUBZILKMSA-N Gln-Ser-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LPIKVBWNNVFHCQ-GUBZILKMSA-N 0.000 description 4
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 4
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 4
- HUWSBFYAGXCXKC-CIUDSAMLSA-N Glu-Ala-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O HUWSBFYAGXCXKC-CIUDSAMLSA-N 0.000 description 4
- ATRHMOJQJWPVBQ-DRZSPHRISA-N Glu-Ala-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ATRHMOJQJWPVBQ-DRZSPHRISA-N 0.000 description 4
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 4
- NCWOMXABNYEPLY-NRPADANISA-N Glu-Ala-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O NCWOMXABNYEPLY-NRPADANISA-N 0.000 description 4
- CGOHAEBMDSEKFB-FXQIFTODSA-N Glu-Glu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O CGOHAEBMDSEKFB-FXQIFTODSA-N 0.000 description 4
- HNVFSTLPVJWIDV-CIUDSAMLSA-N Glu-Glu-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HNVFSTLPVJWIDV-CIUDSAMLSA-N 0.000 description 4
- BUZMZDDKFCSKOT-CIUDSAMLSA-N Glu-Glu-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O BUZMZDDKFCSKOT-CIUDSAMLSA-N 0.000 description 4
- PXXGVUVQWQGGIG-YUMQZZPRSA-N Glu-Gly-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N PXXGVUVQWQGGIG-YUMQZZPRSA-N 0.000 description 4
- LRPXYSGPOBVBEH-IUCAKERBSA-N Glu-Gly-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O LRPXYSGPOBVBEH-IUCAKERBSA-N 0.000 description 4
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 4
- SJJHXJDSNQJMMW-SRVKXCTJSA-N Glu-Lys-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SJJHXJDSNQJMMW-SRVKXCTJSA-N 0.000 description 4
- QJVZSVUYZFYLFQ-CIUDSAMLSA-N Glu-Pro-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O QJVZSVUYZFYLFQ-CIUDSAMLSA-N 0.000 description 4
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 4
- DCBSZJJHOTXMHY-DCAQKATOSA-N Glu-Pro-Pro Chemical compound OC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DCBSZJJHOTXMHY-DCAQKATOSA-N 0.000 description 4
- SWDNPSMMEWRNOH-HJGDQZAQSA-N Glu-Pro-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWDNPSMMEWRNOH-HJGDQZAQSA-N 0.000 description 4
- NNQDRRUXFJYCCJ-NHCYSSNCSA-N Glu-Pro-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O NNQDRRUXFJYCCJ-NHCYSSNCSA-N 0.000 description 4
- IDEODOAVGCMUQV-GUBZILKMSA-N Glu-Ser-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IDEODOAVGCMUQV-GUBZILKMSA-N 0.000 description 4
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 4
- JXYMPBCYRKWJEE-BQBZGAKWSA-N Gly-Arg-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O JXYMPBCYRKWJEE-BQBZGAKWSA-N 0.000 description 4
- LCNXZQROPKFGQK-WHFBIAKZSA-N Gly-Asp-Ser Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O LCNXZQROPKFGQK-WHFBIAKZSA-N 0.000 description 4
- VUUOMYFPWDYETE-WDSKDSINSA-N Gly-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN VUUOMYFPWDYETE-WDSKDSINSA-N 0.000 description 4
- MBOAPAXLTUSMQI-JHEQGTHGSA-N Gly-Glu-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MBOAPAXLTUSMQI-JHEQGTHGSA-N 0.000 description 4
- QPTNELDXWKRIFX-YFKPBYRVSA-N Gly-Gly-Gln Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O QPTNELDXWKRIFX-YFKPBYRVSA-N 0.000 description 4
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 4
- VBOBNHSVQKKTOT-YUMQZZPRSA-N Gly-Lys-Ala Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O VBOBNHSVQKKTOT-YUMQZZPRSA-N 0.000 description 4
- GGLIDLCEPDHEJO-BQBZGAKWSA-N Gly-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)CN GGLIDLCEPDHEJO-BQBZGAKWSA-N 0.000 description 4
- OOCFXNOVSLSHAB-IUCAKERBSA-N Gly-Pro-Pro Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OOCFXNOVSLSHAB-IUCAKERBSA-N 0.000 description 4
- GLACUWHUYFBSPJ-FJXKBIBVSA-N Gly-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GLACUWHUYFBSPJ-FJXKBIBVSA-N 0.000 description 4
- SOEGEPHNZOISMT-BYPYZUCNSA-N Gly-Ser-Gly Chemical compound NCC(=O)N[C@@H](CO)C(=O)NCC(O)=O SOEGEPHNZOISMT-BYPYZUCNSA-N 0.000 description 4
- ABPRMMYHROQBLY-NKWVEPMBSA-N Gly-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)CN)C(=O)O ABPRMMYHROQBLY-NKWVEPMBSA-N 0.000 description 4
- ZZWUYQXMIFTIIY-WEDXCCLWSA-N Gly-Thr-Leu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O ZZWUYQXMIFTIIY-WEDXCCLWSA-N 0.000 description 4
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 4
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 4
- FZKFYOXDVWDELO-KBPBESRZSA-N His-Gly-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O FZKFYOXDVWDELO-KBPBESRZSA-N 0.000 description 4
- TWROVBNEHJSXDG-IHRRRGAJSA-N His-Leu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O TWROVBNEHJSXDG-IHRRRGAJSA-N 0.000 description 4
- YSGBJIQXTIVBHZ-AJNGGQMLSA-N Ile-Lys-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O YSGBJIQXTIVBHZ-AJNGGQMLSA-N 0.000 description 4
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 4
- PWWVAXIEGOYWEE-UHFFFAOYSA-N Isophenergan Chemical compound C1=CC=C2N(CC(C)N(C)C)C3=CC=CC=C3SC2=C1 PWWVAXIEGOYWEE-UHFFFAOYSA-N 0.000 description 4
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 4
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 4
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 4
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 4
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 4
- PVMPDMIKUVNOBD-CIUDSAMLSA-N Leu-Asp-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O PVMPDMIKUVNOBD-CIUDSAMLSA-N 0.000 description 4
- RVVBWTWPNFDYBE-SRVKXCTJSA-N Leu-Glu-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVVBWTWPNFDYBE-SRVKXCTJSA-N 0.000 description 4
- YVKSMSDXKMSIRX-GUBZILKMSA-N Leu-Glu-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O YVKSMSDXKMSIRX-GUBZILKMSA-N 0.000 description 4
- HFBCHNRFRYLZNV-GUBZILKMSA-N Leu-Glu-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HFBCHNRFRYLZNV-GUBZILKMSA-N 0.000 description 4
- KVMULWOHPPMHHE-DCAQKATOSA-N Leu-Glu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KVMULWOHPPMHHE-DCAQKATOSA-N 0.000 description 4
- WQWSMEOYXJTFRU-GUBZILKMSA-N Leu-Glu-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O WQWSMEOYXJTFRU-GUBZILKMSA-N 0.000 description 4
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 4
- CCQLQKZTXZBXTN-NHCYSSNCSA-N Leu-Gly-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CCQLQKZTXZBXTN-NHCYSSNCSA-N 0.000 description 4
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 4
- VGPCJSXPPOQPBK-YUMQZZPRSA-N Leu-Gly-Ser Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O VGPCJSXPPOQPBK-YUMQZZPRSA-N 0.000 description 4
- POZULHZYLPGXMR-ONGXEEELSA-N Leu-Gly-Val Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O POZULHZYLPGXMR-ONGXEEELSA-N 0.000 description 4
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 4
- IFMPDNRWZZEZSL-SRVKXCTJSA-N Leu-Leu-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O IFMPDNRWZZEZSL-SRVKXCTJSA-N 0.000 description 4
- JNDYEOUZBLOVOF-AVGNSLFASA-N Leu-Leu-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JNDYEOUZBLOVOF-AVGNSLFASA-N 0.000 description 4
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 4
- IEWBEPKLKUXQBU-VOAKCMCISA-N Leu-Leu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IEWBEPKLKUXQBU-VOAKCMCISA-N 0.000 description 4
- XWEVVRRSIOBJOO-SRVKXCTJSA-N Leu-Pro-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O XWEVVRRSIOBJOO-SRVKXCTJSA-N 0.000 description 4
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 4
- JIHDFWWRYHSAQB-GUBZILKMSA-N Leu-Ser-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JIHDFWWRYHSAQB-GUBZILKMSA-N 0.000 description 4
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 4
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 4
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 4
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 4
- KWUKZRFFKPLUPE-HJGDQZAQSA-N Lys-Asp-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWUKZRFFKPLUPE-HJGDQZAQSA-N 0.000 description 4
- LLSUNJYOSCOOEB-GUBZILKMSA-N Lys-Glu-Asp Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O LLSUNJYOSCOOEB-GUBZILKMSA-N 0.000 description 4
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 4
- RBEATVHTWHTHTJ-KKUMJFAQSA-N Lys-Leu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O RBEATVHTWHTHTJ-KKUMJFAQSA-N 0.000 description 4
- RPWTZTBIFGENIA-VOAKCMCISA-N Lys-Thr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RPWTZTBIFGENIA-VOAKCMCISA-N 0.000 description 4
- NHXXGBXJTLRGJI-GUBZILKMSA-N Met-Pro-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O NHXXGBXJTLRGJI-GUBZILKMSA-N 0.000 description 4
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 4
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 4
- KAGCQPSEVAETCA-JYJNAYRXSA-N Phe-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N KAGCQPSEVAETCA-JYJNAYRXSA-N 0.000 description 4
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 4
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 4
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 4
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 4
- ZSKJPKFTPQCPIH-RCWTZXSCSA-N Pro-Arg-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSKJPKFTPQCPIH-RCWTZXSCSA-N 0.000 description 4
- UTAUEDINXUMHLG-FXQIFTODSA-N Pro-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 UTAUEDINXUMHLG-FXQIFTODSA-N 0.000 description 4
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 4
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 4
- LGSANCBHSMDFDY-GARJFASQSA-N Pro-Glu-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O LGSANCBHSMDFDY-GARJFASQSA-N 0.000 description 4
- JMVQDLDPDBXAAX-YUMQZZPRSA-N Pro-Gly-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 JMVQDLDPDBXAAX-YUMQZZPRSA-N 0.000 description 4
- WSRWHZRUOCACLJ-UWVGGRQHSA-N Pro-Gly-His Chemical compound C([C@@H](C(=O)O)NC(=O)CNC(=O)[C@H]1NCCC1)C1=CN=CN1 WSRWHZRUOCACLJ-UWVGGRQHSA-N 0.000 description 4
- QEWBZBLXDKIQPS-STQMWFEESA-N Pro-Gly-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QEWBZBLXDKIQPS-STQMWFEESA-N 0.000 description 4
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 4
- XYSXOCIWCPFOCG-IHRRRGAJSA-N Pro-Leu-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XYSXOCIWCPFOCG-IHRRRGAJSA-N 0.000 description 4
- SUENWIFTSTWUKD-AVGNSLFASA-N Pro-Leu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O SUENWIFTSTWUKD-AVGNSLFASA-N 0.000 description 4
- CGSOWZUPLOKYOR-AVGNSLFASA-N Pro-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 CGSOWZUPLOKYOR-AVGNSLFASA-N 0.000 description 4
- SBVPYBFMIGDIDX-SRVKXCTJSA-N Pro-Pro-Pro Chemical compound OC(=O)[C@@H]1CCCN1C(=O)[C@H]1N(C(=O)[C@H]2NCCC2)CCC1 SBVPYBFMIGDIDX-SRVKXCTJSA-N 0.000 description 4
- POQFNPILEQEODH-FXQIFTODSA-N Pro-Ser-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O POQFNPILEQEODH-FXQIFTODSA-N 0.000 description 4
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 4
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 4
- WTUJZHKANPDPIN-CIUDSAMLSA-N Ser-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N WTUJZHKANPDPIN-CIUDSAMLSA-N 0.000 description 4
- QWZIOCFPXMAXET-CIUDSAMLSA-N Ser-Arg-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QWZIOCFPXMAXET-CIUDSAMLSA-N 0.000 description 4
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 4
- QVOGDCQNGLBNCR-FXQIFTODSA-N Ser-Arg-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O QVOGDCQNGLBNCR-FXQIFTODSA-N 0.000 description 4
- UOLGINIHBRIECN-FXQIFTODSA-N Ser-Glu-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UOLGINIHBRIECN-FXQIFTODSA-N 0.000 description 4
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 4
- QYSFWUIXDFJUDW-DCAQKATOSA-N Ser-Leu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QYSFWUIXDFJUDW-DCAQKATOSA-N 0.000 description 4
- GJFYFGOEWLDQGW-GUBZILKMSA-N Ser-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GJFYFGOEWLDQGW-GUBZILKMSA-N 0.000 description 4
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 4
- NNFMANHDYSVNIO-DCAQKATOSA-N Ser-Lys-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NNFMANHDYSVNIO-DCAQKATOSA-N 0.000 description 4
- MQUZANJDFOQOBX-SRVKXCTJSA-N Ser-Phe-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O MQUZANJDFOQOBX-SRVKXCTJSA-N 0.000 description 4
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 4
- FLONGDPORFIVQW-XGEHTFHBSA-N Ser-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FLONGDPORFIVQW-XGEHTFHBSA-N 0.000 description 4
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 4
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 4
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 4
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 4
- RKDFEMGVMMYYNG-WDCWCFNPSA-N Thr-Gln-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O RKDFEMGVMMYYNG-WDCWCFNPSA-N 0.000 description 4
- DJDSEDOKJTZBAR-ZDLURKLDSA-N Thr-Gly-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O DJDSEDOKJTZBAR-ZDLURKLDSA-N 0.000 description 4
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 4
- RFKVQLIXNVEOMB-WEDXCCLWSA-N Thr-Leu-Gly Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N)O RFKVQLIXNVEOMB-WEDXCCLWSA-N 0.000 description 4
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 4
- SGAOHNPSEPVAFP-ZDLURKLDSA-N Thr-Ser-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SGAOHNPSEPVAFP-ZDLURKLDSA-N 0.000 description 4
- LECUEEHKUFYOOV-ZJDVBMNYSA-N Thr-Thr-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](N)[C@@H](C)O LECUEEHKUFYOOV-ZJDVBMNYSA-N 0.000 description 4
- YFOCMOVJBQDBCE-NRPADANISA-N Val-Ala-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N YFOCMOVJBQDBCE-NRPADANISA-N 0.000 description 4
- VVZDBPBZHLQPPB-XVKPBYJWSA-N Val-Glu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O VVZDBPBZHLQPPB-XVKPBYJWSA-N 0.000 description 4
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 4
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 4
- VPGCVZRRBYOGCD-AVGNSLFASA-N Val-Lys-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O VPGCVZRRBYOGCD-AVGNSLFASA-N 0.000 description 4
- XBJKAZATRJBDCU-GUBZILKMSA-N Val-Pro-Ala Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O XBJKAZATRJBDCU-GUBZILKMSA-N 0.000 description 4
- RYQUMYBMOJYYDK-NHCYSSNCSA-N Val-Pro-Glu Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N RYQUMYBMOJYYDK-NHCYSSNCSA-N 0.000 description 4
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 4
- AJNUKMZFHXUBMK-GUBZILKMSA-N Val-Ser-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N AJNUKMZFHXUBMK-GUBZILKMSA-N 0.000 description 4
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 4
- LCHZBEUVGAVMKS-RHYQMDGZSA-N Val-Thr-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)[C@@H](C)O)C(O)=O LCHZBEUVGAVMKS-RHYQMDGZSA-N 0.000 description 4
- OFTXTCGQJXTNQS-XGEHTFHBSA-N Val-Thr-Ser Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N)O OFTXTCGQJXTNQS-XGEHTFHBSA-N 0.000 description 4
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 4
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 4
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 4
- 238000001994 activation Methods 0.000 description 4
- 230000004913 activation Effects 0.000 description 4
- 238000005273 aeration Methods 0.000 description 4
- 210000004102 animal cell Anatomy 0.000 description 4
- 108091007433 antigens Proteins 0.000 description 4
- 102000036639 antigens Human genes 0.000 description 4
- 108010052670 arginyl-glutamyl-glutamic acid Proteins 0.000 description 4
- 108010091092 arginyl-glycyl-proline Proteins 0.000 description 4
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 4
- 108010077245 asparaginyl-proline Proteins 0.000 description 4
- 108010068265 aspartyltyrosine Proteins 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 229940088598 enzyme Drugs 0.000 description 4
- 239000013604 expression vector Substances 0.000 description 4
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 4
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 4
- 108010020688 glycylhistidine Proteins 0.000 description 4
- 108010050343 histidyl-alanyl-glutamine Proteins 0.000 description 4
- 108010027338 isoleucylcysteine Proteins 0.000 description 4
- 108010087810 leucyl-seryl-glutamyl-leucine Proteins 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 4
- 238000010647 peptide synthesis reaction Methods 0.000 description 4
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 4
- 108010083476 phenylalanyltryptophan Proteins 0.000 description 4
- 238000000746 purification Methods 0.000 description 4
- 238000002864 sequence alignment Methods 0.000 description 4
- 108010005652 splenotritin Proteins 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 238000013519 translation Methods 0.000 description 4
- 108010003137 tyrosyltyrosine Proteins 0.000 description 4
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 4
- GJLXVWOMRRWCIB-MERZOTPQSA-N (2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-[[(2S)-2-acetamido-5-(diaminomethylideneamino)pentanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-3-(4-hydroxyphenyl)propanoyl]amino]-5-(diaminomethylideneamino)pentanoyl]amino]-3-(1H-indol-3-yl)propanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanoyl]amino]-6-aminohexanamide Chemical compound C([C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(N)=O)C1=CC=C(O)C=C1 GJLXVWOMRRWCIB-MERZOTPQSA-N 0.000 description 3
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 3
- QTBSBXVTEAMEQO-UHFFFAOYSA-N Acetic acid Chemical compound CC(O)=O QTBSBXVTEAMEQO-UHFFFAOYSA-N 0.000 description 3
- WFDIJRYMOXRFFG-UHFFFAOYSA-N Acetic anhydride Chemical compound CC(=O)OC(C)=O WFDIJRYMOXRFFG-UHFFFAOYSA-N 0.000 description 3
- HHGYNJRJIINWAK-FXQIFTODSA-N Ala-Ala-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N HHGYNJRJIINWAK-FXQIFTODSA-N 0.000 description 3
- AAQGRPOPTAUUBM-ZLUOBGJFSA-N Ala-Ala-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O AAQGRPOPTAUUBM-ZLUOBGJFSA-N 0.000 description 3
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 3
- LSLIRHLIUDVNBN-CIUDSAMLSA-N Ala-Asp-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN LSLIRHLIUDVNBN-CIUDSAMLSA-N 0.000 description 3
- YSMPVONNIWLJML-FXQIFTODSA-N Ala-Asp-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(O)=O YSMPVONNIWLJML-FXQIFTODSA-N 0.000 description 3
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 3
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 3
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 3
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 3
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 3
- NOGFDULFCFXBHB-CIUDSAMLSA-N Ala-Leu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N NOGFDULFCFXBHB-CIUDSAMLSA-N 0.000 description 3
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 3
- VHVVPYOJIIQCKS-QEJZJMRPSA-N Ala-Leu-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VHVVPYOJIIQCKS-QEJZJMRPSA-N 0.000 description 3
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 3
- OSRZOHXQCUFIQG-FPMFFAJLSA-N Ala-Phe-Pro Chemical compound C([C@H](NC(=O)[C@@H]([NH3+])C)C(=O)N1[C@H](CCC1)C([O-])=O)C1=CC=CC=C1 OSRZOHXQCUFIQG-FPMFFAJLSA-N 0.000 description 3
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 3
- FEGOCLZUJUFCHP-CIUDSAMLSA-N Ala-Pro-Gln Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FEGOCLZUJUFCHP-CIUDSAMLSA-N 0.000 description 3
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 3
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 3
- PEEYDECOOVQKRZ-DLOVCJGASA-N Ala-Ser-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PEEYDECOOVQKRZ-DLOVCJGASA-N 0.000 description 3
- LSMDIAAALJJLRO-XQXXSGGOSA-N Ala-Thr-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O LSMDIAAALJJLRO-XQXXSGGOSA-N 0.000 description 3
- WNHNMKOFKCHKKD-BFHQHQDPSA-N Ala-Thr-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O WNHNMKOFKCHKKD-BFHQHQDPSA-N 0.000 description 3
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 3
- JJHBEVZAZXZREW-LFSVMHDDSA-N Ala-Thr-Phe Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O JJHBEVZAZXZREW-LFSVMHDDSA-N 0.000 description 3
- ZJLORAAXDAJLDC-CQDKDKBSSA-N Ala-Tyr-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O ZJLORAAXDAJLDC-CQDKDKBSSA-N 0.000 description 3
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 3
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 3
- IASNWHAGGYTEKX-IUCAKERBSA-N Arg-Arg-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(O)=O IASNWHAGGYTEKX-IUCAKERBSA-N 0.000 description 3
- JTWOBPNAVBESFW-FXQIFTODSA-N Arg-Cys-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)CN=C(N)N JTWOBPNAVBESFW-FXQIFTODSA-N 0.000 description 3
- QIWYWCYNUMJBTC-CIUDSAMLSA-N Arg-Cys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIWYWCYNUMJBTC-CIUDSAMLSA-N 0.000 description 3
- HPKSHFSEXICTLI-CIUDSAMLSA-N Arg-Glu-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O HPKSHFSEXICTLI-CIUDSAMLSA-N 0.000 description 3
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 3
- HAVKMRGWNXMCDR-STQMWFEESA-N Arg-Gly-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HAVKMRGWNXMCDR-STQMWFEESA-N 0.000 description 3
- ZATRYQNPUHGXCU-DTWKUNHWSA-N Arg-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZATRYQNPUHGXCU-DTWKUNHWSA-N 0.000 description 3
- NKNILFJYKKHBKE-WPRPVWTQSA-N Arg-Gly-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NKNILFJYKKHBKE-WPRPVWTQSA-N 0.000 description 3
- OGSQONVYSTZIJB-WDSOQIARSA-N Arg-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CCCN=C(N)N)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OGSQONVYSTZIJB-WDSOQIARSA-N 0.000 description 3
- IGFJVXOATGZTHD-UHFFFAOYSA-N Arg-Phe-His Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccccc1)C(=O)NC(Cc2c[nH]cn2)C(=O)O IGFJVXOATGZTHD-UHFFFAOYSA-N 0.000 description 3
- BSYKSCBTTQKOJG-GUBZILKMSA-N Arg-Pro-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O BSYKSCBTTQKOJG-GUBZILKMSA-N 0.000 description 3
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 3
- HGKHPCFTRQDHCU-IUCAKERBSA-N Arg-Pro-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O HGKHPCFTRQDHCU-IUCAKERBSA-N 0.000 description 3
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 3
- DNLQVHBBMPZUGJ-BQBZGAKWSA-N Arg-Ser-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O DNLQVHBBMPZUGJ-BQBZGAKWSA-N 0.000 description 3
- LFAUVOXPCGJKTB-DCAQKATOSA-N Arg-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N LFAUVOXPCGJKTB-DCAQKATOSA-N 0.000 description 3
- RYQSYXFGFOTJDJ-RHYQMDGZSA-N Arg-Thr-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O RYQSYXFGFOTJDJ-RHYQMDGZSA-N 0.000 description 3
- POOCJCRBHHMAOS-FXQIFTODSA-N Asn-Arg-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O POOCJCRBHHMAOS-FXQIFTODSA-N 0.000 description 3
- JREOBWLIZLXRIS-GUBZILKMSA-N Asn-Glu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JREOBWLIZLXRIS-GUBZILKMSA-N 0.000 description 3
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 3
- WIDVAWAQBRAKTI-YUMQZZPRSA-N Asn-Leu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O WIDVAWAQBRAKTI-YUMQZZPRSA-N 0.000 description 3
- BYLSYQASFJJBCL-DCAQKATOSA-N Asn-Pro-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BYLSYQASFJJBCL-DCAQKATOSA-N 0.000 description 3
- PQKSVQSMTHPRIB-ZKWXMUAHSA-N Asn-Val-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O PQKSVQSMTHPRIB-ZKWXMUAHSA-N 0.000 description 3
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 3
- SOYOSFXLXYZNRG-CIUDSAMLSA-N Asp-Arg-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O SOYOSFXLXYZNRG-CIUDSAMLSA-N 0.000 description 3
- AXXCUABIFZPKPM-BQBZGAKWSA-N Asp-Arg-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O AXXCUABIFZPKPM-BQBZGAKWSA-N 0.000 description 3
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 3
- QXHVOUSPVAWEMX-ZLUOBGJFSA-N Asp-Asp-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O QXHVOUSPVAWEMX-ZLUOBGJFSA-N 0.000 description 3
- HRGGPWBIMIQANI-GUBZILKMSA-N Asp-Gln-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O HRGGPWBIMIQANI-GUBZILKMSA-N 0.000 description 3
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 3
- DTNUIAJCPRMNBT-WHFBIAKZSA-N Asp-Gly-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O DTNUIAJCPRMNBT-WHFBIAKZSA-N 0.000 description 3
- HAFCJCDJGIOYPW-WDSKDSINSA-N Asp-Gly-Gln Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O HAFCJCDJGIOYPW-WDSKDSINSA-N 0.000 description 3
- WSXDIZFNQYTUJB-SRVKXCTJSA-N Asp-His-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O WSXDIZFNQYTUJB-SRVKXCTJSA-N 0.000 description 3
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 3
- DWOGMPWRQQWPPF-GUBZILKMSA-N Asp-Leu-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O DWOGMPWRQQWPPF-GUBZILKMSA-N 0.000 description 3
- ORRJQLIATJDMQM-HJGDQZAQSA-N Asp-Leu-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O ORRJQLIATJDMQM-HJGDQZAQSA-N 0.000 description 3
- YRZIYQGXTSBRLT-AVGNSLFASA-N Asp-Phe-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O YRZIYQGXTSBRLT-AVGNSLFASA-N 0.000 description 3
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 3
- DINOVZWPTMGSRF-QXEWZRGKSA-N Asp-Pro-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O DINOVZWPTMGSRF-QXEWZRGKSA-N 0.000 description 3
- WMLFFCRUSPNENW-ZLUOBGJFSA-N Asp-Ser-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O WMLFFCRUSPNENW-ZLUOBGJFSA-N 0.000 description 3
- QSFHZPQUAAQHAQ-CIUDSAMLSA-N Asp-Ser-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O QSFHZPQUAAQHAQ-CIUDSAMLSA-N 0.000 description 3
- 241000193830 Bacillus <bacterium> Species 0.000 description 3
- CVOZXIPULQQFNY-ZLUOBGJFSA-N Cys-Ala-Cys Chemical compound C[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O CVOZXIPULQQFNY-ZLUOBGJFSA-N 0.000 description 3
- PKNIZMPLMSKROD-BIIVOSGPSA-N Cys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N PKNIZMPLMSKROD-BIIVOSGPSA-N 0.000 description 3
- QLCPDGRAEJSYQM-LPEHRKFASA-N Cys-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)C(=O)O QLCPDGRAEJSYQM-LPEHRKFASA-N 0.000 description 3
- BIVLWXQGXJLGKG-BIIVOSGPSA-N Cys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)C(=O)O BIVLWXQGXJLGKG-BIIVOSGPSA-N 0.000 description 3
- HBHMVBGGHDMPBF-GARJFASQSA-N Cys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N HBHMVBGGHDMPBF-GARJFASQSA-N 0.000 description 3
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 3
- KJJASVYBTKRYSN-FXQIFTODSA-N Cys-Pro-Asp Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC(=O)O)C(=O)O KJJASVYBTKRYSN-FXQIFTODSA-N 0.000 description 3
- 241000588722 Escherichia Species 0.000 description 3
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 3
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 3
- RBWKVOSARCFSQQ-FXQIFTODSA-N Gln-Gln-Ser Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O RBWKVOSARCFSQQ-FXQIFTODSA-N 0.000 description 3
- VGTDBGYFVWOQTI-RYUDHWBXSA-N Gln-Gly-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VGTDBGYFVWOQTI-RYUDHWBXSA-N 0.000 description 3
- NSORZJXKUQFEKL-JGVFFNPUSA-N Gln-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)N)N)C(=O)O NSORZJXKUQFEKL-JGVFFNPUSA-N 0.000 description 3
- HXOLDXKNWKLDMM-YVNDNENWSA-N Gln-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HXOLDXKNWKLDMM-YVNDNENWSA-N 0.000 description 3
- ITZWDGBYBPUZRG-KBIXCLLPSA-N Gln-Ile-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O ITZWDGBYBPUZRG-KBIXCLLPSA-N 0.000 description 3
- LGIKBBLQVSWUGK-DCAQKATOSA-N Gln-Leu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LGIKBBLQVSWUGK-DCAQKATOSA-N 0.000 description 3
- MLSKFHLRFVGNLL-WDCWCFNPSA-N Gln-Leu-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MLSKFHLRFVGNLL-WDCWCFNPSA-N 0.000 description 3
- IHSGESFHTMFHRB-GUBZILKMSA-N Gln-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O IHSGESFHTMFHRB-GUBZILKMSA-N 0.000 description 3
- WEAVZFWWIPIANL-SRVKXCTJSA-N Gln-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N WEAVZFWWIPIANL-SRVKXCTJSA-N 0.000 description 3
- JNVGVECJCOZHCN-DRZSPHRISA-N Gln-Phe-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O JNVGVECJCOZHCN-DRZSPHRISA-N 0.000 description 3
- WBYHRQBKJGEBQJ-CIUDSAMLSA-N Gln-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CS)C(=O)O WBYHRQBKJGEBQJ-CIUDSAMLSA-N 0.000 description 3
- XQDGOJPVMSWZSO-SRVKXCTJSA-N Gln-Pro-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N XQDGOJPVMSWZSO-SRVKXCTJSA-N 0.000 description 3
- WLRYGVYQFXRJDA-DCAQKATOSA-N Gln-Pro-Pro Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 WLRYGVYQFXRJDA-DCAQKATOSA-N 0.000 description 3
- KUBFPYIMAGXGBT-ACZMJKKPSA-N Gln-Ser-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O KUBFPYIMAGXGBT-ACZMJKKPSA-N 0.000 description 3
- FHPXTPQBODWBIY-CIUDSAMLSA-N Glu-Ala-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHPXTPQBODWBIY-CIUDSAMLSA-N 0.000 description 3
- IRDASPPCLZIERZ-XHNCKOQMSA-N Glu-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N IRDASPPCLZIERZ-XHNCKOQMSA-N 0.000 description 3
- RCCDHXSRMWCOOY-GUBZILKMSA-N Glu-Arg-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCCDHXSRMWCOOY-GUBZILKMSA-N 0.000 description 3
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 3
- KKCUFHUTMKQQCF-SRVKXCTJSA-N Glu-Arg-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O KKCUFHUTMKQQCF-SRVKXCTJSA-N 0.000 description 3
- OJGLIOXAKGFFDW-SRVKXCTJSA-N Glu-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N OJGLIOXAKGFFDW-SRVKXCTJSA-N 0.000 description 3
- LXAUHIRMWXQRKI-XHNCKOQMSA-N Glu-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O LXAUHIRMWXQRKI-XHNCKOQMSA-N 0.000 description 3
- RDPOETHPAQEGDP-ACZMJKKPSA-N Glu-Asp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RDPOETHPAQEGDP-ACZMJKKPSA-N 0.000 description 3
- ILGFBUGLBSAQQB-GUBZILKMSA-N Glu-Glu-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ILGFBUGLBSAQQB-GUBZILKMSA-N 0.000 description 3
- LGYZYFFDELZWRS-DCAQKATOSA-N Glu-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O LGYZYFFDELZWRS-DCAQKATOSA-N 0.000 description 3
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 3
- MTAOBYXRYJZRGQ-WDSKDSINSA-N Glu-Gly-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MTAOBYXRYJZRGQ-WDSKDSINSA-N 0.000 description 3
- WRNAXCVRSBBKGS-BQBZGAKWSA-N Glu-Gly-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O WRNAXCVRSBBKGS-BQBZGAKWSA-N 0.000 description 3
- RAUDKMVXNOWDLS-WDSKDSINSA-N Glu-Gly-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O RAUDKMVXNOWDLS-WDSKDSINSA-N 0.000 description 3
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 3
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 3
- INGJLBQKTRJLFO-UKJIMTQDSA-N Glu-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O INGJLBQKTRJLFO-UKJIMTQDSA-N 0.000 description 3
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 3
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 3
- IVGJYOOGJLFKQE-AVGNSLFASA-N Glu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N IVGJYOOGJLFKQE-AVGNSLFASA-N 0.000 description 3
- BCYGDJXHAGZNPQ-DCAQKATOSA-N Glu-Lys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O BCYGDJXHAGZNPQ-DCAQKATOSA-N 0.000 description 3
- HRBYTAIBKPNZKQ-AVGNSLFASA-N Glu-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O HRBYTAIBKPNZKQ-AVGNSLFASA-N 0.000 description 3
- QDMVXRNLOPTPIE-WDCWCFNPSA-N Glu-Lys-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QDMVXRNLOPTPIE-WDCWCFNPSA-N 0.000 description 3
- YTRBQAQSUDSIQE-FHWLQOOXSA-N Glu-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 YTRBQAQSUDSIQE-FHWLQOOXSA-N 0.000 description 3
- DXVOKNVIKORTHQ-GUBZILKMSA-N Glu-Pro-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O DXVOKNVIKORTHQ-GUBZILKMSA-N 0.000 description 3
- QOXDAWODGSIDDI-GUBZILKMSA-N Glu-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N QOXDAWODGSIDDI-GUBZILKMSA-N 0.000 description 3
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 3
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 3
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 3
- ZQNCUVODKOBSSO-XEGUGMAKSA-N Glu-Trp-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O ZQNCUVODKOBSSO-XEGUGMAKSA-N 0.000 description 3
- MFYLRRCYBBJYPI-JYJNAYRXSA-N Glu-Tyr-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O MFYLRRCYBBJYPI-JYJNAYRXSA-N 0.000 description 3
- 108010070675 Glutathione transferase Proteins 0.000 description 3
- 102000005720 Glutathione transferase Human genes 0.000 description 3
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 3
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 3
- QXPRJQPCFXMCIY-NKWVEPMBSA-N Gly-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN QXPRJQPCFXMCIY-NKWVEPMBSA-N 0.000 description 3
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 3
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 3
- XQHSBNVACKQWAV-WHFBIAKZSA-N Gly-Asp-Asn Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XQHSBNVACKQWAV-WHFBIAKZSA-N 0.000 description 3
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 3
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 3
- SOEATRRYCIPEHA-BQBZGAKWSA-N Gly-Glu-Glu Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SOEATRRYCIPEHA-BQBZGAKWSA-N 0.000 description 3
- XTQFHTHIAKKCTM-YFKPBYRVSA-N Gly-Glu-Gly Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O XTQFHTHIAKKCTM-YFKPBYRVSA-N 0.000 description 3
- YYPFZVIXAVDHIK-IUCAKERBSA-N Gly-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN YYPFZVIXAVDHIK-IUCAKERBSA-N 0.000 description 3
- BEQGFMIBZFNROK-JGVFFNPUSA-N Gly-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)CN)C(=O)O BEQGFMIBZFNROK-JGVFFNPUSA-N 0.000 description 3
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 3
- GDOZQTNZPCUARW-YFKPBYRVSA-N Gly-Gly-Glu Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O GDOZQTNZPCUARW-YFKPBYRVSA-N 0.000 description 3
- FCKPEGOCSVZPNC-WHOFXGATSA-N Gly-Ile-Phe Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FCKPEGOCSVZPNC-WHOFXGATSA-N 0.000 description 3
- BHPQOIPBLYJNAW-NGZCFLSTSA-N Gly-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN BHPQOIPBLYJNAW-NGZCFLSTSA-N 0.000 description 3
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 3
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 3
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 3
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 3
- YSDLIYZLOTZZNP-UWVGGRQHSA-N Gly-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN YSDLIYZLOTZZNP-UWVGGRQHSA-N 0.000 description 3
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 3
- PDUHNKAFQXQNLH-ZETCQYMHSA-N Gly-Lys-Gly Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)NCC(O)=O PDUHNKAFQXQNLH-ZETCQYMHSA-N 0.000 description 3
- NTBOEZICHOSJEE-YUMQZZPRSA-N Gly-Lys-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NTBOEZICHOSJEE-YUMQZZPRSA-N 0.000 description 3
- FXGRXIATVXUAHO-WEDXCCLWSA-N Gly-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCCN FXGRXIATVXUAHO-WEDXCCLWSA-N 0.000 description 3
- OQQKUTVULYLCDG-ONGXEEELSA-N Gly-Lys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)CN)C(O)=O OQQKUTVULYLCDG-ONGXEEELSA-N 0.000 description 3
- GGAPHLIUUTVYMX-QWRGUYRKSA-N Gly-Phe-Ser Chemical compound OC[C@@H](C([O-])=O)NC(=O)[C@@H](NC(=O)C[NH3+])CC1=CC=CC=C1 GGAPHLIUUTVYMX-QWRGUYRKSA-N 0.000 description 3
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 3
- WDXLKVQATNEAJQ-BQBZGAKWSA-N Gly-Pro-Asp Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WDXLKVQATNEAJQ-BQBZGAKWSA-N 0.000 description 3
- NSVOVKWEKGEOQB-LURJTMIESA-N Gly-Pro-Gly Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(O)=O NSVOVKWEKGEOQB-LURJTMIESA-N 0.000 description 3
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 3
- WCORRBXVISTKQL-WHFBIAKZSA-N Gly-Ser-Ser Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WCORRBXVISTKQL-WHFBIAKZSA-N 0.000 description 3
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 3
- FKYQEVBRZSFAMJ-QWRGUYRKSA-N Gly-Ser-Tyr Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FKYQEVBRZSFAMJ-QWRGUYRKSA-N 0.000 description 3
- ZLCLYFGMKFCDCN-XPUUQOCRSA-N Gly-Ser-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CO)NC(=O)CN)C(O)=O ZLCLYFGMKFCDCN-XPUUQOCRSA-N 0.000 description 3
- CQMFNTVQVLQRLT-JHEQGTHGSA-N Gly-Thr-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CQMFNTVQVLQRLT-JHEQGTHGSA-N 0.000 description 3
- KBBFOULZCHWGJX-KBPBESRZSA-N Gly-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN)O KBBFOULZCHWGJX-KBPBESRZSA-N 0.000 description 3
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 3
- PMWSGVRIMIFXQH-KKUMJFAQSA-N His-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 PMWSGVRIMIFXQH-KKUMJFAQSA-N 0.000 description 3
- UROVZOUMHNXPLZ-AVGNSLFASA-N His-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 UROVZOUMHNXPLZ-AVGNSLFASA-N 0.000 description 3
- NKVZTQVGUNLLQW-JBDRJPRFSA-N Ile-Ala-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)O)N NKVZTQVGUNLLQW-JBDRJPRFSA-N 0.000 description 3
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 3
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 3
- JDAWAWXGAUZPNJ-ZPFDUUQYSA-N Ile-Glu-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N JDAWAWXGAUZPNJ-ZPFDUUQYSA-N 0.000 description 3
- TWYOYAKMLHWMOJ-ZPFDUUQYSA-N Ile-Leu-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O TWYOYAKMLHWMOJ-ZPFDUUQYSA-N 0.000 description 3
- YKZAMJXNJUWFIK-JBDRJPRFSA-N Ile-Ser-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(=O)O)N YKZAMJXNJUWFIK-JBDRJPRFSA-N 0.000 description 3
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 3
- RKQAYOWLSFLJEE-SVSWQMSJSA-N Ile-Thr-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N RKQAYOWLSFLJEE-SVSWQMSJSA-N 0.000 description 3
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 3
- CQQGCWPXDHTTNF-GUBZILKMSA-N Leu-Ala-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O CQQGCWPXDHTTNF-GUBZILKMSA-N 0.000 description 3
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 3
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 3
- CNNQBZRGQATKNY-DCAQKATOSA-N Leu-Arg-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N CNNQBZRGQATKNY-DCAQKATOSA-N 0.000 description 3
- FJUKMPUELVROGK-IHRRRGAJSA-N Leu-Arg-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N FJUKMPUELVROGK-IHRRRGAJSA-N 0.000 description 3
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 3
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 3
- KKXDHFKZWKLYGB-GUBZILKMSA-N Leu-Asn-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKXDHFKZWKLYGB-GUBZILKMSA-N 0.000 description 3
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 3
- WGNOPSQMIQERPK-UHFFFAOYSA-N Leu-Asn-Pro Natural products CC(C)CC(N)C(=O)NC(CC(=O)N)C(=O)N1CCCC1C(=O)O WGNOPSQMIQERPK-UHFFFAOYSA-N 0.000 description 3
- YKNBJXOJTURHCU-DCAQKATOSA-N Leu-Asp-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKNBJXOJTURHCU-DCAQKATOSA-N 0.000 description 3
- ZURHXHNAEJJRNU-CIUDSAMLSA-N Leu-Asp-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZURHXHNAEJJRNU-CIUDSAMLSA-N 0.000 description 3
- MYGQXVYRZMKRDB-SRVKXCTJSA-N Leu-Asp-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN MYGQXVYRZMKRDB-SRVKXCTJSA-N 0.000 description 3
- QCSFMCFHVGTLFF-NHCYSSNCSA-N Leu-Asp-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O QCSFMCFHVGTLFF-NHCYSSNCSA-N 0.000 description 3
- LOLUPZNNADDTAA-AVGNSLFASA-N Leu-Gln-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LOLUPZNNADDTAA-AVGNSLFASA-N 0.000 description 3
- CIVKXGPFXDIQBV-WDCWCFNPSA-N Leu-Gln-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CIVKXGPFXDIQBV-WDCWCFNPSA-N 0.000 description 3
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 3
- LAPSXOAUPNOINL-YUMQZZPRSA-N Leu-Gly-Asp Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O LAPSXOAUPNOINL-YUMQZZPRSA-N 0.000 description 3
- FIYMBBHGYNQFOP-IUCAKERBSA-N Leu-Gly-Gln Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N FIYMBBHGYNQFOP-IUCAKERBSA-N 0.000 description 3
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 3
- HMDDEJADNKQTBR-BZSNNMDCSA-N Leu-His-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O HMDDEJADNKQTBR-BZSNNMDCSA-N 0.000 description 3
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 3
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 3
- HVHRPWQEQHIQJF-AVGNSLFASA-N Leu-Lys-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HVHRPWQEQHIQJF-AVGNSLFASA-N 0.000 description 3
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 3
- VULJUQZPSOASBZ-SRVKXCTJSA-N Leu-Pro-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O VULJUQZPSOASBZ-SRVKXCTJSA-N 0.000 description 3
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 3
- KZZCOWMDDXDKSS-CIUDSAMLSA-N Leu-Ser-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KZZCOWMDDXDKSS-CIUDSAMLSA-N 0.000 description 3
- IZPVWNSAVUQBGP-CIUDSAMLSA-N Leu-Ser-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IZPVWNSAVUQBGP-CIUDSAMLSA-N 0.000 description 3
- IWMJFLJQHIDZQW-KKUMJFAQSA-N Leu-Ser-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IWMJFLJQHIDZQW-KKUMJFAQSA-N 0.000 description 3
- ODRREERHVHMIPT-OEAJRASXSA-N Leu-Thr-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ODRREERHVHMIPT-OEAJRASXSA-N 0.000 description 3
- AIQWYVFNBNNOLU-RHYQMDGZSA-N Leu-Thr-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O AIQWYVFNBNNOLU-RHYQMDGZSA-N 0.000 description 3
- JGKHAFUAPZCCDU-BZSNNMDCSA-N Leu-Tyr-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=C(O)C=C1 JGKHAFUAPZCCDU-BZSNNMDCSA-N 0.000 description 3
- BGGTYDNTOYRTTR-MEYUZBJRSA-N Leu-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC(C)C)N)O BGGTYDNTOYRTTR-MEYUZBJRSA-N 0.000 description 3
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 3
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 3
- PNPYKQFJGRFYJE-GUBZILKMSA-N Lys-Ala-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNPYKQFJGRFYJE-GUBZILKMSA-N 0.000 description 3
- LPAJOCKCPRZEAG-MNXVOIDGSA-N Lys-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCCN LPAJOCKCPRZEAG-MNXVOIDGSA-N 0.000 description 3
- IMAKMJCBYCSMHM-AVGNSLFASA-N Lys-Glu-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN IMAKMJCBYCSMHM-AVGNSLFASA-N 0.000 description 3
- KNKJPYAZQUFLQK-IHRRRGAJSA-N Lys-His-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCCCN)N KNKJPYAZQUFLQK-IHRRRGAJSA-N 0.000 description 3
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 3
- GZGWILAQHOVXTD-DCAQKATOSA-N Lys-Met-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O GZGWILAQHOVXTD-DCAQKATOSA-N 0.000 description 3
- UQJOKDAYFULYIX-AVGNSLFASA-N Lys-Pro-Pro Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 UQJOKDAYFULYIX-AVGNSLFASA-N 0.000 description 3
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 3
- IOQWIOPSKJOEKI-SRVKXCTJSA-N Lys-Ser-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O IOQWIOPSKJOEKI-SRVKXCTJSA-N 0.000 description 3
- DRRXXZBXDMLGFC-IHRRRGAJSA-N Lys-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN DRRXXZBXDMLGFC-IHRRRGAJSA-N 0.000 description 3
- XZFYRXDAULDNFX-UHFFFAOYSA-N N-L-cysteinyl-L-phenylalanine Natural products SCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 XZFYRXDAULDNFX-UHFFFAOYSA-N 0.000 description 3
- 108010047562 NGR peptide Proteins 0.000 description 3
- 108091034117 Oligonucleotide Proteins 0.000 description 3
- MUBZPKHOEPUJKR-UHFFFAOYSA-N Oxalic acid Chemical compound OC(=O)C(O)=O MUBZPKHOEPUJKR-UHFFFAOYSA-N 0.000 description 3
- LWPMGKSZPKFKJD-DZKIICNBSA-N Phe-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O LWPMGKSZPKFKJD-DZKIICNBSA-N 0.000 description 3
- AXIOGMQCDYVTNY-ACRUOGEOSA-N Phe-Phe-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 AXIOGMQCDYVTNY-ACRUOGEOSA-N 0.000 description 3
- WWPAHTZOWURIMR-ULQDDVLXSA-N Phe-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 WWPAHTZOWURIMR-ULQDDVLXSA-N 0.000 description 3
- WEDZFLRYSIDIRX-IHRRRGAJSA-N Phe-Ser-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=CC=C1 WEDZFLRYSIDIRX-IHRRRGAJSA-N 0.000 description 3
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 3
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 3
- VXCHGLYSIOOZIS-GUBZILKMSA-N Pro-Ala-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 VXCHGLYSIOOZIS-GUBZILKMSA-N 0.000 description 3
- KIZQGKLMXKGDIV-BQBZGAKWSA-N Pro-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 KIZQGKLMXKGDIV-BQBZGAKWSA-N 0.000 description 3
- LNLNHXIQPGKRJQ-SRVKXCTJSA-N Pro-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 LNLNHXIQPGKRJQ-SRVKXCTJSA-N 0.000 description 3
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 3
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 3
- CYQQWUPHIZVCNY-GUBZILKMSA-N Pro-Arg-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CYQQWUPHIZVCNY-GUBZILKMSA-N 0.000 description 3
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 3
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 3
- QCARZLHECSFOGG-CIUDSAMLSA-N Pro-Glu-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O QCARZLHECSFOGG-CIUDSAMLSA-N 0.000 description 3
- XQSREVQDGCPFRJ-STQMWFEESA-N Pro-Gly-Phe Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XQSREVQDGCPFRJ-STQMWFEESA-N 0.000 description 3
- GURGCNUWVSDYTP-SRVKXCTJSA-N Pro-Leu-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GURGCNUWVSDYTP-SRVKXCTJSA-N 0.000 description 3
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 3
- ZUZINZIJHJFJRN-UBHSHLNASA-N Pro-Phe-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 ZUZINZIJHJFJRN-UBHSHLNASA-N 0.000 description 3
- KDBHVPXBQADZKY-GUBZILKMSA-N Pro-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KDBHVPXBQADZKY-GUBZILKMSA-N 0.000 description 3
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 3
- FDMKYQQYJKYCLV-GUBZILKMSA-N Pro-Pro-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 FDMKYQQYJKYCLV-GUBZILKMSA-N 0.000 description 3
- FNGOXVQBBCMFKV-CIUDSAMLSA-N Pro-Ser-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O FNGOXVQBBCMFKV-CIUDSAMLSA-N 0.000 description 3
- MDAWMJUZHBQTBO-XGEHTFHBSA-N Pro-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1)O MDAWMJUZHBQTBO-XGEHTFHBSA-N 0.000 description 3
- GBUNEGKQPSAMNK-QTKMDUPCSA-N Pro-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2)O GBUNEGKQPSAMNK-QTKMDUPCSA-N 0.000 description 3
- FDMCIBSQRKFSTJ-RHYQMDGZSA-N Pro-Thr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O FDMCIBSQRKFSTJ-RHYQMDGZSA-N 0.000 description 3
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 3
- FYXCBXDAMPEHIQ-FHWLQOOXSA-N Pro-Trp-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CCCCN)C(=O)O FYXCBXDAMPEHIQ-FHWLQOOXSA-N 0.000 description 3
- YDTUEBLEAVANFH-RCWTZXSCSA-N Pro-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 YDTUEBLEAVANFH-RCWTZXSCSA-N 0.000 description 3
- 108010003201 RGH 0205 Proteins 0.000 description 3
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 3
- HBZBPFLJNDXRAY-FXQIFTODSA-N Ser-Ala-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O HBZBPFLJNDXRAY-FXQIFTODSA-N 0.000 description 3
- RZUOXAKGNHXZTB-GUBZILKMSA-N Ser-Arg-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O RZUOXAKGNHXZTB-GUBZILKMSA-N 0.000 description 3
- MMAPOBOTRUVNKJ-ZLUOBGJFSA-N Ser-Asp-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CO)N)C(=O)O MMAPOBOTRUVNKJ-ZLUOBGJFSA-N 0.000 description 3
- KCFKKAQKRZBWJB-ZLUOBGJFSA-N Ser-Cys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O KCFKKAQKRZBWJB-ZLUOBGJFSA-N 0.000 description 3
- SWIQQMYVHIXPEK-FXQIFTODSA-N Ser-Cys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O SWIQQMYVHIXPEK-FXQIFTODSA-N 0.000 description 3
- SQBLRDDJTUJDMV-ACZMJKKPSA-N Ser-Glu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQBLRDDJTUJDMV-ACZMJKKPSA-N 0.000 description 3
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 3
- IXCHOHLPHNGFTJ-YUMQZZPRSA-N Ser-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CO)N IXCHOHLPHNGFTJ-YUMQZZPRSA-N 0.000 description 3
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 3
- CICQXRWZNVXFCU-SRVKXCTJSA-N Ser-His-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O CICQXRWZNVXFCU-SRVKXCTJSA-N 0.000 description 3
- NLOAIFSWUUFQFR-CIUDSAMLSA-N Ser-Leu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O NLOAIFSWUUFQFR-CIUDSAMLSA-N 0.000 description 3
- IUXGJEIKJBYKOO-SRVKXCTJSA-N Ser-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N IUXGJEIKJBYKOO-SRVKXCTJSA-N 0.000 description 3
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 3
- IXZHZUGGKLRHJD-DCAQKATOSA-N Ser-Leu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O IXZHZUGGKLRHJD-DCAQKATOSA-N 0.000 description 3
- JAWGSPUJAXYXJA-IHRRRGAJSA-N Ser-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=CC=C1 JAWGSPUJAXYXJA-IHRRRGAJSA-N 0.000 description 3
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 3
- DINQYZRMXGWWTG-GUBZILKMSA-N Ser-Pro-Pro Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DINQYZRMXGWWTG-GUBZILKMSA-N 0.000 description 3
- HHJFMHQYEAAOBM-ZLUOBGJFSA-N Ser-Ser-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O HHJFMHQYEAAOBM-ZLUOBGJFSA-N 0.000 description 3
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 3
- NVNPWELENFJOHH-CIUDSAMLSA-N Ser-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CO)N NVNPWELENFJOHH-CIUDSAMLSA-N 0.000 description 3
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 3
- BDMWLJLPPUCLNV-XGEHTFHBSA-N Ser-Thr-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BDMWLJLPPUCLNV-XGEHTFHBSA-N 0.000 description 3
- PLQWGQUNUPMNOD-KKUMJFAQSA-N Ser-Tyr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O PLQWGQUNUPMNOD-KKUMJFAQSA-N 0.000 description 3
- PMTWIUBUQRGCSB-FXQIFTODSA-N Ser-Val-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O PMTWIUBUQRGCSB-FXQIFTODSA-N 0.000 description 3
- BEBVVQPDSHHWQL-NRPADANISA-N Ser-Val-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O BEBVVQPDSHHWQL-NRPADANISA-N 0.000 description 3
- YEDSOSIKVUMIJE-DCAQKATOSA-N Ser-Val-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O YEDSOSIKVUMIJE-DCAQKATOSA-N 0.000 description 3
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 3
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 3
- FQPQPTHMHZKGFM-XQXXSGGOSA-N Thr-Ala-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O FQPQPTHMHZKGFM-XQXXSGGOSA-N 0.000 description 3
- KEGBFULVYKYJRD-LFSVMHDDSA-N Thr-Ala-Phe Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KEGBFULVYKYJRD-LFSVMHDDSA-N 0.000 description 3
- DWYAUVCQDTZIJI-VZFHVOOUSA-N Thr-Ala-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DWYAUVCQDTZIJI-VZFHVOOUSA-N 0.000 description 3
- YBXMGKCLOPDEKA-NUMRIWBASA-N Thr-Asp-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O YBXMGKCLOPDEKA-NUMRIWBASA-N 0.000 description 3
- OHAJHDJOCKKJLV-LKXGYXEUSA-N Thr-Asp-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OHAJHDJOCKKJLV-LKXGYXEUSA-N 0.000 description 3
- QILPDQCTQZDHFM-HJGDQZAQSA-N Thr-Gln-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QILPDQCTQZDHFM-HJGDQZAQSA-N 0.000 description 3
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 3
- UDQBCBUXAQIZAK-GLLZPBPUSA-N Thr-Glu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDQBCBUXAQIZAK-GLLZPBPUSA-N 0.000 description 3
- SLUWOCTZVGMURC-BFHQHQDPSA-N Thr-Gly-Ala Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O SLUWOCTZVGMURC-BFHQHQDPSA-N 0.000 description 3
- VYEHBMMAJFVTOI-JHEQGTHGSA-N Thr-Gly-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O VYEHBMMAJFVTOI-JHEQGTHGSA-N 0.000 description 3
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 3
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 3
- XOWKUMFHEZLKLT-CIQUZCHMSA-N Thr-Ile-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O XOWKUMFHEZLKLT-CIQUZCHMSA-N 0.000 description 3
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 3
- VRUFCJZQDACGLH-UVOCVTCTSA-N Thr-Leu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VRUFCJZQDACGLH-UVOCVTCTSA-N 0.000 description 3
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 3
- DOBIBIXIHJKVJF-XKBZYTNZSA-N Thr-Ser-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DOBIBIXIHJKVJF-XKBZYTNZSA-N 0.000 description 3
- XHWCDRUPDNSDAZ-XKBZYTNZSA-N Thr-Ser-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O XHWCDRUPDNSDAZ-XKBZYTNZSA-N 0.000 description 3
- AHERARIZBPOMNU-KATARQTJSA-N Thr-Ser-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O AHERARIZBPOMNU-KATARQTJSA-N 0.000 description 3
- IQPWNQRRAJHOKV-KATARQTJSA-N Thr-Ser-Lys Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN IQPWNQRRAJHOKV-KATARQTJSA-N 0.000 description 3
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 3
- NHQVWACSJZJCGJ-FLBSBUHZSA-N Thr-Thr-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NHQVWACSJZJCGJ-FLBSBUHZSA-N 0.000 description 3
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 3
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 3
- ZOCJFNXUVSGBQI-HSHDSVGOSA-N Thr-Trp-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O ZOCJFNXUVSGBQI-HSHDSVGOSA-N 0.000 description 3
- WSGPBCAGEGHKQJ-BBRMVZONSA-N Trp-Gly-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WSGPBCAGEGHKQJ-BBRMVZONSA-N 0.000 description 3
- UJGDFQRPYGJBEH-AAEUAGOBSA-N Trp-Ser-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N UJGDFQRPYGJBEH-AAEUAGOBSA-N 0.000 description 3
- QPOUERMDWKKZEG-HJPIBITLSA-N Tyr-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QPOUERMDWKKZEG-HJPIBITLSA-N 0.000 description 3
- GPLTZEMVOCZVAV-UFYCRDLUSA-N Tyr-Tyr-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=C(O)C=C1 GPLTZEMVOCZVAV-UFYCRDLUSA-N 0.000 description 3
- IZFVRRYRMQFVGX-NRPADANISA-N Val-Ala-Gln Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N IZFVRRYRMQFVGX-NRPADANISA-N 0.000 description 3
- ASQFIHTXXMFENG-XPUUQOCRSA-N Val-Ala-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O ASQFIHTXXMFENG-XPUUQOCRSA-N 0.000 description 3
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 3
- CVUDMNSZAIZFAE-UHFFFAOYSA-N Val-Arg-Pro Natural products NC(N)=NCCCC(NC(=O)C(N)C(C)C)C(=O)N1CCCC1C(O)=O CVUDMNSZAIZFAE-UHFFFAOYSA-N 0.000 description 3
- YCMXFKWYJFZFKS-LAEOZQHASA-N Val-Gln-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCMXFKWYJFZFKS-LAEOZQHASA-N 0.000 description 3
- QHFQQRKNGCXTHL-AUTRQRHGSA-N Val-Gln-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QHFQQRKNGCXTHL-AUTRQRHGSA-N 0.000 description 3
- VFOHXOLPLACADK-GVXVVHGQSA-N Val-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N VFOHXOLPLACADK-GVXVVHGQSA-N 0.000 description 3
- YTPLVNUZZOBFFC-SCZZXKLOSA-N Val-Gly-Pro Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N1CCC[C@@H]1C(O)=O YTPLVNUZZOBFFC-SCZZXKLOSA-N 0.000 description 3
- BTWMICVCQLKKNR-DCAQKATOSA-N Val-Leu-Ser Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C([O-])=O BTWMICVCQLKKNR-DCAQKATOSA-N 0.000 description 3
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 3
- CKTMJBPRVQWPHU-JSGCOSHPSA-N Val-Phe-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)O)N CKTMJBPRVQWPHU-JSGCOSHPSA-N 0.000 description 3
- UGFMVXRXULGLNO-XPUUQOCRSA-N Val-Ser-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O UGFMVXRXULGLNO-XPUUQOCRSA-N 0.000 description 3
- UQMPYVLTQCGRSK-IFFSRLJSSA-N Val-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N)O UQMPYVLTQCGRSK-IFFSRLJSSA-N 0.000 description 3
- PFMSJVIPEZMKSC-DZKIICNBSA-N Val-Tyr-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PFMSJVIPEZMKSC-DZKIICNBSA-N 0.000 description 3
- RTJPAGFXOWEBAI-SRVKXCTJSA-N Val-Val-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RTJPAGFXOWEBAI-SRVKXCTJSA-N 0.000 description 3
- 108010081404 acein-2 Proteins 0.000 description 3
- 108010078114 alanyl-tryptophyl-alanine Proteins 0.000 description 3
- 108010041407 alanylaspartic acid Proteins 0.000 description 3
- 108010050025 alpha-glutamyltryptophan Proteins 0.000 description 3
- 125000003277 amino group Chemical group 0.000 description 3
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 3
- 108010038850 arginyl-isoleucyl-tyrosine Proteins 0.000 description 3
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 3
- -1 aromatic amino acids Chemical class 0.000 description 3
- KRKNYBCHXYNGOX-UHFFFAOYSA-N citric acid Chemical compound OC(=O)CC(O)(C(O)=O)CC(O)=O KRKNYBCHXYNGOX-UHFFFAOYSA-N 0.000 description 3
- 238000010367 cloning Methods 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 238000003745 diagnosis Methods 0.000 description 3
- 229940079593 drug Drugs 0.000 description 3
- 125000002485 formyl group Chemical group [H]C(*)=O 0.000 description 3
- 125000000524 functional group Chemical group 0.000 description 3
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 3
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 3
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 3
- 108010025801 glycyl-prolyl-arginine Proteins 0.000 description 3
- 108010084389 glycyltryptophan Proteins 0.000 description 3
- 238000001727 in vivo Methods 0.000 description 3
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 3
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 3
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 3
- 108010044348 lysyl-glutamyl-aspartic acid Proteins 0.000 description 3
- 108010016686 methionyl-alanyl-serine Proteins 0.000 description 3
- 108010022588 methionyl-lysyl-proline Proteins 0.000 description 3
- 108010084572 phenylalanyl-valine Proteins 0.000 description 3
- 108010025488 pinealon Proteins 0.000 description 3
- 239000013615 primer Substances 0.000 description 3
- 239000000047 product Substances 0.000 description 3
- 108010093296 prolyl-prolyl-alanine Proteins 0.000 description 3
- 108010015796 prolylisoleucine Proteins 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 239000000523 sample Substances 0.000 description 3
- 108010007375 seryl-seryl-seryl-arginine Proteins 0.000 description 3
- 238000003756 stirring Methods 0.000 description 3
- 108010031491 threonyl-lysyl-glutamic acid Proteins 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 108010029384 tryptophyl-histidine Proteins 0.000 description 3
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 3
- 108010009962 valyltyrosine Proteins 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 2
- NTUPOKHATNSWCY-PMPSAXMXSA-N (2s)-2-[[(2s)-1-[(2r)-2-amino-3-phenylpropanoyl]pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoic acid Chemical compound C([C@@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)C1=CC=CC=C1 NTUPOKHATNSWCY-PMPSAXMXSA-N 0.000 description 2
- CUVSTAMIHSSVKL-UWVGGRQHSA-N (4s)-4-[(2-aminoacetyl)amino]-5-[[(2s)-6-amino-1-(carboxymethylamino)-1-oxohexan-2-yl]amino]-5-oxopentanoic acid Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN CUVSTAMIHSSVKL-UWVGGRQHSA-N 0.000 description 2
- 108010036211 5-HT-moduline Proteins 0.000 description 2
- UWQJHXKARZWDIJ-ZLUOBGJFSA-N Ala-Ala-Cys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O UWQJHXKARZWDIJ-ZLUOBGJFSA-N 0.000 description 2
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 2
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 2
- UCIYCBSJBQGDGM-LPEHRKFASA-N Ala-Arg-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N UCIYCBSJBQGDGM-LPEHRKFASA-N 0.000 description 2
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 2
- NXSFUECZFORGOG-CIUDSAMLSA-N Ala-Asn-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NXSFUECZFORGOG-CIUDSAMLSA-N 0.000 description 2
- KIUYPHAMDKDICO-WHFBIAKZSA-N Ala-Asp-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KIUYPHAMDKDICO-WHFBIAKZSA-N 0.000 description 2
- BUDNAJYVCUHLSV-ZLUOBGJFSA-N Ala-Asp-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O BUDNAJYVCUHLSV-ZLUOBGJFSA-N 0.000 description 2
- KRHRBKYBJXMYBB-WHFBIAKZSA-N Ala-Cys-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O KRHRBKYBJXMYBB-WHFBIAKZSA-N 0.000 description 2
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 2
- BLGHHPHXVJWCNK-GUBZILKMSA-N Ala-Gln-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BLGHHPHXVJWCNK-GUBZILKMSA-N 0.000 description 2
- IXTPACPAXIOCRG-ACZMJKKPSA-N Ala-Glu-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N IXTPACPAXIOCRG-ACZMJKKPSA-N 0.000 description 2
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 2
- HMRWQTHUDVXMGH-GUBZILKMSA-N Ala-Glu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HMRWQTHUDVXMGH-GUBZILKMSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 2
- BLIMFWGRQKRCGT-YUMQZZPRSA-N Ala-Gly-Lys Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCCN BLIMFWGRQKRCGT-YUMQZZPRSA-N 0.000 description 2
- SMCGQGDVTPFXKB-XPUUQOCRSA-N Ala-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N SMCGQGDVTPFXKB-XPUUQOCRSA-N 0.000 description 2
- HJGZVLLLBJLXFC-LSJOCFKGSA-N Ala-His-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(O)=O HJGZVLLLBJLXFC-LSJOCFKGSA-N 0.000 description 2
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 2
- GSHKMNKPMLXSQW-KBIXCLLPSA-N Ala-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C)N GSHKMNKPMLXSQW-KBIXCLLPSA-N 0.000 description 2
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 2
- HHRAXZAYZFFRAM-CIUDSAMLSA-N Ala-Leu-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O HHRAXZAYZFFRAM-CIUDSAMLSA-N 0.000 description 2
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 2
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 2
- VCSABYLVNWQYQE-SRVKXCTJSA-N Ala-Lys-Lys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCCN)C(O)=O VCSABYLVNWQYQE-SRVKXCTJSA-N 0.000 description 2
- VCSABYLVNWQYQE-UHFFFAOYSA-N Ala-Lys-Lys Natural products NCCCCC(NC(=O)C(N)C)C(=O)NC(CCCCN)C(O)=O VCSABYLVNWQYQE-UHFFFAOYSA-N 0.000 description 2
- KQESEZXHYOUIIM-CQDKDKBSSA-N Ala-Lys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KQESEZXHYOUIIM-CQDKDKBSSA-N 0.000 description 2
- MDNAVFBZPROEHO-UHFFFAOYSA-N Ala-Lys-Val Natural products CC(C)C(C(O)=O)NC(=O)C(NC(=O)C(C)N)CCCCN MDNAVFBZPROEHO-UHFFFAOYSA-N 0.000 description 2
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 2
- XRUJOVRWNMBAAA-NHCYSSNCSA-N Ala-Phe-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 XRUJOVRWNMBAAA-NHCYSSNCSA-N 0.000 description 2
- CNQAFFMNJIQYGX-DRZSPHRISA-N Ala-Phe-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 CNQAFFMNJIQYGX-DRZSPHRISA-N 0.000 description 2
- ZBLQIYPCUWZSRZ-QEJZJMRPSA-N Ala-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 ZBLQIYPCUWZSRZ-QEJZJMRPSA-N 0.000 description 2
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 2
- YCRAFFCYWOUEOF-DLOVCJGASA-N Ala-Phe-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 YCRAFFCYWOUEOF-DLOVCJGASA-N 0.000 description 2
- CYBJZLQSUJEMAS-LFSVMHDDSA-N Ala-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C)N)O CYBJZLQSUJEMAS-LFSVMHDDSA-N 0.000 description 2
- IHMCQESUJVZTKW-UBHSHLNASA-N Ala-Phe-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CC=CC=C1 IHMCQESUJVZTKW-UBHSHLNASA-N 0.000 description 2
- IPZQNYYAYVRKKK-FXQIFTODSA-N Ala-Pro-Ala Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IPZQNYYAYVRKKK-FXQIFTODSA-N 0.000 description 2
- FQNILRVJOJBFFC-FXQIFTODSA-N Ala-Pro-Asp Chemical compound C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N FQNILRVJOJBFFC-FXQIFTODSA-N 0.000 description 2
- ADSGHMXEAZJJNF-DCAQKATOSA-N Ala-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N ADSGHMXEAZJJNF-DCAQKATOSA-N 0.000 description 2
- BHTBAVZSZCQZPT-GUBZILKMSA-N Ala-Pro-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N BHTBAVZSZCQZPT-GUBZILKMSA-N 0.000 description 2
- OLVCTPPSXNRGKV-GUBZILKMSA-N Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 OLVCTPPSXNRGKV-GUBZILKMSA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 2
- AUFACLFHBAGZEN-ZLUOBGJFSA-N Ala-Ser-Cys Chemical compound N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O AUFACLFHBAGZEN-ZLUOBGJFSA-N 0.000 description 2
- NZGRHTKZFSVPAN-BIIVOSGPSA-N Ala-Ser-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N NZGRHTKZFSVPAN-BIIVOSGPSA-N 0.000 description 2
- HCBKAOZYACJUEF-XQXXSGGOSA-N Ala-Thr-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(N)=O)C(=O)O HCBKAOZYACJUEF-XQXXSGGOSA-N 0.000 description 2
- CREYEAPXISDKSB-FQPOAREZSA-N Ala-Thr-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CREYEAPXISDKSB-FQPOAREZSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- ZVWXMTTZJKBJCI-BHDSKKPTSA-N Ala-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 ZVWXMTTZJKBJCI-BHDSKKPTSA-N 0.000 description 2
- BOKLLPVAQDSLHC-FXQIFTODSA-N Ala-Val-Cys Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O)N BOKLLPVAQDSLHC-FXQIFTODSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- LYILPUNCKACNGF-NAKRPEOUSA-N Ala-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C)N LYILPUNCKACNGF-NAKRPEOUSA-N 0.000 description 2
- XCIGOVDXZULBBV-DCAQKATOSA-N Ala-Val-Lys Chemical compound CC(C)[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCCCN)C(O)=O XCIGOVDXZULBBV-DCAQKATOSA-N 0.000 description 2
- NLYYHIKRBRMAJV-AEJSXWLSSA-N Ala-Val-Pro Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N NLYYHIKRBRMAJV-AEJSXWLSSA-N 0.000 description 2
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 2
- DBKNLHKEVPZVQC-LPEHRKFASA-N Arg-Ala-Pro Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O DBKNLHKEVPZVQC-LPEHRKFASA-N 0.000 description 2
- OVVUNXXROOFSIM-SDDRHHMPSA-N Arg-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O OVVUNXXROOFSIM-SDDRHHMPSA-N 0.000 description 2
- JTKLCCFLSLCCST-SZMVWBNQSA-N Arg-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)=CNC2=C1 JTKLCCFLSLCCST-SZMVWBNQSA-N 0.000 description 2
- XVLLUZMFSAYKJV-GUBZILKMSA-N Arg-Asp-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XVLLUZMFSAYKJV-GUBZILKMSA-N 0.000 description 2
- JSHVMZANPXCDTL-GMOBBJLQSA-N Arg-Asp-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JSHVMZANPXCDTL-GMOBBJLQSA-N 0.000 description 2
- HKRXJBBCQBAGIM-FXQIFTODSA-N Arg-Asp-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CO)C(=O)O)N)CN=C(N)N HKRXJBBCQBAGIM-FXQIFTODSA-N 0.000 description 2
- IGULQRCJLQQPSM-DCAQKATOSA-N Arg-Cys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IGULQRCJLQQPSM-DCAQKATOSA-N 0.000 description 2
- SVHRPCMZTWZROG-DCAQKATOSA-N Arg-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N SVHRPCMZTWZROG-DCAQKATOSA-N 0.000 description 2
- RWDVGVPHEWOZMO-GUBZILKMSA-N Arg-Cys-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCCNC(N)=N)C(O)=O RWDVGVPHEWOZMO-GUBZILKMSA-N 0.000 description 2
- LMPKCSXZJSXBBL-NHCYSSNCSA-N Arg-Gln-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O LMPKCSXZJSXBBL-NHCYSSNCSA-N 0.000 description 2
- PNQWAUXQDBIJDY-GUBZILKMSA-N Arg-Glu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PNQWAUXQDBIJDY-GUBZILKMSA-N 0.000 description 2
- OGUPCHKBOKJFMA-SRVKXCTJSA-N Arg-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N OGUPCHKBOKJFMA-SRVKXCTJSA-N 0.000 description 2
- HPSVTWMFWCHKFN-GARJFASQSA-N Arg-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O HPSVTWMFWCHKFN-GARJFASQSA-N 0.000 description 2
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 2
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 2
- CYXCAHZVPFREJD-LURJTMIESA-N Arg-Gly-Gly Chemical compound NC(=N)NCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O CYXCAHZVPFREJD-LURJTMIESA-N 0.000 description 2
- MSILNNHVVMMTHZ-UWVGGRQHSA-N Arg-His-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 MSILNNHVVMMTHZ-UWVGGRQHSA-N 0.000 description 2
- CRCCTGPNZUCAHE-DCAQKATOSA-N Arg-His-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CN=CN1 CRCCTGPNZUCAHE-DCAQKATOSA-N 0.000 description 2
- UAOSDDXCTBIPCA-QXEWZRGKSA-N Arg-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N UAOSDDXCTBIPCA-QXEWZRGKSA-N 0.000 description 2
- OOIMKQRCPJBGPD-XUXIUFHCSA-N Arg-Ile-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O OOIMKQRCPJBGPD-XUXIUFHCSA-N 0.000 description 2
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 2
- WMEVEPXNCMKNGH-IHRRRGAJSA-N Arg-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N WMEVEPXNCMKNGH-IHRRRGAJSA-N 0.000 description 2
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 2
- DIIGDGJKTMLQQW-IHRRRGAJSA-N Arg-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N DIIGDGJKTMLQQW-IHRRRGAJSA-N 0.000 description 2
- BNYNOWJESJJIOI-XUXIUFHCSA-N Arg-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCN=C(N)N)N BNYNOWJESJJIOI-XUXIUFHCSA-N 0.000 description 2
- CLICCYPMVFGUOF-IHRRRGAJSA-N Arg-Lys-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O CLICCYPMVFGUOF-IHRRRGAJSA-N 0.000 description 2
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 2
- NPAVRDPEFVKELR-DCAQKATOSA-N Arg-Lys-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O NPAVRDPEFVKELR-DCAQKATOSA-N 0.000 description 2
- PAPSMOYMQDWIOR-AVGNSLFASA-N Arg-Lys-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PAPSMOYMQDWIOR-AVGNSLFASA-N 0.000 description 2
- VIINVRPKMUZYOI-DCAQKATOSA-N Arg-Met-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIINVRPKMUZYOI-DCAQKATOSA-N 0.000 description 2
- GSUFZRURORXYTM-STQMWFEESA-N Arg-Phe-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 GSUFZRURORXYTM-STQMWFEESA-N 0.000 description 2
- LXMKTIZAGIBQRX-HRCADAONSA-N Arg-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O LXMKTIZAGIBQRX-HRCADAONSA-N 0.000 description 2
- DNBMCNQKNOKOSD-DCAQKATOSA-N Arg-Pro-Gln Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O DNBMCNQKNOKOSD-DCAQKATOSA-N 0.000 description 2
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 2
- AWMAZIIEFPFHCP-RCWTZXSCSA-N Arg-Pro-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWMAZIIEFPFHCP-RCWTZXSCSA-N 0.000 description 2
- QHVRVUNEAIFTEK-SZMVWBNQSA-N Arg-Pro-Trp Chemical compound N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O QHVRVUNEAIFTEK-SZMVWBNQSA-N 0.000 description 2
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 2
- AUIJUTGLPVHIRT-FXQIFTODSA-N Arg-Ser-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N AUIJUTGLPVHIRT-FXQIFTODSA-N 0.000 description 2
- VRTWYUYCJGNFES-CIUDSAMLSA-N Arg-Ser-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O VRTWYUYCJGNFES-CIUDSAMLSA-N 0.000 description 2
- JOTRDIXZHNQYGP-DCAQKATOSA-N Arg-Ser-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JOTRDIXZHNQYGP-DCAQKATOSA-N 0.000 description 2
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 2
- LYJXHXGPWDTLKW-HJGDQZAQSA-N Arg-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O LYJXHXGPWDTLKW-HJGDQZAQSA-N 0.000 description 2
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 2
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 2
- PJOPLXOCKACMLK-KKUMJFAQSA-N Arg-Tyr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O PJOPLXOCKACMLK-KKUMJFAQSA-N 0.000 description 2
- QCTOLCVIGRLMQS-HRCADAONSA-N Arg-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O QCTOLCVIGRLMQS-HRCADAONSA-N 0.000 description 2
- ISVACHFCVRKIDG-SRVKXCTJSA-N Arg-Val-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O ISVACHFCVRKIDG-SRVKXCTJSA-N 0.000 description 2
- QTAIIXQCOPUNBQ-QXEWZRGKSA-N Arg-Val-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QTAIIXQCOPUNBQ-QXEWZRGKSA-N 0.000 description 2
- QLSRIZIDQXDQHK-RCWTZXSCSA-N Arg-Val-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QLSRIZIDQXDQHK-RCWTZXSCSA-N 0.000 description 2
- SLKLLQWZQHXYSV-CIUDSAMLSA-N Asn-Ala-Lys Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O SLKLLQWZQHXYSV-CIUDSAMLSA-N 0.000 description 2
- XWGJDUSDTRPQRK-ZLUOBGJFSA-N Asn-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O XWGJDUSDTRPQRK-ZLUOBGJFSA-N 0.000 description 2
- WQSCVMQDZYTFQU-FXQIFTODSA-N Asn-Cys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WQSCVMQDZYTFQU-FXQIFTODSA-N 0.000 description 2
- XWFPGQVLOVGSLU-CIUDSAMLSA-N Asn-Gln-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XWFPGQVLOVGSLU-CIUDSAMLSA-N 0.000 description 2
- GNKVBRYFXYWXAB-WDSKDSINSA-N Asn-Glu-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O GNKVBRYFXYWXAB-WDSKDSINSA-N 0.000 description 2
- RAQMSGVCGSJKCL-FOHZUACHSA-N Asn-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(N)=O RAQMSGVCGSJKCL-FOHZUACHSA-N 0.000 description 2
- GQRDIVQPSMPQME-ZPFDUUQYSA-N Asn-Ile-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O GQRDIVQPSMPQME-ZPFDUUQYSA-N 0.000 description 2
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 2
- MYCSPQIARXTUTP-SRVKXCTJSA-N Asn-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N MYCSPQIARXTUTP-SRVKXCTJSA-N 0.000 description 2
- YVXRYLVELQYAEQ-SRVKXCTJSA-N Asn-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N YVXRYLVELQYAEQ-SRVKXCTJSA-N 0.000 description 2
- JLNFZLNDHONLND-GARJFASQSA-N Asn-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N JLNFZLNDHONLND-GARJFASQSA-N 0.000 description 2
- OROMFUQQTSWUTI-IHRRRGAJSA-N Asn-Phe-Arg Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OROMFUQQTSWUTI-IHRRRGAJSA-N 0.000 description 2
- YRTOMUMWSTUQAX-FXQIFTODSA-N Asn-Pro-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O YRTOMUMWSTUQAX-FXQIFTODSA-N 0.000 description 2
- YUOXLJYVSZYPBJ-CIUDSAMLSA-N Asn-Pro-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O YUOXLJYVSZYPBJ-CIUDSAMLSA-N 0.000 description 2
- GKKUBLFXKRDMFC-BQBZGAKWSA-N Asn-Pro-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O GKKUBLFXKRDMFC-BQBZGAKWSA-N 0.000 description 2
- AWXDRZJQCVHCIT-DCAQKATOSA-N Asn-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O AWXDRZJQCVHCIT-DCAQKATOSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- YSYTWUMRHSFODC-QWRGUYRKSA-N Asn-Tyr-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O YSYTWUMRHSFODC-QWRGUYRKSA-N 0.000 description 2
- DXHINQUXBZNUCF-MELADBBJSA-N Asn-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N)C(=O)O DXHINQUXBZNUCF-MELADBBJSA-N 0.000 description 2
- VTYQAQFKMQTKQD-ACZMJKKPSA-N Asp-Ala-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O VTYQAQFKMQTKQD-ACZMJKKPSA-N 0.000 description 2
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 2
- NECWUSYTYSIFNC-DLOVCJGASA-N Asp-Ala-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 NECWUSYTYSIFNC-DLOVCJGASA-N 0.000 description 2
- NJIKKGUVGUBICV-ZLUOBGJFSA-N Asp-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O NJIKKGUVGUBICV-ZLUOBGJFSA-N 0.000 description 2
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 2
- WCFCYFDBMNFSPA-ACZMJKKPSA-N Asp-Asp-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O WCFCYFDBMNFSPA-ACZMJKKPSA-N 0.000 description 2
- RYKWOUUZJFSJOH-FXQIFTODSA-N Asp-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N RYKWOUUZJFSJOH-FXQIFTODSA-N 0.000 description 2
- SNAWMGHSCHKSDK-GUBZILKMSA-N Asp-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SNAWMGHSCHKSDK-GUBZILKMSA-N 0.000 description 2
- PDECQIHABNQRHN-GUBZILKMSA-N Asp-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC(O)=O PDECQIHABNQRHN-GUBZILKMSA-N 0.000 description 2
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 2
- PSLSTUMPZILTAH-BYULHYEWSA-N Asp-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PSLSTUMPZILTAH-BYULHYEWSA-N 0.000 description 2
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 2
- KHGPWGKPYHPOIK-QWRGUYRKSA-N Asp-Gly-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KHGPWGKPYHPOIK-QWRGUYRKSA-N 0.000 description 2
- POTCZYQVVNXUIG-BQBZGAKWSA-N Asp-Gly-Pro Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O POTCZYQVVNXUIG-BQBZGAKWSA-N 0.000 description 2
- SVABRQFIHCSNCI-FOHZUACHSA-N Asp-Gly-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O SVABRQFIHCSNCI-FOHZUACHSA-N 0.000 description 2
- KQBVNNAPIURMPD-PEFMBERDSA-N Asp-Ile-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O KQBVNNAPIURMPD-PEFMBERDSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- SCQIQCWLOMOEFP-DCAQKATOSA-N Asp-Leu-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O SCQIQCWLOMOEFP-DCAQKATOSA-N 0.000 description 2
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 2
- IVPNEDNYYYFAGI-GARJFASQSA-N Asp-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N IVPNEDNYYYFAGI-GARJFASQSA-N 0.000 description 2
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 2
- GKWFMNNNYZHJHV-SRVKXCTJSA-N Asp-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O GKWFMNNNYZHJHV-SRVKXCTJSA-N 0.000 description 2
- MYLZFUMPZCPJCJ-NHCYSSNCSA-N Asp-Lys-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O MYLZFUMPZCPJCJ-NHCYSSNCSA-N 0.000 description 2
- LTCKTLYKRMCFOC-KKUMJFAQSA-N Asp-Phe-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LTCKTLYKRMCFOC-KKUMJFAQSA-N 0.000 description 2
- JUWISGAGWSDGDH-KKUMJFAQSA-N Asp-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=CC=C1 JUWISGAGWSDGDH-KKUMJFAQSA-N 0.000 description 2
- USNJAPJZSGTTPX-XVSYOHENSA-N Asp-Phe-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O USNJAPJZSGTTPX-XVSYOHENSA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 2
- GGRSYTUJHAZTFN-IHRRRGAJSA-N Asp-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O GGRSYTUJHAZTFN-IHRRRGAJSA-N 0.000 description 2
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 2
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 2
- DRCOAZZDQRCGGP-GHCJXIJMSA-N Asp-Ser-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O DRCOAZZDQRCGGP-GHCJXIJMSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- ITGFVUYOLWBPQW-KKHAAJSZSA-N Asp-Thr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ITGFVUYOLWBPQW-KKHAAJSZSA-N 0.000 description 2
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 2
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 2
- GFYOIYJJMSHLSN-QXEWZRGKSA-N Asp-Val-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GFYOIYJJMSHLSN-QXEWZRGKSA-N 0.000 description 2
- OQMGSMNZVHYDTQ-ZKWXMUAHSA-N Asp-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N OQMGSMNZVHYDTQ-ZKWXMUAHSA-N 0.000 description 2
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 2
- XWKPSMRPIKKDDU-RCOVLWMOSA-N Asp-Val-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O XWKPSMRPIKKDDU-RCOVLWMOSA-N 0.000 description 2
- SFJUYBCDQBAYAJ-YDHLFZDLSA-N Asp-Val-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SFJUYBCDQBAYAJ-YDHLFZDLSA-N 0.000 description 2
- QPDUWAUSSWGJSB-NGZCFLSTSA-N Asp-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N QPDUWAUSSWGJSB-NGZCFLSTSA-N 0.000 description 2
- ZUNMTUPRQMWMHX-LSJOCFKGSA-N Asp-Val-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O ZUNMTUPRQMWMHX-LSJOCFKGSA-N 0.000 description 2
- 241000894006 Bacteria Species 0.000 description 2
- OYPRJOBELJOOCE-UHFFFAOYSA-N Calcium Chemical compound [Ca] OYPRJOBELJOOCE-UHFFFAOYSA-N 0.000 description 2
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 2
- OCEHKDFAWQIBHH-FXQIFTODSA-N Cys-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N OCEHKDFAWQIBHH-FXQIFTODSA-N 0.000 description 2
- HRJLVSQKBLZHSR-ZLUOBGJFSA-N Cys-Asn-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O HRJLVSQKBLZHSR-ZLUOBGJFSA-N 0.000 description 2
- WVJHEDOLHPZLRV-CIUDSAMLSA-N Cys-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N WVJHEDOLHPZLRV-CIUDSAMLSA-N 0.000 description 2
- SBMGKDLRJLYZCU-BIIVOSGPSA-N Cys-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N)C(=O)O SBMGKDLRJLYZCU-BIIVOSGPSA-N 0.000 description 2
- FWYBFUDWUUFLDN-FXQIFTODSA-N Cys-Asp-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N FWYBFUDWUUFLDN-FXQIFTODSA-N 0.000 description 2
- KIHRUISMQZVCNO-ZLUOBGJFSA-N Cys-Asp-Asp Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KIHRUISMQZVCNO-ZLUOBGJFSA-N 0.000 description 2
- MBILEVLLOHJZMG-FXQIFTODSA-N Cys-Gln-Glu Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N MBILEVLLOHJZMG-FXQIFTODSA-N 0.000 description 2
- VBPGTULCFGKGTF-ACZMJKKPSA-N Cys-Glu-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VBPGTULCFGKGTF-ACZMJKKPSA-N 0.000 description 2
- KABHAOSDMIYXTR-GUBZILKMSA-N Cys-Glu-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N KABHAOSDMIYXTR-GUBZILKMSA-N 0.000 description 2
- URDUGPGPLNXXES-WHFBIAKZSA-N Cys-Gly-Cys Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O URDUGPGPLNXXES-WHFBIAKZSA-N 0.000 description 2
- HKALUUKHYNEDRS-GUBZILKMSA-N Cys-Leu-Gln Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HKALUUKHYNEDRS-GUBZILKMSA-N 0.000 description 2
- IZUNQDRIAOLWCN-YUMQZZPRSA-N Cys-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N IZUNQDRIAOLWCN-YUMQZZPRSA-N 0.000 description 2
- WVLZTXGTNGHPBO-SRVKXCTJSA-N Cys-Leu-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O WVLZTXGTNGHPBO-SRVKXCTJSA-N 0.000 description 2
- MBRWOKXNHTUJMB-CIUDSAMLSA-N Cys-Pro-Glu Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O MBRWOKXNHTUJMB-CIUDSAMLSA-N 0.000 description 2
- BCFXQBXXDSEHRS-FXQIFTODSA-N Cys-Ser-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BCFXQBXXDSEHRS-FXQIFTODSA-N 0.000 description 2
- BCWIFCLVCRAIQK-ZLUOBGJFSA-N Cys-Ser-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O BCWIFCLVCRAIQK-ZLUOBGJFSA-N 0.000 description 2
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 2
- DGQJGBDBFVGLGL-ZKWXMUAHSA-N Cys-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N DGQJGBDBFVGLGL-ZKWXMUAHSA-N 0.000 description 2
- 108020001019 DNA Primers Proteins 0.000 description 2
- 239000003155 DNA primer Substances 0.000 description 2
- VZCYOOQTPOCHFL-OWOJBTEDSA-N Fumaric acid Chemical compound OC(=O)\C=C\C(O)=O VZCYOOQTPOCHFL-OWOJBTEDSA-N 0.000 description 2
- 241001200922 Gagata Species 0.000 description 2
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 2
- RZSLYUUFFVHFRQ-FXQIFTODSA-N Gln-Ala-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O RZSLYUUFFVHFRQ-FXQIFTODSA-N 0.000 description 2
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 2
- LZRMPXRYLLTAJX-GUBZILKMSA-N Gln-Arg-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZRMPXRYLLTAJX-GUBZILKMSA-N 0.000 description 2
- PRBLYKYHAJEABA-SRVKXCTJSA-N Gln-Arg-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O PRBLYKYHAJEABA-SRVKXCTJSA-N 0.000 description 2
- INFBPLSHYFALDE-ACZMJKKPSA-N Gln-Asn-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O INFBPLSHYFALDE-ACZMJKKPSA-N 0.000 description 2
- CKNUKHBRCSMKMO-XHNCKOQMSA-N Gln-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N)C(=O)O CKNUKHBRCSMKMO-XHNCKOQMSA-N 0.000 description 2
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 2
- BTSPOOHJBYJRKO-CIUDSAMLSA-N Gln-Asp-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BTSPOOHJBYJRKO-CIUDSAMLSA-N 0.000 description 2
- XEYMBRRKIFYQMF-GUBZILKMSA-N Gln-Asp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XEYMBRRKIFYQMF-GUBZILKMSA-N 0.000 description 2
- FJAYYNIXQNERSO-ACZMJKKPSA-N Gln-Cys-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N FJAYYNIXQNERSO-ACZMJKKPSA-N 0.000 description 2
- MFLMFRZBAJSGHK-ACZMJKKPSA-N Gln-Cys-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N MFLMFRZBAJSGHK-ACZMJKKPSA-N 0.000 description 2
- KVXVVDFOZNYYKZ-DCAQKATOSA-N Gln-Gln-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KVXVVDFOZNYYKZ-DCAQKATOSA-N 0.000 description 2
- MADFVRSKEIEZHZ-DCAQKATOSA-N Gln-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N MADFVRSKEIEZHZ-DCAQKATOSA-N 0.000 description 2
- MCAVASRGVBVPMX-FXQIFTODSA-N Gln-Glu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MCAVASRGVBVPMX-FXQIFTODSA-N 0.000 description 2
- BLOXULLYFRGYKZ-GUBZILKMSA-N Gln-Glu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BLOXULLYFRGYKZ-GUBZILKMSA-N 0.000 description 2
- PNENQZWRFMUZOM-DCAQKATOSA-N Gln-Glu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O PNENQZWRFMUZOM-DCAQKATOSA-N 0.000 description 2
- MAGNEQBFSBREJL-DCAQKATOSA-N Gln-Glu-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N MAGNEQBFSBREJL-DCAQKATOSA-N 0.000 description 2
- VOLVNCMGXWDDQY-LPEHRKFASA-N Gln-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N)C(=O)O VOLVNCMGXWDDQY-LPEHRKFASA-N 0.000 description 2
- ZNZPKVQURDQFFS-FXQIFTODSA-N Gln-Glu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZNZPKVQURDQFFS-FXQIFTODSA-N 0.000 description 2
- IKFZXRLDMYWNBU-YUMQZZPRSA-N Gln-Gly-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N IKFZXRLDMYWNBU-YUMQZZPRSA-N 0.000 description 2
- FGYPOQPQTUNESW-IUCAKERBSA-N Gln-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)N)N FGYPOQPQTUNESW-IUCAKERBSA-N 0.000 description 2
- GQZDDFRXSDGUNG-YVNDNENWSA-N Gln-Ile-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O GQZDDFRXSDGUNG-YVNDNENWSA-N 0.000 description 2
- MTCXQQINVAFZKW-MNXVOIDGSA-N Gln-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MTCXQQINVAFZKW-MNXVOIDGSA-N 0.000 description 2
- KKCJHBXMYYVWMX-KQXIARHKSA-N Gln-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N KKCJHBXMYYVWMX-KQXIARHKSA-N 0.000 description 2
- VZRAXPGTUNDIDK-GUBZILKMSA-N Gln-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VZRAXPGTUNDIDK-GUBZILKMSA-N 0.000 description 2
- VUVKKXPCKILIBD-AVGNSLFASA-N Gln-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N VUVKKXPCKILIBD-AVGNSLFASA-N 0.000 description 2
- PSERKXGRRADTKA-MNXVOIDGSA-N Gln-Leu-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PSERKXGRRADTKA-MNXVOIDGSA-N 0.000 description 2
- SHAUZYVSXAMYAZ-JYJNAYRXSA-N Gln-Leu-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N SHAUZYVSXAMYAZ-JYJNAYRXSA-N 0.000 description 2
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 2
- CELXWPDNIGWCJN-WDCWCFNPSA-N Gln-Lys-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CELXWPDNIGWCJN-WDCWCFNPSA-N 0.000 description 2
- XUMFMAVDHQDATI-DCAQKATOSA-N Gln-Pro-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O XUMFMAVDHQDATI-DCAQKATOSA-N 0.000 description 2
- DOQUICBEISTQHE-CIUDSAMLSA-N Gln-Pro-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O DOQUICBEISTQHE-CIUDSAMLSA-N 0.000 description 2
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 2
- YPFFHGRJCUBXPX-NHCYSSNCSA-N Gln-Pro-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O)C(O)=O YPFFHGRJCUBXPX-NHCYSSNCSA-N 0.000 description 2
- MFHVAWMMKZBSRQ-ACZMJKKPSA-N Gln-Ser-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N MFHVAWMMKZBSRQ-ACZMJKKPSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- RONJIBWTGKVKFY-HTUGSXCWSA-N Gln-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O RONJIBWTGKVKFY-HTUGSXCWSA-N 0.000 description 2
- VEYGCDYMOXHJLS-GVXVVHGQSA-N Gln-Val-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VEYGCDYMOXHJLS-GVXVVHGQSA-N 0.000 description 2
- UTKICHUQEQBDGC-ACZMJKKPSA-N Glu-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N UTKICHUQEQBDGC-ACZMJKKPSA-N 0.000 description 2
- RLZBLVSJDFHDBL-KBIXCLLPSA-N Glu-Ala-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RLZBLVSJDFHDBL-KBIXCLLPSA-N 0.000 description 2
- FYBSCGZLICNOBA-XQXXSGGOSA-N Glu-Ala-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FYBSCGZLICNOBA-XQXXSGGOSA-N 0.000 description 2
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 2
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 2
- IYAUFWMUCGBFMQ-CIUDSAMLSA-N Glu-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N IYAUFWMUCGBFMQ-CIUDSAMLSA-N 0.000 description 2
- VTTSANCGJWLPNC-ZPFDUUQYSA-N Glu-Arg-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VTTSANCGJWLPNC-ZPFDUUQYSA-N 0.000 description 2
- FLLRAEJOLZPSMN-CIUDSAMLSA-N Glu-Asn-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FLLRAEJOLZPSMN-CIUDSAMLSA-N 0.000 description 2
- PCBBLFVHTYNQGG-LAEOZQHASA-N Glu-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N PCBBLFVHTYNQGG-LAEOZQHASA-N 0.000 description 2
- NTBDVNJIWCKURJ-ACZMJKKPSA-N Glu-Asp-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NTBDVNJIWCKURJ-ACZMJKKPSA-N 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- PAQUJCSYVIBPLC-AVGNSLFASA-N Glu-Asp-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 PAQUJCSYVIBPLC-AVGNSLFASA-N 0.000 description 2
- JRCUFCXYZLPSDZ-ACZMJKKPSA-N Glu-Asp-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O JRCUFCXYZLPSDZ-ACZMJKKPSA-N 0.000 description 2
- SBCYJMOOHUDWDA-NUMRIWBASA-N Glu-Asp-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SBCYJMOOHUDWDA-NUMRIWBASA-N 0.000 description 2
- PKYAVRMYTBBRLS-FXQIFTODSA-N Glu-Cys-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O PKYAVRMYTBBRLS-FXQIFTODSA-N 0.000 description 2
- OWVURWCRZZMAOZ-XHNCKOQMSA-N Glu-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)C(=O)O OWVURWCRZZMAOZ-XHNCKOQMSA-N 0.000 description 2
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 2
- NUSWUSKZRCGFEX-FXQIFTODSA-N Glu-Glu-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O NUSWUSKZRCGFEX-FXQIFTODSA-N 0.000 description 2
- SJPMNHCEWPTRBR-BQBZGAKWSA-N Glu-Glu-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SJPMNHCEWPTRBR-BQBZGAKWSA-N 0.000 description 2
- AUTNXSQEVVHSJK-YVNDNENWSA-N Glu-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O AUTNXSQEVVHSJK-YVNDNENWSA-N 0.000 description 2
- YLJHCWNDBKKOEB-IHRRRGAJSA-N Glu-Glu-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YLJHCWNDBKKOEB-IHRRRGAJSA-N 0.000 description 2
- KUTPGXNAAOQSPD-LPEHRKFASA-N Glu-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)C(=O)O KUTPGXNAAOQSPD-LPEHRKFASA-N 0.000 description 2
- AIGROOHQXCACHL-WDSKDSINSA-N Glu-Gly-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(O)=O AIGROOHQXCACHL-WDSKDSINSA-N 0.000 description 2
- CAVMESABQIKFKT-IUCAKERBSA-N Glu-Gly-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N CAVMESABQIKFKT-IUCAKERBSA-N 0.000 description 2
- KRGZZKWSBGPLKL-IUCAKERBSA-N Glu-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCC(=O)O)N KRGZZKWSBGPLKL-IUCAKERBSA-N 0.000 description 2
- OPAINBJQDQTGJY-JGVFFNPUSA-N Glu-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CCC(=O)O)N)C(=O)O OPAINBJQDQTGJY-JGVFFNPUSA-N 0.000 description 2
- HPJLZFTUUJKWAJ-JHEQGTHGSA-N Glu-Gly-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HPJLZFTUUJKWAJ-JHEQGTHGSA-N 0.000 description 2
- GGJOGFJIPPGNRK-JSGCOSHPSA-N Glu-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 GGJOGFJIPPGNRK-JSGCOSHPSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- ZWABFSSWTSAMQN-KBIXCLLPSA-N Glu-Ile-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O ZWABFSSWTSAMQN-KBIXCLLPSA-N 0.000 description 2
- BKRQSECBKKCCKW-HVTMNAMFSA-N Glu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)O)N BKRQSECBKKCCKW-HVTMNAMFSA-N 0.000 description 2
- XTZDZAXYPDISRR-MNXVOIDGSA-N Glu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N XTZDZAXYPDISRR-MNXVOIDGSA-N 0.000 description 2
- ZSWGJYOZWBHROQ-RWRJDSDZSA-N Glu-Ile-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZSWGJYOZWBHROQ-RWRJDSDZSA-N 0.000 description 2
- VSRCAOIHMGCIJK-SRVKXCTJSA-N Glu-Leu-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VSRCAOIHMGCIJK-SRVKXCTJSA-N 0.000 description 2
- VMKCPNBBPGGQBJ-GUBZILKMSA-N Glu-Leu-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N VMKCPNBBPGGQBJ-GUBZILKMSA-N 0.000 description 2
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 2
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 2
- IOUQWHIEQYQVFD-JYJNAYRXSA-N Glu-Leu-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IOUQWHIEQYQVFD-JYJNAYRXSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- OQXDUSZKISQQSS-GUBZILKMSA-N Glu-Lys-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O OQXDUSZKISQQSS-GUBZILKMSA-N 0.000 description 2
- MFNUFCFRAZPJFW-JYJNAYRXSA-N Glu-Lys-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MFNUFCFRAZPJFW-JYJNAYRXSA-N 0.000 description 2
- RBXSZQRSEGYDFG-GUBZILKMSA-N Glu-Lys-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O RBXSZQRSEGYDFG-GUBZILKMSA-N 0.000 description 2
- SUIAHERNFYRBDZ-GVXVVHGQSA-N Glu-Lys-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O SUIAHERNFYRBDZ-GVXVVHGQSA-N 0.000 description 2
- JDUKCSSHWNIQQZ-IHRRRGAJSA-N Glu-Phe-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JDUKCSSHWNIQQZ-IHRRRGAJSA-N 0.000 description 2
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 2
- AAJHGGDRKHYSDH-GUBZILKMSA-N Glu-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O AAJHGGDRKHYSDH-GUBZILKMSA-N 0.000 description 2
- BFEZQZKEPRKKHV-SRVKXCTJSA-N Glu-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O BFEZQZKEPRKKHV-SRVKXCTJSA-N 0.000 description 2
- LPHGXOWFAXFCPX-KKUMJFAQSA-N Glu-Pro-Phe Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O LPHGXOWFAXFCPX-KKUMJFAQSA-N 0.000 description 2
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 2
- WIKMTDVSCUJIPJ-CIUDSAMLSA-N Glu-Ser-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WIKMTDVSCUJIPJ-CIUDSAMLSA-N 0.000 description 2
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 2
- GPSHCSTUYOQPAI-JHEQGTHGSA-N Glu-Thr-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O GPSHCSTUYOQPAI-JHEQGTHGSA-N 0.000 description 2
- UUTGYDAKPISJAO-JYJNAYRXSA-N Glu-Tyr-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 UUTGYDAKPISJAO-JYJNAYRXSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- YPHPEHMXOYTEQG-LAEOZQHASA-N Glu-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O YPHPEHMXOYTEQG-LAEOZQHASA-N 0.000 description 2
- VIPDPMHGICREIS-GVXVVHGQSA-N Glu-Val-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O VIPDPMHGICREIS-GVXVVHGQSA-N 0.000 description 2
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- QSDKBRMVXSWAQE-BFHQHQDPSA-N Gly-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN QSDKBRMVXSWAQE-BFHQHQDPSA-N 0.000 description 2
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 2
- PYUCNHJQQVSPGN-BQBZGAKWSA-N Gly-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)CN=C(N)N PYUCNHJQQVSPGN-BQBZGAKWSA-N 0.000 description 2
- RQZGFWKQLPJOEQ-YUMQZZPRSA-N Gly-Arg-Gln Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN)CN=C(N)N RQZGFWKQLPJOEQ-YUMQZZPRSA-N 0.000 description 2
- OVSKVOOUFAKODB-UWVGGRQHSA-N Gly-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OVSKVOOUFAKODB-UWVGGRQHSA-N 0.000 description 2
- MXXXVOYFNVJHMA-IUCAKERBSA-N Gly-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN MXXXVOYFNVJHMA-IUCAKERBSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- WJZLEENECIOOSA-WDSKDSINSA-N Gly-Asn-Gln Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)O WJZLEENECIOOSA-WDSKDSINSA-N 0.000 description 2
- XRTDOIOIBMAXCT-NKWVEPMBSA-N Gly-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)CN)C(=O)O XRTDOIOIBMAXCT-NKWVEPMBSA-N 0.000 description 2
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 2
- JPWIMMUNWUKOAD-STQMWFEESA-N Gly-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN JPWIMMUNWUKOAD-STQMWFEESA-N 0.000 description 2
- TZOVVRJYUDETQG-RCOVLWMOSA-N Gly-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CN TZOVVRJYUDETQG-RCOVLWMOSA-N 0.000 description 2
- GZBZACMXFIPIDX-WHFBIAKZSA-N Gly-Cys-Asp Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)C(=O)O GZBZACMXFIPIDX-WHFBIAKZSA-N 0.000 description 2
- IXKRSKPKSLXIHN-YUMQZZPRSA-N Gly-Cys-Leu Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O IXKRSKPKSLXIHN-YUMQZZPRSA-N 0.000 description 2
- QCTLGOYODITHPQ-WHFBIAKZSA-N Gly-Cys-Ser Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O QCTLGOYODITHPQ-WHFBIAKZSA-N 0.000 description 2
- VNBNZUAPOYGRDB-ZDLURKLDSA-N Gly-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN)O VNBNZUAPOYGRDB-ZDLURKLDSA-N 0.000 description 2
- PEZZSFLFXXFUQD-XPUUQOCRSA-N Gly-Cys-Val Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O PEZZSFLFXXFUQD-XPUUQOCRSA-N 0.000 description 2
- KTSZUNRRYXPZTK-BQBZGAKWSA-N Gly-Gln-Glu Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KTSZUNRRYXPZTK-BQBZGAKWSA-N 0.000 description 2
- BIRKKBCSAIHDDF-WDSKDSINSA-N Gly-Glu-Cys Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BIRKKBCSAIHDDF-WDSKDSINSA-N 0.000 description 2
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 2
- XPJBQTCXPJNIFE-ZETCQYMHSA-N Gly-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)CN XPJBQTCXPJNIFE-ZETCQYMHSA-N 0.000 description 2
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 2
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 2
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 2
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 2
- DGKBSGNCMCLDSL-BYULHYEWSA-N Gly-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN DGKBSGNCMCLDSL-BYULHYEWSA-N 0.000 description 2
- VIIBEIQMLJEUJG-LAEOZQHASA-N Gly-Ile-Gln Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O VIIBEIQMLJEUJG-LAEOZQHASA-N 0.000 description 2
- HAXARWKYFIIHKD-ZKWXMUAHSA-N Gly-Ile-Ser Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HAXARWKYFIIHKD-ZKWXMUAHSA-N 0.000 description 2
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 2
- YIFUFYZELCMPJP-YUMQZZPRSA-N Gly-Leu-Cys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(O)=O YIFUFYZELCMPJP-YUMQZZPRSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- LLZXNUUIBOALNY-QWRGUYRKSA-N Gly-Leu-Lys Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN LLZXNUUIBOALNY-QWRGUYRKSA-N 0.000 description 2
- NNCSJUBVFBDDLC-YUMQZZPRSA-N Gly-Leu-Ser Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O NNCSJUBVFBDDLC-YUMQZZPRSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- BXICSAQLIHFDDL-YUMQZZPRSA-N Gly-Lys-Asn Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O BXICSAQLIHFDDL-YUMQZZPRSA-N 0.000 description 2
- PTIIBFKSLCYQBO-NHCYSSNCSA-N Gly-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN PTIIBFKSLCYQBO-NHCYSSNCSA-N 0.000 description 2
- PCPOYRCAHPJXII-UWVGGRQHSA-N Gly-Lys-Met Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O PCPOYRCAHPJXII-UWVGGRQHSA-N 0.000 description 2
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 2
- IGOYNRWLWHWAQO-JTQLQIEISA-N Gly-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IGOYNRWLWHWAQO-JTQLQIEISA-N 0.000 description 2
- QAMMIGULQSIRCD-IRXDYDNUSA-N Gly-Phe-Tyr Chemical compound C([C@H](NC(=O)C[NH3+])C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C([O-])=O)C1=CC=CC=C1 QAMMIGULQSIRCD-IRXDYDNUSA-N 0.000 description 2
- HJARVELKOSZUEW-YUMQZZPRSA-N Gly-Pro-Gln Chemical compound [H]NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJARVELKOSZUEW-YUMQZZPRSA-N 0.000 description 2
- JJGBXTYGTKWGAT-YUMQZZPRSA-N Gly-Pro-Glu Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O JJGBXTYGTKWGAT-YUMQZZPRSA-N 0.000 description 2
- ZZJVYSAQQMDIRD-UWVGGRQHSA-N Gly-Pro-His Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ZZJVYSAQQMDIRD-UWVGGRQHSA-N 0.000 description 2
- HAOUOFNNJJLVNS-BQBZGAKWSA-N Gly-Pro-Ser Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O HAOUOFNNJJLVNS-BQBZGAKWSA-N 0.000 description 2
- BMWFDYIYBAFROD-WPRPVWTQSA-N Gly-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN BMWFDYIYBAFROD-WPRPVWTQSA-N 0.000 description 2
- YOBGUCWZPXJHTN-BQBZGAKWSA-N Gly-Ser-Arg Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YOBGUCWZPXJHTN-BQBZGAKWSA-N 0.000 description 2
- IALQAMYQJBZNSK-WHFBIAKZSA-N Gly-Ser-Asn Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O IALQAMYQJBZNSK-WHFBIAKZSA-N 0.000 description 2
- FGPLUIQCSKGLTI-WDSKDSINSA-N Gly-Ser-Glu Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O FGPLUIQCSKGLTI-WDSKDSINSA-N 0.000 description 2
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 2
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 2
- FFJQHWKSGAWSTJ-BFHQHQDPSA-N Gly-Thr-Ala Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O FFJQHWKSGAWSTJ-BFHQHQDPSA-N 0.000 description 2
- YXTFLTJYLIAZQG-FJXKBIBVSA-N Gly-Thr-Arg Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YXTFLTJYLIAZQG-FJXKBIBVSA-N 0.000 description 2
- ZKJZBRHRWKLVSJ-ZDLURKLDSA-N Gly-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O ZKJZBRHRWKLVSJ-ZDLURKLDSA-N 0.000 description 2
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 2
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- DKJWUIYLMLUBDX-XPUUQOCRSA-N Gly-Val-Cys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(=O)O DKJWUIYLMLUBDX-XPUUQOCRSA-N 0.000 description 2
- AFMOTCMSEBITOE-YEPSODPASA-N Gly-Val-Thr Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AFMOTCMSEBITOE-YEPSODPASA-N 0.000 description 2
- RVKIPWVMZANZLI-UHFFFAOYSA-N H-Lys-Trp-OH Natural products C1=CC=C2C(CC(NC(=O)C(N)CCCCN)C(O)=O)=CNC2=C1 RVKIPWVMZANZLI-UHFFFAOYSA-N 0.000 description 2
- 241000238631 Hexapoda Species 0.000 description 2
- BIAKMWKJMQLZOJ-ZKWXMUAHSA-N His-Ala-Ala Chemical compound C[C@H](NC(=O)[C@H](C)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O BIAKMWKJMQLZOJ-ZKWXMUAHSA-N 0.000 description 2
- QIVPRLJQQVXCIY-HGNGGELXSA-N His-Ala-Gln Chemical compound C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCC(N)=O)C(O)=O QIVPRLJQQVXCIY-HGNGGELXSA-N 0.000 description 2
- VSLXGYMEHVAJBH-DLOVCJGASA-N His-Ala-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O VSLXGYMEHVAJBH-DLOVCJGASA-N 0.000 description 2
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 2
- HDXNWVLQSQFJOX-SRVKXCTJSA-N His-Arg-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HDXNWVLQSQFJOX-SRVKXCTJSA-N 0.000 description 2
- ZIMTWPHIKZEHSE-UWVGGRQHSA-N His-Arg-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O ZIMTWPHIKZEHSE-UWVGGRQHSA-N 0.000 description 2
- PROLDOGUBQJNPG-RWMBFGLXSA-N His-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O PROLDOGUBQJNPG-RWMBFGLXSA-N 0.000 description 2
- SOFSRBYHDINIRG-QTKMDUPCSA-N His-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N)O SOFSRBYHDINIRG-QTKMDUPCSA-N 0.000 description 2
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 2
- SDTPKSOWFXBACN-GUBZILKMSA-N His-Glu-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O SDTPKSOWFXBACN-GUBZILKMSA-N 0.000 description 2
- STWGDDDFLUFCCA-GVXVVHGQSA-N His-Glu-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O STWGDDDFLUFCCA-GVXVVHGQSA-N 0.000 description 2
- OQDLKDUVMTUPPG-AVGNSLFASA-N His-Leu-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OQDLKDUVMTUPPG-AVGNSLFASA-N 0.000 description 2
- YAALVYQFVJNXIV-KKUMJFAQSA-N His-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CN=CN1 YAALVYQFVJNXIV-KKUMJFAQSA-N 0.000 description 2
- ZSKJIISDJXJQPV-BZSNNMDCSA-N His-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 ZSKJIISDJXJQPV-BZSNNMDCSA-N 0.000 description 2
- SKOKHBGDXGTDDP-MELADBBJSA-N His-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N SKOKHBGDXGTDDP-MELADBBJSA-N 0.000 description 2
- LVXFNTIIGOQBMD-SRVKXCTJSA-N His-Leu-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O LVXFNTIIGOQBMD-SRVKXCTJSA-N 0.000 description 2
- RNAYRCNHRYEBTH-IHRRRGAJSA-N His-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RNAYRCNHRYEBTH-IHRRRGAJSA-N 0.000 description 2
- OWYIDJCNRWRSJY-QTKMDUPCSA-N His-Pro-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O OWYIDJCNRWRSJY-QTKMDUPCSA-N 0.000 description 2
- STGQSBKUYSPPIG-CIUDSAMLSA-N His-Ser-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 STGQSBKUYSPPIG-CIUDSAMLSA-N 0.000 description 2
- PLCAEMGSYOYIPP-GUBZILKMSA-N His-Ser-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 PLCAEMGSYOYIPP-GUBZILKMSA-N 0.000 description 2
- GIRSNERMXCMDBO-GARJFASQSA-N His-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O GIRSNERMXCMDBO-GARJFASQSA-N 0.000 description 2
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 2
- ILUVWFTXAUYOBW-CUJWVEQBSA-N His-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC1=CN=CN1)N)O ILUVWFTXAUYOBW-CUJWVEQBSA-N 0.000 description 2
- DQZCEKQPSOBNMJ-NKIYYHGXSA-N His-Thr-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DQZCEKQPSOBNMJ-NKIYYHGXSA-N 0.000 description 2
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 2
- ISQOVWDWRUONJH-YESZJQIVSA-N His-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ISQOVWDWRUONJH-YESZJQIVSA-N 0.000 description 2
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 2
- WSAILOWUJZEAGC-DCAQKATOSA-N His-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WSAILOWUJZEAGC-DCAQKATOSA-N 0.000 description 2
- SYPULFZAGBBIOM-GVXVVHGQSA-N His-Val-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N SYPULFZAGBBIOM-GVXVVHGQSA-N 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-N Hydrochloric acid Chemical compound Cl VEXZGXHMUGYJMC-UHFFFAOYSA-N 0.000 description 2
- LQSBBHNVAVNZSX-GHCJXIJMSA-N Ile-Ala-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N LQSBBHNVAVNZSX-GHCJXIJMSA-N 0.000 description 2
- YPWHUFAAMNHMGS-QSFUFRPTSA-N Ile-Ala-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YPWHUFAAMNHMGS-QSFUFRPTSA-N 0.000 description 2
- NPROWIBAWYMPAZ-GUDRVLHUSA-N Ile-Asp-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N NPROWIBAWYMPAZ-GUDRVLHUSA-N 0.000 description 2
- BSWLQVGEVFYGIM-ZPFDUUQYSA-N Ile-Gln-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BSWLQVGEVFYGIM-ZPFDUUQYSA-N 0.000 description 2
- BALLIXFZYSECCF-QEWYBTABSA-N Ile-Gln-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N BALLIXFZYSECCF-QEWYBTABSA-N 0.000 description 2
- PHIXPNQDGGILMP-YVNDNENWSA-N Ile-Glu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N PHIXPNQDGGILMP-YVNDNENWSA-N 0.000 description 2
- UBHUJPVCJHPSEU-GRLWGSQLSA-N Ile-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N UBHUJPVCJHPSEU-GRLWGSQLSA-N 0.000 description 2
- SPQWWEZBHXHUJN-KBIXCLLPSA-N Ile-Glu-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O SPQWWEZBHXHUJN-KBIXCLLPSA-N 0.000 description 2
- CDGLBYSAZFIIJO-RCOVLWMOSA-N Ile-Gly-Gly Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O CDGLBYSAZFIIJO-RCOVLWMOSA-N 0.000 description 2
- ODPKZZLRDNXTJZ-WHOFXGATSA-N Ile-Gly-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ODPKZZLRDNXTJZ-WHOFXGATSA-N 0.000 description 2
- LBRCLQMZAHRTLV-ZKWXMUAHSA-N Ile-Gly-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LBRCLQMZAHRTLV-ZKWXMUAHSA-N 0.000 description 2
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 2
- YGDWPQCLFJNMOL-MNXVOIDGSA-N Ile-Leu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YGDWPQCLFJNMOL-MNXVOIDGSA-N 0.000 description 2
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 2
- GAZGFPOZOLEYAJ-YTFOTSKYSA-N Ile-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N GAZGFPOZOLEYAJ-YTFOTSKYSA-N 0.000 description 2
- GVKKVHNRTUFCCE-BJDJZHNGSA-N Ile-Leu-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)O)N GVKKVHNRTUFCCE-BJDJZHNGSA-N 0.000 description 2
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 2
- UIEZQYNXCYHMQS-BJDJZHNGSA-N Ile-Lys-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)O)N UIEZQYNXCYHMQS-BJDJZHNGSA-N 0.000 description 2
- OVDKXUDMKXAZIV-ZPFDUUQYSA-N Ile-Lys-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OVDKXUDMKXAZIV-ZPFDUUQYSA-N 0.000 description 2
- ADDYYRVQQZFIMW-MNXVOIDGSA-N Ile-Lys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ADDYYRVQQZFIMW-MNXVOIDGSA-N 0.000 description 2
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- UDBPXJNOEWDBDF-XUXIUFHCSA-N Ile-Lys-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)O)N UDBPXJNOEWDBDF-XUXIUFHCSA-N 0.000 description 2
- FQYQMFCIJNWDQZ-CYDGBPFRSA-N Ile-Pro-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 FQYQMFCIJNWDQZ-CYDGBPFRSA-N 0.000 description 2
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 2
- ZNOBVZFCHNHKHA-KBIXCLLPSA-N Ile-Ser-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZNOBVZFCHNHKHA-KBIXCLLPSA-N 0.000 description 2
- RQJUKVXWAKJDBW-SVSWQMSJSA-N Ile-Ser-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N RQJUKVXWAKJDBW-SVSWQMSJSA-N 0.000 description 2
- WCNWGAUZWWSYDG-SVSWQMSJSA-N Ile-Thr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)O)N WCNWGAUZWWSYDG-SVSWQMSJSA-N 0.000 description 2
- VBGCPJBKUXRYDA-DSYPUSFNSA-N Ile-Trp-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCCN)C(=O)O)N VBGCPJBKUXRYDA-DSYPUSFNSA-N 0.000 description 2
- ZUWSVOYKBCHLRR-MGHWNKPDSA-N Ile-Tyr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N ZUWSVOYKBCHLRR-MGHWNKPDSA-N 0.000 description 2
- ZGKVPOSSTGHJAF-HJPIBITLSA-N Ile-Tyr-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CO)C(=O)O)N ZGKVPOSSTGHJAF-HJPIBITLSA-N 0.000 description 2
- ZYVTXBXHIKGZMD-QSFUFRPTSA-N Ile-Val-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ZYVTXBXHIKGZMD-QSFUFRPTSA-N 0.000 description 2
- HGCNKOLVKRAVHD-UHFFFAOYSA-N L-Met-L-Phe Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=CC=C1 HGCNKOLVKRAVHD-UHFFFAOYSA-N 0.000 description 2
- LJHGALIOHLRRQN-DCAQKATOSA-N Leu-Ala-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LJHGALIOHLRRQN-DCAQKATOSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- DQPQTXMIRBUWKO-DCAQKATOSA-N Leu-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(C)C)N DQPQTXMIRBUWKO-DCAQKATOSA-N 0.000 description 2
- HXWALXSAVBLTPK-NUTKFTJISA-N Leu-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(C)C)N HXWALXSAVBLTPK-NUTKFTJISA-N 0.000 description 2
- SUPVSFFZWVOEOI-UHFFFAOYSA-N Leu-Ala-Tyr Natural products CC(C)CC(N)C(=O)NC(C)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-UHFFFAOYSA-N 0.000 description 2
- HBJZFCIVFIBNSV-DCAQKATOSA-N Leu-Arg-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O HBJZFCIVFIBNSV-DCAQKATOSA-N 0.000 description 2
- GRZSCTXVCDUIPO-SRVKXCTJSA-N Leu-Arg-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRZSCTXVCDUIPO-SRVKXCTJSA-N 0.000 description 2
- HASRFYOMVPJRPU-SRVKXCTJSA-N Leu-Arg-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(O)=O)C(O)=O HASRFYOMVPJRPU-SRVKXCTJSA-N 0.000 description 2
- IBMVEYRWAWIOTN-RWMBFGLXSA-N Leu-Arg-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(O)=O IBMVEYRWAWIOTN-RWMBFGLXSA-N 0.000 description 2
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 2
- TWQIYNGNYNJUFM-NHCYSSNCSA-N Leu-Asn-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TWQIYNGNYNJUFM-NHCYSSNCSA-N 0.000 description 2
- FGNQZXKVAZIMCI-CIUDSAMLSA-N Leu-Asp-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N FGNQZXKVAZIMCI-CIUDSAMLSA-N 0.000 description 2
- ILJREDZFPHTUIE-GUBZILKMSA-N Leu-Asp-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ILJREDZFPHTUIE-GUBZILKMSA-N 0.000 description 2
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 2
- MMEDVBWCMGRKKC-GARJFASQSA-N Leu-Asp-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N MMEDVBWCMGRKKC-GARJFASQSA-N 0.000 description 2
- CLVUXCBGKUECIT-HJGDQZAQSA-N Leu-Asp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CLVUXCBGKUECIT-HJGDQZAQSA-N 0.000 description 2
- NFHJQETXTSDZSI-DCAQKATOSA-N Leu-Cys-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NFHJQETXTSDZSI-DCAQKATOSA-N 0.000 description 2
- DKEZVKFLETVJFY-CIUDSAMLSA-N Leu-Cys-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DKEZVKFLETVJFY-CIUDSAMLSA-N 0.000 description 2
- QKIBIXAQKAFZGL-GUBZILKMSA-N Leu-Cys-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QKIBIXAQKAFZGL-GUBZILKMSA-N 0.000 description 2
- HUEBCHPSXSQUGN-GARJFASQSA-N Leu-Cys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N HUEBCHPSXSQUGN-GARJFASQSA-N 0.000 description 2
- WCTCIIAGNMFYAO-DCAQKATOSA-N Leu-Cys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O WCTCIIAGNMFYAO-DCAQKATOSA-N 0.000 description 2
- KAFOIVJDVSZUMD-DCAQKATOSA-N Leu-Gln-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O KAFOIVJDVSZUMD-DCAQKATOSA-N 0.000 description 2
- RSFGIMMPWAXNML-MNXVOIDGSA-N Leu-Gln-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RSFGIMMPWAXNML-MNXVOIDGSA-N 0.000 description 2
- FQZPTCNSNPWHLJ-AVGNSLFASA-N Leu-Gln-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(O)=O FQZPTCNSNPWHLJ-AVGNSLFASA-N 0.000 description 2
- GPICTNQYKHHHTH-GUBZILKMSA-N Leu-Gln-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O GPICTNQYKHHHTH-GUBZILKMSA-N 0.000 description 2
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 2
- LAGPXKYZCCTSGQ-JYJNAYRXSA-N Leu-Glu-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LAGPXKYZCCTSGQ-JYJNAYRXSA-N 0.000 description 2
- ZFNLIDNJUWNIJL-WDCWCFNPSA-N Leu-Glu-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZFNLIDNJUWNIJL-WDCWCFNPSA-N 0.000 description 2
- OXRLYTYUXAQTHP-YUMQZZPRSA-N Leu-Gly-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](C)C(O)=O OXRLYTYUXAQTHP-YUMQZZPRSA-N 0.000 description 2
- KGCLIYGPQXUNLO-IUCAKERBSA-N Leu-Gly-Glu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O KGCLIYGPQXUNLO-IUCAKERBSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- WRLPVDVHNWSSCL-MELADBBJSA-N Leu-His-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N WRLPVDVHNWSSCL-MELADBBJSA-N 0.000 description 2
- AVEGDIAXTDVBJS-XUXIUFHCSA-N Leu-Ile-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AVEGDIAXTDVBJS-XUXIUFHCSA-N 0.000 description 2
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 2
- PDQDCFBVYXEFSD-SRVKXCTJSA-N Leu-Leu-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PDQDCFBVYXEFSD-SRVKXCTJSA-N 0.000 description 2
- QNBVTHNJGCOVFA-AVGNSLFASA-N Leu-Leu-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O QNBVTHNJGCOVFA-AVGNSLFASA-N 0.000 description 2
- JLWZLIQRYCTYBD-IHRRRGAJSA-N Leu-Lys-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JLWZLIQRYCTYBD-IHRRRGAJSA-N 0.000 description 2
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 2
- VCHVSKNMTXWIIP-SRVKXCTJSA-N Leu-Lys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O VCHVSKNMTXWIIP-SRVKXCTJSA-N 0.000 description 2
- OVZLLFONXILPDZ-VOAKCMCISA-N Leu-Lys-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OVZLLFONXILPDZ-VOAKCMCISA-N 0.000 description 2
- LZHJZLHSRGWBBE-IHRRRGAJSA-N Leu-Lys-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O LZHJZLHSRGWBBE-IHRRRGAJSA-N 0.000 description 2
- WXZOHBVPVKABQN-DCAQKATOSA-N Leu-Met-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WXZOHBVPVKABQN-DCAQKATOSA-N 0.000 description 2
- KXCMQWMNYQOAKA-SRVKXCTJSA-N Leu-Met-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N KXCMQWMNYQOAKA-SRVKXCTJSA-N 0.000 description 2
- FLNPJLDPGMLWAU-UWVGGRQHSA-N Leu-Met-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(C)C FLNPJLDPGMLWAU-UWVGGRQHSA-N 0.000 description 2
- MJTOYIHCKVQICL-ULQDDVLXSA-N Leu-Met-Phe Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N MJTOYIHCKVQICL-ULQDDVLXSA-N 0.000 description 2
- GNRPTBRHRRZCMA-RWMBFGLXSA-N Leu-Met-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N GNRPTBRHRRZCMA-RWMBFGLXSA-N 0.000 description 2
- MJWVXZABPOKJJF-ACRUOGEOSA-N Leu-Phe-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MJWVXZABPOKJJF-ACRUOGEOSA-N 0.000 description 2
- RRVCZCNFXIFGRA-DCAQKATOSA-N Leu-Pro-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O RRVCZCNFXIFGRA-DCAQKATOSA-N 0.000 description 2
- HGUUMQWGYCVPKG-DCAQKATOSA-N Leu-Pro-Cys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HGUUMQWGYCVPKG-DCAQKATOSA-N 0.000 description 2
- PWPBLZXWFXJFHE-RHYQMDGZSA-N Leu-Pro-Thr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O PWPBLZXWFXJFHE-RHYQMDGZSA-N 0.000 description 2
- UCXQIIIFOOGYEM-ULQDDVLXSA-N Leu-Pro-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 UCXQIIIFOOGYEM-ULQDDVLXSA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 2
- ICYRCNICGBJLGM-HJGDQZAQSA-N Leu-Thr-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O ICYRCNICGBJLGM-HJGDQZAQSA-N 0.000 description 2
- LINKCQUOMUDLKN-KATARQTJSA-N Leu-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(C)C)N)O LINKCQUOMUDLKN-KATARQTJSA-N 0.000 description 2
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 2
- VJGQRELPQWNURN-JYJNAYRXSA-N Leu-Tyr-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJGQRELPQWNURN-JYJNAYRXSA-N 0.000 description 2
- OZTZJMUZVAVJGY-BZSNNMDCSA-N Leu-Tyr-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N OZTZJMUZVAVJGY-BZSNNMDCSA-N 0.000 description 2
- BTEMNFBEAAOGBR-BZSNNMDCSA-N Leu-Tyr-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BTEMNFBEAAOGBR-BZSNNMDCSA-N 0.000 description 2
- AXVIGSRGTMNSJU-YESZJQIVSA-N Leu-Tyr-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N AXVIGSRGTMNSJU-YESZJQIVSA-N 0.000 description 2
- VUBIPAHVHMZHCM-KKUMJFAQSA-N Leu-Tyr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 VUBIPAHVHMZHCM-KKUMJFAQSA-N 0.000 description 2
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 2
- MVJRBCJCRYGCKV-GVXVVHGQSA-N Leu-Val-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MVJRBCJCRYGCKV-GVXVVHGQSA-N 0.000 description 2
- AIMGJYMCTAABEN-GVXVVHGQSA-N Leu-Val-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AIMGJYMCTAABEN-GVXVVHGQSA-N 0.000 description 2
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 2
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 2
- MPGHETGWWWUHPY-CIUDSAMLSA-N Lys-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN MPGHETGWWWUHPY-CIUDSAMLSA-N 0.000 description 2
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 2
- UWKNTTJNVSYXPC-CIUDSAMLSA-N Lys-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN UWKNTTJNVSYXPC-CIUDSAMLSA-N 0.000 description 2
- IRNSXVOWSXSULE-DCAQKATOSA-N Lys-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN IRNSXVOWSXSULE-DCAQKATOSA-N 0.000 description 2
- YNNPKXBBRZVIRX-IHRRRGAJSA-N Lys-Arg-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O YNNPKXBBRZVIRX-IHRRRGAJSA-N 0.000 description 2
- NTSPQIONFJUMJV-AVGNSLFASA-N Lys-Arg-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O NTSPQIONFJUMJV-AVGNSLFASA-N 0.000 description 2
- DNEJSAIMVANNPA-DCAQKATOSA-N Lys-Asn-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DNEJSAIMVANNPA-DCAQKATOSA-N 0.000 description 2
- 108010062166 Lys-Asn-Asp Proteins 0.000 description 2
- YKIRNDPUWONXQN-GUBZILKMSA-N Lys-Asn-Gln Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N YKIRNDPUWONXQN-GUBZILKMSA-N 0.000 description 2
- JBRWKVANRYPCAF-XIRDDKMYSA-N Lys-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N JBRWKVANRYPCAF-XIRDDKMYSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- IWWMPCPLFXFBAF-SRVKXCTJSA-N Lys-Asp-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O IWWMPCPLFXFBAF-SRVKXCTJSA-N 0.000 description 2
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 2
- YEIYAQQKADPIBJ-GARJFASQSA-N Lys-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O YEIYAQQKADPIBJ-GARJFASQSA-N 0.000 description 2
- GKFNXYMAMKJSKD-NHCYSSNCSA-N Lys-Asp-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GKFNXYMAMKJSKD-NHCYSSNCSA-N 0.000 description 2
- MWVUEPNEPWMFBD-SRVKXCTJSA-N Lys-Cys-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCCCN MWVUEPNEPWMFBD-SRVKXCTJSA-N 0.000 description 2
- KSFQPRLZAUXXPT-GARJFASQSA-N Lys-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CCCCN)N)C(=O)O KSFQPRLZAUXXPT-GARJFASQSA-N 0.000 description 2
- WTZUSCUIVPVCRH-SRVKXCTJSA-N Lys-Gln-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N WTZUSCUIVPVCRH-SRVKXCTJSA-N 0.000 description 2
- LXNPMPIQDNSMTA-AVGNSLFASA-N Lys-Gln-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 LXNPMPIQDNSMTA-AVGNSLFASA-N 0.000 description 2
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 2
- DCRWPTBMWMGADO-AVGNSLFASA-N Lys-Glu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DCRWPTBMWMGADO-AVGNSLFASA-N 0.000 description 2
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 2
- GQFDWEDHOQRNLC-QWRGUYRKSA-N Lys-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCCN GQFDWEDHOQRNLC-QWRGUYRKSA-N 0.000 description 2
- PGLGNCVOWIORQE-SRVKXCTJSA-N Lys-His-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O PGLGNCVOWIORQE-SRVKXCTJSA-N 0.000 description 2
- XREQQOATSMMAJP-MGHWNKPDSA-N Lys-Ile-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XREQQOATSMMAJP-MGHWNKPDSA-N 0.000 description 2
- WAIHHELKYSFIQN-XUXIUFHCSA-N Lys-Ile-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O WAIHHELKYSFIQN-XUXIUFHCSA-N 0.000 description 2
- OVAOHZIOUBEQCJ-IHRRRGAJSA-N Lys-Leu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OVAOHZIOUBEQCJ-IHRRRGAJSA-N 0.000 description 2
- ONPDTSFZAIWMDI-AVGNSLFASA-N Lys-Leu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ONPDTSFZAIWMDI-AVGNSLFASA-N 0.000 description 2
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 2
- ALGGDNMLQNFVIZ-SRVKXCTJSA-N Lys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ALGGDNMLQNFVIZ-SRVKXCTJSA-N 0.000 description 2
- YXPJCVNIDDKGOE-MELADBBJSA-N Lys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N)C(=O)O YXPJCVNIDDKGOE-MELADBBJSA-N 0.000 description 2
- JPYPRVHMKRFTAT-KKUMJFAQSA-N Lys-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N JPYPRVHMKRFTAT-KKUMJFAQSA-N 0.000 description 2
- YSPZCHGIWAQVKQ-AVGNSLFASA-N Lys-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN YSPZCHGIWAQVKQ-AVGNSLFASA-N 0.000 description 2
- WQDKIVRHTQYJSN-DCAQKATOSA-N Lys-Ser-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N WQDKIVRHTQYJSN-DCAQKATOSA-N 0.000 description 2
- DNWBUCHHMRQWCZ-GUBZILKMSA-N Lys-Ser-Gln Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O DNWBUCHHMRQWCZ-GUBZILKMSA-N 0.000 description 2
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 2
- WZVSHTFTCYOFPL-GARJFASQSA-N Lys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCCCN)N)C(=O)O WZVSHTFTCYOFPL-GARJFASQSA-N 0.000 description 2
- DIBZLYZXTSVGLN-CIUDSAMLSA-N Lys-Ser-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O DIBZLYZXTSVGLN-CIUDSAMLSA-N 0.000 description 2
- CAVRAQIDHUPECU-UVOCVTCTSA-N Lys-Thr-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAVRAQIDHUPECU-UVOCVTCTSA-N 0.000 description 2
- UGCIQUYEJIEHKX-GVXVVHGQSA-N Lys-Val-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O UGCIQUYEJIEHKX-GVXVVHGQSA-N 0.000 description 2
- 108010052285 Membrane Proteins Proteins 0.000 description 2
- YRAWWKUTNBILNT-FXQIFTODSA-N Met-Ala-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YRAWWKUTNBILNT-FXQIFTODSA-N 0.000 description 2
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 2
- DTICLBJHRYSJLH-GUBZILKMSA-N Met-Ala-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O DTICLBJHRYSJLH-GUBZILKMSA-N 0.000 description 2
- OSOLWRWQADPDIQ-DCAQKATOSA-N Met-Asp-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OSOLWRWQADPDIQ-DCAQKATOSA-N 0.000 description 2
- YORIKIDJCPKBON-YUMQZZPRSA-N Met-Glu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O YORIKIDJCPKBON-YUMQZZPRSA-N 0.000 description 2
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 2
- FZUNSVYYPYJYAP-NAKRPEOUSA-N Met-Ile-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O FZUNSVYYPYJYAP-NAKRPEOUSA-N 0.000 description 2
- FWAHLGXNBLWIKB-NAKRPEOUSA-N Met-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCSC FWAHLGXNBLWIKB-NAKRPEOUSA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- WPTHAGXMYDRPFD-SRVKXCTJSA-N Met-Lys-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O WPTHAGXMYDRPFD-SRVKXCTJSA-N 0.000 description 2
- HSJIGJRZYUADSS-IHRRRGAJSA-N Met-Lys-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O HSJIGJRZYUADSS-IHRRRGAJSA-N 0.000 description 2
- LXCSZPUQKMTXNW-BQBZGAKWSA-N Met-Ser-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O LXCSZPUQKMTXNW-BQBZGAKWSA-N 0.000 description 2
- PHURAEXVWLDIGT-LPEHRKFASA-N Met-Ser-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N PHURAEXVWLDIGT-LPEHRKFASA-N 0.000 description 2
- GGXZOTSDJJTDGB-GUBZILKMSA-N Met-Ser-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O GGXZOTSDJJTDGB-GUBZILKMSA-N 0.000 description 2
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 2
- 241001465754 Metazoa Species 0.000 description 2
- AFVFQIVMOAPDHO-UHFFFAOYSA-N Methanesulfonic acid Chemical compound CS(O)(=O)=O AFVFQIVMOAPDHO-UHFFFAOYSA-N 0.000 description 2
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 2
- 108010087066 N2-tryptophyllysine Proteins 0.000 description 2
- 108091005804 Peptidases Proteins 0.000 description 2
- FPTXMUIBLMGTQH-ONGXEEELSA-N Phe-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 FPTXMUIBLMGTQH-ONGXEEELSA-N 0.000 description 2
- BBDSZDHUCPSYAC-QEJZJMRPSA-N Phe-Ala-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BBDSZDHUCPSYAC-QEJZJMRPSA-N 0.000 description 2
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 2
- CGOMLCQJEMWMCE-STQMWFEESA-N Phe-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CGOMLCQJEMWMCE-STQMWFEESA-N 0.000 description 2
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 2
- JIYJYFIXQTYDNF-YDHLFZDLSA-N Phe-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N JIYJYFIXQTYDNF-YDHLFZDLSA-N 0.000 description 2
- CSYVXYQDIVCQNU-QWRGUYRKSA-N Phe-Asp-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O CSYVXYQDIVCQNU-QWRGUYRKSA-N 0.000 description 2
- OJUMUUXGSXUZJZ-SRVKXCTJSA-N Phe-Asp-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O OJUMUUXGSXUZJZ-SRVKXCTJSA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- PDUVELWDJZOUEI-IHRRRGAJSA-N Phe-Cys-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PDUVELWDJZOUEI-IHRRRGAJSA-N 0.000 description 2
- LXUJDHOKVUYHRC-KKUMJFAQSA-N Phe-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N LXUJDHOKVUYHRC-KKUMJFAQSA-N 0.000 description 2
- FGXIJNMDRCZVDE-KKUMJFAQSA-N Phe-Cys-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N FGXIJNMDRCZVDE-KKUMJFAQSA-N 0.000 description 2
- VLZGUAUYZGQKPM-DRZSPHRISA-N Phe-Gln-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O VLZGUAUYZGQKPM-DRZSPHRISA-N 0.000 description 2
- MGBRZXXGQBAULP-DRZSPHRISA-N Phe-Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGBRZXXGQBAULP-DRZSPHRISA-N 0.000 description 2
- HOYQLNNGMHXZDW-KKUMJFAQSA-N Phe-Glu-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HOYQLNNGMHXZDW-KKUMJFAQSA-N 0.000 description 2
- MPFGIYLYWUCSJG-AVGNSLFASA-N Phe-Glu-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MPFGIYLYWUCSJG-AVGNSLFASA-N 0.000 description 2
- WPTYDQPGBMDUBI-QWRGUYRKSA-N Phe-Gly-Asn Chemical compound N[C@@H](Cc1ccccc1)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O WPTYDQPGBMDUBI-QWRGUYRKSA-N 0.000 description 2
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 2
- NHCKESBLOMHIIE-IRXDYDNUSA-N Phe-Gly-Phe Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 NHCKESBLOMHIIE-IRXDYDNUSA-N 0.000 description 2
- HNFUGJUZJRYUHN-JSGCOSHPSA-N Phe-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HNFUGJUZJRYUHN-JSGCOSHPSA-N 0.000 description 2
- SFKOEHXABNPLRT-KBPBESRZSA-N Phe-His-Gly Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)NCC(O)=O SFKOEHXABNPLRT-KBPBESRZSA-N 0.000 description 2
- CWFGECHCRMGPPT-MXAVVETBSA-N Phe-Ile-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O CWFGECHCRMGPPT-MXAVVETBSA-N 0.000 description 2
- KXUZHWXENMYOHC-QEJZJMRPSA-N Phe-Leu-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUZHWXENMYOHC-QEJZJMRPSA-N 0.000 description 2
- KDYPMIZMXDECSU-JYJNAYRXSA-N Phe-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 KDYPMIZMXDECSU-JYJNAYRXSA-N 0.000 description 2
- SMFGCTXUBWEPKM-KBPBESRZSA-N Phe-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 SMFGCTXUBWEPKM-KBPBESRZSA-N 0.000 description 2
- METZZBCMDXHFMK-BZSNNMDCSA-N Phe-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N METZZBCMDXHFMK-BZSNNMDCSA-N 0.000 description 2
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 2
- RBRNEFJTEHPDSL-ACRUOGEOSA-N Phe-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 RBRNEFJTEHPDSL-ACRUOGEOSA-N 0.000 description 2
- ZJPGOXWRFNKIQL-JYJNAYRXSA-N Phe-Pro-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=CC=C1 ZJPGOXWRFNKIQL-JYJNAYRXSA-N 0.000 description 2
- NJJBATPLUQHRBM-IHRRRGAJSA-N Phe-Pro-Ser Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CO)C(=O)O NJJBATPLUQHRBM-IHRRRGAJSA-N 0.000 description 2
- ZLAKUZDMKVKFAI-JYJNAYRXSA-N Phe-Pro-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O ZLAKUZDMKVKFAI-JYJNAYRXSA-N 0.000 description 2
- BPCLGWHVPVTTFM-QWRGUYRKSA-N Phe-Ser-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)NCC(O)=O BPCLGWHVPVTTFM-QWRGUYRKSA-N 0.000 description 2
- JXQVYPWVGUOIDV-MXAVVETBSA-N Phe-Ser-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JXQVYPWVGUOIDV-MXAVVETBSA-N 0.000 description 2
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 2
- GLJZDMZJHFXJQG-BZSNNMDCSA-N Phe-Ser-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLJZDMZJHFXJQG-BZSNNMDCSA-N 0.000 description 2
- MRWOVVNKSXXLRP-IHPCNDPISA-N Phe-Ser-Trp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MRWOVVNKSXXLRP-IHPCNDPISA-N 0.000 description 2
- XNMYNGDKJNOKHH-BZSNNMDCSA-N Phe-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O XNMYNGDKJNOKHH-BZSNNMDCSA-N 0.000 description 2
- LTAWNJXSRUCFAN-UNQGMJICSA-N Phe-Thr-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LTAWNJXSRUCFAN-UNQGMJICSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- PTDAGKJHZBGDKD-OEAJRASXSA-N Phe-Thr-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O PTDAGKJHZBGDKD-OEAJRASXSA-N 0.000 description 2
- GNRMAQSIROFNMI-IXOXFDKPSA-N Phe-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GNRMAQSIROFNMI-IXOXFDKPSA-N 0.000 description 2
- ZVJGAXNBBKPYOE-HKUYNNGSSA-N Phe-Trp-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)NCC(O)=O)C1=CC=CC=C1 ZVJGAXNBBKPYOE-HKUYNNGSSA-N 0.000 description 2
- XALFIVXGQUEGKV-JSGCOSHPSA-N Phe-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XALFIVXGQUEGKV-JSGCOSHPSA-N 0.000 description 2
- APZNYJFGVAGFCF-JYJNAYRXSA-N Phe-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccccc1)C(C)C)C(O)=O APZNYJFGVAGFCF-JYJNAYRXSA-N 0.000 description 2
- NBIIXXVUZAFLBC-UHFFFAOYSA-N Phosphoric acid Chemical compound OP(O)(O)=O NBIIXXVUZAFLBC-UHFFFAOYSA-N 0.000 description 2
- FCCBQBZXIAZNIG-LSJOCFKGSA-N Pro-Ala-His Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O FCCBQBZXIAZNIG-LSJOCFKGSA-N 0.000 description 2
- LCRSGSIRKLXZMZ-BPNCWPANSA-N Pro-Ala-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LCRSGSIRKLXZMZ-BPNCWPANSA-N 0.000 description 2
- OLHDPZMYUSBGDE-GUBZILKMSA-N Pro-Arg-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O OLHDPZMYUSBGDE-GUBZILKMSA-N 0.000 description 2
- BNBBNGZZKQUWCD-IUCAKERBSA-N Pro-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H]1CCCN1 BNBBNGZZKQUWCD-IUCAKERBSA-N 0.000 description 2
- OYEUSRAZOGIDBY-JYJNAYRXSA-N Pro-Arg-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OYEUSRAZOGIDBY-JYJNAYRXSA-N 0.000 description 2
- ORPZXBQTEHINPB-SRVKXCTJSA-N Pro-Arg-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H]1CCCN1)C(O)=O ORPZXBQTEHINPB-SRVKXCTJSA-N 0.000 description 2
- WWAQEUOYCYMGHB-FXQIFTODSA-N Pro-Asn-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 WWAQEUOYCYMGHB-FXQIFTODSA-N 0.000 description 2
- OBVCYFIHIIYIQF-CIUDSAMLSA-N Pro-Asn-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OBVCYFIHIIYIQF-CIUDSAMLSA-N 0.000 description 2
- XROLYVMNVIKVEM-BQBZGAKWSA-N Pro-Asn-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O XROLYVMNVIKVEM-BQBZGAKWSA-N 0.000 description 2
- SFECXGVELZFBFJ-VEVYYDQMSA-N Pro-Asp-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFECXGVELZFBFJ-VEVYYDQMSA-N 0.000 description 2
- QXNSKJLSLYCTMT-FXQIFTODSA-N Pro-Cys-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O QXNSKJLSLYCTMT-FXQIFTODSA-N 0.000 description 2
- OZAPWFHRPINHND-GUBZILKMSA-N Pro-Cys-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O OZAPWFHRPINHND-GUBZILKMSA-N 0.000 description 2
- UPJGUQPLYWTISV-GUBZILKMSA-N Pro-Gln-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UPJGUQPLYWTISV-GUBZILKMSA-N 0.000 description 2
- ZPPVJIJMIKTERM-YUMQZZPRSA-N Pro-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ZPPVJIJMIKTERM-YUMQZZPRSA-N 0.000 description 2
- HJSCRFZVGXAGNG-SRVKXCTJSA-N Pro-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H]1CCCN1 HJSCRFZVGXAGNG-SRVKXCTJSA-N 0.000 description 2
- LANQLYHLMYDWJP-SRVKXCTJSA-N Pro-Gln-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O LANQLYHLMYDWJP-SRVKXCTJSA-N 0.000 description 2
- XZONQWUEBAFQPO-HJGDQZAQSA-N Pro-Gln-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XZONQWUEBAFQPO-HJGDQZAQSA-N 0.000 description 2
- VPFGPKIWSDVTOY-SRVKXCTJSA-N Pro-Glu-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O VPFGPKIWSDVTOY-SRVKXCTJSA-N 0.000 description 2
- LXVLKXPFIDDHJG-CIUDSAMLSA-N Pro-Glu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O LXVLKXPFIDDHJG-CIUDSAMLSA-N 0.000 description 2
- VPEVBAUSTBWQHN-NHCYSSNCSA-N Pro-Glu-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O VPEVBAUSTBWQHN-NHCYSSNCSA-N 0.000 description 2
- ULIWFCCJIOEHMU-BQBZGAKWSA-N Pro-Gly-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H]1CCCN1 ULIWFCCJIOEHMU-BQBZGAKWSA-N 0.000 description 2
- QNZLIVROMORQFH-BQBZGAKWSA-N Pro-Gly-Cys Chemical compound C1C[C@H](NC1)C(=O)NCC(=O)N[C@@H](CS)C(=O)O QNZLIVROMORQFH-BQBZGAKWSA-N 0.000 description 2
- VZKBJNBZMZHKRC-XUXIUFHCSA-N Pro-Ile-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O VZKBJNBZMZHKRC-XUXIUFHCSA-N 0.000 description 2
- UREQLMJCKFLLHM-NAKRPEOUSA-N Pro-Ile-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O UREQLMJCKFLLHM-NAKRPEOUSA-N 0.000 description 2
- HFNPOYOKIPGAEI-SRVKXCTJSA-N Pro-Leu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 HFNPOYOKIPGAEI-SRVKXCTJSA-N 0.000 description 2
- BRJGUPWVFXKBQI-XUXIUFHCSA-N Pro-Leu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BRJGUPWVFXKBQI-XUXIUFHCSA-N 0.000 description 2
- MRYUJHGPZQNOAD-IHRRRGAJSA-N Pro-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 MRYUJHGPZQNOAD-IHRRRGAJSA-N 0.000 description 2
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 2
- VTFXTWDFPTWNJY-RHYQMDGZSA-N Pro-Leu-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VTFXTWDFPTWNJY-RHYQMDGZSA-N 0.000 description 2
- QGLFRQCECIWXFA-RCWTZXSCSA-N Pro-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1)O QGLFRQCECIWXFA-RCWTZXSCSA-N 0.000 description 2
- WHNJMTHJGCEKGA-ULQDDVLXSA-N Pro-Phe-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WHNJMTHJGCEKGA-ULQDDVLXSA-N 0.000 description 2
- GFHXZNVJIKMAGO-IHRRRGAJSA-N Pro-Phe-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GFHXZNVJIKMAGO-IHRRRGAJSA-N 0.000 description 2
- FYKUEXMZYFIZKA-DCAQKATOSA-N Pro-Pro-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O FYKUEXMZYFIZKA-DCAQKATOSA-N 0.000 description 2
- KBUAPZAZPWNYSW-SRVKXCTJSA-N Pro-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 KBUAPZAZPWNYSW-SRVKXCTJSA-N 0.000 description 2
- GOMUXSCOIWIJFP-GUBZILKMSA-N Pro-Ser-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GOMUXSCOIWIJFP-GUBZILKMSA-N 0.000 description 2
- GMJDSFYVTAMIBF-FXQIFTODSA-N Pro-Ser-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O GMJDSFYVTAMIBF-FXQIFTODSA-N 0.000 description 2
- CZCCVJUUWBMISW-FXQIFTODSA-N Pro-Ser-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O CZCCVJUUWBMISW-FXQIFTODSA-N 0.000 description 2
- SEZGGSHLMROBFX-CIUDSAMLSA-N Pro-Ser-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O SEZGGSHLMROBFX-CIUDSAMLSA-N 0.000 description 2
- LNICFEXCAHIJOR-DCAQKATOSA-N Pro-Ser-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O LNICFEXCAHIJOR-DCAQKATOSA-N 0.000 description 2
- SXJOPONICMGFCR-DCAQKATOSA-N Pro-Ser-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O SXJOPONICMGFCR-DCAQKATOSA-N 0.000 description 2
- XSXABUHLKPUVLX-JYJNAYRXSA-N Pro-Ser-Trp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O XSXABUHLKPUVLX-JYJNAYRXSA-N 0.000 description 2
- UGDMQJSXSSZUKL-IHRRRGAJSA-N Pro-Ser-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O UGDMQJSXSSZUKL-IHRRRGAJSA-N 0.000 description 2
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 2
- DCHQYSOGURGJST-FJXKBIBVSA-N Pro-Thr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O DCHQYSOGURGJST-FJXKBIBVSA-N 0.000 description 2
- RMJZWERKFFNNNS-XGEHTFHBSA-N Pro-Thr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O RMJZWERKFFNNNS-XGEHTFHBSA-N 0.000 description 2
- GZNYIXWOIUFLGO-ZJDVBMNYSA-N Pro-Thr-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZNYIXWOIUFLGO-ZJDVBMNYSA-N 0.000 description 2
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 2
- QKWYXRPICJEQAJ-KJEVXHAQSA-N Pro-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@@H]2CCCN2)O QKWYXRPICJEQAJ-KJEVXHAQSA-N 0.000 description 2
- XDKKMRPRRCOELJ-GUBZILKMSA-N Pro-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 XDKKMRPRRCOELJ-GUBZILKMSA-N 0.000 description 2
- WWXNZNWZNZPDIF-SRVKXCTJSA-N Pro-Val-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 WWXNZNWZNZPDIF-SRVKXCTJSA-N 0.000 description 2
- KHRLUIPIMIQFGT-AVGNSLFASA-N Pro-Val-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHRLUIPIMIQFGT-AVGNSLFASA-N 0.000 description 2
- 108010079005 RDV peptide Proteins 0.000 description 2
- LVVBAKCGXXUHFO-ZLUOBGJFSA-N Ser-Ala-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O LVVBAKCGXXUHFO-ZLUOBGJFSA-N 0.000 description 2
- HRNQLKCLPVKZNE-CIUDSAMLSA-N Ser-Ala-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O HRNQLKCLPVKZNE-CIUDSAMLSA-N 0.000 description 2
- IYCBDVBJWDXQRR-FXQIFTODSA-N Ser-Ala-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IYCBDVBJWDXQRR-FXQIFTODSA-N 0.000 description 2
- IDQFQFVEWMWRQQ-DLOVCJGASA-N Ser-Ala-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IDQFQFVEWMWRQQ-DLOVCJGASA-N 0.000 description 2
- BRKHVZNDAOMAHX-BIIVOSGPSA-N Ser-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N BRKHVZNDAOMAHX-BIIVOSGPSA-N 0.000 description 2
- NLQUOHDCLSFABG-GUBZILKMSA-N Ser-Arg-Arg Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NLQUOHDCLSFABG-GUBZILKMSA-N 0.000 description 2
- QEDMOZUJTGEIBF-FXQIFTODSA-N Ser-Arg-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O QEDMOZUJTGEIBF-FXQIFTODSA-N 0.000 description 2
- QFBNNYNWKYKVJO-DCAQKATOSA-N Ser-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N QFBNNYNWKYKVJO-DCAQKATOSA-N 0.000 description 2
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 2
- BCKYYTVFBXHPOG-ACZMJKKPSA-N Ser-Asn-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N BCKYYTVFBXHPOG-ACZMJKKPSA-N 0.000 description 2
- ZXLUWXWISXIFIX-ACZMJKKPSA-N Ser-Asn-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZXLUWXWISXIFIX-ACZMJKKPSA-N 0.000 description 2
- YMEXHZTVKDAKIY-GHCJXIJMSA-N Ser-Asn-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO)C(O)=O YMEXHZTVKDAKIY-GHCJXIJMSA-N 0.000 description 2
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 2
- BTPAWKABYQMKKN-LKXGYXEUSA-N Ser-Asp-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BTPAWKABYQMKKN-LKXGYXEUSA-N 0.000 description 2
- HEQPKICPPDOSIN-SRVKXCTJSA-N Ser-Asp-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HEQPKICPPDOSIN-SRVKXCTJSA-N 0.000 description 2
- INCNPLPRPOYTJI-JBDRJPRFSA-N Ser-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N INCNPLPRPOYTJI-JBDRJPRFSA-N 0.000 description 2
- RFBKULCUBJAQFT-BIIVOSGPSA-N Ser-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CO)N)C(=O)O RFBKULCUBJAQFT-BIIVOSGPSA-N 0.000 description 2
- IXUGADGDCQDLSA-FXQIFTODSA-N Ser-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N IXUGADGDCQDLSA-FXQIFTODSA-N 0.000 description 2
- ZOHGLPQGEHSLPD-FXQIFTODSA-N Ser-Gln-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZOHGLPQGEHSLPD-FXQIFTODSA-N 0.000 description 2
- HVKMTOIAYDOJPL-NRPADANISA-N Ser-Gln-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVKMTOIAYDOJPL-NRPADANISA-N 0.000 description 2
- YQQKYAZABFEYAF-FXQIFTODSA-N Ser-Glu-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQQKYAZABFEYAF-FXQIFTODSA-N 0.000 description 2
- LALNXSXEYFUUDD-GUBZILKMSA-N Ser-Glu-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LALNXSXEYFUUDD-GUBZILKMSA-N 0.000 description 2
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 2
- WBINSDOPZHQPPM-AVGNSLFASA-N Ser-Glu-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)O WBINSDOPZHQPPM-AVGNSLFASA-N 0.000 description 2
- OHKFXGKHSJKKAL-NRPADANISA-N Ser-Glu-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OHKFXGKHSJKKAL-NRPADANISA-N 0.000 description 2
- SNVIOQXAHVORQM-WDSKDSINSA-N Ser-Gly-Gln Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O SNVIOQXAHVORQM-WDSKDSINSA-N 0.000 description 2
- IOVHBRCQOGWAQH-ZKWXMUAHSA-N Ser-Gly-Ile Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O IOVHBRCQOGWAQH-ZKWXMUAHSA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- QGAHMVHBORDHDC-YUMQZZPRSA-N Ser-His-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CN=CN1 QGAHMVHBORDHDC-YUMQZZPRSA-N 0.000 description 2
- UGHCUDLCCVVIJR-VGDYDELISA-N Ser-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N UGHCUDLCCVVIJR-VGDYDELISA-N 0.000 description 2
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 2
- DOSZISJPMCYEHT-NAKRPEOUSA-N Ser-Ile-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O DOSZISJPMCYEHT-NAKRPEOUSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- IAORETPTUDBBGV-CIUDSAMLSA-N Ser-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N IAORETPTUDBBGV-CIUDSAMLSA-N 0.000 description 2
- GVIGVIOEYBOTCB-XIRDDKMYSA-N Ser-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC(C)C)C(O)=O)=CNC2=C1 GVIGVIOEYBOTCB-XIRDDKMYSA-N 0.000 description 2
- PPNPDKGQRFSCAC-CIUDSAMLSA-N Ser-Lys-Asp Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CO)C(=O)N[C@@H](CC(O)=O)C(O)=O PPNPDKGQRFSCAC-CIUDSAMLSA-N 0.000 description 2
- WGDYNRCOQRERLZ-KKUMJFAQSA-N Ser-Lys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N WGDYNRCOQRERLZ-KKUMJFAQSA-N 0.000 description 2
- FPCGZYMRFFIYIH-CIUDSAMLSA-N Ser-Lys-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O FPCGZYMRFFIYIH-CIUDSAMLSA-N 0.000 description 2
- LRZLZIUXQBIWTB-KATARQTJSA-N Ser-Lys-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LRZLZIUXQBIWTB-KATARQTJSA-N 0.000 description 2
- PMCMLDNPAZUYGI-DCAQKATOSA-N Ser-Lys-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O PMCMLDNPAZUYGI-DCAQKATOSA-N 0.000 description 2
- FZEUTKVQGMVGHW-AVGNSLFASA-N Ser-Phe-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZEUTKVQGMVGHW-AVGNSLFASA-N 0.000 description 2
- QMCDMHWAKMUGJE-IHRRRGAJSA-N Ser-Phe-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O QMCDMHWAKMUGJE-IHRRRGAJSA-N 0.000 description 2
- NUEHQDHDLDXCRU-GUBZILKMSA-N Ser-Pro-Arg Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NUEHQDHDLDXCRU-GUBZILKMSA-N 0.000 description 2
- JLKWJWPDXPKKHI-FXQIFTODSA-N Ser-Pro-Asn Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CC(=O)N)C(=O)O JLKWJWPDXPKKHI-FXQIFTODSA-N 0.000 description 2
- WNDUPCKKKGSKIQ-CIUDSAMLSA-N Ser-Pro-Gln Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O WNDUPCKKKGSKIQ-CIUDSAMLSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- CKDXFSPMIDSMGV-GUBZILKMSA-N Ser-Pro-Val Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O CKDXFSPMIDSMGV-GUBZILKMSA-N 0.000 description 2
- FZXOPYUEQGDGMS-ACZMJKKPSA-N Ser-Ser-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O FZXOPYUEQGDGMS-ACZMJKKPSA-N 0.000 description 2
- JCLAFVNDBJMLBC-JBDRJPRFSA-N Ser-Ser-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JCLAFVNDBJMLBC-JBDRJPRFSA-N 0.000 description 2
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 2
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 2
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 2
- VLMIUSLQONKLDV-HEIBUPTGSA-N Ser-Thr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VLMIUSLQONKLDV-HEIBUPTGSA-N 0.000 description 2
- YXEYTHXDRDAIOJ-CWRNSKLLSA-N Ser-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CO)N)C(=O)O YXEYTHXDRDAIOJ-CWRNSKLLSA-N 0.000 description 2
- LLSLRQOEAFCZLW-NRPADANISA-N Ser-Val-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LLSLRQOEAFCZLW-NRPADANISA-N 0.000 description 2
- SIEBDTCABMZCLF-XGEHTFHBSA-N Ser-Val-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SIEBDTCABMZCLF-XGEHTFHBSA-N 0.000 description 2
- HSWXBJCBYSWBPT-GUBZILKMSA-N Ser-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CO)C(C)C)C(O)=O HSWXBJCBYSWBPT-GUBZILKMSA-N 0.000 description 2
- 108091081024 Start codon Proteins 0.000 description 2
- QAOWNCQODCNURD-UHFFFAOYSA-N Sulfuric acid Chemical compound OS(O)(=O)=O QAOWNCQODCNURD-UHFFFAOYSA-N 0.000 description 2
- 108020005038 Terminator Codon Proteins 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- LVHHEVGYAZGXDE-KDXUFGMBSA-N Thr-Ala-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(=O)O)N)O LVHHEVGYAZGXDE-KDXUFGMBSA-N 0.000 description 2
- CAJFZCICSVBOJK-SHGPDSBTSA-N Thr-Ala-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAJFZCICSVBOJK-SHGPDSBTSA-N 0.000 description 2
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 2
- UKBSDLHIKIXJKH-HJGDQZAQSA-N Thr-Arg-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O UKBSDLHIKIXJKH-HJGDQZAQSA-N 0.000 description 2
- NAXBBCLCEOTAIG-RHYQMDGZSA-N Thr-Arg-Lys Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O NAXBBCLCEOTAIG-RHYQMDGZSA-N 0.000 description 2
- GZYNMZQXFRWDFH-YTWAJWBKSA-N Thr-Arg-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O GZYNMZQXFRWDFH-YTWAJWBKSA-N 0.000 description 2
- NLJKZUGAIIRWJN-LKXGYXEUSA-N Thr-Asp-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O NLJKZUGAIIRWJN-LKXGYXEUSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- GNHRVXYZKWSJTF-HJGDQZAQSA-N Thr-Asp-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O GNHRVXYZKWSJTF-HJGDQZAQSA-N 0.000 description 2
- JXKMXEBNZCKSDY-JIOCBJNQSA-N Thr-Asp-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O JXKMXEBNZCKSDY-JIOCBJNQSA-N 0.000 description 2
- ODSAPYVQSLDRSR-LKXGYXEUSA-N Thr-Cys-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O ODSAPYVQSLDRSR-LKXGYXEUSA-N 0.000 description 2
- LYGKYFKSZTUXGZ-ZDLURKLDSA-N Thr-Cys-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)NCC(O)=O LYGKYFKSZTUXGZ-ZDLURKLDSA-N 0.000 description 2
- ZLNWJMRLHLGKFX-SVSWQMSJSA-N Thr-Cys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZLNWJMRLHLGKFX-SVSWQMSJSA-N 0.000 description 2
- MMTOHPRBJKEZHT-BWBBJGPYSA-N Thr-Cys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O MMTOHPRBJKEZHT-BWBBJGPYSA-N 0.000 description 2
- UZJDBCHMIQXLOQ-HEIBUPTGSA-N Thr-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O UZJDBCHMIQXLOQ-HEIBUPTGSA-N 0.000 description 2
- VUVCRYXYUUPGSB-GLLZPBPUSA-N Thr-Gln-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N)O VUVCRYXYUUPGSB-GLLZPBPUSA-N 0.000 description 2
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 2
- SHOMROOOQBDGRL-JHEQGTHGSA-N Thr-Glu-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O SHOMROOOQBDGRL-JHEQGTHGSA-N 0.000 description 2
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 2
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 2
- YZUWGFXVVZQJEI-PMVVWTBXSA-N Thr-Gly-His Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O YZUWGFXVVZQJEI-PMVVWTBXSA-N 0.000 description 2
- ZTPXSEUVYNNZRB-CDMKHQONSA-N Thr-Gly-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZTPXSEUVYNNZRB-CDMKHQONSA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- KRGDDWVBBDLPSJ-CUJWVEQBSA-N Thr-His-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O KRGDDWVBBDLPSJ-CUJWVEQBSA-N 0.000 description 2
- SXAGUVRFGJSFKC-ZEILLAHLSA-N Thr-His-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SXAGUVRFGJSFKC-ZEILLAHLSA-N 0.000 description 2
- HOVLHEKTGVIKAP-WDCWCFNPSA-N Thr-Leu-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O HOVLHEKTGVIKAP-WDCWCFNPSA-N 0.000 description 2
- FLPZMPOZGYPBEN-PPCPHDFISA-N Thr-Leu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLPZMPOZGYPBEN-PPCPHDFISA-N 0.000 description 2
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 2
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 2
- UUSQVWOVUYMLJA-PPCPHDFISA-N Thr-Lys-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UUSQVWOVUYMLJA-PPCPHDFISA-N 0.000 description 2
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 2
- GUHLYMZJVXUIPO-RCWTZXSCSA-N Thr-Met-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GUHLYMZJVXUIPO-RCWTZXSCSA-N 0.000 description 2
- WNQJTLATMXYSEL-OEAJRASXSA-N Thr-Phe-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O WNQJTLATMXYSEL-OEAJRASXSA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- DEGCBBCMYWNJNA-RHYQMDGZSA-N Thr-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O DEGCBBCMYWNJNA-RHYQMDGZSA-N 0.000 description 2
- YGCDFAJJCRVQKU-RCWTZXSCSA-N Thr-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)[C@@H](C)O YGCDFAJJCRVQKU-RCWTZXSCSA-N 0.000 description 2
- WKGAAMOJPMBBMC-IXOXFDKPSA-N Thr-Ser-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O WKGAAMOJPMBBMC-IXOXFDKPSA-N 0.000 description 2
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- NDZYTIMDOZMECO-SHGPDSBTSA-N Thr-Thr-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O NDZYTIMDOZMECO-SHGPDSBTSA-N 0.000 description 2
- YRJOLUDFVAUXLI-GSSVUCPTSA-N Thr-Thr-Asp Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O YRJOLUDFVAUXLI-GSSVUCPTSA-N 0.000 description 2
- CSNBWOJOEOPYIJ-UVOCVTCTSA-N Thr-Thr-Lys Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O CSNBWOJOEOPYIJ-UVOCVTCTSA-N 0.000 description 2
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 2
- KVEWWQRTAVMOFT-KJEVXHAQSA-N Thr-Tyr-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O KVEWWQRTAVMOFT-KJEVXHAQSA-N 0.000 description 2
- XGFYGMKZKFRGAI-RCWTZXSCSA-N Thr-Val-Arg Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N XGFYGMKZKFRGAI-RCWTZXSCSA-N 0.000 description 2
- AKHDFZHUPGVFEJ-YEPSODPASA-N Thr-Val-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AKHDFZHUPGVFEJ-YEPSODPASA-N 0.000 description 2
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 2
- KZTLZZQTJMCGIP-ZJDVBMNYSA-N Thr-Val-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KZTLZZQTJMCGIP-ZJDVBMNYSA-N 0.000 description 2
- VYVBSMCZNHOZGD-RCWTZXSCSA-N Thr-Val-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O VYVBSMCZNHOZGD-RCWTZXSCSA-N 0.000 description 2
- BDWDMRSGCXEDMR-WFBYXXMGSA-N Trp-Ala-Asn Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N BDWDMRSGCXEDMR-WFBYXXMGSA-N 0.000 description 2
- VKMOGXREKGVZAF-QEJZJMRPSA-N Trp-Asp-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VKMOGXREKGVZAF-QEJZJMRPSA-N 0.000 description 2
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 2
- RPVDDQYNBOVWLR-HOCLYGCPSA-N Trp-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RPVDDQYNBOVWLR-HOCLYGCPSA-N 0.000 description 2
- CCZXBOFIBYQLEV-IHPCNDPISA-N Trp-Leu-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O CCZXBOFIBYQLEV-IHPCNDPISA-N 0.000 description 2
- HJXOFWKCWLHYIJ-SZMVWBNQSA-N Trp-Lys-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HJXOFWKCWLHYIJ-SZMVWBNQSA-N 0.000 description 2
- VUMCLPHXCBIJJB-PMVMPFDFSA-N Trp-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N VUMCLPHXCBIJJB-PMVMPFDFSA-N 0.000 description 2
- GBEAUNVBIMLWIB-IHPCNDPISA-N Trp-Ser-Phe Chemical compound C([C@H](NC(=O)[C@H](CO)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 GBEAUNVBIMLWIB-IHPCNDPISA-N 0.000 description 2
- SSSDKJMQMZTMJP-BVSLBCMMSA-N Trp-Tyr-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=C(O)C=C1 SSSDKJMQMZTMJP-BVSLBCMMSA-N 0.000 description 2
- TVOGEPLDNYTAHD-CQDKDKBSSA-N Tyr-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 TVOGEPLDNYTAHD-CQDKDKBSSA-N 0.000 description 2
- OOEUVMFKKZYSRX-LEWSCRJBSA-N Tyr-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N OOEUVMFKKZYSRX-LEWSCRJBSA-N 0.000 description 2
- JWHOIHCOHMZSAR-QWRGUYRKSA-N Tyr-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JWHOIHCOHMZSAR-QWRGUYRKSA-N 0.000 description 2
- RYSNTWVRSLCAJZ-RYUDHWBXSA-N Tyr-Gln-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 RYSNTWVRSLCAJZ-RYUDHWBXSA-N 0.000 description 2
- NZFCWALTLNFHHC-JYJNAYRXSA-N Tyr-Glu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NZFCWALTLNFHHC-JYJNAYRXSA-N 0.000 description 2
- AKLNEFNQWLHIGY-QWRGUYRKSA-N Tyr-Gly-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N)O AKLNEFNQWLHIGY-QWRGUYRKSA-N 0.000 description 2
- KHCSOLAHNLOXJR-BZSNNMDCSA-N Tyr-Leu-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KHCSOLAHNLOXJR-BZSNNMDCSA-N 0.000 description 2
- CDKZJGMPZHPAJC-ULQDDVLXSA-N Tyr-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDKZJGMPZHPAJC-ULQDDVLXSA-N 0.000 description 2
- OGPKMBOPMDTEDM-IHRRRGAJSA-N Tyr-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N OGPKMBOPMDTEDM-IHRRRGAJSA-N 0.000 description 2
- FDKDGFGTHGJKNV-FHWLQOOXSA-N Tyr-Phe-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N FDKDGFGTHGJKNV-FHWLQOOXSA-N 0.000 description 2
- ARMNWLJYHCOSHE-KKUMJFAQSA-N Tyr-Pro-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O ARMNWLJYHCOSHE-KKUMJFAQSA-N 0.000 description 2
- QKXAEWMHAAVVGS-KKUMJFAQSA-N Tyr-Pro-Glu Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O QKXAEWMHAAVVGS-KKUMJFAQSA-N 0.000 description 2
- NHOVZGFNTGMYMI-KKUMJFAQSA-N Tyr-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NHOVZGFNTGMYMI-KKUMJFAQSA-N 0.000 description 2
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 2
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 2
- RUCNAYOMFXRIKJ-DCAQKATOSA-N Val-Ala-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN RUCNAYOMFXRIKJ-DCAQKATOSA-N 0.000 description 2
- HNWQUBBOBKSFQV-AVGNSLFASA-N Val-Arg-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HNWQUBBOBKSFQV-AVGNSLFASA-N 0.000 description 2
- IDKGBVZGNTYYCC-QXEWZRGKSA-N Val-Asn-Pro Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(O)=O IDKGBVZGNTYYCC-QXEWZRGKSA-N 0.000 description 2
- QHDXUYOYTPWCSK-RCOVLWMOSA-N Val-Asp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)NCC(=O)O)N QHDXUYOYTPWCSK-RCOVLWMOSA-N 0.000 description 2
- DDNIHOWRDOXXPF-NGZCFLSTSA-N Val-Asp-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N DDNIHOWRDOXXPF-NGZCFLSTSA-N 0.000 description 2
- VXCAZHCVDBQMTP-NRPADANISA-N Val-Cys-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VXCAZHCVDBQMTP-NRPADANISA-N 0.000 description 2
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 2
- CFSSLXZJEMERJY-NRPADANISA-N Val-Gln-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CFSSLXZJEMERJY-NRPADANISA-N 0.000 description 2
- PGBJAZDAEWPDAA-NHCYSSNCSA-N Val-Gln-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCSC)C(=O)O)N PGBJAZDAEWPDAA-NHCYSSNCSA-N 0.000 description 2
- JXGWQYWDUOWQHA-DZKIICNBSA-N Val-Gln-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N JXGWQYWDUOWQHA-DZKIICNBSA-N 0.000 description 2
- BRPKEERLGYNCNC-NHCYSSNCSA-N Val-Glu-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N BRPKEERLGYNCNC-NHCYSSNCSA-N 0.000 description 2
- CVIXTAITYJQMPE-LAEOZQHASA-N Val-Glu-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CVIXTAITYJQMPE-LAEOZQHASA-N 0.000 description 2
- XWYUBUYQMOUFRQ-IFFSRLJSSA-N Val-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N)O XWYUBUYQMOUFRQ-IFFSRLJSSA-N 0.000 description 2
- PIFJAFRUVWZRKR-QMMMGPOBSA-N Val-Gly-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)NCC(=O)NCC([O-])=O PIFJAFRUVWZRKR-QMMMGPOBSA-N 0.000 description 2
- URIRWLJVWHYLET-ONGXEEELSA-N Val-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C URIRWLJVWHYLET-ONGXEEELSA-N 0.000 description 2
- FXVDGDZRYLFQKY-WPRPVWTQSA-N Val-Gly-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C FXVDGDZRYLFQKY-WPRPVWTQSA-N 0.000 description 2
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 2
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 2
- CPGJELLYDQEDRK-NAKRPEOUSA-N Val-Ile-Ala Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O CPGJELLYDQEDRK-NAKRPEOUSA-N 0.000 description 2
- VXDSPJJQUQDCKH-UKJIMTQDSA-N Val-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N VXDSPJJQUQDCKH-UKJIMTQDSA-N 0.000 description 2
- UKEVLVBHRKWECS-LSJOCFKGSA-N Val-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](C(C)C)N UKEVLVBHRKWECS-LSJOCFKGSA-N 0.000 description 2
- FTKXYXACXYOHND-XUXIUFHCSA-N Val-Ile-Leu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O FTKXYXACXYOHND-XUXIUFHCSA-N 0.000 description 2
- SDUBQHUJJWQTEU-XUXIUFHCSA-N Val-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C(C)C)N SDUBQHUJJWQTEU-XUXIUFHCSA-N 0.000 description 2
- DJQIUOKSNRBTSV-CYDGBPFRSA-N Val-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](C(C)C)N DJQIUOKSNRBTSV-CYDGBPFRSA-N 0.000 description 2
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 2
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 2
- KTEZUXISLQTDDQ-NHCYSSNCSA-N Val-Lys-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)O)C(=O)O)N KTEZUXISLQTDDQ-NHCYSSNCSA-N 0.000 description 2
- ZRSZTKTVPNSUNA-IHRRRGAJSA-N Val-Lys-Leu Chemical compound CC(C)C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)C(C)C)C(O)=O ZRSZTKTVPNSUNA-IHRRRGAJSA-N 0.000 description 2
- YMTOEGGOCHVGEH-IHRRRGAJSA-N Val-Lys-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O YMTOEGGOCHVGEH-IHRRRGAJSA-N 0.000 description 2
- CXWJFWAZIVWBOS-XQQFMLRXSA-N Val-Lys-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CXWJFWAZIVWBOS-XQQFMLRXSA-N 0.000 description 2
- FMQGYTMERWBMSI-HJWJTTGWSA-N Val-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N FMQGYTMERWBMSI-HJWJTTGWSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- YTNGABPUXFEOGU-SRVKXCTJSA-N Val-Pro-Arg Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O YTNGABPUXFEOGU-SRVKXCTJSA-N 0.000 description 2
- ZXYPHBKIZLAQTL-QXEWZRGKSA-N Val-Pro-Asp Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZXYPHBKIZLAQTL-QXEWZRGKSA-N 0.000 description 2
- GQMNEJMFMCJJTD-NHCYSSNCSA-N Val-Pro-Gln Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O GQMNEJMFMCJJTD-NHCYSSNCSA-N 0.000 description 2
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 2
- WANVRBAZGSICCP-SRVKXCTJSA-N Val-Pro-Met Chemical compound CSCC[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C)C(O)=O WANVRBAZGSICCP-SRVKXCTJSA-N 0.000 description 2
- QIVPZSWBBHRNBA-JYJNAYRXSA-N Val-Pro-Phe Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1ccccc1)C(O)=O QIVPZSWBBHRNBA-JYJNAYRXSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- PGQUDQYHWICSAB-NAKRPEOUSA-N Val-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N PGQUDQYHWICSAB-NAKRPEOUSA-N 0.000 description 2
- DLLRRUDLMSJTMB-GUBZILKMSA-N Val-Ser-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)O)N DLLRRUDLMSJTMB-GUBZILKMSA-N 0.000 description 2
- PZTZYZUTCPZWJH-FXQIFTODSA-N Val-Ser-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PZTZYZUTCPZWJH-FXQIFTODSA-N 0.000 description 2
- NZYNRRGJJVSSTJ-GUBZILKMSA-N Val-Ser-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O NZYNRRGJJVSSTJ-GUBZILKMSA-N 0.000 description 2
- CEKSLIVSNNGOKH-KZVJFYERSA-N Val-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](C(C)C)N)O CEKSLIVSNNGOKH-KZVJFYERSA-N 0.000 description 2
- UVHFONIHVHLDDQ-IFFSRLJSSA-N Val-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O UVHFONIHVHLDDQ-IFFSRLJSSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- NGXQOQNXSGOYOI-BQFCYCMXSA-N Val-Trp-Gln Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 NGXQOQNXSGOYOI-BQFCYCMXSA-N 0.000 description 2
- ZLMFVXMJFIWIRE-FHWLQOOXSA-N Val-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](C(C)C)N ZLMFVXMJFIWIRE-FHWLQOOXSA-N 0.000 description 2
- JXWGBRRVTRAZQA-ULQDDVLXSA-N Val-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](C(C)C)N JXWGBRRVTRAZQA-ULQDDVLXSA-N 0.000 description 2
- VVIZITNVZUAEMI-DLOVCJGASA-N Val-Val-Gln Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(N)=O VVIZITNVZUAEMI-DLOVCJGASA-N 0.000 description 2
- LLJLBRRXKZTTRD-GUBZILKMSA-N Val-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N LLJLBRRXKZTTRD-GUBZILKMSA-N 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 125000002777 acetyl group Chemical group [H]C([H])([H])C(*)=O 0.000 description 2
- 125000002252 acyl group Chemical group 0.000 description 2
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 2
- 108010069490 alanyl-glycyl-seryl-glutamic acid Proteins 0.000 description 2
- 108010045023 alanyl-prolyl-tyrosine Proteins 0.000 description 2
- 238000004458 analytical method Methods 0.000 description 2
- 230000000692 anti-sense effect Effects 0.000 description 2
- 239000003816 antisense DNA Substances 0.000 description 2
- 239000007864 aqueous solution Substances 0.000 description 2
- 108010069926 arginyl-glycyl-serine Proteins 0.000 description 2
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 2
- 108010010430 asparagine-proline-alanine Proteins 0.000 description 2
- WPYMKLBDIGXBTP-UHFFFAOYSA-N benzoic acid Chemical compound OC(=O)C1=CC=CC=C1 WPYMKLBDIGXBTP-UHFFFAOYSA-N 0.000 description 2
- 210000004369 blood Anatomy 0.000 description 2
- 239000008280 blood Substances 0.000 description 2
- 238000006664 bond formation reaction Methods 0.000 description 2
- 239000011575 calcium Substances 0.000 description 2
- 229910052791 calcium Inorganic materials 0.000 description 2
- 150000001718 carbodiimides Chemical class 0.000 description 2
- 150000007942 carboxylates Chemical class 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 238000003776 cleavage reaction Methods 0.000 description 2
- 238000006482 condensation reaction Methods 0.000 description 2
- 238000012258 culturing Methods 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- XBDQKXXYIPTUBI-UHFFFAOYSA-N dimethylselenoniopropionate Natural products CCC(O)=O XBDQKXXYIPTUBI-UHFFFAOYSA-N 0.000 description 2
- FSXRLASFHBWESK-UHFFFAOYSA-N dipeptide phenylalanyl-tyrosine Natural products C=1C=C(O)C=CC=1CC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FSXRLASFHBWESK-UHFFFAOYSA-N 0.000 description 2
- BNIILDVGGAEEIG-UHFFFAOYSA-L disodium hydrogen phosphate Chemical compound [Na+].[Na+].OP([O-])([O-])=O BNIILDVGGAEEIG-UHFFFAOYSA-L 0.000 description 2
- 229910000397 disodium phosphate Inorganic materials 0.000 description 2
- 235000019800 disodium phosphate Nutrition 0.000 description 2
- 238000002474 experimental method Methods 0.000 description 2
- 102000037865 fusion proteins Human genes 0.000 description 2
- 108020001507 fusion proteins Proteins 0.000 description 2
- 108010057083 glutamyl-aspartyl-leucine Proteins 0.000 description 2
- HPAIKDPJURGQLN-UHFFFAOYSA-N glycyl-L-histidyl-L-phenylalanine Natural products C=1C=CC=CC=1CC(C(O)=O)NC(=O)C(NC(=O)CN)CC1=CN=CN1 HPAIKDPJURGQLN-UHFFFAOYSA-N 0.000 description 2
- 108010019832 glycyl-asparaginyl-glycine Proteins 0.000 description 2
- 108010081985 glycyl-cystinyl-aspartic acid Proteins 0.000 description 2
- 108010023364 glycyl-histidyl-arginine Proteins 0.000 description 2
- 108010050475 glycyl-leucyl-tyrosine Proteins 0.000 description 2
- 108010048994 glycyl-tyrosyl-alanine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- NPZTUJOABDZTLV-UHFFFAOYSA-N hydroxybenzotriazole Substances O=C1C=CC=C2NNN=C12 NPZTUJOABDZTLV-UHFFFAOYSA-N 0.000 description 2
- 238000000338 in vitro Methods 0.000 description 2
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 2
- 108010078274 isoleucylvaline Proteins 0.000 description 2
- 238000002372 labelling Methods 0.000 description 2
- 108010044056 leucyl-phenylalanine Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 238000004519 manufacturing process Methods 0.000 description 2
- 238000000691 measurement method Methods 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- BDAGIHXWWSANSR-UHFFFAOYSA-N methanoic acid Natural products OC=O BDAGIHXWWSANSR-UHFFFAOYSA-N 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 108010068488 methionylphenylalanine Proteins 0.000 description 2
- 150000007522 mineralic acids Chemical class 0.000 description 2
- 230000035772 mutation Effects 0.000 description 2
- 150000007524 organic acids Chemical class 0.000 description 2
- 235000005985 organic acids Nutrition 0.000 description 2
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 2
- 108010089198 phenylalanyl-prolyl-arginine Proteins 0.000 description 2
- 108010073101 phenylalanylleucine Proteins 0.000 description 2
- 108091033319 polynucleotide Proteins 0.000 description 2
- 102000040430 polynucleotide Human genes 0.000 description 2
- 239000002157 polynucleotide Substances 0.000 description 2
- 238000001243 protein synthesis Methods 0.000 description 2
- 239000002994 raw material Substances 0.000 description 2
- 108091008146 restriction endonucleases Proteins 0.000 description 2
- 238000003757 reverse transcription PCR Methods 0.000 description 2
- 230000007017 scission Effects 0.000 description 2
- 108010069117 seryl-lysyl-aspartic acid Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 239000002904 solvent Substances 0.000 description 2
- 241000894007 species Species 0.000 description 2
- 239000006228 supernatant Substances 0.000 description 2
- 108091035539 telomere Proteins 0.000 description 2
- 102000055501 telomere Human genes 0.000 description 2
- UEUXEKPTXMALOB-UHFFFAOYSA-J tetrasodium;2-[2-[bis(carboxylatomethyl)amino]ethyl-(carboxylatomethyl)amino]acetate Chemical compound [Na+].[Na+].[Na+].[Na+].[O-]C(=O)CN(CC([O-])=O)CCN(CC([O-])=O)CC([O-])=O UEUXEKPTXMALOB-UHFFFAOYSA-J 0.000 description 2
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 2
- VZCYOOQTPOCHFL-UHFFFAOYSA-N trans-butenedioic acid Natural products OC(=O)C=CC(O)=O VZCYOOQTPOCHFL-UHFFFAOYSA-N 0.000 description 2
- 230000014621 translational initiation Effects 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 241000701161 unidentified adenovirus Species 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- DQJCDTNMLBYVAY-ZXXIYAEKSA-N (2S,5R,10R,13R)-16-{[(2R,3S,4R,5R)-3-{[(2S,3R,4R,5S,6R)-3-acetamido-4,5-dihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy}-5-(ethylamino)-6-hydroxy-2-(hydroxymethyl)oxan-4-yl]oxy}-5-(4-aminobutyl)-10-carbamoyl-2,13-dimethyl-4,7,12,15-tetraoxo-3,6,11,14-tetraazaheptadecan-1-oic acid Chemical compound NCCCC[C@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@@H](C)NC(=O)C(C)O[C@@H]1[C@@H](NCC)C(O)O[C@H](CO)[C@H]1O[C@H]1[C@H](NC(C)=O)[C@@H](O)[C@H](O)[C@@H](CO)O1 DQJCDTNMLBYVAY-ZXXIYAEKSA-N 0.000 description 1
- WXPZDDCNKXMOMC-AVGNSLFASA-N (2s)-1-[(2s)-2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]-5-(diaminomethylideneamino)pentanoyl]pyrrolidine-2-carboxylic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N1[C@H](C(O)=O)CCC1 WXPZDDCNKXMOMC-AVGNSLFASA-N 0.000 description 1
- PKOHVHWNGUHYRE-ZFWWWQNUSA-N (2s)-1-[2-[[(2s)-2-amino-3-(1h-indol-3-yl)propanoyl]amino]acetyl]pyrrolidine-2-carboxylic acid Chemical compound O=C([C@H](CC=1C2=CC=CC=C2NC=1)N)NCC(=O)N1CCC[C@H]1C(O)=O PKOHVHWNGUHYRE-ZFWWWQNUSA-N 0.000 description 1
- AXFMEGAFCUULFV-BLFANLJRSA-N (2s)-2-[[(2s)-1-[(2s,3r)-2-amino-3-methylpentanoyl]pyrrolidine-2-carbonyl]amino]pentanedioic acid Chemical compound CC[C@@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AXFMEGAFCUULFV-BLFANLJRSA-N 0.000 description 1
- VLAFRQCSFRYCLC-FXQIFTODSA-N (2s)-2-[[(2s)-2-[[2-[[(2s)-2-aminopropanoyl]amino]acetyl]amino]-3-hydroxypropanoyl]amino]pentanedioic acid Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O VLAFRQCSFRYCLC-FXQIFTODSA-N 0.000 description 1
- NNRFRJQMBSBXGO-CIUDSAMLSA-N (3s)-3-[[2-[[(2s)-2-amino-5-(diaminomethylideneamino)pentanoyl]amino]acetyl]amino]-4-[[(1s)-1-carboxy-2-hydroxyethyl]amino]-4-oxobutanoic acid Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NNRFRJQMBSBXGO-CIUDSAMLSA-N 0.000 description 1
- NPWMTBZSRRLQNJ-VKHMYHEASA-N (3s)-3-aminopiperidine-2,6-dione Chemical compound N[C@H]1CCC(=O)NC1=O NPWMTBZSRRLQNJ-VKHMYHEASA-N 0.000 description 1
- HGHOBRRUMWJWCU-FXQIFTODSA-N (4s)-4-[[(2s)-2-aminopropanoyl]amino]-5-[[(2s)-3-carboxy-1-(carboxymethylamino)-1-oxopropan-2-yl]amino]-5-oxopentanoic acid Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O HGHOBRRUMWJWCU-FXQIFTODSA-N 0.000 description 1
- 125000006526 (C1-C2) alkyl group Chemical group 0.000 description 1
- BJEPYKJPYRNKOW-REOHCLBHSA-N (S)-malic acid Chemical compound OC(=O)[C@@H](O)CC(O)=O BJEPYKJPYRNKOW-REOHCLBHSA-N 0.000 description 1
- BDNKZNFMNDZQMI-UHFFFAOYSA-N 1,3-diisopropylcarbodiimide Chemical compound CC(C)N=C=NC(C)C BDNKZNFMNDZQMI-UHFFFAOYSA-N 0.000 description 1
- YQTCQNIPQMJNTI-UHFFFAOYSA-N 2,2-dimethylpropan-1-one Chemical group CC(C)(C)[C]=O YQTCQNIPQMJNTI-UHFFFAOYSA-N 0.000 description 1
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 1
- SCPRYBYMKVYVND-UHFFFAOYSA-N 2-[[2-[[1-(2-amino-4-methylpentanoyl)pyrrolidine-2-carbonyl]amino]-4-methylpentanoyl]amino]-4-methylpentanoic acid Chemical compound CC(C)CC(N)C(=O)N1CCCC1C(=O)NC(CC(C)C)C(=O)NC(CC(C)C)C(O)=O SCPRYBYMKVYVND-UHFFFAOYSA-N 0.000 description 1
- QMOQBVOBWVNSNO-UHFFFAOYSA-N 2-[[2-[[2-[(2-azaniumylacetyl)amino]acetyl]amino]acetyl]amino]acetate Chemical compound NCC(=O)NCC(=O)NCC(=O)NCC(O)=O QMOQBVOBWVNSNO-UHFFFAOYSA-N 0.000 description 1
- XWTNPSHCJMZAHQ-QMMMGPOBSA-N 2-[[2-[[2-[[(2s)-2-amino-4-methylpentanoyl]amino]acetyl]amino]acetyl]amino]acetic acid Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(=O)NCC(O)=O XWTNPSHCJMZAHQ-QMMMGPOBSA-N 0.000 description 1
- 125000000094 2-phenylethyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])([H])* 0.000 description 1
- BMYNFMYTOJXKLE-UHFFFAOYSA-N 3-azaniumyl-2-hydroxypropanoate Chemical compound NCC(O)C(O)=O BMYNFMYTOJXKLE-UHFFFAOYSA-N 0.000 description 1
- OSWFIVFLDKOXQC-UHFFFAOYSA-N 4-(3-methoxyphenyl)aniline Chemical compound COC1=CC=CC(C=2C=CC(N)=CC=2)=C1 OSWFIVFLDKOXQC-UHFFFAOYSA-N 0.000 description 1
- IMIZPWSVYADSCN-UHFFFAOYSA-N 4-methyl-2-[[4-methyl-2-[[4-methyl-2-(pyrrolidine-2-carbonylamino)pentanoyl]amino]pentanoyl]amino]pentanoic acid Chemical compound CC(C)CC(C(O)=O)NC(=O)C(CC(C)C)NC(=O)C(CC(C)C)NC(=O)C1CCCN1 IMIZPWSVYADSCN-UHFFFAOYSA-N 0.000 description 1
- 229920000936 Agarose Polymers 0.000 description 1
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 1
- ZFXQNADNEBRERM-BJDJZHNGSA-N Ala-Ala-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ZFXQNADNEBRERM-BJDJZHNGSA-N 0.000 description 1
- ODWSTKXGQGYHSH-FXQIFTODSA-N Ala-Arg-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O ODWSTKXGQGYHSH-FXQIFTODSA-N 0.000 description 1
- SDMAQFGBPOJFOM-GUBZILKMSA-N Ala-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SDMAQFGBPOJFOM-GUBZILKMSA-N 0.000 description 1
- DVWVZSJAYIJZFI-FXQIFTODSA-N Ala-Arg-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O DVWVZSJAYIJZFI-FXQIFTODSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- SVBXIUDNTRTKHE-CIUDSAMLSA-N Ala-Arg-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O SVBXIUDNTRTKHE-CIUDSAMLSA-N 0.000 description 1
- SKHCUBQVZJHOFM-NAKRPEOUSA-N Ala-Arg-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SKHCUBQVZJHOFM-NAKRPEOUSA-N 0.000 description 1
- YWWATNIVMOCSAV-UBHSHLNASA-N Ala-Arg-Phe Chemical compound NC(=N)NCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YWWATNIVMOCSAV-UBHSHLNASA-N 0.000 description 1
- XEXJJJRVTFGWIC-FXQIFTODSA-N Ala-Asn-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XEXJJJRVTFGWIC-FXQIFTODSA-N 0.000 description 1
- LBJYAILUMSUTAM-ZLUOBGJFSA-N Ala-Asn-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LBJYAILUMSUTAM-ZLUOBGJFSA-N 0.000 description 1
- YBPLKDWJFYCZSV-ZLUOBGJFSA-N Ala-Asn-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N YBPLKDWJFYCZSV-ZLUOBGJFSA-N 0.000 description 1
- PXKLCFFSVLKOJM-ACZMJKKPSA-N Ala-Asn-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXKLCFFSVLKOJM-ACZMJKKPSA-N 0.000 description 1
- SHYYAQLDNVHPFT-DLOVCJGASA-N Ala-Asn-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SHYYAQLDNVHPFT-DLOVCJGASA-N 0.000 description 1
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- GSCLWXDNIMNIJE-ZLUOBGJFSA-N Ala-Asp-Asn Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GSCLWXDNIMNIJE-ZLUOBGJFSA-N 0.000 description 1
- WQVYAWIMAWTGMW-ZLUOBGJFSA-N Ala-Asp-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N WQVYAWIMAWTGMW-ZLUOBGJFSA-N 0.000 description 1
- KUDREHRZRIVKHS-UWJYBYFXSA-N Ala-Asp-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KUDREHRZRIVKHS-UWJYBYFXSA-N 0.000 description 1
- IKKVASZHTMKJIR-ZKWXMUAHSA-N Ala-Asp-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IKKVASZHTMKJIR-ZKWXMUAHSA-N 0.000 description 1
- NJIFPLAJSVUQOZ-JBDRJPRFSA-N Ala-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C)N NJIFPLAJSVUQOZ-JBDRJPRFSA-N 0.000 description 1
- CXZFXHGJJPVUJE-CIUDSAMLSA-N Ala-Cys-Leu Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O)N CXZFXHGJJPVUJE-CIUDSAMLSA-N 0.000 description 1
- IYCZBJXFSZSHPN-DLOVCJGASA-N Ala-Cys-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IYCZBJXFSZSHPN-DLOVCJGASA-N 0.000 description 1
- OILNWMNBLIHXQK-ZLUOBGJFSA-N Ala-Cys-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O OILNWMNBLIHXQK-ZLUOBGJFSA-N 0.000 description 1
- YEELWQSXYBJVSV-UWJYBYFXSA-N Ala-Cys-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YEELWQSXYBJVSV-UWJYBYFXSA-N 0.000 description 1
- CXQODNIBUNQWAS-CIUDSAMLSA-N Ala-Gln-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CXQODNIBUNQWAS-CIUDSAMLSA-N 0.000 description 1
- NKJBKNVQHBZUIX-ACZMJKKPSA-N Ala-Gln-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKJBKNVQHBZUIX-ACZMJKKPSA-N 0.000 description 1
- FVSOUJZKYWEFOB-KBIXCLLPSA-N Ala-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](C)N FVSOUJZKYWEFOB-KBIXCLLPSA-N 0.000 description 1
- CRWFEKLFPVRPBV-CIUDSAMLSA-N Ala-Gln-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O CRWFEKLFPVRPBV-CIUDSAMLSA-N 0.000 description 1
- JPGBXANAQYHTLA-DRZSPHRISA-N Ala-Gln-Phe Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JPGBXANAQYHTLA-DRZSPHRISA-N 0.000 description 1
- MVBWLRJESQOQTM-ACZMJKKPSA-N Ala-Gln-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O MVBWLRJESQOQTM-ACZMJKKPSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- ZDYNWWQXFRUOEO-XDTLVQLUSA-N Ala-Gln-Tyr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZDYNWWQXFRUOEO-XDTLVQLUSA-N 0.000 description 1
- YIGLXQRFQVWFEY-NRPADANISA-N Ala-Gln-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O YIGLXQRFQVWFEY-NRPADANISA-N 0.000 description 1
- KXEVYGKATAMXJJ-ACZMJKKPSA-N Ala-Glu-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KXEVYGKATAMXJJ-ACZMJKKPSA-N 0.000 description 1
- BVSGPHDECMJBDE-HGNGGELXSA-N Ala-Glu-His Chemical compound C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BVSGPHDECMJBDE-HGNGGELXSA-N 0.000 description 1
- NHLAEBFGWPXFGI-WHFBIAKZSA-N Ala-Gly-Asn Chemical compound C[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)N)C(=O)O)N NHLAEBFGWPXFGI-WHFBIAKZSA-N 0.000 description 1
- MPLOSMWGDNJSEV-WHFBIAKZSA-N Ala-Gly-Asp Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O MPLOSMWGDNJSEV-WHFBIAKZSA-N 0.000 description 1
- LJFNNUBZSZCZFN-WHFBIAKZSA-N Ala-Gly-Cys Chemical compound N[C@@H](C)C(=O)NCC(=O)N[C@@H](CS)C(=O)O LJFNNUBZSZCZFN-WHFBIAKZSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 1
- CWEAKSWWKHGTRJ-BQBZGAKWSA-N Ala-Gly-Met Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O CWEAKSWWKHGTRJ-BQBZGAKWSA-N 0.000 description 1
- NIZKGBJVCMRDKO-KWQFWETISA-N Ala-Gly-Tyr Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NIZKGBJVCMRDKO-KWQFWETISA-N 0.000 description 1
- ZPXCNXMJEZKRLU-LSJOCFKGSA-N Ala-His-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CN=CN1 ZPXCNXMJEZKRLU-LSJOCFKGSA-N 0.000 description 1
- KMGOBAQSCKTBGD-DLOVCJGASA-N Ala-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CC1=CN=CN1 KMGOBAQSCKTBGD-DLOVCJGASA-N 0.000 description 1
- CBCCCLMNOBLBSC-XVYDVKMFSA-N Ala-His-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CBCCCLMNOBLBSC-XVYDVKMFSA-N 0.000 description 1
- NJWJSLCQEDMGNC-MBLNEYKQSA-N Ala-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C)N)O NJWJSLCQEDMGNC-MBLNEYKQSA-N 0.000 description 1
- HQJKCXHQNUCKMY-GHCJXIJMSA-N Ala-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C)N HQJKCXHQNUCKMY-GHCJXIJMSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- OKIKVSXTXVVFDV-MMWGEVLESA-N Ala-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N OKIKVSXTXVVFDV-MMWGEVLESA-N 0.000 description 1
- VNYMOTCMNHJGTG-JBDRJPRFSA-N Ala-Ile-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O VNYMOTCMNHJGTG-JBDRJPRFSA-N 0.000 description 1
- LXAARTARZJJCMB-CIQUZCHMSA-N Ala-Ile-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LXAARTARZJJCMB-CIQUZCHMSA-N 0.000 description 1
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 1
- GHBSKQGCIYSCNS-NAKRPEOUSA-N Ala-Leu-Asp-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O GHBSKQGCIYSCNS-NAKRPEOUSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- WUHJHHGYVVJMQE-BJDJZHNGSA-N Ala-Leu-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WUHJHHGYVVJMQE-BJDJZHNGSA-N 0.000 description 1
- ZKEHTYWGPMMGBC-XUXIUFHCSA-N Ala-Leu-Leu-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O ZKEHTYWGPMMGBC-XUXIUFHCSA-N 0.000 description 1
- LDLSENBXQNDTPB-DCAQKATOSA-N Ala-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N LDLSENBXQNDTPB-DCAQKATOSA-N 0.000 description 1
- SDZRIBWEVVRDQI-CIUDSAMLSA-N Ala-Lys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O SDZRIBWEVVRDQI-CIUDSAMLSA-N 0.000 description 1
- MFMDKJIPHSWSBM-GUBZILKMSA-N Ala-Lys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFMDKJIPHSWSBM-GUBZILKMSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- CHFFHQUVXHEGBY-GARJFASQSA-N Ala-Lys-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N CHFFHQUVXHEGBY-GARJFASQSA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- OINVDEKBKBCPLX-JXUBOQSCSA-N Ala-Lys-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OINVDEKBKBCPLX-JXUBOQSCSA-N 0.000 description 1
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 1
- RAAWHFXHAACDFT-FXQIFTODSA-N Ala-Met-Asn Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CC(N)=O)C(O)=O RAAWHFXHAACDFT-FXQIFTODSA-N 0.000 description 1
- PVQLRJRPUTXFFX-CIUDSAMLSA-N Ala-Met-Gln Chemical compound CSCC[C@H](NC(=O)[C@H](C)N)C(=O)N[C@@H](CCC(N)=O)C(O)=O PVQLRJRPUTXFFX-CIUDSAMLSA-N 0.000 description 1
- DWYROCSXOOMOEU-CIUDSAMLSA-N Ala-Met-Glu Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N DWYROCSXOOMOEU-CIUDSAMLSA-N 0.000 description 1
- XSTZMVAYYCJTNR-DCAQKATOSA-N Ala-Met-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XSTZMVAYYCJTNR-DCAQKATOSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- BDQNLQSWRAPHGU-DLOVCJGASA-N Ala-Phe-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N BDQNLQSWRAPHGU-DLOVCJGASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- DXTYEWAQOXYRHZ-KKXDTOCCSA-N Ala-Phe-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N DXTYEWAQOXYRHZ-KKXDTOCCSA-N 0.000 description 1
- BTRULDJUUVGRNE-DCAQKATOSA-N Ala-Pro-Lys Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(O)=O BTRULDJUUVGRNE-DCAQKATOSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- FFZJHQODAYHGPO-KZVJFYERSA-N Ala-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C)N FFZJHQODAYHGPO-KZVJFYERSA-N 0.000 description 1
- JNLDTVRGXMSYJC-UVBJJODRSA-N Ala-Pro-Trp Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O JNLDTVRGXMSYJC-UVBJJODRSA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- YYAVDNKUWLAFCV-ACZMJKKPSA-N Ala-Ser-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O YYAVDNKUWLAFCV-ACZMJKKPSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- MMLHRUJLOUSRJX-CIUDSAMLSA-N Ala-Ser-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN MMLHRUJLOUSRJX-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- SYIFFFHSXBNPMC-UWJYBYFXSA-N Ala-Ser-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N SYIFFFHSXBNPMC-UWJYBYFXSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- YNOCMHZSWJMGBB-GCJQMDKQSA-N Ala-Thr-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O YNOCMHZSWJMGBB-GCJQMDKQSA-N 0.000 description 1
- QKHWNPQNOHEFST-VZFHVOOUSA-N Ala-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N)O QKHWNPQNOHEFST-VZFHVOOUSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- SAHQGRZIQVEJPF-JXUBOQSCSA-N Ala-Thr-Lys Chemical compound C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN SAHQGRZIQVEJPF-JXUBOQSCSA-N 0.000 description 1
- QOIGKCBMXUCDQU-KDXUFGMBSA-N Ala-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C)N)O QOIGKCBMXUCDQU-KDXUFGMBSA-N 0.000 description 1
- KTXKIYXZQFWJKB-VZFHVOOUSA-N Ala-Thr-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O KTXKIYXZQFWJKB-VZFHVOOUSA-N 0.000 description 1
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 1
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 1
- PXAFZDXYEIIUTF-LKTVYLICSA-N Ala-Trp-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O PXAFZDXYEIIUTF-LKTVYLICSA-N 0.000 description 1
- VQBULXOHAZSTQY-GKCIPKSASA-N Ala-Trp-Phe Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VQBULXOHAZSTQY-GKCIPKSASA-N 0.000 description 1
- PGNNQOJOEGFAOR-KWQFWETISA-N Ala-Tyr-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 PGNNQOJOEGFAOR-KWQFWETISA-N 0.000 description 1
- QRIYOHQJRDHFKF-UWJYBYFXSA-N Ala-Tyr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=C(O)C=C1 QRIYOHQJRDHFKF-UWJYBYFXSA-N 0.000 description 1
- MUGAESARFRGOTQ-IGNZVWTISA-N Ala-Tyr-Tyr Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N MUGAESARFRGOTQ-IGNZVWTISA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- IYKVSFNGSWTTNZ-GUBZILKMSA-N Ala-Val-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IYKVSFNGSWTTNZ-GUBZILKMSA-N 0.000 description 1
- CLOMBHBBUKAUBP-LSJOCFKGSA-N Ala-Val-His Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N CLOMBHBBUKAUBP-LSJOCFKGSA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- YFWTXMRJJDNTLM-LSJOCFKGSA-N Arg-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YFWTXMRJJDNTLM-LSJOCFKGSA-N 0.000 description 1
- VWVPYNGMOCSSGK-GUBZILKMSA-N Arg-Arg-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O VWVPYNGMOCSSGK-GUBZILKMSA-N 0.000 description 1
- KJGNDQCYBNBXDA-GUBZILKMSA-N Arg-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N KJGNDQCYBNBXDA-GUBZILKMSA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- HJVGMOYJDDXLMI-AVGNSLFASA-N Arg-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@@H](N)CCCNC(N)=N HJVGMOYJDDXLMI-AVGNSLFASA-N 0.000 description 1
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 1
- YUIGJDNAGKJLDO-JYJNAYRXSA-N Arg-Arg-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YUIGJDNAGKJLDO-JYJNAYRXSA-N 0.000 description 1
- DPXDVGDLWJYZBH-GUBZILKMSA-N Arg-Asn-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DPXDVGDLWJYZBH-GUBZILKMSA-N 0.000 description 1
- USNSOPDIZILSJP-FXQIFTODSA-N Arg-Asn-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O USNSOPDIZILSJP-FXQIFTODSA-N 0.000 description 1
- RVDVDRUZWZIBJQ-CIUDSAMLSA-N Arg-Asn-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O RVDVDRUZWZIBJQ-CIUDSAMLSA-N 0.000 description 1
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 1
- RWCLSUOSKWTXLA-FXQIFTODSA-N Arg-Asp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O RWCLSUOSKWTXLA-FXQIFTODSA-N 0.000 description 1
- PQWTZSNVWSOFFK-FXQIFTODSA-N Arg-Asp-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)CN=C(N)N PQWTZSNVWSOFFK-FXQIFTODSA-N 0.000 description 1
- NTAZNGWBXRVEDJ-FXQIFTODSA-N Arg-Asp-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NTAZNGWBXRVEDJ-FXQIFTODSA-N 0.000 description 1
- OZNSCVPYWZRQPY-CIUDSAMLSA-N Arg-Asp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O OZNSCVPYWZRQPY-CIUDSAMLSA-N 0.000 description 1
- KMSHNDWHPWXPEC-BQBZGAKWSA-N Arg-Asp-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O KMSHNDWHPWXPEC-BQBZGAKWSA-N 0.000 description 1
- ASQYTJJWAMDISW-BPUTZDHNSA-N Arg-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N ASQYTJJWAMDISW-BPUTZDHNSA-N 0.000 description 1
- FBLMOFHNVQBKRR-IHRRRGAJSA-N Arg-Asp-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FBLMOFHNVQBKRR-IHRRRGAJSA-N 0.000 description 1
- VXXHDZKEQNGXNU-QXEWZRGKSA-N Arg-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N VXXHDZKEQNGXNU-QXEWZRGKSA-N 0.000 description 1
- DQNLFLGFZAUIOW-FXQIFTODSA-N Arg-Cys-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O DQNLFLGFZAUIOW-FXQIFTODSA-N 0.000 description 1
- GDVDRMUYICMNFJ-CIUDSAMLSA-N Arg-Cys-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O GDVDRMUYICMNFJ-CIUDSAMLSA-N 0.000 description 1
- BBYTXXRNSFUOOX-IHRRRGAJSA-N Arg-Cys-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BBYTXXRNSFUOOX-IHRRRGAJSA-N 0.000 description 1
- YHSNASXGBPAHRL-BPUTZDHNSA-N Arg-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N YHSNASXGBPAHRL-BPUTZDHNSA-N 0.000 description 1
- QQJSJIBESHAJPM-IHRRRGAJSA-N Arg-Cys-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QQJSJIBESHAJPM-IHRRRGAJSA-N 0.000 description 1
- PTVGLOCPAVYPFG-CIUDSAMLSA-N Arg-Gln-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O PTVGLOCPAVYPFG-CIUDSAMLSA-N 0.000 description 1
- BJNUAWGXPSHQMJ-DCAQKATOSA-N Arg-Gln-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O BJNUAWGXPSHQMJ-DCAQKATOSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- QAODJPUKWNNNRP-DCAQKATOSA-N Arg-Glu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QAODJPUKWNNNRP-DCAQKATOSA-N 0.000 description 1
- XLWSGICNBZGYTA-CIUDSAMLSA-N Arg-Glu-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O XLWSGICNBZGYTA-CIUDSAMLSA-N 0.000 description 1
- NYZGVTGOMPHSJW-CIUDSAMLSA-N Arg-Glu-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N NYZGVTGOMPHSJW-CIUDSAMLSA-N 0.000 description 1
- MZRBYBIQTIKERR-GUBZILKMSA-N Arg-Glu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MZRBYBIQTIKERR-GUBZILKMSA-N 0.000 description 1
- QAXCZGMLVICQKS-SRVKXCTJSA-N Arg-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QAXCZGMLVICQKS-SRVKXCTJSA-N 0.000 description 1
- NKBQZKVMKJJDLX-SRVKXCTJSA-N Arg-Glu-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NKBQZKVMKJJDLX-SRVKXCTJSA-N 0.000 description 1
- SKTGPBFTMNLIHQ-KKUMJFAQSA-N Arg-Glu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SKTGPBFTMNLIHQ-KKUMJFAQSA-N 0.000 description 1
- UFBURHXMKFQVLM-CIUDSAMLSA-N Arg-Glu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UFBURHXMKFQVLM-CIUDSAMLSA-N 0.000 description 1
- PNIGSVZJNVUVJA-BQBZGAKWSA-N Arg-Gly-Asn Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O PNIGSVZJNVUVJA-BQBZGAKWSA-N 0.000 description 1
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 1
- PPPXVIBMLFWNSK-BQBZGAKWSA-N Arg-Gly-Cys Chemical compound C(C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)CN=C(N)N PPPXVIBMLFWNSK-BQBZGAKWSA-N 0.000 description 1
- WVNFNPGXYADPPO-BQBZGAKWSA-N Arg-Gly-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O WVNFNPGXYADPPO-BQBZGAKWSA-N 0.000 description 1
- KRQSPVKUISQQFS-FJXKBIBVSA-N Arg-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N KRQSPVKUISQQFS-FJXKBIBVSA-N 0.000 description 1
- ZZZWQALDSQQBEW-STQMWFEESA-N Arg-Gly-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ZZZWQALDSQQBEW-STQMWFEESA-N 0.000 description 1
- SLNCSSWAIDUUGF-LSJOCFKGSA-N Arg-His-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O SLNCSSWAIDUUGF-LSJOCFKGSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- MMGCRPZQZWTZTA-IHRRRGAJSA-N Arg-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N MMGCRPZQZWTZTA-IHRRRGAJSA-N 0.000 description 1
- ZJEDSBGPBXVBMP-PYJNHQTQSA-N Arg-His-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJEDSBGPBXVBMP-PYJNHQTQSA-N 0.000 description 1
- RKQRHMKFNBYOTN-IHRRRGAJSA-N Arg-His-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N RKQRHMKFNBYOTN-IHRRRGAJSA-N 0.000 description 1
- GFMWTFHOZGLTLC-AVGNSLFASA-N Arg-His-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O GFMWTFHOZGLTLC-AVGNSLFASA-N 0.000 description 1
- IRRMIGDCPOPZJW-ULQDDVLXSA-N Arg-His-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IRRMIGDCPOPZJW-ULQDDVLXSA-N 0.000 description 1
- FRMQITGHXMUNDF-GMOBBJLQSA-N Arg-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FRMQITGHXMUNDF-GMOBBJLQSA-N 0.000 description 1
- HCIUUZGFTDTEGM-NAKRPEOUSA-N Arg-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HCIUUZGFTDTEGM-NAKRPEOUSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- AGVNTAUPLWIQEN-ZPFDUUQYSA-N Arg-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AGVNTAUPLWIQEN-ZPFDUUQYSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- OFIYLHVAAJYRBC-HJWJTTGWSA-N Arg-Ile-Phe Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](Cc1ccccc1)C(O)=O OFIYLHVAAJYRBC-HJWJTTGWSA-N 0.000 description 1
- GNYUVVJYGJFKHN-RVMXOQNASA-N Arg-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N GNYUVVJYGJFKHN-RVMXOQNASA-N 0.000 description 1
- HJDNZFIYILEIKR-OSUNSFLBSA-N Arg-Ile-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HJDNZFIYILEIKR-OSUNSFLBSA-N 0.000 description 1
- FNXCAFKDGBROCU-STECZYCISA-N Arg-Ile-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 FNXCAFKDGBROCU-STECZYCISA-N 0.000 description 1
- LLUGJARLJCGLAR-CYDGBPFRSA-N Arg-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LLUGJARLJCGLAR-CYDGBPFRSA-N 0.000 description 1
- NIUDXSFNLBIWOB-DCAQKATOSA-N Arg-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NIUDXSFNLBIWOB-DCAQKATOSA-N 0.000 description 1
- NOZYDJOPOGKUSR-AVGNSLFASA-N Arg-Leu-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O NOZYDJOPOGKUSR-AVGNSLFASA-N 0.000 description 1
- IIAXFBUTKIDDIP-ULQDDVLXSA-N Arg-Leu-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O IIAXFBUTKIDDIP-ULQDDVLXSA-N 0.000 description 1
- MJINRRBEMOLJAK-DCAQKATOSA-N Arg-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N MJINRRBEMOLJAK-DCAQKATOSA-N 0.000 description 1
- RIIVUOJDDQXHRV-SRVKXCTJSA-N Arg-Lys-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O RIIVUOJDDQXHRV-SRVKXCTJSA-N 0.000 description 1
- CVXXSWQORBZAAA-SRVKXCTJSA-N Arg-Lys-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N CVXXSWQORBZAAA-SRVKXCTJSA-N 0.000 description 1
- MTYLORHAQXVQOW-AVGNSLFASA-N Arg-Lys-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O MTYLORHAQXVQOW-AVGNSLFASA-N 0.000 description 1
- XUGATJVGQUGQKY-ULQDDVLXSA-N Arg-Lys-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XUGATJVGQUGQKY-ULQDDVLXSA-N 0.000 description 1
- HIMXTOIXVXWHTB-DCAQKATOSA-N Arg-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N HIMXTOIXVXWHTB-DCAQKATOSA-N 0.000 description 1
- VVJTWSRNMJNDPN-IUCAKERBSA-N Arg-Met-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O VVJTWSRNMJNDPN-IUCAKERBSA-N 0.000 description 1
- ZEBDYGZVMMKZNB-SRVKXCTJSA-N Arg-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCCN=C(N)N)N ZEBDYGZVMMKZNB-SRVKXCTJSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- FKQITMVNILRUCQ-IHRRRGAJSA-N Arg-Phe-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O FKQITMVNILRUCQ-IHRRRGAJSA-N 0.000 description 1
- VEAIMHJZTIDCIH-KKUMJFAQSA-N Arg-Phe-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEAIMHJZTIDCIH-KKUMJFAQSA-N 0.000 description 1
- DPLFNLDACGGBAK-KKUMJFAQSA-N Arg-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N DPLFNLDACGGBAK-KKUMJFAQSA-N 0.000 description 1
- AOHKLEBWKMKITA-IHRRRGAJSA-N Arg-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N AOHKLEBWKMKITA-IHRRRGAJSA-N 0.000 description 1
- UIUXXFIKWQVMEX-UFYCRDLUSA-N Arg-Phe-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O UIUXXFIKWQVMEX-UFYCRDLUSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- OVQJAKFLFTZDNC-GUBZILKMSA-N Arg-Pro-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O OVQJAKFLFTZDNC-GUBZILKMSA-N 0.000 description 1
- FOQFHANLUJDQEE-GUBZILKMSA-N Arg-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CS)C(=O)O FOQFHANLUJDQEE-GUBZILKMSA-N 0.000 description 1
- STHNZYKCJHWULY-AVGNSLFASA-N Arg-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCCN=C(N)N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O STHNZYKCJHWULY-AVGNSLFASA-N 0.000 description 1
- FVBZXNSRIDVYJS-AVGNSLFASA-N Arg-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N FVBZXNSRIDVYJS-AVGNSLFASA-N 0.000 description 1
- ISJWBVIYRBAXEB-CIUDSAMLSA-N Arg-Ser-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O ISJWBVIYRBAXEB-CIUDSAMLSA-N 0.000 description 1
- JQHASVQBAKRJKD-GUBZILKMSA-N Arg-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N JQHASVQBAKRJKD-GUBZILKMSA-N 0.000 description 1
- JPAWCMXVNZPJLO-IHRRRGAJSA-N Arg-Ser-Phe Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JPAWCMXVNZPJLO-IHRRRGAJSA-N 0.000 description 1
- LRPZJPMQGKGHSG-XGEHTFHBSA-N Arg-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCCN=C(N)N)N)O LRPZJPMQGKGHSG-XGEHTFHBSA-N 0.000 description 1
- ASQKVGRCKOFKIU-KZVJFYERSA-N Arg-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O ASQKVGRCKOFKIU-KZVJFYERSA-N 0.000 description 1
- WCZXPVPHUMYLMS-VEVYYDQMSA-N Arg-Thr-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O WCZXPVPHUMYLMS-VEVYYDQMSA-N 0.000 description 1
- HRCIIMCTUIAKQB-XGEHTFHBSA-N Arg-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O HRCIIMCTUIAKQB-XGEHTFHBSA-N 0.000 description 1
- AUZAXCPWMDBWEE-HJGDQZAQSA-N Arg-Thr-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O AUZAXCPWMDBWEE-HJGDQZAQSA-N 0.000 description 1
- KSHJMDSNSKDJPU-QTKMDUPCSA-N Arg-Thr-His Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KSHJMDSNSKDJPU-QTKMDUPCSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- MOGMYRUNTKYZFB-UNQGMJICSA-N Arg-Thr-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MOGMYRUNTKYZFB-UNQGMJICSA-N 0.000 description 1
- WTFIFQWLQXZLIZ-UMPQAUOISA-N Arg-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O WTFIFQWLQXZLIZ-UMPQAUOISA-N 0.000 description 1
- OGZBJJLRKQZRHL-KJEVXHAQSA-N Arg-Thr-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OGZBJJLRKQZRHL-KJEVXHAQSA-N 0.000 description 1
- DRDWXKWUSIKKOB-PJODQICGSA-N Arg-Trp-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C)C(O)=O DRDWXKWUSIKKOB-PJODQICGSA-N 0.000 description 1
- FSPQNLYOFCXUCE-BPUTZDHNSA-N Arg-Trp-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N FSPQNLYOFCXUCE-BPUTZDHNSA-N 0.000 description 1
- XOZYYXMHMIEJET-XIRDDKMYSA-N Arg-Trp-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(O)=O)C(O)=O XOZYYXMHMIEJET-XIRDDKMYSA-N 0.000 description 1
- YHZQOSXDTFRZKU-WDSOQIARSA-N Arg-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 YHZQOSXDTFRZKU-WDSOQIARSA-N 0.000 description 1
- LOVIQNMIPQVIGT-BVSLBCMMSA-N Arg-Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCCN=C(N)N)N)C(O)=O)C1=CC=CC=C1 LOVIQNMIPQVIGT-BVSLBCMMSA-N 0.000 description 1
- ZCSHHTFOZULVLN-SZMVWBNQSA-N Arg-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CCCN=C(N)N)=CNC2=C1 ZCSHHTFOZULVLN-SZMVWBNQSA-N 0.000 description 1
- QMQZYILAWUOLPV-JYJNAYRXSA-N Arg-Tyr-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)CC1=CC=C(O)C=C1 QMQZYILAWUOLPV-JYJNAYRXSA-N 0.000 description 1
- NMTANZXPDAHUKU-ULQDDVLXSA-N Arg-Tyr-Lys Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCCCN)C(O)=O)CC1=CC=C(O)C=C1 NMTANZXPDAHUKU-ULQDDVLXSA-N 0.000 description 1
- IZSMEUDYADKZTJ-KJEVXHAQSA-N Arg-Tyr-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IZSMEUDYADKZTJ-KJEVXHAQSA-N 0.000 description 1
- CNBIWSCSSCAINS-UFYCRDLUSA-N Arg-Tyr-Tyr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O CNBIWSCSSCAINS-UFYCRDLUSA-N 0.000 description 1
- XMZZGVGKGXRIGJ-JYJNAYRXSA-N Arg-Tyr-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O XMZZGVGKGXRIGJ-JYJNAYRXSA-N 0.000 description 1
- ULBHWNVWSCJLCO-NHCYSSNCSA-N Arg-Val-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N ULBHWNVWSCJLCO-NHCYSSNCSA-N 0.000 description 1
- XEOXPCNONWHHSW-AVGNSLFASA-N Arg-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N XEOXPCNONWHHSW-AVGNSLFASA-N 0.000 description 1
- FMYQECOAIFGQGU-CYDGBPFRSA-N Arg-Val-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FMYQECOAIFGQGU-CYDGBPFRSA-N 0.000 description 1
- JWCCFNZJIRZUCL-AVGNSLFASA-N Arg-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N JWCCFNZJIRZUCL-AVGNSLFASA-N 0.000 description 1
- WHLDJYNHXOMGMU-JYJNAYRXSA-N Arg-Val-Tyr Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 WHLDJYNHXOMGMU-JYJNAYRXSA-N 0.000 description 1
- ANAHQDPQQBDOBM-UHFFFAOYSA-N Arg-Val-Tyr Natural products CC(C)C(NC(=O)C(N)CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O ANAHQDPQQBDOBM-UHFFFAOYSA-N 0.000 description 1
- YNDLOUMBVDVALC-ZLUOBGJFSA-N Asn-Ala-Ala Chemical compound C[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC(=O)N)N YNDLOUMBVDVALC-ZLUOBGJFSA-N 0.000 description 1
- PFOYSEIHFVKHNF-FXQIFTODSA-N Asn-Ala-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PFOYSEIHFVKHNF-FXQIFTODSA-N 0.000 description 1
- BRCVLJZIIFBSPF-ZLUOBGJFSA-N Asn-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N BRCVLJZIIFBSPF-ZLUOBGJFSA-N 0.000 description 1
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 1
- QQEWINYJRFBLNN-DLOVCJGASA-N Asn-Ala-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QQEWINYJRFBLNN-DLOVCJGASA-N 0.000 description 1
- VDCIPFYVCICPEC-FXQIFTODSA-N Asn-Arg-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O VDCIPFYVCICPEC-FXQIFTODSA-N 0.000 description 1
- YJRORCOAFUZVKA-FXQIFTODSA-N Asn-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N YJRORCOAFUZVKA-FXQIFTODSA-N 0.000 description 1
- BDMIFVIWCNLDCT-CIUDSAMLSA-N Asn-Arg-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O BDMIFVIWCNLDCT-CIUDSAMLSA-N 0.000 description 1
- GXMSVVBIAMWMKO-BQBZGAKWSA-N Asn-Arg-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N GXMSVVBIAMWMKO-BQBZGAKWSA-N 0.000 description 1
- HAJWYALLJIATCX-FXQIFTODSA-N Asn-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)CN=C(N)N HAJWYALLJIATCX-FXQIFTODSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- KSBHCUSPLWRVEK-ZLUOBGJFSA-N Asn-Asn-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O KSBHCUSPLWRVEK-ZLUOBGJFSA-N 0.000 description 1
- YNSCBOUZTAGIGO-ZLUOBGJFSA-N Asn-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)C(=O)N YNSCBOUZTAGIGO-ZLUOBGJFSA-N 0.000 description 1
- RCENDENBBJFJHZ-ACZMJKKPSA-N Asn-Asn-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RCENDENBBJFJHZ-ACZMJKKPSA-N 0.000 description 1
- LJUOLNXOWSWGKF-ACZMJKKPSA-N Asn-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N LJUOLNXOWSWGKF-ACZMJKKPSA-N 0.000 description 1
- PCKRJVZAQZWNKM-WHFBIAKZSA-N Asn-Asn-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O PCKRJVZAQZWNKM-WHFBIAKZSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- DXZNJWFECGJCQR-FXQIFTODSA-N Asn-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N DXZNJWFECGJCQR-FXQIFTODSA-N 0.000 description 1
- NLCDVZJDEXIDDL-BIIVOSGPSA-N Asn-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O NLCDVZJDEXIDDL-BIIVOSGPSA-N 0.000 description 1
- ABMMIOIRQJNRHG-XKNYDFJKSA-N Asn-Asn-Pro-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O ABMMIOIRQJNRHG-XKNYDFJKSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- CUQUEHYSSFETRD-ACZMJKKPSA-N Asn-Asp-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N CUQUEHYSSFETRD-ACZMJKKPSA-N 0.000 description 1
- UGXVKHRDGLYFKR-CIUDSAMLSA-N Asn-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(N)=O UGXVKHRDGLYFKR-CIUDSAMLSA-N 0.000 description 1
- BGINHSZTXRJIPP-FXQIFTODSA-N Asn-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N BGINHSZTXRJIPP-FXQIFTODSA-N 0.000 description 1
- VYLVOMUVLMGCRF-ZLUOBGJFSA-N Asn-Asp-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O VYLVOMUVLMGCRF-ZLUOBGJFSA-N 0.000 description 1
- XQQVCUIBGYFKDC-OLHMAJIHSA-N Asn-Asp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XQQVCUIBGYFKDC-OLHMAJIHSA-N 0.000 description 1
- PAXHINASXXXILC-SRVKXCTJSA-N Asn-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)N)N)O PAXHINASXXXILC-SRVKXCTJSA-N 0.000 description 1
- IYVSIZAXNLOKFQ-BYULHYEWSA-N Asn-Asp-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IYVSIZAXNLOKFQ-BYULHYEWSA-N 0.000 description 1
- LUVODTFFSXVOAG-ACZMJKKPSA-N Asn-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N LUVODTFFSXVOAG-ACZMJKKPSA-N 0.000 description 1
- ZPMNECSEJXXNBE-CIUDSAMLSA-N Asn-Cys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O ZPMNECSEJXXNBE-CIUDSAMLSA-N 0.000 description 1
- CZIXHXIJJZLYRJ-SRVKXCTJSA-N Asn-Cys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 CZIXHXIJJZLYRJ-SRVKXCTJSA-N 0.000 description 1
- SQZIAWGBBUSSPJ-ZKWXMUAHSA-N Asn-Cys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)N)N SQZIAWGBBUSSPJ-ZKWXMUAHSA-N 0.000 description 1
- QGNXYDHVERJIAY-ACZMJKKPSA-N Asn-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N QGNXYDHVERJIAY-ACZMJKKPSA-N 0.000 description 1
- PQAIOUVVZCOLJK-FXQIFTODSA-N Asn-Gln-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N PQAIOUVVZCOLJK-FXQIFTODSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- FUHFYEKSGWOWGZ-XHNCKOQMSA-N Asn-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N)C(=O)O FUHFYEKSGWOWGZ-XHNCKOQMSA-N 0.000 description 1
- KWQPAXYXVMHJJR-AVGNSLFASA-N Asn-Gln-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KWQPAXYXVMHJJR-AVGNSLFASA-N 0.000 description 1
- SRUUBQBAVNQZGJ-LAEOZQHASA-N Asn-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N SRUUBQBAVNQZGJ-LAEOZQHASA-N 0.000 description 1
- ULRPXVNMIIYDDJ-ACZMJKKPSA-N Asn-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N ULRPXVNMIIYDDJ-ACZMJKKPSA-N 0.000 description 1
- SNAKIVFVLVUCKB-UHFFFAOYSA-N Asn-Glu-Ala-Lys Natural products NCCCCC(C(O)=O)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(N)CC(N)=O SNAKIVFVLVUCKB-UHFFFAOYSA-N 0.000 description 1
- COUZKSSMBFADSB-AVGNSLFASA-N Asn-Glu-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)N)N COUZKSSMBFADSB-AVGNSLFASA-N 0.000 description 1
- UBKOVSLDWIHYSY-ACZMJKKPSA-N Asn-Glu-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O UBKOVSLDWIHYSY-ACZMJKKPSA-N 0.000 description 1
- DMLSCRJBWUEALP-LAEOZQHASA-N Asn-Glu-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O DMLSCRJBWUEALP-LAEOZQHASA-N 0.000 description 1
- DDPXDCKYWDGZAL-BQBZGAKWSA-N Asn-Gly-Arg Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N DDPXDCKYWDGZAL-BQBZGAKWSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- IICZCLFBILYRCU-WHFBIAKZSA-N Asn-Gly-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IICZCLFBILYRCU-WHFBIAKZSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- OPEPUCYIGFEGSW-WDSKDSINSA-N Asn-Gly-Glu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OPEPUCYIGFEGSW-WDSKDSINSA-N 0.000 description 1
- PBSQFBAJKPLRJY-BYULHYEWSA-N Asn-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N PBSQFBAJKPLRJY-BYULHYEWSA-N 0.000 description 1
- JQSWHKKUZMTOIH-QWRGUYRKSA-N Asn-Gly-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC(=O)N)N JQSWHKKUZMTOIH-QWRGUYRKSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- OOWSBIOUKIUWLO-RCOVLWMOSA-N Asn-Gly-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O OOWSBIOUKIUWLO-RCOVLWMOSA-N 0.000 description 1
- MOHUTCNYQLMARY-GUBZILKMSA-N Asn-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MOHUTCNYQLMARY-GUBZILKMSA-N 0.000 description 1
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 1
- UYXXMIZGHYKYAT-NHCYSSNCSA-N Asn-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)N)N UYXXMIZGHYKYAT-NHCYSSNCSA-N 0.000 description 1
- XVBDDUPJVQXDSI-PEFMBERDSA-N Asn-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N XVBDDUPJVQXDSI-PEFMBERDSA-N 0.000 description 1
- NVWJMQNYLYWVNQ-BYULHYEWSA-N Asn-Ile-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O NVWJMQNYLYWVNQ-BYULHYEWSA-N 0.000 description 1
- KMCRKVOLRCOMBG-DJFWLOJKSA-N Asn-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N KMCRKVOLRCOMBG-DJFWLOJKSA-N 0.000 description 1
- ACKNRKFVYUVWAC-ZPFDUUQYSA-N Asn-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ACKNRKFVYUVWAC-ZPFDUUQYSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 1
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 1
- BXUHCIXDSWRSBS-CIUDSAMLSA-N Asn-Leu-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O BXUHCIXDSWRSBS-CIUDSAMLSA-N 0.000 description 1
- BZWRLDPIWKOVKB-ZPFDUUQYSA-N Asn-Leu-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BZWRLDPIWKOVKB-ZPFDUUQYSA-N 0.000 description 1
- GLWFAWNYGWBMOC-SRVKXCTJSA-N Asn-Leu-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GLWFAWNYGWBMOC-SRVKXCTJSA-N 0.000 description 1
- UBGGJTMETLEXJD-DCAQKATOSA-N Asn-Leu-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O UBGGJTMETLEXJD-DCAQKATOSA-N 0.000 description 1
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 1
- NUCUBYIUPVYGPP-XIRDDKMYSA-N Asn-Leu-Trp Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)CC(N)=O)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O NUCUBYIUPVYGPP-XIRDDKMYSA-N 0.000 description 1
- TZFQICWZWFNIKU-KKUMJFAQSA-N Asn-Leu-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 TZFQICWZWFNIKU-KKUMJFAQSA-N 0.000 description 1
- RCFGLXMZDYNRSC-CIUDSAMLSA-N Asn-Lys-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O RCFGLXMZDYNRSC-CIUDSAMLSA-N 0.000 description 1
- ALHMNHZJBYBYHS-DCAQKATOSA-N Asn-Lys-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ALHMNHZJBYBYHS-DCAQKATOSA-N 0.000 description 1
- FTSAJSADJCMDHH-CIUDSAMLSA-N Asn-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)N)N FTSAJSADJCMDHH-CIUDSAMLSA-N 0.000 description 1
- RZNAMKZJPBQWDJ-SRVKXCTJSA-N Asn-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N RZNAMKZJPBQWDJ-SRVKXCTJSA-N 0.000 description 1
- JWKDQOORUCYUIW-ZPFDUUQYSA-N Asn-Lys-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JWKDQOORUCYUIW-ZPFDUUQYSA-N 0.000 description 1
- ZYPWIUFLYMQZBS-SRVKXCTJSA-N Asn-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N ZYPWIUFLYMQZBS-SRVKXCTJSA-N 0.000 description 1
- GIQCDTKOIPUDSG-GARJFASQSA-N Asn-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)N)N)C(=O)O GIQCDTKOIPUDSG-GARJFASQSA-N 0.000 description 1
- COWITDLVHMZSIW-CIUDSAMLSA-N Asn-Lys-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O COWITDLVHMZSIW-CIUDSAMLSA-N 0.000 description 1
- NTWOPSIUJBMNRI-KKUMJFAQSA-N Asn-Lys-Tyr Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTWOPSIUJBMNRI-KKUMJFAQSA-N 0.000 description 1
- NKFGQWVYETUWGU-UHFFFAOYSA-N Asn-Met-Asn-His Chemical compound NC(=O)CC(N)C(=O)NC(CCSC)C(=O)NC(CC(N)=O)C(=O)NC(C(O)=O)CC1=CN=CN1 NKFGQWVYETUWGU-UHFFFAOYSA-N 0.000 description 1
- MYVBTYXSWILFCG-BQBZGAKWSA-N Asn-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)N)N MYVBTYXSWILFCG-BQBZGAKWSA-N 0.000 description 1
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 1
- ZVUMKOMKQCANOM-AVGNSLFASA-N Asn-Phe-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZVUMKOMKQCANOM-AVGNSLFASA-N 0.000 description 1
- HZZIFFOVHLWGCS-KKUMJFAQSA-N Asn-Phe-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O HZZIFFOVHLWGCS-KKUMJFAQSA-N 0.000 description 1
- RVHGJNGNKGDCPX-KKUMJFAQSA-N Asn-Phe-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)N)N RVHGJNGNKGDCPX-KKUMJFAQSA-N 0.000 description 1
- FTNRWCPWDWRPAV-BZSNNMDCSA-N Asn-Phe-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(N)=O)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FTNRWCPWDWRPAV-BZSNNMDCSA-N 0.000 description 1
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 1
- PLTGTJAZQRGMPP-FXQIFTODSA-N Asn-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC(N)=O PLTGTJAZQRGMPP-FXQIFTODSA-N 0.000 description 1
- UWFOMGUWGPRVBW-GUBZILKMSA-N Asn-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)N)N UWFOMGUWGPRVBW-GUBZILKMSA-N 0.000 description 1
- SZNGQSBRHFMZLT-IHRRRGAJSA-N Asn-Pro-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SZNGQSBRHFMZLT-IHRRRGAJSA-N 0.000 description 1
- GMUOCGCDOYYWPD-FXQIFTODSA-N Asn-Pro-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O GMUOCGCDOYYWPD-FXQIFTODSA-N 0.000 description 1
- IDUUACUJKUXKKD-VEVYYDQMSA-N Asn-Pro-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O IDUUACUJKUXKKD-VEVYYDQMSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- GZXOUBTUAUAVHD-ACZMJKKPSA-N Asn-Ser-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O GZXOUBTUAUAVHD-ACZMJKKPSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 1
- NCXTYSVDWLAQGZ-ZKWXMUAHSA-N Asn-Ser-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC(N)=O NCXTYSVDWLAQGZ-ZKWXMUAHSA-N 0.000 description 1
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 1
- JXMREEPBRANWBY-VEVYYDQMSA-N Asn-Thr-Arg Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JXMREEPBRANWBY-VEVYYDQMSA-N 0.000 description 1
- QYRMBFWDSFGSFC-OLHMAJIHSA-N Asn-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QYRMBFWDSFGSFC-OLHMAJIHSA-N 0.000 description 1
- HPASIOLTWSNMFB-OLHMAJIHSA-N Asn-Thr-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O HPASIOLTWSNMFB-OLHMAJIHSA-N 0.000 description 1
- QUMKPKWYDVMGNT-NUMRIWBASA-N Asn-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O QUMKPKWYDVMGNT-NUMRIWBASA-N 0.000 description 1
- ZUFPUBYQYWCMDB-NUMRIWBASA-N Asn-Thr-Glu Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ZUFPUBYQYWCMDB-NUMRIWBASA-N 0.000 description 1
- HCZQKHSRYHCPSD-IUKAMOBKSA-N Asn-Thr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HCZQKHSRYHCPSD-IUKAMOBKSA-N 0.000 description 1
- JBDLMLZNDRLDIX-HJGDQZAQSA-N Asn-Thr-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O JBDLMLZNDRLDIX-HJGDQZAQSA-N 0.000 description 1
- PIABYSIYPGLLDQ-XVSYOHENSA-N Asn-Thr-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PIABYSIYPGLLDQ-XVSYOHENSA-N 0.000 description 1
- XCBKBPRFACFFOO-AQZXSJQPSA-N Asn-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)N)N)O XCBKBPRFACFFOO-AQZXSJQPSA-N 0.000 description 1
- KZYSHAMXEBPJBD-JRQIVUDYSA-N Asn-Thr-Tyr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KZYSHAMXEBPJBD-JRQIVUDYSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- ANRZCQXIXGDXLR-CWRNSKLLSA-N Asn-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)N)N)C(=O)O ANRZCQXIXGDXLR-CWRNSKLLSA-N 0.000 description 1
- RTFXPCYMDYBZNQ-SRVKXCTJSA-N Asn-Tyr-Asn Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O RTFXPCYMDYBZNQ-SRVKXCTJSA-N 0.000 description 1
- BEHQTVDBCLSCBY-CFMVVWHZSA-N Asn-Tyr-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BEHQTVDBCLSCBY-CFMVVWHZSA-N 0.000 description 1
- QUCCLIXMVPIVOB-BZSNNMDCSA-N Asn-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC(=O)N)N QUCCLIXMVPIVOB-BZSNNMDCSA-N 0.000 description 1
- XEGZSHSPQNDNRH-JRQIVUDYSA-N Asn-Tyr-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XEGZSHSPQNDNRH-JRQIVUDYSA-N 0.000 description 1
- MJIJBEYEHBKTIM-BYULHYEWSA-N Asn-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N MJIJBEYEHBKTIM-BYULHYEWSA-N 0.000 description 1
- JZLFYAAGGYMRIK-BYULHYEWSA-N Asn-Val-Asp Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O JZLFYAAGGYMRIK-BYULHYEWSA-N 0.000 description 1
- LTDGPJKGJDIBQD-LAEOZQHASA-N Asn-Val-Gln Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O LTDGPJKGJDIBQD-LAEOZQHASA-N 0.000 description 1
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 1
- PBVLJOIPOGUQQP-CIUDSAMLSA-N Asp-Ala-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O PBVLJOIPOGUQQP-CIUDSAMLSA-N 0.000 description 1
- KVMPVNGOKHTUHZ-GCJQMDKQSA-N Asp-Ala-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KVMPVNGOKHTUHZ-GCJQMDKQSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- GVPSCJQLUGIKAM-GUBZILKMSA-N Asp-Arg-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GVPSCJQLUGIKAM-GUBZILKMSA-N 0.000 description 1
- ZLGKHJHFYSRUBH-FXQIFTODSA-N Asp-Arg-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLGKHJHFYSRUBH-FXQIFTODSA-N 0.000 description 1
- ICAYWNTWHRRAQP-FXQIFTODSA-N Asp-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N ICAYWNTWHRRAQP-FXQIFTODSA-N 0.000 description 1
- WSOKZUVWBXVJHX-CIUDSAMLSA-N Asp-Arg-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O WSOKZUVWBXVJHX-CIUDSAMLSA-N 0.000 description 1
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 1
- SDHFVYLZFBDSQT-DCAQKATOSA-N Asp-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N SDHFVYLZFBDSQT-DCAQKATOSA-N 0.000 description 1
- MRQQMVZUHXUPEV-IHRRRGAJSA-N Asp-Arg-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MRQQMVZUHXUPEV-IHRRRGAJSA-N 0.000 description 1
- FAEIQWHBRBWUBN-FXQIFTODSA-N Asp-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N)CN=C(N)N FAEIQWHBRBWUBN-FXQIFTODSA-N 0.000 description 1
- UQBGYPFHWFZMCD-ZLUOBGJFSA-N Asp-Asn-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O UQBGYPFHWFZMCD-ZLUOBGJFSA-N 0.000 description 1
- QRULNKJGYQQZMW-ZLUOBGJFSA-N Asp-Asn-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QRULNKJGYQQZMW-ZLUOBGJFSA-N 0.000 description 1
- VBVKSAFJPVXMFJ-CIUDSAMLSA-N Asp-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N VBVKSAFJPVXMFJ-CIUDSAMLSA-N 0.000 description 1
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 1
- JDHOJQJMWBKHDB-CIUDSAMLSA-N Asp-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N JDHOJQJMWBKHDB-CIUDSAMLSA-N 0.000 description 1
- WKGJGVGTEZGFSW-FXQIFTODSA-N Asp-Asn-Met Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O WKGJGVGTEZGFSW-FXQIFTODSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- RDRMWJBLOSRRAW-BYULHYEWSA-N Asp-Asn-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O RDRMWJBLOSRRAW-BYULHYEWSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- FANQWNCPNFEPGZ-WHFBIAKZSA-N Asp-Asp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FANQWNCPNFEPGZ-WHFBIAKZSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- MJKBOVWWADWLHV-ZLUOBGJFSA-N Asp-Cys-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)C(=O)O MJKBOVWWADWLHV-ZLUOBGJFSA-N 0.000 description 1
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 1
- QQXOYLWJQUPXJU-WHFBIAKZSA-N Asp-Cys-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O QQXOYLWJQUPXJU-WHFBIAKZSA-N 0.000 description 1
- FTNVLGCFIJEMQT-CIUDSAMLSA-N Asp-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N FTNVLGCFIJEMQT-CIUDSAMLSA-N 0.000 description 1
- WEDGJJRCJNHYSF-SRVKXCTJSA-N Asp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N WEDGJJRCJNHYSF-SRVKXCTJSA-N 0.000 description 1
- NURJSGZGBVJFAD-ZLUOBGJFSA-N Asp-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O NURJSGZGBVJFAD-ZLUOBGJFSA-N 0.000 description 1
- DZQKLNLLWFQONU-LKXGYXEUSA-N Asp-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N)O DZQKLNLLWFQONU-LKXGYXEUSA-N 0.000 description 1
- NZJDBCYBYCUEDC-UBHSHLNASA-N Asp-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N NZJDBCYBYCUEDC-UBHSHLNASA-N 0.000 description 1
- SMZCLQGDQMGESY-ACZMJKKPSA-N Asp-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)O)N SMZCLQGDQMGESY-ACZMJKKPSA-N 0.000 description 1
- SPKRHJOVRVDJGG-CIUDSAMLSA-N Asp-Gln-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N SPKRHJOVRVDJGG-CIUDSAMLSA-N 0.000 description 1
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 1
- XJQRWGXKUSDEFI-ACZMJKKPSA-N Asp-Glu-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O XJQRWGXKUSDEFI-ACZMJKKPSA-N 0.000 description 1
- KHBLRHKVXICFMY-GUBZILKMSA-N Asp-Glu-Lys Chemical compound N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O KHBLRHKVXICFMY-GUBZILKMSA-N 0.000 description 1
- OVPHVTCDVYYTHN-AVGNSLFASA-N Asp-Glu-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OVPHVTCDVYYTHN-AVGNSLFASA-N 0.000 description 1
- DGKCOYGQLNWNCJ-ACZMJKKPSA-N Asp-Glu-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O DGKCOYGQLNWNCJ-ACZMJKKPSA-N 0.000 description 1
- YDJVIBMKAMQPPP-LAEOZQHASA-N Asp-Glu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O YDJVIBMKAMQPPP-LAEOZQHASA-N 0.000 description 1
- JUWZKMBALYLZCK-WHFBIAKZSA-N Asp-Gly-Asn Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O JUWZKMBALYLZCK-WHFBIAKZSA-N 0.000 description 1
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 1
- LDGUZSIPGSPBJP-XVYDVKMFSA-N Asp-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N LDGUZSIPGSPBJP-XVYDVKMFSA-N 0.000 description 1
- OGTCOKZFOJIZFG-CIUDSAMLSA-N Asp-His-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O OGTCOKZFOJIZFG-CIUDSAMLSA-N 0.000 description 1
- QHHVSXGWLYEAGX-GUBZILKMSA-N Asp-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QHHVSXGWLYEAGX-GUBZILKMSA-N 0.000 description 1
- TVIZQBFURPLQDV-DJFWLOJKSA-N Asp-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC(=O)O)N TVIZQBFURPLQDV-DJFWLOJKSA-N 0.000 description 1
- SWTQDYFZVOJVLL-KKUMJFAQSA-N Asp-His-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)O)N)O SWTQDYFZVOJVLL-KKUMJFAQSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- CYCKJEFVFNRWEZ-UGYAYLCHSA-N Asp-Ile-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O CYCKJEFVFNRWEZ-UGYAYLCHSA-N 0.000 description 1
- QNFRBNZGVVKBNJ-PEFMBERDSA-N Asp-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)O)N QNFRBNZGVVKBNJ-PEFMBERDSA-N 0.000 description 1
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 1
- YFSLJHLQOALGSY-ZPFDUUQYSA-N Asp-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N YFSLJHLQOALGSY-ZPFDUUQYSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- RTXQQDVBACBSCW-CFMVVWHZSA-N Asp-Ile-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RTXQQDVBACBSCW-CFMVVWHZSA-N 0.000 description 1
- KYQNAIMCTRZLNP-QSFUFRPTSA-N Asp-Ile-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O KYQNAIMCTRZLNP-QSFUFRPTSA-N 0.000 description 1
- CLUMZOKVGUWUFD-CIUDSAMLSA-N Asp-Leu-Asn Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O CLUMZOKVGUWUFD-CIUDSAMLSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- CJUKAWUWBZCTDQ-SRVKXCTJSA-N Asp-Leu-Lys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O CJUKAWUWBZCTDQ-SRVKXCTJSA-N 0.000 description 1
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 1
- MYOHQBFRJQFIDZ-KKUMJFAQSA-N Asp-Leu-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MYOHQBFRJQFIDZ-KKUMJFAQSA-N 0.000 description 1
- LIVXPXUVXFRWNY-CIUDSAMLSA-N Asp-Lys-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O LIVXPXUVXFRWNY-CIUDSAMLSA-N 0.000 description 1
- UZFHNLYQWMGUHU-DCAQKATOSA-N Asp-Lys-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UZFHNLYQWMGUHU-DCAQKATOSA-N 0.000 description 1
- CTWCFPWFIGRAEP-CIUDSAMLSA-N Asp-Lys-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O CTWCFPWFIGRAEP-CIUDSAMLSA-N 0.000 description 1
- QNIACYURSSCLRP-GUBZILKMSA-N Asp-Lys-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O QNIACYURSSCLRP-GUBZILKMSA-N 0.000 description 1
- VSMYBNPOHYAXSD-GUBZILKMSA-N Asp-Lys-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O VSMYBNPOHYAXSD-GUBZILKMSA-N 0.000 description 1
- AKKUDRZKFZWPBH-SRVKXCTJSA-N Asp-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N AKKUDRZKFZWPBH-SRVKXCTJSA-N 0.000 description 1
- HJCGDIGVVWETRO-ZPFDUUQYSA-N Asp-Lys-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC(O)=O)C(O)=O HJCGDIGVVWETRO-ZPFDUUQYSA-N 0.000 description 1
- NVFSJIXJZCDICF-SRVKXCTJSA-N Asp-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(=O)O)N NVFSJIXJZCDICF-SRVKXCTJSA-N 0.000 description 1
- FQHBAQLBIXLWAG-DCAQKATOSA-N Asp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N FQHBAQLBIXLWAG-DCAQKATOSA-N 0.000 description 1
- DONWIPDSZZJHHK-HJGDQZAQSA-N Asp-Lys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC(=O)O)N)O DONWIPDSZZJHHK-HJGDQZAQSA-N 0.000 description 1
- RXBGWGRSWXOBGK-KKUMJFAQSA-N Asp-Lys-Tyr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O RXBGWGRSWXOBGK-KKUMJFAQSA-N 0.000 description 1
- DYDKXJWQCIVTMR-WDSKDSINSA-N Asp-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC(O)=O DYDKXJWQCIVTMR-WDSKDSINSA-N 0.000 description 1
- SARSTIZOZFBDOM-FXQIFTODSA-N Asp-Met-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O SARSTIZOZFBDOM-FXQIFTODSA-N 0.000 description 1
- WWOYXVBGHAHQBG-FXQIFTODSA-N Asp-Met-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O WWOYXVBGHAHQBG-FXQIFTODSA-N 0.000 description 1
- JXGJJQJHXHXJQF-CIUDSAMLSA-N Asp-Met-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O JXGJJQJHXHXJQF-CIUDSAMLSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- IOXWDLNHXZOXQP-FXQIFTODSA-N Asp-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N IOXWDLNHXZOXQP-FXQIFTODSA-N 0.000 description 1
- DJCAHYVLMSRBFR-QXEWZRGKSA-N Asp-Met-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC(O)=O DJCAHYVLMSRBFR-QXEWZRGKSA-N 0.000 description 1
- GYWQGGUCMDCUJE-DLOVCJGASA-N Asp-Phe-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O GYWQGGUCMDCUJE-DLOVCJGASA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- UCHSVZYJKJLPHF-BZSNNMDCSA-N Asp-Phe-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O UCHSVZYJKJLPHF-BZSNNMDCSA-N 0.000 description 1
- XOGKTPDSLHEBEA-UHFFFAOYSA-N Asp-Phe-Pro-His Chemical compound C1CCC(C(=O)NC(CC=2NC=NC=2)C(O)=O)N1C(=O)C(NC(=O)C(CC(O)=O)N)CC1=CC=CC=C1 XOGKTPDSLHEBEA-UHFFFAOYSA-N 0.000 description 1
- GPPIDDWYKJPRES-YDHLFZDLSA-N Asp-Phe-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O GPPIDDWYKJPRES-YDHLFZDLSA-N 0.000 description 1
- KESWRFKUZRUTAH-FXQIFTODSA-N Asp-Pro-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O KESWRFKUZRUTAH-FXQIFTODSA-N 0.000 description 1
- HJZLUGQGJWXJCJ-CIUDSAMLSA-N Asp-Pro-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O HJZLUGQGJWXJCJ-CIUDSAMLSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- YFGUZQQCSDZRBN-DCAQKATOSA-N Asp-Pro-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O YFGUZQQCSDZRBN-DCAQKATOSA-N 0.000 description 1
- UAXIKORUDGGIGA-DCAQKATOSA-N Asp-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CCCCN)C(=O)O UAXIKORUDGGIGA-DCAQKATOSA-N 0.000 description 1
- LGGHQRZIJSYRHA-GUBZILKMSA-N Asp-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N LGGHQRZIJSYRHA-GUBZILKMSA-N 0.000 description 1
- BKOIIURTQAJHAT-GUBZILKMSA-N Asp-Pro-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 BKOIIURTQAJHAT-GUBZILKMSA-N 0.000 description 1
- YXALGQBWVHQVLC-UHFFFAOYSA-N Asp-Pro-Ser-Leu-Lys Natural products CC(C)CC(NC(=O)C(CO)NC(=O)C1CCCN1C(=O)C(N)CC(=O)O)C(=O)NC(CCCCN)C(=O)O YXALGQBWVHQVLC-UHFFFAOYSA-N 0.000 description 1
- FOXXZZGDIAQPQI-XKNYDFJKSA-N Asp-Pro-Ser-Ser Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FOXXZZGDIAQPQI-XKNYDFJKSA-N 0.000 description 1
- ZBYLEBZCVKLPCY-FXQIFTODSA-N Asp-Ser-Arg Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZBYLEBZCVKLPCY-FXQIFTODSA-N 0.000 description 1
- XXAMCEGRCZQGEM-ZLUOBGJFSA-N Asp-Ser-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O XXAMCEGRCZQGEM-ZLUOBGJFSA-N 0.000 description 1
- FIAKNCXQFFKSSI-ZLUOBGJFSA-N Asp-Ser-Cys Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O FIAKNCXQFFKSSI-ZLUOBGJFSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- MJJIHRWNWSQTOI-VEVYYDQMSA-N Asp-Thr-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MJJIHRWNWSQTOI-VEVYYDQMSA-N 0.000 description 1
- IWLZBRTUIVXZJD-OLHMAJIHSA-N Asp-Thr-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O IWLZBRTUIVXZJD-OLHMAJIHSA-N 0.000 description 1
- QOCFFCUFZGDHTP-NUMRIWBASA-N Asp-Thr-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOCFFCUFZGDHTP-NUMRIWBASA-N 0.000 description 1
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 1
- KBJVTFWQWXCYCQ-IUKAMOBKSA-N Asp-Thr-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KBJVTFWQWXCYCQ-IUKAMOBKSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- GCACQYDBDHRVGE-LKXGYXEUSA-N Asp-Thr-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC(O)=O GCACQYDBDHRVGE-LKXGYXEUSA-N 0.000 description 1
- RSMZEHCMIOKNMW-GSSVUCPTSA-N Asp-Thr-Thr Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RSMZEHCMIOKNMW-GSSVUCPTSA-N 0.000 description 1
- NVXLFIPTHPKSKL-UBHSHLNASA-N Asp-Trp-Asn Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(O)=O)N)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 NVXLFIPTHPKSKL-UBHSHLNASA-N 0.000 description 1
- DKQCWCQRAMAFLN-UBHSHLNASA-N Asp-Trp-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(O)=O)C(O)=O DKQCWCQRAMAFLN-UBHSHLNASA-N 0.000 description 1
- LTARLVHGOGBRHN-AAEUAGOBSA-N Asp-Trp-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)NCC(O)=O LTARLVHGOGBRHN-AAEUAGOBSA-N 0.000 description 1
- ZVYYMCXVPZEAPU-CWRNSKLLSA-N Asp-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZVYYMCXVPZEAPU-CWRNSKLLSA-N 0.000 description 1
- PLNJUJGNLDSFOP-UWJYBYFXSA-N Asp-Tyr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O PLNJUJGNLDSFOP-UWJYBYFXSA-N 0.000 description 1
- KNDCWFXCFKSEBM-AVGNSLFASA-N Asp-Tyr-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KNDCWFXCFKSEBM-AVGNSLFASA-N 0.000 description 1
- ZQFZEBRNAMXXJV-KKUMJFAQSA-N Asp-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O ZQFZEBRNAMXXJV-KKUMJFAQSA-N 0.000 description 1
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 1
- WOKXEQLPBLLWHC-IHRRRGAJSA-N Asp-Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 WOKXEQLPBLLWHC-IHRRRGAJSA-N 0.000 description 1
- CZIVKMOEXPILDK-SRVKXCTJSA-N Asp-Tyr-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O CZIVKMOEXPILDK-SRVKXCTJSA-N 0.000 description 1
- PLOKOIJSGCISHE-BYULHYEWSA-N Asp-Val-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLOKOIJSGCISHE-BYULHYEWSA-N 0.000 description 1
- WAEDSQFVZJUHLI-BYULHYEWSA-N Asp-Val-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O WAEDSQFVZJUHLI-BYULHYEWSA-N 0.000 description 1
- MFDPBZAFCRKYEY-LAEOZQHASA-N Asp-Val-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MFDPBZAFCRKYEY-LAEOZQHASA-N 0.000 description 1
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 1
- GIKOVDMXBAFXDF-NHCYSSNCSA-N Asp-Val-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O GIKOVDMXBAFXDF-NHCYSSNCSA-N 0.000 description 1
- QOJJMJKTMKNFEF-ZKWXMUAHSA-N Asp-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O QOJJMJKTMKNFEF-ZKWXMUAHSA-N 0.000 description 1
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 1
- GYNUXDMCDILYIQ-QRTARXTBSA-N Asp-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC(=O)O)N GYNUXDMCDILYIQ-QRTARXTBSA-N 0.000 description 1
- JGLWFWXGOINXEA-YDHLFZDLSA-N Asp-Val-Tyr Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JGLWFWXGOINXEA-YDHLFZDLSA-N 0.000 description 1
- 241000020089 Atacta Species 0.000 description 1
- 239000005711 Benzoic acid Substances 0.000 description 1
- 102100021277 Beta-secretase 2 Human genes 0.000 description 1
- 101100512078 Caenorhabditis elegans lys-1 gene Proteins 0.000 description 1
- 102000014914 Carrier Proteins Human genes 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 241000282693 Cercopithecidae Species 0.000 description 1
- 101100002679 Chlorobium chlorochromatii (strain CaD3) aroE gene Proteins 0.000 description 1
- 101100350092 Chlorobium chlorochromatii (strain CaD3) obg gene Proteins 0.000 description 1
- 101100303857 Chlorobium chlorochromatii (strain CaD3) rpmB gene Proteins 0.000 description 1
- 108090000317 Chymotrypsin Proteins 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 235000019750 Crude protein Nutrition 0.000 description 1
- AMRLSQGGERHDHJ-FXQIFTODSA-N Cys-Ala-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMRLSQGGERHDHJ-FXQIFTODSA-N 0.000 description 1
- PLBJMUUEGBBHRH-ZLUOBGJFSA-N Cys-Ala-Asn Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O PLBJMUUEGBBHRH-ZLUOBGJFSA-N 0.000 description 1
- QFMCHXSGIZPBKG-ZLUOBGJFSA-N Cys-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N QFMCHXSGIZPBKG-ZLUOBGJFSA-N 0.000 description 1
- PRXCTTWKGJAPMT-ZLUOBGJFSA-N Cys-Ala-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O PRXCTTWKGJAPMT-ZLUOBGJFSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- GMXSSZUVDNPRMA-FXQIFTODSA-N Cys-Arg-Asp Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GMXSSZUVDNPRMA-FXQIFTODSA-N 0.000 description 1
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 1
- SURTWIXUHQNUGN-GUBZILKMSA-N Cys-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N SURTWIXUHQNUGN-GUBZILKMSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- BYALSSDCQYHKMY-XGEHTFHBSA-N Cys-Arg-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N)O BYALSSDCQYHKMY-XGEHTFHBSA-N 0.000 description 1
- JIVJXVJMOBVCJF-ZLUOBGJFSA-N Cys-Asn-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)N JIVJXVJMOBVCJF-ZLUOBGJFSA-N 0.000 description 1
- UISYPAHPLXGLNH-ACZMJKKPSA-N Cys-Asn-Gln Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O UISYPAHPLXGLNH-ACZMJKKPSA-N 0.000 description 1
- KLLFLHBKSJAUMZ-ACZMJKKPSA-N Cys-Asn-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N KLLFLHBKSJAUMZ-ACZMJKKPSA-N 0.000 description 1
- YRJICXCOIBUCRP-CIUDSAMLSA-N Cys-Asn-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CS)N YRJICXCOIBUCRP-CIUDSAMLSA-N 0.000 description 1
- SFUUYRSAJPWTGO-SRVKXCTJSA-N Cys-Asn-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SFUUYRSAJPWTGO-SRVKXCTJSA-N 0.000 description 1
- VNLYIYOYUNGURO-ZLUOBGJFSA-N Cys-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N VNLYIYOYUNGURO-ZLUOBGJFSA-N 0.000 description 1
- OLIYIKRCOZBFCW-ZLUOBGJFSA-N Cys-Asp-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)C(=O)O OLIYIKRCOZBFCW-ZLUOBGJFSA-N 0.000 description 1
- GSNRZJNHMVMOFV-ACZMJKKPSA-N Cys-Asp-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N GSNRZJNHMVMOFV-ACZMJKKPSA-N 0.000 description 1
- VZKXOWRNJDEGLZ-WHFBIAKZSA-N Cys-Asp-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O VZKXOWRNJDEGLZ-WHFBIAKZSA-N 0.000 description 1
- YZFCGHIBLBDZDA-ZLUOBGJFSA-N Cys-Asp-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YZFCGHIBLBDZDA-ZLUOBGJFSA-N 0.000 description 1
- WKELHWMCIXSVDT-UBHSHLNASA-N Cys-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CS)N WKELHWMCIXSVDT-UBHSHLNASA-N 0.000 description 1
- DVKQPQKQDHHFTE-ZLUOBGJFSA-N Cys-Cys-Asn Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N)C(=O)N DVKQPQKQDHHFTE-ZLUOBGJFSA-N 0.000 description 1
- SMYXEYRYCLIPIL-ZLUOBGJFSA-N Cys-Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CS)C(O)=O SMYXEYRYCLIPIL-ZLUOBGJFSA-N 0.000 description 1
- ZJBWJHQDOIMVLM-WHFBIAKZSA-N Cys-Cys-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZJBWJHQDOIMVLM-WHFBIAKZSA-N 0.000 description 1
- QJUDRFBUWAGUSG-SRVKXCTJSA-N Cys-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CS)N QJUDRFBUWAGUSG-SRVKXCTJSA-N 0.000 description 1
- BVFQOPGFOQVZTE-ACZMJKKPSA-N Cys-Gln-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O BVFQOPGFOQVZTE-ACZMJKKPSA-N 0.000 description 1
- ZVNFONSZVUBRAV-CIUDSAMLSA-N Cys-Gln-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)CN=C(N)N ZVNFONSZVUBRAV-CIUDSAMLSA-N 0.000 description 1
- YRKJQKATZOTUEN-ACZMJKKPSA-N Cys-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N YRKJQKATZOTUEN-ACZMJKKPSA-N 0.000 description 1
- MWZSCEAYQCMROW-GUBZILKMSA-N Cys-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N MWZSCEAYQCMROW-GUBZILKMSA-N 0.000 description 1
- YZKOXEJTLWZOQL-GUBZILKMSA-N Cys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N YZKOXEJTLWZOQL-GUBZILKMSA-N 0.000 description 1
- PORWNQWEEIOIRH-XHNCKOQMSA-N Cys-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)C(=O)O PORWNQWEEIOIRH-XHNCKOQMSA-N 0.000 description 1
- HHABWQIFXZPZCK-ACZMJKKPSA-N Cys-Gln-Ser Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N HHABWQIFXZPZCK-ACZMJKKPSA-N 0.000 description 1
- UCMIKRLLIOVDRJ-XKBZYTNZSA-N Cys-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CS)N)O UCMIKRLLIOVDRJ-XKBZYTNZSA-N 0.000 description 1
- SFRQEQGPRTVDPO-NRPADANISA-N Cys-Gln-Val Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O SFRQEQGPRTVDPO-NRPADANISA-N 0.000 description 1
- YUZPQIQWXLRFBW-ACZMJKKPSA-N Cys-Glu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O YUZPQIQWXLRFBW-ACZMJKKPSA-N 0.000 description 1
- FIADUEYFRSCCIK-CIUDSAMLSA-N Cys-Glu-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIADUEYFRSCCIK-CIUDSAMLSA-N 0.000 description 1
- RWGDABDXVXRLLH-ACZMJKKPSA-N Cys-Glu-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CS)N RWGDABDXVXRLLH-ACZMJKKPSA-N 0.000 description 1
- BCSYBBMFGLHCOA-ACZMJKKPSA-N Cys-Glu-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(O)=O BCSYBBMFGLHCOA-ACZMJKKPSA-N 0.000 description 1
- UDPSLLFHOLGXBY-FXQIFTODSA-N Cys-Glu-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O UDPSLLFHOLGXBY-FXQIFTODSA-N 0.000 description 1
- SBORMUFGKSCGEN-XHNCKOQMSA-N Cys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)C(=O)O SBORMUFGKSCGEN-XHNCKOQMSA-N 0.000 description 1
- BDWIZLQVVWQMTB-XKBZYTNZSA-N Cys-Glu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CS)N)O BDWIZLQVVWQMTB-XKBZYTNZSA-N 0.000 description 1
- AOZBJZBKFHOYHL-AVGNSLFASA-N Cys-Glu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O AOZBJZBKFHOYHL-AVGNSLFASA-N 0.000 description 1
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 1
- GUKYYUFHWYRMEU-WHFBIAKZSA-N Cys-Gly-Asp Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O GUKYYUFHWYRMEU-WHFBIAKZSA-N 0.000 description 1
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 1
- LBOLGUYQEPZSKM-YUMQZZPRSA-N Cys-Gly-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N LBOLGUYQEPZSKM-YUMQZZPRSA-N 0.000 description 1
- XTHUKRLJRUVVBF-WHFBIAKZSA-N Cys-Gly-Ser Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O XTHUKRLJRUVVBF-WHFBIAKZSA-N 0.000 description 1
- XVLMKWWVBNESPX-XVYDVKMFSA-N Cys-His-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N XVLMKWWVBNESPX-XVYDVKMFSA-N 0.000 description 1
- ANPADMNVVOOYKW-DCAQKATOSA-N Cys-His-Arg Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ANPADMNVVOOYKW-DCAQKATOSA-N 0.000 description 1
- XELISBQUZZAPQK-CIUDSAMLSA-N Cys-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N XELISBQUZZAPQK-CIUDSAMLSA-N 0.000 description 1
- VTJLJQGUMBWHBP-GUBZILKMSA-N Cys-His-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CS)N VTJLJQGUMBWHBP-GUBZILKMSA-N 0.000 description 1
- KPENUVBHAKRDQR-GUBZILKMSA-N Cys-His-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPENUVBHAKRDQR-GUBZILKMSA-N 0.000 description 1
- SBDVXRYCOIEYNV-YUMQZZPRSA-N Cys-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N SBDVXRYCOIEYNV-YUMQZZPRSA-N 0.000 description 1
- RRJOQIBQVZDVCW-SRVKXCTJSA-N Cys-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CS)N RRJOQIBQVZDVCW-SRVKXCTJSA-N 0.000 description 1
- WTNLLMQAFPOCTJ-GARJFASQSA-N Cys-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N)C(=O)O WTNLLMQAFPOCTJ-GARJFASQSA-N 0.000 description 1
- DIUBVGXMXONJCF-KKUMJFAQSA-N Cys-His-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DIUBVGXMXONJCF-KKUMJFAQSA-N 0.000 description 1
- LYSHSHHDBVKJRN-JBDRJPRFSA-N Cys-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CS)N LYSHSHHDBVKJRN-JBDRJPRFSA-N 0.000 description 1
- CUXIOFHFFXNUGG-HTFCKZLJSA-N Cys-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CS)N CUXIOFHFFXNUGG-HTFCKZLJSA-N 0.000 description 1
- OXFOKRAFNYSREH-BJDJZHNGSA-N Cys-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CS)N OXFOKRAFNYSREH-BJDJZHNGSA-N 0.000 description 1
- KCSDYJSCUWLILX-BJDJZHNGSA-N Cys-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N KCSDYJSCUWLILX-BJDJZHNGSA-N 0.000 description 1
- ODDOYXKAHLKKQY-MMWGEVLESA-N Cys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N ODDOYXKAHLKKQY-MMWGEVLESA-N 0.000 description 1
- MRVSLWQRNWEROS-SVSWQMSJSA-N Cys-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CS)N MRVSLWQRNWEROS-SVSWQMSJSA-N 0.000 description 1
- MTNJRNQDDSWQQA-GQGQLFGLSA-N Cys-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N MTNJRNQDDSWQQA-GQGQLFGLSA-N 0.000 description 1
- KXUKWRVYDYIPSQ-CIUDSAMLSA-N Cys-Leu-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O KXUKWRVYDYIPSQ-CIUDSAMLSA-N 0.000 description 1
- ABLJDBFJPUWQQB-DCAQKATOSA-N Cys-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N ABLJDBFJPUWQQB-DCAQKATOSA-N 0.000 description 1
- BLGNLNRBABWDST-CIUDSAMLSA-N Cys-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BLGNLNRBABWDST-CIUDSAMLSA-N 0.000 description 1
- DIHCYBRLTVEPBW-SRVKXCTJSA-N Cys-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CS)N DIHCYBRLTVEPBW-SRVKXCTJSA-N 0.000 description 1
- UCSXXFRXHGUXCQ-SRVKXCTJSA-N Cys-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N UCSXXFRXHGUXCQ-SRVKXCTJSA-N 0.000 description 1
- VXLXATVURDNDCG-CIUDSAMLSA-N Cys-Lys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N VXLXATVURDNDCG-CIUDSAMLSA-N 0.000 description 1
- VDUPGIDTWNQAJD-CIUDSAMLSA-N Cys-Lys-Cys Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)CS)C(=O)N[C@@H](CS)C(O)=O VDUPGIDTWNQAJD-CIUDSAMLSA-N 0.000 description 1
- BNCKELUXXUYRNY-GUBZILKMSA-N Cys-Lys-Glu Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N BNCKELUXXUYRNY-GUBZILKMSA-N 0.000 description 1
- CIVXDCMSSFGWAL-YUMQZZPRSA-N Cys-Lys-Gly Chemical compound C(CCN)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N CIVXDCMSSFGWAL-YUMQZZPRSA-N 0.000 description 1
- VOBMMKMWSIVIOA-SRVKXCTJSA-N Cys-Lys-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N VOBMMKMWSIVIOA-SRVKXCTJSA-N 0.000 description 1
- XMVZMBGFIOQONW-GARJFASQSA-N Cys-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N)C(=O)O XMVZMBGFIOQONW-GARJFASQSA-N 0.000 description 1
- WTEJFWOJHCJDML-FXQIFTODSA-N Cys-Met-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CS)C(O)=O WTEJFWOJHCJDML-FXQIFTODSA-N 0.000 description 1
- POSRGGKLRWCUBE-CIUDSAMLSA-N Cys-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CS)N POSRGGKLRWCUBE-CIUDSAMLSA-N 0.000 description 1
- ORYFTECKJZTNQP-DCAQKATOSA-N Cys-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CS)N ORYFTECKJZTNQP-DCAQKATOSA-N 0.000 description 1
- HEPLXMBVMCXTBP-QWRGUYRKSA-N Cys-Phe-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O HEPLXMBVMCXTBP-QWRGUYRKSA-N 0.000 description 1
- MFMDKTLJCUBQIC-MXAVVETBSA-N Cys-Phe-Ile Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MFMDKTLJCUBQIC-MXAVVETBSA-N 0.000 description 1
- CHRCKSPMGYDLIA-SRVKXCTJSA-N Cys-Phe-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O CHRCKSPMGYDLIA-SRVKXCTJSA-N 0.000 description 1
- RAGIABZNLPZBGS-FXQIFTODSA-N Cys-Pro-Cys Chemical compound N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(O)=O RAGIABZNLPZBGS-FXQIFTODSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- NITLUESFANGEIW-BQBZGAKWSA-N Cys-Pro-Gly Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O NITLUESFANGEIW-BQBZGAKWSA-N 0.000 description 1
- DQUWSUWXPWGTQT-DCAQKATOSA-N Cys-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CS DQUWSUWXPWGTQT-DCAQKATOSA-N 0.000 description 1
- HMWBPUDETPKSSS-DCAQKATOSA-N Cys-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CCCCN)C(=O)O HMWBPUDETPKSSS-DCAQKATOSA-N 0.000 description 1
- KSMSFCBQBQPFAD-GUBZILKMSA-N Cys-Pro-Pro Chemical compound SC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 KSMSFCBQBQPFAD-GUBZILKMSA-N 0.000 description 1
- TXGDWPBLUFQODU-XGEHTFHBSA-N Cys-Pro-Thr Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O TXGDWPBLUFQODU-XGEHTFHBSA-N 0.000 description 1
- XBELMDARIGXDKY-GUBZILKMSA-N Cys-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CS)N XBELMDARIGXDKY-GUBZILKMSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- UEHCDNYDBBCQEL-CIUDSAMLSA-N Cys-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N UEHCDNYDBBCQEL-CIUDSAMLSA-N 0.000 description 1
- VCPHQVQGVSKDHY-FXQIFTODSA-N Cys-Ser-Met Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O VCPHQVQGVSKDHY-FXQIFTODSA-N 0.000 description 1
- HJXSYJVCMUOUNY-SRVKXCTJSA-N Cys-Ser-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N HJXSYJVCMUOUNY-SRVKXCTJSA-N 0.000 description 1
- YNJBLTDKTMKEET-ZLUOBGJFSA-N Cys-Ser-Ser Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O YNJBLTDKTMKEET-ZLUOBGJFSA-N 0.000 description 1
- XWTGTTNUCCEFJI-UBHSHLNASA-N Cys-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N XWTGTTNUCCEFJI-UBHSHLNASA-N 0.000 description 1
- IXPSSIBVVKSOIE-SRVKXCTJSA-N Cys-Ser-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N)O IXPSSIBVVKSOIE-SRVKXCTJSA-N 0.000 description 1
- YWEHYKGJWHPGPY-XGEHTFHBSA-N Cys-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CS)N)O YWEHYKGJWHPGPY-XGEHTFHBSA-N 0.000 description 1
- FTTZLFIEUQHLHH-BWBBJGPYSA-N Cys-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N)O FTTZLFIEUQHLHH-BWBBJGPYSA-N 0.000 description 1
- JLZCAZJGWNRXCI-XKBZYTNZSA-N Cys-Thr-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O JLZCAZJGWNRXCI-XKBZYTNZSA-N 0.000 description 1
- JAHCWGSVNZXHRR-SVSWQMSJSA-N Cys-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](CS)N JAHCWGSVNZXHRR-SVSWQMSJSA-N 0.000 description 1
- GFAPBMCRSMSGDZ-XGEHTFHBSA-N Cys-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CS)N)O GFAPBMCRSMSGDZ-XGEHTFHBSA-N 0.000 description 1
- IQXSTXKVEMRMMB-XAVMHZPKSA-N Cys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N)O IQXSTXKVEMRMMB-XAVMHZPKSA-N 0.000 description 1
- FANFRJOFTYCNRG-JYBASQMISA-N Cys-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CS)N)O FANFRJOFTYCNRG-JYBASQMISA-N 0.000 description 1
- XSELZJJGSKZZDO-UBHSHLNASA-N Cys-Trp-Asp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CS)N XSELZJJGSKZZDO-UBHSHLNASA-N 0.000 description 1
- PNEAWXSKCKCHDK-XIRDDKMYSA-N Cys-Trp-His Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CS)N)C(O)=O)C1=CN=CN1 PNEAWXSKCKCHDK-XIRDDKMYSA-N 0.000 description 1
- PXEGEYISOXISDV-XIRDDKMYSA-N Cys-Trp-Lys Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CS)=CNC2=C1 PXEGEYISOXISDV-XIRDDKMYSA-N 0.000 description 1
- FCXJJTRGVAZDER-FXQIFTODSA-N Cys-Val-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O FCXJJTRGVAZDER-FXQIFTODSA-N 0.000 description 1
- MQQLYEHXSBJTRK-FXQIFTODSA-N Cys-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CS)N MQQLYEHXSBJTRK-FXQIFTODSA-N 0.000 description 1
- AZDQAZRURQMSQD-XPUUQOCRSA-N Cys-Val-Gly Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AZDQAZRURQMSQD-XPUUQOCRSA-N 0.000 description 1
- KZZYVYWSXMFYEC-DCAQKATOSA-N Cys-Val-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O KZZYVYWSXMFYEC-DCAQKATOSA-N 0.000 description 1
- ZXGDAZLSOSYSBA-IHRRRGAJSA-N Cys-Val-Phe Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZXGDAZLSOSYSBA-IHRRRGAJSA-N 0.000 description 1
- QQAYIVHVRFJICE-AEJSXWLSSA-N Cys-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CS)N QQAYIVHVRFJICE-AEJSXWLSSA-N 0.000 description 1
- YTMBNLHIDIKJIU-HCXYKTFWSA-N D-Arginyl-L-arginyl-D-glutaminyl-L-phenylalanine Chemical compound NC(=N)NCCC[C@@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](CCC(O)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YTMBNLHIDIKJIU-HCXYKTFWSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 101150074155 DHFR gene Proteins 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 238000000018 DNA microarray Methods 0.000 description 1
- 108010008286 DNA nucleotidylexotransferase Proteins 0.000 description 1
- 239000003298 DNA probe Substances 0.000 description 1
- 102100029764 DNA-directed DNA/RNA polymerase mu Human genes 0.000 description 1
- FEWJPZIEWOKRBE-JCYAYHJZSA-N Dextrotartaric acid Chemical compound OC(=O)[C@H](O)[C@@H](O)C(O)=O FEWJPZIEWOKRBE-JCYAYHJZSA-N 0.000 description 1
- 238000002965 ELISA Methods 0.000 description 1
- 241001646716 Escherichia coli K-12 Species 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108010072062 GEKG peptide Proteins 0.000 description 1
- INKFLNZBTSNFON-CIUDSAMLSA-N Gln-Ala-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O INKFLNZBTSNFON-CIUDSAMLSA-N 0.000 description 1
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 1
- REJJNXODKSHOKA-ACZMJKKPSA-N Gln-Ala-Asp Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N REJJNXODKSHOKA-ACZMJKKPSA-N 0.000 description 1
- DTCCMDYODDPHBG-ACZMJKKPSA-N Gln-Ala-Cys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O DTCCMDYODDPHBG-ACZMJKKPSA-N 0.000 description 1
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 1
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 1
- IGNGBUVODQLMRJ-CIUDSAMLSA-N Gln-Ala-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(O)=O IGNGBUVODQLMRJ-CIUDSAMLSA-N 0.000 description 1
- OVQXQLWWJSNYFV-XEGUGMAKSA-N Gln-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(N)=O)C)C(O)=O)=CNC2=C1 OVQXQLWWJSNYFV-XEGUGMAKSA-N 0.000 description 1
- YNNXQZDEOCYJJL-CIUDSAMLSA-N Gln-Arg-Asp Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N YNNXQZDEOCYJJL-CIUDSAMLSA-N 0.000 description 1
- LTLXPHKSQQILNF-CIUDSAMLSA-N Gln-Arg-Cys Chemical compound C(C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)CN=C(N)N LTLXPHKSQQILNF-CIUDSAMLSA-N 0.000 description 1
- WOACHWLUOFZLGJ-GUBZILKMSA-N Gln-Arg-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O WOACHWLUOFZLGJ-GUBZILKMSA-N 0.000 description 1
- RGRMOYQUIJVQQD-SRVKXCTJSA-N Gln-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)N)N RGRMOYQUIJVQQD-SRVKXCTJSA-N 0.000 description 1
- KJRXLVZYJJLUCV-DCAQKATOSA-N Gln-Arg-Met Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O KJRXLVZYJJLUCV-DCAQKATOSA-N 0.000 description 1
- DTMLKCYOQKZXKZ-HJGDQZAQSA-N Gln-Arg-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DTMLKCYOQKZXKZ-HJGDQZAQSA-N 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- SOBBAYVQSNXYPQ-ACZMJKKPSA-N Gln-Asn-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O SOBBAYVQSNXYPQ-ACZMJKKPSA-N 0.000 description 1
- PHZYLYASFWHLHJ-FXQIFTODSA-N Gln-Asn-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PHZYLYASFWHLHJ-FXQIFTODSA-N 0.000 description 1
- ZPDVKYLJTOFQJV-WDSKDSINSA-N Gln-Asn-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ZPDVKYLJTOFQJV-WDSKDSINSA-N 0.000 description 1
- RMOCFPBLHAOTDU-ACZMJKKPSA-N Gln-Asn-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RMOCFPBLHAOTDU-ACZMJKKPSA-N 0.000 description 1
- DXMPMSWUZVNBSG-QEJZJMRPSA-N Gln-Asn-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N DXMPMSWUZVNBSG-QEJZJMRPSA-N 0.000 description 1
- MGJMFSBEMSNYJL-AVGNSLFASA-N Gln-Asn-Tyr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MGJMFSBEMSNYJL-AVGNSLFASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- CYTSBCIIEHUPDU-ACZMJKKPSA-N Gln-Asp-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O CYTSBCIIEHUPDU-ACZMJKKPSA-N 0.000 description 1
- QYTKAVBFRUGYAU-ACZMJKKPSA-N Gln-Asp-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QYTKAVBFRUGYAU-ACZMJKKPSA-N 0.000 description 1
- KYFSMWLWHYZRNW-ACZMJKKPSA-N Gln-Asp-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N KYFSMWLWHYZRNW-ACZMJKKPSA-N 0.000 description 1
- IKDOHQHEFPPGJG-FXQIFTODSA-N Gln-Asp-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O IKDOHQHEFPPGJG-FXQIFTODSA-N 0.000 description 1
- JKPGHIQCHIIRMS-AVGNSLFASA-N Gln-Asp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)N)N JKPGHIQCHIIRMS-AVGNSLFASA-N 0.000 description 1
- UZMWDBOHAOSCCH-ACZMJKKPSA-N Gln-Cys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O UZMWDBOHAOSCCH-ACZMJKKPSA-N 0.000 description 1
- UVAOVENCIONMJP-GUBZILKMSA-N Gln-Cys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O UVAOVENCIONMJP-GUBZILKMSA-N 0.000 description 1
- LPJVZYMINRLCQA-AVGNSLFASA-N Gln-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N LPJVZYMINRLCQA-AVGNSLFASA-N 0.000 description 1
- COYGBRTZEVWZBW-XKBZYTNZSA-N Gln-Cys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(N)=O COYGBRTZEVWZBW-XKBZYTNZSA-N 0.000 description 1
- PZVJDMJHKUWSIV-AVGNSLFASA-N Gln-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)N)N)O PZVJDMJHKUWSIV-AVGNSLFASA-N 0.000 description 1
- LPYPANUXJGFMGV-FXQIFTODSA-N Gln-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)N)N LPYPANUXJGFMGV-FXQIFTODSA-N 0.000 description 1
- QYKBTDOAMKORGL-FXQIFTODSA-N Gln-Gln-Asp Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N QYKBTDOAMKORGL-FXQIFTODSA-N 0.000 description 1
- RRBLZNIIMHSHQF-FXQIFTODSA-N Gln-Gln-Cys Chemical compound C(CC(=O)N)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N RRBLZNIIMHSHQF-FXQIFTODSA-N 0.000 description 1
- PKVWNYGXMNWJSI-CIUDSAMLSA-N Gln-Gln-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O PKVWNYGXMNWJSI-CIUDSAMLSA-N 0.000 description 1
- XFKUFUJECJUQTQ-CIUDSAMLSA-N Gln-Gln-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XFKUFUJECJUQTQ-CIUDSAMLSA-N 0.000 description 1
- NVEASDQHBRZPSU-BQBZGAKWSA-N Gln-Gln-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O NVEASDQHBRZPSU-BQBZGAKWSA-N 0.000 description 1
- LWDGZZGWDMHBOF-FXQIFTODSA-N Gln-Glu-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O LWDGZZGWDMHBOF-FXQIFTODSA-N 0.000 description 1
- CGVWDTRDPLOMHZ-FXQIFTODSA-N Gln-Glu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CGVWDTRDPLOMHZ-FXQIFTODSA-N 0.000 description 1
- ZQPOVSJFBBETHQ-CIUDSAMLSA-N Gln-Glu-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZQPOVSJFBBETHQ-CIUDSAMLSA-N 0.000 description 1
- SNLOOPZHAQDMJG-CIUDSAMLSA-N Gln-Glu-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SNLOOPZHAQDMJG-CIUDSAMLSA-N 0.000 description 1
- PXAFHUATEHLECW-GUBZILKMSA-N Gln-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCC(=O)N)N PXAFHUATEHLECW-GUBZILKMSA-N 0.000 description 1
- MFJAPSYJQJCQDN-BQBZGAKWSA-N Gln-Gly-Glu Chemical compound NC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O MFJAPSYJQJCQDN-BQBZGAKWSA-N 0.000 description 1
- XSBGUANSZDGULP-IUCAKERBSA-N Gln-Gly-Lys Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O XSBGUANSZDGULP-IUCAKERBSA-N 0.000 description 1
- SMLDOQHTOAAFJQ-WDSKDSINSA-N Gln-Gly-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SMLDOQHTOAAFJQ-WDSKDSINSA-N 0.000 description 1
- ORYMMTRPKVTGSJ-XVKPBYJWSA-N Gln-Gly-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCC(N)=O ORYMMTRPKVTGSJ-XVKPBYJWSA-N 0.000 description 1
- BVELAHPZLYLZDJ-HGNGGELXSA-N Gln-His-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O BVELAHPZLYLZDJ-HGNGGELXSA-N 0.000 description 1
- DQPOBSRQNWOBNA-GUBZILKMSA-N Gln-His-Asn Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O DQPOBSRQNWOBNA-GUBZILKMSA-N 0.000 description 1
- JNEITCMDYWKPIW-GUBZILKMSA-N Gln-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JNEITCMDYWKPIW-GUBZILKMSA-N 0.000 description 1
- IWUFOVSLWADEJC-AVGNSLFASA-N Gln-His-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O IWUFOVSLWADEJC-AVGNSLFASA-N 0.000 description 1
- LKVCNGLNTAPMSZ-JYJNAYRXSA-N Gln-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)N)N LKVCNGLNTAPMSZ-JYJNAYRXSA-N 0.000 description 1
- ICDIMQAMJGDHSE-GUBZILKMSA-N Gln-His-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O ICDIMQAMJGDHSE-GUBZILKMSA-N 0.000 description 1
- XWIBVSAEUCAAKF-GVXVVHGQSA-N Gln-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)N)N XWIBVSAEUCAAKF-GVXVVHGQSA-N 0.000 description 1
- HDUDGCZEOZEFOA-KBIXCLLPSA-N Gln-Ile-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HDUDGCZEOZEFOA-KBIXCLLPSA-N 0.000 description 1
- DAAUVRPSZRDMBV-KBIXCLLPSA-N Gln-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N DAAUVRPSZRDMBV-KBIXCLLPSA-N 0.000 description 1
- FTIJVMLAGRAYMJ-MNXVOIDGSA-N Gln-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(N)=O FTIJVMLAGRAYMJ-MNXVOIDGSA-N 0.000 description 1
- MWERYIXRDZDXOA-QEWYBTABSA-N Gln-Ile-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O MWERYIXRDZDXOA-QEWYBTABSA-N 0.000 description 1
- ZNTDJIMJKNNSLR-RWRJDSDZSA-N Gln-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N ZNTDJIMJKNNSLR-RWRJDSDZSA-N 0.000 description 1
- HYPVLWGNBIYTNA-GUBZILKMSA-N Gln-Leu-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HYPVLWGNBIYTNA-GUBZILKMSA-N 0.000 description 1
- QBLMTCRYYTVUQY-GUBZILKMSA-N Gln-Leu-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O QBLMTCRYYTVUQY-GUBZILKMSA-N 0.000 description 1
- HHQCBFGKQDMWSP-GUBZILKMSA-N Gln-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HHQCBFGKQDMWSP-GUBZILKMSA-N 0.000 description 1
- IULKWYSYZSURJK-AVGNSLFASA-N Gln-Leu-Lys Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O IULKWYSYZSURJK-AVGNSLFASA-N 0.000 description 1
- KHNJVFYHIKLUPD-SRVKXCTJSA-N Gln-Leu-Met Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N KHNJVFYHIKLUPD-SRVKXCTJSA-N 0.000 description 1
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 1
- TWIAMTNJOMRDAK-GUBZILKMSA-N Gln-Lys-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O TWIAMTNJOMRDAK-GUBZILKMSA-N 0.000 description 1
- UWKPRVKWEKEMSY-DCAQKATOSA-N Gln-Lys-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UWKPRVKWEKEMSY-DCAQKATOSA-N 0.000 description 1
- HPCOBEHVEHWREJ-DCAQKATOSA-N Gln-Lys-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O HPCOBEHVEHWREJ-DCAQKATOSA-N 0.000 description 1
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 1
- JRHPEMVLTRADLJ-AVGNSLFASA-N Gln-Lys-Lys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N JRHPEMVLTRADLJ-AVGNSLFASA-N 0.000 description 1
- XZLLTYBONVKGLO-SDDRHHMPSA-N Gln-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N)C(=O)O XZLLTYBONVKGLO-SDDRHHMPSA-N 0.000 description 1
- QKWBEMCLYTYBNI-GVXVVHGQSA-N Gln-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(N)=O QKWBEMCLYTYBNI-GVXVVHGQSA-N 0.000 description 1
- WHVLABLIJYGVEK-QEWYBTABSA-N Gln-Phe-Ile Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WHVLABLIJYGVEK-QEWYBTABSA-N 0.000 description 1
- OZEQPCDLCDRCGY-SOUVJXGZSA-N Gln-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CCC(=O)N)N)C(=O)O OZEQPCDLCDRCGY-SOUVJXGZSA-N 0.000 description 1
- ZVQZXPADLZIQFF-FHWLQOOXSA-N Gln-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 ZVQZXPADLZIQFF-FHWLQOOXSA-N 0.000 description 1
- QFXNFFZTMFHPST-DZKIICNBSA-N Gln-Phe-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCC(=O)N)N QFXNFFZTMFHPST-DZKIICNBSA-N 0.000 description 1
- FNAJNWPDTIXYJN-CIUDSAMLSA-N Gln-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O FNAJNWPDTIXYJN-CIUDSAMLSA-N 0.000 description 1
- PBYFVIQRFLNQCO-GUBZILKMSA-N Gln-Pro-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O PBYFVIQRFLNQCO-GUBZILKMSA-N 0.000 description 1
- OREPWMPAUWIIAM-ZPFDUUQYSA-N Gln-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCC(=O)N)N OREPWMPAUWIIAM-ZPFDUUQYSA-N 0.000 description 1
- MFORDNZDKAVNSR-SRVKXCTJSA-N Gln-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCC(N)=O MFORDNZDKAVNSR-SRVKXCTJSA-N 0.000 description 1
- UWMDGPFFTKDUIY-HJGDQZAQSA-N Gln-Pro-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWMDGPFFTKDUIY-HJGDQZAQSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- UTOQQOMEJDPDMX-ACZMJKKPSA-N Gln-Ser-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O UTOQQOMEJDPDMX-ACZMJKKPSA-N 0.000 description 1
- RWQCWSGOOOEGPB-FXQIFTODSA-N Gln-Ser-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O RWQCWSGOOOEGPB-FXQIFTODSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- KVQOVQVGVKDZNW-GUBZILKMSA-N Gln-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)N)N KVQOVQVGVKDZNW-GUBZILKMSA-N 0.000 description 1
- ZGHMRONFHDVXEF-AVGNSLFASA-N Gln-Ser-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZGHMRONFHDVXEF-AVGNSLFASA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- DYVMTEWCGAVKSE-HJGDQZAQSA-N Gln-Thr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O DYVMTEWCGAVKSE-HJGDQZAQSA-N 0.000 description 1
- XIYWAJQIWLXXAF-XKBZYTNZSA-N Gln-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O XIYWAJQIWLXXAF-XKBZYTNZSA-N 0.000 description 1
- DUGYCMAIAKAQPB-GLLZPBPUSA-N Gln-Thr-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DUGYCMAIAKAQPB-GLLZPBPUSA-N 0.000 description 1
- SYTFJIQPBRJSOK-NKIYYHGXSA-N Gln-Thr-His Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 SYTFJIQPBRJSOK-NKIYYHGXSA-N 0.000 description 1
- VLOLPWWCNKWRNB-LOKLDPHHSA-N Gln-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O VLOLPWWCNKWRNB-LOKLDPHHSA-N 0.000 description 1
- ARYKRXHBIPLULY-XKBZYTNZSA-N Gln-Thr-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ARYKRXHBIPLULY-XKBZYTNZSA-N 0.000 description 1
- STHSGOZLFLFGSS-SUSMZKCASA-N Gln-Thr-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STHSGOZLFLFGSS-SUSMZKCASA-N 0.000 description 1
- HLRLXVPRJJITSK-IFFSRLJSSA-N Gln-Thr-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HLRLXVPRJJITSK-IFFSRLJSSA-N 0.000 description 1
- YMCPEHDGTRUOHO-SXNHZJKMSA-N Gln-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CCC(=O)N)N YMCPEHDGTRUOHO-SXNHZJKMSA-N 0.000 description 1
- YJCZUTXLPXBNIO-BHYGNILZSA-N Gln-Trp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)N)N)C(=O)O YJCZUTXLPXBNIO-BHYGNILZSA-N 0.000 description 1
- BJVBMSTUUWGZKX-JYJNAYRXSA-N Gln-Tyr-His Chemical compound N[C@@H](CCC(N)=O)C(=O)N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BJVBMSTUUWGZKX-JYJNAYRXSA-N 0.000 description 1
- JKDBRTNMYXYLHO-JYJNAYRXSA-N Gln-Tyr-Leu Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 JKDBRTNMYXYLHO-JYJNAYRXSA-N 0.000 description 1
- UGEZSPWLJABDAR-KKUMJFAQSA-N Gln-Tyr-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)N)N UGEZSPWLJABDAR-KKUMJFAQSA-N 0.000 description 1
- HPBKQFJXDUVNQV-FHWLQOOXSA-N Gln-Tyr-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N)O HPBKQFJXDUVNQV-FHWLQOOXSA-N 0.000 description 1
- ZZLDMBMFKZFQMU-NRPADANISA-N Gln-Val-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O ZZLDMBMFKZFQMU-NRPADANISA-N 0.000 description 1
- OACPJRQRAHMQEQ-NHCYSSNCSA-N Gln-Val-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OACPJRQRAHMQEQ-NHCYSSNCSA-N 0.000 description 1
- QGWXAMDECCKGRU-XVKPBYJWSA-N Gln-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(N)=O)C(=O)NCC(O)=O QGWXAMDECCKGRU-XVKPBYJWSA-N 0.000 description 1
- ZMXZGYLINVNTKH-DZKIICNBSA-N Gln-Val-Phe Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ZMXZGYLINVNTKH-DZKIICNBSA-N 0.000 description 1
- VYOILACOFPPNQH-UMNHJUIQSA-N Gln-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)N)N VYOILACOFPPNQH-UMNHJUIQSA-N 0.000 description 1
- FITIQFSXXBKFFM-NRPADANISA-N Gln-Val-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O FITIQFSXXBKFFM-NRPADANISA-N 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- SZXSSXUNOALWCH-ACZMJKKPSA-N Glu-Ala-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O SZXSSXUNOALWCH-ACZMJKKPSA-N 0.000 description 1
- WZZSKAJIHTUUSG-ACZMJKKPSA-N Glu-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O WZZSKAJIHTUUSG-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- JJKKWYQVHRUSDG-GUBZILKMSA-N Glu-Ala-Lys Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O JJKKWYQVHRUSDG-GUBZILKMSA-N 0.000 description 1
- RSUVOPBMWMTVDI-XEGUGMAKSA-N Glu-Ala-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CCC(O)=O)C)C(O)=O)=CNC2=C1 RSUVOPBMWMTVDI-XEGUGMAKSA-N 0.000 description 1
- CVPXINNKRTZBMO-CIUDSAMLSA-N Glu-Arg-Asn Chemical compound C(C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)CN=C(N)N CVPXINNKRTZBMO-CIUDSAMLSA-N 0.000 description 1
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 1
- LTUVYLVIZHJCOQ-KKUMJFAQSA-N Glu-Arg-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LTUVYLVIZHJCOQ-KKUMJFAQSA-N 0.000 description 1
- DYFJZDDQPNIPAB-NHCYSSNCSA-N Glu-Arg-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O DYFJZDDQPNIPAB-NHCYSSNCSA-N 0.000 description 1
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 1
- RJONUNZIMUXUOI-GUBZILKMSA-N Glu-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)O)N RJONUNZIMUXUOI-GUBZILKMSA-N 0.000 description 1
- BUVMZWZNWMKASN-QEJZJMRPSA-N Glu-Asn-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)=CNC2=C1 BUVMZWZNWMKASN-QEJZJMRPSA-N 0.000 description 1
- JPHYJQHPILOKHC-ACZMJKKPSA-N Glu-Asp-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O JPHYJQHPILOKHC-ACZMJKKPSA-N 0.000 description 1
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 1
- DSPQRJXOIXHOHK-WDSKDSINSA-N Glu-Asp-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O DSPQRJXOIXHOHK-WDSKDSINSA-N 0.000 description 1
- RTOOAKXIJADOLL-GUBZILKMSA-N Glu-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N RTOOAKXIJADOLL-GUBZILKMSA-N 0.000 description 1
- HJIFPJUEOGZWRI-GUBZILKMSA-N Glu-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCC(=O)O)N HJIFPJUEOGZWRI-GUBZILKMSA-N 0.000 description 1
- WATXSTJXNBOHKD-LAEOZQHASA-N Glu-Asp-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O WATXSTJXNBOHKD-LAEOZQHASA-N 0.000 description 1
- KLJMRPIBBLTDGE-ACZMJKKPSA-N Glu-Cys-Asn Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(N)=O)C(O)=O KLJMRPIBBLTDGE-ACZMJKKPSA-N 0.000 description 1
- LSTFYPOGBGFIPP-FXQIFTODSA-N Glu-Cys-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O LSTFYPOGBGFIPP-FXQIFTODSA-N 0.000 description 1
- ZZIFPJZQHRJERU-WDSKDSINSA-N Glu-Cys-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O ZZIFPJZQHRJERU-WDSKDSINSA-N 0.000 description 1
- RQNYYRHRKSVKAB-GUBZILKMSA-N Glu-Cys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O RQNYYRHRKSVKAB-GUBZILKMSA-N 0.000 description 1
- ZXQPJYWZSFGWJB-AVGNSLFASA-N Glu-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N ZXQPJYWZSFGWJB-AVGNSLFASA-N 0.000 description 1
- UENPHLAAKDPZQY-XKBZYTNZSA-N Glu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O UENPHLAAKDPZQY-XKBZYTNZSA-N 0.000 description 1
- VSMQDIVEBXPKRT-QEJZJMRPSA-N Glu-Cys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N VSMQDIVEBXPKRT-QEJZJMRPSA-N 0.000 description 1
- KIMXNQXJJWWVIN-AVGNSLFASA-N Glu-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCC(=O)O)N)O KIMXNQXJJWWVIN-AVGNSLFASA-N 0.000 description 1
- FKGNJUCQKXQNRA-NRPADANISA-N Glu-Cys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H](N)CCC(O)=O FKGNJUCQKXQNRA-NRPADANISA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- CLROYXHHUZELFX-FXQIFTODSA-N Glu-Gln-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O CLROYXHHUZELFX-FXQIFTODSA-N 0.000 description 1
- XHWLNISLUFEWNS-CIUDSAMLSA-N Glu-Gln-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O XHWLNISLUFEWNS-CIUDSAMLSA-N 0.000 description 1
- XHUCVVHRLNPZSZ-CIUDSAMLSA-N Glu-Gln-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O XHUCVVHRLNPZSZ-CIUDSAMLSA-N 0.000 description 1
- CJWANNXUTOATSJ-DCAQKATOSA-N Glu-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N CJWANNXUTOATSJ-DCAQKATOSA-N 0.000 description 1
- UMIRPYLZFKOEOH-YVNDNENWSA-N Glu-Gln-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UMIRPYLZFKOEOH-YVNDNENWSA-N 0.000 description 1
- WLIPTFCZLHCNFD-LPEHRKFASA-N Glu-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O WLIPTFCZLHCNFD-LPEHRKFASA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- QQLBPVKLJBAXBS-FXQIFTODSA-N Glu-Glu-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QQLBPVKLJBAXBS-FXQIFTODSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- KASDBWKLWJKTLJ-GUBZILKMSA-N Glu-Glu-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O KASDBWKLWJKTLJ-GUBZILKMSA-N 0.000 description 1
- IQACOVZVOMVILH-FXQIFTODSA-N Glu-Glu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O IQACOVZVOMVILH-FXQIFTODSA-N 0.000 description 1
- PHONAZGUEGIOEM-GLLZPBPUSA-N Glu-Glu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PHONAZGUEGIOEM-GLLZPBPUSA-N 0.000 description 1
- BUAKRRKDHSSIKK-IHRRRGAJSA-N Glu-Glu-Tyr Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BUAKRRKDHSSIKK-IHRRRGAJSA-N 0.000 description 1
- UHVIQGKBMXEVGN-WDSKDSINSA-N Glu-Gly-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O UHVIQGKBMXEVGN-WDSKDSINSA-N 0.000 description 1
- LYCDZGLXQBPNQU-WDSKDSINSA-N Glu-Gly-Cys Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O LYCDZGLXQBPNQU-WDSKDSINSA-N 0.000 description 1
- OGNJZUXUTPQVBR-BQBZGAKWSA-N Glu-Gly-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O OGNJZUXUTPQVBR-BQBZGAKWSA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- CUXJIASLBRJOFV-LAEOZQHASA-N Glu-Gly-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H]([C@@H](C)CC)C(O)=O CUXJIASLBRJOFV-LAEOZQHASA-N 0.000 description 1
- BRKUZSLQMPNVFN-SRVKXCTJSA-N Glu-His-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BRKUZSLQMPNVFN-SRVKXCTJSA-N 0.000 description 1
- COSBSYQVPSODFX-GUBZILKMSA-N Glu-His-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N COSBSYQVPSODFX-GUBZILKMSA-N 0.000 description 1
- BIHMNDPWRUROFZ-JYJNAYRXSA-N Glu-His-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BIHMNDPWRUROFZ-JYJNAYRXSA-N 0.000 description 1
- WVTIBGWZUMJBFY-GUBZILKMSA-N Glu-His-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O WVTIBGWZUMJBFY-GUBZILKMSA-N 0.000 description 1
- QIQABBIDHGQXGA-ZPFDUUQYSA-N Glu-Ile-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O QIQABBIDHGQXGA-ZPFDUUQYSA-N 0.000 description 1
- LGYCLOCORAEQSZ-PEFMBERDSA-N Glu-Ile-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O LGYCLOCORAEQSZ-PEFMBERDSA-N 0.000 description 1
- YVYVMJNUENBOOL-KBIXCLLPSA-N Glu-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N YVYVMJNUENBOOL-KBIXCLLPSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- GRHXUHCFENOCOS-ZPFDUUQYSA-N Glu-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)O)N GRHXUHCFENOCOS-ZPFDUUQYSA-N 0.000 description 1
- WTMZXOPHTIVFCP-QEWYBTABSA-N Glu-Ile-Phe Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 WTMZXOPHTIVFCP-QEWYBTABSA-N 0.000 description 1
- KRRFFAHEAOCBCQ-SIUGBPQLSA-N Glu-Ile-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KRRFFAHEAOCBCQ-SIUGBPQLSA-N 0.000 description 1
- PJBVXVBTTFZPHJ-GUBZILKMSA-N Glu-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N PJBVXVBTTFZPHJ-GUBZILKMSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- VGBSZQSKQRMLHD-MNXVOIDGSA-N Glu-Leu-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VGBSZQSKQRMLHD-MNXVOIDGSA-N 0.000 description 1
- DWBBKNPKDHXIAC-SRVKXCTJSA-N Glu-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCC(O)=O DWBBKNPKDHXIAC-SRVKXCTJSA-N 0.000 description 1
- CUPSDFQZTVVTSK-GUBZILKMSA-N Glu-Lys-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCC(O)=O CUPSDFQZTVVTSK-GUBZILKMSA-N 0.000 description 1
- UJMNFCAHLYKWOZ-DCAQKATOSA-N Glu-Lys-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O UJMNFCAHLYKWOZ-DCAQKATOSA-N 0.000 description 1
- OCJRHJZKGGSPRW-IUCAKERBSA-N Glu-Lys-Gly Chemical compound NCCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O OCJRHJZKGGSPRW-IUCAKERBSA-N 0.000 description 1
- YGLCLCMAYUYZSG-AVGNSLFASA-N Glu-Lys-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 YGLCLCMAYUYZSG-AVGNSLFASA-N 0.000 description 1
- YKBUCXNNBYZYAY-MNXVOIDGSA-N Glu-Lys-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YKBUCXNNBYZYAY-MNXVOIDGSA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- JHSRJMUJOGLIHK-GUBZILKMSA-N Glu-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)O)N JHSRJMUJOGLIHK-GUBZILKMSA-N 0.000 description 1
- CBEUFCJRFNZMCU-SRVKXCTJSA-N Glu-Met-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O CBEUFCJRFNZMCU-SRVKXCTJSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 1
- FQFWFZWOHOEVMZ-IHRRRGAJSA-N Glu-Phe-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O FQFWFZWOHOEVMZ-IHRRRGAJSA-N 0.000 description 1
- YRMZCZIRHYCNHX-RYUDHWBXSA-N Glu-Phe-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)NCC(O)=O YRMZCZIRHYCNHX-RYUDHWBXSA-N 0.000 description 1
- QNJNPKSWAHPYGI-JYJNAYRXSA-N Glu-Phe-Leu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=CC=C1 QNJNPKSWAHPYGI-JYJNAYRXSA-N 0.000 description 1
- CHDWDBPJOZVZSE-KKUMJFAQSA-N Glu-Phe-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CHDWDBPJOZVZSE-KKUMJFAQSA-N 0.000 description 1
- FGSGPLRPQCZBSQ-AVGNSLFASA-N Glu-Phe-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O FGSGPLRPQCZBSQ-AVGNSLFASA-N 0.000 description 1
- ITVBKCZZLJUUHI-HTUGSXCWSA-N Glu-Phe-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ITVBKCZZLJUUHI-HTUGSXCWSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- HLYCMRDRWGSTPZ-CIUDSAMLSA-N Glu-Pro-Cys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)O)N)C(=O)N[C@@H](CS)C(=O)O HLYCMRDRWGSTPZ-CIUDSAMLSA-N 0.000 description 1
- PAZQYODKOZHXGA-SRVKXCTJSA-N Glu-Pro-His Chemical compound N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O PAZQYODKOZHXGA-SRVKXCTJSA-N 0.000 description 1
- ARIORLIIMJACKZ-KKUMJFAQSA-N Glu-Pro-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ARIORLIIMJACKZ-KKUMJFAQSA-N 0.000 description 1
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 1
- MRWYPDWDZSLWJM-ACZMJKKPSA-N Glu-Ser-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O MRWYPDWDZSLWJM-ACZMJKKPSA-N 0.000 description 1
- RFTVTKBHDXCEEX-WDSKDSINSA-N Glu-Ser-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RFTVTKBHDXCEEX-WDSKDSINSA-N 0.000 description 1
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 1
- BXSZPACYCMNKLS-AVGNSLFASA-N Glu-Ser-Phe Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O BXSZPACYCMNKLS-AVGNSLFASA-N 0.000 description 1
- JWNZHMSRZXXGTM-XKBZYTNZSA-N Glu-Ser-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JWNZHMSRZXXGTM-XKBZYTNZSA-N 0.000 description 1
- MWTGQXBHVRTCOR-GLLZPBPUSA-N Glu-Thr-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MWTGQXBHVRTCOR-GLLZPBPUSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- QVXWAFZDWRLXTI-NWLDYVSISA-N Glu-Thr-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QVXWAFZDWRLXTI-NWLDYVSISA-N 0.000 description 1
- DLISPGXMKZTWQG-IFFSRLJSSA-N Glu-Thr-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O DLISPGXMKZTWQG-IFFSRLJSSA-N 0.000 description 1
- JDAYMLXPUJRSDJ-XIRDDKMYSA-N Glu-Trp-Arg Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O)=CNC2=C1 JDAYMLXPUJRSDJ-XIRDDKMYSA-N 0.000 description 1
- ZTNHPMZHAILHRB-JSGCOSHPSA-N Glu-Trp-Gly Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)NCC(O)=O)=CNC2=C1 ZTNHPMZHAILHRB-JSGCOSHPSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- JLCYOCDGIUZMKQ-JBACZVJFSA-N Glu-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CCC(=O)O)N JLCYOCDGIUZMKQ-JBACZVJFSA-N 0.000 description 1
- CGWHAXBNGYQBBK-JBACZVJFSA-N Glu-Trp-Tyr Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@H](CCC(O)=O)N)C(O)=O)C1=CC=C(O)C=C1 CGWHAXBNGYQBBK-JBACZVJFSA-N 0.000 description 1
- QGAJQIGFFIQJJK-IHRRRGAJSA-N Glu-Tyr-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O QGAJQIGFFIQJJK-IHRRRGAJSA-N 0.000 description 1
- RXJFSLQVMGYQEL-IHRRRGAJSA-N Glu-Tyr-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 RXJFSLQVMGYQEL-IHRRRGAJSA-N 0.000 description 1
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 1
- HJTSRYLPAYGEEC-SIUGBPQLSA-N Glu-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCC(=O)O)N HJTSRYLPAYGEEC-SIUGBPQLSA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- YQPFCZVKMUVZIN-AUTRQRHGSA-N Glu-Val-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O YQPFCZVKMUVZIN-AUTRQRHGSA-N 0.000 description 1
- LZEUDRYSAZAJIO-AUTRQRHGSA-N Glu-Val-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LZEUDRYSAZAJIO-AUTRQRHGSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- ZYRXTRTUCAVNBQ-GVXVVHGQSA-N Glu-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)O)N ZYRXTRTUCAVNBQ-GVXVVHGQSA-N 0.000 description 1
- RMWAOBGCZZSJHE-UMNHJUIQSA-N Glu-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N RMWAOBGCZZSJHE-UMNHJUIQSA-N 0.000 description 1
- WGYHAAXZWPEBDQ-IFFSRLJSSA-N Glu-Val-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGYHAAXZWPEBDQ-IFFSRLJSSA-N 0.000 description 1
- QRWPTXLWHHTOCO-DZKIICNBSA-N Glu-Val-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QRWPTXLWHHTOCO-DZKIICNBSA-N 0.000 description 1
- FKJQNJCQTKUBCD-XPUUQOCRSA-N Gly-Ala-His Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O FKJQNJCQTKUBCD-XPUUQOCRSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- OGCIHJPYKVSMTE-YUMQZZPRSA-N Gly-Arg-Glu Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O OGCIHJPYKVSMTE-YUMQZZPRSA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- KFMBRBPXHVMDFN-UWVGGRQHSA-N Gly-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCNC(N)=N KFMBRBPXHVMDFN-UWVGGRQHSA-N 0.000 description 1
- GWCRIHNSVMOBEQ-BQBZGAKWSA-N Gly-Arg-Ser Chemical compound [H]NCC(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O GWCRIHNSVMOBEQ-BQBZGAKWSA-N 0.000 description 1
- XZRZILPOZBVTDB-GJZGRUSLSA-N Gly-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)CN)C(O)=O)=CNC2=C1 XZRZILPOZBVTDB-GJZGRUSLSA-N 0.000 description 1
- XUORRGAFUQIMLC-STQMWFEESA-N Gly-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)CN)O XUORRGAFUQIMLC-STQMWFEESA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- CIMULJZTTOBOPN-WHFBIAKZSA-N Gly-Asn-Asn Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CIMULJZTTOBOPN-WHFBIAKZSA-N 0.000 description 1
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 1
- FMVLWTYYODVFRG-BQBZGAKWSA-N Gly-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)CN FMVLWTYYODVFRG-BQBZGAKWSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FMNHBTKMRFVGRO-FOHZUACHSA-N Gly-Asn-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)CN FMNHBTKMRFVGRO-FOHZUACHSA-N 0.000 description 1
- GRIRDMVMJJDZKV-RCOVLWMOSA-N Gly-Asn-Val Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O GRIRDMVMJJDZKV-RCOVLWMOSA-N 0.000 description 1
- KQDMENMTYNBWMR-WHFBIAKZSA-N Gly-Asp-Ala Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KQDMENMTYNBWMR-WHFBIAKZSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- SUDUYJOBLHQAMI-WHFBIAKZSA-N Gly-Asp-Cys Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CS)C(O)=O SUDUYJOBLHQAMI-WHFBIAKZSA-N 0.000 description 1
- QSTLUOIOYLYLLF-WDSKDSINSA-N Gly-Asp-Glu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QSTLUOIOYLYLLF-WDSKDSINSA-N 0.000 description 1
- 108010050006 Gly-Asp-Gly-Arg Proteins 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- QGZSAHIZRQHCEQ-QWRGUYRKSA-N Gly-Asp-Tyr Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QGZSAHIZRQHCEQ-QWRGUYRKSA-N 0.000 description 1
- YZACQYVWLCQWBT-BQBZGAKWSA-N Gly-Cys-Arg Chemical compound [H]NCC(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YZACQYVWLCQWBT-BQBZGAKWSA-N 0.000 description 1
- YDWZGVCXMVLDQH-WHFBIAKZSA-N Gly-Cys-Asn Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(N)=O YDWZGVCXMVLDQH-WHFBIAKZSA-N 0.000 description 1
- MQVNVZUEPUIAFA-WDSKDSINSA-N Gly-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)CN MQVNVZUEPUIAFA-WDSKDSINSA-N 0.000 description 1
- CEXINUGNTZFNRY-BYPYZUCNSA-N Gly-Cys-Gly Chemical compound [NH3+]CC(=O)N[C@@H](CS)C(=O)NCC([O-])=O CEXINUGNTZFNRY-BYPYZUCNSA-N 0.000 description 1
- YYQGVXNKAXUTJU-YUMQZZPRSA-N Gly-Cys-His Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O YYQGVXNKAXUTJU-YUMQZZPRSA-N 0.000 description 1
- UEGIPZAXNBYCCP-NKWVEPMBSA-N Gly-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)CN)C(=O)O UEGIPZAXNBYCCP-NKWVEPMBSA-N 0.000 description 1
- GYAUWXXORNTCHU-QWRGUYRKSA-N Gly-Cys-Tyr Chemical compound NCC(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 GYAUWXXORNTCHU-QWRGUYRKSA-N 0.000 description 1
- DTRUBYPMMVPQPD-YUMQZZPRSA-N Gly-Gln-Arg Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DTRUBYPMMVPQPD-YUMQZZPRSA-N 0.000 description 1
- VOCMRCVMAPSSAL-IUCAKERBSA-N Gly-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN VOCMRCVMAPSSAL-IUCAKERBSA-N 0.000 description 1
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- PABFFPWEJMEVEC-JGVFFNPUSA-N Gly-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)CN)C(=O)O PABFFPWEJMEVEC-JGVFFNPUSA-N 0.000 description 1
- NPSWCZIRBAYNSB-JHEQGTHGSA-N Gly-Gln-Thr Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NPSWCZIRBAYNSB-JHEQGTHGSA-N 0.000 description 1
- JLJLBWDKDRYOPA-RYUDHWBXSA-N Gly-Gln-Tyr Chemical compound NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 JLJLBWDKDRYOPA-RYUDHWBXSA-N 0.000 description 1
- HDNXXTBKOJKWNN-WDSKDSINSA-N Gly-Glu-Asn Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O HDNXXTBKOJKWNN-WDSKDSINSA-N 0.000 description 1
- FIQQRCFQXGLOSZ-WDSKDSINSA-N Gly-Glu-Asp Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FIQQRCFQXGLOSZ-WDSKDSINSA-N 0.000 description 1
- STVHDEHTKFXBJQ-LAEOZQHASA-N Gly-Glu-Ile Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O STVHDEHTKFXBJQ-LAEOZQHASA-N 0.000 description 1
- ZQIMMEYPEXIYBB-IUCAKERBSA-N Gly-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)CN ZQIMMEYPEXIYBB-IUCAKERBSA-N 0.000 description 1
- NTOWAXLMQFKJPT-YUMQZZPRSA-N Gly-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)CN NTOWAXLMQFKJPT-YUMQZZPRSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- PDAWDNVHMUKWJR-ZETCQYMHSA-N Gly-Gly-His Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 PDAWDNVHMUKWJR-ZETCQYMHSA-N 0.000 description 1
- KAJAOGBVWCYGHZ-JTQLQIEISA-N Gly-Gly-Phe Chemical compound [NH3+]CC(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=CC=C1 KAJAOGBVWCYGHZ-JTQLQIEISA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- YWAQATDNEKZFFK-BYPYZUCNSA-N Gly-Gly-Ser Chemical compound NCC(=O)NCC(=O)N[C@@H](CO)C(O)=O YWAQATDNEKZFFK-BYPYZUCNSA-N 0.000 description 1
- UQJNXZSSGQIPIQ-FBCQKBJTSA-N Gly-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)CN UQJNXZSSGQIPIQ-FBCQKBJTSA-N 0.000 description 1
- UPADCCSMVOQAGF-LBPRGKRZSA-N Gly-Gly-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)CNC(=O)CN)C(O)=O)=CNC2=C1 UPADCCSMVOQAGF-LBPRGKRZSA-N 0.000 description 1
- VAXIVIPMCTYSHI-YUMQZZPRSA-N Gly-His-Asp Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN VAXIVIPMCTYSHI-YUMQZZPRSA-N 0.000 description 1
- CQIIXEHDSZUSAG-QWRGUYRKSA-N Gly-His-His Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 CQIIXEHDSZUSAG-QWRGUYRKSA-N 0.000 description 1
- HPAIKDPJURGQLN-KBPBESRZSA-N Gly-His-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CNC=N1 HPAIKDPJURGQLN-KBPBESRZSA-N 0.000 description 1
- LPCKHUXOGVNZRS-YUMQZZPRSA-N Gly-His-Ser Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O LPCKHUXOGVNZRS-YUMQZZPRSA-N 0.000 description 1
- ALOBJFDJTMQQPW-ONGXEEELSA-N Gly-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)CN ALOBJFDJTMQQPW-ONGXEEELSA-N 0.000 description 1
- SXJHOPPTOJACOA-QXEWZRGKSA-N Gly-Ile-Arg Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N SXJHOPPTOJACOA-QXEWZRGKSA-N 0.000 description 1
- DENRBIYENOKSEX-PEXQALLHSA-N Gly-Ile-His Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DENRBIYENOKSEX-PEXQALLHSA-N 0.000 description 1
- AAHSHTLISQUZJL-QSFUFRPTSA-N Gly-Ile-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O AAHSHTLISQUZJL-QSFUFRPTSA-N 0.000 description 1
- UESJMAMHDLEHGM-NHCYSSNCSA-N Gly-Ile-Leu Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O UESJMAMHDLEHGM-NHCYSSNCSA-N 0.000 description 1
- ITZOBNKQDZEOCE-NHCYSSNCSA-N Gly-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)CN ITZOBNKQDZEOCE-NHCYSSNCSA-N 0.000 description 1
- SCWYHUQOOFRVHP-MBLNEYKQSA-N Gly-Ile-Thr Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SCWYHUQOOFRVHP-MBLNEYKQSA-N 0.000 description 1
- NSTUFLGQJCOCDL-UWVGGRQHSA-N Gly-Leu-Arg Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NSTUFLGQJCOCDL-UWVGGRQHSA-N 0.000 description 1
- LIXWIUAORXJNBH-QWRGUYRKSA-N Gly-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)CN LIXWIUAORXJNBH-QWRGUYRKSA-N 0.000 description 1
- JPAACTMBBBGAAR-HOTGVXAUSA-N Gly-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)CN)CC(C)C)C(O)=O)=CNC2=C1 JPAACTMBBBGAAR-HOTGVXAUSA-N 0.000 description 1
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- YKJUITHASJAGHO-HOTGVXAUSA-N Gly-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)CN YKJUITHASJAGHO-HOTGVXAUSA-N 0.000 description 1
- DBJYVKDPGIFXFO-BQBZGAKWSA-N Gly-Met-Ala Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O DBJYVKDPGIFXFO-BQBZGAKWSA-N 0.000 description 1
- SJLKKOZFHSJJAW-YUMQZZPRSA-N Gly-Met-Glu Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)CN SJLKKOZFHSJJAW-YUMQZZPRSA-N 0.000 description 1
- UWQDKRIZSROAKS-FJXKBIBVSA-N Gly-Met-Thr Chemical compound [H]NCC(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UWQDKRIZSROAKS-FJXKBIBVSA-N 0.000 description 1
- MTBIKIMYHUWBRX-QWRGUYRKSA-N Gly-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)CN MTBIKIMYHUWBRX-QWRGUYRKSA-N 0.000 description 1
- WZSHYFGOLPXPLL-RYUDHWBXSA-N Gly-Phe-Glu Chemical compound NCC(=O)N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CCC(O)=O)C(O)=O WZSHYFGOLPXPLL-RYUDHWBXSA-N 0.000 description 1
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- FEUPVVCGQLNXNP-IRXDYDNUSA-N Gly-Phe-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FEUPVVCGQLNXNP-IRXDYDNUSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- SCJJPCQUJYPHRZ-BQBZGAKWSA-N Gly-Pro-Asn Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O SCJJPCQUJYPHRZ-BQBZGAKWSA-N 0.000 description 1
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- GAAHQHNCMIAYEX-UWVGGRQHSA-N Gly-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN GAAHQHNCMIAYEX-UWVGGRQHSA-N 0.000 description 1
- IXHQLZIWBCQBLQ-STQMWFEESA-N Gly-Pro-Phe Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 IXHQLZIWBCQBLQ-STQMWFEESA-N 0.000 description 1
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- FKESCSGWBPUTPN-FOHZUACHSA-N Gly-Thr-Asn Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O FKESCSGWBPUTPN-FOHZUACHSA-N 0.000 description 1
- NVTPVQLIZCOJFK-FOHZUACHSA-N Gly-Thr-Asp Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O NVTPVQLIZCOJFK-FOHZUACHSA-N 0.000 description 1
- DBUNZBWUWCIELX-JHEQGTHGSA-N Gly-Thr-Glu Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O DBUNZBWUWCIELX-JHEQGTHGSA-N 0.000 description 1
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 1
- RHRLHXQWHCNJKR-PMVVWTBXSA-N Gly-Thr-His Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 RHRLHXQWHCNJKR-PMVVWTBXSA-N 0.000 description 1
- XHVONGZZVUUORG-WEDXCCLWSA-N Gly-Thr-Lys Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCCN XHVONGZZVUUORG-WEDXCCLWSA-N 0.000 description 1
- LLWQVJNHMYBLLK-CDMKHQONSA-N Gly-Thr-Phe Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O LLWQVJNHMYBLLK-CDMKHQONSA-N 0.000 description 1
- WSWWTQYHFCBKBT-DVJZZOLTSA-N Gly-Thr-Trp Chemical compound C[C@@H](O)[C@H](NC(=O)CN)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O WSWWTQYHFCBKBT-DVJZZOLTSA-N 0.000 description 1
- RCHFYMASWAZQQZ-ZANVPECISA-N Gly-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)CN)=CNC2=C1 RCHFYMASWAZQQZ-ZANVPECISA-N 0.000 description 1
- FXTUGWXZTFMTIV-GJZGRUSLSA-N Gly-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)CN FXTUGWXZTFMTIV-GJZGRUSLSA-N 0.000 description 1
- UMRIXLHPZZIOML-OALUTQOASA-N Gly-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)CN UMRIXLHPZZIOML-OALUTQOASA-N 0.000 description 1
- UIQGJYUEQDOODF-KWQFWETISA-N Gly-Tyr-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 UIQGJYUEQDOODF-KWQFWETISA-N 0.000 description 1
- GNNJKUYDWFIBTK-QWRGUYRKSA-N Gly-Tyr-Asp Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O GNNJKUYDWFIBTK-QWRGUYRKSA-N 0.000 description 1
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 1
- OCRQUYDOYKCOQG-IRXDYDNUSA-N Gly-Tyr-Phe Chemical compound C([C@H](NC(=O)CN)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 OCRQUYDOYKCOQG-IRXDYDNUSA-N 0.000 description 1
- LYZYGGWCBLBDMC-QWHCGFSZSA-N Gly-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)CN)C(=O)O LYZYGGWCBLBDMC-QWHCGFSZSA-N 0.000 description 1
- GBYYQVBXFVDJPJ-WLTAIBSBSA-N Gly-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)CN)O GBYYQVBXFVDJPJ-WLTAIBSBSA-N 0.000 description 1
- NGRPGJGKJMUGDM-XVKPBYJWSA-N Gly-Val-Gln Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NGRPGJGKJMUGDM-XVKPBYJWSA-N 0.000 description 1
- SYOJVRNQCXYEOV-XVKPBYJWSA-N Gly-Val-Glu Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SYOJVRNQCXYEOV-XVKPBYJWSA-N 0.000 description 1
- ZVXMEWXHFBYJPI-LSJOCFKGSA-N Gly-Val-Ile Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZVXMEWXHFBYJPI-LSJOCFKGSA-N 0.000 description 1
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- SBVMXEZQJVUARN-XPUUQOCRSA-N Gly-Val-Ser Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O SBVMXEZQJVUARN-XPUUQOCRSA-N 0.000 description 1
- IZVICCORZOSGPT-JSGCOSHPSA-N Gly-Val-Tyr Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O IZVICCORZOSGPT-JSGCOSHPSA-N 0.000 description 1
- KSOBNUBCYHGUKH-UWVGGRQHSA-N Gly-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN KSOBNUBCYHGUKH-UWVGGRQHSA-N 0.000 description 1
- 102000002068 Glycopeptides Human genes 0.000 description 1
- 108010015899 Glycopeptides Proteins 0.000 description 1
- 108090000288 Glycoproteins Proteins 0.000 description 1
- 102000003886 Glycoproteins Human genes 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- PDSUIXMZYNURGI-AVGNSLFASA-N His-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 PDSUIXMZYNURGI-AVGNSLFASA-N 0.000 description 1
- YPLYIXGKCRQZGW-SRVKXCTJSA-N His-Arg-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YPLYIXGKCRQZGW-SRVKXCTJSA-N 0.000 description 1
- SYMSVYVUSPSAAO-IHRRRGAJSA-N His-Arg-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O SYMSVYVUSPSAAO-IHRRRGAJSA-N 0.000 description 1
- MWAJSVTZZOUOBU-IHRRRGAJSA-N His-Arg-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CN=CN1 MWAJSVTZZOUOBU-IHRRRGAJSA-N 0.000 description 1
- UCDWNBFOZCZSNV-AVGNSLFASA-N His-Arg-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(O)=O UCDWNBFOZCZSNV-AVGNSLFASA-N 0.000 description 1
- ZPVJJPAIUZLSNE-DCAQKATOSA-N His-Arg-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O ZPVJJPAIUZLSNE-DCAQKATOSA-N 0.000 description 1
- TTZAWSKKNCEINZ-AVGNSLFASA-N His-Arg-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O TTZAWSKKNCEINZ-AVGNSLFASA-N 0.000 description 1
- AVQOSMRPITVTRB-CIUDSAMLSA-N His-Asn-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N AVQOSMRPITVTRB-CIUDSAMLSA-N 0.000 description 1
- OBTMRGFRLJBSFI-GARJFASQSA-N His-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OBTMRGFRLJBSFI-GARJFASQSA-N 0.000 description 1
- DFHVLUKTTVTCKY-PBCZWWQYSA-N His-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N)O DFHVLUKTTVTCKY-PBCZWWQYSA-N 0.000 description 1
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 1
- WZOGEMJIZBNFBK-CIUDSAMLSA-N His-Asp-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O WZOGEMJIZBNFBK-CIUDSAMLSA-N 0.000 description 1
- MVADCDSCFTXCBT-CIUDSAMLSA-N His-Asp-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MVADCDSCFTXCBT-CIUDSAMLSA-N 0.000 description 1
- ZJSMFRTVYSLKQU-DJFWLOJKSA-N His-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N ZJSMFRTVYSLKQU-DJFWLOJKSA-N 0.000 description 1
- WCNXUTNLSRWWQN-DCAQKATOSA-N His-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N WCNXUTNLSRWWQN-DCAQKATOSA-N 0.000 description 1
- FAQYEASGXHQQAA-XIRDDKMYSA-N His-Asp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC3=CN=CN3)N FAQYEASGXHQQAA-XIRDDKMYSA-N 0.000 description 1
- LBHOVGUGOBINDL-KKUMJFAQSA-N His-Asp-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O LBHOVGUGOBINDL-KKUMJFAQSA-N 0.000 description 1
- QNILDNVBIARMRK-XVYDVKMFSA-N His-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CN=CN1)N QNILDNVBIARMRK-XVYDVKMFSA-N 0.000 description 1
- CYHWWHKRCKHYGQ-GUBZILKMSA-N His-Cys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N CYHWWHKRCKHYGQ-GUBZILKMSA-N 0.000 description 1
- BQYZXYCEKYJKAM-VGDYDELISA-N His-Cys-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQYZXYCEKYJKAM-VGDYDELISA-N 0.000 description 1
- VLPMGIJPAWENQB-SRVKXCTJSA-N His-Cys-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(O)=O VLPMGIJPAWENQB-SRVKXCTJSA-N 0.000 description 1
- QSLKWWDKIXMWJV-SRVKXCTJSA-N His-Cys-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N QSLKWWDKIXMWJV-SRVKXCTJSA-N 0.000 description 1
- WOAMZMXCLBBQKW-KKUMJFAQSA-N His-Cys-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CN=CN2)N)O WOAMZMXCLBBQKW-KKUMJFAQSA-N 0.000 description 1
- HVCRQRQPIIRNLY-IUCAKERBSA-N His-Gln-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N HVCRQRQPIIRNLY-IUCAKERBSA-N 0.000 description 1
- IMCHNUANCIGUKS-SRVKXCTJSA-N His-Glu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IMCHNUANCIGUKS-SRVKXCTJSA-N 0.000 description 1
- YTKOTXRIWQHSAZ-GUBZILKMSA-N His-Glu-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N YTKOTXRIWQHSAZ-GUBZILKMSA-N 0.000 description 1
- JCOSMKPAOYDKRO-AVGNSLFASA-N His-Glu-Lys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N JCOSMKPAOYDKRO-AVGNSLFASA-N 0.000 description 1
- PYNUBZSXKQKAHL-UWVGGRQHSA-N His-Gly-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O PYNUBZSXKQKAHL-UWVGGRQHSA-N 0.000 description 1
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 1
- RGPWUJOMKFYFSR-QWRGUYRKSA-N His-Gly-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O RGPWUJOMKFYFSR-QWRGUYRKSA-N 0.000 description 1
- ZUPVLBAXUUGKKN-VHSXEESVSA-N His-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CC2=CN=CN2)N)C(=O)O ZUPVLBAXUUGKKN-VHSXEESVSA-N 0.000 description 1
- FYTCLUIYTYFGPT-YUMQZZPRSA-N His-Gly-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FYTCLUIYTYFGPT-YUMQZZPRSA-N 0.000 description 1
- BDFCIKANUNMFGB-PMVVWTBXSA-N His-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CN=CN1 BDFCIKANUNMFGB-PMVVWTBXSA-N 0.000 description 1
- NTXIJPDAHXSHNL-ONGXEEELSA-N His-Gly-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O NTXIJPDAHXSHNL-ONGXEEELSA-N 0.000 description 1
- CTGZVVQVIBSOBB-AVGNSLFASA-N His-His-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O CTGZVVQVIBSOBB-AVGNSLFASA-N 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- OZBDSFBWIDPVDA-BZSNNMDCSA-N His-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N OZBDSFBWIDPVDA-BZSNNMDCSA-N 0.000 description 1
- JBSLJUPMTYLLFH-MELADBBJSA-N His-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O JBSLJUPMTYLLFH-MELADBBJSA-N 0.000 description 1
- BZKDJRSZWLPJNI-SRVKXCTJSA-N His-His-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O BZKDJRSZWLPJNI-SRVKXCTJSA-N 0.000 description 1
- CNHSMSFYVARZLI-YJRXYDGGSA-N His-His-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CNHSMSFYVARZLI-YJRXYDGGSA-N 0.000 description 1
- QPSCMXDWVKWVOW-BZSNNMDCSA-N His-His-Tyr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QPSCMXDWVKWVOW-BZSNNMDCSA-N 0.000 description 1
- QMUHTRISZMFKAY-MXAVVETBSA-N His-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N QMUHTRISZMFKAY-MXAVVETBSA-N 0.000 description 1
- DYKZGTLPSNOFHU-DEQVHRJGSA-N His-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N DYKZGTLPSNOFHU-DEQVHRJGSA-N 0.000 description 1
- WZBLRQQCDYYRTD-SIXJUCDHSA-N His-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N WZBLRQQCDYYRTD-SIXJUCDHSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- SKYULSWNBYAQMG-IHRRRGAJSA-N His-Leu-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SKYULSWNBYAQMG-IHRRRGAJSA-N 0.000 description 1
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 1
- BPOHQCZZSFBSON-KKUMJFAQSA-N His-Leu-His Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BPOHQCZZSFBSON-KKUMJFAQSA-N 0.000 description 1
- BXOLYFJYQQRQDJ-MXAVVETBSA-N His-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CN=CN1)N BXOLYFJYQQRQDJ-MXAVVETBSA-N 0.000 description 1
- LVWIJITYHRZHBO-IXOXFDKPSA-N His-Leu-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LVWIJITYHRZHBO-IXOXFDKPSA-N 0.000 description 1
- VGYOLSOFODKLSP-IHPCNDPISA-N His-Leu-Trp Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CN=CN1 VGYOLSOFODKLSP-IHPCNDPISA-N 0.000 description 1
- GUXQAPACZVVOKX-AVGNSLFASA-N His-Lys-Gln Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GUXQAPACZVVOKX-AVGNSLFASA-N 0.000 description 1
- JUIOPCXACJLRJK-AVGNSLFASA-N His-Lys-Glu Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N JUIOPCXACJLRJK-AVGNSLFASA-N 0.000 description 1
- DPQIPEAHIYMUEJ-IHRRRGAJSA-N His-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CN=CN1)N DPQIPEAHIYMUEJ-IHRRRGAJSA-N 0.000 description 1
- TTYKEFZRLKQTHH-MELADBBJSA-N His-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O TTYKEFZRLKQTHH-MELADBBJSA-N 0.000 description 1
- BKOVCRUIXDIWFV-IXOXFDKPSA-N His-Lys-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CN=CN1 BKOVCRUIXDIWFV-IXOXFDKPSA-N 0.000 description 1
- TVMNTHXFRSXZGR-IHRRRGAJSA-N His-Lys-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O TVMNTHXFRSXZGR-IHRRRGAJSA-N 0.000 description 1
- XDIVYNSPYBLSME-DCAQKATOSA-N His-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N XDIVYNSPYBLSME-DCAQKATOSA-N 0.000 description 1
- WYSJPCTWSBJFCO-AVGNSLFASA-N His-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CC1=CN=CN1)N WYSJPCTWSBJFCO-AVGNSLFASA-N 0.000 description 1
- SVVULKPWDBIPCO-BZSNNMDCSA-N His-Phe-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O SVVULKPWDBIPCO-BZSNNMDCSA-N 0.000 description 1
- ULRFSEJGSHYLQI-YESZJQIVSA-N His-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CC3=CN=CN3)N)C(=O)O ULRFSEJGSHYLQI-YESZJQIVSA-N 0.000 description 1
- ZVKDCQVQTGYBQT-LSJOCFKGSA-N His-Pro-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O ZVKDCQVQTGYBQT-LSJOCFKGSA-N 0.000 description 1
- GNBHSMFBUNEWCJ-DCAQKATOSA-N His-Pro-Asn Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GNBHSMFBUNEWCJ-DCAQKATOSA-N 0.000 description 1
- ABCCKUZDWMERKT-AVGNSLFASA-N His-Pro-Met Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O ABCCKUZDWMERKT-AVGNSLFASA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- CWSZWFILCNSNEX-CIUDSAMLSA-N His-Ser-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CWSZWFILCNSNEX-CIUDSAMLSA-N 0.000 description 1
- WKEABZIITNXXQZ-CIUDSAMLSA-N His-Ser-Cys Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N WKEABZIITNXXQZ-CIUDSAMLSA-N 0.000 description 1
- JMSONHOUHFDOJH-GUBZILKMSA-N His-Ser-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 JMSONHOUHFDOJH-GUBZILKMSA-N 0.000 description 1
- ZHHLTWUOWXHVQJ-YUMQZZPRSA-N His-Ser-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZHHLTWUOWXHVQJ-YUMQZZPRSA-N 0.000 description 1
- DGLAHESNTJWGDO-SRVKXCTJSA-N His-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N DGLAHESNTJWGDO-SRVKXCTJSA-N 0.000 description 1
- JGFWUKYIQAEYAH-DCAQKATOSA-N His-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JGFWUKYIQAEYAH-DCAQKATOSA-N 0.000 description 1
- XHQYFGPIRUHQIB-PBCZWWQYSA-N His-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CN=CN1 XHQYFGPIRUHQIB-PBCZWWQYSA-N 0.000 description 1
- HZWWOGWOBQBETJ-CUJWVEQBSA-N His-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N)O HZWWOGWOBQBETJ-CUJWVEQBSA-N 0.000 description 1
- UPJODPVSKKWGDQ-KLHWPWHYSA-N His-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)O UPJODPVSKKWGDQ-KLHWPWHYSA-N 0.000 description 1
- FWWJVUFXUQOEDM-WDSOQIARSA-N His-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N FWWJVUFXUQOEDM-WDSOQIARSA-N 0.000 description 1
- DGVYSZUCRYXKOJ-XIRDDKMYSA-N His-Trp-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N DGVYSZUCRYXKOJ-XIRDDKMYSA-N 0.000 description 1
- KECFCPNPPYCGBL-PMVMPFDFSA-N His-Trp-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)NC(=O)[C@H](CC4=CN=CN4)N KECFCPNPPYCGBL-PMVMPFDFSA-N 0.000 description 1
- PZUZIHRPOVVHOT-KBPBESRZSA-N His-Tyr-Gly Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)NCC(O)=O)C1=CN=CN1 PZUZIHRPOVVHOT-KBPBESRZSA-N 0.000 description 1
- QTMKFZAYZKBFRC-BZSNNMDCSA-N His-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CN=CN3)N)O QTMKFZAYZKBFRC-BZSNNMDCSA-N 0.000 description 1
- JATYGDHMDRAISQ-KKUMJFAQSA-N His-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O JATYGDHMDRAISQ-KKUMJFAQSA-N 0.000 description 1
- KDDKJKKQODQQBR-NHCYSSNCSA-N His-Val-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N KDDKJKKQODQQBR-NHCYSSNCSA-N 0.000 description 1
- CGAMSLMBYJHMDY-ONGXEEELSA-N His-Val-Gly Chemical compound CC(C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CN=CN1)N CGAMSLMBYJHMDY-ONGXEEELSA-N 0.000 description 1
- FFYYUUWROYYKFY-IHRRRGAJSA-N His-Val-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O FFYYUUWROYYKFY-IHRRRGAJSA-N 0.000 description 1
- GGXUJBKENKVYNV-ULQDDVLXSA-N His-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CN=CN2)N GGXUJBKENKVYNV-ULQDDVLXSA-N 0.000 description 1
- 101000604463 Homo sapiens Netrin-G1 Proteins 0.000 description 1
- 108700039609 IRW peptide Proteins 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- JRHFQUPIZOYKQP-KBIXCLLPSA-N Ile-Ala-Glu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O JRHFQUPIZOYKQP-KBIXCLLPSA-N 0.000 description 1
- WUEIUSDAECDLQO-NAKRPEOUSA-N Ile-Ala-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)O)N WUEIUSDAECDLQO-NAKRPEOUSA-N 0.000 description 1
- HERITAGIPLEJMT-GVARAGBVSA-N Ile-Ala-Tyr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 HERITAGIPLEJMT-GVARAGBVSA-N 0.000 description 1
- TZCGZYWNIDZZMR-UHFFFAOYSA-N Ile-Arg-Ala Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(C)C(O)=O)CCCN=C(N)N TZCGZYWNIDZZMR-UHFFFAOYSA-N 0.000 description 1
- FVEWRQXNISSYFO-ZPFDUUQYSA-N Ile-Arg-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N FVEWRQXNISSYFO-ZPFDUUQYSA-N 0.000 description 1
- QLRMMMQNCWBNPQ-QXEWZRGKSA-N Ile-Arg-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)NCC(=O)O)N QLRMMMQNCWBNPQ-QXEWZRGKSA-N 0.000 description 1
- YOTNPRLPIPHQSB-XUXIUFHCSA-N Ile-Arg-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOTNPRLPIPHQSB-XUXIUFHCSA-N 0.000 description 1
- VZIFYHYNQDIPLI-HJWJTTGWSA-N Ile-Arg-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N VZIFYHYNQDIPLI-HJWJTTGWSA-N 0.000 description 1
- DMHGKBGOUAJRHU-RVMXOQNASA-N Ile-Arg-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N DMHGKBGOUAJRHU-RVMXOQNASA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- QTUSJASXLGLJSR-OSUNSFLBSA-N Ile-Arg-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N QTUSJASXLGLJSR-OSUNSFLBSA-N 0.000 description 1
- YKRIXHPEIZUDDY-GMOBBJLQSA-N Ile-Asn-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YKRIXHPEIZUDDY-GMOBBJLQSA-N 0.000 description 1
- SCHZQZPYHBWYEQ-PEFMBERDSA-N Ile-Asn-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SCHZQZPYHBWYEQ-PEFMBERDSA-N 0.000 description 1
- UAVQIQOOBXFKRC-BYULHYEWSA-N Ile-Asn-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O UAVQIQOOBXFKRC-BYULHYEWSA-N 0.000 description 1
- XENGULNPUDGALZ-ZPFDUUQYSA-N Ile-Asn-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N XENGULNPUDGALZ-ZPFDUUQYSA-N 0.000 description 1
- HDODQNPMSHDXJT-GHCJXIJMSA-N Ile-Asn-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O HDODQNPMSHDXJT-GHCJXIJMSA-N 0.000 description 1
- RPZFUIQVAPZLRH-GHCJXIJMSA-N Ile-Asp-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C)C(=O)O)N RPZFUIQVAPZLRH-GHCJXIJMSA-N 0.000 description 1
- UMYZBHKAVTXWIW-GMOBBJLQSA-N Ile-Asp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UMYZBHKAVTXWIW-GMOBBJLQSA-N 0.000 description 1
- NBJAAWYRLGCJOF-UGYAYLCHSA-N Ile-Asp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NBJAAWYRLGCJOF-UGYAYLCHSA-N 0.000 description 1
- HVWXAQVMRBKKFE-UGYAYLCHSA-N Ile-Asp-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HVWXAQVMRBKKFE-UGYAYLCHSA-N 0.000 description 1
- IDAHFEPYTJJZFD-PEFMBERDSA-N Ile-Asp-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N IDAHFEPYTJJZFD-PEFMBERDSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- BGZIJZJBXRVBGJ-SXTJYALSSA-N Ile-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N BGZIJZJBXRVBGJ-SXTJYALSSA-N 0.000 description 1
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 1
- REJKOQYVFDEZHA-SLBDDTMCSA-N Ile-Asp-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N REJKOQYVFDEZHA-SLBDDTMCSA-N 0.000 description 1
- FHCNLXMTQJNJNH-KBIXCLLPSA-N Ile-Cys-Gln Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(=O)O FHCNLXMTQJNJNH-KBIXCLLPSA-N 0.000 description 1
- LDRALPZEVHVXEK-KBIXCLLPSA-N Ile-Cys-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N LDRALPZEVHVXEK-KBIXCLLPSA-N 0.000 description 1
- PFTFEWHJSAXGED-ZKWXMUAHSA-N Ile-Cys-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N PFTFEWHJSAXGED-ZKWXMUAHSA-N 0.000 description 1
- PPSQSIDMOVPKPI-BJDJZHNGSA-N Ile-Cys-Leu Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)O PPSQSIDMOVPKPI-BJDJZHNGSA-N 0.000 description 1
- AWTDTFXPVCTHAK-BJDJZHNGSA-N Ile-Cys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCCN)C(=O)O)N AWTDTFXPVCTHAK-BJDJZHNGSA-N 0.000 description 1
- WEWCEPOYKANMGZ-MMWGEVLESA-N Ile-Cys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N1CCC[C@@H]1C(=O)O)N WEWCEPOYKANMGZ-MMWGEVLESA-N 0.000 description 1
- VQUCKIAECLVLAD-SVSWQMSJSA-N Ile-Cys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VQUCKIAECLVLAD-SVSWQMSJSA-N 0.000 description 1
- ZIPOVLBRVPXWJQ-SPOWBLRKSA-N Ile-Cys-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N ZIPOVLBRVPXWJQ-SPOWBLRKSA-N 0.000 description 1
- ZDNORQNHCJUVOV-KBIXCLLPSA-N Ile-Gln-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O ZDNORQNHCJUVOV-KBIXCLLPSA-N 0.000 description 1
- GECLQMBTZCPAFY-PEFMBERDSA-N Ile-Gln-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(=O)O)C(=O)O)N GECLQMBTZCPAFY-PEFMBERDSA-N 0.000 description 1
- LJKDGRWXYUTRSH-YVNDNENWSA-N Ile-Gln-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LJKDGRWXYUTRSH-YVNDNENWSA-N 0.000 description 1
- CYHJCEKUMCNDFG-LAEOZQHASA-N Ile-Gln-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)NCC(=O)O)N CYHJCEKUMCNDFG-LAEOZQHASA-N 0.000 description 1
- KUHFPGIVBOCRMV-MNXVOIDGSA-N Ile-Gln-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC(C)C)C(=O)O)N KUHFPGIVBOCRMV-MNXVOIDGSA-N 0.000 description 1
- LKACSKJPTFSBHR-MNXVOIDGSA-N Ile-Gln-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N LKACSKJPTFSBHR-MNXVOIDGSA-N 0.000 description 1
- JRYQSFOFUFXPTB-RWRJDSDZSA-N Ile-Gln-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N JRYQSFOFUFXPTB-RWRJDSDZSA-N 0.000 description 1
- MVLDERGQICFFLL-ZQINRCPSSA-N Ile-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 MVLDERGQICFFLL-ZQINRCPSSA-N 0.000 description 1
- LGMUPVWZEYYUMU-YVNDNENWSA-N Ile-Glu-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N LGMUPVWZEYYUMU-YVNDNENWSA-N 0.000 description 1
- LPXHYGGZJOCAFR-MNXVOIDGSA-N Ile-Glu-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N LPXHYGGZJOCAFR-MNXVOIDGSA-N 0.000 description 1
- DFJJAVZIHDFOGQ-MNXVOIDGSA-N Ile-Glu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N DFJJAVZIHDFOGQ-MNXVOIDGSA-N 0.000 description 1
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 1
- XLCZWMJPVGRWHJ-KQXIARHKSA-N Ile-Glu-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N XLCZWMJPVGRWHJ-KQXIARHKSA-N 0.000 description 1
- PNDMHTTXXPUQJH-RWRJDSDZSA-N Ile-Glu-Thr Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H]([C@H](O)C)C(=O)O PNDMHTTXXPUQJH-RWRJDSDZSA-N 0.000 description 1
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 1
- KFVUBLZRFSVDGO-BYULHYEWSA-N Ile-Gly-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O KFVUBLZRFSVDGO-BYULHYEWSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- NYEYYMLUABXDMC-NHCYSSNCSA-N Ile-Gly-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)O)N NYEYYMLUABXDMC-NHCYSSNCSA-N 0.000 description 1
- LWWILHPVAKKLQS-QXEWZRGKSA-N Ile-Gly-Met Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N LWWILHPVAKKLQS-QXEWZRGKSA-N 0.000 description 1
- GQKSJYINYYWPMR-NGZCFLSTSA-N Ile-Gly-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N GQKSJYINYYWPMR-NGZCFLSTSA-N 0.000 description 1
- UAQSZXGJGLHMNV-XEGUGMAKSA-N Ile-Gly-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N UAQSZXGJGLHMNV-XEGUGMAKSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- JNDYZNJRRNFYIR-VGDYDELISA-N Ile-His-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N JNDYZNJRRNFYIR-VGDYDELISA-N 0.000 description 1
- UASTVUQJMLZWGG-PEXQALLHSA-N Ile-His-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N UASTVUQJMLZWGG-PEXQALLHSA-N 0.000 description 1
- VUEXLJFLDONGKQ-PYJNHQTQSA-N Ile-His-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCSC)C(=O)O)N VUEXLJFLDONGKQ-PYJNHQTQSA-N 0.000 description 1
- LNJLOZYNZFGJMM-DEQVHRJGSA-N Ile-His-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N LNJLOZYNZFGJMM-DEQVHRJGSA-N 0.000 description 1
- SJLVSMMIFYTSGY-GRLWGSQLSA-N Ile-Ile-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N SJLVSMMIFYTSGY-GRLWGSQLSA-N 0.000 description 1
- YNMQUIVKEFRCPH-QSFUFRPTSA-N Ile-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O)N YNMQUIVKEFRCPH-QSFUFRPTSA-N 0.000 description 1
- TWPSALMCEHCIOY-YTFOTSKYSA-N Ile-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(=O)O)N TWPSALMCEHCIOY-YTFOTSKYSA-N 0.000 description 1
- AXNGDPAKKCEKGY-QPHKQPEJSA-N Ile-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N AXNGDPAKKCEKGY-QPHKQPEJSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- PKGGWLOLRLOPGK-XUXIUFHCSA-N Ile-Leu-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PKGGWLOLRLOPGK-XUXIUFHCSA-N 0.000 description 1
- HUORUFRRJHELPD-MNXVOIDGSA-N Ile-Leu-Glu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N HUORUFRRJHELPD-MNXVOIDGSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- GVNNAHIRSDRIII-AJNGGQMLSA-N Ile-Lys-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N GVNNAHIRSDRIII-AJNGGQMLSA-N 0.000 description 1
- CEPIAEUVRKGPGP-DSYPUSFNSA-N Ile-Lys-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)[C@@H](C)CC)C(O)=O)=CNC2=C1 CEPIAEUVRKGPGP-DSYPUSFNSA-N 0.000 description 1
- UFRXVQGGPNSJRY-CYDGBPFRSA-N Ile-Met-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N UFRXVQGGPNSJRY-CYDGBPFRSA-N 0.000 description 1
- DNKDIDZHXZAGRY-HJWJTTGWSA-N Ile-Met-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DNKDIDZHXZAGRY-HJWJTTGWSA-N 0.000 description 1
- NPAYJTAXWXJKLO-NAKRPEOUSA-N Ile-Met-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)O)N NPAYJTAXWXJKLO-NAKRPEOUSA-N 0.000 description 1
- BKPPWVSPSIUXHZ-OSUNSFLBSA-N Ile-Met-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N BKPPWVSPSIUXHZ-OSUNSFLBSA-N 0.000 description 1
- SNHYFFQZRFIRHO-CYDGBPFRSA-N Ile-Met-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(=O)O)N SNHYFFQZRFIRHO-CYDGBPFRSA-N 0.000 description 1
- OTSVBELRDMSPKY-PCBIJLKTSA-N Ile-Phe-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OTSVBELRDMSPKY-PCBIJLKTSA-N 0.000 description 1
- UAELWXJFLZBKQS-WHOFXGATSA-N Ile-Phe-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O UAELWXJFLZBKQS-WHOFXGATSA-N 0.000 description 1
- USXAYNCLFSUSBA-MGHWNKPDSA-N Ile-Phe-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N USXAYNCLFSUSBA-MGHWNKPDSA-N 0.000 description 1
- WYUHAXJAMDTOAU-IAVJCBSLSA-N Ile-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N WYUHAXJAMDTOAU-IAVJCBSLSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- CIDLJWVDMNDKPT-FIRPJDEBSA-N Ile-Phe-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N CIDLJWVDMNDKPT-FIRPJDEBSA-N 0.000 description 1
- FGBRXCZYVRFNKQ-MXAVVETBSA-N Ile-Phe-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N FGBRXCZYVRFNKQ-MXAVVETBSA-N 0.000 description 1
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 1
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 1
- BJECXJHLUJXPJQ-PYJNHQTQSA-N Ile-Pro-His Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N BJECXJHLUJXPJQ-PYJNHQTQSA-N 0.000 description 1
- IVXJIMGDOYRLQU-XUXIUFHCSA-N Ile-Pro-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O IVXJIMGDOYRLQU-XUXIUFHCSA-N 0.000 description 1
- MLSUZXHSNRBDCI-CYDGBPFRSA-N Ile-Pro-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)O)N MLSUZXHSNRBDCI-CYDGBPFRSA-N 0.000 description 1
- JHNJNTMTZHEDLJ-NAKRPEOUSA-N Ile-Ser-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JHNJNTMTZHEDLJ-NAKRPEOUSA-N 0.000 description 1
- JODPUDMBQBIWCK-GHCJXIJMSA-N Ile-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O JODPUDMBQBIWCK-GHCJXIJMSA-N 0.000 description 1
- FBGXMKUWQFPHFB-JBDRJPRFSA-N Ile-Ser-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N FBGXMKUWQFPHFB-JBDRJPRFSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 1
- ZDNNDIJTUHQCAM-MXAVVETBSA-N Ile-Ser-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N ZDNNDIJTUHQCAM-MXAVVETBSA-N 0.000 description 1
- WLRJHVNFGAOYPS-HJPIBITLSA-N Ile-Ser-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N WLRJHVNFGAOYPS-HJPIBITLSA-N 0.000 description 1
- HXIDVIFHRYRXLZ-NAKRPEOUSA-N Ile-Ser-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)O)N HXIDVIFHRYRXLZ-NAKRPEOUSA-N 0.000 description 1
- PZWBBXHHUSIGKH-OSUNSFLBSA-N Ile-Thr-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PZWBBXHHUSIGKH-OSUNSFLBSA-N 0.000 description 1
- YCKPUHHMCFSUMD-IUKAMOBKSA-N Ile-Thr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N YCKPUHHMCFSUMD-IUKAMOBKSA-N 0.000 description 1
- YBKKLDBBPFIXBQ-MBLNEYKQSA-N Ile-Thr-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)O)N YBKKLDBBPFIXBQ-MBLNEYKQSA-N 0.000 description 1
- HJDZMPFEXINXLO-QPHKQPEJSA-N Ile-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N HJDZMPFEXINXLO-QPHKQPEJSA-N 0.000 description 1
- KBDIBHQICWDGDL-PPCPHDFISA-N Ile-Thr-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N KBDIBHQICWDGDL-PPCPHDFISA-N 0.000 description 1
- QGXQHJQPAPMACW-PPCPHDFISA-N Ile-Thr-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)O)N QGXQHJQPAPMACW-PPCPHDFISA-N 0.000 description 1
- WXLYNEHOGRYNFU-URLPEUOOSA-N Ile-Thr-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N WXLYNEHOGRYNFU-URLPEUOOSA-N 0.000 description 1
- KWHFUMYCSPJCFQ-NGTWOADLSA-N Ile-Thr-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N KWHFUMYCSPJCFQ-NGTWOADLSA-N 0.000 description 1
- QHUREMVLLMNUAX-OSUNSFLBSA-N Ile-Thr-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)O)N QHUREMVLLMNUAX-OSUNSFLBSA-N 0.000 description 1
- BLFXHAFTNYZEQE-VKOGCVSHSA-N Ile-Trp-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N BLFXHAFTNYZEQE-VKOGCVSHSA-N 0.000 description 1
- BZUOLKFQVVBTJY-SLBDDTMCSA-N Ile-Trp-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)N)C(=O)O)N BZUOLKFQVVBTJY-SLBDDTMCSA-N 0.000 description 1
- OMDWJWGZGMCQND-CFMVVWHZSA-N Ile-Tyr-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OMDWJWGZGMCQND-CFMVVWHZSA-N 0.000 description 1
- GNXGAVNTVNOCLL-SIUGBPQLSA-N Ile-Tyr-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GNXGAVNTVNOCLL-SIUGBPQLSA-N 0.000 description 1
- REXAUQBGSGDEJY-IGISWZIWSA-N Ile-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)N REXAUQBGSGDEJY-IGISWZIWSA-N 0.000 description 1
- GVEODXUBBFDBPW-MGHWNKPDSA-N Ile-Tyr-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)CC1=CC=C(O)C=C1 GVEODXUBBFDBPW-MGHWNKPDSA-N 0.000 description 1
- NXRNRBOKDBIVKQ-CXTHYWKRSA-N Ile-Tyr-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N NXRNRBOKDBIVKQ-CXTHYWKRSA-N 0.000 description 1
- AUIYHFRUOOKTGX-UKJIMTQDSA-N Ile-Val-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N AUIYHFRUOOKTGX-UKJIMTQDSA-N 0.000 description 1
- UYODHPPSCXBNCS-XUXIUFHCSA-N Ile-Val-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C UYODHPPSCXBNCS-XUXIUFHCSA-N 0.000 description 1
- RQZFWBLDTBDEOF-RNJOBUHISA-N Ile-Val-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N RQZFWBLDTBDEOF-RNJOBUHISA-N 0.000 description 1
- 102100034343 Integrase Human genes 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- KFKWRHQBZQICHA-STQMWFEESA-N L-leucyl-L-phenylalanine Natural products CC(C)C[C@H](N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 KFKWRHQBZQICHA-STQMWFEESA-N 0.000 description 1
- TYYLDKGBCJGJGW-UHFFFAOYSA-N L-tryptophan-L-tyrosine Natural products C=1NC2=CC=CC=C2C=1CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 TYYLDKGBCJGJGW-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- CZCSUZMIRKFFFA-CIUDSAMLSA-N Leu-Ala-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O CZCSUZMIRKFFFA-CIUDSAMLSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- QPRQGENIBFLVEB-BJDJZHNGSA-N Leu-Ala-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O QPRQGENIBFLVEB-BJDJZHNGSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- SUPVSFFZWVOEOI-CQDKDKBSSA-N Leu-Ala-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 SUPVSFFZWVOEOI-CQDKDKBSSA-N 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- REPPKAMYTOJTFC-DCAQKATOSA-N Leu-Arg-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O REPPKAMYTOJTFC-DCAQKATOSA-N 0.000 description 1
- UILIPCLTHRPCRB-XUXIUFHCSA-N Leu-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(C)C)N UILIPCLTHRPCRB-XUXIUFHCSA-N 0.000 description 1
- QUAAUWNLWMLERT-IHRRRGAJSA-N Leu-Arg-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O QUAAUWNLWMLERT-IHRRRGAJSA-N 0.000 description 1
- YOZCKMXHBYKOMQ-IHRRRGAJSA-N Leu-Arg-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N YOZCKMXHBYKOMQ-IHRRRGAJSA-N 0.000 description 1
- XYUBOFCTGPZFSA-WDSOQIARSA-N Leu-Arg-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 XYUBOFCTGPZFSA-WDSOQIARSA-N 0.000 description 1
- DUBAVOVZNZKEQQ-AVGNSLFASA-N Leu-Arg-Val Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CCCN=C(N)N DUBAVOVZNZKEQQ-AVGNSLFASA-N 0.000 description 1
- WUFYAPWIHCUMLL-CIUDSAMLSA-N Leu-Asn-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O WUFYAPWIHCUMLL-CIUDSAMLSA-N 0.000 description 1
- DBVWMYGBVFCRBE-CIUDSAMLSA-N Leu-Asn-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DBVWMYGBVFCRBE-CIUDSAMLSA-N 0.000 description 1
- RFUBXQQFJFGJFV-GUBZILKMSA-N Leu-Asn-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O RFUBXQQFJFGJFV-GUBZILKMSA-N 0.000 description 1
- OIARJGNVARWKFP-YUMQZZPRSA-N Leu-Asn-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O OIARJGNVARWKFP-YUMQZZPRSA-N 0.000 description 1
- OXKYZSRZKBTVEY-ZPFDUUQYSA-N Leu-Asn-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OXKYZSRZKBTVEY-ZPFDUUQYSA-N 0.000 description 1
- WGNOPSQMIQERPK-GARJFASQSA-N Leu-Asn-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N WGNOPSQMIQERPK-GARJFASQSA-N 0.000 description 1
- BPANDPNDMJHFEV-CIUDSAMLSA-N Leu-Asp-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O BPANDPNDMJHFEV-CIUDSAMLSA-N 0.000 description 1
- DLFAACQHIRSQGG-CIUDSAMLSA-N Leu-Asp-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O DLFAACQHIRSQGG-CIUDSAMLSA-N 0.000 description 1
- KTFHTMHHKXUYPW-ZPFDUUQYSA-N Leu-Asp-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KTFHTMHHKXUYPW-ZPFDUUQYSA-N 0.000 description 1
- XVSJMWYYLHPDKY-DCAQKATOSA-N Leu-Asp-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O XVSJMWYYLHPDKY-DCAQKATOSA-N 0.000 description 1
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 1
- RRSLQOLASISYTB-CIUDSAMLSA-N Leu-Cys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O RRSLQOLASISYTB-CIUDSAMLSA-N 0.000 description 1
- PPTAQBNUFKTJKA-BJDJZHNGSA-N Leu-Cys-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PPTAQBNUFKTJKA-BJDJZHNGSA-N 0.000 description 1
- PPBKJAQJAUHZKX-SRVKXCTJSA-N Leu-Cys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC(C)C PPBKJAQJAUHZKX-SRVKXCTJSA-N 0.000 description 1
- PNUCWVAGVNLUMW-CIUDSAMLSA-N Leu-Cys-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O PNUCWVAGVNLUMW-CIUDSAMLSA-N 0.000 description 1
- FOEHRHOBWFQSNW-KATARQTJSA-N Leu-Cys-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(C)C)N)O FOEHRHOBWFQSNW-KATARQTJSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- DLCXCECTCPKKCD-GUBZILKMSA-N Leu-Gln-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O DLCXCECTCPKKCD-GUBZILKMSA-N 0.000 description 1
- BOFAFKVZQUMTID-AVGNSLFASA-N Leu-Gln-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N BOFAFKVZQUMTID-AVGNSLFASA-N 0.000 description 1
- YSKSXVKQLLBVEX-SZMVWBNQSA-N Leu-Gln-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC(C)C)C(O)=O)=CNC2=C1 YSKSXVKQLLBVEX-SZMVWBNQSA-N 0.000 description 1
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 1
- PRZVBIAOPFGAQF-SRVKXCTJSA-N Leu-Glu-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O PRZVBIAOPFGAQF-SRVKXCTJSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- FMEICTQWUKNAGC-YUMQZZPRSA-N Leu-Gly-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O FMEICTQWUKNAGC-YUMQZZPRSA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- JRJLGNFWYFSJHB-HOCLYGCPSA-N Leu-Gly-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JRJLGNFWYFSJHB-HOCLYGCPSA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- YWYQSLOTVIRCFE-SRVKXCTJSA-N Leu-His-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(O)=O YWYQSLOTVIRCFE-SRVKXCTJSA-N 0.000 description 1
- DDEMUMVXNFPDKC-SRVKXCTJSA-N Leu-His-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CS)C(=O)O)N DDEMUMVXNFPDKC-SRVKXCTJSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- XQXGNBFMAXWIGI-MXAVVETBSA-N Leu-His-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(C)C)CC1=CN=CN1 XQXGNBFMAXWIGI-MXAVVETBSA-N 0.000 description 1
- CSFVADKICPDRRF-KKUMJFAQSA-N Leu-His-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CN=CN1 CSFVADKICPDRRF-KKUMJFAQSA-N 0.000 description 1
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 1
- USLNHQZCDQJBOV-ZPFDUUQYSA-N Leu-Ile-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O USLNHQZCDQJBOV-ZPFDUUQYSA-N 0.000 description 1
- AUBMZAMQCOYSIC-MNXVOIDGSA-N Leu-Ile-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O AUBMZAMQCOYSIC-MNXVOIDGSA-N 0.000 description 1
- JFSGIJSCJFQGSZ-MXAVVETBSA-N Leu-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N JFSGIJSCJFQGSZ-MXAVVETBSA-N 0.000 description 1
- SEMUSFOBZGKBGW-YTFOTSKYSA-N Leu-Ile-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SEMUSFOBZGKBGW-YTFOTSKYSA-N 0.000 description 1
- KUIDCYNIEJBZBU-AJNGGQMLSA-N Leu-Ile-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O KUIDCYNIEJBZBU-AJNGGQMLSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- OMHLATXVNQSALM-FQUUOJAGSA-N Leu-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(C)C)N OMHLATXVNQSALM-FQUUOJAGSA-N 0.000 description 1
- LIINDKYIGYTDLG-PPCPHDFISA-N Leu-Ile-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LIINDKYIGYTDLG-PPCPHDFISA-N 0.000 description 1
- NRFGTHFONZYFNY-MGHWNKPDSA-N Leu-Ile-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NRFGTHFONZYFNY-MGHWNKPDSA-N 0.000 description 1
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- WXUOJXIGOPMDJM-SRVKXCTJSA-N Leu-Lys-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O WXUOJXIGOPMDJM-SRVKXCTJSA-N 0.000 description 1
- DCGXHWINSHEPIR-SRVKXCTJSA-N Leu-Lys-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N DCGXHWINSHEPIR-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- BGZCJDGBBUUBHA-KKUMJFAQSA-N Leu-Lys-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O BGZCJDGBBUUBHA-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- FIICHHJDINDXKG-IHPCNDPISA-N Leu-Lys-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O FIICHHJDINDXKG-IHPCNDPISA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- ARRIJPQRBWRNLT-DCAQKATOSA-N Leu-Met-Asn Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ARRIJPQRBWRNLT-DCAQKATOSA-N 0.000 description 1
- LQUIENKUVKPNIC-ULQDDVLXSA-N Leu-Met-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O LQUIENKUVKPNIC-ULQDDVLXSA-N 0.000 description 1
- NJMXCOOEFLMZSR-AVGNSLFASA-N Leu-Met-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O NJMXCOOEFLMZSR-AVGNSLFASA-N 0.000 description 1
- GCXGCIYIHXSKAY-ULQDDVLXSA-N Leu-Phe-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GCXGCIYIHXSKAY-ULQDDVLXSA-N 0.000 description 1
- BIZNDKMFQHDOIE-KKUMJFAQSA-N Leu-Phe-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(N)=O)C(O)=O)CC1=CC=CC=C1 BIZNDKMFQHDOIE-KKUMJFAQSA-N 0.000 description 1
- ZAVCJRJOQKIOJW-KKUMJFAQSA-N Leu-Phe-Asp Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=CC=C1 ZAVCJRJOQKIOJW-KKUMJFAQSA-N 0.000 description 1
- YESNGRDJQWDYLH-KKUMJFAQSA-N Leu-Phe-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)O)N YESNGRDJQWDYLH-KKUMJFAQSA-N 0.000 description 1
- SYRTUBLKWNDSDK-DKIMLUQUSA-N Leu-Phe-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYRTUBLKWNDSDK-DKIMLUQUSA-N 0.000 description 1
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 1
- PJWOOBTYQNNRBF-BZSNNMDCSA-N Leu-Phe-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)O)N PJWOOBTYQNNRBF-BZSNNMDCSA-N 0.000 description 1
- WXDRGWBQZIMJDE-ULQDDVLXSA-N Leu-Phe-Met Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O WXDRGWBQZIMJDE-ULQDDVLXSA-N 0.000 description 1
- UHNQRAFSEBGZFZ-YESZJQIVSA-N Leu-Phe-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N UHNQRAFSEBGZFZ-YESZJQIVSA-N 0.000 description 1
- YWKNKRAKOCLOLH-OEAJRASXSA-N Leu-Phe-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YWKNKRAKOCLOLH-OEAJRASXSA-N 0.000 description 1
- MAXILRZVORNXBE-PMVMPFDFSA-N Leu-Phe-Trp Chemical compound C([C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MAXILRZVORNXBE-PMVMPFDFSA-N 0.000 description 1
- MVVSHHJKJRZVNY-ACRUOGEOSA-N Leu-Phe-Tyr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MVVSHHJKJRZVNY-ACRUOGEOSA-N 0.000 description 1
- BMVFXOQHDQZAQU-DCAQKATOSA-N Leu-Pro-Asp Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N BMVFXOQHDQZAQU-DCAQKATOSA-N 0.000 description 1
- MUCIDQMDOYQYBR-IHRRRGAJSA-N Leu-Pro-His Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N MUCIDQMDOYQYBR-IHRRRGAJSA-N 0.000 description 1
- YRRCOJOXAJNSAX-IHRRRGAJSA-N Leu-Pro-Lys Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)O)N YRRCOJOXAJNSAX-IHRRRGAJSA-N 0.000 description 1
- XXXXOVFBXRERQL-ULQDDVLXSA-N Leu-Pro-Phe Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XXXXOVFBXRERQL-ULQDDVLXSA-N 0.000 description 1
- JLYUZRKPDKHUTC-WDSOQIARSA-N Leu-Pro-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O JLYUZRKPDKHUTC-WDSOQIARSA-N 0.000 description 1
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 1
- AKVBOOKXVAMKSS-GUBZILKMSA-N Leu-Ser-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O AKVBOOKXVAMKSS-GUBZILKMSA-N 0.000 description 1
- ADJWHHZETYAAAX-SRVKXCTJSA-N Leu-Ser-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ADJWHHZETYAAAX-SRVKXCTJSA-N 0.000 description 1
- SVBJIZVVYJYGLA-DCAQKATOSA-N Leu-Ser-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O SVBJIZVVYJYGLA-DCAQKATOSA-N 0.000 description 1
- ZDJQVSIPFLMNOX-RHYQMDGZSA-N Leu-Thr-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZDJQVSIPFLMNOX-RHYQMDGZSA-N 0.000 description 1
- LCNASHSOFMRYFO-WDCWCFNPSA-N Leu-Thr-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(N)=O LCNASHSOFMRYFO-WDCWCFNPSA-N 0.000 description 1
- LFSQWRSVPNKJGP-WDCWCFNPSA-N Leu-Thr-Glu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCC(O)=O LFSQWRSVPNKJGP-WDCWCFNPSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- QWWPYKKLXWOITQ-VOAKCMCISA-N Leu-Thr-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(C)C QWWPYKKLXWOITQ-VOAKCMCISA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- ILDSIMPXNFWKLH-KATARQTJSA-N Leu-Thr-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ILDSIMPXNFWKLH-KATARQTJSA-N 0.000 description 1
- RNYLNYTYMXACRI-VFAJRCTISA-N Leu-Thr-Trp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O RNYLNYTYMXACRI-VFAJRCTISA-N 0.000 description 1
- CNWDWAMPKVYJJB-NUTKFTJISA-N Leu-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 CNWDWAMPKVYJJB-NUTKFTJISA-N 0.000 description 1
- WGAZVKFCPHXZLO-SZMVWBNQSA-N Leu-Trp-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N WGAZVKFCPHXZLO-SZMVWBNQSA-N 0.000 description 1
- URJUVJDTPXCQFL-IHPCNDPISA-N Leu-Trp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)N URJUVJDTPXCQFL-IHPCNDPISA-N 0.000 description 1
- FPFOYSCDUWTZBF-IHPCNDPISA-N Leu-Trp-Leu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]([NH3+])CC(C)C)C(=O)N[C@@H](CC(C)C)C([O-])=O)=CNC2=C1 FPFOYSCDUWTZBF-IHPCNDPISA-N 0.000 description 1
- SUYRAPCRSCCPAK-VFAJRCTISA-N Leu-Trp-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SUYRAPCRSCCPAK-VFAJRCTISA-N 0.000 description 1
- RIHIGSWBLHSGLV-CQDKDKBSSA-N Leu-Tyr-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O RIHIGSWBLHSGLV-CQDKDKBSSA-N 0.000 description 1
- UCRJTSIIAYHOHE-ULQDDVLXSA-N Leu-Tyr-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UCRJTSIIAYHOHE-ULQDDVLXSA-N 0.000 description 1
- ISSAURVGLGAPDK-KKUMJFAQSA-N Leu-Tyr-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O ISSAURVGLGAPDK-KKUMJFAQSA-N 0.000 description 1
- VHTIZYYHIUHMCA-JYJNAYRXSA-N Leu-Tyr-Gln Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VHTIZYYHIUHMCA-JYJNAYRXSA-N 0.000 description 1
- WFCKERTZVCQXKH-KBPBESRZSA-N Leu-Tyr-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O WFCKERTZVCQXKH-KBPBESRZSA-N 0.000 description 1
- ARNIBBOXIAWUOP-MGHWNKPDSA-N Leu-Tyr-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ARNIBBOXIAWUOP-MGHWNKPDSA-N 0.000 description 1
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 1
- TUIOUEWKFFVNLH-DCAQKATOSA-N Leu-Val-Cys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CS)C(O)=O TUIOUEWKFFVNLH-DCAQKATOSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- MSFITIBEMPWCBD-ULQDDVLXSA-N Leu-Val-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 MSFITIBEMPWCBD-ULQDDVLXSA-N 0.000 description 1
- FZIJIFCXUCZHOL-CIUDSAMLSA-N Lys-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCCN FZIJIFCXUCZHOL-CIUDSAMLSA-N 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- JCFYLFOCALSNLQ-GUBZILKMSA-N Lys-Ala-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JCFYLFOCALSNLQ-GUBZILKMSA-N 0.000 description 1
- WSXTWLJHTLRFLW-SRVKXCTJSA-N Lys-Ala-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(O)=O WSXTWLJHTLRFLW-SRVKXCTJSA-N 0.000 description 1
- IXHKPDJKKCUKHS-GARJFASQSA-N Lys-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IXHKPDJKKCUKHS-GARJFASQSA-N 0.000 description 1
- YRWCPXOFBKTCFY-NUTKFTJISA-N Lys-Ala-Trp Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N YRWCPXOFBKTCFY-NUTKFTJISA-N 0.000 description 1
- CLBGMWIYPYAZPR-AVGNSLFASA-N Lys-Arg-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O CLBGMWIYPYAZPR-AVGNSLFASA-N 0.000 description 1
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 1
- GQUDMNDPQTXZRV-DCAQKATOSA-N Lys-Arg-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GQUDMNDPQTXZRV-DCAQKATOSA-N 0.000 description 1
- JGAMUXDWYSXYLM-SRVKXCTJSA-N Lys-Arg-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGAMUXDWYSXYLM-SRVKXCTJSA-N 0.000 description 1
- BRSGXFITDXFMFF-IHRRRGAJSA-N Lys-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N BRSGXFITDXFMFF-IHRRRGAJSA-N 0.000 description 1
- VHNOAIFVYUQOOY-XUXIUFHCSA-N Lys-Arg-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VHNOAIFVYUQOOY-XUXIUFHCSA-N 0.000 description 1
- SJNZALDHDUYDBU-IHRRRGAJSA-N Lys-Arg-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(O)=O SJNZALDHDUYDBU-IHRRRGAJSA-N 0.000 description 1
- NQCJGQHHYZNUDK-DCAQKATOSA-N Lys-Arg-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CCCN=C(N)N NQCJGQHHYZNUDK-DCAQKATOSA-N 0.000 description 1
- SWWCDAGDQHTKIE-RHYQMDGZSA-N Lys-Arg-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWWCDAGDQHTKIE-RHYQMDGZSA-N 0.000 description 1
- GGAPIOORBXHMNY-ULQDDVLXSA-N Lys-Arg-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCCN)N)O GGAPIOORBXHMNY-ULQDDVLXSA-N 0.000 description 1
- WLCYCADOWRMSAJ-CIUDSAMLSA-N Lys-Asn-Cys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CS)C(O)=O WLCYCADOWRMSAJ-CIUDSAMLSA-N 0.000 description 1
- QYOXSYXPHUHOJR-GUBZILKMSA-N Lys-Asn-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYOXSYXPHUHOJR-GUBZILKMSA-N 0.000 description 1
- HQVDJTYKCMIWJP-YUMQZZPRSA-N Lys-Asn-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O HQVDJTYKCMIWJP-YUMQZZPRSA-N 0.000 description 1
- FACUGMGEFUEBTI-SRVKXCTJSA-N Lys-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCCCN FACUGMGEFUEBTI-SRVKXCTJSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- HGZHSNBZDOLMLH-DCAQKATOSA-N Lys-Asn-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N HGZHSNBZDOLMLH-DCAQKATOSA-N 0.000 description 1
- DGWXCIORNLWGGG-CIUDSAMLSA-N Lys-Asn-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O DGWXCIORNLWGGG-CIUDSAMLSA-N 0.000 description 1
- PXHCFKXNSBJSTQ-KKUMJFAQSA-N Lys-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N)O PXHCFKXNSBJSTQ-KKUMJFAQSA-N 0.000 description 1
- LZWNAOIMTLNMDW-NHCYSSNCSA-N Lys-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCCN)N LZWNAOIMTLNMDW-NHCYSSNCSA-N 0.000 description 1
- HKCCVDWHHTVVPN-CIUDSAMLSA-N Lys-Asp-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O HKCCVDWHHTVVPN-CIUDSAMLSA-N 0.000 description 1
- QUYCUALODHJQLK-CIUDSAMLSA-N Lys-Asp-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUYCUALODHJQLK-CIUDSAMLSA-N 0.000 description 1
- SQXUUGUCGJSWCK-CIUDSAMLSA-N Lys-Asp-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N SQXUUGUCGJSWCK-CIUDSAMLSA-N 0.000 description 1
- OVIVOCSURJYCTM-GUBZILKMSA-N Lys-Asp-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(O)=O OVIVOCSURJYCTM-GUBZILKMSA-N 0.000 description 1
- IBQMEXQYZMVIFU-SRVKXCTJSA-N Lys-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CCCCN)N IBQMEXQYZMVIFU-SRVKXCTJSA-N 0.000 description 1
- WGCKDDHUFPQSMZ-ZPFDUUQYSA-N Lys-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCCCN WGCKDDHUFPQSMZ-ZPFDUUQYSA-N 0.000 description 1
- NRQRKMYZONPCTM-CIUDSAMLSA-N Lys-Asp-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O NRQRKMYZONPCTM-CIUDSAMLSA-N 0.000 description 1
- NTBFKPBULZGXQL-KKUMJFAQSA-N Lys-Asp-Tyr Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 NTBFKPBULZGXQL-KKUMJFAQSA-N 0.000 description 1
- RDIILCRAWOSDOQ-CIUDSAMLSA-N Lys-Cys-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N RDIILCRAWOSDOQ-CIUDSAMLSA-N 0.000 description 1
- RLZDUFRBMQNYIJ-YUMQZZPRSA-N Lys-Cys-Gly Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N RLZDUFRBMQNYIJ-YUMQZZPRSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- GUYHHBZCBQZLFW-GUBZILKMSA-N Lys-Gln-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N GUYHHBZCBQZLFW-GUBZILKMSA-N 0.000 description 1
- MRWXLRGAFDOILG-DCAQKATOSA-N Lys-Gln-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MRWXLRGAFDOILG-DCAQKATOSA-N 0.000 description 1
- HWMZUBUEOYAQSC-DCAQKATOSA-N Lys-Gln-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O HWMZUBUEOYAQSC-DCAQKATOSA-N 0.000 description 1
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 1
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 1
- QQUJSUFWEDZQQY-AVGNSLFASA-N Lys-Gln-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN QQUJSUFWEDZQQY-AVGNSLFASA-N 0.000 description 1
- NDORZBUHCOJQDO-GVXVVHGQSA-N Lys-Gln-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O NDORZBUHCOJQDO-GVXVVHGQSA-N 0.000 description 1
- DRCILAJNUJKAHC-SRVKXCTJSA-N Lys-Glu-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O DRCILAJNUJKAHC-SRVKXCTJSA-N 0.000 description 1
- GRADYHMSAUIKPS-DCAQKATOSA-N Lys-Glu-Gln Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O GRADYHMSAUIKPS-DCAQKATOSA-N 0.000 description 1
- GCMWRRQAKQXDED-IUCAKERBSA-N Lys-Glu-Gly Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)N[C@@H](CCC([O-])=O)C(=O)NCC([O-])=O GCMWRRQAKQXDED-IUCAKERBSA-N 0.000 description 1
- DUTMKEAPLLUGNO-JYJNAYRXSA-N Lys-Glu-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DUTMKEAPLLUGNO-JYJNAYRXSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- WGLAORUKDGRINI-WDCWCFNPSA-N Lys-Glu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WGLAORUKDGRINI-WDCWCFNPSA-N 0.000 description 1
- GPJGFSFYBJGYRX-YUMQZZPRSA-N Lys-Gly-Asp Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O GPJGFSFYBJGYRX-YUMQZZPRSA-N 0.000 description 1
- XNKDCYABMBBEKN-IUCAKERBSA-N Lys-Gly-Gln Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O XNKDCYABMBBEKN-IUCAKERBSA-N 0.000 description 1
- LCMWVZLBCUVDAZ-IUCAKERBSA-N Lys-Gly-Glu Chemical compound [NH3+]CCCC[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CCC([O-])=O LCMWVZLBCUVDAZ-IUCAKERBSA-N 0.000 description 1
- ISHNZELVUVPCHY-ZETCQYMHSA-N Lys-Gly-Gly Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)NCC(O)=O ISHNZELVUVPCHY-ZETCQYMHSA-N 0.000 description 1
- UETQMSASAVBGJY-QWRGUYRKSA-N Lys-Gly-His Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CNC=N1 UETQMSASAVBGJY-QWRGUYRKSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- NNKLKUUGESXCBS-KBPBESRZSA-N Lys-Gly-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NNKLKUUGESXCBS-KBPBESRZSA-N 0.000 description 1
- OWRUUFUVXFREBD-KKUMJFAQSA-N Lys-His-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(O)=O OWRUUFUVXFREBD-KKUMJFAQSA-N 0.000 description 1
- MXMDJEJWERYPMO-XUXIUFHCSA-N Lys-Ile-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MXMDJEJWERYPMO-XUXIUFHCSA-N 0.000 description 1
- QBEPTBMRQALPEV-MNXVOIDGSA-N Lys-Ile-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN QBEPTBMRQALPEV-MNXVOIDGSA-N 0.000 description 1
- KEPWSUPUFAPBRF-DKIMLUQUSA-N Lys-Ile-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KEPWSUPUFAPBRF-DKIMLUQUSA-N 0.000 description 1
- IZJGPPIGYTVXLB-FQUUOJAGSA-N Lys-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N IZJGPPIGYTVXLB-FQUUOJAGSA-N 0.000 description 1
- PRSBSVAVOQOAMI-BJDJZHNGSA-N Lys-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCCCN PRSBSVAVOQOAMI-BJDJZHNGSA-N 0.000 description 1
- NCZIQZYZPUPMKY-PPCPHDFISA-N Lys-Ile-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NCZIQZYZPUPMKY-PPCPHDFISA-N 0.000 description 1
- MYZMQWHPDAYKIE-SRVKXCTJSA-N Lys-Leu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O MYZMQWHPDAYKIE-SRVKXCTJSA-N 0.000 description 1
- MUXNCRWTWBMNHX-SRVKXCTJSA-N Lys-Leu-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O MUXNCRWTWBMNHX-SRVKXCTJSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- VMTYLUGCXIEDMV-QWRGUYRKSA-N Lys-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN VMTYLUGCXIEDMV-QWRGUYRKSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- YPLVCBKEPJPBDQ-MELADBBJSA-N Lys-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N YPLVCBKEPJPBDQ-MELADBBJSA-N 0.000 description 1
- XOQMURBBIXRRCR-SRVKXCTJSA-N Lys-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN XOQMURBBIXRRCR-SRVKXCTJSA-N 0.000 description 1
- ZJWIXBZTAAJERF-IHRRRGAJSA-N Lys-Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N ZJWIXBZTAAJERF-IHRRRGAJSA-N 0.000 description 1
- RIJCHEVHFWMDKD-SRVKXCTJSA-N Lys-Lys-Asn Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RIJCHEVHFWMDKD-SRVKXCTJSA-N 0.000 description 1
- AHFOKDZWPPGJAZ-SRVKXCTJSA-N Lys-Lys-Cys Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O)N AHFOKDZWPPGJAZ-SRVKXCTJSA-N 0.000 description 1
- UQRZFMQQXXJTTF-AVGNSLFASA-N Lys-Lys-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O UQRZFMQQXXJTTF-AVGNSLFASA-N 0.000 description 1
- YUAXTFMFMOIMAM-QWRGUYRKSA-N Lys-Lys-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O YUAXTFMFMOIMAM-QWRGUYRKSA-N 0.000 description 1
- GAHJXEMYXKLZRQ-AJNGGQMLSA-N Lys-Lys-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O GAHJXEMYXKLZRQ-AJNGGQMLSA-N 0.000 description 1
- KJIXWRWPOCKYLD-IHRRRGAJSA-N Lys-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCCCN)N KJIXWRWPOCKYLD-IHRRRGAJSA-N 0.000 description 1
- PLDJDCJLRCYPJB-VOAKCMCISA-N Lys-Lys-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PLDJDCJLRCYPJB-VOAKCMCISA-N 0.000 description 1
- QQPSCXKFDSORFT-IHRRRGAJSA-N Lys-Lys-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCCN QQPSCXKFDSORFT-IHRRRGAJSA-N 0.000 description 1
- WKUXWMWQTOYTFI-SRVKXCTJSA-N Lys-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N WKUXWMWQTOYTFI-SRVKXCTJSA-N 0.000 description 1
- VSTNAUBHKQPVJX-IHRRRGAJSA-N Lys-Met-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O VSTNAUBHKQPVJX-IHRRRGAJSA-N 0.000 description 1
- MTBLFIQZECOEBY-IHRRRGAJSA-N Lys-Met-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(O)=O MTBLFIQZECOEBY-IHRRRGAJSA-N 0.000 description 1
- INMBONMDMGPADT-AVGNSLFASA-N Lys-Met-Met Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCCN)N INMBONMDMGPADT-AVGNSLFASA-N 0.000 description 1
- ALEVUGKHINJNIF-QEJZJMRPSA-N Lys-Phe-Ala Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 ALEVUGKHINJNIF-QEJZJMRPSA-N 0.000 description 1
- XFOAWKDQMRMCDN-ULQDDVLXSA-N Lys-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CCCCN)CC1=CC=CC=C1 XFOAWKDQMRMCDN-ULQDDVLXSA-N 0.000 description 1
- MSSJJDVQTFTLIF-KBPBESRZSA-N Lys-Phe-Gly Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](Cc1ccccc1)C(=O)NCC(O)=O MSSJJDVQTFTLIF-KBPBESRZSA-N 0.000 description 1
- LNMKRJJLEFASGA-BZSNNMDCSA-N Lys-Phe-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O LNMKRJJLEFASGA-BZSNNMDCSA-N 0.000 description 1
- CENKQZWVYMLRAX-ULQDDVLXSA-N Lys-Phe-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(O)=O CENKQZWVYMLRAX-ULQDDVLXSA-N 0.000 description 1
- UDXSLGLHFUBRRM-OEAJRASXSA-N Lys-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CCCCN)N)O UDXSLGLHFUBRRM-OEAJRASXSA-N 0.000 description 1
- OBZHNHBAAVEWKI-DCAQKATOSA-N Lys-Pro-Asn Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O OBZHNHBAAVEWKI-DCAQKATOSA-N 0.000 description 1
- PDIDTSZKKFEDMB-UWVGGRQHSA-N Lys-Pro-Gly Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O PDIDTSZKKFEDMB-UWVGGRQHSA-N 0.000 description 1
- QBHGXFQJFPWJIH-XUXIUFHCSA-N Lys-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN QBHGXFQJFPWJIH-XUXIUFHCSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- LECIJRIRMVOFMH-ULQDDVLXSA-N Lys-Pro-Phe Chemical compound NCCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 LECIJRIRMVOFMH-ULQDDVLXSA-N 0.000 description 1
- LOGFVTREOLYCPF-RHYQMDGZSA-N Lys-Pro-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCCN LOGFVTREOLYCPF-RHYQMDGZSA-N 0.000 description 1
- JMNRXRPBHFGXQX-GUBZILKMSA-N Lys-Ser-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O JMNRXRPBHFGXQX-GUBZILKMSA-N 0.000 description 1
- JOSAKOKSPXROGQ-BJDJZHNGSA-N Lys-Ser-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JOSAKOKSPXROGQ-BJDJZHNGSA-N 0.000 description 1
- MIFFFXHMAHFACR-KATARQTJSA-N Lys-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CCCCN MIFFFXHMAHFACR-KATARQTJSA-N 0.000 description 1
- PLOUVAYOMTYJRG-JXUBOQSCSA-N Lys-Thr-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PLOUVAYOMTYJRG-JXUBOQSCSA-N 0.000 description 1
- QVTDVTONTRSQMF-WDCWCFNPSA-N Lys-Thr-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CCCCN QVTDVTONTRSQMF-WDCWCFNPSA-N 0.000 description 1
- YKBSXQFZWFXFIB-VOAKCMCISA-N Lys-Thr-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CCCCN)C(O)=O YKBSXQFZWFXFIB-VOAKCMCISA-N 0.000 description 1
- YCJCEMKOZOYBEF-OEAJRASXSA-N Lys-Thr-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YCJCEMKOZOYBEF-OEAJRASXSA-N 0.000 description 1
- YFQSSOAGMZGXFT-MEYUZBJRSA-N Lys-Thr-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O YFQSSOAGMZGXFT-MEYUZBJRSA-N 0.000 description 1
- VHTOGMKQXXJOHG-RHYQMDGZSA-N Lys-Thr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O VHTOGMKQXXJOHG-RHYQMDGZSA-N 0.000 description 1
- XGZDDOKIHSYHTO-SZMVWBNQSA-N Lys-Trp-Glu Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 XGZDDOKIHSYHTO-SZMVWBNQSA-N 0.000 description 1
- ZJSXCIMWLPSTMG-HSCHXYMDSA-N Lys-Trp-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ZJSXCIMWLPSTMG-HSCHXYMDSA-N 0.000 description 1
- NROQVSYLPRLJIP-PMVMPFDFSA-N Lys-Trp-Tyr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O NROQVSYLPRLJIP-PMVMPFDFSA-N 0.000 description 1
- RMKJOQSYLQQRFN-KKUMJFAQSA-N Lys-Tyr-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O RMKJOQSYLQQRFN-KKUMJFAQSA-N 0.000 description 1
- XYLSGAWRCZECIQ-JYJNAYRXSA-N Lys-Tyr-Glu Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CCC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 XYLSGAWRCZECIQ-JYJNAYRXSA-N 0.000 description 1
- IMDJSVBFQKDDEQ-MGHWNKPDSA-N Lys-Tyr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCCCN)N IMDJSVBFQKDDEQ-MGHWNKPDSA-N 0.000 description 1
- MIMXMVDLMDMOJD-BZSNNMDCSA-N Lys-Tyr-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O MIMXMVDLMDMOJD-BZSNNMDCSA-N 0.000 description 1
- SQRLLZAQNOQCEG-KKUMJFAQSA-N Lys-Tyr-Ser Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=C(O)C=C1 SQRLLZAQNOQCEG-KKUMJFAQSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- XABXVVSWUVCZST-GVXVVHGQSA-N Lys-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN XABXVVSWUVCZST-GVXVVHGQSA-N 0.000 description 1
- VKCPHIOZDWUFSW-ONGXEEELSA-N Lys-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN VKCPHIOZDWUFSW-ONGXEEELSA-N 0.000 description 1
- NYTDJEZBAAFLLG-IHRRRGAJSA-N Lys-Val-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(O)=O NYTDJEZBAAFLLG-IHRRRGAJSA-N 0.000 description 1
- GILLQRYAWOMHED-DCAQKATOSA-N Lys-Val-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN GILLQRYAWOMHED-DCAQKATOSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- XBAJINCXDBTJRH-WDSOQIARSA-N Lys-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCCN)N XBAJINCXDBTJRH-WDSOQIARSA-N 0.000 description 1
- IKXQOBUBZSOWDY-AVGNSLFASA-N Lys-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CCCCN)N IKXQOBUBZSOWDY-AVGNSLFASA-N 0.000 description 1
- 102000018697 Membrane Proteins Human genes 0.000 description 1
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 1
- QRHWTCJBCLGYRB-FXQIFTODSA-N Met-Ala-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(O)=O QRHWTCJBCLGYRB-FXQIFTODSA-N 0.000 description 1
- KUQWVNFMZLHAPA-CIUDSAMLSA-N Met-Ala-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O KUQWVNFMZLHAPA-CIUDSAMLSA-N 0.000 description 1
- QAHFGYLFLVGBNW-DCAQKATOSA-N Met-Ala-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN QAHFGYLFLVGBNW-DCAQKATOSA-N 0.000 description 1
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 1
- WYEXWKAWMNJKPN-UBHSHLNASA-N Met-Ala-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CCSC)N WYEXWKAWMNJKPN-UBHSHLNASA-N 0.000 description 1
- WXHHTBVYQOSYSL-FXQIFTODSA-N Met-Ala-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O WXHHTBVYQOSYSL-FXQIFTODSA-N 0.000 description 1
- IIPHCNKHEZYSNE-DCAQKATOSA-N Met-Arg-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O IIPHCNKHEZYSNE-DCAQKATOSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- OBVHKUFUDCPZDW-JYJNAYRXSA-N Met-Arg-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OBVHKUFUDCPZDW-JYJNAYRXSA-N 0.000 description 1
- SBSIKVMCCJUCBZ-GUBZILKMSA-N Met-Asn-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N SBSIKVMCCJUCBZ-GUBZILKMSA-N 0.000 description 1
- ACYHZNZHIZWLQF-BQBZGAKWSA-N Met-Asn-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O ACYHZNZHIZWLQF-BQBZGAKWSA-N 0.000 description 1
- CAODKDAPYGUMLK-FXQIFTODSA-N Met-Asn-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CAODKDAPYGUMLK-FXQIFTODSA-N 0.000 description 1
- HDNOQCZWJGGHSS-VEVYYDQMSA-N Met-Asn-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HDNOQCZWJGGHSS-VEVYYDQMSA-N 0.000 description 1
- UZVWDRPUTHXQAM-FXQIFTODSA-N Met-Asp-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O UZVWDRPUTHXQAM-FXQIFTODSA-N 0.000 description 1
- SQUTUWHAAWJYES-GUBZILKMSA-N Met-Asp-Arg Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O SQUTUWHAAWJYES-GUBZILKMSA-N 0.000 description 1
- ZMYHJISLFYTQGK-FXQIFTODSA-N Met-Asp-Asn Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ZMYHJISLFYTQGK-FXQIFTODSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- WGBMNLCRYKSWAR-DCAQKATOSA-N Met-Asp-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN WGBMNLCRYKSWAR-DCAQKATOSA-N 0.000 description 1
- TZLYIHDABYBOCJ-FXQIFTODSA-N Met-Asp-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O TZLYIHDABYBOCJ-FXQIFTODSA-N 0.000 description 1
- AWOMRHGUWFBDNU-ZPFDUUQYSA-N Met-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N AWOMRHGUWFBDNU-ZPFDUUQYSA-N 0.000 description 1
- HHCOOFPGNXKFGR-HJGDQZAQSA-N Met-Gln-Thr Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O HHCOOFPGNXKFGR-HJGDQZAQSA-N 0.000 description 1
- PHWSCIFNNLLUFJ-NHCYSSNCSA-N Met-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCSC)N PHWSCIFNNLLUFJ-NHCYSSNCSA-N 0.000 description 1
- UYAKZHGIPRCGPF-CIUDSAMLSA-N Met-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCSC)N UYAKZHGIPRCGPF-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- SJDQOYTYNGZZJX-SRVKXCTJSA-N Met-Glu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SJDQOYTYNGZZJX-SRVKXCTJSA-N 0.000 description 1
- OGAZPKJHHZPYFK-GARJFASQSA-N Met-Glu-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N OGAZPKJHHZPYFK-GARJFASQSA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- STTRPDDKDVKIDF-KKUMJFAQSA-N Met-Glu-Tyr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 STTRPDDKDVKIDF-KKUMJFAQSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- DGNZGCQSVGGYJS-BQBZGAKWSA-N Met-Gly-Asp Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O DGNZGCQSVGGYJS-BQBZGAKWSA-N 0.000 description 1
- SLQDSYZHHOKQSR-QXEWZRGKSA-N Met-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CCSC SLQDSYZHHOKQSR-QXEWZRGKSA-N 0.000 description 1
- MYAPQOBHGWJZOM-UWVGGRQHSA-N Met-Gly-Leu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C MYAPQOBHGWJZOM-UWVGGRQHSA-N 0.000 description 1
- LRALLISKBZNSKN-BQBZGAKWSA-N Met-Gly-Ser Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LRALLISKBZNSKN-BQBZGAKWSA-N 0.000 description 1
- WXJXYMFUTRXRGO-UWVGGRQHSA-N Met-His-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CNC=N1 WXJXYMFUTRXRGO-UWVGGRQHSA-N 0.000 description 1
- MVMNUCOHQGYYKB-PEDHHIEDSA-N Met-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H](CCSC)N MVMNUCOHQGYYKB-PEDHHIEDSA-N 0.000 description 1
- QZPXMHVKPHJNTR-DCAQKATOSA-N Met-Leu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O QZPXMHVKPHJNTR-DCAQKATOSA-N 0.000 description 1
- UROWNMBTQGGTHB-DCAQKATOSA-N Met-Leu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O UROWNMBTQGGTHB-DCAQKATOSA-N 0.000 description 1
- ZIIMORLEZLVRIP-SRVKXCTJSA-N Met-Leu-Gln Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZIIMORLEZLVRIP-SRVKXCTJSA-N 0.000 description 1
- SODXFJOPSCXOHE-IHRRRGAJSA-N Met-Leu-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O SODXFJOPSCXOHE-IHRRRGAJSA-N 0.000 description 1
- AWGBEIYZPAXXSX-RWMBFGLXSA-N Met-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N AWGBEIYZPAXXSX-RWMBFGLXSA-N 0.000 description 1
- OCRSGGIJBDUXHU-WDSOQIARSA-N Met-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCSC)C(O)=O)=CNC2=C1 OCRSGGIJBDUXHU-WDSOQIARSA-N 0.000 description 1
- YYEIFXZOBZVDPH-DCAQKATOSA-N Met-Lys-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O YYEIFXZOBZVDPH-DCAQKATOSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- CGUYGMFQZCYJSG-DCAQKATOSA-N Met-Lys-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O CGUYGMFQZCYJSG-DCAQKATOSA-N 0.000 description 1
- LCPUWQLULVXROY-RHYQMDGZSA-N Met-Lys-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LCPUWQLULVXROY-RHYQMDGZSA-N 0.000 description 1
- DJJBHQHOZLUBCN-WDSOQIARSA-N Met-Lys-Trp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DJJBHQHOZLUBCN-WDSOQIARSA-N 0.000 description 1
- QTMIXEQWGNIPBL-JYJNAYRXSA-N Met-Met-Tyr Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N QTMIXEQWGNIPBL-JYJNAYRXSA-N 0.000 description 1
- XGIQKEAKUSPCBU-SRVKXCTJSA-N Met-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCSC)N XGIQKEAKUSPCBU-SRVKXCTJSA-N 0.000 description 1
- KRLKICLNEICJGV-STQMWFEESA-N Met-Phe-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 KRLKICLNEICJGV-STQMWFEESA-N 0.000 description 1
- JQHYVIKEFYETEW-IHRRRGAJSA-N Met-Phe-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CO)C(O)=O)CC1=CC=CC=C1 JQHYVIKEFYETEW-IHRRRGAJSA-N 0.000 description 1
- MPCKIRSXNKACRF-GUBZILKMSA-N Met-Pro-Asn Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O MPCKIRSXNKACRF-GUBZILKMSA-N 0.000 description 1
- CAEZLMGDJMEBKP-AVGNSLFASA-N Met-Pro-His Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CNC=N1 CAEZLMGDJMEBKP-AVGNSLFASA-N 0.000 description 1
- MQASRXPTQJJNFM-JYJNAYRXSA-N Met-Pro-Phe Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MQASRXPTQJJNFM-JYJNAYRXSA-N 0.000 description 1
- LUYURUYVNYGKGM-RCWTZXSCSA-N Met-Pro-Thr Chemical compound CSCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUYURUYVNYGKGM-RCWTZXSCSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- SMVTWPOATVIXTN-NAKRPEOUSA-N Met-Ser-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SMVTWPOATVIXTN-NAKRPEOUSA-N 0.000 description 1
- HLZORBMOISUNIV-DCAQKATOSA-N Met-Ser-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(C)C HLZORBMOISUNIV-DCAQKATOSA-N 0.000 description 1
- DSZFTPCSFVWMKP-DCAQKATOSA-N Met-Ser-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN DSZFTPCSFVWMKP-DCAQKATOSA-N 0.000 description 1
- FIZZULTXMVEIAA-IHRRRGAJSA-N Met-Ser-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FIZZULTXMVEIAA-IHRRRGAJSA-N 0.000 description 1
- MIXPUVSPPOWTCR-FXQIFTODSA-N Met-Ser-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MIXPUVSPPOWTCR-FXQIFTODSA-N 0.000 description 1
- CIIJWIAORKTXAH-FJXKBIBVSA-N Met-Thr-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O CIIJWIAORKTXAH-FJXKBIBVSA-N 0.000 description 1
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 1
- YIGCDRZMZNDENK-UNQGMJICSA-N Met-Thr-Phe Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O YIGCDRZMZNDENK-UNQGMJICSA-N 0.000 description 1
- NDJSSFWDYDUQID-YTWAJWBKSA-N Met-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N)O NDJSSFWDYDUQID-YTWAJWBKSA-N 0.000 description 1
- ZBLSZPYQQRIHQU-RCWTZXSCSA-N Met-Thr-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O ZBLSZPYQQRIHQU-RCWTZXSCSA-N 0.000 description 1
- SQPZCTBSLIIMBL-BPUTZDHNSA-N Met-Trp-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CO)C(=O)O)N SQPZCTBSLIIMBL-BPUTZDHNSA-N 0.000 description 1
- OOLVTRHJJBCJKB-IHRRRGAJSA-N Met-Tyr-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N OOLVTRHJJBCJKB-IHRRRGAJSA-N 0.000 description 1
- NBEFNGUZUOUGFG-KKUMJFAQSA-N Met-Tyr-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N NBEFNGUZUOUGFG-KKUMJFAQSA-N 0.000 description 1
- TWEWRDAAIYBJTO-ULQDDVLXSA-N Met-Tyr-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N TWEWRDAAIYBJTO-ULQDDVLXSA-N 0.000 description 1
- JHVNNUIQXOGAHI-KJEVXHAQSA-N Met-Tyr-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CCSC)N)O JHVNNUIQXOGAHI-KJEVXHAQSA-N 0.000 description 1
- ALTHVGNGGZZSAC-SRVKXCTJSA-N Met-Val-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCNC(N)=N ALTHVGNGGZZSAC-SRVKXCTJSA-N 0.000 description 1
- 102000016943 Muramidase Human genes 0.000 description 1
- 108010014251 Muramidase Proteins 0.000 description 1
- 101000604464 Mus musculus Netrin-G1 Proteins 0.000 description 1
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 1
- PESQCPHRXOFIPX-UHFFFAOYSA-N N-L-methionyl-L-tyrosine Natural products CSCCC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 PESQCPHRXOFIPX-UHFFFAOYSA-N 0.000 description 1
- VIHYIVKEECZGOU-UHFFFAOYSA-N N-acetylimidazole Chemical compound CC(=O)N1C=CN=C1 VIHYIVKEECZGOU-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010065395 Neuropep-1 Proteins 0.000 description 1
- 101100068676 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) gln-1 gene Proteins 0.000 description 1
- 101100342977 Neurospora crassa (strain ATCC 24698 / 74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) leu-1 gene Proteins 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 101150012394 PHO5 gene Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- DFEVBOYEUQJGER-JURCDPSOSA-N Phe-Ala-Ile Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)O DFEVBOYEUQJGER-JURCDPSOSA-N 0.000 description 1
- BKWJQWJPZMUWEG-LFSVMHDDSA-N Phe-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BKWJQWJPZMUWEG-LFSVMHDDSA-N 0.000 description 1
- LZDIENNKWVXJMX-JYJNAYRXSA-N Phe-Arg-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC1=CC=CC=C1 LZDIENNKWVXJMX-JYJNAYRXSA-N 0.000 description 1
- AYPMIIKUMNADSU-IHRRRGAJSA-N Phe-Arg-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O AYPMIIKUMNADSU-IHRRRGAJSA-N 0.000 description 1
- YMORXCKTSSGYIG-IHRRRGAJSA-N Phe-Arg-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N YMORXCKTSSGYIG-IHRRRGAJSA-N 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- AGYXCMYVTBYGCT-ULQDDVLXSA-N Phe-Arg-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O AGYXCMYVTBYGCT-ULQDDVLXSA-N 0.000 description 1
- GNUCSNWOCQFMMC-UFYCRDLUSA-N Phe-Arg-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 GNUCSNWOCQFMMC-UFYCRDLUSA-N 0.000 description 1
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 1
- AWAYOWOUGVZXOB-BZSNNMDCSA-N Phe-Asn-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 AWAYOWOUGVZXOB-BZSNNMDCSA-N 0.000 description 1
- CDNPIRSCAFMMBE-SRVKXCTJSA-N Phe-Asn-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O CDNPIRSCAFMMBE-SRVKXCTJSA-N 0.000 description 1
- HTKNPQZCMLBOTQ-XVSYOHENSA-N Phe-Asn-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N)O HTKNPQZCMLBOTQ-XVSYOHENSA-N 0.000 description 1
- SWZKMTDPQXLQRD-XVSYOHENSA-N Phe-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWZKMTDPQXLQRD-XVSYOHENSA-N 0.000 description 1
- OMHMIXFFRPMYHB-SRVKXCTJSA-N Phe-Cys-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)N)C(=O)O)N OMHMIXFFRPMYHB-SRVKXCTJSA-N 0.000 description 1
- ZBYHVSHBZYHQBW-SRVKXCTJSA-N Phe-Cys-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(=O)O)C(=O)O)N ZBYHVSHBZYHQBW-SRVKXCTJSA-N 0.000 description 1
- QEPZQAPZKIPVDV-KKUMJFAQSA-N Phe-Cys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N QEPZQAPZKIPVDV-KKUMJFAQSA-N 0.000 description 1
- VJEZWOSKRCLHRP-MELADBBJSA-N Phe-Cys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CS)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O VJEZWOSKRCLHRP-MELADBBJSA-N 0.000 description 1
- WFDAEEUZPZSMOG-SRVKXCTJSA-N Phe-Cys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O WFDAEEUZPZSMOG-SRVKXCTJSA-N 0.000 description 1
- ZFVWWUILVLLVFA-AVGNSLFASA-N Phe-Gln-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N ZFVWWUILVLLVFA-AVGNSLFASA-N 0.000 description 1
- LLGTYVHITPVGKR-RYUDHWBXSA-N Phe-Gln-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O LLGTYVHITPVGKR-RYUDHWBXSA-N 0.000 description 1
- WYPVCIACUMJRIB-JYJNAYRXSA-N Phe-Gln-Lys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N WYPVCIACUMJRIB-JYJNAYRXSA-N 0.000 description 1
- MFQXSDWKUXTOPZ-DZKIICNBSA-N Phe-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=CC=C1)N MFQXSDWKUXTOPZ-DZKIICNBSA-N 0.000 description 1
- FMMIYCMOVGXZIP-AVGNSLFASA-N Phe-Glu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O FMMIYCMOVGXZIP-AVGNSLFASA-N 0.000 description 1
- UEADQPLTYBWWTG-AVGNSLFASA-N Phe-Glu-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 UEADQPLTYBWWTG-AVGNSLFASA-N 0.000 description 1
- KYYMILWEGJYPQZ-IHRRRGAJSA-N Phe-Glu-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 KYYMILWEGJYPQZ-IHRRRGAJSA-N 0.000 description 1
- AKJAKCBHLJGRBU-JYJNAYRXSA-N Phe-Glu-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N AKJAKCBHLJGRBU-JYJNAYRXSA-N 0.000 description 1
- MGECUMGTSHYHEJ-QEWYBTABSA-N Phe-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 MGECUMGTSHYHEJ-QEWYBTABSA-N 0.000 description 1
- RFEXGCASCQGGHZ-STQMWFEESA-N Phe-Gly-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O RFEXGCASCQGGHZ-STQMWFEESA-N 0.000 description 1
- MMYUOSCXBJFUNV-QWRGUYRKSA-N Phe-Gly-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N MMYUOSCXBJFUNV-QWRGUYRKSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- NAXPHWZXEXNDIW-JTQLQIEISA-N Phe-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 NAXPHWZXEXNDIW-JTQLQIEISA-N 0.000 description 1
- APJPXSFJBMMOLW-KBPBESRZSA-N Phe-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 APJPXSFJBMMOLW-KBPBESRZSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- RVRRHFPCEOVRKQ-KKUMJFAQSA-N Phe-His-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CC(=O)N)C(=O)O)N RVRRHFPCEOVRKQ-KKUMJFAQSA-N 0.000 description 1
- PPHFTNABKQRAJV-JYJNAYRXSA-N Phe-His-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PPHFTNABKQRAJV-JYJNAYRXSA-N 0.000 description 1
- VADLTGVIOIOKGM-BZSNNMDCSA-N Phe-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC=CC=1)C1=CN=CN1 VADLTGVIOIOKGM-BZSNNMDCSA-N 0.000 description 1
- FINLZXKJWTYYLC-ACRUOGEOSA-N Phe-His-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1N=CNC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 FINLZXKJWTYYLC-ACRUOGEOSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- BVHFFNYBKRTSIU-MEYUZBJRSA-N Phe-His-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BVHFFNYBKRTSIU-MEYUZBJRSA-N 0.000 description 1
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 1
- MJQFZGOIVBDIMZ-WHOFXGATSA-N Phe-Ile-Gly Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(=O)O MJQFZGOIVBDIMZ-WHOFXGATSA-N 0.000 description 1
- DVOCGBNHAUHKHJ-DKIMLUQUSA-N Phe-Ile-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O DVOCGBNHAUHKHJ-DKIMLUQUSA-N 0.000 description 1
- BYAIIACBWBOJCU-URLPEUOOSA-N Phe-Ile-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BYAIIACBWBOJCU-URLPEUOOSA-N 0.000 description 1
- JQLQUPIYYJXZLJ-ZEWNOJEFSA-N Phe-Ile-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 JQLQUPIYYJXZLJ-ZEWNOJEFSA-N 0.000 description 1
- RORUIHAWOLADSH-HJWJTTGWSA-N Phe-Ile-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 RORUIHAWOLADSH-HJWJTTGWSA-N 0.000 description 1
- XMQSOOJRRVEHRO-ULQDDVLXSA-N Phe-Leu-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMQSOOJRRVEHRO-ULQDDVLXSA-N 0.000 description 1
- KBVJZCVLQWCJQN-KKUMJFAQSA-N Phe-Leu-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KBVJZCVLQWCJQN-KKUMJFAQSA-N 0.000 description 1
- YKUGPVXSDOOANW-KKUMJFAQSA-N Phe-Leu-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YKUGPVXSDOOANW-KKUMJFAQSA-N 0.000 description 1
- TXKWKTWYTIAZSV-KKUMJFAQSA-N Phe-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N TXKWKTWYTIAZSV-KKUMJFAQSA-N 0.000 description 1
- YTILBRIUASDGBL-BZSNNMDCSA-N Phe-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 YTILBRIUASDGBL-BZSNNMDCSA-N 0.000 description 1
- MSHZERMPZKCODG-ACRUOGEOSA-N Phe-Leu-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 MSHZERMPZKCODG-ACRUOGEOSA-N 0.000 description 1
- INHMISZWLJZQGH-ULQDDVLXSA-N Phe-Leu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 INHMISZWLJZQGH-ULQDDVLXSA-N 0.000 description 1
- DNAXXTQSTKOHFO-QEJZJMRPSA-N Phe-Lys-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CC1=CC=CC=C1 DNAXXTQSTKOHFO-QEJZJMRPSA-N 0.000 description 1
- RMKGXGPQIPLTFC-KKUMJFAQSA-N Phe-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O RMKGXGPQIPLTFC-KKUMJFAQSA-N 0.000 description 1
- CJAHQEZWDZNSJO-KKUMJFAQSA-N Phe-Lys-Cys Chemical compound NCCCC[C@@H](C(=O)N[C@@H](CS)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 CJAHQEZWDZNSJO-KKUMJFAQSA-N 0.000 description 1
- DOXQMJCSSYZSNM-BZSNNMDCSA-N Phe-Lys-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O DOXQMJCSSYZSNM-BZSNNMDCSA-N 0.000 description 1
- BNRFQGLWLQESBG-YESZJQIVSA-N Phe-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O BNRFQGLWLQESBG-YESZJQIVSA-N 0.000 description 1
- BSHMIVKDJQGLNT-ACRUOGEOSA-N Phe-Lys-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 BSHMIVKDJQGLNT-ACRUOGEOSA-N 0.000 description 1
- FQUUYTNBMIBOHS-IHRRRGAJSA-N Phe-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N FQUUYTNBMIBOHS-IHRRRGAJSA-N 0.000 description 1
- RYQWALWYQWBUKN-FHWLQOOXSA-N Phe-Phe-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O RYQWALWYQWBUKN-FHWLQOOXSA-N 0.000 description 1
- GPLWGAYGROGDEN-BZSNNMDCSA-N Phe-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O GPLWGAYGROGDEN-BZSNNMDCSA-N 0.000 description 1
- MGLBSROLWAWCKN-FCLVOEFKSA-N Phe-Phe-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MGLBSROLWAWCKN-FCLVOEFKSA-N 0.000 description 1
- YMTMNYNEZDAGMW-RNXOBYDBSA-N Phe-Phe-Trp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N YMTMNYNEZDAGMW-RNXOBYDBSA-N 0.000 description 1
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 1
- AAERWTUHZKLDLC-IHRRRGAJSA-N Phe-Pro-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O AAERWTUHZKLDLC-IHRRRGAJSA-N 0.000 description 1
- CZQZSMJXFGGBHM-KKUMJFAQSA-N Phe-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O CZQZSMJXFGGBHM-KKUMJFAQSA-N 0.000 description 1
- QARPMYDMYVLFMW-KKUMJFAQSA-N Phe-Pro-Glu Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(O)=O)C1=CC=CC=C1 QARPMYDMYVLFMW-KKUMJFAQSA-N 0.000 description 1
- ZVRJWDUPIDMHDN-ULQDDVLXSA-N Phe-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 ZVRJWDUPIDMHDN-ULQDDVLXSA-N 0.000 description 1
- CKJACGQPCPMWIT-UFYCRDLUSA-N Phe-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 CKJACGQPCPMWIT-UFYCRDLUSA-N 0.000 description 1
- ODGNUUUDJONJSC-UFYCRDLUSA-N Phe-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)N)C(=O)N[C@@H](CC3=CC=C(C=C3)O)C(=O)O ODGNUUUDJONJSC-UFYCRDLUSA-N 0.000 description 1
- IIEOLPMQYRBZCN-SRVKXCTJSA-N Phe-Ser-Cys Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O IIEOLPMQYRBZCN-SRVKXCTJSA-N 0.000 description 1
- BONHGTUEEPIMPM-AVGNSLFASA-N Phe-Ser-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O BONHGTUEEPIMPM-AVGNSLFASA-N 0.000 description 1
- ILGCZYGFYQLSDZ-KKUMJFAQSA-N Phe-Ser-His Chemical compound N[C@@H](Cc1ccccc1)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O ILGCZYGFYQLSDZ-KKUMJFAQSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- QSWKNJAPHQDAAS-MELADBBJSA-N Phe-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=CC=C2)N)C(=O)O QSWKNJAPHQDAAS-MELADBBJSA-N 0.000 description 1
- IAOZOFPONWDXNT-IXOXFDKPSA-N Phe-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IAOZOFPONWDXNT-IXOXFDKPSA-N 0.000 description 1
- GMWNQSGWWGKTSF-LFSVMHDDSA-N Phe-Thr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMWNQSGWWGKTSF-LFSVMHDDSA-N 0.000 description 1
- RAGOJJCBGXARPO-XVSYOHENSA-N Phe-Thr-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H]([C@H](O)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 RAGOJJCBGXARPO-XVSYOHENSA-N 0.000 description 1
- BSTPNLNKHKBONJ-HTUGSXCWSA-N Phe-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O BSTPNLNKHKBONJ-HTUGSXCWSA-N 0.000 description 1
- KLYYKKGCPOGDPE-OEAJRASXSA-N Phe-Thr-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O KLYYKKGCPOGDPE-OEAJRASXSA-N 0.000 description 1
- OLZVAVSJEUAOHI-UNQGMJICSA-N Phe-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O OLZVAVSJEUAOHI-UNQGMJICSA-N 0.000 description 1
- MSSXKZBDKZAHCX-UNQGMJICSA-N Phe-Thr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O MSSXKZBDKZAHCX-UNQGMJICSA-N 0.000 description 1
- YCEWAVIRWNGGSS-NQCBNZPSSA-N Phe-Trp-Ile Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)C1=CC=CC=C1 YCEWAVIRWNGGSS-NQCBNZPSSA-N 0.000 description 1
- WDOCBGZHAQQIBL-IHPCNDPISA-N Phe-Trp-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 WDOCBGZHAQQIBL-IHPCNDPISA-N 0.000 description 1
- QTDBZORPVYTRJU-KKXDTOCCSA-N Phe-Tyr-Ala Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C)C(O)=O QTDBZORPVYTRJU-KKXDTOCCSA-N 0.000 description 1
- CVAUVSOFHJKCHN-BZSNNMDCSA-N Phe-Tyr-Cys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CS)C(O)=O)C1=CC=CC=C1 CVAUVSOFHJKCHN-BZSNNMDCSA-N 0.000 description 1
- GCFNFKNPCMBHNT-IRXDYDNUSA-N Phe-Tyr-Gly Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)NCC(=O)O)N GCFNFKNPCMBHNT-IRXDYDNUSA-N 0.000 description 1
- ZYNBEWGJFXTBDU-ACRUOGEOSA-N Phe-Tyr-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)NC(=O)[C@H](CC2=CC=CC=C2)N ZYNBEWGJFXTBDU-ACRUOGEOSA-N 0.000 description 1
- FRMKIPSIZSFTTE-HJOGWXRNSA-N Phe-Tyr-Phe Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O FRMKIPSIZSFTTE-HJOGWXRNSA-N 0.000 description 1
- SJRQWEDYTKYHHL-SLFFLAALSA-N Phe-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CC=CC=C3)N)C(=O)O SJRQWEDYTKYHHL-SLFFLAALSA-N 0.000 description 1
- APMXLWHMIVWLLR-BZSNNMDCSA-N Phe-Tyr-Ser Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CO)C(O)=O)C1=CC=CC=C1 APMXLWHMIVWLLR-BZSNNMDCSA-N 0.000 description 1
- CDHURCQGUDNBMA-UBHSHLNASA-N Phe-Val-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 CDHURCQGUDNBMA-UBHSHLNASA-N 0.000 description 1
- BQMFWUKNOCJDNV-HJWJTTGWSA-N Phe-Val-Ile Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BQMFWUKNOCJDNV-HJWJTTGWSA-N 0.000 description 1
- IEIFEYBAYFSRBQ-IHRRRGAJSA-N Phe-Val-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N IEIFEYBAYFSRBQ-IHRRRGAJSA-N 0.000 description 1
- GAMLAXHLYGLQBJ-UFYCRDLUSA-N Phe-Val-Tyr Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC1=CC=C(C=C1)O)C(C)C)CC1=CC=CC=C1 GAMLAXHLYGLQBJ-UFYCRDLUSA-N 0.000 description 1
- 206010035226 Plasma cell myeloma Diseases 0.000 description 1
- FELJDCNGZFDUNR-WDSKDSINSA-N Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 FELJDCNGZFDUNR-WDSKDSINSA-N 0.000 description 1
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 1
- ALJGSKMBIUEJOB-FXQIFTODSA-N Pro-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 ALJGSKMBIUEJOB-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- DRVIASBABBMZTF-GUBZILKMSA-N Pro-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1 DRVIASBABBMZTF-GUBZILKMSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- OOLOTUZJUBOMAX-GUBZILKMSA-N Pro-Ala-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O OOLOTUZJUBOMAX-GUBZILKMSA-N 0.000 description 1
- NHDVNAKDACFHPX-GUBZILKMSA-N Pro-Arg-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O NHDVNAKDACFHPX-GUBZILKMSA-N 0.000 description 1
- SSSFPISOZOLQNP-GUBZILKMSA-N Pro-Arg-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SSSFPISOZOLQNP-GUBZILKMSA-N 0.000 description 1
- HPXVFFIIGOAQRV-DCAQKATOSA-N Pro-Arg-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O HPXVFFIIGOAQRV-DCAQKATOSA-N 0.000 description 1
- XZGWNSIRZIUHHP-SRVKXCTJSA-N Pro-Arg-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H]1CCCN1 XZGWNSIRZIUHHP-SRVKXCTJSA-N 0.000 description 1
- KDIIENQUNVNWHR-JYJNAYRXSA-N Pro-Arg-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O KDIIENQUNVNWHR-JYJNAYRXSA-N 0.000 description 1
- NUZHSNLQJDYSRW-BZSNNMDCSA-N Pro-Arg-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O NUZHSNLQJDYSRW-BZSNNMDCSA-N 0.000 description 1
- UVKNEILZSJMKSR-FXQIFTODSA-N Pro-Asn-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 UVKNEILZSJMKSR-FXQIFTODSA-N 0.000 description 1
- INXAPZFIOVGHSV-CIUDSAMLSA-N Pro-Asn-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1 INXAPZFIOVGHSV-CIUDSAMLSA-N 0.000 description 1
- AMBLXEMWFARNNQ-DCAQKATOSA-N Pro-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@@H]1CCCN1 AMBLXEMWFARNNQ-DCAQKATOSA-N 0.000 description 1
- MTHRMUXESFIAMS-DCAQKATOSA-N Pro-Asn-Lys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O MTHRMUXESFIAMS-DCAQKATOSA-N 0.000 description 1
- VOHFZDSRPZLXLH-IHRRRGAJSA-N Pro-Asn-Phe Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VOHFZDSRPZLXLH-IHRRRGAJSA-N 0.000 description 1
- MLQVJYMFASXBGZ-IHRRRGAJSA-N Pro-Asn-Tyr Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O MLQVJYMFASXBGZ-IHRRRGAJSA-N 0.000 description 1
- AHXPYZRZRMQOAU-QXEWZRGKSA-N Pro-Asn-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H]1CCCN1)C(O)=O AHXPYZRZRMQOAU-QXEWZRGKSA-N 0.000 description 1
- VJLJGKQAOQJXJG-CIUDSAMLSA-N Pro-Asp-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VJLJGKQAOQJXJG-CIUDSAMLSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- QVIZLAUEAMQKGS-GUBZILKMSA-N Pro-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@@H]1CCCN1 QVIZLAUEAMQKGS-GUBZILKMSA-N 0.000 description 1
- YFNOUBWUIIJQHF-LPEHRKFASA-N Pro-Asp-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N2CCC[C@@H]2C(=O)O YFNOUBWUIIJQHF-LPEHRKFASA-N 0.000 description 1
- FKKHDBFNOLCYQM-FXQIFTODSA-N Pro-Cys-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)N[C@@H](C)C(O)=O FKKHDBFNOLCYQM-FXQIFTODSA-N 0.000 description 1
- SZZBUDVXWZZPDH-BQBZGAKWSA-N Pro-Cys-Gly Chemical compound OC(=O)CNC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 SZZBUDVXWZZPDH-BQBZGAKWSA-N 0.000 description 1
- NOXSEHJOXCWRHK-DCAQKATOSA-N Pro-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 NOXSEHJOXCWRHK-DCAQKATOSA-N 0.000 description 1
- TUYWCHPXKQTISF-LPEHRKFASA-N Pro-Cys-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N2CCC[C@@H]2C(=O)O TUYWCHPXKQTISF-LPEHRKFASA-N 0.000 description 1
- JFNPBBOGGNMSRX-CIUDSAMLSA-N Pro-Gln-Ala Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O JFNPBBOGGNMSRX-CIUDSAMLSA-N 0.000 description 1
- ODPIUQVTULPQEP-CIUDSAMLSA-N Pro-Gln-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@@H]1CCCN1 ODPIUQVTULPQEP-CIUDSAMLSA-N 0.000 description 1
- WGAQWMRJUFQXMF-ZPFDUUQYSA-N Pro-Gln-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WGAQWMRJUFQXMF-ZPFDUUQYSA-N 0.000 description 1
- DRIJZWBRGMJCDD-DCAQKATOSA-N Pro-Gln-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O DRIJZWBRGMJCDD-DCAQKATOSA-N 0.000 description 1
- SKICPQLTOXGWGO-GARJFASQSA-N Pro-Gln-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCC(=O)N)C(=O)N2CCC[C@@H]2C(=O)O SKICPQLTOXGWGO-GARJFASQSA-N 0.000 description 1
- DIFXZGPHVCIVSQ-CIUDSAMLSA-N Pro-Gln-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DIFXZGPHVCIVSQ-CIUDSAMLSA-N 0.000 description 1
- UAYHMOIGIQZLFR-NHCYSSNCSA-N Pro-Gln-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UAYHMOIGIQZLFR-NHCYSSNCSA-N 0.000 description 1
- LHALYDBUDCWMDY-CIUDSAMLSA-N Pro-Glu-Ala Chemical compound C[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O LHALYDBUDCWMDY-CIUDSAMLSA-N 0.000 description 1
- KIPIKSXPPLABPN-CIUDSAMLSA-N Pro-Glu-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1 KIPIKSXPPLABPN-CIUDSAMLSA-N 0.000 description 1
- WVOXLKUUVCCCSU-ZPFDUUQYSA-N Pro-Glu-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVOXLKUUVCCCSU-ZPFDUUQYSA-N 0.000 description 1
- VOZIBWWZSBIXQN-SRVKXCTJSA-N Pro-Glu-Lys Chemical compound NCCCC[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H]1CCCN1)C(O)=O VOZIBWWZSBIXQN-SRVKXCTJSA-N 0.000 description 1
- WFHYFCWBLSKEMS-KKUMJFAQSA-N Pro-Glu-Phe Chemical compound N([C@@H](CCC(=O)O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C(=O)[C@@H]1CCCN1 WFHYFCWBLSKEMS-KKUMJFAQSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- DMKWYMWNEKIPFC-IUCAKERBSA-N Pro-Gly-Arg Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](CCCNC(N)=N)C(O)=O DMKWYMWNEKIPFC-IUCAKERBSA-N 0.000 description 1
- HAAQQNHQZBOWFO-LURJTMIESA-N Pro-Gly-Gly Chemical compound OC(=O)CNC(=O)CNC(=O)[C@@H]1CCCN1 HAAQQNHQZBOWFO-LURJTMIESA-N 0.000 description 1
- HAEGAELAYWSUNC-WPRPVWTQSA-N Pro-Gly-Val Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HAEGAELAYWSUNC-WPRPVWTQSA-N 0.000 description 1
- PEYNRYREGPAOAK-LSJOCFKGSA-N Pro-His-Ala Chemical compound C([C@@H](C(=O)N[C@@H](C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 PEYNRYREGPAOAK-LSJOCFKGSA-N 0.000 description 1
- LCUOTSLIVGSGAU-AVGNSLFASA-N Pro-His-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LCUOTSLIVGSGAU-AVGNSLFASA-N 0.000 description 1
- JUJGNDZIKKQMDJ-IHRRRGAJSA-N Pro-His-His Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CNC=N1)C(O)=O JUJGNDZIKKQMDJ-IHRRRGAJSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- JRQCDSNPRNGWRG-AVGNSLFASA-N Pro-His-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@@H]2CCCN2 JRQCDSNPRNGWRG-AVGNSLFASA-N 0.000 description 1
- AQSMZTIEJMZQEC-DCAQKATOSA-N Pro-His-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O AQSMZTIEJMZQEC-DCAQKATOSA-N 0.000 description 1
- LPGSNRSLPHRNBW-AVGNSLFASA-N Pro-His-Val Chemical compound C([C@@H](C(=O)N[C@@H](C(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 LPGSNRSLPHRNBW-AVGNSLFASA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- BBFRBZYKHIKFBX-GMOBBJLQSA-N Pro-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BBFRBZYKHIKFBX-GMOBBJLQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- KLSOMAFWRISSNI-OSUNSFLBSA-N Pro-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 KLSOMAFWRISSNI-OSUNSFLBSA-N 0.000 description 1
- CLJLVCYFABNTHP-DCAQKATOSA-N Pro-Leu-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O CLJLVCYFABNTHP-DCAQKATOSA-N 0.000 description 1
- FYPGHGXAOZTOBO-IHRRRGAJSA-N Pro-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@@H]2CCCN2 FYPGHGXAOZTOBO-IHRRRGAJSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- YAZNFQUKPUASKB-DCAQKATOSA-N Pro-Lys-Cys Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)O YAZNFQUKPUASKB-DCAQKATOSA-N 0.000 description 1
- XQPHBAKJJJZOBX-SRVKXCTJSA-N Pro-Lys-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O XQPHBAKJJJZOBX-SRVKXCTJSA-N 0.000 description 1
- MHHQQZIFLWFZGR-DCAQKATOSA-N Pro-Lys-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O MHHQQZIFLWFZGR-DCAQKATOSA-N 0.000 description 1
- FRVUYKWGPCQRBL-GUBZILKMSA-N Pro-Met-Cys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@@H]1CCCN1 FRVUYKWGPCQRBL-GUBZILKMSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- ZJXXCGZFYQQETF-CYDGBPFRSA-N Pro-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 ZJXXCGZFYQQETF-CYDGBPFRSA-N 0.000 description 1
- QCMYJBKTMIWZAP-AVGNSLFASA-N Pro-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 QCMYJBKTMIWZAP-AVGNSLFASA-N 0.000 description 1
- APIAILHCTSBGLU-JYJNAYRXSA-N Pro-Met-Phe Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@@H]2CCCN2 APIAILHCTSBGLU-JYJNAYRXSA-N 0.000 description 1
- WLJYLAQSUSIQNH-GUBZILKMSA-N Pro-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@@H]1CCCN1 WLJYLAQSUSIQNH-GUBZILKMSA-N 0.000 description 1
- AUYKOPJPKUCYHE-SRVKXCTJSA-N Pro-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@@H]1CCCN1 AUYKOPJPKUCYHE-SRVKXCTJSA-N 0.000 description 1
- LGMBKOAPPTYKLC-JYJNAYRXSA-N Pro-Phe-Arg Chemical compound C([C@@H](C(=O)N[C@@H](CCCNC(=N)N)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 LGMBKOAPPTYKLC-JYJNAYRXSA-N 0.000 description 1
- MLKVIVZCFYRTIR-KKUMJFAQSA-N Pro-Phe-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O MLKVIVZCFYRTIR-KKUMJFAQSA-N 0.000 description 1
- JIWJRKNYLSHONY-KKUMJFAQSA-N Pro-Phe-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JIWJRKNYLSHONY-KKUMJFAQSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- SWRNSCMUXRLHCR-ULQDDVLXSA-N Pro-Phe-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 SWRNSCMUXRLHCR-ULQDDVLXSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- SPLBRAKYXGOFSO-UNQGMJICSA-N Pro-Phe-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@@H]2CCCN2)O SPLBRAKYXGOFSO-UNQGMJICSA-N 0.000 description 1
- GFHOSBYCLACKEK-GUBZILKMSA-N Pro-Pro-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O GFHOSBYCLACKEK-GUBZILKMSA-N 0.000 description 1
- FHZJRBVMLGOHBX-GUBZILKMSA-N Pro-Pro-Asp Chemical compound OC(=O)C[C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@@H]1CCCN1)C(O)=O FHZJRBVMLGOHBX-GUBZILKMSA-N 0.000 description 1
- RFWXYTJSVDUBBZ-DCAQKATOSA-N Pro-Pro-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 RFWXYTJSVDUBBZ-DCAQKATOSA-N 0.000 description 1
- PCWLNNZTBJTZRN-AVGNSLFASA-N Pro-Pro-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 PCWLNNZTBJTZRN-AVGNSLFASA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- ITUDDXVFGFEKPD-NAKRPEOUSA-N Pro-Ser-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O ITUDDXVFGFEKPD-NAKRPEOUSA-N 0.000 description 1
- KWMZPPWYBVZIER-XGEHTFHBSA-N Pro-Ser-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KWMZPPWYBVZIER-XGEHTFHBSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- KIDXAAQVMNLJFQ-KZVJFYERSA-N Pro-Thr-Ala Chemical compound C[C@@H](O)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](C)C(O)=O KIDXAAQVMNLJFQ-KZVJFYERSA-N 0.000 description 1
- WVXQQUWOKUZIEG-VEVYYDQMSA-N Pro-Thr-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O WVXQQUWOKUZIEG-VEVYYDQMSA-N 0.000 description 1
- HRIXMVRZRGFKNQ-HJGDQZAQSA-N Pro-Thr-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(O)=O HRIXMVRZRGFKNQ-HJGDQZAQSA-N 0.000 description 1
- GXWRTSIVLSQACD-RCWTZXSCSA-N Pro-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@@H]1CCCN1)O GXWRTSIVLSQACD-RCWTZXSCSA-N 0.000 description 1
- NBDHWLZEMKSVHH-UVBJJODRSA-N Pro-Trp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 NBDHWLZEMKSVHH-UVBJJODRSA-N 0.000 description 1
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 1
- LEBTWGWVUVJNTA-FKBYEOEOSA-N Pro-Trp-Phe Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)N[C@@H](CC4=CC=CC=C4)C(=O)O LEBTWGWVUVJNTA-FKBYEOEOSA-N 0.000 description 1
- VBZXFFYOBDLLFE-HSHDSVGOSA-N Pro-Trp-Thr Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H]([C@H](O)C)C(O)=O)C(=O)[C@@H]1CCCN1 VBZXFFYOBDLLFE-HSHDSVGOSA-N 0.000 description 1
- MCPXQHVVCPTRIM-HJOGWXRNSA-N Pro-Trp-Trp Chemical compound N([C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)O)C(=O)[C@@H]1CCCN1 MCPXQHVVCPTRIM-HJOGWXRNSA-N 0.000 description 1
- CWZUFLWPEFHWEI-IHRRRGAJSA-N Pro-Tyr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O CWZUFLWPEFHWEI-IHRRRGAJSA-N 0.000 description 1
- BVTYXOFTHDXSNI-IHRRRGAJSA-N Pro-Tyr-Cys Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 BVTYXOFTHDXSNI-IHRRRGAJSA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- IMNVAOPEMFDAQD-NHCYSSNCSA-N Pro-Val-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IMNVAOPEMFDAQD-NHCYSSNCSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- OQSGBXGNAFQGGS-CYDGBPFRSA-N Pro-Val-Ile Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O OQSGBXGNAFQGGS-CYDGBPFRSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- VDHGTOHMHHQSKG-JYJNAYRXSA-N Pro-Val-Phe Chemical compound CC(C)[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O VDHGTOHMHHQSKG-JYJNAYRXSA-N 0.000 description 1
- FHJQROWZEJFZPO-SRVKXCTJSA-N Pro-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FHJQROWZEJFZPO-SRVKXCTJSA-N 0.000 description 1
- OFOBLEOULBTSOW-UHFFFAOYSA-N Propanedioic acid Natural products OC(=O)CC(O)=O OFOBLEOULBTSOW-UHFFFAOYSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- 108010026552 Proteome Proteins 0.000 description 1
- 108010054530 RGDN peptide Proteins 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 108010025216 RVF peptide Proteins 0.000 description 1
- 102100037486 Reverse transcriptase/ribonuclease H Human genes 0.000 description 1
- 229910003797 SPO1 Inorganic materials 0.000 description 1
- 229910003798 SPO2 Inorganic materials 0.000 description 1
- 101100150136 Saccharomyces cerevisiae (strain ATCC 204508 / S288c) SPO1 gene Proteins 0.000 description 1
- 101100478210 Schizosaccharomyces pombe (strain 972 / ATCC 24843) spo2 gene Proteins 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- BKOKTRCZXRIQPX-ZLUOBGJFSA-N Ser-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N BKOKTRCZXRIQPX-ZLUOBGJFSA-N 0.000 description 1
- MMGJPDWSIOAGTH-ACZMJKKPSA-N Ser-Ala-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O MMGJPDWSIOAGTH-ACZMJKKPSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- DWUIECHTAMYEFL-XVYDVKMFSA-N Ser-Ala-His Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 DWUIECHTAMYEFL-XVYDVKMFSA-N 0.000 description 1
- WTWGOQRNRFHFQD-JBDRJPRFSA-N Ser-Ala-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WTWGOQRNRFHFQD-JBDRJPRFSA-N 0.000 description 1
- YUSRGTQIPCJNHQ-CIUDSAMLSA-N Ser-Arg-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O YUSRGTQIPCJNHQ-CIUDSAMLSA-N 0.000 description 1
- VQBLHWSPVYYZTB-DCAQKATOSA-N Ser-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N VQBLHWSPVYYZTB-DCAQKATOSA-N 0.000 description 1
- QGMLKFGTGXWAHF-IHRRRGAJSA-N Ser-Arg-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QGMLKFGTGXWAHF-IHRRRGAJSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- HZWAHWQZPSXNCB-BPUTZDHNSA-N Ser-Arg-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O HZWAHWQZPSXNCB-BPUTZDHNSA-N 0.000 description 1
- XVAUJOAYHWWNQF-ZLUOBGJFSA-N Ser-Asn-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O XVAUJOAYHWWNQF-ZLUOBGJFSA-N 0.000 description 1
- OOKCGAYXSNJBGQ-ZLUOBGJFSA-N Ser-Asn-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OOKCGAYXSNJBGQ-ZLUOBGJFSA-N 0.000 description 1
- UBRXAVQWXOWRSJ-ZLUOBGJFSA-N Ser-Asn-Asp Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)C(=O)N UBRXAVQWXOWRSJ-ZLUOBGJFSA-N 0.000 description 1
- FIDMVVBUOCMMJG-CIUDSAMLSA-N Ser-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CO FIDMVVBUOCMMJG-CIUDSAMLSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- UGJRQLURDVGULT-LKXGYXEUSA-N Ser-Asn-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O UGJRQLURDVGULT-LKXGYXEUSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- CNIIKZQXBBQHCX-FXQIFTODSA-N Ser-Asp-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O CNIIKZQXBBQHCX-FXQIFTODSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- MESDJCNHLZBMEP-ZLUOBGJFSA-N Ser-Asp-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MESDJCNHLZBMEP-ZLUOBGJFSA-N 0.000 description 1
- FTVRVZNYIYWJGB-ACZMJKKPSA-N Ser-Asp-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FTVRVZNYIYWJGB-ACZMJKKPSA-N 0.000 description 1
- BNFVPSRLHHPQKS-WHFBIAKZSA-N Ser-Asp-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O BNFVPSRLHHPQKS-WHFBIAKZSA-N 0.000 description 1
- SFZKGGOGCNQPJY-CIUDSAMLSA-N Ser-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N SFZKGGOGCNQPJY-CIUDSAMLSA-N 0.000 description 1
- OLIJLNWFEQEFDM-SRVKXCTJSA-N Ser-Asp-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 OLIJLNWFEQEFDM-SRVKXCTJSA-N 0.000 description 1
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 1
- BLPYXIXXCFVIIF-FXQIFTODSA-N Ser-Cys-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N)CN=C(N)N BLPYXIXXCFVIIF-FXQIFTODSA-N 0.000 description 1
- TUYBIWUZWJUZDD-ACZMJKKPSA-N Ser-Cys-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CCC(N)=O TUYBIWUZWJUZDD-ACZMJKKPSA-N 0.000 description 1
- WTPKKLMBNBCCNL-ACZMJKKPSA-N Ser-Cys-Glu Chemical compound C(CC(=O)O)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N WTPKKLMBNBCCNL-ACZMJKKPSA-N 0.000 description 1
- RNFKSBPHLTZHLU-WHFBIAKZSA-N Ser-Cys-Gly Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)NCC(=O)O)N)O RNFKSBPHLTZHLU-WHFBIAKZSA-N 0.000 description 1
- KMWFXJCGRXBQAC-CIUDSAMLSA-N Ser-Cys-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CO)N KMWFXJCGRXBQAC-CIUDSAMLSA-N 0.000 description 1
- XSYJDGIDKRNWFX-SRVKXCTJSA-N Ser-Cys-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XSYJDGIDKRNWFX-SRVKXCTJSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- CRZRTKAVUUGKEQ-ACZMJKKPSA-N Ser-Gln-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CRZRTKAVUUGKEQ-ACZMJKKPSA-N 0.000 description 1
- CDVFZMOFNJPUDD-ACZMJKKPSA-N Ser-Gln-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O CDVFZMOFNJPUDD-ACZMJKKPSA-N 0.000 description 1
- VMVNCJDKFOQOHM-GUBZILKMSA-N Ser-Gln-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N VMVNCJDKFOQOHM-GUBZILKMSA-N 0.000 description 1
- BQWCDDAISCPDQV-XHNCKOQMSA-N Ser-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N)C(=O)O BQWCDDAISCPDQV-XHNCKOQMSA-N 0.000 description 1
- VDVYTKZBMFADQH-AVGNSLFASA-N Ser-Gln-Tyr Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 VDVYTKZBMFADQH-AVGNSLFASA-N 0.000 description 1
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 1
- PVDTYLHUWAEYGY-CIUDSAMLSA-N Ser-Glu-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O PVDTYLHUWAEYGY-CIUDSAMLSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- QKQDTEYDEIJPNK-GUBZILKMSA-N Ser-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CO QKQDTEYDEIJPNK-GUBZILKMSA-N 0.000 description 1
- DSGYZICNAMEJOC-AVGNSLFASA-N Ser-Glu-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O DSGYZICNAMEJOC-AVGNSLFASA-N 0.000 description 1
- UFKPDBLKLOBMRH-XHNCKOQMSA-N Ser-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N)C(=O)O UFKPDBLKLOBMRH-XHNCKOQMSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- BPMRXBZYPGYPJN-WHFBIAKZSA-N Ser-Gly-Asn Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O BPMRXBZYPGYPJN-WHFBIAKZSA-N 0.000 description 1
- MIJWOJAXARLEHA-WDSKDSINSA-N Ser-Gly-Glu Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O MIJWOJAXARLEHA-WDSKDSINSA-N 0.000 description 1
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 1
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 1
- LOKXAXAESFYFAX-CIUDSAMLSA-N Ser-His-Cys Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CS)C(O)=O)CC1=CN=CN1 LOKXAXAESFYFAX-CIUDSAMLSA-N 0.000 description 1
- CLKKNZQUQMZDGD-SRVKXCTJSA-N Ser-His-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CN=CN1 CLKKNZQUQMZDGD-SRVKXCTJSA-N 0.000 description 1
- CAOYHZOWXFFAIR-CIUDSAMLSA-N Ser-His-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O CAOYHZOWXFFAIR-CIUDSAMLSA-N 0.000 description 1
- JEHPKECJCALLRW-CUJWVEQBSA-N Ser-His-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JEHPKECJCALLRW-CUJWVEQBSA-N 0.000 description 1
- ZUDXUJSYCCNZQJ-DCAQKATOSA-N Ser-His-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CO)N ZUDXUJSYCCNZQJ-DCAQKATOSA-N 0.000 description 1
- LQESNKGTTNHZPZ-GHCJXIJMSA-N Ser-Ile-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O LQESNKGTTNHZPZ-GHCJXIJMSA-N 0.000 description 1
- CJINPXGSKSZQNE-KBIXCLLPSA-N Ser-Ile-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(N)=O)C(O)=O CJINPXGSKSZQNE-KBIXCLLPSA-N 0.000 description 1
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 1
- HBTCFCHYALPXME-HTFCKZLJSA-N Ser-Ile-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HBTCFCHYALPXME-HTFCKZLJSA-N 0.000 description 1
- RIAKPZVSNBBNRE-BJDJZHNGSA-N Ser-Ile-Leu Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O RIAKPZVSNBBNRE-BJDJZHNGSA-N 0.000 description 1
- YMDNFPNTIPQMJP-NAKRPEOUSA-N Ser-Ile-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCSC)C(O)=O YMDNFPNTIPQMJP-NAKRPEOUSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- ZOPISOXXPQNOCO-SVSWQMSJSA-N Ser-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CO)N ZOPISOXXPQNOCO-SVSWQMSJSA-N 0.000 description 1
- XXNYYSXNXCJYKX-DCAQKATOSA-N Ser-Leu-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(O)=O XXNYYSXNXCJYKX-DCAQKATOSA-N 0.000 description 1
- JWOBLHJRDADHLN-KKUMJFAQSA-N Ser-Leu-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JWOBLHJRDADHLN-KKUMJFAQSA-N 0.000 description 1
- GZSZPKSBVAOGIE-CIUDSAMLSA-N Ser-Lys-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O GZSZPKSBVAOGIE-CIUDSAMLSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- CRJZZXMAADSBBQ-SRVKXCTJSA-N Ser-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO CRJZZXMAADSBBQ-SRVKXCTJSA-N 0.000 description 1
- OCWWJBZQXGYQCA-DCAQKATOSA-N Ser-Lys-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O OCWWJBZQXGYQCA-DCAQKATOSA-N 0.000 description 1
- PTWIYDNFWPXQSD-GARJFASQSA-N Ser-Lys-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N)C(=O)O PTWIYDNFWPXQSD-GARJFASQSA-N 0.000 description 1
- LPSKHZWBQONOQJ-XIRDDKMYSA-N Ser-Lys-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CO)N LPSKHZWBQONOQJ-XIRDDKMYSA-N 0.000 description 1
- KJKQUQXDEKMPDK-FXQIFTODSA-N Ser-Met-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O KJKQUQXDEKMPDK-FXQIFTODSA-N 0.000 description 1
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 1
- IFLVBVIYADZIQO-DCAQKATOSA-N Ser-Met-Lys Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N IFLVBVIYADZIQO-DCAQKATOSA-N 0.000 description 1
- VIIJCAQMJBHSJH-FXQIFTODSA-N Ser-Met-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O VIIJCAQMJBHSJH-FXQIFTODSA-N 0.000 description 1
- RXSWQCATLWVDLI-XGEHTFHBSA-N Ser-Met-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RXSWQCATLWVDLI-XGEHTFHBSA-N 0.000 description 1
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 1
- HJAXVYLCKDPPDF-SRVKXCTJSA-N Ser-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N HJAXVYLCKDPPDF-SRVKXCTJSA-N 0.000 description 1
- BUYHXYIUQUBEQP-AVGNSLFASA-N Ser-Phe-Glu Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CO)N BUYHXYIUQUBEQP-AVGNSLFASA-N 0.000 description 1
- KZPRPBLHYMZIMH-MXAVVETBSA-N Ser-Phe-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KZPRPBLHYMZIMH-MXAVVETBSA-N 0.000 description 1
- RRVFEDGUXSYWOW-BZSNNMDCSA-N Ser-Phe-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RRVFEDGUXSYWOW-BZSNNMDCSA-N 0.000 description 1
- FBLNYDYPCLFTSP-IXOXFDKPSA-N Ser-Phe-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FBLNYDYPCLFTSP-IXOXFDKPSA-N 0.000 description 1
- MHVXPTAMDHLTHB-IHPCNDPISA-N Ser-Phe-Trp Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(O)=O)C1=CC=CC=C1 MHVXPTAMDHLTHB-IHPCNDPISA-N 0.000 description 1
- PJIQEIFXZPCWOJ-FXQIFTODSA-N Ser-Pro-Asp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O PJIQEIFXZPCWOJ-FXQIFTODSA-N 0.000 description 1
- GZGFSPWOMUKKCV-NAKRPEOUSA-N Ser-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO GZGFSPWOMUKKCV-NAKRPEOUSA-N 0.000 description 1
- OVQZAFXWIWNYKA-GUBZILKMSA-N Ser-Pro-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CO)N OVQZAFXWIWNYKA-GUBZILKMSA-N 0.000 description 1
- BVLGVLWFIZFEAH-BPUTZDHNSA-N Ser-Pro-Trp Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O BVLGVLWFIZFEAH-BPUTZDHNSA-N 0.000 description 1
- OZPDGESCTGGNAD-CIUDSAMLSA-N Ser-Ser-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CO OZPDGESCTGGNAD-CIUDSAMLSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- PYTKULIABVRXSC-BWBBJGPYSA-N Ser-Ser-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PYTKULIABVRXSC-BWBBJGPYSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 1
- SQHKXWODKJDZRC-LKXGYXEUSA-N Ser-Thr-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(O)=O SQHKXWODKJDZRC-LKXGYXEUSA-N 0.000 description 1
- SZRNDHWMVSFPSP-XKBZYTNZSA-N Ser-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CO)N)O SZRNDHWMVSFPSP-XKBZYTNZSA-N 0.000 description 1
- KKKVOZNCLALMPV-XKBZYTNZSA-N Ser-Thr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KKKVOZNCLALMPV-XKBZYTNZSA-N 0.000 description 1
- DYEGLQRVMBWQLD-IXOXFDKPSA-N Ser-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CO)N)O DYEGLQRVMBWQLD-IXOXFDKPSA-N 0.000 description 1
- STIAINRLUUKYKM-WFBYXXMGSA-N Ser-Trp-Ala Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 STIAINRLUUKYKM-WFBYXXMGSA-N 0.000 description 1
- AXKJPUBALUNJEO-UBHSHLNASA-N Ser-Trp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(N)=O)C(O)=O AXKJPUBALUNJEO-UBHSHLNASA-N 0.000 description 1
- ATEQEHCGZKBEMU-GQGQLFGLSA-N Ser-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N ATEQEHCGZKBEMU-GQGQLFGLSA-N 0.000 description 1
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 1
- ZWSZBWAFDZRBNM-UBHSHLNASA-N Ser-Trp-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(O)=O ZWSZBWAFDZRBNM-UBHSHLNASA-N 0.000 description 1
- HAUVENOGHPECML-BPUTZDHNSA-N Ser-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CO)=CNC2=C1 HAUVENOGHPECML-BPUTZDHNSA-N 0.000 description 1
- PQEQXWRVHQAAKS-SRVKXCTJSA-N Ser-Tyr-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CO)N)CC1=CC=C(O)C=C1 PQEQXWRVHQAAKS-SRVKXCTJSA-N 0.000 description 1
- GSCVDSBEYVGMJQ-SRVKXCTJSA-N Ser-Tyr-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CO)N)O GSCVDSBEYVGMJQ-SRVKXCTJSA-N 0.000 description 1
- VEVYMLNYMULSMS-AVGNSLFASA-N Ser-Tyr-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(O)=O VEVYMLNYMULSMS-AVGNSLFASA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- FHXGMDRKJHKLKW-QWRGUYRKSA-N Ser-Tyr-Gly Chemical compound OC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 FHXGMDRKJHKLKW-QWRGUYRKSA-N 0.000 description 1
- UBTNVMGPMYDYIU-HJPIBITLSA-N Ser-Tyr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UBTNVMGPMYDYIU-HJPIBITLSA-N 0.000 description 1
- HKHCTNFKZXAMIF-KKUMJFAQSA-N Ser-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CC1=CC=C(O)C=C1 HKHCTNFKZXAMIF-KKUMJFAQSA-N 0.000 description 1
- VVKVHAOOUGNDPJ-SRVKXCTJSA-N Ser-Tyr-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VVKVHAOOUGNDPJ-SRVKXCTJSA-N 0.000 description 1
- PCMZJFMUYWIERL-ZKWXMUAHSA-N Ser-Val-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O PCMZJFMUYWIERL-ZKWXMUAHSA-N 0.000 description 1
- SGZVZUCRAVSPKQ-FXQIFTODSA-N Ser-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N SGZVZUCRAVSPKQ-FXQIFTODSA-N 0.000 description 1
- JZRYFUGREMECBH-XPUUQOCRSA-N Ser-Val-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O JZRYFUGREMECBH-XPUUQOCRSA-N 0.000 description 1
- SYCFMSYTIFXWAJ-DCAQKATOSA-N Ser-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N SYCFMSYTIFXWAJ-DCAQKATOSA-N 0.000 description 1
- HNDMFDBQXYZSRM-IHRRRGAJSA-N Ser-Val-Phe Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O HNDMFDBQXYZSRM-IHRRRGAJSA-N 0.000 description 1
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 1
- 102000004589 Solute Carrier Proteins Human genes 0.000 description 1
- 108010042650 Solute Carrier Proteins Proteins 0.000 description 1
- 238000002105 Southern blotting Methods 0.000 description 1
- 238000006826 Stephen synthesis reaction Methods 0.000 description 1
- KDYFGRWQOYBRFD-UHFFFAOYSA-N Succinic acid Natural products OC(=O)CCC(O)=O KDYFGRWQOYBRFD-UHFFFAOYSA-N 0.000 description 1
- 101150006914 TRP1 gene Proteins 0.000 description 1
- FEWJPZIEWOKRBE-UHFFFAOYSA-N Tartaric acid Natural products [H+].[H+].[O-]C(=O)C(O)C(O)C([O-])=O FEWJPZIEWOKRBE-UHFFFAOYSA-N 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DDPVJPIGACCMEH-XQXXSGGOSA-N Thr-Ala-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DDPVJPIGACCMEH-XQXXSGGOSA-N 0.000 description 1
- PXQUBKWZENPDGE-CIQUZCHMSA-N Thr-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)O)N PXQUBKWZENPDGE-CIQUZCHMSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- JMZKMSTYXHFYAK-VEVYYDQMSA-N Thr-Arg-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)O JMZKMSTYXHFYAK-VEVYYDQMSA-N 0.000 description 1
- GLQFKOVWXPPFTP-VEVYYDQMSA-N Thr-Arg-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O GLQFKOVWXPPFTP-VEVYYDQMSA-N 0.000 description 1
- JMQUAZXYFAEOIH-XGEHTFHBSA-N Thr-Arg-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N)O JMQUAZXYFAEOIH-XGEHTFHBSA-N 0.000 description 1
- UTSWGQNAQRIHAI-UNQGMJICSA-N Thr-Arg-Phe Chemical compound NC(N)=NCCC[C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UTSWGQNAQRIHAI-UNQGMJICSA-N 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- JNQZPAWOPBZGIX-RCWTZXSCSA-N Thr-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)CCCN=C(N)N JNQZPAWOPBZGIX-RCWTZXSCSA-N 0.000 description 1
- JHBHMCMKSPXRHV-NUMRIWBASA-N Thr-Asn-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O JHBHMCMKSPXRHV-NUMRIWBASA-N 0.000 description 1
- TZKPNGDGUVREEB-FOHZUACHSA-N Thr-Asn-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O TZKPNGDGUVREEB-FOHZUACHSA-N 0.000 description 1
- JTEICXDKGWKRRV-HJGDQZAQSA-N Thr-Asn-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O JTEICXDKGWKRRV-HJGDQZAQSA-N 0.000 description 1
- JBHMLZSKIXMVFS-XVSYOHENSA-N Thr-Asn-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JBHMLZSKIXMVFS-XVSYOHENSA-N 0.000 description 1
- LXWZOMSOUAMOIA-JIOCBJNQSA-N Thr-Asn-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N)O LXWZOMSOUAMOIA-JIOCBJNQSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- PZVGOVRNGKEFCB-KKHAAJSZSA-N Thr-Asn-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N)O PZVGOVRNGKEFCB-KKHAAJSZSA-N 0.000 description 1
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 1
- DCCGCVLVVSAJFK-NUMRIWBASA-N Thr-Asp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O DCCGCVLVVSAJFK-NUMRIWBASA-N 0.000 description 1
- DCLBXIWHLVEPMQ-JRQIVUDYSA-N Thr-Asp-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 DCLBXIWHLVEPMQ-JRQIVUDYSA-N 0.000 description 1
- DSLHSTIUAPKERR-XGEHTFHBSA-N Thr-Cys-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(O)=O DSLHSTIUAPKERR-XGEHTFHBSA-N 0.000 description 1
- RJBFAHKSFNNHAI-XKBZYTNZSA-N Thr-Gln-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CS)C(=O)O)N)O RJBFAHKSFNNHAI-XKBZYTNZSA-N 0.000 description 1
- UHBPFYOQQPFKQR-JHEQGTHGSA-N Thr-Gln-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O UHBPFYOQQPFKQR-JHEQGTHGSA-N 0.000 description 1
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 1
- MQUZMZBFKCHVOB-HJGDQZAQSA-N Thr-Gln-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O MQUZMZBFKCHVOB-HJGDQZAQSA-N 0.000 description 1
- KGKWKSSSQGGYAU-SUSMZKCASA-N Thr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KGKWKSSSQGGYAU-SUSMZKCASA-N 0.000 description 1
- RCEHMXVEMNXRIW-IRIUXVKKSA-N Thr-Gln-Tyr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N)O RCEHMXVEMNXRIW-IRIUXVKKSA-N 0.000 description 1
- LGNBRHZANHMZHK-NUMRIWBASA-N Thr-Glu-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O LGNBRHZANHMZHK-NUMRIWBASA-N 0.000 description 1
- GKWNLDNXMMLRMC-GLLZPBPUSA-N Thr-Glu-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O GKWNLDNXMMLRMC-GLLZPBPUSA-N 0.000 description 1
- LHEZGZQRLDBSRR-WDCWCFNPSA-N Thr-Glu-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LHEZGZQRLDBSRR-WDCWCFNPSA-N 0.000 description 1
- HJOSVGCWOTYJFG-WDCWCFNPSA-N Thr-Glu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N)O HJOSVGCWOTYJFG-WDCWCFNPSA-N 0.000 description 1
- OQCXTUQTKQFDCX-HTUGSXCWSA-N Thr-Glu-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O OQCXTUQTKQFDCX-HTUGSXCWSA-N 0.000 description 1
- KBLYJPQSNGTDIU-LOKLDPHHSA-N Thr-Glu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O KBLYJPQSNGTDIU-LOKLDPHHSA-N 0.000 description 1
- ONNSECRQFSTMCC-XKBZYTNZSA-N Thr-Glu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ONNSECRQFSTMCC-XKBZYTNZSA-N 0.000 description 1
- WYKJENSCCRJLRC-ZDLURKLDSA-N Thr-Gly-Cys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)O WYKJENSCCRJLRC-ZDLURKLDSA-N 0.000 description 1
- AQAMPXBRJJWPNI-JHEQGTHGSA-N Thr-Gly-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AQAMPXBRJJWPNI-JHEQGTHGSA-N 0.000 description 1
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 1
- MPUMPERGHHJGRP-WEDXCCLWSA-N Thr-Gly-Lys Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N)O MPUMPERGHHJGRP-WEDXCCLWSA-N 0.000 description 1
- UBDDORVPVLEECX-FJXKBIBVSA-N Thr-Gly-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O UBDDORVPVLEECX-FJXKBIBVSA-N 0.000 description 1
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 1
- JQAWYCUUFIMTHE-WLTAIBSBSA-N Thr-Gly-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JQAWYCUUFIMTHE-WLTAIBSBSA-N 0.000 description 1
- YUOCMLNTUZAGNF-KLHWPWHYSA-N Thr-His-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N2CCC[C@@H]2C(=O)O)N)O YUOCMLNTUZAGNF-KLHWPWHYSA-N 0.000 description 1
- YUPVPKZBKCLFLT-QTKMDUPCSA-N Thr-His-Val Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N)O YUPVPKZBKCLFLT-QTKMDUPCSA-N 0.000 description 1
- PAXANSWUSVPFNK-IUKAMOBKSA-N Thr-Ile-Asn Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N PAXANSWUSVPFNK-IUKAMOBKSA-N 0.000 description 1
- AHOLTQCAVBSUDP-PPCPHDFISA-N Thr-Ile-Lys Chemical compound CC[C@H](C)[C@H](NC(=O)[C@@H](N)[C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O AHOLTQCAVBSUDP-PPCPHDFISA-N 0.000 description 1
- YJCVECXVYHZOBK-KNZXXDILSA-N Thr-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H]([C@@H](C)O)N YJCVECXVYHZOBK-KNZXXDILSA-N 0.000 description 1
- GXUWHVZYDAHFSV-FLBSBUHZSA-N Thr-Ile-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GXUWHVZYDAHFSV-FLBSBUHZSA-N 0.000 description 1
- XYFISNXATOERFZ-OSUNSFLBSA-N Thr-Ile-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N XYFISNXATOERFZ-OSUNSFLBSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- ODXKUIGEPAGKKV-KATARQTJSA-N Thr-Leu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CS)C(=O)O)N)O ODXKUIGEPAGKKV-KATARQTJSA-N 0.000 description 1
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 1
- PRNGXSILMXSWQQ-OEAJRASXSA-N Thr-Leu-Phe Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O PRNGXSILMXSWQQ-OEAJRASXSA-N 0.000 description 1
- KRDSCBLRHORMRK-JXUBOQSCSA-N Thr-Lys-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O KRDSCBLRHORMRK-JXUBOQSCSA-N 0.000 description 1
- SCSVNSNWUTYSFO-WDCWCFNPSA-N Thr-Lys-Glu Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O SCSVNSNWUTYSFO-WDCWCFNPSA-N 0.000 description 1
- ZXIHABSKUITPTN-IXOXFDKPSA-N Thr-Lys-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O ZXIHABSKUITPTN-IXOXFDKPSA-N 0.000 description 1
- DCRHJDRLCFMEBI-RHYQMDGZSA-N Thr-Lys-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(O)=O DCRHJDRLCFMEBI-RHYQMDGZSA-N 0.000 description 1
- XSEPSRUDSPHMPX-KATARQTJSA-N Thr-Lys-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O XSEPSRUDSPHMPX-KATARQTJSA-N 0.000 description 1
- MCDVZTRGHNXTGK-HJGDQZAQSA-N Thr-Met-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O MCDVZTRGHNXTGK-HJGDQZAQSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- SIEZEMFJLYRUMK-YTWAJWBKSA-N Thr-Met-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N)O SIEZEMFJLYRUMK-YTWAJWBKSA-N 0.000 description 1
- KPNSNVTUVKSBFL-ZJDVBMNYSA-N Thr-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KPNSNVTUVKSBFL-ZJDVBMNYSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- WRQLCVIALDUQEQ-UNQGMJICSA-N Thr-Phe-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WRQLCVIALDUQEQ-UNQGMJICSA-N 0.000 description 1
- GYUUYCIXELGTJS-MEYUZBJRSA-N Thr-Phe-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O GYUUYCIXELGTJS-MEYUZBJRSA-N 0.000 description 1
- BDYBHQWMHYDRKJ-UNQGMJICSA-N Thr-Phe-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N)O BDYBHQWMHYDRKJ-UNQGMJICSA-N 0.000 description 1
- VEIKMWOMUYMMMK-FCLVOEFKSA-N Thr-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)[C@H](O)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 VEIKMWOMUYMMMK-FCLVOEFKSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- MXNAOGFNFNKUPD-JHYOHUSXSA-N Thr-Phe-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MXNAOGFNFNKUPD-JHYOHUSXSA-N 0.000 description 1
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 1
- LKJCABTUFGTPPY-HJGDQZAQSA-N Thr-Pro-Gln Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O LKJCABTUFGTPPY-HJGDQZAQSA-N 0.000 description 1
- JAJOFWABAUKAEJ-QTKMDUPCSA-N Thr-Pro-His Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O JAJOFWABAUKAEJ-QTKMDUPCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- FWTFAZKJORVTIR-VZFHVOOUSA-N Thr-Ser-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O FWTFAZKJORVTIR-VZFHVOOUSA-N 0.000 description 1
- STUAPCLEDMKXKL-LKXGYXEUSA-N Thr-Ser-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O STUAPCLEDMKXKL-LKXGYXEUSA-N 0.000 description 1
- IVDFVBVIVLJJHR-LKXGYXEUSA-N Thr-Ser-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O IVDFVBVIVLJJHR-LKXGYXEUSA-N 0.000 description 1
- BCYUHPXBHCUYBA-CUJWVEQBSA-N Thr-Ser-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O BCYUHPXBHCUYBA-CUJWVEQBSA-N 0.000 description 1
- NQQMWWVVGIXUOX-SVSWQMSJSA-N Thr-Ser-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NQQMWWVVGIXUOX-SVSWQMSJSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- AAZOYLQUEQRUMZ-GSSVUCPTSA-N Thr-Thr-Asn Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O AAZOYLQUEQRUMZ-GSSVUCPTSA-N 0.000 description 1
- VBMOVTMNHWPZJR-SUSMZKCASA-N Thr-Thr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(O)=O VBMOVTMNHWPZJR-SUSMZKCASA-N 0.000 description 1
- VGNLMPBYWWNQFS-ZEILLAHLSA-N Thr-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O VGNLMPBYWWNQFS-ZEILLAHLSA-N 0.000 description 1
- PJCYRZVSACOYSN-ZJDVBMNYSA-N Thr-Thr-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(O)=O PJCYRZVSACOYSN-ZJDVBMNYSA-N 0.000 description 1
- QJIODPFLAASXJC-JHYOHUSXSA-N Thr-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N)O QJIODPFLAASXJC-JHYOHUSXSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- SOUPNXUJAJENFU-SWRJLBSHSA-N Thr-Trp-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O SOUPNXUJAJENFU-SWRJLBSHSA-N 0.000 description 1
- GJOBRAHDRIDAPT-NGTWOADLSA-N Thr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H]([C@@H](C)O)N GJOBRAHDRIDAPT-NGTWOADLSA-N 0.000 description 1
- ZEJBJDHSQPOVJV-UAXMHLISSA-N Thr-Trp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ZEJBJDHSQPOVJV-UAXMHLISSA-N 0.000 description 1
- JNKAYADBODLPMQ-HSHDSVGOSA-N Thr-Trp-Val Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)[C@@H](C)O)=CNC2=C1 JNKAYADBODLPMQ-HSHDSVGOSA-N 0.000 description 1
- BEZTUFWTPVOROW-KJEVXHAQSA-N Thr-Tyr-Arg Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N)O BEZTUFWTPVOROW-KJEVXHAQSA-N 0.000 description 1
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 1
- CJEHCEOXPLASCK-MEYUZBJRSA-N Thr-Tyr-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)[C@H](O)C)CC1=CC=C(O)C=C1 CJEHCEOXPLASCK-MEYUZBJRSA-N 0.000 description 1
- LVRFMARKDGGZMX-IZPVPAKOSA-N Thr-Tyr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=C(O)C=C1 LVRFMARKDGGZMX-IZPVPAKOSA-N 0.000 description 1
- QGVBFDIREUUSHX-IFFSRLJSSA-N Thr-Val-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O QGVBFDIREUUSHX-IFFSRLJSSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- ILUOMMDDGREELW-OSUNSFLBSA-N Thr-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O ILUOMMDDGREELW-OSUNSFLBSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- QNXZCKMXHPULME-ZNSHCXBVSA-N Thr-Val-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O QNXZCKMXHPULME-ZNSHCXBVSA-N 0.000 description 1
- BTAJAOWZCWOHBU-HSHDSVGOSA-N Thr-Val-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)[C@@H](C)O)C(C)C)C(O)=O)=CNC2=C1 BTAJAOWZCWOHBU-HSHDSVGOSA-N 0.000 description 1
- BPGDJSUFQKWUBK-KJEVXHAQSA-N Thr-Val-Tyr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BPGDJSUFQKWUBK-KJEVXHAQSA-N 0.000 description 1
- BRBCKMMXKONBAA-KWBADKCTSA-N Trp-Ala-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 BRBCKMMXKONBAA-KWBADKCTSA-N 0.000 description 1
- CXUFDWZBHKUGKK-CABZTGNLSA-N Trp-Ala-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O)=CNC2=C1 CXUFDWZBHKUGKK-CABZTGNLSA-N 0.000 description 1
- FOAJSVIXYCLTSC-PJODQICGSA-N Trp-Ala-Met Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FOAJSVIXYCLTSC-PJODQICGSA-N 0.000 description 1
- VZBWRZGNEPBRDE-HZUKXOBISA-N Trp-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N VZBWRZGNEPBRDE-HZUKXOBISA-N 0.000 description 1
- HOJPPPKZWFRTHJ-PJODQICGSA-N Trp-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N HOJPPPKZWFRTHJ-PJODQICGSA-N 0.000 description 1
- PNKDNKGMEHJTJQ-BPUTZDHNSA-N Trp-Arg-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(=O)N)C(=O)O)N PNKDNKGMEHJTJQ-BPUTZDHNSA-N 0.000 description 1
- NXAPHBHZCMQORW-FDARSICLSA-N Trp-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NXAPHBHZCMQORW-FDARSICLSA-N 0.000 description 1
- VIWQOOBRKCGSDK-RYQLBKOJSA-N Trp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O VIWQOOBRKCGSDK-RYQLBKOJSA-N 0.000 description 1
- ICNFHVUVCNWUAB-SZMVWBNQSA-N Trp-Arg-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ICNFHVUVCNWUAB-SZMVWBNQSA-N 0.000 description 1
- DXDMNBJJEXYMLA-UBHSHLNASA-N Trp-Asn-Asp Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O)=CNC2=C1 DXDMNBJJEXYMLA-UBHSHLNASA-N 0.000 description 1
- ADBFWLXCCKIXBQ-XIRDDKMYSA-N Trp-Asn-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N ADBFWLXCCKIXBQ-XIRDDKMYSA-N 0.000 description 1
- QNTBGBCOEYNAPV-CWRNSKLLSA-N Trp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)O QNTBGBCOEYNAPV-CWRNSKLLSA-N 0.000 description 1
- DVAAUUVLDFKTAQ-VHWLVUOQSA-N Trp-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N DVAAUUVLDFKTAQ-VHWLVUOQSA-N 0.000 description 1
- ZCPCXVJOMUPIDD-IHPCNDPISA-N Trp-Asp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 ZCPCXVJOMUPIDD-IHPCNDPISA-N 0.000 description 1
- DTPARJBMONKGGC-IHPCNDPISA-N Trp-Cys-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N DTPARJBMONKGGC-IHPCNDPISA-N 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- XZLHHHYSWIYXHD-XIRDDKMYSA-N Trp-Gln-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O XZLHHHYSWIYXHD-XIRDDKMYSA-N 0.000 description 1
- PTAWAMWPRFTACW-SZMVWBNQSA-N Trp-Gln-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PTAWAMWPRFTACW-SZMVWBNQSA-N 0.000 description 1
- AFSYEUHJBVCPEL-JBACZVJFSA-N Trp-Gln-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 AFSYEUHJBVCPEL-JBACZVJFSA-N 0.000 description 1
- AWYXDHQQFPZJNE-QEJZJMRPSA-N Trp-Gln-Ser Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N AWYXDHQQFPZJNE-QEJZJMRPSA-N 0.000 description 1
- CZWIHKFGHICAJX-BPUTZDHNSA-N Trp-Glu-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 CZWIHKFGHICAJX-BPUTZDHNSA-N 0.000 description 1
- OKAMOYTUQMIFJO-JBACZVJFSA-N Trp-Glu-Phe Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=CC=C1 OKAMOYTUQMIFJO-JBACZVJFSA-N 0.000 description 1
- OZUJUVFWMHTWCZ-HOCLYGCPSA-N Trp-Gly-His Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)NCC(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O OZUJUVFWMHTWCZ-HOCLYGCPSA-N 0.000 description 1
- PVRRBEROBJQPJX-SZMVWBNQSA-N Trp-His-Gln Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N PVRRBEROBJQPJX-SZMVWBNQSA-N 0.000 description 1
- OJCSQAWRJKPKFM-TUSQITKMSA-N Trp-His-Trp Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OJCSQAWRJKPKFM-TUSQITKMSA-N 0.000 description 1
- KULBQAVOXHQLIY-HSCHXYMDSA-N Trp-Ile-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O)=CNC2=C1 KULBQAVOXHQLIY-HSCHXYMDSA-N 0.000 description 1
- CMXACOZDEJYZSK-XIRDDKMYSA-N Trp-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N CMXACOZDEJYZSK-XIRDDKMYSA-N 0.000 description 1
- YVXIAOOYAKBAAI-SZMVWBNQSA-N Trp-Leu-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 YVXIAOOYAKBAAI-SZMVWBNQSA-N 0.000 description 1
- UJRIVCPPPMYCNA-HOCLYGCPSA-N Trp-Leu-Gly Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N UJRIVCPPPMYCNA-HOCLYGCPSA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- YTZYHKOSHOXTHA-TUSQITKMSA-N Trp-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=3C4=CC=CC=C4NC=3)CC(C)C)C(O)=O)=CNC2=C1 YTZYHKOSHOXTHA-TUSQITKMSA-N 0.000 description 1
- YPBYQWFZAAQMGW-XIRDDKMYSA-N Trp-Lys-Asn Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(=O)N)C(=O)O)N YPBYQWFZAAQMGW-XIRDDKMYSA-N 0.000 description 1
- NWQCKAPDGQMZQN-IHPCNDPISA-N Trp-Lys-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O NWQCKAPDGQMZQN-IHPCNDPISA-N 0.000 description 1
- UUIYFDAWNBSWPG-IHPCNDPISA-N Trp-Lys-Lys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N UUIYFDAWNBSWPG-IHPCNDPISA-N 0.000 description 1
- FXHOCONKLLUOCF-WDSOQIARSA-N Trp-Lys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N FXHOCONKLLUOCF-WDSOQIARSA-N 0.000 description 1
- KWTRGSQOQHZKIA-PMVMPFDFSA-N Trp-Lys-Tyr Chemical compound C([C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)CCCCN)C(O)=O)C1=CC=C(O)C=C1 KWTRGSQOQHZKIA-PMVMPFDFSA-N 0.000 description 1
- RQLNEFOBQAVGSY-WDSOQIARSA-N Trp-Met-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O RQLNEFOBQAVGSY-WDSOQIARSA-N 0.000 description 1
- LVTKHGUGBGNBPL-UHFFFAOYSA-N Trp-P-1 Chemical compound N1C2=CC=CC=C2C2=C1C(C)=C(N)N=C2C LVTKHGUGBGNBPL-UHFFFAOYSA-N 0.000 description 1
- GIAMKIPJSRZVJB-IHPCNDPISA-N Trp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GIAMKIPJSRZVJB-IHPCNDPISA-N 0.000 description 1
- BIBZRFIKOLGWFQ-XIRDDKMYSA-N Trp-Pro-Gln Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC2=CNC3=CC=CC=C32)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O BIBZRFIKOLGWFQ-XIRDDKMYSA-N 0.000 description 1
- VCGOTJGGBXEBFO-FDARSICLSA-N Trp-Pro-Ile Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VCGOTJGGBXEBFO-FDARSICLSA-N 0.000 description 1
- JGLXHHQUSIULAK-OYDLWJJNSA-N Trp-Pro-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H]3CCCN3C(=O)[C@H](CC=3C4=CC=CC=C4NC=3)N)C(O)=O)=CNC2=C1 JGLXHHQUSIULAK-OYDLWJJNSA-N 0.000 description 1
- LORJKYIPJIRIRT-BVSLBCMMSA-N Trp-Pro-Tyr Chemical compound C([C@H](NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC=1C2=CC=CC=C2NC=1)N)C(O)=O)C1=CC=C(O)C=C1 LORJKYIPJIRIRT-BVSLBCMMSA-N 0.000 description 1
- SUEGAFMNTXXNLR-WFBYXXMGSA-N Trp-Ser-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O SUEGAFMNTXXNLR-WFBYXXMGSA-N 0.000 description 1
- IQIRAJGHFRVFEL-UBHSHLNASA-N Trp-Ser-Cys Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N IQIRAJGHFRVFEL-UBHSHLNASA-N 0.000 description 1
- HWCBFXAWVTXXHZ-NYVOZVTQSA-N Trp-Ser-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)N HWCBFXAWVTXXHZ-NYVOZVTQSA-N 0.000 description 1
- HIZDHWHVOLUGOX-BPUTZDHNSA-N Trp-Ser-Val Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O HIZDHWHVOLUGOX-BPUTZDHNSA-N 0.000 description 1
- YCQXZDHDSUHUSG-FJHTZYQYSA-N Trp-Thr-Ala Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 YCQXZDHDSUHUSG-FJHTZYQYSA-N 0.000 description 1
- WBZOZLNLXVBCNW-LTHWPDAASA-N Trp-Thr-Ile Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)CC)C(O)=O)[C@@H](C)O)=CNC2=C1 WBZOZLNLXVBCNW-LTHWPDAASA-N 0.000 description 1
- QJIOKZXDGFZQJP-OYDLWJJNSA-N Trp-Trp-Arg Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QJIOKZXDGFZQJP-OYDLWJJNSA-N 0.000 description 1
- RPTAWXPQXXCUGL-OYDLWJJNSA-N Trp-Trp-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](Cc1c[nH]c2ccccc12)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O RPTAWXPQXXCUGL-OYDLWJJNSA-N 0.000 description 1
- DVLHKUWLNKDINO-PMVMPFDFSA-N Trp-Tyr-Leu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O DVLHKUWLNKDINO-PMVMPFDFSA-N 0.000 description 1
- UIRVSEPRMWDVEW-RNXOBYDBSA-N Trp-Tyr-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N UIRVSEPRMWDVEW-RNXOBYDBSA-N 0.000 description 1
- ICPRIGUXAFULPH-ILWGZMRPSA-N Trp-Tyr-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=C(C=C2)O)NC(=O)[C@H](CC3=CNC4=CC=CC=C43)N)C(=O)O ICPRIGUXAFULPH-ILWGZMRPSA-N 0.000 description 1
- XKTWZYNTLXITCY-QRTARXTBSA-N Trp-Val-Asn Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O)=CNC2=C1 XKTWZYNTLXITCY-QRTARXTBSA-N 0.000 description 1
- UUZYQOUJTORBQO-ZVZYQTTQSA-N Trp-Val-Gln Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)=CNC2=C1 UUZYQOUJTORBQO-ZVZYQTTQSA-N 0.000 description 1
- XQMGDVVKFRLQKH-BBRMVZONSA-N Trp-Val-Gly Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O)=CNC2=C1 XQMGDVVKFRLQKH-BBRMVZONSA-N 0.000 description 1
- NMOIRIIIUVELLY-WDSOQIARSA-N Trp-Val-Leu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C(O)=O)C(C)C)=CNC2=C1 NMOIRIIIUVELLY-WDSOQIARSA-N 0.000 description 1
- 108090000631 Trypsin Proteins 0.000 description 1
- 102000004142 Trypsin Human genes 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- NIHNMOSRSAYZIT-BPNCWPANSA-N Tyr-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NIHNMOSRSAYZIT-BPNCWPANSA-N 0.000 description 1
- JONPRIHUYSPIMA-UWJYBYFXSA-N Tyr-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 JONPRIHUYSPIMA-UWJYBYFXSA-N 0.000 description 1
- BURPTJBFWIOHEY-UWJYBYFXSA-N Tyr-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 BURPTJBFWIOHEY-UWJYBYFXSA-N 0.000 description 1
- QJBWZNTWJSZUOY-UWJYBYFXSA-N Tyr-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QJBWZNTWJSZUOY-UWJYBYFXSA-N 0.000 description 1
- XGEUYEOEZYFHRL-KKXDTOCCSA-N Tyr-Ala-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 XGEUYEOEZYFHRL-KKXDTOCCSA-N 0.000 description 1
- LGEYOIQBBIPHQN-UWJYBYFXSA-N Tyr-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 LGEYOIQBBIPHQN-UWJYBYFXSA-N 0.000 description 1
- HSVPZJLMPLMPOX-BPNCWPANSA-N Tyr-Arg-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O HSVPZJLMPLMPOX-BPNCWPANSA-N 0.000 description 1
- HKIUVWMZYFBIHG-KKUMJFAQSA-N Tyr-Arg-Gln Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N)O HKIUVWMZYFBIHG-KKUMJFAQSA-N 0.000 description 1
- AKXBNSZMYAOGLS-STQMWFEESA-N Tyr-Arg-Gly Chemical compound NC(N)=NCCC[C@@H](C(=O)NCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AKXBNSZMYAOGLS-STQMWFEESA-N 0.000 description 1
- KDGFPPHLXCEQRN-STECZYCISA-N Tyr-Arg-Ile Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O KDGFPPHLXCEQRN-STECZYCISA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- IIJWXEUNETVJPV-IHRRRGAJSA-N Tyr-Arg-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N)O IIJWXEUNETVJPV-IHRRRGAJSA-N 0.000 description 1
- CKKFTIQYURNSEI-IHRRRGAJSA-N Tyr-Asn-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CKKFTIQYURNSEI-IHRRRGAJSA-N 0.000 description 1
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 1
- XMNDQSYABVWZRK-BZSNNMDCSA-N Tyr-Asn-Phe Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O XMNDQSYABVWZRK-BZSNNMDCSA-N 0.000 description 1
- VTFWAGGJDRSQFG-MELADBBJSA-N Tyr-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O VTFWAGGJDRSQFG-MELADBBJSA-N 0.000 description 1
- SCCKSNREWHMKOJ-SRVKXCTJSA-N Tyr-Asn-Ser Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O SCCKSNREWHMKOJ-SRVKXCTJSA-N 0.000 description 1
- BARBHMSSVWPKPZ-IHRRRGAJSA-N Tyr-Asp-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O BARBHMSSVWPKPZ-IHRRRGAJSA-N 0.000 description 1
- BEIGSKUPTIFYRZ-SRVKXCTJSA-N Tyr-Asp-Asp Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O BEIGSKUPTIFYRZ-SRVKXCTJSA-N 0.000 description 1
- YRBHLWWGSSQICE-IHRRRGAJSA-N Tyr-Asp-Met Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O YRBHLWWGSSQICE-IHRRRGAJSA-N 0.000 description 1
- WPVGRKLNHJJCEN-BZSNNMDCSA-N Tyr-Asp-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 WPVGRKLNHJJCEN-BZSNNMDCSA-N 0.000 description 1
- MNMYOSZWCKYEDI-JRQIVUDYSA-N Tyr-Asp-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MNMYOSZWCKYEDI-JRQIVUDYSA-N 0.000 description 1
- QOIKZODVIPOPDD-AVGNSLFASA-N Tyr-Cys-Gln Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(N)=O)C(O)=O QOIKZODVIPOPDD-AVGNSLFASA-N 0.000 description 1
- FQNUWOHNGJWNLM-QWRGUYRKSA-N Tyr-Cys-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FQNUWOHNGJWNLM-QWRGUYRKSA-N 0.000 description 1
- HZZKQZDUIKVFDZ-AVGNSLFASA-N Tyr-Gln-Ser Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)O HZZKQZDUIKVFDZ-AVGNSLFASA-N 0.000 description 1
- MPKPIWFFDWVJGC-IRIUXVKKSA-N Tyr-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O MPKPIWFFDWVJGC-IRIUXVKKSA-N 0.000 description 1
- KEHKBBUYZWAMHL-DZKIICNBSA-N Tyr-Gln-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O KEHKBBUYZWAMHL-DZKIICNBSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- IWRMTNJCCMEBEX-AVGNSLFASA-N Tyr-Glu-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O IWRMTNJCCMEBEX-AVGNSLFASA-N 0.000 description 1
- SLCSPPCQWUHPPO-JYJNAYRXSA-N Tyr-Glu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 SLCSPPCQWUHPPO-JYJNAYRXSA-N 0.000 description 1
- ZRPLVTZTKPPSBT-AVGNSLFASA-N Tyr-Glu-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZRPLVTZTKPPSBT-AVGNSLFASA-N 0.000 description 1
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 1
- PMDWYLVWHRTJIW-STQMWFEESA-N Tyr-Gly-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PMDWYLVWHRTJIW-STQMWFEESA-N 0.000 description 1
- CDHQEOXPWBDFPL-QWRGUYRKSA-N Tyr-Gly-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDHQEOXPWBDFPL-QWRGUYRKSA-N 0.000 description 1
- IJUTXXAXQODRMW-KBPBESRZSA-N Tyr-Gly-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N)O IJUTXXAXQODRMW-KBPBESRZSA-N 0.000 description 1
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 1
- KCPFDGNYAMKZQP-KBPBESRZSA-N Tyr-Gly-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O KCPFDGNYAMKZQP-KBPBESRZSA-N 0.000 description 1
- OHNXAUCZVWGTLL-KKUMJFAQSA-N Tyr-His-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CS)C(=O)O)N)O OHNXAUCZVWGTLL-KKUMJFAQSA-N 0.000 description 1
- WPXKRJVHBXYLDT-JUKXBJQTSA-N Tyr-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=C(C=C2)O)N WPXKRJVHBXYLDT-JUKXBJQTSA-N 0.000 description 1
- FBHBVXUBTYVCRU-BZSNNMDCSA-N Tyr-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CN=CN1 FBHBVXUBTYVCRU-BZSNNMDCSA-N 0.000 description 1
- NXRGXTBPMOGFID-CFMVVWHZSA-N Tyr-Ile-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(N)=O)C(O)=O NXRGXTBPMOGFID-CFMVVWHZSA-N 0.000 description 1
- JJNXZIPLIXIGBX-HJPIBITLSA-N Tyr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JJNXZIPLIXIGBX-HJPIBITLSA-N 0.000 description 1
- GGXUDPQWAWRINY-XEGUGMAKSA-N Tyr-Ile-Gly Chemical compound OC(=O)CNC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 GGXUDPQWAWRINY-XEGUGMAKSA-N 0.000 description 1
- AVIQBBOOTZENLH-KKUMJFAQSA-N Tyr-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N AVIQBBOOTZENLH-KKUMJFAQSA-N 0.000 description 1
- NKUGCYDFQKFVOJ-JYJNAYRXSA-N Tyr-Leu-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NKUGCYDFQKFVOJ-JYJNAYRXSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- QHLIUFUEUDFAOT-MGHWNKPDSA-N Tyr-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=C(C=C1)O)N QHLIUFUEUDFAOT-MGHWNKPDSA-N 0.000 description 1
- PRONOHBTMLNXCZ-BZSNNMDCSA-N Tyr-Leu-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 PRONOHBTMLNXCZ-BZSNNMDCSA-N 0.000 description 1
- DMWNPLOERDAHSY-MEYUZBJRSA-N Tyr-Leu-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DMWNPLOERDAHSY-MEYUZBJRSA-N 0.000 description 1
- OLYXUGBVBGSZDN-ACRUOGEOSA-N Tyr-Leu-Tyr Chemical compound C([C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=C(O)C=C1 OLYXUGBVBGSZDN-ACRUOGEOSA-N 0.000 description 1
- JAGGEZACYAAMIL-CQDKDKBSSA-N Tyr-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC1=CC=C(C=C1)O)N JAGGEZACYAAMIL-CQDKDKBSSA-N 0.000 description 1
- GITNQBVCEQBDQC-KKUMJFAQSA-N Tyr-Lys-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(N)=O)C(O)=O GITNQBVCEQBDQC-KKUMJFAQSA-N 0.000 description 1
- ZOBLBMGJKVJVEV-BZSNNMDCSA-N Tyr-Lys-Lys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(=O)O)N)O ZOBLBMGJKVJVEV-BZSNNMDCSA-N 0.000 description 1
- PGEFRHBWGOJPJT-KKUMJFAQSA-N Tyr-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O PGEFRHBWGOJPJT-KKUMJFAQSA-N 0.000 description 1
- KYPMKDGKAYQCHO-RYUDHWBXSA-N Tyr-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 KYPMKDGKAYQCHO-RYUDHWBXSA-N 0.000 description 1
- YSGAPESOXHFTQY-IHRRRGAJSA-N Tyr-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N YSGAPESOXHFTQY-IHRRRGAJSA-N 0.000 description 1
- QMNWABHLJOHGDS-IHRRRGAJSA-N Tyr-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QMNWABHLJOHGDS-IHRRRGAJSA-N 0.000 description 1
- XDGPTBVOSHKDFT-KKUMJFAQSA-N Tyr-Met-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(O)=O XDGPTBVOSHKDFT-KKUMJFAQSA-N 0.000 description 1
- OFHKXNKJXURPSY-ULQDDVLXSA-N Tyr-Met-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O OFHKXNKJXURPSY-ULQDDVLXSA-N 0.000 description 1
- KHUVIWRRFMPVHD-JYJNAYRXSA-N Tyr-Met-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O KHUVIWRRFMPVHD-JYJNAYRXSA-N 0.000 description 1
- ZMKDQRJLMRZHRI-ACRUOGEOSA-N Tyr-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N ZMKDQRJLMRZHRI-ACRUOGEOSA-N 0.000 description 1
- JXGUUJMPCRXMSO-HJOGWXRNSA-N Tyr-Phe-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 JXGUUJMPCRXMSO-HJOGWXRNSA-N 0.000 description 1
- PHKQVWWHRYUCJL-HJOGWXRNSA-N Tyr-Phe-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O PHKQVWWHRYUCJL-HJOGWXRNSA-N 0.000 description 1
- CDBXVDXSLPLFMD-BPNCWPANSA-N Tyr-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=C(O)C=C1 CDBXVDXSLPLFMD-BPNCWPANSA-N 0.000 description 1
- PYJKETPLFITNKS-IHRRRGAJSA-N Tyr-Pro-Asn Chemical compound N[C@@H](Cc1ccc(O)cc1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O PYJKETPLFITNKS-IHRRRGAJSA-N 0.000 description 1
- SZEIFUXUTBBQFQ-STQMWFEESA-N Tyr-Pro-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SZEIFUXUTBBQFQ-STQMWFEESA-N 0.000 description 1
- BIWVVOHTKDLRMP-ULQDDVLXSA-N Tyr-Pro-Leu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O BIWVVOHTKDLRMP-ULQDDVLXSA-N 0.000 description 1
- XGZBEGGGAUQBMB-KJEVXHAQSA-N Tyr-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC2=CC=C(C=C2)O)N)O XGZBEGGGAUQBMB-KJEVXHAQSA-N 0.000 description 1
- RWOKVQUCENPXGE-IHRRRGAJSA-N Tyr-Ser-Arg Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RWOKVQUCENPXGE-IHRRRGAJSA-N 0.000 description 1
- KWKJGBHDYJOVCR-SRVKXCTJSA-N Tyr-Ser-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N)O KWKJGBHDYJOVCR-SRVKXCTJSA-N 0.000 description 1
- HRHYJNLMIJWGLF-BZSNNMDCSA-N Tyr-Ser-Phe Chemical compound C([C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=C(O)C=C1 HRHYJNLMIJWGLF-BZSNNMDCSA-N 0.000 description 1
- UMSZZGTXGKHTFJ-SRVKXCTJSA-N Tyr-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 UMSZZGTXGKHTFJ-SRVKXCTJSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- XUIOBCQESNDTDE-FQPOAREZSA-N Tyr-Thr-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XUIOBCQESNDTDE-FQPOAREZSA-N 0.000 description 1
- QFHRUCJIRVILCK-YJRXYDGGSA-N Tyr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O QFHRUCJIRVILCK-YJRXYDGGSA-N 0.000 description 1
- RIVVDNTUSRVTQT-IRIUXVKKSA-N Tyr-Thr-Gln Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O RIVVDNTUSRVTQT-IRIUXVKKSA-N 0.000 description 1
- XFEMMSGONWQACR-KJEVXHAQSA-N Tyr-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O XFEMMSGONWQACR-KJEVXHAQSA-N 0.000 description 1
- WQOHKVRQDLNDIL-YJRXYDGGSA-N Tyr-Thr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O WQOHKVRQDLNDIL-YJRXYDGGSA-N 0.000 description 1
- KLQPIEVIKOQRAW-IZPVPAKOSA-N Tyr-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N)O KLQPIEVIKOQRAW-IZPVPAKOSA-N 0.000 description 1
- AKRHKDCELJLTMD-BVSLBCMMSA-N Tyr-Trp-Arg Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N AKRHKDCELJLTMD-BVSLBCMMSA-N 0.000 description 1
- NUQZCPSZHGIYTA-HKUYNNGSSA-N Tyr-Trp-Gly Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC3=CC=C(C=C3)O)N NUQZCPSZHGIYTA-HKUYNNGSSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- OJCISMMNNUNNJA-BZSNNMDCSA-N Tyr-Tyr-Asp Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N[C@@H](CC(O)=O)C(O)=O)C1=CC=C(O)C=C1 OJCISMMNNUNNJA-BZSNNMDCSA-N 0.000 description 1
- AGDDLOQMXUQPDY-BZSNNMDCSA-N Tyr-Tyr-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O AGDDLOQMXUQPDY-BZSNNMDCSA-N 0.000 description 1
- NVJCMGGZHOJNBU-UFYCRDLUSA-N Tyr-Val-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N NVJCMGGZHOJNBU-UFYCRDLUSA-N 0.000 description 1
- RVGVIWNHABGIFH-IHRRRGAJSA-N Tyr-Val-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O RVGVIWNHABGIFH-IHRRRGAJSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108010064997 VPY tripeptide Proteins 0.000 description 1
- 241000700618 Vaccinia virus Species 0.000 description 1
- LTFLDDDGWOVIHY-NAKRPEOUSA-N Val-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C)NC(=O)[C@H](C(C)C)N LTFLDDDGWOVIHY-NAKRPEOUSA-N 0.000 description 1
- UUYCNAXCCDNULB-QXEWZRGKSA-N Val-Arg-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(N)=O)C(O)=O UUYCNAXCCDNULB-QXEWZRGKSA-N 0.000 description 1
- JOQSQZFKFYJKKJ-GUBZILKMSA-N Val-Arg-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CS)C(=O)O)N JOQSQZFKFYJKKJ-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- COYSIHFOCOMGCF-WPRPVWTQSA-N Val-Arg-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-WPRPVWTQSA-N 0.000 description 1
- COYSIHFOCOMGCF-UHFFFAOYSA-N Val-Arg-Gly Natural products CC(C)C(N)C(=O)NC(C(=O)NCC(O)=O)CCCN=C(N)N COYSIHFOCOMGCF-UHFFFAOYSA-N 0.000 description 1
- PAPWZOJOLKZEFR-AVGNSLFASA-N Val-Arg-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCCN)C(=O)O)N PAPWZOJOLKZEFR-AVGNSLFASA-N 0.000 description 1
- IVXJODPZRWHCCR-JYJNAYRXSA-N Val-Arg-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N IVXJODPZRWHCCR-JYJNAYRXSA-N 0.000 description 1
- VMRFIKXKOFNMHW-GUBZILKMSA-N Val-Arg-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N VMRFIKXKOFNMHW-GUBZILKMSA-N 0.000 description 1
- CWOSXNKDOACNJN-BZSNNMDCSA-N Val-Arg-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N CWOSXNKDOACNJN-BZSNNMDCSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- GXAZTLJYINLMJL-LAEOZQHASA-N Val-Asn-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N GXAZTLJYINLMJL-LAEOZQHASA-N 0.000 description 1
- ZMDCGGKHRKNWKD-LAEOZQHASA-N Val-Asn-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N ZMDCGGKHRKNWKD-LAEOZQHASA-N 0.000 description 1
- UDNYEPLJTRDMEJ-RCOVLWMOSA-N Val-Asn-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)NCC(=O)O)N UDNYEPLJTRDMEJ-RCOVLWMOSA-N 0.000 description 1
- LIQJSDDOULTANC-QSFUFRPTSA-N Val-Asn-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N LIQJSDDOULTANC-QSFUFRPTSA-N 0.000 description 1
- JLFKWDAZBRYCGX-ZKWXMUAHSA-N Val-Asn-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N JLFKWDAZBRYCGX-ZKWXMUAHSA-N 0.000 description 1
- CGGVNFJRZJUVAE-BYULHYEWSA-N Val-Asp-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N CGGVNFJRZJUVAE-BYULHYEWSA-N 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- VUTHNLMCXKLLFI-LAEOZQHASA-N Val-Asp-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VUTHNLMCXKLLFI-LAEOZQHASA-N 0.000 description 1
- ZQGPWORGSNRQLN-NHCYSSNCSA-N Val-Asp-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZQGPWORGSNRQLN-NHCYSSNCSA-N 0.000 description 1
- XLDYBRXERHITNH-QSFUFRPTSA-N Val-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)C(C)C XLDYBRXERHITNH-QSFUFRPTSA-N 0.000 description 1
- TZVUSFMQWPWHON-NHCYSSNCSA-N Val-Asp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C(C)C)N TZVUSFMQWPWHON-NHCYSSNCSA-N 0.000 description 1
- BMGOFDMKDVVGJG-NHCYSSNCSA-N Val-Asp-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCCN)C(=O)O)N BMGOFDMKDVVGJG-NHCYSSNCSA-N 0.000 description 1
- YODDULVCGFQRFZ-ZKWXMUAHSA-N Val-Asp-Ser Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O YODDULVCGFQRFZ-ZKWXMUAHSA-N 0.000 description 1
- CWSIBTLMMQLPPZ-FXQIFTODSA-N Val-Cys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N CWSIBTLMMQLPPZ-FXQIFTODSA-N 0.000 description 1
- FRUYSSRPJXNRRB-GUBZILKMSA-N Val-Cys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FRUYSSRPJXNRRB-GUBZILKMSA-N 0.000 description 1
- KOPBYUSPXBQIHD-NRPADANISA-N Val-Cys-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KOPBYUSPXBQIHD-NRPADANISA-N 0.000 description 1
- FPCIBLUVDNXPJO-XPUUQOCRSA-N Val-Cys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CS)C(=O)NCC(O)=O FPCIBLUVDNXPJO-XPUUQOCRSA-N 0.000 description 1
- LHADRQBREKTRLR-DCAQKATOSA-N Val-Cys-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N LHADRQBREKTRLR-DCAQKATOSA-N 0.000 description 1
- HIZMLPKDJAXDRG-FXQIFTODSA-N Val-Cys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N HIZMLPKDJAXDRG-FXQIFTODSA-N 0.000 description 1
- XJFXZQKJQGYFMM-GUBZILKMSA-N Val-Cys-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)O)N XJFXZQKJQGYFMM-GUBZILKMSA-N 0.000 description 1
- XTAUQCGQFJQGEJ-NHCYSSNCSA-N Val-Gln-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N XTAUQCGQFJQGEJ-NHCYSSNCSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- CPTQYHDSVGVGDZ-UKJIMTQDSA-N Val-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C(C)C)N CPTQYHDSVGVGDZ-UKJIMTQDSA-N 0.000 description 1
- ZEVNVXYRZRIRCH-GVXVVHGQSA-N Val-Gln-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N ZEVNVXYRZRIRCH-GVXVVHGQSA-N 0.000 description 1
- AGKDVLSDNSTLFA-UMNHJUIQSA-N Val-Gln-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N AGKDVLSDNSTLFA-UMNHJUIQSA-N 0.000 description 1
- PWRITNSESKQTPW-NRPADANISA-N Val-Gln-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N PWRITNSESKQTPW-NRPADANISA-N 0.000 description 1
- UZDHNIJRRTUKKC-DLOVCJGASA-N Val-Gln-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N UZDHNIJRRTUKKC-DLOVCJGASA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- SZTTYWIUCGSURQ-AUTRQRHGSA-N Val-Glu-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O SZTTYWIUCGSURQ-AUTRQRHGSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- MHAHQDBEIDPFQS-NHCYSSNCSA-N Val-Glu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)C(C)C MHAHQDBEIDPFQS-NHCYSSNCSA-N 0.000 description 1
- JTWIMNMUYLQNPI-WPRPVWTQSA-N Val-Gly-Arg Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N JTWIMNMUYLQNPI-WPRPVWTQSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- DJEVQCWNMQOABE-RCOVLWMOSA-N Val-Gly-Asp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC(=O)O)C(=O)O)N DJEVQCWNMQOABE-RCOVLWMOSA-N 0.000 description 1
- BEGDZYNDCNEGJZ-XVKPBYJWSA-N Val-Gly-Gln Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O BEGDZYNDCNEGJZ-XVKPBYJWSA-N 0.000 description 1
- PMDOQZFYGWZSTK-LSJOCFKGSA-N Val-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)C(C)C PMDOQZFYGWZSTK-LSJOCFKGSA-N 0.000 description 1
- SYOMXKPPFZRELL-ONGXEEELSA-N Val-Gly-Lys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)O)N SYOMXKPPFZRELL-ONGXEEELSA-N 0.000 description 1
- LAYSXAOGWHKNED-XPUUQOCRSA-N Val-Gly-Ser Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O LAYSXAOGWHKNED-XPUUQOCRSA-N 0.000 description 1
- KZKMBGXCNLPYKD-YEPSODPASA-N Val-Gly-Thr Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O KZKMBGXCNLPYKD-YEPSODPASA-N 0.000 description 1
- JVYIGCARISMLMV-HOCLYGCPSA-N Val-Gly-Trp Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N JVYIGCARISMLMV-HOCLYGCPSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- WJVLTYSHNXRCLT-NHCYSSNCSA-N Val-His-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC(=O)O)C(=O)O)N WJVLTYSHNXRCLT-NHCYSSNCSA-N 0.000 description 1
- SDSCOOZQQGUQFC-GVXVVHGQSA-N Val-His-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N SDSCOOZQQGUQFC-GVXVVHGQSA-N 0.000 description 1
- PTFPUAXGIKTVNN-ONGXEEELSA-N Val-His-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)NCC(=O)O)N PTFPUAXGIKTVNN-ONGXEEELSA-N 0.000 description 1
- ZIGZPYJXIWLQFC-QTKMDUPCSA-N Val-His-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](C(C)C)N)O ZIGZPYJXIWLQFC-QTKMDUPCSA-N 0.000 description 1
- JPPXDMBGXJBTIB-ULQDDVLXSA-N Val-His-Tyr Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O)N JPPXDMBGXJBTIB-ULQDDVLXSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- LKUDRJSNRWVGMS-QSFUFRPTSA-N Val-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LKUDRJSNRWVGMS-QSFUFRPTSA-N 0.000 description 1
- PYXQBKJPHNCTNW-CYDGBPFRSA-N Val-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](C(C)C)N PYXQBKJPHNCTNW-CYDGBPFRSA-N 0.000 description 1
- APQIVBCUIUDSMB-OSUNSFLBSA-N Val-Ile-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N APQIVBCUIUDSMB-OSUNSFLBSA-N 0.000 description 1
- MYLNLEIZWHVENT-VKOGCVSHSA-N Val-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](C(C)C)N MYLNLEIZWHVENT-VKOGCVSHSA-N 0.000 description 1
- BZWUSZGQOILYEU-STECZYCISA-N Val-Ile-Tyr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 BZWUSZGQOILYEU-STECZYCISA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- DAVNYIUELQBTAP-XUXIUFHCSA-N Val-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)N DAVNYIUELQBTAP-XUXIUFHCSA-N 0.000 description 1
- RFKJNTRMXGCKFE-FHWLQOOXSA-N Val-Leu-Trp Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC(C)C)C(O)=O)=CNC2=C1 RFKJNTRMXGCKFE-FHWLQOOXSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- IJGPOONOTBNTFS-GVXVVHGQSA-N Val-Lys-Glu Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(O)=O IJGPOONOTBNTFS-GVXVVHGQSA-N 0.000 description 1
- QRVPEKJBBRYISE-XUXIUFHCSA-N Val-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N QRVPEKJBBRYISE-XUXIUFHCSA-N 0.000 description 1
- HPANGHISDXDUQY-ULQDDVLXSA-N Val-Lys-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N HPANGHISDXDUQY-ULQDDVLXSA-N 0.000 description 1
- JAKHAONCJJZVHT-DCAQKATOSA-N Val-Lys-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N JAKHAONCJJZVHT-DCAQKATOSA-N 0.000 description 1
- MBGFDZDWMDLXHQ-GUBZILKMSA-N Val-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MBGFDZDWMDLXHQ-GUBZILKMSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- VENKIVFKIPGEJN-NHCYSSNCSA-N Val-Met-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N VENKIVFKIPGEJN-NHCYSSNCSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- RSGHLMMKXJGCMK-JYJNAYRXSA-N Val-Met-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N RSGHLMMKXJGCMK-JYJNAYRXSA-N 0.000 description 1
- VNGKMNPAENRGDC-JYJNAYRXSA-N Val-Phe-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 VNGKMNPAENRGDC-JYJNAYRXSA-N 0.000 description 1
- UZFNHAXYMICTBU-DZKIICNBSA-N Val-Phe-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N UZFNHAXYMICTBU-DZKIICNBSA-N 0.000 description 1
- YLRAFVVWZRSZQC-DZKIICNBSA-N Val-Phe-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N YLRAFVVWZRSZQC-DZKIICNBSA-N 0.000 description 1
- UXODSMTVPWXHBT-ULQDDVLXSA-N Val-Phe-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N UXODSMTVPWXHBT-ULQDDVLXSA-N 0.000 description 1
- HJSLDXZAZGFPDK-ULQDDVLXSA-N Val-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](C(C)C)N HJSLDXZAZGFPDK-ULQDDVLXSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- ZEBRMWPTJNHXAJ-JYJNAYRXSA-N Val-Phe-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)O)N ZEBRMWPTJNHXAJ-JYJNAYRXSA-N 0.000 description 1
- BCBFMJYTNKDALA-UFYCRDLUSA-N Val-Phe-Phe Chemical compound N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O BCBFMJYTNKDALA-UFYCRDLUSA-N 0.000 description 1
- VCIYTVOBLZHFSC-XHSDSOJGSA-N Val-Phe-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N2CCC[C@@H]2C(=O)O)N VCIYTVOBLZHFSC-XHSDSOJGSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- JMCOXFSCTGKLLB-FKBYEOEOSA-N Val-Phe-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC2=CNC3=CC=CC=C32)C(=O)O)N JMCOXFSCTGKLLB-FKBYEOEOSA-N 0.000 description 1
- LGXUZJIQCGXKGZ-QXEWZRGKSA-N Val-Pro-Asn Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)N)C(=O)O)N LGXUZJIQCGXKGZ-QXEWZRGKSA-N 0.000 description 1
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 1
- VSCIANXXVZOYOC-AVGNSLFASA-N Val-Pro-His Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N VSCIANXXVZOYOC-AVGNSLFASA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- MIKHIIQMRFYVOR-RCWTZXSCSA-N Val-Pro-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](C(C)C)N)O MIKHIIQMRFYVOR-RCWTZXSCSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- LTTQCQRTSHJPPL-ZKWXMUAHSA-N Val-Ser-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N LTTQCQRTSHJPPL-ZKWXMUAHSA-N 0.000 description 1
- QTPQHINADBYBNA-DCAQKATOSA-N Val-Ser-Lys Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCCN QTPQHINADBYBNA-DCAQKATOSA-N 0.000 description 1
- QZKVWWIUSQGWMY-IHRRRGAJSA-N Val-Ser-Phe Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QZKVWWIUSQGWMY-IHRRRGAJSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- PQSNETRGCRUOGP-KKHAAJSZSA-N Val-Thr-Asn Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(N)=O PQSNETRGCRUOGP-KKHAAJSZSA-N 0.000 description 1
- BZDGLJPROOOUOZ-XGEHTFHBSA-N Val-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N)O BZDGLJPROOOUOZ-XGEHTFHBSA-N 0.000 description 1
- WUFHZIRMAZZWRS-OSUNSFLBSA-N Val-Thr-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C(C)C)N WUFHZIRMAZZWRS-OSUNSFLBSA-N 0.000 description 1
- PDDJTOSAVNRJRH-UNQGMJICSA-N Val-Thr-Phe Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)NC(=O)[C@H](C(C)C)N)O PDDJTOSAVNRJRH-UNQGMJICSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- HTONZBWRYUKUKC-RCWTZXSCSA-N Val-Thr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O HTONZBWRYUKUKC-RCWTZXSCSA-N 0.000 description 1
- IRAUYEAFPFPVND-UVBJJODRSA-N Val-Trp-Ala Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C)C(O)=O)=CNC2=C1 IRAUYEAFPFPVND-UVBJJODRSA-N 0.000 description 1
- SUGRIIAOLCDLBD-ZOBUZTSGSA-N Val-Trp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(=O)O)C(=O)O)N SUGRIIAOLCDLBD-ZOBUZTSGSA-N 0.000 description 1
- QHSSPPHOHJSTML-HOCLYGCPSA-N Val-Trp-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)NCC(=O)O)N QHSSPPHOHJSTML-HOCLYGCPSA-N 0.000 description 1
- KJFBXCFOPAKPTM-BZSNNMDCSA-N Val-Trp-Val Chemical compound C1=CC=C2C(C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O)=CNC2=C1 KJFBXCFOPAKPTM-BZSNNMDCSA-N 0.000 description 1
- MIAZWUMFUURQNP-YDHLFZDLSA-N Val-Tyr-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N MIAZWUMFUURQNP-YDHLFZDLSA-N 0.000 description 1
- DOBHJKVVACOQTN-DZKIICNBSA-N Val-Tyr-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=C(O)C=C1 DOBHJKVVACOQTN-DZKIICNBSA-N 0.000 description 1
- ZNGPROMGGGFOAA-JYJNAYRXSA-N Val-Tyr-Val Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=C(O)C=C1 ZNGPROMGGGFOAA-JYJNAYRXSA-N 0.000 description 1
- DFQZDQPLWBSFEJ-LSJOCFKGSA-N Val-Val-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(=O)N)C(=O)O)N DFQZDQPLWBSFEJ-LSJOCFKGSA-N 0.000 description 1
- SSKKGOWRPNIVDW-AVGNSLFASA-N Val-Val-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SSKKGOWRPNIVDW-AVGNSLFASA-N 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 230000005856 abnormality Effects 0.000 description 1
- 235000011054 acetic acid Nutrition 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 150000008065 acid anhydrides Chemical class 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 239000007825 activation reagent Substances 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 238000013019 agitation Methods 0.000 description 1
- 108010028939 alanyl-alanyl-lysyl-alanine Proteins 0.000 description 1
- 108010084217 alanyl-glutamyl-aspartyl-glycine Proteins 0.000 description 1
- 108010045350 alanyl-tyrosyl-alanine Proteins 0.000 description 1
- 108010011559 alanylphenylalanine Proteins 0.000 description 1
- 108010070783 alanyltyrosine Proteins 0.000 description 1
- 150000001298 alcohols Chemical class 0.000 description 1
- 125000000217 alkyl group Chemical group 0.000 description 1
- BJEPYKJPYRNKOW-UHFFFAOYSA-N alpha-hydroxysuccinic acid Natural products OC(=O)C(O)CC(O)=O BJEPYKJPYRNKOW-UHFFFAOYSA-N 0.000 description 1
- 229910000147 aluminium phosphate Inorganic materials 0.000 description 1
- 150000003862 amino acid derivatives Chemical class 0.000 description 1
- 125000000539 amino acid group Chemical group 0.000 description 1
- 239000000074 antisense oligonucleotide Substances 0.000 description 1
- 238000012230 antisense oligonucleotides Methods 0.000 description 1
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 1
- 108010089975 arginyl-glycyl-aspartyl-serine Proteins 0.000 description 1
- 108010091818 arginyl-glycyl-aspartyl-valine Proteins 0.000 description 1
- 108010036533 arginylvaline Proteins 0.000 description 1
- 125000003710 aryl alkyl group Chemical group 0.000 description 1
- 125000003118 aryl group Chemical group 0.000 description 1
- 108010027234 aspartyl-glycyl-glutamyl-alanine Proteins 0.000 description 1
- 239000012752 auxiliary agent Substances 0.000 description 1
- 230000001580 bacterial effect Effects 0.000 description 1
- SRSXLGNVWSONIS-UHFFFAOYSA-N benzenesulfonic acid Chemical compound OS(=O)(=O)C1=CC=CC=C1 SRSXLGNVWSONIS-UHFFFAOYSA-N 0.000 description 1
- 229940092714 benzenesulfonic acid Drugs 0.000 description 1
- 235000010233 benzoic acid Nutrition 0.000 description 1
- 125000001797 benzyl group Chemical group [H]C1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])* 0.000 description 1
- 210000001124 body fluid Anatomy 0.000 description 1
- 239000010839 body fluid Substances 0.000 description 1
- 239000000872 buffer Substances 0.000 description 1
- 239000007853 buffer solution Substances 0.000 description 1
- KDYFGRWQOYBRFD-NUQCWPJISA-N butanedioic acid Chemical compound O[14C](=O)CC[14C](O)=O KDYFGRWQOYBRFD-NUQCWPJISA-N 0.000 description 1
- 125000000484 butyl group Chemical group [H]C([*])([H])C([H])([H])C([H])([H])C([H])([H])[H] 0.000 description 1
- 239000004202 carbamide Substances 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000013611 chromosomal DNA Substances 0.000 description 1
- 230000002759 chromosomal effect Effects 0.000 description 1
- 229960002376 chymotrypsin Drugs 0.000 description 1
- 235000015165 citric acid Nutrition 0.000 description 1
- 238000004440 column chromatography Methods 0.000 description 1
- 230000002860 competitive effect Effects 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 239000013256 coordination polymer Substances 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 125000000113 cyclohexyl group Chemical group [H]C1([H])C([H])([H])C([H])([H])C([H])(*)C([H])([H])C1([H])[H] 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 230000006378 damage Effects 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 239000003398 denaturant Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 239000000032 diagnostic agent Substances 0.000 description 1
- 229940039227 diagnostic agent Drugs 0.000 description 1
- 230000029087 digestion Effects 0.000 description 1
- 238000004821 distillation Methods 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 238000004520 electroporation Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 150000002170 ethers Chemical class 0.000 description 1
- 125000001495 ethyl group Chemical group [H]C([H])([H])C([H])([H])* 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 235000019253 formic acid Nutrition 0.000 description 1
- 239000001530 fumaric acid Substances 0.000 description 1
- 235000011087 fumaric acid Nutrition 0.000 description 1
- 230000005861 gene abnormality Effects 0.000 description 1
- 238000001415 gene therapy Methods 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010037389 glutamyl-cysteinyl-lysine Proteins 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010084264 glycyl-glycyl-cysteine Proteins 0.000 description 1
- 108010001064 glycyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 108010028188 glycyl-histidyl-serine Proteins 0.000 description 1
- 108010077435 glycyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010059898 glycyl-tyrosyl-lysine Proteins 0.000 description 1
- 229960000789 guanidine hydrochloride Drugs 0.000 description 1
- PJJJBBJSCAKJQF-UHFFFAOYSA-N guanidinium chloride Chemical compound [Cl-].NC(N)=[NH2+] PJJJBBJSCAKJQF-UHFFFAOYSA-N 0.000 description 1
- 150000008282 halocarbons Chemical class 0.000 description 1
- 210000002216 heart Anatomy 0.000 description 1
- 238000002744 homologous recombination Methods 0.000 description 1
- 230000006801 homologous recombination Effects 0.000 description 1
- 102000055334 human NTNG1 Human genes 0.000 description 1
- 210000004408 hybridoma Anatomy 0.000 description 1
- 239000000017 hydrogel Substances 0.000 description 1
- 230000002209 hydrophobic effect Effects 0.000 description 1
- 230000002163 immunogen Effects 0.000 description 1
- 230000002401 inhibitory effect Effects 0.000 description 1
- 150000007529 inorganic bases Chemical class 0.000 description 1
- 238000002955 isolation Methods 0.000 description 1
- 125000001449 isopropyl group Chemical group [H]C([H])([H])C([H])(*)C([H])([H])[H] 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000012177 large-scale sequencing Methods 0.000 description 1
- 108010077158 leucinyl-arginyl-tryptophan Proteins 0.000 description 1
- 108010073093 leucyl-glycyl-glycyl-glycine Proteins 0.000 description 1
- 125000005647 linker group Chemical group 0.000 description 1
- 238000004811 liquid chromatography Methods 0.000 description 1
- 239000007791 liquid phase Substances 0.000 description 1
- 210000004185 liver Anatomy 0.000 description 1
- 210000004072 lung Anatomy 0.000 description 1
- 239000004325 lysozyme Substances 0.000 description 1
- 235000010335 lysozyme Nutrition 0.000 description 1
- 229960000274 lysozyme Drugs 0.000 description 1
- 108010076718 lysyl-glutamyl-tryptophan Proteins 0.000 description 1
- 108010045397 lysyl-tyrosyl-lysine Proteins 0.000 description 1
- 108010075702 lysyl-valyl-aspartyl-leucine Proteins 0.000 description 1
- VZCYOOQTPOCHFL-UPHRSURJSA-N maleic acid Chemical compound OC(=O)\C=C/C(O)=O VZCYOOQTPOCHFL-UPHRSURJSA-N 0.000 description 1
- 239000011976 maleic acid Substances 0.000 description 1
- 239000001630 malic acid Substances 0.000 description 1
- 235000011090 malic acid Nutrition 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 238000002844 melting Methods 0.000 description 1
- 230000008018 melting Effects 0.000 description 1
- 229940098779 methanesulfonic acid Drugs 0.000 description 1
- 108010034507 methionyltryptophan Proteins 0.000 description 1
- 125000002496 methyl group Chemical group [H]C([H])([H])* 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 201000000050 myeloid neoplasm Diseases 0.000 description 1
- 125000004123 n-propyl group Chemical group [H]C([H])([H])C([H])([H])C([H])([H])* 0.000 description 1
- 230000007935 neutral effect Effects 0.000 description 1
- FEMOMIGRRWSMCU-UHFFFAOYSA-N ninhydrin Chemical compound C1=CC=C2C(=O)C(O)(O)C(=O)C2=C1 FEMOMIGRRWSMCU-UHFFFAOYSA-N 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 150000007530 organic bases Chemical class 0.000 description 1
- 235000006408 oxalic acid Nutrition 0.000 description 1
- 230000008506 pathogenesis Effects 0.000 description 1
- 101150019841 penP gene Proteins 0.000 description 1
- 125000001997 phenyl group Chemical group [H]C1=C([H])C([H])=C(*)C([H])=C1[H] 0.000 description 1
- 108010082795 phenylalanyl-arginyl-arginine Proteins 0.000 description 1
- 108010072637 phenylalanyl-arginyl-phenylalanine Proteins 0.000 description 1
- 108010070409 phenylalanyl-glycyl-glycine Proteins 0.000 description 1
- 108010064486 phenylalanyl-leucyl-valine Proteins 0.000 description 1
- 108010065135 phenylalanyl-phenylalanyl-phenylalanine Proteins 0.000 description 1
- 108010024607 phenylalanylalanine Proteins 0.000 description 1
- 238000003752 polymerase chain reaction Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 108010020432 prolyl-prolylisoleucine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 230000001737 promoting effect Effects 0.000 description 1
- 230000000069 prophylactic effect Effects 0.000 description 1
- 235000019260 propionic acid Nutrition 0.000 description 1
- 235000019833 protease Nutrition 0.000 description 1
- 235000019419 proteases Nutrition 0.000 description 1
- 238000000159 protein binding assay Methods 0.000 description 1
- 208000020016 psychiatric disease Diseases 0.000 description 1
- IUVKMZGDUIUOCP-BTNSXGMBSA-N quinbolone Chemical compound O([C@H]1CC[C@H]2[C@H]3[C@@H]([C@]4(C=CC(=O)C=C4CC3)C)CC[C@@]21C)C1=CCCC1 IUVKMZGDUIUOCP-BTNSXGMBSA-N 0.000 description 1
- 230000006340 racemization Effects 0.000 description 1
- 238000003127 radioimmunoassay Methods 0.000 description 1
- 101150079601 recA gene Proteins 0.000 description 1
- 238000001953 recrystallisation Methods 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000010076 replication Effects 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 108010015840 seryl-prolyl-lysyl-lysine Proteins 0.000 description 1
- 238000002741 site-directed mutagenesis Methods 0.000 description 1
- 238000010532 solid phase synthesis reaction Methods 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 210000000952 spleen Anatomy 0.000 description 1
- 238000010186 staining Methods 0.000 description 1
- 239000012086 standard solution Substances 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 125000001424 substituent group Chemical group 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
- 150000003462 sulfoxides Chemical class 0.000 description 1
- 239000004094 surface-active agent Substances 0.000 description 1
- 238000001308 synthesis method Methods 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 239000011975 tartaric acid Substances 0.000 description 1
- 235000002906 tartaric acid Nutrition 0.000 description 1
- 210000001550 testis Anatomy 0.000 description 1
- 238000010257 thawing Methods 0.000 description 1
- 230000001225 therapeutic effect Effects 0.000 description 1
- 108010033670 threonyl-aspartyl-tyrosine Proteins 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical compound [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 239000012588 trypsin Substances 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010015666 tryptophyl-leucyl-glutamic acid Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 108010044292 tryptophyltyrosine Proteins 0.000 description 1
- 108010079202 tyrosyl-alanyl-cysteine Proteins 0.000 description 1
- 108010017949 tyrosyl-glycyl-glycine Proteins 0.000 description 1
- 108010078580 tyrosylleucine Proteins 0.000 description 1
- 241000701447 unidentified baculovirus Species 0.000 description 1
- 241001430294 unidentified retrovirus Species 0.000 description 1
- 239000013603 viral vector Substances 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
- 108010027345 wheylin-1 peptide Proteins 0.000 description 1
Landscapes
- Enzymes And Modification Thereof (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Medicines Containing Material From Animals Or Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
Abstract
(57)【要約】 (修正有)
【課題】ヒト成人全脳、ヒト成人海馬及びヒト胎児全脳
由来のcDNAライブラリーから、蛋白質をコードして
いる領域を含む新規なDNAを直接クローニングし、そ
れらの塩基配列を決定し、更にそれらの機能を同定する
こと。 【解決手段】以下の(a)又は(b)のポリペプチドを
コードする塩基配列を含むDNA:(a)特定の配列
(但し、一部の配列は除く)のいずれか一つで示される
アミノ酸配列と同一又は実質的に同一のアミノ酸配列か
ら成るポリペプチド、(b)特定の配列(但し、一部の
特定の配列は除く)のいずれか一つで示されるアミノ酸
配列において、一部のアミノ酸が欠失、置換又は付加さ
れたアミノ酸配列から成り、(a)のポリペプチドの機
能と実質的に同質の生物学的活性を有するポリペプチ
ド、上記DNAにコードされる組換えポリペプチド、及
び該ポリペプチドを含む蛋白質。
由来のcDNAライブラリーから、蛋白質をコードして
いる領域を含む新規なDNAを直接クローニングし、そ
れらの塩基配列を決定し、更にそれらの機能を同定する
こと。 【解決手段】以下の(a)又は(b)のポリペプチドを
コードする塩基配列を含むDNA:(a)特定の配列
(但し、一部の配列は除く)のいずれか一つで示される
アミノ酸配列と同一又は実質的に同一のアミノ酸配列か
ら成るポリペプチド、(b)特定の配列(但し、一部の
特定の配列は除く)のいずれか一つで示されるアミノ酸
配列において、一部のアミノ酸が欠失、置換又は付加さ
れたアミノ酸配列から成り、(a)のポリペプチドの機
能と実質的に同質の生物学的活性を有するポリペプチ
ド、上記DNAにコードされる組換えポリペプチド、及
び該ポリペプチドを含む蛋白質。
Description
【0001】
【発明の属する技術分野】本発明は、DNA及び該DN
Aを含む遺伝子、並びに該DNAにコードされる組換え
ポリペプチド及び該ポリペプチドを含む新規組換え蛋白
質に関する。
Aを含む遺伝子、並びに該DNAにコードされる組換え
ポリペプチド及び該ポリペプチドを含む新規組換え蛋白
質に関する。
【0002】
【従来の技術】ヒトゲノム計画における大規模シークエ
ンシングによって、ヒトゲノムの塩基配列に関する情報
が日々産出されている。ヒトゲノム計画の最終目的は単
にゲノム全塩基配列を決定することではなく、その構造
情報、即ち、DNAの塩基配列情報からヒトのさまざま
な生命現象を読み解くことにあろう。ヒトゲノム配列中
で蛋白質をコードしている領域はその極一部であり、現
在は、ニュートラルネットワークや隠れマルコフモデル
と呼ばれる情報科学の手法を用いて、そのコード領域の
予測が行われている。しかしながら、それらの予測精度
はまだ充分なものではない。
ンシングによって、ヒトゲノムの塩基配列に関する情報
が日々産出されている。ヒトゲノム計画の最終目的は単
にゲノム全塩基配列を決定することではなく、その構造
情報、即ち、DNAの塩基配列情報からヒトのさまざま
な生命現象を読み解くことにあろう。ヒトゲノム配列中
で蛋白質をコードしている領域はその極一部であり、現
在は、ニュートラルネットワークや隠れマルコフモデル
と呼ばれる情報科学の手法を用いて、そのコード領域の
予測が行われている。しかしながら、それらの予測精度
はまだ充分なものではない。
【0003】今回、本発明者は新規な遺伝子を見出すべ
く、ヒト成人全脳、ヒト成人海馬及びヒト胎児全脳由来
のcDNAライブラリーから、蛋白質をコードしている
領域を含む新規なDNAを直接クローニングすることに
成功し、それらの塩基配列を決定して本発明を完成させ
た。
く、ヒト成人全脳、ヒト成人海馬及びヒト胎児全脳由来
のcDNAライブラリーから、蛋白質をコードしている
領域を含む新規なDNAを直接クローニングすることに
成功し、それらの塩基配列を決定して本発明を完成させ
た。
【0004】
【課題を解決するための手段】即ち、本発明は第一の態
様として、以下の(a)又は(b)のポリペプチドをコ
ードする塩基配列を含むDNAに係る: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示されるアミノ酸配
列と同一又は実質的に同一のアミノ酸配列から成るポリ
ペプチド、(b)配列番号:1乃至44(但し、配列番
号7、11及び25は除く)のいずれか一つで示される
アミノ酸配列において、一部のアミノ酸が欠失、置換又
は付加されたアミノ酸配列から成り、(a)のポリペプ
チドの機能と実質的に同質の生物学的活性を有するポリ
ペプチド。本発明の第二の態様として、以下の(a)又
は(b)のDNAに係る: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示される塩基配列に
おいて、夫々の配列で示されるアミノ酸配列をコードす
る塩基配列を含むDNA、(b)(a)のDNAとスト
リンジェントな条件下でハイブリダイズし、(a)のポ
リペプチドの機能と実質的に同質の生物学的活性を有す
る蛋白質をコードするDNA。以上の本発明の第一及び
第二の態様であるDNAをまとめて、以下、「本発明D
NA」ともいう。又、本発明はこれらDNAを含む遺伝
子にも係る。更に、本発明は上記DNA又は遺伝子にコ
ードされる組換えポリペプチド(以下、「本発明ポリペ
プチド」ともいう。)、及び該ポリペプチドを含む組換
え蛋白質に係る。
様として、以下の(a)又は(b)のポリペプチドをコ
ードする塩基配列を含むDNAに係る: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示されるアミノ酸配
列と同一又は実質的に同一のアミノ酸配列から成るポリ
ペプチド、(b)配列番号:1乃至44(但し、配列番
号7、11及び25は除く)のいずれか一つで示される
アミノ酸配列において、一部のアミノ酸が欠失、置換又
は付加されたアミノ酸配列から成り、(a)のポリペプ
チドの機能と実質的に同質の生物学的活性を有するポリ
ペプチド。本発明の第二の態様として、以下の(a)又
は(b)のDNAに係る: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示される塩基配列に
おいて、夫々の配列で示されるアミノ酸配列をコードす
る塩基配列を含むDNA、(b)(a)のDNAとスト
リンジェントな条件下でハイブリダイズし、(a)のポ
リペプチドの機能と実質的に同質の生物学的活性を有す
る蛋白質をコードするDNA。以上の本発明の第一及び
第二の態様であるDNAをまとめて、以下、「本発明D
NA」ともいう。又、本発明はこれらDNAを含む遺伝
子にも係る。更に、本発明は上記DNA又は遺伝子にコ
ードされる組換えポリペプチド(以下、「本発明ポリペ
プチド」ともいう。)、及び該ポリペプチドを含む組換
え蛋白質に係る。
【0005】本発明DNAを有するクローンの名称、本
発明ポリペプチド又は蛋白質の長さ、その機能について
は、表1乃至表3に示されている。
発明ポリペプチド又は蛋白質の長さ、その機能について
は、表1乃至表3に示されている。
【0006】本発明DNAは、市販されている(クロン
テック社)ヒト成人全脳、ヒト成人海馬及びヒト胎児全
脳のmRNAを出発材料として、本発明者が調製したc
DNAライブラリーから、cDNA断片として単離した
後に、塩基配列を決定し同定したものである。即ち、具
体的には、小原他の方法(DNA Research Vol.4,53−59
(1997))に従って調製したヒト成人全脳、ヒト成人海
馬及びヒト胎児全脳由来のcDNAライブラリーからク
ローンをランダムに単離する。次に、ハイブリダイゼー
ションにより、重複クローン(繰り返し出てくるクロー
ン)を除き、その後インビトロでの転写翻訳を行い50
kDa以上の産物が認められるクローンについてその両
末端の塩基配列を決定する。更に、こうして得られた末
端塩基配列をクエリーとして既知遺伝子のデータベース
に相同性検索を行い、その結果、新規であることが判明
したクローンについて全塩基配列を決定する。このよう
にして既知の遺伝子に依存した従来のクローニング方法
では得られなかった未知の遺伝子も、システマチックに
クローニングを行なうことができる。又、短い断片や得
られた配列に人工的な間違いが起こらないように十分な
注意を払いながら、RACE等のPCR法を使用するこ
とによっても、本発明DNAを含むヒト由来遺伝子の全
領域を調製することも可能である。
テック社)ヒト成人全脳、ヒト成人海馬及びヒト胎児全
脳のmRNAを出発材料として、本発明者が調製したc
DNAライブラリーから、cDNA断片として単離した
後に、塩基配列を決定し同定したものである。即ち、具
体的には、小原他の方法(DNA Research Vol.4,53−59
(1997))に従って調製したヒト成人全脳、ヒト成人海
馬及びヒト胎児全脳由来のcDNAライブラリーからク
ローンをランダムに単離する。次に、ハイブリダイゼー
ションにより、重複クローン(繰り返し出てくるクロー
ン)を除き、その後インビトロでの転写翻訳を行い50
kDa以上の産物が認められるクローンについてその両
末端の塩基配列を決定する。更に、こうして得られた末
端塩基配列をクエリーとして既知遺伝子のデータベース
に相同性検索を行い、その結果、新規であることが判明
したクローンについて全塩基配列を決定する。このよう
にして既知の遺伝子に依存した従来のクローニング方法
では得られなかった未知の遺伝子も、システマチックに
クローニングを行なうことができる。又、短い断片や得
られた配列に人工的な間違いが起こらないように十分な
注意を払いながら、RACE等のPCR法を使用するこ
とによっても、本発明DNAを含むヒト由来遺伝子の全
領域を調製することも可能である。
【0007】更に、本発明は、本発明DNA又は本発明
DNAを含む遺伝子を含有する組換えベクター、該組換
えベクターを保持する形質転換体、該形質転換体を培養
し、本発明ポリペプチド若しくは該ポリペプチドを含む
組換え蛋白質を生成、蓄積せしめ、これを採取すること
を特徴とする、本発明ポリペプチド若しくは該ポリペプ
チドを含む組換え蛋白質、又はその塩の製造方法、及
び、こうして得られる本発明ポリペプチド若しくは該ポ
リペプチドを含む組換え蛋白質又はその塩を提供する。
DNAを含む遺伝子を含有する組換えベクター、該組換
えベクターを保持する形質転換体、該形質転換体を培養
し、本発明ポリペプチド若しくは該ポリペプチドを含む
組換え蛋白質を生成、蓄積せしめ、これを採取すること
を特徴とする、本発明ポリペプチド若しくは該ポリペプ
チドを含む組換え蛋白質、又はその塩の製造方法、及
び、こうして得られる本発明ポリペプチド若しくは該ポ
リペプチドを含む組換え蛋白質又はその塩を提供する。
【0008】又、本発明は、本発明DNA又は遺伝子を
含有してなる医薬、本発明ポリペプチド若しくはその部
分ポリペプチド又は該ポリペプチドを含む組換え蛋白質
をコードする塩基配列を含むポリヌクレオチド(DN
A)、それら塩基配列に実質的に相補的な塩基配列を有
するアンチセンスヌクレオチド、該ポリヌクレオチド又
はアンチセンスヌクレオチドを含有してなる医薬、本発
明ポリペプチド若しくはその部分ポリペプチド、及び、
該ポリペプチド又はそれらを含む組換え蛋白質を含有し
てなる医薬に係る。更に、本発明は、本発明ポリペプチ
ド若しくはその部分ポリペプチド又は該ポリペプチドを
含む組換え蛋白質又はそれらの塩に対する抗体、及び、
本発明ポリペプチド、その部分ポリペプチド若しくは該
ポリペプチドを含む組換え蛋白質又はそれらの塩、又は
それらに対する抗体を用いることを特徴とする、それら
物質と特異的に相互作用する物質のスクリーニング方
法、スクリーニング用キット、並びに、該スクリーニン
グ方法によって同定される物質(化合物)自体等にも係
る。
含有してなる医薬、本発明ポリペプチド若しくはその部
分ポリペプチド又は該ポリペプチドを含む組換え蛋白質
をコードする塩基配列を含むポリヌクレオチド(DN
A)、それら塩基配列に実質的に相補的な塩基配列を有
するアンチセンスヌクレオチド、該ポリヌクレオチド又
はアンチセンスヌクレオチドを含有してなる医薬、本発
明ポリペプチド若しくはその部分ポリペプチド、及び、
該ポリペプチド又はそれらを含む組換え蛋白質を含有し
てなる医薬に係る。更に、本発明は、本発明ポリペプチ
ド若しくはその部分ポリペプチド又は該ポリペプチドを
含む組換え蛋白質又はそれらの塩に対する抗体、及び、
本発明ポリペプチド、その部分ポリペプチド若しくは該
ポリペプチドを含む組換え蛋白質又はそれらの塩、又は
それらに対する抗体を用いることを特徴とする、それら
物質と特異的に相互作用する物質のスクリーニング方
法、スクリーニング用キット、並びに、該スクリーニン
グ方法によって同定される物質(化合物)自体等にも係
る。
【0009】
【発明の実施の形態】本発明DNAとしては、前述した
本発明ポリペプチドをコードする塩基配列から成るもの
であればいかなるものであってもよい。また、ヒトの
脳、又は、それ以外の組織、例えば、心臓、肺、肝臓、
脾臓、腎臓、精巣、等の細胞・組織に由来するcDNA
ライブラリー等から同定・単離されたcDNA、又は、
合成DNAのいずれでもよい。ライブラリー作成に使用
するベクターは、バクテリオファージ、プラスミド、コ
スミド、ファージミドなどいずれであってもよい。ま
た、前記した細胞・組織よりtotalRNA画分またはm
RNA画分を調製したものを用いて、直接ReverseTrans
cription coupled Polymerase Chain Reaction(以下、
「RT-PCR法」と略称する)によって増幅すること
もできる。
本発明ポリペプチドをコードする塩基配列から成るもの
であればいかなるものであってもよい。また、ヒトの
脳、又は、それ以外の組織、例えば、心臓、肺、肝臓、
脾臓、腎臓、精巣、等の細胞・組織に由来するcDNA
ライブラリー等から同定・単離されたcDNA、又は、
合成DNAのいずれでもよい。ライブラリー作成に使用
するベクターは、バクテリオファージ、プラスミド、コ
スミド、ファージミドなどいずれであってもよい。ま
た、前記した細胞・組織よりtotalRNA画分またはm
RNA画分を調製したものを用いて、直接ReverseTrans
cription coupled Polymerase Chain Reaction(以下、
「RT-PCR法」と略称する)によって増幅すること
もできる。
【0010】配列番号:1乃至44(但し、配列番号
7、11及び25は除く)のいずれか一つで示されるア
ミノ酸配列と実質的に同一のアミノ酸配列とは、配列番
号:1乃至44(但し、配列番号7、11及び25は除
く)のいずれか一つで示される全アミノ酸配列との相同
性の程度が、全体の平均で約70%以上、好ましくは約
80%以上、更に好ましくは約90%以上、特に好まし
くは約95%以上であるアミノ酸配列を意味する。従っ
て、本発明の配列番号:1乃至44(但し、配列番号
7、11及び25は除く)のいずれか一つで示されるア
ミノ酸配列と実質的に同一のアミノ酸配列から成るポリ
ペプチドとしては、例えば、前記の各配列番号で示され
るアミノ酸配列に対して上記の相同性を有し、各配列番
号で示されるアミノ酸配列から成るポリペプチドの機能
と実質的に同質の生物学的活性(機能)を有するポリペ
プチドを挙げることが出来る。ここで、実質的に同質と
は、それらの活性(機能)が性質的に同質であることを
示す。又、本発明ポリペプチドには、例えば、配列番
号:1乃至44(但し、配列番号7、11及び25は除
く)のいずれか一つで示されるアミノ酸配列中の一部
(好ましくは、1〜20個程度、より好ましくは1〜1
0個程度、さらに好ましくは数個)のアミノ酸が欠失、
置換又は付加したアミノ酸配列、或いはそれらを組み合
わせたアミノ酸配列から成り、配列番号:1乃至44
(但し、配列番号7、11及び25は除く)のいずれか
一つで示されるアミノ酸配列から成るポリペプチドの機
能と実質的に同質の生物学的活性(機能)を有するポリ
ペプチドも含まれる。
7、11及び25は除く)のいずれか一つで示されるア
ミノ酸配列と実質的に同一のアミノ酸配列とは、配列番
号:1乃至44(但し、配列番号7、11及び25は除
く)のいずれか一つで示される全アミノ酸配列との相同
性の程度が、全体の平均で約70%以上、好ましくは約
80%以上、更に好ましくは約90%以上、特に好まし
くは約95%以上であるアミノ酸配列を意味する。従っ
て、本発明の配列番号:1乃至44(但し、配列番号
7、11及び25は除く)のいずれか一つで示されるア
ミノ酸配列と実質的に同一のアミノ酸配列から成るポリ
ペプチドとしては、例えば、前記の各配列番号で示され
るアミノ酸配列に対して上記の相同性を有し、各配列番
号で示されるアミノ酸配列から成るポリペプチドの機能
と実質的に同質の生物学的活性(機能)を有するポリペ
プチドを挙げることが出来る。ここで、実質的に同質と
は、それらの活性(機能)が性質的に同質であることを
示す。又、本発明ポリペプチドには、例えば、配列番
号:1乃至44(但し、配列番号7、11及び25は除
く)のいずれか一つで示されるアミノ酸配列中の一部
(好ましくは、1〜20個程度、より好ましくは1〜1
0個程度、さらに好ましくは数個)のアミノ酸が欠失、
置換又は付加したアミノ酸配列、或いはそれらを組み合
わせたアミノ酸配列から成り、配列番号:1乃至44
(但し、配列番号7、11及び25は除く)のいずれか
一つで示されるアミノ酸配列から成るポリペプチドの機
能と実質的に同質の生物学的活性(機能)を有するポリ
ペプチドも含まれる。
【0011】上記の配列番号:1乃至44(但し、配列
番号7、11及び25は除く)のいずれか一つで示され
るアミノ酸配列と実質的に同一のアミノ酸配列から成る
ポリペプチド、又はその一部のアミノ酸が欠失、置換又
は付加したアミノ酸配列から成るポリペプチドは、例え
ば、部位特異的変異導入法、遺伝子相同組換え法、プラ
イマー伸長法、及びPCR法等の当業者に周知の方法を
適宜組み合わせて、容易に作成することが可能である。
尚、その際に、実質的に同質の生物学的活性を有するた
めには、当該ポリペプチドを構成するアミノ酸のうち、
同族アミノ酸(極性・非極性アミノ酸、疎水性・親水性
アミノ酸、陽性・陰性荷電アミノ酸、芳香族アミノ酸な
ど)同士の置換が可能性として考えられる。又、実質的
に同質の生物学的活性の維持のためには、本発明の各ポ
リペプチドに含まれる機能ドメイン内のアミノ酸は保持
されることが望ましい。
番号7、11及び25は除く)のいずれか一つで示され
るアミノ酸配列と実質的に同一のアミノ酸配列から成る
ポリペプチド、又はその一部のアミノ酸が欠失、置換又
は付加したアミノ酸配列から成るポリペプチドは、例え
ば、部位特異的変異導入法、遺伝子相同組換え法、プラ
イマー伸長法、及びPCR法等の当業者に周知の方法を
適宜組み合わせて、容易に作成することが可能である。
尚、その際に、実質的に同質の生物学的活性を有するた
めには、当該ポリペプチドを構成するアミノ酸のうち、
同族アミノ酸(極性・非極性アミノ酸、疎水性・親水性
アミノ酸、陽性・陰性荷電アミノ酸、芳香族アミノ酸な
ど)同士の置換が可能性として考えられる。又、実質的
に同質の生物学的活性の維持のためには、本発明の各ポ
リペプチドに含まれる機能ドメイン内のアミノ酸は保持
されることが望ましい。
【0012】更に、本発明DNAは、配列番号:1乃至
44(但し、配列番号7、11及び25は除く)のいず
れか一つで示される塩基配列において、夫々の配列で示
されるアミノ酸配列をコードする塩基配列を含むDN
A、及び、該DNAとストリンジェントな条件下でハイ
ブリダイズし、各配列で示されるアミノ酸配列から成る
ポリペプチドの機能と同質の生物学的活性(機能)を有
するポリペプチド(蛋白質)をコードするDNAを包含
する。かかる条件下で、配列番号:1乃至44(但し、
配列番号7、11及び25は除く)のいずれか一つで示
される塩基配列において、夫々の配列で示されるアミノ
酸配列をコードする塩基配列を含むDNAとハイブリダ
イズできるDNAとしては、例えば、該DNAの全塩基
配列との相同性の程度が、全体の平均で約80%以上、
好ましくは約90%以上、より好ましくは約95%以上
である塩基配列を含有するDNA等を挙げることが出来
る。ハイブリダイゼーションは、カレント・プロトコー
ルズ・イン・モレキュラー・バイオロジー(Current pr
otocols in molecular biology(edited by Frederick
M. Ausubel et al., 1987))に記載の方法等、当業界
で公知の方法あるいはそれに準じる方法に従って行なう
ことができる。また、市販のライブラリーを使用する場
合、添付の使用説明書に記載の方法に従って行なうこと
ができる。ここで、「ストリンジェントな条件」とは、
例えば、65℃の1mM EDTA ナトリウム、0.5M リン酸水
素ナトリウム(pH7.2)、7%SDS 水溶液中でハイブリ
ダイズさせ、65℃の1mM EDTA ナトリウム、40mM リ
ン酸水素ナトリウム(pH7.2)、1%SDS 水溶液中でメ
ンブレンを洗浄する条件でのサザンブロットハイブリダ
イゼーションで本発明DNAプローブにハイブリダイズ
する程度の条件である。
44(但し、配列番号7、11及び25は除く)のいず
れか一つで示される塩基配列において、夫々の配列で示
されるアミノ酸配列をコードする塩基配列を含むDN
A、及び、該DNAとストリンジェントな条件下でハイ
ブリダイズし、各配列で示されるアミノ酸配列から成る
ポリペプチドの機能と同質の生物学的活性(機能)を有
するポリペプチド(蛋白質)をコードするDNAを包含
する。かかる条件下で、配列番号:1乃至44(但し、
配列番号7、11及び25は除く)のいずれか一つで示
される塩基配列において、夫々の配列で示されるアミノ
酸配列をコードする塩基配列を含むDNAとハイブリダ
イズできるDNAとしては、例えば、該DNAの全塩基
配列との相同性の程度が、全体の平均で約80%以上、
好ましくは約90%以上、より好ましくは約95%以上
である塩基配列を含有するDNA等を挙げることが出来
る。ハイブリダイゼーションは、カレント・プロトコー
ルズ・イン・モレキュラー・バイオロジー(Current pr
otocols in molecular biology(edited by Frederick
M. Ausubel et al., 1987))に記載の方法等、当業界
で公知の方法あるいはそれに準じる方法に従って行なう
ことができる。また、市販のライブラリーを使用する場
合、添付の使用説明書に記載の方法に従って行なうこと
ができる。ここで、「ストリンジェントな条件」とは、
例えば、65℃の1mM EDTA ナトリウム、0.5M リン酸水
素ナトリウム(pH7.2)、7%SDS 水溶液中でハイブリ
ダイズさせ、65℃の1mM EDTA ナトリウム、40mM リ
ン酸水素ナトリウム(pH7.2)、1%SDS 水溶液中でメ
ンブレンを洗浄する条件でのサザンブロットハイブリダ
イゼーションで本発明DNAプローブにハイブリダイズ
する程度の条件である。
【0013】本発明DNAのクローニングの手段として
は、本発明ポリペプチドの部分等の適当な塩基配列を有
する合成DNAプライマーを用いてPCR法によって増
幅するか、または適当なベクターに組み込んだDNAを
本発明ポリペプチドの一部あるいは全領域をコードする
DNA断片もしくは合成DNAを用いて標識したものと
のハイブリダイゼーションによって選別することができ
る。ハイブリダイゼーションの方法は、例えば、上記の
Current protocols in molecular biology(edited by
Frederick M. Ausubel et al., 1987)に記載の方法な
どに従って行なうことができる。また、市販のライブラ
リーを使用する場合、添付の使用説明書に記載の方法に
従って行なうことができる。クローン化されたポリペプ
チドをコードするDNAは目的によりそのまま、または
所望により制限酵素で消化したり、リンカーを付加した
りして使用することができる。該DNAはその5’末端
側に翻訳開始コドンとしてのATGを有し、また3’末
端側には翻訳終止コドンとしてのTAA、TGAまたは
TAGを有していてもよい。これらの翻訳開始コドンや
翻訳終止コドンは、適当な合成DNAアダプターを用い
て付加することもできる。
は、本発明ポリペプチドの部分等の適当な塩基配列を有
する合成DNAプライマーを用いてPCR法によって増
幅するか、または適当なベクターに組み込んだDNAを
本発明ポリペプチドの一部あるいは全領域をコードする
DNA断片もしくは合成DNAを用いて標識したものと
のハイブリダイゼーションによって選別することができ
る。ハイブリダイゼーションの方法は、例えば、上記の
Current protocols in molecular biology(edited by
Frederick M. Ausubel et al., 1987)に記載の方法な
どに従って行なうことができる。また、市販のライブラ
リーを使用する場合、添付の使用説明書に記載の方法に
従って行なうことができる。クローン化されたポリペプ
チドをコードするDNAは目的によりそのまま、または
所望により制限酵素で消化したり、リンカーを付加した
りして使用することができる。該DNAはその5’末端
側に翻訳開始コドンとしてのATGを有し、また3’末
端側には翻訳終止コドンとしてのTAA、TGAまたは
TAGを有していてもよい。これらの翻訳開始コドンや
翻訳終止コドンは、適当な合成DNAアダプターを用い
て付加することもできる。
【0014】本発明の蛋白質の発現ベクターは、当該技
術分野で公知の方法に従って作成することが出来る。例
えば、(1)本発明DNA又は本発明DNAを含む遺伝
子を含有するDNA断片を切り出し、(2)該DNA断
片を適当な発現ベクター中のプロモーターの下流に連結
することにより製造することができる。ベクターとして
は、大腸菌由来のプラスミド(例、pBR322,pB
R325,pUC18,pUC118)、枯草菌由来の
プラスミド(例、pUB110,pTP5,pC19
4)、酵母由来プラスミド(例、pSH19,pSH1
5)、λファージなどのバクテリオファージ、レトロウ
イルス,ワクシニアウイルス,バキュロウイルスなどの
動物ウイルス等を利用することが出来る。本発明で用い
られるプロモーターとしては、遺伝子の発現に用いる宿
主に対応した適切なプロモーターであればいかなるもの
でもよい。例えば、宿主が大腸菌である場合は、trp
プロモーター、lacプロモーター、recAプロモー
ター、λPLプロモーター、lppプロモーターなど
が、宿主が枯草菌である場合は、SPO1プロモータ
ー、SPO2プロモーター、penPプロモーターな
ど、宿主が酵母である場合は、PHO5プロモーター、
PGKプロモーター、GAPプロモーター、ADHプロ
モーターなどが好ましい。動物細胞を宿主として用いる
場合は、SRαプロモーター、SV40プロモーター、
LTRプロモーター、CMVプロモーター、HSV-T
Kプロモーターなどが挙げられる。
術分野で公知の方法に従って作成することが出来る。例
えば、(1)本発明DNA又は本発明DNAを含む遺伝
子を含有するDNA断片を切り出し、(2)該DNA断
片を適当な発現ベクター中のプロモーターの下流に連結
することにより製造することができる。ベクターとして
は、大腸菌由来のプラスミド(例、pBR322,pB
R325,pUC18,pUC118)、枯草菌由来の
プラスミド(例、pUB110,pTP5,pC19
4)、酵母由来プラスミド(例、pSH19,pSH1
5)、λファージなどのバクテリオファージ、レトロウ
イルス,ワクシニアウイルス,バキュロウイルスなどの
動物ウイルス等を利用することが出来る。本発明で用い
られるプロモーターとしては、遺伝子の発現に用いる宿
主に対応した適切なプロモーターであればいかなるもの
でもよい。例えば、宿主が大腸菌である場合は、trp
プロモーター、lacプロモーター、recAプロモー
ター、λPLプロモーター、lppプロモーターなど
が、宿主が枯草菌である場合は、SPO1プロモータ
ー、SPO2プロモーター、penPプロモーターな
ど、宿主が酵母である場合は、PHO5プロモーター、
PGKプロモーター、GAPプロモーター、ADHプロ
モーターなどが好ましい。動物細胞を宿主として用いる
場合は、SRαプロモーター、SV40プロモーター、
LTRプロモーター、CMVプロモーター、HSV-T
Kプロモーターなどが挙げられる。
【0015】発現ベクターには、以上の他に、所望によ
り当該技術分野で公知の、エンハンサー、スプライシン
グシグナル、ポリA付加シグナル、選択マーカー、SV
40複製オリジン等を付加することができる。また、必
要に応じて、本発明のDNAにコードされた蛋白質を他
の蛋白質(例えば、グルタチオンSトランスフェラーゼ
及びプロテインA)との融合蛋白質として発現させるこ
とも可能である。このような融合蛋白質は、適当なプロ
テアーゼを使用して切断し、それぞれの蛋白質に分離す
ることが出来る。
り当該技術分野で公知の、エンハンサー、スプライシン
グシグナル、ポリA付加シグナル、選択マーカー、SV
40複製オリジン等を付加することができる。また、必
要に応じて、本発明のDNAにコードされた蛋白質を他
の蛋白質(例えば、グルタチオンSトランスフェラーゼ
及びプロテインA)との融合蛋白質として発現させるこ
とも可能である。このような融合蛋白質は、適当なプロ
テアーゼを使用して切断し、それぞれの蛋白質に分離す
ることが出来る。
【0016】宿主細胞としては、例えば、エシェリヒア
属菌、バチルス属菌、酵母、昆虫細胞、昆虫、動物細胞
などが用いられる。エシェリヒア属菌の具体例として
は、エシェリヒア・コリ(Escherichia coli)K12・
DH1(Proc. Natl. Acad. Sci. USA,60巻,1
60(1968)),JM103(Nucleic Acids Resear
ch,9巻,309(1981)),JA221(Journal
of Molecular Biology,120巻,517(197
8)),及びHB101(Journal of Molecular Biolog
y,41巻,459(1969))等が用いられる。バチ
ルス属菌としては、例えば、バチルス・サチルス(Baci
llus subtilis)MI114(Gene,24巻,255(1
983)),207−21〔Journal of Biochemistry,
95巻,87(1984)〕等が用いられる。酵母として
は、例えば、サッカロマイセス セレビシエ(Saccaromy
ces cerevisiae)AH22,AH22R-,NA87−
11A,DKD−5D,20B−12、シゾサッカロマ
イセス ポンベ(Schizosaccaromyces pombe)NCYC
1913,NCYC2036、サッカロマイセス ピキ
ア パストリス(Saccaromycespicjia pastoris)等が用
いられる。動物細胞としては、例えば、サル細胞COS
−7,Vero,チャイニーズハムスター細胞CHO(以
下、CHO細胞と略記),dhfr遺伝子欠損CHO細
胞,マウスL細胞,マウスAtT−20,マウスミエロ
ーマ細胞,ラットGH3,ヒトFL細胞などが用いられ
る。
属菌、バチルス属菌、酵母、昆虫細胞、昆虫、動物細胞
などが用いられる。エシェリヒア属菌の具体例として
は、エシェリヒア・コリ(Escherichia coli)K12・
DH1(Proc. Natl. Acad. Sci. USA,60巻,1
60(1968)),JM103(Nucleic Acids Resear
ch,9巻,309(1981)),JA221(Journal
of Molecular Biology,120巻,517(197
8)),及びHB101(Journal of Molecular Biolog
y,41巻,459(1969))等が用いられる。バチ
ルス属菌としては、例えば、バチルス・サチルス(Baci
llus subtilis)MI114(Gene,24巻,255(1
983)),207−21〔Journal of Biochemistry,
95巻,87(1984)〕等が用いられる。酵母として
は、例えば、サッカロマイセス セレビシエ(Saccaromy
ces cerevisiae)AH22,AH22R-,NA87−
11A,DKD−5D,20B−12、シゾサッカロマ
イセス ポンベ(Schizosaccaromyces pombe)NCYC
1913,NCYC2036、サッカロマイセス ピキ
ア パストリス(Saccaromycespicjia pastoris)等が用
いられる。動物細胞としては、例えば、サル細胞COS
−7,Vero,チャイニーズハムスター細胞CHO(以
下、CHO細胞と略記),dhfr遺伝子欠損CHO細
胞,マウスL細胞,マウスAtT−20,マウスミエロ
ーマ細胞,ラットGH3,ヒトFL細胞などが用いられ
る。
【0017】これら宿主細胞の形質転換は、当該技術分
野で公知の方法に従って行うことが出来る。例えば、以
下に記載の文献を参照することが出来る。Proc. Natl.
Acad. Sci. USA,69巻,2110(1972); Ge
ne,17巻,107(1982);Molecular & General
Genetics,168巻,111(1979);Methods in
Enzymology,194巻,182−187(1991);
Proc. Natl. Acad. Sci. USA),75巻,1929
(1978);細胞工学別冊8 新 細胞工学実験プロトコ
ール.263−267(1995)(秀潤社発行);及
び Virology,52巻,456(1973)。
野で公知の方法に従って行うことが出来る。例えば、以
下に記載の文献を参照することが出来る。Proc. Natl.
Acad. Sci. USA,69巻,2110(1972); Ge
ne,17巻,107(1982);Molecular & General
Genetics,168巻,111(1979);Methods in
Enzymology,194巻,182−187(1991);
Proc. Natl. Acad. Sci. USA),75巻,1929
(1978);細胞工学別冊8 新 細胞工学実験プロトコ
ール.263−267(1995)(秀潤社発行);及
び Virology,52巻,456(1973)。
【0018】このようにして得られた、本発明DNA又
は本発明DNAを含む遺伝子を含有する発現ベクターで
形質転換された形質転換体は、当該技術分野で公知の方
法に従って培養することが出来る。例えば、以下に記載
の文献を参照することが出来る。例えば、宿主がエシェ
リヒア属菌の場合、培養は通常約15〜43℃で約3〜
24時間行ない、必要により、通気や撹拌を加えること
もできる。宿主がバチルス属菌の場合、培養は通常、約
30〜40℃で約6〜24時間行ない、必要により通気
や撹拌を加えることもできる。宿主が酵母である形質転
換体を培養する際、培養は通常、pH約5〜8に調整さ
れた培地を用いて約20℃〜35℃で約24〜72時間
行ない、必要に応じて通気や撹拌を加えることもでき
る。宿主が動物細胞である形質転換体を培養する際、p
Hは約6〜8に調整された培地を用いて、通常約30℃
〜40℃で約15〜60時間行ない、必要に応じて通気
や撹拌を加えることもできる。
は本発明DNAを含む遺伝子を含有する発現ベクターで
形質転換された形質転換体は、当該技術分野で公知の方
法に従って培養することが出来る。例えば、以下に記載
の文献を参照することが出来る。例えば、宿主がエシェ
リヒア属菌の場合、培養は通常約15〜43℃で約3〜
24時間行ない、必要により、通気や撹拌を加えること
もできる。宿主がバチルス属菌の場合、培養は通常、約
30〜40℃で約6〜24時間行ない、必要により通気
や撹拌を加えることもできる。宿主が酵母である形質転
換体を培養する際、培養は通常、pH約5〜8に調整さ
れた培地を用いて約20℃〜35℃で約24〜72時間
行ない、必要に応じて通気や撹拌を加えることもでき
る。宿主が動物細胞である形質転換体を培養する際、p
Hは約6〜8に調整された培地を用いて、通常約30℃
〜40℃で約15〜60時間行ない、必要に応じて通気
や撹拌を加えることもできる。
【0019】上記培養物から本発明ポリペプチド又は蛋
白質を分離精製するには、例えば、培養後、公知の方法
で菌体あるいは細胞を集め、これを適当な緩衝液に懸濁
し、超音波、リゾチームおよび/または凍結融解などに
よって菌体あるいは細胞を破壊したのち、遠心分離やろ
過により蛋白質の粗抽出液を得る。緩衝液の中に尿素や
塩酸グアニジンなどの蛋白質変性剤や、トリトンX−1
00TMなどの界面活性剤が含まれていてもよい。培養液
中に蛋白質が分泌される場合には、培養終了後、公知の
方法で菌体あるいは細胞と上清とを分離し、上清を集め
る。このようにして得られた培養上清、あるいは抽出液
中に含まれる蛋白質の精製は、公知の分離・精製法を適
切に組み合わせて行なうことができる。こうして得られ
た本発明ポリペプチド(蛋白質)は、公知の方法あるい
はそれに準じる方法によって塩に変換することができ、
逆に塩で得られた場合には公知の方法あるいはそれに準
じる方法により、遊離体または他の塩に変換することが
できる。更に、組換え体が産生する蛋白質を、精製前ま
たは精製後に、トリプシン及びキモトリプシンのような
適当な蛋白修飾酵素を作用させることにより、任意に修
飾を加えたり、ポリペプチドを部分的に除去することも
できる。本発明ポリペプチド(蛋白質)又はその塩の存
在は、様々な結合アッセイ及び特異抗体を用いたエンザ
イムイムノアッセイ等により測定することができる。
白質を分離精製するには、例えば、培養後、公知の方法
で菌体あるいは細胞を集め、これを適当な緩衝液に懸濁
し、超音波、リゾチームおよび/または凍結融解などに
よって菌体あるいは細胞を破壊したのち、遠心分離やろ
過により蛋白質の粗抽出液を得る。緩衝液の中に尿素や
塩酸グアニジンなどの蛋白質変性剤や、トリトンX−1
00TMなどの界面活性剤が含まれていてもよい。培養液
中に蛋白質が分泌される場合には、培養終了後、公知の
方法で菌体あるいは細胞と上清とを分離し、上清を集め
る。このようにして得られた培養上清、あるいは抽出液
中に含まれる蛋白質の精製は、公知の分離・精製法を適
切に組み合わせて行なうことができる。こうして得られ
た本発明ポリペプチド(蛋白質)は、公知の方法あるい
はそれに準じる方法によって塩に変換することができ、
逆に塩で得られた場合には公知の方法あるいはそれに準
じる方法により、遊離体または他の塩に変換することが
できる。更に、組換え体が産生する蛋白質を、精製前ま
たは精製後に、トリプシン及びキモトリプシンのような
適当な蛋白修飾酵素を作用させることにより、任意に修
飾を加えたり、ポリペプチドを部分的に除去することも
できる。本発明ポリペプチド(蛋白質)又はその塩の存
在は、様々な結合アッセイ及び特異抗体を用いたエンザ
イムイムノアッセイ等により測定することができる。
【0020】本発明ポリペプチド(蛋白質)は、C末端
が通常カルボキシル基(−COOH)またはカルボキシ
レート(−COO-)であるが、C末端がアミド(−CO
NH 2)またはエステル(−COOR)であってもよ
い。ここでエステルにおけるRとしては、例えば、メチ
ル、エチル、n−プロピル、イソプロピルもしくはn−
ブチルなどのC1-6アルキル基、例えば、シクロペンチ
ル、シクロヘキシルなどのC3-8シクロアルキル基、例
えば、フェニル、α−ナフチルなどのC6-12アリール
基、例えば、ベンジル、フェネチルなどのフェニル−C
1-2アルキル基もしくはα−ナフチルメチルなどのα−
ナフチル−C1-2アルキル基などのC7-14アラルキル基
のほか、経口用エステルとして汎用されるピバロイルオ
キシメチルエステルなどが用いられる。
が通常カルボキシル基(−COOH)またはカルボキシ
レート(−COO-)であるが、C末端がアミド(−CO
NH 2)またはエステル(−COOR)であってもよ
い。ここでエステルにおけるRとしては、例えば、メチ
ル、エチル、n−プロピル、イソプロピルもしくはn−
ブチルなどのC1-6アルキル基、例えば、シクロペンチ
ル、シクロヘキシルなどのC3-8シクロアルキル基、例
えば、フェニル、α−ナフチルなどのC6-12アリール
基、例えば、ベンジル、フェネチルなどのフェニル−C
1-2アルキル基もしくはα−ナフチルメチルなどのα−
ナフチル−C1-2アルキル基などのC7-14アラルキル基
のほか、経口用エステルとして汎用されるピバロイルオ
キシメチルエステルなどが用いられる。
【0021】本発明ポリペプチド(蛋白質)がC末端以
外にカルボキシル基(またはカルボキシレート)を有し
ている場合、カルボキシル基がアミド化またはエステル
化されているものも本発明の蛋白質に含まれる。この場
合のエステルとしては、例えば上記したC末端のエステ
ルなどが用いられる。さらに、本発明の蛋白質には、N
末端のメチオニン残基のアミノ基が保護基(例えば、ホ
ルミル基、アセチル基などのC1-6アシル基など)で保
護されているもの、生体内で切断されて生成するN末端
のグルタミン酸残基がピログルタミン化したもの、分子
内のアミノ酸の側鎖上にある、例えばOH、COOH、
NH2、SHなどが適当な保護基(例えば、ホルミル
基、アセチル基などのC1-6アシル基など)で保護され
ているもの、あるいは糖鎖が結合したいわゆる糖蛋白質
などの複合蛋白質なども含まれる。
外にカルボキシル基(またはカルボキシレート)を有し
ている場合、カルボキシル基がアミド化またはエステル
化されているものも本発明の蛋白質に含まれる。この場
合のエステルとしては、例えば上記したC末端のエステ
ルなどが用いられる。さらに、本発明の蛋白質には、N
末端のメチオニン残基のアミノ基が保護基(例えば、ホ
ルミル基、アセチル基などのC1-6アシル基など)で保
護されているもの、生体内で切断されて生成するN末端
のグルタミン酸残基がピログルタミン化したもの、分子
内のアミノ酸の側鎖上にある、例えばOH、COOH、
NH2、SHなどが適当な保護基(例えば、ホルミル
基、アセチル基などのC1-6アシル基など)で保護され
ているもの、あるいは糖鎖が結合したいわゆる糖蛋白質
などの複合蛋白質なども含まれる。
【0022】本発明の蛋白質の部分ポリペプチドとして
は、前記した本発明ポリペプチド(蛋白質)の部分ペプ
チドであって、実質的に同質の活性を有するものであれ
ばいずれのものでもよい。例えば、本発明ポリペプチド
(蛋白質)の構成アミノ酸配列のうち少なくとも10個
以上、好ましくは50個以上、さらに好ましくは70個
以上、より好ましくは100個以上、最も好ましくは2
00個以上のアミノ酸配列を有し、例えば、本発明のポ
リペプチドの機能と実質的に同質の生物学的活性を有す
るするペプチドなどが用いられる。本発明の部分ポリペ
プチドとしては、例えば、各機能ドメインを含むものが
好ましい。又、本発明の部分ペプチドはC末端が通常カ
ルボキシル基(−COOH)またはカルボキシレート
(−COO-)であるが、前記した本発明の蛋白質のご
とく、C末端がアミド(−CONH2)またはエステル
(−COOR)であってもよい。さらに、本発明の部分
ペプチドには、前記した本発明の蛋白質と同様に、N末
端のメチオニン残基のアミノ基が保護基で保護されてい
るもの、N端側が生体内で切断され生成したグルタミル
基がピログルタミン酸化したもの、分子内のアミノ酸の
側鎖上の置換基が適当な保護基で保護されているもの、
あるいは糖鎖が結合したいわゆる糖ペプチドなどの複合
ペプチドなども含まれる。本発明の部分ペプチドは、例
えば、試薬、実験の際の標準物質、又は免疫源若しくは
その一部として使用することが出来る。
は、前記した本発明ポリペプチド(蛋白質)の部分ペプ
チドであって、実質的に同質の活性を有するものであれ
ばいずれのものでもよい。例えば、本発明ポリペプチド
(蛋白質)の構成アミノ酸配列のうち少なくとも10個
以上、好ましくは50個以上、さらに好ましくは70個
以上、より好ましくは100個以上、最も好ましくは2
00個以上のアミノ酸配列を有し、例えば、本発明のポ
リペプチドの機能と実質的に同質の生物学的活性を有す
るするペプチドなどが用いられる。本発明の部分ポリペ
プチドとしては、例えば、各機能ドメインを含むものが
好ましい。又、本発明の部分ペプチドはC末端が通常カ
ルボキシル基(−COOH)またはカルボキシレート
(−COO-)であるが、前記した本発明の蛋白質のご
とく、C末端がアミド(−CONH2)またはエステル
(−COOR)であってもよい。さらに、本発明の部分
ペプチドには、前記した本発明の蛋白質と同様に、N末
端のメチオニン残基のアミノ基が保護基で保護されてい
るもの、N端側が生体内で切断され生成したグルタミル
基がピログルタミン酸化したもの、分子内のアミノ酸の
側鎖上の置換基が適当な保護基で保護されているもの、
あるいは糖鎖が結合したいわゆる糖ペプチドなどの複合
ペプチドなども含まれる。本発明の部分ペプチドは、例
えば、試薬、実験の際の標準物質、又は免疫源若しくは
その一部として使用することが出来る。
【0023】本発明ポリペプチド(蛋白質)又はその部
分ペプチドの塩としては、とりわけ生理学的に許容され
る酸付加塩が好ましい。この様な塩としては、例えば、
無機酸(例えば、塩酸、リン酸、臭化水素酸、硫酸)と
の塩、あるいは有機酸(例えば、酢酸、ギ酸、プロピオ
ン酸、フマル酸、マレイン酸、コハク酸、酒石酸、クエ
ン酸、リンゴ酸、蓚酸、安息香酸、メタンスルホン酸、
ベンゼンスルホン酸)との塩などが用いられる。
分ペプチドの塩としては、とりわけ生理学的に許容され
る酸付加塩が好ましい。この様な塩としては、例えば、
無機酸(例えば、塩酸、リン酸、臭化水素酸、硫酸)と
の塩、あるいは有機酸(例えば、酢酸、ギ酸、プロピオ
ン酸、フマル酸、マレイン酸、コハク酸、酒石酸、クエ
ン酸、リンゴ酸、蓚酸、安息香酸、メタンスルホン酸、
ベンゼンスルホン酸)との塩などが用いられる。
【0024】本発明ポリペプチド(蛋白質)、その部分
ペプチドもしくはそれらの塩またはそれらのアミド体
は、当該技術分野で公知の化学合成方法を用いて調製す
ることも出来る。例えば、通常市販されている蛋白質合
成用樹脂を用い、α−アミノ基と側鎖官能基を適当に保
護したアミノ酸を、目的とする蛋白質の配列通りに、当
業界において自体公知の各種縮合方法に従い、樹脂上で
縮合させる。反応の最後に樹脂から蛋白質を切り出すと
同時に各種保護基を除去し、さらに高希釈溶液中で分子
内ジスルフィド結合形成反応を実施し、目的の蛋白質、
その部分ペプチドまたはそれらのアミド体を取得する。
上記した保護アミノ酸の縮合に関しては、例えば、DC
C、N,N'-ジイソプロピルカルボジイミド、及びN-エチル
-N'-(3-ジメチルアミノプロリル)カルボジイミドのよ
うなカルボジイミド類に代表される蛋白質合成に使用で
きる各種活性化試薬を用いることができる。これらによ
る活性化にはラセミ化抑制添加剤(例えば、HOBt, HOOB
t)とともに保護アミノ酸を直接樹脂に添加するかまた
は、対称酸無水物またはHOBtエステルあるいはHOOBtエ
ステルとしてあらかじめ保護アミノ酸の活性化を行なっ
た後に樹脂に添加することができる。
ペプチドもしくはそれらの塩またはそれらのアミド体
は、当該技術分野で公知の化学合成方法を用いて調製す
ることも出来る。例えば、通常市販されている蛋白質合
成用樹脂を用い、α−アミノ基と側鎖官能基を適当に保
護したアミノ酸を、目的とする蛋白質の配列通りに、当
業界において自体公知の各種縮合方法に従い、樹脂上で
縮合させる。反応の最後に樹脂から蛋白質を切り出すと
同時に各種保護基を除去し、さらに高希釈溶液中で分子
内ジスルフィド結合形成反応を実施し、目的の蛋白質、
その部分ペプチドまたはそれらのアミド体を取得する。
上記した保護アミノ酸の縮合に関しては、例えば、DC
C、N,N'-ジイソプロピルカルボジイミド、及びN-エチル
-N'-(3-ジメチルアミノプロリル)カルボジイミドのよ
うなカルボジイミド類に代表される蛋白質合成に使用で
きる各種活性化試薬を用いることができる。これらによ
る活性化にはラセミ化抑制添加剤(例えば、HOBt, HOOB
t)とともに保護アミノ酸を直接樹脂に添加するかまた
は、対称酸無水物またはHOBtエステルあるいはHOOBtエ
ステルとしてあらかじめ保護アミノ酸の活性化を行なっ
た後に樹脂に添加することができる。
【0025】保護アミノ酸の活性化や樹脂との縮合に用
いられる溶媒としては、酸アミド類、ハロゲン化炭化水
素類、アルコール類、スルオキシド類、及びエーテル類
等、当業界において蛋白質縮合反応に使用しうることが
知られている溶媒から適宜選択されうる。反応温度は蛋
白質結合形成反応に使用され得ることが知られている範
囲から適宜選択される。活性化されたアミノ酸誘導体は
通常1.5〜4倍過剰で用いられる。ニンヒドリン反応
を用いたテストの結果、縮合が不十分な場合には保護基
の脱離を行うことなく縮合反応を繰り返すことにより十
分な縮合を行なうことができる。反応を繰り返しても十
分な縮合が得られないときには、無水酢酸またはアセチ
ルイミダゾールを用いて未反応アミノ酸をアセチル化し
て、後の反応に影響を及ぼさないようにすることができ
る。原料の各アミノ基、カルボキシル基、及びセリン水
酸基等の保護基としても、当該技術分野において、通常
使用される基を使用することができる。原料の反応に関
与すべきでない官能基の保護ならびに保護基、およびそ
の保護基の脱離、反応に関与する官能基の活性化などは
公知の基または公知の手段から適宜選択しうる。
いられる溶媒としては、酸アミド類、ハロゲン化炭化水
素類、アルコール類、スルオキシド類、及びエーテル類
等、当業界において蛋白質縮合反応に使用しうることが
知られている溶媒から適宜選択されうる。反応温度は蛋
白質結合形成反応に使用され得ることが知られている範
囲から適宜選択される。活性化されたアミノ酸誘導体は
通常1.5〜4倍過剰で用いられる。ニンヒドリン反応
を用いたテストの結果、縮合が不十分な場合には保護基
の脱離を行うことなく縮合反応を繰り返すことにより十
分な縮合を行なうことができる。反応を繰り返しても十
分な縮合が得られないときには、無水酢酸またはアセチ
ルイミダゾールを用いて未反応アミノ酸をアセチル化し
て、後の反応に影響を及ぼさないようにすることができ
る。原料の各アミノ基、カルボキシル基、及びセリン水
酸基等の保護基としても、当該技術分野において、通常
使用される基を使用することができる。原料の反応に関
与すべきでない官能基の保護ならびに保護基、およびそ
の保護基の脱離、反応に関与する官能基の活性化などは
公知の基または公知の手段から適宜選択しうる。
【0026】本発明の部分ペプチドまたはそれらの塩
は、当該技術分野において自体公知のペプチドの合成法
に従って、あるいは本発明の蛋白質を適当なペプチダー
ゼで切断することによって製造することができる。ペプ
チドの合成法としては、例えば、固相合成法、液相合成
法のいずれによっても良い。公知の縮合方法や保護基の
脱離としては、例えば、以下の(1)〜(3)に記載さ
れた方法が挙げられる。 (1)泉屋信夫他、ペプチド合成の基礎と実験、 丸善
(株) (1975年) (2)矢島治明 および榊原俊平、生化学実験講座 1、
蛋白質の化学IV、 205、(1977年) (3)矢島治明監修、続医薬品の開発 第14巻 ペプチド
合成 広川書店 反応後の精製も自体公知の方法、例えば、溶媒抽出・蒸
留・カラムクロマトグラフィー・液体クロマトグラフィ
ー・再結晶などを組み合わせて本発明の部分ペプチドを
精製単離することができる。上記方法で得られる部分ペ
プチドが遊離体である場合は、公知の方法によって適当
な塩に変換することができるし、逆に塩で得られた場合
は、公知の方法によって遊離体に変換することができ
る。
は、当該技術分野において自体公知のペプチドの合成法
に従って、あるいは本発明の蛋白質を適当なペプチダー
ゼで切断することによって製造することができる。ペプ
チドの合成法としては、例えば、固相合成法、液相合成
法のいずれによっても良い。公知の縮合方法や保護基の
脱離としては、例えば、以下の(1)〜(3)に記載さ
れた方法が挙げられる。 (1)泉屋信夫他、ペプチド合成の基礎と実験、 丸善
(株) (1975年) (2)矢島治明 および榊原俊平、生化学実験講座 1、
蛋白質の化学IV、 205、(1977年) (3)矢島治明監修、続医薬品の開発 第14巻 ペプチド
合成 広川書店 反応後の精製も自体公知の方法、例えば、溶媒抽出・蒸
留・カラムクロマトグラフィー・液体クロマトグラフィ
ー・再結晶などを組み合わせて本発明の部分ペプチドを
精製単離することができる。上記方法で得られる部分ペ
プチドが遊離体である場合は、公知の方法によって適当
な塩に変換することができるし、逆に塩で得られた場合
は、公知の方法によって遊離体に変換することができ
る。
【0027】本発明ポリペプチド(蛋白質)、その部分
ペプチドまたはそれらの塩に対する抗体は、それらを認
識し得るものであれば、ポリクローナル抗体、モノクロ
ーナル抗体の何れであってもよい。本発明ポリペプチド
(蛋白質)、その部分ペプチドまたはそれらの塩に対す
る抗体は、本発明ポリペプチド(蛋白質)又はその部分
ペプチドを抗原として用い、公知の抗体または抗血清の
製造法に従って製造することができる。本発明の抗体
は、体液や組織などの被検体中に存在する本発明ポリペ
プチド(蛋白質)等を検出するために使用することがで
きる。また、これらを精製するために使用する抗体カラ
ムの作製、精製時の各分画中の本発明ポリペプチド(蛋
白質)の検出、被検細胞内における本発明ポリペプチド
(蛋白質)の挙動の分析などのために使用することがで
きる。
ペプチドまたはそれらの塩に対する抗体は、それらを認
識し得るものであれば、ポリクローナル抗体、モノクロ
ーナル抗体の何れであってもよい。本発明ポリペプチド
(蛋白質)、その部分ペプチドまたはそれらの塩に対す
る抗体は、本発明ポリペプチド(蛋白質)又はその部分
ペプチドを抗原として用い、公知の抗体または抗血清の
製造法に従って製造することができる。本発明の抗体
は、体液や組織などの被検体中に存在する本発明ポリペ
プチド(蛋白質)等を検出するために使用することがで
きる。また、これらを精製するために使用する抗体カラ
ムの作製、精製時の各分画中の本発明ポリペプチド(蛋
白質)の検出、被検細胞内における本発明ポリペプチド
(蛋白質)の挙動の分析などのために使用することがで
きる。
【0028】更に、本発明の抗体は、公知の方法による
被検液中の本発明ポリペプチド(蛋白質)等の定量、特
に、モノクローナル抗体を使用したサンドイッチ免疫測
定法による定量、及び組織染色等による検出などに使用
することができる。それによって、例えば、本発明ポリ
ペプチド(蛋白質)等が関与する疾病の診断を行なうこ
とができる。これらの目的には、抗体分子そのものを用
いてもよく、また、抗体分子のF(ab')2 、Fab'、
あるいはFab画分を用いてもよい。本発明の抗体を用
いる本発明の蛋白質等の定量法は、特に制限されるべき
ものではなく、被測定液中の抗原量(例えば、蛋白質
量)に対応した抗体、抗原もしくは抗体−抗原複合体の
量を化学的または物理的手段により検出し、これを既知
量の抗原を含む標準液を用いて作製した標準曲線より算
出する測定法であれば、いずれの測定法を用いてもよ
い。例えば、ネフロメトリー、競合法、イムノメトリッ
ク法およびサンドイッチ法が好適に用いられるが、感
度、特異性の点で、後述するサンドイッチ法を用いるの
が好ましい。標識物質を用いる測定法に用いられる標識
剤としては、当該技術分野で公知の、例えば、放射性同
位元素、酵素、蛍光物質、発光物質などを用いることが
出来る。
被検液中の本発明ポリペプチド(蛋白質)等の定量、特
に、モノクローナル抗体を使用したサンドイッチ免疫測
定法による定量、及び組織染色等による検出などに使用
することができる。それによって、例えば、本発明ポリ
ペプチド(蛋白質)等が関与する疾病の診断を行なうこ
とができる。これらの目的には、抗体分子そのものを用
いてもよく、また、抗体分子のF(ab')2 、Fab'、
あるいはFab画分を用いてもよい。本発明の抗体を用
いる本発明の蛋白質等の定量法は、特に制限されるべき
ものではなく、被測定液中の抗原量(例えば、蛋白質
量)に対応した抗体、抗原もしくは抗体−抗原複合体の
量を化学的または物理的手段により検出し、これを既知
量の抗原を含む標準液を用いて作製した標準曲線より算
出する測定法であれば、いずれの測定法を用いてもよ
い。例えば、ネフロメトリー、競合法、イムノメトリッ
ク法およびサンドイッチ法が好適に用いられるが、感
度、特異性の点で、後述するサンドイッチ法を用いるの
が好ましい。標識物質を用いる測定法に用いられる標識
剤としては、当該技術分野で公知の、例えば、放射性同
位元素、酵素、蛍光物質、発光物質などを用いることが
出来る。
【0029】これらの測定・検出方法に関する一般的な
技術手段の詳細については、総説、成書などを参照する
ことができる。例えば、入江 寛編「続ラジオイムノア
ッセイ〕(講談社、昭和54年発行)、石川栄治ら編
「酵素免疫測定法」(第3版)(医学書院、昭和62年
発行)、「Methods in ENZYMOLOGY」Vol. 70(Immunoche
mical Techniques(Part A))、 同書 Vol. 73(Immunoche
mical Techniques(PartB))、 同書 Vol. 74(Immunochem
ical Techniques(Part C))、 同書 Vol. 84(Immunochem
ical Techniques(Part D:Selected Immunoassays))、
同書 Vol. 92(Immunochemical Techniques(Part E:Mono
clonal Antibodies and General Immunoassay Method
s))、 同書 Vol. 121(Immunochemical Techniques(Part
I:HybridomaTechnology and Monoclonal Antibodies))
(以上、アカデミックプレス社発行)などを参照すること
ができる。
技術手段の詳細については、総説、成書などを参照する
ことができる。例えば、入江 寛編「続ラジオイムノア
ッセイ〕(講談社、昭和54年発行)、石川栄治ら編
「酵素免疫測定法」(第3版)(医学書院、昭和62年
発行)、「Methods in ENZYMOLOGY」Vol. 70(Immunoche
mical Techniques(Part A))、 同書 Vol. 73(Immunoche
mical Techniques(PartB))、 同書 Vol. 74(Immunochem
ical Techniques(Part C))、 同書 Vol. 84(Immunochem
ical Techniques(Part D:Selected Immunoassays))、
同書 Vol. 92(Immunochemical Techniques(Part E:Mono
clonal Antibodies and General Immunoassay Method
s))、 同書 Vol. 121(Immunochemical Techniques(Part
I:HybridomaTechnology and Monoclonal Antibodies))
(以上、アカデミックプレス社発行)などを参照すること
ができる。
【0030】本発明ポリペプチド(蛋白質)又はその部
分ポリペプチドをコードするDNAに実質的に相補的な
塩基配列を有するアンチセンスオリゴヌクレオチド(D
NA)としては、当該DNAの塩基配列に実質的に相補
的な塩基配列を有し、該DNAの発現を抑制し得る作用
を有するものであれば、いずれのアンチセンスDNAで
あってもよい。実質的に相補的な塩基配列とは、例え
ば、本発明DNAに相補的な塩基配列の全塩基配列また
は部分塩基配列と好ましくは約90以上、より好ましく
は約95%以上、最も好ましくは100%の相同性を有
する塩基配列などが挙げられる。又、これらアンチセン
スDNAと同様の作用を有する核酸配列(RNAまたは
DNAの修飾体)も本発明でいうアンチセンスDNAに
含まれる。これらのアンチセンスDNAは、公知のDN
A合成装置などを用いて製造することができる。
分ポリペプチドをコードするDNAに実質的に相補的な
塩基配列を有するアンチセンスオリゴヌクレオチド(D
NA)としては、当該DNAの塩基配列に実質的に相補
的な塩基配列を有し、該DNAの発現を抑制し得る作用
を有するものであれば、いずれのアンチセンスDNAで
あってもよい。実質的に相補的な塩基配列とは、例え
ば、本発明DNAに相補的な塩基配列の全塩基配列また
は部分塩基配列と好ましくは約90以上、より好ましく
は約95%以上、最も好ましくは100%の相同性を有
する塩基配列などが挙げられる。又、これらアンチセン
スDNAと同様の作用を有する核酸配列(RNAまたは
DNAの修飾体)も本発明でいうアンチセンスDNAに
含まれる。これらのアンチセンスDNAは、公知のDN
A合成装置などを用いて製造することができる。
【0031】更に、本発明ポリペプチド(蛋白質)等
は、これら物質と特異的に相互作用する化合物をスクリ
ーニングする為の試薬として有用である。すなわち、本
発明は、本発明ポリペプチド(蛋白質)、その部分ペプ
チド若しくはそれらの塩、又はそれらに対する抗体を用
いることを特徴とする、該物質又はそれらの塩と特異的
に相互作用する化合物のスクリーニング方法、及びその
為のスクリーニング用キットを提供する。本発明のスク
リーニング方法またはスクリーニング用キットを用いて
同定される化合物またはその塩は、上記した試験化合物
から選ばれた化合物であり、本発明ポリペプチド(蛋白
質)等と相互作用し、その生物学的活性を調節、阻害、
促進、又は拮抗等する化合物である。該化合物またはそ
の塩は、本発明の蛋白質等の活性に直接作用するもので
あってもよいし、本発明ポリペプチド(蛋白質)等の発
現に作用することによって間接的に本発明ポリペプチド
(蛋白質)等の活性に作用するものであってもよい。該
化合物の塩としては、例えば、薬学的に許容可能な塩な
どが用いられる。例えば、無機塩基との塩、有機塩基と
の塩、無機酸との塩、有機酸との塩、塩基性または酸性
アミノ酸との塩などがあげられる。本発明ポリペプチド
(蛋白質)等の生物学的活性を阻害する化合物も上記各
種疾病に対する治療・予防剤などの医薬として使用でき
る可能性がある。
は、これら物質と特異的に相互作用する化合物をスクリ
ーニングする為の試薬として有用である。すなわち、本
発明は、本発明ポリペプチド(蛋白質)、その部分ペプ
チド若しくはそれらの塩、又はそれらに対する抗体を用
いることを特徴とする、該物質又はそれらの塩と特異的
に相互作用する化合物のスクリーニング方法、及びその
為のスクリーニング用キットを提供する。本発明のスク
リーニング方法またはスクリーニング用キットを用いて
同定される化合物またはその塩は、上記した試験化合物
から選ばれた化合物であり、本発明ポリペプチド(蛋白
質)等と相互作用し、その生物学的活性を調節、阻害、
促進、又は拮抗等する化合物である。該化合物またはそ
の塩は、本発明の蛋白質等の活性に直接作用するもので
あってもよいし、本発明ポリペプチド(蛋白質)等の発
現に作用することによって間接的に本発明ポリペプチド
(蛋白質)等の活性に作用するものであってもよい。該
化合物の塩としては、例えば、薬学的に許容可能な塩な
どが用いられる。例えば、無機塩基との塩、有機塩基と
の塩、無機酸との塩、有機酸との塩、塩基性または酸性
アミノ酸との塩などがあげられる。本発明ポリペプチド
(蛋白質)等の生物学的活性を阻害する化合物も上記各
種疾病に対する治療・予防剤などの医薬として使用でき
る可能性がある。
【0032】本発明DNA及び該DNAを含む遺伝子を
プローブとして使用することにより、本発明ポリペプチ
ド又はその部分ペプチドをコードするDNAまたはmR
NAの異常(遺伝子異常)を検出することができるの
で、例えば、該DNAまたはmRNAの損傷、突然変異
あるいは発現低下や、該DNAまたはmRNAの増加あ
るいは発現過多などの遺伝子診断剤として有用である。
本発明のDNAを用いる上記の遺伝子診断は、例えば、
公知のノーザンハイブリダイゼーションやPCR−SS
CP法(Genomics,第5巻,874〜879頁(198
9年)、Proceedings of the National Academy of Sci
ences of the United States of America,第86巻,
2766〜2770頁(1989年))などにより実施
することができる。更に、本発明DNA又は遺伝子に異
常があったり、欠損している場合あるいは発現量が減少
している場合、生体内において正常な機能を発揮できな
い患者に対しては、公知手段に従って(1)レトロウイ
ルスベクター、アデノウイルスベクター、アデノウイル
スアソシエーテッドウイルスベクターなどの適当なベク
ターをベヒクルとして使用する遺伝子治療によって、本
発明DNA又は遺伝子を該患者体内に導入し、発現させ
るか、又は(2)本発明の蛋白質等を該患者に注入する
こと等によって、該患者において本発明の蛋白質等の機
能を発揮させることができるものと考えられる。本発明
DNA又は遺伝子を、該DNAを単独、又は、摂取促進
のための補助剤とともに、遺伝子銃やハイドロゲルカテ
ーテルのようなカテーテルによって投与することも可能
である。
プローブとして使用することにより、本発明ポリペプチ
ド又はその部分ペプチドをコードするDNAまたはmR
NAの異常(遺伝子異常)を検出することができるの
で、例えば、該DNAまたはmRNAの損傷、突然変異
あるいは発現低下や、該DNAまたはmRNAの増加あ
るいは発現過多などの遺伝子診断剤として有用である。
本発明のDNAを用いる上記の遺伝子診断は、例えば、
公知のノーザンハイブリダイゼーションやPCR−SS
CP法(Genomics,第5巻,874〜879頁(198
9年)、Proceedings of the National Academy of Sci
ences of the United States of America,第86巻,
2766〜2770頁(1989年))などにより実施
することができる。更に、本発明DNA又は遺伝子に異
常があったり、欠損している場合あるいは発現量が減少
している場合、生体内において正常な機能を発揮できな
い患者に対しては、公知手段に従って(1)レトロウイ
ルスベクター、アデノウイルスベクター、アデノウイル
スアソシエーテッドウイルスベクターなどの適当なベク
ターをベヒクルとして使用する遺伝子治療によって、本
発明DNA又は遺伝子を該患者体内に導入し、発現させ
るか、又は(2)本発明の蛋白質等を該患者に注入する
こと等によって、該患者において本発明の蛋白質等の機
能を発揮させることができるものと考えられる。本発明
DNA又は遺伝子を、該DNAを単独、又は、摂取促進
のための補助剤とともに、遺伝子銃やハイドロゲルカテ
ーテルのようなカテーテルによって投与することも可能
である。
【0033】本明細書および図面において、塩基やアミ
ノ酸などを略号で表示する場合、IUPAC−IUB C
ommision on Biochemical Nomenclature による略号あ
るいは当該分野における慣用略号に基づくものであり、
またアミノ酸に関し光学異性体があり得る場合は、特に
明示しなければL体を示すものとする。
ノ酸などを略号で表示する場合、IUPAC−IUB C
ommision on Biochemical Nomenclature による略号あ
るいは当該分野における慣用略号に基づくものであり、
またアミノ酸に関し光学異性体があり得る場合は、特に
明示しなければL体を示すものとする。
【0034】
【実施例】以下に、実施例により本発明をさらに具体的
に説明するが、本発明はそれに限定されるものではな
い。なお、実施例における各種遺伝子操作は、上記のCu
rrent protocols in molecular biology(edited by Fr
ederick M. Ausubel et al.,1987)に記載されている方
法に従った。
に説明するが、本発明はそれに限定されるものではな
い。なお、実施例における各種遺伝子操作は、上記のCu
rrent protocols in molecular biology(edited by Fr
ederick M. Ausubel et al.,1987)に記載されている方
法に従った。
【0035】(1)ヒト成人全脳、ヒト成人海馬及びヒ
ト胎児全脳由来cDNAライブラリーの構築 NotI部位を有するオリゴヌクレオチド(GACTA
GTTCTAGATCGCGAGCGGCCGCCC
(T)15)(インビトロジェン)をプライマーとし
て、ヒト成人全脳、ヒト成人海馬及びヒト胎児全脳由来
mRNA(クローンテック社)を鋳型にSuperScriptII
逆転写酵素キット(インビトロジェン)で2本鎖cDN
Aを合成した。SalI部位を有するアダプター(イン
ビトロジェン)をcDNAとライゲーションした。その
後、NotI消化し、1%濃度の低融解アガロース電気
泳動により、3kb以上のDNA断片を精製した。精製
cDNA断片を、SalI−NotI制限酵素処理した
pBluescript IISK+ プラスミドとライゲーションし
た。大腸菌 ElectroMax DH10B 株(インビトロジェン)
にエレクトロポレーション法によりこの組換えプラスミ
ドを導入した。
ト胎児全脳由来cDNAライブラリーの構築 NotI部位を有するオリゴヌクレオチド(GACTA
GTTCTAGATCGCGAGCGGCCGCCC
(T)15)(インビトロジェン)をプライマーとし
て、ヒト成人全脳、ヒト成人海馬及びヒト胎児全脳由来
mRNA(クローンテック社)を鋳型にSuperScriptII
逆転写酵素キット(インビトロジェン)で2本鎖cDN
Aを合成した。SalI部位を有するアダプター(イン
ビトロジェン)をcDNAとライゲーションした。その
後、NotI消化し、1%濃度の低融解アガロース電気
泳動により、3kb以上のDNA断片を精製した。精製
cDNA断片を、SalI−NotI制限酵素処理した
pBluescript IISK+ プラスミドとライゲーションし
た。大腸菌 ElectroMax DH10B 株(インビトロジェン)
にエレクトロポレーション法によりこの組換えプラスミ
ドを導入した。
【0036】(2)スクリーニング
次いで、こうして構築したcDNAライブラリーからラ
ンダムにクローンをピックアップし、メンブランにスポ
ッティングした。次に、これまでに本発明者等によって
既に全長の解析が行われている約1,300個のクローンの
塩基配列に基づき作成したオリゴDNA(各21塩基)
の混合物の各3’末端をターミナルトランスフェラーゼ
でDIGラベルし、これらをプルーブとして使用してド
ットハイブリダイゼーション(Current protocols in m
olecular biology(edited by Frederick M. Ausubel e
t al., 1987))により、重複クローン(繰り返し出て
くるクローン)を除いた。次に、インビトロでの転写翻
訳(プロメガ社TNT T7 Quick Coupled Transcription/T
ranslation System cat.no.L1107)を行い、50kDa
以上の産物が認められるクローンを選択した。次に、選
択したクローンの末端塩基配列を決定し、得られた配列
をクエリーとして相同検索プログラムBLASTN 2.0.14 (A
ltschul, Stephen F., Thomas L. Madden, Alejandro
A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Mill
er, andDavid J. Lipman (1997), "Gapped BLAST and P
SI-BLAST: a new generation of protein database sea
rch programs", Nucleic Acids Res. 25:3389-3402)を
用いて、nr(All GenBank+EMBL+DDBJ+PDB sequences (bu
t no EST, STS,GSS, orphase 0,1 or 2 HTGS sequence
s))データベースに対して相同検索を行った。その結
果、相同遺伝子が存在しなかったもの、即ち、新規遺伝
子であるものについて全塩基配列を決定した。配列決定
には、PEアプライドバイオシステム社製のDNAシー
クエンサー(ABI PRISM377)と同社製反応キットを使用
した。大部分の配列はショットガンクローンをダイター
ミネーター法を用いて決定した。一部の塩基配列につい
ては、決定した塩基配列を元にしてオリゴヌクレオチド
を合成し、プライマーウォーキング法で決定した。
ンダムにクローンをピックアップし、メンブランにスポ
ッティングした。次に、これまでに本発明者等によって
既に全長の解析が行われている約1,300個のクローンの
塩基配列に基づき作成したオリゴDNA(各21塩基)
の混合物の各3’末端をターミナルトランスフェラーゼ
でDIGラベルし、これらをプルーブとして使用してド
ットハイブリダイゼーション(Current protocols in m
olecular biology(edited by Frederick M. Ausubel e
t al., 1987))により、重複クローン(繰り返し出て
くるクローン)を除いた。次に、インビトロでの転写翻
訳(プロメガ社TNT T7 Quick Coupled Transcription/T
ranslation System cat.no.L1107)を行い、50kDa
以上の産物が認められるクローンを選択した。次に、選
択したクローンの末端塩基配列を決定し、得られた配列
をクエリーとして相同検索プログラムBLASTN 2.0.14 (A
ltschul, Stephen F., Thomas L. Madden, Alejandro
A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Mill
er, andDavid J. Lipman (1997), "Gapped BLAST and P
SI-BLAST: a new generation of protein database sea
rch programs", Nucleic Acids Res. 25:3389-3402)を
用いて、nr(All GenBank+EMBL+DDBJ+PDB sequences (bu
t no EST, STS,GSS, orphase 0,1 or 2 HTGS sequence
s))データベースに対して相同検索を行った。その結
果、相同遺伝子が存在しなかったもの、即ち、新規遺伝
子であるものについて全塩基配列を決定した。配列決定
には、PEアプライドバイオシステム社製のDNAシー
クエンサー(ABI PRISM377)と同社製反応キットを使用
した。大部分の配列はショットガンクローンをダイター
ミネーター法を用いて決定した。一部の塩基配列につい
ては、決定した塩基配列を元にしてオリゴヌクレオチド
を合成し、プライマーウォーキング法で決定した。
【0037】このようにして新規DNA又は遺伝子のス
クリーニングを行なった。その結果、配列表の配列番号
1乃至44(但し、配列番号7、11及び25は除く)
のいずれか一つに示された新規DNA又は遺伝子が検出
された。これらの新規DNA又は遺伝子について、上記
の配列決定方法によりその塩基配列を決定した。本発明
DNA又は遺伝子を有するクローンの名称は表1乃至表
3に示されている。
クリーニングを行なった。その結果、配列表の配列番号
1乃至44(但し、配列番号7、11及び25は除く)
のいずれか一つに示された新規DNA又は遺伝子が検出
された。これらの新規DNA又は遺伝子について、上記
の配列決定方法によりその塩基配列を決定した。本発明
DNA又は遺伝子を有するクローンの名称は表1乃至表
3に示されている。
【0038】(3)本発明DNAの相同性検索
次に、こうして得られた全塩基配列に基づき、クローン
のアミノ酸配列を既知配列ライブラリーnr release 122
に対して解析プログラムBLASTP 2.0.14 (Altschul, Ste
phen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang,Zheng Zhang, Webb Miller, and David
J. Lipman (1997), "Gapped BLAST andPSI-BLAST: a ne
w generation of protein database search programs",
Nucleic Acids Res. 25:3389-3402)を用いて検索した
ところ表4〜表6に示した各相同遺伝子と相同性を示す
ことが明らかになった。尚、表4〜表6には、これら相
同遺伝子に関する情報、即ち、その名称、データベース
ID、生物種、蛋白質長、及び記載文献が挙げられてい
る。又、これら各表中の「生物種」の略号の意味は表7
で説明されている。
のアミノ酸配列を既知配列ライブラリーnr release 122
に対して解析プログラムBLASTP 2.0.14 (Altschul, Ste
phen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang,Zheng Zhang, Webb Miller, and David
J. Lipman (1997), "Gapped BLAST andPSI-BLAST: a ne
w generation of protein database search programs",
Nucleic Acids Res. 25:3389-3402)を用いて検索した
ところ表4〜表6に示した各相同遺伝子と相同性を示す
ことが明らかになった。尚、表4〜表6には、これら相
同遺伝子に関する情報、即ち、その名称、データベース
ID、生物種、蛋白質長、及び記載文献が挙げられてい
る。又、これら各表中の「生物種」の略号の意味は表7
で説明されている。
【0039】更に、各クローンに含まれる本発明DNA
又は遺伝子と表4〜表6に示した各相同遺伝子との相同
性に関する各種データを表8〜表9にまとめた。これら
表中の各項目の意味は以下の通りである。 「相同領域 クローン」クローンの相同領域の起点及び
終点 「相同領域 相同遺伝子」相同遺伝子の相同領域の起点
及び終点 「Score」この値が高ければ高いほど信頼度が高い 「E-value」この値が0に近ければ近いほど信頼度が高い 「相同性」相同領域のアミノ酸残基の一致の割合 「相同範囲率」相同遺伝子中の相同領域の割合
又は遺伝子と表4〜表6に示した各相同遺伝子との相同
性に関する各種データを表8〜表9にまとめた。これら
表中の各項目の意味は以下の通りである。 「相同領域 クローン」クローンの相同領域の起点及び
終点 「相同領域 相同遺伝子」相同遺伝子の相同領域の起点
及び終点 「Score」この値が高ければ高いほど信頼度が高い 「E-value」この値が0に近ければ近いほど信頼度が高い 「相同性」相同領域のアミノ酸残基の一致の割合 「相同範囲率」相同遺伝子中の相同領域の割合
【0040】(4)各種ドメインの検索
次に、クローンに含まれるDNAがコードするアミノ酸
配列中から、Pfam 6.0に含まれる検索ツールPfam HMM v
er 2.1 Search (HMMPFAM) (Sonnhammer ELL, Eddy SR,
Birney E, Bateman A, Durbin R (1998) Pfam: multipl
e sequence alignments and HMM-profiles of protein
domains, Nucleic Acids Research 26:320-322)を用い
て機能ドメインを検索した。更に、膜蛋白予測プログラ
ムであるSOSUI system (ver. 1.0 / 10, Mar., 1996)
(Takatsugu Hirokawa, Seah Boon-Chieng and Shigeki
Mitaku, SOSUI: Classification and Secondary Struct
ure Prediction System for Membrane Proteins, Bioin
formatics (formerly CABIOS) 1998 May;14(4):378-37
9.) を用いて膜貫通ドメインを検索した。これらの検出
された機能ドメイン及び膜貫通ドメインを表10〜表1
9にそれぞれのクローンについて示した。これら表中の
各項目の意味は以下の通りである。 「機能ドメイン」Pfam SOSUIにより検出されたドメイン 「クローン from」機能ドメインの起点 「クローン to」機能ドメインの終点 「相同遺伝子 from」機能ドメインの起点 「相同遺伝子 to」機能ドメインの終点 「Score(Pfamのみ)」この値が高ければ高いほど信頼度
が高い 「Exp(Pfamのみ)」この値が0に近ければ近いほど信頼度
が高い 又、各機能ドメインの完全標記を表20〜表21に示し
た。
配列中から、Pfam 6.0に含まれる検索ツールPfam HMM v
er 2.1 Search (HMMPFAM) (Sonnhammer ELL, Eddy SR,
Birney E, Bateman A, Durbin R (1998) Pfam: multipl
e sequence alignments and HMM-profiles of protein
domains, Nucleic Acids Research 26:320-322)を用い
て機能ドメインを検索した。更に、膜蛋白予測プログラ
ムであるSOSUI system (ver. 1.0 / 10, Mar., 1996)
(Takatsugu Hirokawa, Seah Boon-Chieng and Shigeki
Mitaku, SOSUI: Classification and Secondary Struct
ure Prediction System for Membrane Proteins, Bioin
formatics (formerly CABIOS) 1998 May;14(4):378-37
9.) を用いて膜貫通ドメインを検索した。これらの検出
された機能ドメイン及び膜貫通ドメインを表10〜表1
9にそれぞれのクローンについて示した。これら表中の
各項目の意味は以下の通りである。 「機能ドメイン」Pfam SOSUIにより検出されたドメイン 「クローン from」機能ドメインの起点 「クローン to」機能ドメインの終点 「相同遺伝子 from」機能ドメインの起点 「相同遺伝子 to」機能ドメインの終点 「Score(Pfamのみ)」この値が高ければ高いほど信頼度
が高い 「Exp(Pfamのみ)」この値が0に近ければ近いほど信頼度
が高い 又、各機能ドメインの完全標記を表20〜表21に示し
た。
【0041】(5)発現部位
RT-PCR Coupled ELISAを用いて、組織と脳の部位での発
現を、それぞれで一番強い発現を示したものを表22〜
表23に示した。尚、組織及び脳の部位の完全標記を表
24に示した。 (6)染色体位置 クローンのDNA配列に対応する、既知配列ライブラリーG
enbank release122中のヒトゲノム配列を解析プログラ
ムBLASTN 2.0.14 (Altschul, Stephen F., Thomas L. M
adden, Alejandro A. Schaffer, Jinghui Zhang, Zheng
Zhang, Webb Miller, and David J. Lipman (1997), "
Gapped BLAST and PSI-BLAST: a new generation of pr
otein database search programs", Nucleic Acids Re
s. 25:3389-3402)を用いて検索した。適合したクローン
の説明(Definition)の中からこのクローンが由来した染
色体の番号の記述を抽出し、これを表22〜表23に示
した。
現を、それぞれで一番強い発現を示したものを表22〜
表23に示した。尚、組織及び脳の部位の完全標記を表
24に示した。 (6)染色体位置 クローンのDNA配列に対応する、既知配列ライブラリーG
enbank release122中のヒトゲノム配列を解析プログラ
ムBLASTN 2.0.14 (Altschul, Stephen F., Thomas L. M
adden, Alejandro A. Schaffer, Jinghui Zhang, Zheng
Zhang, Webb Miller, and David J. Lipman (1997), "
Gapped BLAST and PSI-BLAST: a new generation of pr
otein database search programs", Nucleic Acids Re
s. 25:3389-3402)を用いて検索した。適合したクローン
の説明(Definition)の中からこのクローンが由来した染
色体の番号の記述を抽出し、これを表22〜表23に示
した。
【0042】以上の、相同性、相同性遺伝子に関する情
報、各種ドメイン、発現部位、及び染色体位置、等に基
づき、当業者であれば、本発明のDNA又は遺伝子が表
1〜表3に示した各機能を有するものと予測することが
出来る。尚、以下のクローンに含まれる遺伝子が完全長
であることは、以下の理由により判定した。bg00381
は、ヒトのオメガクラスのグルタチオン転移酵素とのア
ミノ酸配列のアラインメントから、ヒトのオメガクラス
のグルタチオン転移酵素の選択的スプライシングの一つ
のタイプの完全長であると考えられる。又、fk03350
は、マウスのネトリン G1とのアミノ酸配列のアライン
メントから、ヒトのネトリン G1の完全長であると考え
られる。又、fk06388は、ウサギのカルシウム依存性のS
olute carrier 蛋白質とのアミノ酸配列のアラインメン
トから、ヒトのカルシウム依存性のSolute carrier 蛋
白質の完全長であると考えられる。
報、各種ドメイン、発現部位、及び染色体位置、等に基
づき、当業者であれば、本発明のDNA又は遺伝子が表
1〜表3に示した各機能を有するものと予測することが
出来る。尚、以下のクローンに含まれる遺伝子が完全長
であることは、以下の理由により判定した。bg00381
は、ヒトのオメガクラスのグルタチオン転移酵素とのア
ミノ酸配列のアラインメントから、ヒトのオメガクラス
のグルタチオン転移酵素の選択的スプライシングの一つ
のタイプの完全長であると考えられる。又、fk03350
は、マウスのネトリン G1とのアミノ酸配列のアライン
メントから、ヒトのネトリン G1の完全長であると考え
られる。又、fk06388は、ウサギのカルシウム依存性のS
olute carrier 蛋白質とのアミノ酸配列のアラインメン
トから、ヒトのカルシウム依存性のSolute carrier 蛋
白質の完全長であると考えられる。
【0043】本発明で得られた新規なDNA又は遺伝子
を所謂DNAチップ等に集積させ、これに、例えば、精
神病等の脳が関与する疾患の患者と対照としての正常人
の血液又は組織等から作成したプローブをハイブリダイ
ゼーションさせることによって、これら疾患の診断、治
療等に役立てることが出来る。又、本発明のDNA若し
くは遺伝子又はそれらの一部の塩基配列に基づき作成し
た合成DNAプライマーを使用し、ヒトの血液又は組織
から抽出した染色体DNAを用いてPCRを行い、その
産物の塩基配列を決定することにより、本発明のDNA
又は遺伝子中にある個体によって異なる一塩基の変異、
即ち、cSNPsを見出すことが出来る。これにより、
個体の体質等が予測され、各自に適した医薬の開発等が
可能となる。又、クロスハイブリダイゼーションによ
り、マウス等のモデル生物における本発明のDNA又は
遺伝子に対するオルソログ(ホモログ、カウンターパー
ト)遺伝子を単離し、例えば、これら遺伝子をノックア
ウトすることによってヒトの疾患モデル動物を作成し、
ヒトの病因となる遺伝子を探索・同定することも可能で
ある。更に、本発明のDNA又は遺伝子に対する抗体を
網羅的に作成し、それらを集積させて所謂PROTEI
Nチップを作成し、患者と正常人との蛋白質発現量の差
異を検出する等のプロテオーム解析から、病気の診断・
治療等に役立てることが出来る。
を所謂DNAチップ等に集積させ、これに、例えば、精
神病等の脳が関与する疾患の患者と対照としての正常人
の血液又は組織等から作成したプローブをハイブリダイ
ゼーションさせることによって、これら疾患の診断、治
療等に役立てることが出来る。又、本発明のDNA若し
くは遺伝子又はそれらの一部の塩基配列に基づき作成し
た合成DNAプライマーを使用し、ヒトの血液又は組織
から抽出した染色体DNAを用いてPCRを行い、その
産物の塩基配列を決定することにより、本発明のDNA
又は遺伝子中にある個体によって異なる一塩基の変異、
即ち、cSNPsを見出すことが出来る。これにより、
個体の体質等が予測され、各自に適した医薬の開発等が
可能となる。又、クロスハイブリダイゼーションによ
り、マウス等のモデル生物における本発明のDNA又は
遺伝子に対するオルソログ(ホモログ、カウンターパー
ト)遺伝子を単離し、例えば、これら遺伝子をノックア
ウトすることによってヒトの疾患モデル動物を作成し、
ヒトの病因となる遺伝子を探索・同定することも可能で
ある。更に、本発明のDNA又は遺伝子に対する抗体を
網羅的に作成し、それらを集積させて所謂PROTEI
Nチップを作成し、患者と正常人との蛋白質発現量の差
異を検出する等のプロテオーム解析から、病気の診断・
治療等に役立てることが出来る。
【0044】
【表1】
【0045】
【表2】
【0046】
【表3】
【0047】
【表4】
【0048】
【表5】
【0049】
【表6】
【0050】
【表7】
【0051】
【表8】
【0052】
【表9】
【0053】
【表10】
【0054】
【表11】
【0055】
【表12】
【0056】
【表13】
【0057】
【表14】
【0058】
【表15】
【0059】
【表16】
【0060】
【表17】
【0061】
【表18】
【0062】
【表19】
【0063】
【表20】
【0064】
【表21】
【0065】
【表22】
【0066】
【表23】
【0067】
【表24】
【0068】
【配列表】
SEQUENCE LISTING
<110> Kazusa DNA Research Institute
<120> Novel Genes and Proteins Encoded by the Genes
<130> AB02016
<160> 44
<210> 1
<211> 5302
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2278)..(2700)
<400> 1
ctgagatgtt gtctgaactt ccagtcactc attgtgaagt ggaactgcag taactacatt 60
tggtgcaaaa tttcatgtcg atggaagctg atatgtcctg cagtctcaga gggacagaat 120
tctggtgtat ctccctttgg cctatgagga atgaagattt aggctctgag ggttttgtca 180
aaatttataa gatgtaagaa atggggacgt ctttaggcca tgaggcaagg cctagtggtg 240
taaagaaaca gcttctgacc cagggctcat gaagtcagca tggtgtcagg ggcgatggtg 300
gagtgagcat agtatggttt ttggaaccat tccttcctct tatttcttca caggccaggt 360
atgcacaacc agggggcatc tcgggcctga tgagcagagt cctgaagaga caaaggggca 420
atcctgagga agagacttga cagggaggaa ggagccacca atttcactcc tgagggaatt 480
cttcctcccc agtaaaccat aggattttcc tttgtctcag tgagagtttt cctgagtgaa 540
gagaaaagag agagaaaggg actgatgtgt tatctgttac caccacctga agtgattctg 600
aaacgttggg ggaaaggaga attatcttca gagtcataag caatccagag gccatggtta 660
cacataagag agtggaagtt cttatctctg gggggtaggc actaggcatt gatattggtt 720
aaagctcatt cactgattac aaagtgagac aaataaatac tccactgcaa gatttattca 780
tgacatttta tttcacacac aattcacttg ccttcacgca aagagctctt ggttccctgc 840
ctatggacac actgtaactg ggtcttccaa attcattctt ctggatgtaa aagaggacat 900
aggcctgttg actcaggaca gaagtgatac cggaggcagt gacctcggca tcatccattt 960
tataccaatg gccttcttga gtttgacata agataagtaa tgtccgttgt gacaactcca 1020
cccagcgtgg accagcacag catagaggac atagacaaga ggtcctgtgt tctgctgaga 1080
catgtatggc tgcatgtcaa ggcactcagg atattgcaca ttcttggcaa gtttgttgcc 1140
tgtgacctca gagaatctct tcaatacaag gatgaggacc ttggcagaag tgtgtaaagt 1200
taacgtcctg gaggcaggcg ccttctggag acaaagacca caatgatagg cattttatcc 1260
attgagttct tcgggcttca ccaactgtta caaagcttgc ttgacactct gagctgcctg 1320
gatatccagg gcgatgtcca ggtaagggtc aaaggtgtct gaaatcccgt ggcagtggaa 1380
acagttgatg tgagatctcc agtaccctcc aaatatttgg tggatgaggg tggtgtcctt 1440
agagtgatta tctaaatgct tgtgcccggg aaggaatgcc tttttcatgg catccacagt 1500
gaacatgaga aattcatggg catcttcctg ctcgcctcta tggaagccag cagccaacac 1560
ctgtgagggc tggatgacat ggccaggact gtggaggggc catgtgatgt gagcttgcat 1620
agtacagagc atgcagcact tgtgacgatg acacgtttga gagtgctccc gggacagcat 1680
gtagttggca aggggcggtg tgtatgtcag gcactgcagg gaagcgttca cgtagcaggt 1740
atttcccata ttctggagcc cagcccccac cgcagcaggt ctcctgctac tcagaggaag 1800
cttctccctg ggagcgagct gtcttgccac aggagccaaa tcatcacaga ggtcgacgcg 1860
ggtctcagat gagagtgatg acttctcaga gagagaagtc cgctggattt cagcaaaagc 1920
tgcatctggc cgagaagatg tgagttttga aaagtggttg aactgccact cacctcccaa 1980
gtagagtgag tcgtcctcca tgtcgcccgg aacaaggatc acaaggtttt tcggctggga 2040
ccgcaggttg cagaaagacg ctatctcttc cgagtcttca aatgacgggc tctctggccg 2100
catcagccct tatataactc acccccacca accgcgaaca ccccacccac ccatcaggtg 2160
cgcgataaac caatcaaata tcagcactca attaaggaat gagtcacagg gtgtgtcccc 2220
ttgcatcgct gggaattcaa cagacacagc ccacatcatg acttctagaa cacctga 2277
atc aaa cta ctc ctc agg ctg ata gac aca tgt aat atg agt gta acc 2325
Ile Lys Leu Leu Leu Arg Leu Ile Asp Thr Cys Asn Met Ser Val Thr
1 5 10 15
ggg ttg gga cag tgg cca cac agt tgc ctt att tta ggt aaa aca atg 2373
Gly Leu Gly Gln Trp Pro His Ser Cys Leu Ile Leu Gly Lys Thr Met
20 25 30
tca ggg aag aaa tct tta cct atg aaa ccg tgt gtg tgt gtg tgt gtg 2421
Ser Gly Lys Lys Ser Leu Pro Met Lys Pro Cys Val Cys Val Cys Val
35 40 45
tgt gtt ttt ctc ccc tac gtg tgg gtc ggc act tcc act gag atc act 2469
Cys Val Phe Leu Pro Tyr Val Trp Val Gly Thr Ser Thr Glu Ile Thr
50 55 60
ggc aca caa gca gag ccc tct tgc agt gtt tgt tct tcc ctt tgg ctc 2517
Gly Thr Gln Ala Glu Pro Ser Cys Ser Val Cys Ser Ser Leu Trp Leu
65 70 75 80
tcc tgg tcc tcc ctt gca gag aag cga gtg tgc cag tgt tca tgg act 2565
Ser Trp Ser Ser Leu Ala Glu Lys Arg Val Cys Gln Cys Ser Trp Thr
85 90 95
cct gat ctg tcg ggt tcg tcg aag aga ggt tta gca ggg agc ttt gct 2613
Pro Asp Leu Ser Gly Ser Ser Lys Arg Gly Leu Ala Gly Ser Phe Ala
100 105 110
gtt cag gat gat ggt ttt tca tcc cac act tgt att ttg att gat gaa 2661
Val Gln Asp Asp Gly Phe Ser Ser His Thr Cys Ile Leu Ile Asp Glu
115 120 125
tca caa gta cgt tgg gag gca ggg tac ctt caa gtt ttc tgacgttgaa 2710
Ser Gln Val Arg Trp Glu Ala Gly Tyr Leu Gln Val Phe
130 135 140
ctcaggcttc gttttgtttt gctcttggag gaatttccag tggtctaagg tgctttcctg 2770
agtggctctt tccaccaagt gctcgtccaa ctcgggtacc tggaggcagg ggtagtctct 2830
cttgagctct ccttgcgttg ctcgcctgtc tgtgtcttca gcgccgaggg ctcttggttc 2890
cctgcctctt gacacactct cactgtgtct ttcccattca ctcttgtgga tgtaaaagag 2950
gacataggcc tgttgactca ggacagaggt gatgccagag gcagtgacct cggcatcatc 3010
cattttatac cactggcctt cttgaacttt gacataagag aagtaatgtc cgttgtgaca 3070
actccacccg gcgtggacca gcacagcata gaggacatag acaagaggtc ctgtgttctg 3130
ctgagacatg tatggctgca tgtcaaggca ctcaggatat tgcacattct tggcaatttt 3190
gttgcctgtg acatcggaga atctcttcaa tacgaggatg aggatcttgg cagaagtgtg 3250
taaagtttac gtcttggagg ccggcgccct ctggagacaa agaccacaat gataggcatt 3310
ctctccattg agttcttcgg gcttcaccaa ctgttccaaa gcttgcttga cactctgagc 3370
ttcctggata tccagggcga tgtccaggta agggccaaaa gtgtctgaaa tgccgtggca 3430
gtggagacac ttgatttgag atctccagta ccctccaaat atttggtgga tgagggtggt 3490
gtccttggag tgatgatcta cctgcttgtg cccgggaagg catgcctttt tcatggcatc 3550
cacagtgaac atgagaaatt caagggcagc ttcctgcttg cctctatgga agccagcagc 3610
caatgcctgt gagggctgga tgacatggcc aggaatgtgg aggggccatg tgatgtgagc 3670
ttccatggta cagagcatgc agcacttgtg acgatgacat gtttgagagt gctcccggga 3730
cagcatgtag ttggcaaggg gcggtgtgta tgtcagacac tgctgggaag cgttcacgta 3790
gcaggtattt cccatattct ggagcccagc ccccaccgca gcaggtctcc tgctactcag 3850
aggaagcttc tccctgggag caagctgtct tgccacaggc gccaaatcat cgcagaagtc 3910
gacgcgggtc tcagttgaga gttgtgactt ctcagggaga gaagtccgct ggatttcagc 3970
aaaggctgca tctggccgag aagatgtgag ttttgaaaag tggttgaact gccactcacc 4030
tcccaagtag agtgagtcgt cctccatgtc gcccggaaca aggatcacaa ggtttttcgg 4090
ctgggacctt atgttgcaga aagactctat ctcttccgag agagtcttca aatgacgagc 4150
tctctggccg catcagccct tatataactc acccccacca acggcgaaca cctcacccac 4210
tcatcaggtg cgcgataagc caatcaaatg tcagcattta attaaggaat gagtcacagg 4270
gtgtgtcccc ttgcatcgct gggaattcaa cagacacagc ccacatcatg acttctagaa 4330
cacctgaatc acattactcc tcaggatgat aggcagatgt aatatgagtg taaccaggtt 4390
gggacagtgg ccacacagtt gccttatttt aggtaaaaca atgtcaggga agaaatcttt 4450
acctatgaag ccgtgtgtgt gtgtgtttgt gtgtttgtgt gtgtgtgttt gtgctgggat 4510
gaacctccaa gtatgtgctt ttggcagcta ccatcatcct ctcagcgacg gaaagagaag 4570
aagtcggaag tgcgctttct gacctgagaa tagtcaatga agtatagtat ttagcacagc 4630
gtattttttt ccctaataag aaaggagaga tccgtggaaa caaacaaacc ttccagcgat 4690
aagctttcca cgttcagcct attgattctc tatccgaatg aaattagctg ccagtggata 4750
acaagacaag tctttgcatg aaatgctctt ttggaagcta ggttgccaaa taataaagca 4810
tcatatggta gaaacacact gaagtttgaa gagatactca gtgcacaaag tagactgtga 4870
aagactttgt ggaaatcatg cagtcactga gagactaatt gatgacaatc cccaaattta 4930
tgtttgccag gaaagagaga tggtcatgac attgtatagt gaatgatttc ggatgtgcga 4990
tggcagttta agaaaacatg aaacggccgg gcgcggtggc tcacgcctgt aatcccagca 5050
ctttgggagg ccgaggcggg tggatcatga ggtcaggaga tcgagaccat cctggctaac 5110
aaagtgaaac cccgtctcta ctaaaaatac aaaaaattag ccgggcgcgg tggcgggcgc 5170
ctgtagtccc agctactcgg gaggctgagg caggagaatg gcgtgaaccc gggaagcgga 5230
gcttgcagtg agccgagatt gcgccactgc agtccgcagt ccggcctggg cgacagagcg 5290
agaccccgtc tc 5302
<210> 2
<211> 6113
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(1779)
<400> 2
ggg aag gaa atg aag agc aaa gtg gac acg att gtg aac ttc acc cac 48
Gly Lys Glu Met Lys Ser Lys Val Asp Thr Ile Val Asn Phe Thr His
1 5 10 15
cag cac ttc acc tcc cag ttc gag gtc act gtc tgg gca ccc agg ctc 96
Gln His Phe Thr Ser Gln Phe Glu Val Thr Val Trp Ala Pro Arg Leu
20 25 30
ccc ctg cag att gag atc tca gac acc gag ctg agc cag atc aag ggc 144
Pro Leu Gln Ile Glu Ile Ser Asp Thr Glu Leu Ser Gln Ile Lys Gly
35 40 45
tgg agg atc ccg gtt gct gcc aac aga agg cct acc cgg gaa agc gat 192
Trp Arg Ile Pro Val Ala Ala Asn Arg Arg Pro Thr Arg Glu Ser Asp
50 55 60
gac gag gac gat gag gag aag aag gga cga ggc tgc tcc ctg cag tac 240
Asp Glu Asp Asp Glu Glu Lys Lys Gly Arg Gly Cys Ser Leu Gln Tyr
65 70 75 80
cag cac gcc aca gtg cgt gtc ctc acc cag ttt gtg gcc gag tca cct 288
Gln His Ala Thr Val Arg Val Leu Thr Gln Phe Val Ala Glu Ser Pro
85 90 95
gac tta ggg cag ctg acc tac atg ctg ggc ccc gac tgg cag ttt gac 336
Asp Leu Gly Gln Leu Thr Tyr Met Leu Gly Pro Asp Trp Gln Phe Asp
100 105 110
atc act gac ctt gtg acc gag ttc atg aag gtg gag gag ccg aaa atc 384
Ile Thr Asp Leu Val Thr Glu Phe Met Lys Val Glu Glu Pro Lys Ile
115 120 125
gct cag tta cag gac ggc agg acc ctg gct ggt cgg gag ccg gga ata 432
Ala Gln Leu Gln Asp Gly Arg Thr Leu Ala Gly Arg Glu Pro Gly Ile
130 135 140
acc acg gtg cag gtc ctc tcg ccg ttg tct gac tcc atc ctg gct gag 480
Thr Thr Val Gln Val Leu Ser Pro Leu Ser Asp Ser Ile Leu Ala Glu
145 150 155 160
aag acg gtg att gtc ctg gat gac cga gtc acc atc gcg gag ctg gga 528
Lys Thr Val Ile Val Leu Asp Asp Arg Val Thr Ile Ala Glu Leu Gly
165 170 175
gtg cag ctc gta gct ggc atg tct ctc tcc ctg cag cca cac cga gca 576
Val Gln Leu Val Ala Gly Met Ser Leu Ser Leu Gln Pro His Arg Ala
180 185 190
gac aaa agg gcc atc gtc tcc aca gct gct gcc ctg gat gtt ctt cag 624
Asp Lys Arg Ala Ile Val Ser Thr Ala Ala Ala Leu Asp Val Leu Gln
195 200 205
tcc cca cag cag gaa gca ata gta agt tct tgg att ttg ttc agt gat 672
Ser Pro Gln Gln Glu Ala Ile Val Ser Ser Trp Ile Leu Phe Ser Asp
210 215 220
ggt tcg gtg aca cct tta gac att tac gat cct aag gat tat tct gtt 720
Gly Ser Val Thr Pro Leu Asp Ile Tyr Asp Pro Lys Asp Tyr Ser Val
225 230 235 240
act gtc tca tca ttg gat gaa atg gtg gtg tct gtc cag gca aac ctt 768
Thr Val Ser Ser Leu Asp Glu Met Val Val Ser Val Gln Ala Asn Leu
245 250 255
gag tcc aaa tgg cca att gtg gtt gca gag ggt gaa gga caa ggg cct 816
Glu Ser Lys Trp Pro Ile Val Val Ala Glu Gly Glu Gly Gln Gly Pro
260 265 270
ttg att aag tta gaa atg atg ata agt gaa cct tgt cag aag acc aag 864
Leu Ile Lys Leu Glu Met Met Ile Ser Glu Pro Cys Gln Lys Thr Lys
275 280 285
agg aag agt gtt ctt gcc gtg ggt aaa gga aat gtc aag gtc aaa ttc 912
Arg Lys Ser Val Leu Ala Val Gly Lys Gly Asn Val Lys Val Lys Phe
290 295 300
gaa cca agt agt gat gag cac caa gga ggc agc aat gat att gag ggc 960
Glu Pro Ser Ser Asp Glu His Gln Gly Gly Ser Asn Asp Ile Glu Gly
305 310 315 320
ata aat cgg gaa tat aaa gac cac ctc agt aat tcc ata gag cgc gaa 1008
Ile Asn Arg Glu Tyr Lys Asp His Leu Ser Asn Ser Ile Glu Arg Glu
325 330 335
gga aac cag gag aga gca gtc cag gaa tgg ttc cac cgt ggc aca cct 1056
Gly Asn Gln Glu Arg Ala Val Gln Glu Trp Phe His Arg Gly Thr Pro
340 345 350
gtt ggc caa gag gaa agt acc aac aaa agc aca acc ccc cag tct ccc 1104
Val Gly Gln Glu Glu Ser Thr Asn Lys Ser Thr Thr Pro Gln Ser Pro
355 360 365
atg gaa ggg aag aat aag tta ctc aaa agt ggt ggt cca gat gcc ttt 1152
Met Glu Gly Lys Asn Lys Leu Leu Lys Ser Gly Gly Pro Asp Ala Phe
370 375 380
aca agc ttc ccc act caa ggg aag tca ccg gac ccc aat aat cct agt 1200
Thr Ser Phe Pro Thr Gln Gly Lys Ser Pro Asp Pro Asn Asn Pro Ser
385 390 395 400
gac ctc aca gtg acc tca agg ggg cta acg gac ttg gag att ggc atg 1248
Asp Leu Thr Val Thr Ser Arg Gly Leu Thr Asp Leu Glu Ile Gly Met
405 410 415
tat gcc ttg ctc tgc gtc ttc tgt ctg gcc att ctg gtc ttc ttg atc 1296
Tyr Ala Leu Leu Cys Val Phe Cys Leu Ala Ile Leu Val Phe Leu Ile
420 425 430
aac tgc gtg gcg ttt gcc tgg aaa tac aga cac aaa agg ttt gct gtg 1344
Asn Cys Val Ala Phe Ala Trp Lys Tyr Arg His Lys Arg Phe Ala Val
435 440 445
agt gag cag ggc aac atc ccc cat tcc cac gac tgg gtc tgg ctt ggg 1392
Ser Glu Gln Gly Asn Ile Pro His Ser His Asp Trp Val Trp Leu Gly
450 455 460
aat gaa gtg gaa ctt ttg gag aac cct gtt gac att aca ctc cca tca 1440
Asn Glu Val Glu Leu Leu Glu Asn Pro Val Asp Ile Thr Leu Pro Ser
465 470 475 480
gag gag tgc aca acc atg ata gac agg ggc ctg cag ttc gag gag agg 1488
Glu Glu Cys Thr Thr Met Ile Asp Arg Gly Leu Gln Phe Glu Glu Arg
485 490 495
aac ttc ctt ctg aat ggc agt tcc cag aag act ttt cat agt caa cta 1536
Asn Phe Leu Leu Asn Gly Ser Ser Gln Lys Thr Phe His Ser Gln Leu
500 505 510
ctc aga ccc tct gac tat gtc tat gag aaa gaa att aaa aat gaa cct 1584
Leu Arg Pro Ser Asp Tyr Val Tyr Glu Lys Glu Ile Lys Asn Glu Pro
515 520 525
atg aat tct tcg ggc cca aag agg aag aga gtc aag ttc act tcc tac 1632
Met Asn Ser Ser Gly Pro Lys Arg Lys Arg Val Lys Phe Thr Ser Tyr
530 535 540
acc acc atc ctc cca gag gac ggc ggc cca tac acc aac tcc atc ctg 1680
Thr Thr Ile Leu Pro Glu Asp Gly Gly Pro Tyr Thr Asn Ser Ile Leu
545 550 555 560
ttt gac agc gat gat aac atc aag tgg gtc tgc caa gat atg ggg ctg 1728
Phe Asp Ser Asp Asp Asn Ile Lys Trp Val Cys Gln Asp Met Gly Leu
565 570 575
ggg gat tca cag gac ttt aga gac tat atg gaa agt ctg caa gac cag 1776
Gly Asp Ser Gln Asp Phe Arg Asp Tyr Met Glu Ser Leu Gln Asp Gln
580 585 590
atg taaactcctt tcttatgttt gtattcacct ttatgccttc tgttttttga 1829
Met
atgctggagc agtgagtttg atcagcaata ggggatgatt taacaaagtt tgatttgtgg 1889
aggtctggtg ggattcattt ctaagcaggt aaaagaggtt tggagagcta tagaagctgg 1949
gttttaagtt tggaaatgcc tctaaaacag ccacatgtgg ggactggaga aattctaaga 2009
caacagtttt atggactgcc tggtacgagc tcagtgcaaa tgtattaaac ctgaccccac 2069
agacattgtt agtcatctcg tgacaaatgg ccatgtggaa ttagaaaaga tttgggtgtt 2129
gatttttcta tttctagacc ttttaaagca cagtggacac ttattgcccc ttggcctgag 2189
tttagacata catgaaagat gactgaatgt agctatcctg atttgtcatg agcgctgctc 2249
attattttta tatacatttg cacctgcacc tgttcctttg acctctggaa ttctttttga 2309
aacatgaaga gaagaaattt cagtcttttc cttgagctct gtttgattta cacatgagtt 2369
ttcttccctg ggatttgcca tgccatgtta tttcctgggt ctccatgcag gatgtcaggt 2429
ttttccagtt ttctgcactt catcatctag ggatggtatc caagaagctt attctcactt 2489
tgttttttcc ttccttttgt tgttgaactt tcttgtataa cactgggaca aagggtttta 2549
cttaaaatgt taatatacag tgagggaggg ccatctcttt actgttttct atgtgaagtt 2609
tcaaacattt ggaaaaaagt tgtcaatcac aaacatttat ttttattctc tgcttgttga 2669
aaactcatgg ggaactcctg gagtgcattt ctgagggtaa aaaggcccca acatagctgt 2729
atttatacaa gtgctttgtg gccaacacaa tttactatcc aaactgaagt cttaaagtag 2789
aattgaccta cacagaaagg aaattgccta ggcaatagat gcagattaca ctttccattt 2849
taccagaaca aaagtttttt tttttttaaa taagtagaat gacagttcaa gttagcaata 2909
tatagaaatc catccataca atgaccaaaa taaataaata aataaaactc aatcatgact 2969
ttggcagaga taaaacattg acaagggtgt gcatttgtgg agagaagcag tgggtttccc 3029
cccaaaatcc atacaaataa atgactattc cagaactcaa gaagcagcta ttatagaaaa 3089
aaaaattaaa aaccacagct attgaaaaat caaaacggca agagaaaaaa agaaacaact 3149
cctgcataat gtttaaataa aaatgttatt ttaatagcat catgaatggt cttaaggcaa 3209
aaatttcaga ttagcaaaat gtgatgacat atttattaag attgtattat tggaaatgta 3269
gacactggac tagtatgcct cattatgagc tttactgagc catttgcatc tctaagaaat 3329
atgattttaa aggcccaaag taggaaggat ctgcaggggg tcacctgaag cagtgccctg 3389
actccaaatg gctgataact gagctaatcc aaggagaggc gttcagttgt tctcttatta 3449
tagatttcct tgggtgggaa catgacaacc ccgcctcctc tagccatgac gtggcagcca 3509
aaaagttctt cctgtatctc tggcccacag gactgtagtg tgtttgggaa cagtgctgaa 3569
attgcaagtg atgtaggcct gacctccaga gcccagcgtt cagttagttg tataaaatgg 3629
ggcttaggaa ccgaagctga tgaagttcca aaagtagttc agtttgatgg gataaggaag 3689
caaggagagc caggcattag ctgcaaaagg agcttgcaga gagatggctg acactttatg 3749
ggaagattcc tcagtcagcc aaaagctttt caaagagttt ggtgcagagg tagctcagag 3809
ccaaccatcc aggaccgaag aaagctcatg tgctgattga cacatagcta tccaggtgag 3869
tcggctcctt ctatgaggct tttcaatcgt ctctcccttt ggatctctgg cattccagaa 3929
agactgggaa ttcatttctg aaaatgatga acctccgctg aaacggggag gaaatctctt 3989
acacacatac agcagggtcc tgcccactgg taaagtgaga cgttgggtag ggggtgtaac 4049
tggggaggag gtgaaggaaa ttatccctgc catggggagt gttattcagg aacataaaaa 4109
ttagagcatt ccctctgatg gcagtgcttg acggggtgca gggaacaaac tggggagcgt 4169
tcaaacatgc cagtggtttc tcagcagggc tggaaactgg ccacagtcct gaagatgctt 4229
cttgtgacaa cttagagtca ctgggagtca cacagctcct gtgtagccgc ctttgctcct 4289
ctgtgttaaa acccagtcag ttcaaaataa caatcaaatg gaaggccatg ggggaagggt 4349
tccttccacc atataaccca cacccagatg tatataaacg catatacata catatacctg 4409
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg aatactgaag agccagacaa caaactattt 4469
agtaagaatc aacagtgact gtcaattgtt ctcagatttg gcatggatac tatttctcat 4529
gaaataatgt cattaaagta tttcctcctc taaatatttc ctgaagatca taaaccatgg 4589
gttatagcac cttgttttcc agctagtcaa tatacaacta ttaggtgctg tgccccaaaa 4649
tatctaaaat gggggtaaag atataaacta ttaagccatg aaatctaatt ttctggggac 4709
catgcatttg cctggatttt ccaggatttt atggctgata gacccaaatt attagtcttg 4769
ataaacaacc tgaaccaaac atttataaag tcccttcctc tcccatggga atggaatgct 4829
cagactcctg ctgagggaga caccagcaag cctcttgaga aggcaccaga cagtgaactc 4889
tgtggtcaca tgaagggagg ttcagatgct agacgaccag cgtatgacct ggaaccattg 4949
aggattgcaa ccgctgcctc cagggtccag atgtgctgca gtagagggca gagcacagga 5009
atcaagaagc agctgccaca ccaaagaact gatgcttaaa cacacaggtg tgttgcattt 5069
gagagaaggc aagaagaaaa gacaatagaa ggttaccctg gaaaaacaac acctagcaaa 5129
cctcagaata ctccatgtgt cccaagagaa ggaataattg tctttctagc tgttatatat 5189
tgcacactgg ccaggataaa ttcccagact ataataatcc tgactatgta gacggaagtt 5249
tctatttata aagtcatgca gggggaaagt tgattaattt ttgtcccatt gccacatctg 5309
tagactaagt aacttttaac ttgctaactt ttagaatgtc cacctttgtg cattttcgta 5369
gcattcttgt ttcattattt tagaaagggg tcttcccatt actgggatgt cggggccctg 5429
tcagagaact attattaagg ctctgtgagg gaactgttga gtctcacagc aaggaccaca 5489
gagcgtggtt tcatctgaac cctacaagtc agggacacca gtgtgttttc tactcgagtt 5549
acctaaaaat ggtctcaatt ttccatttca ttctctgtaa atgacacaat agatggttta 5609
tagactcagc agccctcacc ttttattaaa attatatatg tgtgatgtaa tgcatatcac 5669
ctgtcatgta aagggacggt atggatggtg gaaaagttat gctaaaatat ggattgcaga 5729
tatttttgta tgtaatatag gcaatataat gaaacaccga gttttttaaa gtgaaagcat 5789
gcaaaatcgt agcttttaaa tgtacagaca tcccactcaa aaatatctaa actgatagtg 5849
ggaaaaacat ttgagaccta gtaacatcat gaaatgcact gaatttggaa ttctggccta 5909
gaaaggctgt ggcttatgtt gggattgatg atggaatctg ccagaacatt ttcatcttat 5969
tcttcttgac ttttggattt ttttcttttc tttttttctg gaaatatttc ggaaataaag 6029
tgacttcatt tttcagcata aaagtatatt ctaaccacag ggtaacacat cgtttttaac 6089
atgaaaataa acatttaaac attc 6113
<210> 3
<211> 5213
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (3)..(5213)
<400> 3
ag cgg ttc ggg cca cag acg ctg gag cgc atc aca cgg gac gac gcg 47
Arg Phe Gly Pro Gln Thr Leu Glu Arg Ile Thr Arg Asp Asp Ala
1 5 10 15
gcc atc tgc acc acc gag tac tca cgc atc gtg ccc ctg gag aac gga 95
Ala Ile Cys Thr Thr Glu Tyr Ser Arg Ile Val Pro Leu Glu Asn Gly
20 25 30
gag atc gtg gtg tcc ctg gtg aac gga cgt ccg ggc gcc atg aat ttc 143
Glu Ile Val Val Ser Leu Val Asn Gly Arg Pro Gly Ala Met Asn Phe
35 40 45
tcc tac tcg ccg ctg cta cgt gag ttc acc aag gcc acc aac gtc cgc 191
Ser Tyr Ser Pro Leu Leu Arg Glu Phe Thr Lys Ala Thr Asn Val Arg
50 55 60
ctg cgc ttc ctg cgt acc aac acg ctg ctg ggc cat ctc atg ggg aag 239
Leu Arg Phe Leu Arg Thr Asn Thr Leu Leu Gly His Leu Met Gly Lys
65 70 75
gcg ctg cgg gac ccc acg gtc acc cgc cgg tat tat tac agc atc aag 287
Ala Leu Arg Asp Pro Thr Val Thr Arg Arg Tyr Tyr Tyr Ser Ile Lys
80 85 90 95
gat atc agc atc gga ggc cgc tgt gtc tgc cac ggc cac gcg gat gcc 335
Asp Ile Ser Ile Gly Gly Arg Cys Val Cys His Gly His Ala Asp Ala
100 105 110
tgc gat gcc aaa gac ccc acg gac ccg ttc agg ctg cag tgc acc tgc 383
Cys Asp Ala Lys Asp Pro Thr Asp Pro Phe Arg Leu Gln Cys Thr Cys
115 120 125
cag cac aac acc tgc ggg ggc acc tgc gac cgc tgc tgc ccc ggc ttc 431
Gln His Asn Thr Cys Gly Gly Thr Cys Asp Arg Cys Cys Pro Gly Phe
130 135 140
aat cag cag ccg tgg aag cct gcg act gcc aac agt gcc aac gag tgc 479
Asn Gln Gln Pro Trp Lys Pro Ala Thr Ala Asn Ser Ala Asn Glu Cys
145 150 155
cag tcc tgt aac tgc tac ggc cat gcc acc gac tgt tac tac gac cct 527
Gln Ser Cys Asn Cys Tyr Gly His Ala Thr Asp Cys Tyr Tyr Asp Pro
160 165 170 175
gag gtg gac cgg cgc cgc gcc agc cag agc ctg gat ggc acc tat cag 575
Glu Val Asp Arg Arg Arg Ala Ser Gln Ser Leu Asp Gly Thr Tyr Gln
180 185 190
ggt ggg ggt gtc tgt atc gac tgc cag cac cac acc acc ggc gtc aac 623
Gly Gly Gly Val Cys Ile Asp Cys Gln His His Thr Thr Gly Val Asn
195 200 205
tgt gag cgc tgc ctg ccc ggc ttc tac cgc tct ccc aac cac cct ctc 671
Cys Glu Arg Cys Leu Pro Gly Phe Tyr Arg Ser Pro Asn His Pro Leu
210 215 220
gac tcg ccc cac gtc tgc cgc cgc tgc aac tgc gag tcc gac ttc acg 719
Asp Ser Pro His Val Cys Arg Arg Cys Asn Cys Glu Ser Asp Phe Thr
225 230 235
gat ggc acc tgc gag gac ctg acg ggt cga tgc tac tgc cgg ccc aac 767
Asp Gly Thr Cys Glu Asp Leu Thr Gly Arg Cys Tyr Cys Arg Pro Asn
240 245 250 255
ttc tct ggg gag cgg tgt gac gtg tgt gcc gag ggc ttc acg ggc ttc 815
Phe Ser Gly Glu Arg Cys Asp Val Cys Ala Glu Gly Phe Thr Gly Phe
260 265 270
cca agc tgc tac ccg acg ccc tcg tcc tcc aat gac acc agg gag cag 863
Pro Ser Cys Tyr Pro Thr Pro Ser Ser Ser Asn Asp Thr Arg Glu Gln
275 280 285
gtg ctg cca gcc ggc cag att gtg aat tgt gac tgc agc gcg gca ggg 911
Val Leu Pro Ala Gly Gln Ile Val Asn Cys Asp Cys Ser Ala Ala Gly
290 295 300
acc cag ggc aac gcc tgc cgg aag gac cca agg gtg gga cgc tgt ctg 959
Thr Gln Gly Asn Ala Cys Arg Lys Asp Pro Arg Val Gly Arg Cys Leu
305 310 315
tgc aaa ccc aac ttc caa ggc acc cat tgt gag ctc tgc gcg cca ggg 1007
Cys Lys Pro Asn Phe Gln Gly Thr His Cys Glu Leu Cys Ala Pro Gly
320 325 330 335
ttc tac ggc ccc ggc tgc cag ccc tgc cag tgt tcc agc cct gga gtg 1055
Phe Tyr Gly Pro Gly Cys Gln Pro Cys Gln Cys Ser Ser Pro Gly Val
340 345 350
gcc gat gac cgc tgt gac cct gac aca ggc cag tgc agg tgc cga gtg 1103
Ala Asp Asp Arg Cys Asp Pro Asp Thr Gly Gln Cys Arg Cys Arg Val
355 360 365
ggc ttc gag ggg gcc aca tgt gat cgc tgt gcc ccc ggc tac ttt cac 1151
Gly Phe Glu Gly Ala Thr Cys Asp Arg Cys Ala Pro Gly Tyr Phe His
370 375 380
ttc cct ctc tgc cag ttg tgt ggc tgc agc cct gca gga acc ttg ccc 1199
Phe Pro Leu Cys Gln Leu Cys Gly Cys Ser Pro Ala Gly Thr Leu Pro
385 390 395
gag ggc tgc gat gag gcc ggc cgc tgc cta tgc cag cct gag ttt gct 1247
Glu Gly Cys Asp Glu Ala Gly Arg Cys Leu Cys Gln Pro Glu Phe Ala
400 405 410 415
gga cct cat tgt gac cgg tgc cgc cct ggc tac cat ggt ttc ccc aac 1295
Gly Pro His Cys Asp Arg Cys Arg Pro Gly Tyr His Gly Phe Pro Asn
420 425 430
tgc caa gca tgc acc tgc gac cct cgg gga gcc ctg gac cag ctc tgt 1343
Cys Gln Ala Cys Thr Cys Asp Pro Arg Gly Ala Leu Asp Gln Leu Cys
435 440 445
ggg gcg gga ggt ttg tgc cgc tgc cgc ccc ggc tac aca ggc act gcc 1391
Gly Ala Gly Gly Leu Cys Arg Cys Arg Pro Gly Tyr Thr Gly Thr Ala
450 455 460
tgc cag gaa tgc agc ccc ggc ttt cac ggc ttc ccc agc tgt gtc ccc 1439
Cys Gln Glu Cys Ser Pro Gly Phe His Gly Phe Pro Ser Cys Val Pro
465 470 475
tgc cac tgc tct gct gaa ggc tcc ctg cac gca gcc tgt gac ccc cgg 1487
Cys His Cys Ser Ala Glu Gly Ser Leu His Ala Ala Cys Asp Pro Arg
480 485 490 495
agt ggg cag tgc agc tgc cgg ccc cgt gtg acg ggg ctg cgg tgt gac 1535
Ser Gly Gln Cys Ser Cys Arg Pro Arg Val Thr Gly Leu Arg Cys Asp
500 505 510
aca tgt gtg ccc ggt gcc tac aac ttc ccc tac tgc gaa gct ggc tct 1583
Thr Cys Val Pro Gly Ala Tyr Asn Phe Pro Tyr Cys Glu Ala Gly Ser
515 520 525
tgc cac cct gcc ggt ctg gcc cca gtg gat cct gcc ctt cct gag gca 1631
Cys His Pro Ala Gly Leu Ala Pro Val Asp Pro Ala Leu Pro Glu Ala
530 535 540
cag gtt ccc tgt atg tgc cgg gct cac gtg gag ggg ccg agc tgt gac 1679
Gln Val Pro Cys Met Cys Arg Ala His Val Glu Gly Pro Ser Cys Asp
545 550 555
cgc tgc aaa cct ggg ttc tgg gga ctg agc ccc agc aac ccc gag ggc 1727
Arg Cys Lys Pro Gly Phe Trp Gly Leu Ser Pro Ser Asn Pro Glu Gly
560 565 570 575
tgt acc cgc tgc agc tgc gac ctc agg ggc aca ctg ggt gga gtt gct 1775
Cys Thr Arg Cys Ser Cys Asp Leu Arg Gly Thr Leu Gly Gly Val Ala
580 585 590
gag tgc cag ccg ggc acc ggc cag tgc ttc tgc aag ccc cac gtg tgc 1823
Glu Cys Gln Pro Gly Thr Gly Gln Cys Phe Cys Lys Pro His Val Cys
595 600 605
ggc cag gcc tgc gcg tcc tgc aag gat ggc ttc ttt gga ctg gat cag 1871
Gly Gln Ala Cys Ala Ser Cys Lys Asp Gly Phe Phe Gly Leu Asp Gln
610 615 620
gct gac tat ttt ggc tgc cgc agc tgc cgg tgt gac att ggc ggt gca 1919
Ala Asp Tyr Phe Gly Cys Arg Ser Cys Arg Cys Asp Ile Gly Gly Ala
625 630 635
ctg ggc cag agc tgt gaa ccg agg acg ggc gtc tgc cgg tgc cgc ccc 1967
Leu Gly Gln Ser Cys Glu Pro Arg Thr Gly Val Cys Arg Cys Arg Pro
640 645 650 655
aac acc cag ggc ccc acc tgc agc gag cct gcg agg gac cac tac ctc 2015
Asn Thr Gln Gly Pro Thr Cys Ser Glu Pro Ala Arg Asp His Tyr Leu
660 665 670
ccg gac ctg cac cac ctg cgc ctg gag ctg gag gag gct gcc aca cct 2063
Pro Asp Leu His His Leu Arg Leu Glu Leu Glu Glu Ala Ala Thr Pro
675 680 685
gag ggt cac gcc atg cgc ttt ggc ttc aac ccc ctc gag ttc gag aac 2111
Glu Gly His Ala Met Arg Phe Gly Phe Asn Pro Leu Glu Phe Glu Asn
690 695 700
ttc agc tgg agg ggc tac gcg cag atg gca cct gtc cag ccc agg atc 2159
Phe Ser Trp Arg Gly Tyr Ala Gln Met Ala Pro Val Gln Pro Arg Ile
705 710 715
gtg gcc agg ctg aac ctg acc tcc cct gac ctt ttc tgg ctc gtc ttc 2207
Val Ala Arg Leu Asn Leu Thr Ser Pro Asp Leu Phe Trp Leu Val Phe
720 725 730 735
cga tac gtc aac cgg ggg gcc atg agt gtg agc ggg cgg gtc tct gtg 2255
Arg Tyr Val Asn Arg Gly Ala Met Ser Val Ser Gly Arg Val Ser Val
740 745 750
cga gag gag ggc agg tcg gcc acc tgc gcc aac tgc aca gca cag agt 2303
Arg Glu Glu Gly Arg Ser Ala Thr Cys Ala Asn Cys Thr Ala Gln Ser
755 760 765
cag ccc gtg gcc ttc cca ccc agc acg gag cct gcc ttc atc acc gtg 2351
Gln Pro Val Ala Phe Pro Pro Ser Thr Glu Pro Ala Phe Ile Thr Val
770 775 780
ccc cag agg ggc ttc gga gag ccc ttt gtg ctg aac cct ggc acc tgg 2399
Pro Gln Arg Gly Phe Gly Glu Pro Phe Val Leu Asn Pro Gly Thr Trp
785 790 795
gcc ctg cgt gtg gag gcc gaa ggg gtg ctc ctg gac tac gtg gtt ctg 2447
Ala Leu Arg Val Glu Ala Glu Gly Val Leu Leu Asp Tyr Val Val Leu
800 805 810 815
ctg cct agc gca tac tac gag gcg gcg ctc ctg cag ctg cgg gtg act 2495
Leu Pro Ser Ala Tyr Tyr Glu Ala Ala Leu Leu Gln Leu Arg Val Thr
820 825 830
gag gcc tgc aca tac cgt ccc tct gcc cag cag tct ggc gac aac tgc 2543
Glu Ala Cys Thr Tyr Arg Pro Ser Ala Gln Gln Ser Gly Asp Asn Cys
835 840 845
ctc ctc tac aca cac ctc ccc ctg gat ggc ttc ccc tcg gcc gcc ggg 2591
Leu Leu Tyr Thr His Leu Pro Leu Asp Gly Phe Pro Ser Ala Ala Gly
850 855 860
ctg gag gcc ctg tgt cgc cag gac aac agc ctg ccc cgg ccc tgc ccc 2639
Leu Glu Ala Leu Cys Arg Gln Asp Asn Ser Leu Pro Arg Pro Cys Pro
865 870 875
acg gag cag ctc agc ccg tcg cac ccg cca ctg atc acc tgc acg ggc 2687
Thr Glu Gln Leu Ser Pro Ser His Pro Pro Leu Ile Thr Cys Thr Gly
880 885 890 895
agt gat gtg gac gtc cag ctt caa gtg gca gtg cca cag cca ggc cgc 2735
Ser Asp Val Asp Val Gln Leu Gln Val Ala Val Pro Gln Pro Gly Arg
900 905 910
tat gcc cta gtg gtg gag tac gcc aat gag gat gcc cgc cag gag gtg 2783
Tyr Ala Leu Val Val Glu Tyr Ala Asn Glu Asp Ala Arg Gln Glu Val
915 920 925
ggc gtg gcc gtg cac acc cca cag cgg gcc ccc cag cag ggg ctg ctc 2831
Gly Val Ala Val His Thr Pro Gln Arg Ala Pro Gln Gln Gly Leu Leu
930 935 940
tcc ctg cac ccc tgc ctg tac agc acc ctg tgc cgg ggc act gcc cgg 2879
Ser Leu His Pro Cys Leu Tyr Ser Thr Leu Cys Arg Gly Thr Ala Arg
945 950 955
gat acc cag gac cac ctg gct gtc ttc cac ctg gac tcg gag gcc agc 2927
Asp Thr Gln Asp His Leu Ala Val Phe His Leu Asp Ser Glu Ala Ser
960 965 970 975
gtg agg ctc aca gcc gaa cag gca cgc ttc ttc ctg cac ggg gtc act 2975
Val Arg Leu Thr Ala Glu Gln Ala Arg Phe Phe Leu His Gly Val Thr
980 985 990
ctg gtg ccc att gag gag ttc agc ccg gag ttc gtg gag ccc cgg gtc 3023
Leu Val Pro Ile Glu Glu Phe Ser Pro Glu Phe Val Glu Pro Arg Val
995 1000 1005
agc tgc atc agc agc cac ggc gcc ttt ggc ccc aac agt gcc gcc tgt 3071
Ser Cys Ile Ser Ser His Gly Ala Phe Gly Pro Asn Ser Ala Ala Cys
1010 1015 1020
ctg ccc tcg cgc ttc cca aag ccg ccc cag ccc atc atc ctc agg gac 3119
Leu Pro Ser Arg Phe Pro Lys Pro Pro Gln Pro Ile Ile Leu Arg Asp
1025 1030 1035
tgc cag gtg atc ccg ctg ccg ccc ggc ctc ccg ctg acc cac gcg cag 3167
Cys Gln Val Ile Pro Leu Pro Pro Gly Leu Pro Leu Thr His Ala Gln
1040 1045 1050 1055
gat ctc act cca gcc atg tcc cca gct gga ccc cga cct cgg ccc ccc 3215
Asp Leu Thr Pro Ala Met Ser Pro Ala Gly Pro Arg Pro Arg Pro Pro
1060 1065 1070
acc gct gtg gac cct gat gca gag ccc acc ctg ctg cgt gag ccc cag 3263
Thr Ala Val Asp Pro Asp Ala Glu Pro Thr Leu Leu Arg Glu Pro Gln
1075 1080 1085
gcc acc gtg gtc ttc acc acc cat gtg ccc acg ctg ggc cgc tat gcc 3311
Ala Thr Val Val Phe Thr Thr His Val Pro Thr Leu Gly Arg Tyr Ala
1090 1095 1100
ttc ctg ctg cac ggc tac cag cca gcc cac ccc acc ttc ccc gtg gaa 3359
Phe Leu Leu His Gly Tyr Gln Pro Ala His Pro Thr Phe Pro Val Glu
1105 1110 1115
gtc ctc atc aac gcc ggc cgc gtg tgg cag ggc cac gcc aac gcc agc 3407
Val Leu Ile Asn Ala Gly Arg Val Trp Gln Gly His Ala Asn Ala Ser
1120 1125 1130 1135
ttc tgt cca cat ggc tac ggc tgc cgc acc ctg gtg gtg tgt gag ggc 3455
Phe Cys Pro His Gly Tyr Gly Cys Arg Thr Leu Val Val Cys Glu Gly
1140 1145 1150
cag gcc ctg ctg gac gtg acc cac agc gag ctc act gtg acc gtg cgt 3503
Gln Ala Leu Leu Asp Val Thr His Ser Glu Leu Thr Val Thr Val Arg
1155 1160 1165
gtg ccc gag ggc cgg tgg ctc tgg ctg gat tat gta ctc gtg gtc cct 3551
Val Pro Glu Gly Arg Trp Leu Trp Leu Asp Tyr Val Leu Val Val Pro
1170 1175 1180
gag aac gtc tac agc ttt ggc tac ctc cgg gag gag ccc ctg gat aaa 3599
Glu Asn Val Tyr Ser Phe Gly Tyr Leu Arg Glu Glu Pro Leu Asp Lys
1185 1190 1195
tcc tat gac ttc atc agc cac tgc gca gcc cag ggc tac cac atc agc 3647
Ser Tyr Asp Phe Ile Ser His Cys Ala Ala Gln Gly Tyr His Ile Ser
1200 1205 1210 1215
ccc agc agc tca tcc ctg ttc tgc cga aac gct gct gct tcc ctc tcc 3695
Pro Ser Ser Ser Ser Leu Phe Cys Arg Asn Ala Ala Ala Ser Leu Ser
1220 1225 1230
ctc ttc tat aac aac gga gcc cgt cca tgt ggc tgc cac gaa gta ggt 3743
Leu Phe Tyr Asn Asn Gly Ala Arg Pro Cys Gly Cys His Glu Val Gly
1235 1240 1245
gct aca ggc ccc acg tgt gag ccc ttc ggg ggc cag tgt ccc tgc cat 3791
Ala Thr Gly Pro Thr Cys Glu Pro Phe Gly Gly Gln Cys Pro Cys His
1250 1255 1260
gcc cat gtc att ggc cgt gac tgc tcc cgc tgt gcc acc gga tac tgg 3839
Ala His Val Ile Gly Arg Asp Cys Ser Arg Cys Ala Thr Gly Tyr Trp
1265 1270 1275
ggc ttc ccc aac tgc agg ccc tgt gac tgc ggt gcc cgc ctc tgt gac 3887
Gly Phe Pro Asn Cys Arg Pro Cys Asp Cys Gly Ala Arg Leu Cys Asp
1280 1285 1290 1295
gag ctc acg ggc cag tgc atc tgc ccg cca cgc acc atc ccg ccc gac 3935
Glu Leu Thr Gly Gln Cys Ile Cys Pro Pro Arg Thr Ile Pro Pro Asp
1300 1305 1310
tgc ctg ctg tgc cag ccc cag acc ttt ggc tgc cac ccc ctg gtc ggc 3983
Cys Leu Leu Cys Gln Pro Gln Thr Phe Gly Cys His Pro Leu Val Gly
1315 1320 1325
tgt gag gag tgt aac tgc tca ggg ccc ggc atc cag gag ctc aca gac 4031
Cys Glu Glu Cys Asn Cys Ser Gly Pro Gly Ile Gln Glu Leu Thr Asp
1330 1335 1340
cct acc tgt gac aca gac agc ggc cag tgc aag tgc aga ccc aac gtg 4079
Pro Thr Cys Asp Thr Asp Ser Gly Gln Cys Lys Cys Arg Pro Asn Val
1345 1350 1355
act ggg cgc cgc tgt gat acc tgc tct ccg ggc ttc cat ggc tac ccc 4127
Thr Gly Arg Arg Cys Asp Thr Cys Ser Pro Gly Phe His Gly Tyr Pro
1360 1365 1370 1375
cgc tgc cgc ccc tgt gac tgt cac gag gcg ggc act gcg cct ggc gtg 4175
Arg Cys Arg Pro Cys Asp Cys His Glu Ala Gly Thr Ala Pro Gly Val
1380 1385 1390
tgt gac ccc ctc aca ggg cag tgc tac tgt aag gag aac gtg cag ggc 4223
Cys Asp Pro Leu Thr Gly Gln Cys Tyr Cys Lys Glu Asn Val Gln Gly
1395 1400 1405
ccc aaa tgt gac cag tgc agc ctt ggg acc ttc tca ctg gat gct gcc 4271
Pro Lys Cys Asp Gln Cys Ser Leu Gly Thr Phe Ser Leu Asp Ala Ala
1410 1415 1420
aac ccc aaa ggt tgc acc cgc tgc ttc tgc ttt ggg gcc acg gag cgc 4319
Asn Pro Lys Gly Cys Thr Arg Cys Phe Cys Phe Gly Ala Thr Glu Arg
1425 1430 1435
tgc cgg agc tcg tcc tac acc cgc cag gag ttc gtg gat atg gag gga 4367
Cys Arg Ser Ser Ser Tyr Thr Arg Gln Glu Phe Val Asp Met Glu Gly
1440 1445 1450 1455
tgg gtg ctg ctg agc act gac cgg cag gtg gtg ccc cac gag cgg cag 4415
Trp Val Leu Leu Ser Thr Asp Arg Gln Val Val Pro His Glu Arg Gln
1460 1465 1470
cca ggg acg gag atg ctc cgt gca gac ctg cgg cac gtg cct gag gct 4463
Pro Gly Thr Glu Met Leu Arg Ala Asp Leu Arg His Val Pro Glu Ala
1475 1480 1485
gtg ccc gag gct ttc ccc gag ctg tac tgg cag gcc cca ccc tcc tac 4511
Val Pro Glu Ala Phe Pro Glu Leu Tyr Trp Gln Ala Pro Pro Ser Tyr
1490 1495 1500
ctg ggg gac cgg gtg tca tcc tac ggt ggg acc ctc cgt tat gaa ctg 4559
Leu Gly Asp Arg Val Ser Ser Tyr Gly Gly Thr Leu Arg Tyr Glu Leu
1505 1510 1515
cac tca gag acc cag cgg gga gat gtc ttt gtc ccc atg gag agc agg 4607
His Ser Glu Thr Gln Arg Gly Asp Val Phe Val Pro Met Glu Ser Arg
1520 1525 1530 1535
ccg gat gtg gtg ctg cag ggc aac cag atg agc atc aca ttc ctg gag 4655
Pro Asp Val Val Leu Gln Gly Asn Gln Met Ser Ile Thr Phe Leu Glu
1540 1545 1550
ccg gca tac ccc acg cct ggc cac gtt cac cgt ggg cag ctg cag ctg 4703
Pro Ala Tyr Pro Thr Pro Gly His Val His Arg Gly Gln Leu Gln Leu
1555 1560 1565
gtg gag ggg aac ttc cgg cat acg gag act cgc aac act gtg tcc cgc 4751
Val Glu Gly Asn Phe Arg His Thr Glu Thr Arg Asn Thr Val Ser Arg
1570 1575 1580
gag gag ctc atg atg gtg ctg gcc agc ctg gag cag ctg cag atc cgt 4799
Glu Glu Leu Met Met Val Leu Ala Ser Leu Glu Gln Leu Gln Ile Arg
1585 1590 1595
gcc ctc ttc tca cag atc tcc tcg gct gtc tcc ctg cgc agg gtg gca 4847
Ala Leu Phe Ser Gln Ile Ser Ser Ala Val Ser Leu Arg Arg Val Ala
1600 1605 1610 1615
ctg gag gtg gcc agc cca gca ggc cag ggg gcc ctg gcc agc aat gtg 4895
Leu Glu Val Ala Ser Pro Ala Gly Gln Gly Ala Leu Ala Ser Asn Val
1620 1625 1630
gag ctg tgc ctg tgc ccc gcc agc tac cgg ggg gac tca tgc cag gaa 4943
Glu Leu Cys Leu Cys Pro Ala Ser Tyr Arg Gly Asp Ser Cys Gln Glu
1635 1640 1645
tgt gcc ccc ggc ttc tat cgg gac gtc aaa ggt ctc ttc ctg ggc cga 4991
Cys Ala Pro Gly Phe Tyr Arg Asp Val Lys Gly Leu Phe Leu Gly Arg
1650 1655 1660
tgt gtc cct tgt cag tgc cat gga cac tca gac cgc tgc ctc cct ggc 5039
Cys Val Pro Cys Gln Cys His Gly His Ser Asp Arg Cys Leu Pro Gly
1665 1670 1675
tct ggc gtc tgt gtg gac tgc cag cac aac acc gaa ggg gcc cac tgt 5087
Ser Gly Val Cys Val Asp Cys Gln His Asn Thr Glu Gly Ala His Cys
1680 1685 1690 1695
gag cgc tgc cag gct ggc ttc atg agc agc agg gac gac ccc agc gcc 5135
Glu Arg Cys Gln Ala Gly Phe Met Ser Ser Arg Asp Asp Pro Ser Ala
1700 1705 1710
ccc tgt gtc agc tgc ccc tgc ccc ctc tca gtg cct tcc aac aac ttc 5183
Pro Cys Val Ser Cys Pro Cys Pro Leu Ser Val Pro Ser Asn Asn Phe
1715 1720 1725
gcc gag ggc tgt gtc ctg cga ggc ggc cgc 5213
Ala Glu Gly Cys Val Leu Arg Gly Gly Arg
1730 1735
<210> 4
<211> 5834
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2480)..(5464)
<400> 4
tgctgaagga aagagtaaag ataaaggcga tatggaggtg tacatccaac ctggtccata 60
tgtggatcct ttcacgacag tgaccctggg ctggccagac aatgacaagg agttacgctt 120
ccaatggtca tgtgggtctt gctgggctct gtggagcagc tgtgttgaga ggcagctgct 180
tcgcacagac cagagggagc tggtggttcc agcatcctgc ctgccgccgc ctgactctgc 240
tgtcaccctg cgcctggctg ttctgagagg ccaagagctg gagaacaggg cagagcagtg 300
cctctacgtg tctgcgccct gggaactcag gcctcgagtc agctgtgaga ggaactgcag 360
gccagttaat gccagcaaag acattctgct cagggtcacc atgggggagg actctccagt 420
ggctatgttc agctggtatt tggacaacac cccaacagag caggctgagc ccctcccgga 480
tgcctgcaga ctcagaggat tttggccaag gtccttaacc ctcctccaga gcaacacctc 540
cacgttgctg ttgaacagct cgtttctgca gtcccgggga gaggtcatcc gaatcagagc 600
cacagcactg accaggcatg cctatgggga ggacacctat gtgatcagca ctgtgcctcc 660
ccgtgaggtg cctgcctgca ctattgcccc agaggagggc accgttctga cgagctttgc 720
catcttctgc aacgcctcca cagccctggg acccctggag ttctgcttct gtctggaatc 780
aggttcctgc ctacactgtg gccctgaacc tgccctccca tcagtgtatc tgccacttgg 840
agaggagaac aatgactttg tgctgacagt agttatttct gccaccaatc gtgcagggga 900
cacgcagcag acccaggcca tggctaaggt ggtactcgga gacacatgtg ttgaggatgt 960
agcattccag gctgccgtgt cagagaaaat ccccacagct ctgcaaggcg agggtggccc 1020
cgagcagctc ctccagctgg ccaaggctgt gtcctccatg ctgaaccaag agcatgaaag 1080
ccagggctca ggacagtcac tgagcataga cgtcagacag aaggtcagag agcatgtgct 1140
gggatcactg tctgcagtca ccaccggctt ggaggacgtg cagagggtgc aggagctggc 1200
cgaggtgctg agagaggtga cctgccggag taaggaactc acaccctcgg cccagtggga 1260
ggccagcttg gctctacagc atgccagtga ggccctgttg acagtgagtg ccaaggcccg 1320
ccctgaggac cagaggcgcc aggcagccac cagggacctg tttcaggctg tgggcagtgt 1380
gctggaagct tccctgagca acagaccaga agagcctgcg gaggccagca gcagccagat 1440
tgccacagtg ccacggctgc tttgagtcgt ggagcatgtg cagaccgccc tcctgctggg 1500
aaaactgcca gggggccttc cagccatgct ggccaccccc tccatctctg tgtacacaaa 1560
caggtacaaa ccagctgtgt tacaggtccc caagaaagct catcccctaa cctggttcat 1620
tccacaccgg cactccaaag ccctgtgcta ggcagagctc aggagaccag ccttggagct 1680
agagaagtga tgggacctag agctgggaag ggcttcccac cgagagaagg cccccttgtc 1740
accatgttga gtgtttgcac agcacaggaa gggaggcatg gagtgccaag cccagtgtta 1800
gatccacctg aaggagtcct gagagacggt cacatggcag ctgtgcaggt aagggggctg 1860
aatgaatggg actagcctgc tgttaaccat gctgtgcatg tgtctgacca gccactgctc 1920
cttgggcctg cagggactca agatcctctt acttttggcg tgcgcgcgca cacacaaaca 1980
cacacacaca cagagggctc atctctcagt gagcactgat gccatacctg ggatgctcca 2040
caccttccat tcctccacct gacaagcagg agctggacaa tgctgctggc aacactggga 2100
ccctctgccc caagcctcag ttcctcctgc cagggtttcc ctcttagatc tcacagacag 2160
tttcccaaag cagtatcctg attagaggaa cgatctgaag tggggtcccc aacaagatca 2220
gaaaggggca aggtcaaaga ccagaaaaga tggggcgggg ttaagacatg acgtcagagg 2280
tccaggtggg aaggggagat aaccctggag aaaaacctgc atccaagctc atttattcat 2340
tcaacgaaaa tactgatgga ctacctgtta tatgccaggc gctgctgtag acattggggg 2400
tacagtagta aaccaggctg gcaaagcccc accctcatgg agcctccagt ctagctgcct 2460
cttgtgctgc ttctgctga aac aaa cat gct gaa cca cct atg gga atc aaa 2512
Asn Lys His Ala Glu Pro Pro Met Gly Ile Lys
1 5 10
aca cac ccc cac tct gat ggt gaa gct ggg ctc cca gta gtg cag gag 2560
Thr His Pro His Ser Asp Gly Glu Ala Gly Leu Pro Val Val Gln Glu
15 20 25
gtg tgg ggg ctc cct cga aca ctg act ttc tcc tcc ctc ccc tgc cac 2608
Val Trp Gly Leu Pro Arg Thr Leu Thr Phe Ser Ser Leu Pro Cys His
30 35 40
ctc tgt cct cac ctt gtg gcc aga ata caa ccc tgg agt tgg caa ggc 2656
Leu Cys Pro His Leu Val Ala Arg Ile Gln Pro Trp Ser Trp Gln Gly
45 50 55
tcc tcc ctg cgc cct gat gcc gca gac tct gca acc ttc atg ctg ccc 2704
Ser Ser Leu Arg Pro Asp Ala Ala Asp Ser Ala Thr Phe Met Leu Pro
60 65 70 75
gct gcc tcc tcc ctc agc tct ctg gag ggc ggc cag gag ccc gtg gat 2752
Ala Ala Ser Ser Leu Ser Ser Leu Glu Gly Gly Gln Glu Pro Val Asp
80 85 90
ata aag atc atg agt ttc cca aag agc ccc ttt cca gcc cga agc cac 2800
Ile Lys Ile Met Ser Phe Pro Lys Ser Pro Phe Pro Ala Arg Ser His
95 100 105
ttt gat gtc agc ggg act gtc ggt ggc ctc cgt gtg acc agc cct agt 2848
Phe Asp Val Ser Gly Thr Val Gly Gly Leu Arg Val Thr Ser Pro Ser
110 115 120
ggt caa ctc ata cct gtg aag aat ctg tcg gag aat atc gag atc ctg 2896
Gly Gln Leu Ile Pro Val Lys Asn Leu Ser Glu Asn Ile Glu Ile Leu
125 130 135
ctg ccc cgg cat tca caa aga cac agc cag ccg acc gtg ttg aac ctg 2944
Leu Pro Arg His Ser Gln Arg His Ser Gln Pro Thr Val Leu Asn Leu
140 145 150 155
acc agt cct gaa gct ttg tgg gtg aac gtg act tca ggg gag gca acc 2992
Thr Ser Pro Glu Ala Leu Trp Val Asn Val Thr Ser Gly Glu Ala Thr
160 165 170
ttg ggg atc cag ctg cac tgg aga ccg gac att gca ctc acg ctt agc 3040
Leu Gly Ile Gln Leu His Trp Arg Pro Asp Ile Ala Leu Thr Leu Ser
175 180 185
ctg ggc tat ggc tac cac ccc aac aag agc agc tac gat gcc caa act 3088
Leu Gly Tyr Gly Tyr His Pro Asn Lys Ser Ser Tyr Asp Ala Gln Thr
190 195 200
cac ctc gta cca atg gtg gct cca gat gag ctg ccc acg tgg atc ctg 3136
His Leu Val Pro Met Val Ala Pro Asp Glu Leu Pro Thr Trp Ile Leu
205 210 215
agc cca cag gac ctg cgt ttt gga gaa ggg gtc tac tat ttg act gtg 3184
Ser Pro Gln Asp Leu Arg Phe Gly Glu Gly Val Tyr Tyr Leu Thr Val
220 225 230 235
gtc cct gag tct gac ctg gag cca gcc ccc ggc agg gac ctc acg gtt 3232
Val Pro Glu Ser Asp Leu Glu Pro Ala Pro Gly Arg Asp Leu Thr Val
240 245 250
ggc atc acc acc ttc ctg tct cac tgt gtg ttc tgg gat gag gtc cag 3280
Gly Ile Thr Thr Phe Leu Ser His Cys Val Phe Trp Asp Glu Val Gln
255 260 265
gag act tgg gac gac tca gga tgc cag gtg ggg cct cgg acc agc ccc 3328
Glu Thr Trp Asp Asp Ser Gly Cys Gln Val Gly Pro Arg Thr Ser Pro
270 275 280
tac cag aca cac tgc ctc tgc aac cac ctc act ttc ttc gga agc acg 3376
Tyr Gln Thr His Cys Leu Cys Asn His Leu Thr Phe Phe Gly Ser Thr
285 290 295
ttc ctg gtg atg ccc aat gcc atc gac gtc cac cag act gct gag ctc 3424
Phe Leu Val Met Pro Asn Ala Ile Asp Val His Gln Thr Ala Glu Leu
300 305 310 315
ttt gcc acc ttt gag gac aac cct gtg gtc gtg acc acc gtg ggc tgc 3472
Phe Ala Thr Phe Glu Asp Asn Pro Val Val Val Thr Thr Val Gly Cys
320 325 330
ctg tgt gtg gtc tac gtg ctg gtg gtg atc tgg gcg agg agg aag gac 3520
Leu Cys Val Val Tyr Val Leu Val Val Ile Trp Ala Arg Arg Lys Asp
335 340 345
gct cag gat cag gcc aag gtg aag gtc aca gtg ctg gaa gac aat gat 3568
Ala Gln Asp Gln Ala Lys Val Lys Val Thr Val Leu Glu Asp Asn Asp
350 355 360
ccc ttt gct cag tac cac tac ctg gtg aca gtc tac aca gga cac cga 3616
Pro Phe Ala Gln Tyr His Tyr Leu Val Thr Val Tyr Thr Gly His Arg
365 370 375
cga ggg gca gcc acg tcc tca aag gtg act gtc acc ctg tat ggc ctg 3664
Arg Gly Ala Ala Thr Ser Ser Lys Val Thr Val Thr Leu Tyr Gly Leu
380 385 390 395
gat gga gag aga gag ccc cac cac ctg gct gat cct gac act ccg gtt 3712
Asp Gly Glu Arg Glu Pro His His Leu Ala Asp Pro Asp Thr Pro Val
400 405 410
ttt gag cga gga gca gtg gat gcc ttc ctc ctc tcc acc ctg ttc ccc 3760
Phe Glu Arg Gly Ala Val Asp Ala Phe Leu Leu Ser Thr Leu Phe Pro
415 420 425
ctg gga gaa ctg cgg agc ctc cgg ctg tgg cat gac aac tca ggg gac 3808
Leu Gly Glu Leu Arg Ser Leu Arg Leu Trp His Asp Asn Ser Gly Asp
430 435 440
cgg cca tcg tgg tat gtg agc cgg gtg ctg gtc tat gac ctg gtg atg 3856
Arg Pro Ser Trp Tyr Val Ser Arg Val Leu Val Tyr Asp Leu Val Met
445 450 455
gac cgg aag tgg tat ttc ctg tgc aac tcc tgg cta tcc atc aat gtt 3904
Asp Arg Lys Trp Tyr Phe Leu Cys Asn Ser Trp Leu Ser Ile Asn Val
460 465 470 475
gga gat tgc gtc ctc gac aag gtg ttt cct gtg gcc acg gag cag gac 3952
Gly Asp Cys Val Leu Asp Lys Val Phe Pro Val Ala Thr Glu Gln Asp
480 485 490
aga aaa caa ttc agc cac ctg ttt ttc atg aag act tcc gcg ggc ttc 4000
Arg Lys Gln Phe Ser His Leu Phe Phe Met Lys Thr Ser Ala Gly Phe
495 500 505
cag gat gga cac atc tgg tat tcg atc ttc agc cgc tgc gct cgc agc 4048
Gln Asp Gly His Ile Trp Tyr Ser Ile Phe Ser Arg Cys Ala Arg Ser
510 515 520
agc ttc acc cgc gtc cag agg gtg tcc tgc tgc ttc tcc ctg ctg ctg 4096
Ser Phe Thr Arg Val Gln Arg Val Ser Cys Cys Phe Ser Leu Leu Leu
525 530 535
tgc acc atg ctg acc agc atc atg ttc tgg ggg gtc ccc aag gac cca 4144
Cys Thr Met Leu Thr Ser Ile Met Phe Trp Gly Val Pro Lys Asp Pro
540 545 550 555
gct gag caa aag atg gac ttg ggt aaa att gaa ttc acc tgg cag gag 4192
Ala Glu Gln Lys Met Asp Leu Gly Lys Ile Glu Phe Thr Trp Gln Glu
560 565 570
gtg atg att ggc ctg gag agc tcc atc ctc atg ttc ccc atc aac ctc 4240
Val Met Ile Gly Leu Glu Ser Ser Ile Leu Met Phe Pro Ile Asn Leu
575 580 585
ctg att gtt cag atc ttt cag aac acc cgt ccc cgg gtc gcg aag gag 4288
Leu Ile Val Gln Ile Phe Gln Asn Thr Arg Pro Arg Val Ala Lys Glu
590 595 600
cag aac act gga aaa tgg gac cgg ggg tcc ccc aac ctg act ccc tcc 4336
Gln Asn Thr Gly Lys Trp Asp Arg Gly Ser Pro Asn Leu Thr Pro Ser
605 610 615
cca cag ccc atg gag gac ggc ctt ctg aca cct gag gca gtg acc aag 4384
Pro Gln Pro Met Glu Asp Gly Leu Leu Thr Pro Glu Ala Val Thr Lys
620 625 630 635
gat gtg tca aga atc gtc agc tcc ctc ttc aaa gct ctc aag gtg cca 4432
Asp Val Ser Arg Ile Val Ser Ser Leu Phe Lys Ala Leu Lys Val Pro
640 645 650
tcc ccc gcc ttg ggc tgg gac tca gtg aac ttg atg gac atc aac agt 4480
Ser Pro Ala Leu Gly Trp Asp Ser Val Asn Leu Met Asp Ile Asn Ser
655 660 665
ctc ctc gcc ttg gtg gaa gat gtc att tat cca cag aac aca tca ggg 4528
Leu Leu Ala Leu Val Glu Asp Val Ile Tyr Pro Gln Asn Thr Ser Gly
670 675 680
cag gtg ttc tgg gag gaa gcc aaa aag aga gag gac cct gta aca ctc 4576
Gln Val Phe Trp Glu Glu Ala Lys Lys Arg Glu Asp Pro Val Thr Leu
685 690 695
act ttg ggg tca tca gaa atg aaa gag aaa tca cag tgt ccc aag ccc 4624
Thr Leu Gly Ser Ser Glu Met Lys Glu Lys Ser Gln Cys Pro Lys Pro
700 705 710 715
aag gcg gca cgg agt ggc ccc tgg aag gac agc gcc tac agg cag tgt 4672
Lys Ala Ala Arg Ser Gly Pro Trp Lys Asp Ser Ala Tyr Arg Gln Cys
720 725 730
ctg tac ctt cag ctg gaa cac gtg gag caa gag ctg cgg ctg gtg ggg 4720
Leu Tyr Leu Gln Leu Glu His Val Glu Gln Glu Leu Arg Leu Val Gly
735 740 745
ccc cga ggc ttc ccc cag cac cac agc cat gcc cag gcc ctc agg cag 4768
Pro Arg Gly Phe Pro Gln His His Ser His Ala Gln Ala Leu Arg Gln
750 755 760
ctg cag acc ctg aag ggc ggc ctg ggg gta cag ccg ggc acc tgg gcc 4816
Leu Gln Thr Leu Lys Gly Gly Leu Gly Val Gln Pro Gly Thr Trp Ala
765 770 775
cct gca cat gcc agc gct ctt cag gtg agc aaa ccc cct caa ggc ctg 4864
Pro Ala His Ala Ser Ala Leu Gln Val Ser Lys Pro Pro Gln Gly Leu
780 785 790 795
ccc tgg tgg tgc atc ctg gtg ggc tgg ctc ctg gta gcg gcc acc agt 4912
Pro Trp Trp Cys Ile Leu Val Gly Trp Leu Leu Val Ala Ala Thr Ser
800 805 810
ggg gtg gcg gcc ttc ttc acc atg ctc tac ggc ctg cac tac ggg agg 4960
Gly Val Ala Ala Phe Phe Thr Met Leu Tyr Gly Leu His Tyr Gly Arg
815 820 825
gcc agc tcc ctc agg tgg ctc atc tcc atg gct gtc tcc ttc gtg gag 5008
Ala Ser Ser Leu Arg Trp Leu Ile Ser Met Ala Val Ser Phe Val Glu
830 835 840
agc gtg ttc gtc acc cag ccc ctg aag gtg ctg gga ttc gct gct ttc 5056
Ser Val Phe Val Thr Gln Pro Leu Lys Val Leu Gly Phe Ala Ala Phe
845 850 855
ttt gca ctg gtc ttg aag aga gtg gac gat gag gag gat act gtg gcc 5104
Phe Ala Leu Val Leu Lys Arg Val Asp Asp Glu Glu Asp Thr Val Ala
860 865 870 875
ccg ctg cca gga cat ctg ttg ggc cca gac ccc tat gcc ttg ttc cga 5152
Pro Leu Pro Gly His Leu Leu Gly Pro Asp Pro Tyr Ala Leu Phe Arg
880 885 890
gca cga aga aac agc agc agg gat gtc tac cag cca cct ctc acc gct 5200
Ala Arg Arg Asn Ser Ser Arg Asp Val Tyr Gln Pro Pro Leu Thr Ala
895 900 905
gcc att gag aag atg aaa acc acc cac ctc aag gaa cag aaa gca ttt 5248
Ala Ile Glu Lys Met Lys Thr Thr His Leu Lys Glu Gln Lys Ala Phe
910 915 920
gcc ctc atc aga gaa atc ctg gca tac ttg ggc ttc ctg tgg atg cta 5296
Ala Leu Ile Arg Glu Ile Leu Ala Tyr Leu Gly Phe Leu Trp Met Leu
925 930 935
ctg ctc gtg gcc tac ggg cag agg gac ccc agc gcc tac cac ctc aac 5344
Leu Leu Val Ala Tyr Gly Gln Arg Asp Pro Ser Ala Tyr His Leu Asn
940 945 950 955
aga cac ctc cag cac agc ttc acc agg ggc ttt tca ggt gtg ctc ggc 5392
Arg His Leu Gln His Ser Phe Thr Arg Gly Phe Ser Gly Val Leu Gly
960 965 970
ttc cga gag ttc ttc aag tgg gcc aac acc acc ctc gtg agt aac ctg 5440
Phe Arg Glu Phe Phe Lys Trp Ala Asn Thr Thr Leu Val Ser Asn Leu
975 980 985
tat ggt cac ccc cca ggt aag tcc tgaagccctg gggctgtgtc agcctccttg 5494
Tyr Gly His Pro Pro Gly Lys Ser
990 995
tgccttctgg ctaggaacag acgtggggca ctgtcagcat cagcctcata agaaaaccag 5554
gccaagtgtg gtggcttaca cctgtaatcc tagtactttg ggaggctgag gcgggaggat 5614
tgcttgaggc caggaattcg agaccagcct ggcaacatgg caagaccctg cctctttgaa 5674
aaatacaaaa attagctgga tgtgatgcct catgcctata gtccctgcaa ctctggaggc 5734
tgaggtggga ggattgctta agcccgggag gcagaacctg cagtgagcca aaattgcacc 5794
actgcactcc agcctgggag acagagcaag actctgtctt 5834
<210> 5
<211> 4247
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(3528)
<400> 5
ggc cgc aga ggc tgc tcg gcc ccg ctg ccc gcc agg tcg tgg gct ccc 48
Gly Arg Arg Gly Cys Ser Ala Pro Leu Pro Ala Arg Ser Trp Ala Pro
1 5 10 15
gct ccg ggc ccg ggg cca ccg gct gtg gac ttc aaa atg agc gtc cct 96
Ala Pro Gly Pro Gly Pro Pro Ala Val Asp Phe Lys Met Ser Val Pro
20 25 30
gac tac atg cag tgt gct gag gac cac cag acg ctg ctc gtg gtg gtc 144
Asp Tyr Met Gln Cys Ala Glu Asp His Gln Thr Leu Leu Val Val Val
35 40 45
cag cct gtg ggc atc gtc tcc gag gag aac ttc ttc agg atc tat aag 192
Gln Pro Val Gly Ile Val Ser Glu Glu Asn Phe Phe Arg Ile Tyr Lys
50 55 60
agg att tgc tct gtg agt cag atc agc gtg cgg gac tcc cag cga gtc 240
Arg Ile Cys Ser Val Ser Gln Ile Ser Val Arg Asp Ser Gln Arg Val
65 70 75 80
ctc tac atc cgc tac agg cac cac tac cca ccc gag aac aac gag tgg 288
Leu Tyr Ile Arg Tyr Arg His His Tyr Pro Pro Glu Asn Asn Glu Trp
85 90 95
ggc gac ttc cag acc cac cgc aaa gtc gtg ggc ctc atc acc atc aca 336
Gly Asp Phe Gln Thr His Arg Lys Val Val Gly Leu Ile Thr Ile Thr
100 105 110
gac tgc ttc tcg gcc aag gac tgg cca cag acc ttc gag aag ttc cac 384
Asp Cys Phe Ser Ala Lys Asp Trp Pro Gln Thr Phe Glu Lys Phe His
115 120 125
gtg cag aag gag atc tac ggc tcc aca ctg tat gac tcc cgg ctc ttt 432
Val Gln Lys Glu Ile Tyr Gly Ser Thr Leu Tyr Asp Ser Arg Leu Phe
130 135 140
gtc ttc ggg ctg cag ggg gag atc gtg gag cag ccg cgc acc gac gtg 480
Val Phe Gly Leu Gln Gly Glu Ile Val Glu Gln Pro Arg Thr Asp Val
145 150 155 160
gct ttc tac ccc aat tac gag gac tgc cag acg gtg gag aag aga atc 528
Ala Phe Tyr Pro Asn Tyr Glu Asp Cys Gln Thr Val Glu Lys Arg Ile
165 170 175
gag gac ttc atc gag tca ctg ttc atc gtg ctg gag tcc aag cgt ctg 576
Glu Asp Phe Ile Glu Ser Leu Phe Ile Val Leu Glu Ser Lys Arg Leu
180 185 190
gac aga gcc aca gac aag tct ggg gat aag atc ccc ctt ctc tgt gtc 624
Asp Arg Ala Thr Asp Lys Ser Gly Asp Lys Ile Pro Leu Leu Cys Val
195 200 205
ccg ttt gag aaa aag gac ttt gta gga ctg gac aca gac agc aga cat 672
Pro Phe Glu Lys Lys Asp Phe Val Gly Leu Asp Thr Asp Ser Arg His
210 215 220
tac aag aag cgg tgc caa ggc cgc atg cgg aag cac gtg ggg gac ctg 720
Tyr Lys Lys Arg Cys Gln Gly Arg Met Arg Lys His Val Gly Asp Leu
225 230 235 240
tgc ctg cag gca ggg atg ctg cag gac tcc ctg gtg cat tac cac atg 768
Cys Leu Gln Ala Gly Met Leu Gln Asp Ser Leu Val His Tyr His Met
245 250 255
tcg gtg gag ctg ctg cgt tct gtg aat gac ttt ctg tgg ctt gga gct 816
Ser Val Glu Leu Leu Arg Ser Val Asn Asp Phe Leu Trp Leu Gly Ala
260 265 270
gcc ctg gaa gga ttg tgt tca gct tct gtc atc tat cac tat cct ggt 864
Ala Leu Glu Gly Leu Cys Ser Ala Ser Val Ile Tyr His Tyr Pro Gly
275 280 285
gga act ggt ggg aag agt gga gct cgg agg ttc cag ggc agc acc ctt 912
Gly Thr Gly Gly Lys Ser Gly Ala Arg Arg Phe Gln Gly Ser Thr Leu
290 295 300
cct gct gaa gca gcc aat aga cac cgg cca ggg gca cag gaa gtt ctc 960
Pro Ala Glu Ala Ala Asn Arg His Arg Pro Gly Ala Gln Glu Val Leu
305 310 315 320
att gat cca ggt gcc ctc acc acc aat ggc atc aac cct gac acc agt 1008
Ile Asp Pro Gly Ala Leu Thr Thr Asn Gly Ile Asn Pro Asp Thr Ser
325 330 335
act gag atc gga cgt gct aag aac tgc ctt agc cct gaa gac ata att 1056
Thr Glu Ile Gly Arg Ala Lys Asn Cys Leu Ser Pro Glu Asp Ile Ile
340 345 350
gac aag tat aaa gag gcg att tcc tat tac agc aag tat aag aat gcg 1104
Asp Lys Tyr Lys Glu Ala Ile Ser Tyr Tyr Ser Lys Tyr Lys Asn Ala
355 360 365
gga gtg att gag tta gaa gcg tgc atc aag gct gta cgt gtc ctt gca 1152
Gly Val Ile Glu Leu Glu Ala Cys Ile Lys Ala Val Arg Val Leu Ala
370 375 380
att cag aaa cgg agc atg gaa gca tca gaa ttt ctt cag aat gca gtt 1200
Ile Gln Lys Arg Ser Met Glu Ala Ser Glu Phe Leu Gln Asn Ala Val
385 390 395 400
tac att aac ctt cga cag ctt tct gag gaa gag aaa att cag cgc tac 1248
Tyr Ile Asn Leu Arg Gln Leu Ser Glu Glu Glu Lys Ile Gln Arg Tyr
405 410 415
agc atc ctc tcc gag ctc tat gag ctg atc ggc ttc cat cgc aag tct 1296
Ser Ile Leu Ser Glu Leu Tyr Glu Leu Ile Gly Phe His Arg Lys Ser
420 425 430
gcg ttc ttc aag cgc gtg gcc gcc atg cag tgc gtg gcc cca agc atc 1344
Ala Phe Phe Lys Arg Val Ala Ala Met Gln Cys Val Ala Pro Ser Ile
435 440 445
gcg gag cct ggg tgg agg gcc tgc tac aaa ctc ctc ctg gaa acg ctg 1392
Ala Glu Pro Gly Trp Arg Ala Cys Tyr Lys Leu Leu Leu Glu Thr Leu
450 455 460
ccc ggc tac agt ctg tcg ctg gat ccc aaa gat ttc agc aga ggc acg 1440
Pro Gly Tyr Ser Leu Ser Leu Asp Pro Lys Asp Phe Ser Arg Gly Thr
465 470 475 480
cac aga ggc tgg gct gcg gtc cag atg cgt ttg ctc cat gaa ttg gtc 1488
His Arg Gly Trp Ala Ala Val Gln Met Arg Leu Leu His Glu Leu Val
485 490 495
tac gcc tcc cga agg atg ggg aac cct gcc ctc tct gtc aga cac ctg 1536
Tyr Ala Ser Arg Arg Met Gly Asn Pro Ala Leu Ser Val Arg His Leu
500 505 510
tcc ttc ctt cta cag acc atg ctg gac ttc ttg tcg gat cag gaa aag 1584
Ser Phe Leu Leu Gln Thr Met Leu Asp Phe Leu Ser Asp Gln Glu Lys
515 520 525
aaa gat gtg gcc caa agc cta gag aac tat acg tcc aag tgt cct ggg 1632
Lys Asp Val Ala Gln Ser Leu Glu Asn Tyr Thr Ser Lys Cys Pro Gly
530 535 540
acc atg gag ccc atc gcc ctc cct ggc ggc ctc acc ctg cca ccg gtg 1680
Thr Met Glu Pro Ile Ala Leu Pro Gly Gly Leu Thr Leu Pro Pro Val
545 550 555 560
ccc ttc acc aag ctt ccc atc gtc agg cat gtg aaa cta ttg aac ctt 1728
Pro Phe Thr Lys Leu Pro Ile Val Arg His Val Lys Leu Leu Asn Leu
565 570 575
cct gct agc ctc cgg cca cac aaa atg aaa agc ttg ctg ggt cag aac 1776
Pro Ala Ser Leu Arg Pro His Lys Met Lys Ser Leu Leu Gly Gln Asn
580 585 590
gtg tca acc aaa agt cct ttc atc tat tca cca att atc gca cac aac 1824
Val Ser Thr Lys Ser Pro Phe Ile Tyr Ser Pro Ile Ile Ala His Asn
595 600 605
cgt gga gaa gag cgg aac aag aaa ata gat ttc cag tgg gtt caa gga 1872
Arg Gly Glu Glu Arg Asn Lys Lys Ile Asp Phe Gln Trp Val Gln Gly
610 615 620
gat gtg tgt gaa gtt cag ctg atg gta tat aac cca atg ccg ttt gaa 1920
Asp Val Cys Glu Val Gln Leu Met Val Tyr Asn Pro Met Pro Phe Glu
625 630 635 640
ctt cga gtt gaa aac atg ggg ctg ctc acc agc gga gtg gag ttc gag 1968
Leu Arg Val Glu Asn Met Gly Leu Leu Thr Ser Gly Val Glu Phe Glu
645 650 655
tct ctc cct gcg gcg ctt tct ctt ccg gct gaa tct ggt ctg tac cca 2016
Ser Leu Pro Ala Ala Leu Ser Leu Pro Ala Glu Ser Gly Leu Tyr Pro
660 665 670
gtg acg ctc gtc ggg gtc ccg cag acg act gga acg att act gtg aac 2064
Val Thr Leu Val Gly Val Pro Gln Thr Thr Gly Thr Ile Thr Val Asn
675 680 685
ggt tac cat acc acg gtc ttc ggt gtg ttc agt gac tgt ttg ctg gat 2112
Gly Tyr His Thr Thr Val Phe Gly Val Phe Ser Asp Cys Leu Leu Asp
690 695 700
aac ctg ccg gga ata aaa acc agt ggc tcc aca gtg gaa gtc att ccc 2160
Asn Leu Pro Gly Ile Lys Thr Ser Gly Ser Thr Val Glu Val Ile Pro
705 710 715 720
gcg ttg cca aga ctg cag atc agc acc tct ctg ccc aga tct gca cat 2208
Ala Leu Pro Arg Leu Gln Ile Ser Thr Ser Leu Pro Arg Ser Ala His
725 730 735
tca ttg caa cct tct tct ggt gat gaa ata tct act aat gta tct gtc 2256
Ser Leu Gln Pro Ser Ser Gly Asp Glu Ile Ser Thr Asn Val Ser Val
740 745 750
cag ctt tac aat gga gaa agt cag caa cta atc att aaa ttg gaa aat 2304
Gln Leu Tyr Asn Gly Glu Ser Gln Gln Leu Ile Ile Lys Leu Glu Asn
755 760 765
att gga atg gaa cca ttg gag aaa ctg gag gtc acc tcg aaa gtt ctc 2352
Ile Gly Met Glu Pro Leu Glu Lys Leu Glu Val Thr Ser Lys Val Leu
770 775 780
acc act aaa gaa aaa ttg tat ggc gac ttc ttg agc tgg aag cta gag 2400
Thr Thr Lys Glu Lys Leu Tyr Gly Asp Phe Leu Ser Trp Lys Leu Glu
785 790 795 800
gaa acc ctt gcc cag ttc cct ttg cag cct ggg aag gtg gcc acg ttc 2448
Glu Thr Leu Ala Gln Phe Pro Leu Gln Pro Gly Lys Val Ala Thr Phe
805 810 815
aca atc aac atc aaa gtg aag ctg gat ttc tcc tgc cag gag aat ctc 2496
Thr Ile Asn Ile Lys Val Lys Leu Asp Phe Ser Cys Gln Glu Asn Leu
820 825 830
ctg cag gat ctc agt gat gat gga atc agt gtg agt ggc ttt ccc ctg 2544
Leu Gln Asp Leu Ser Asp Asp Gly Ile Ser Val Ser Gly Phe Pro Leu
835 840 845
tcc agt cct ttt cgg cag gtc gtt cgg ccc cga gtg gag ggc aaa cct 2592
Ser Ser Pro Phe Arg Gln Val Val Arg Pro Arg Val Glu Gly Lys Pro
850 855 860
gtg aac cca ccc gag agc aac aaa gca ggc gac tac agc cac gtg aag 2640
Val Asn Pro Pro Glu Ser Asn Lys Ala Gly Asp Tyr Ser His Val Lys
865 870 875 880
acc ctg gaa gct gtc ctg aat ttc aaa tac tct gga ggc ccg ggc cac 2688
Thr Leu Glu Ala Val Leu Asn Phe Lys Tyr Ser Gly Gly Pro Gly His
885 890 895
act gaa gga tat tac agg aat ctc tcc ctg ggg ctg cat gta gaa gtc 2736
Thr Glu Gly Tyr Tyr Arg Asn Leu Ser Leu Gly Leu His Val Glu Val
900 905 910
gag ccg tct gta ttt ttc acc cga gtc agc acc ctc cca gca acc agt 2784
Glu Pro Ser Val Phe Phe Thr Arg Val Ser Thr Leu Pro Ala Thr Ser
915 920 925
acc cgg cag tgt cac ctg ctc ctg gat gtc ttc aac tcc acc gag cat 2832
Thr Arg Gln Cys His Leu Leu Leu Asp Val Phe Asn Ser Thr Glu His
930 935 940
gag ctg acc gtc agc acc agg agc agc gag gca ctc atc ctg cac gcc 2880
Glu Leu Thr Val Ser Thr Arg Ser Ser Glu Ala Leu Ile Leu His Ala
945 950 955 960
ggc gag tgc cag cga atg gct att caa gtg gac aag ttc aac ttt gag 2928
Gly Glu Cys Gln Arg Met Ala Ile Gln Val Asp Lys Phe Asn Phe Glu
965 970 975
agt ttc ccg gag tcc cct ggg gag aag ggg caa ttt gca aac ccc aag 2976
Ser Phe Pro Glu Ser Pro Gly Glu Lys Gly Gln Phe Ala Asn Pro Lys
980 985 990
cag ctg gag gaa gag cgg cgg gaa gcc cga ggc ctg gag atc cac agc 3024
Gln Leu Glu Glu Glu Arg Arg Glu Ala Arg Gly Leu Glu Ile His Ser
995 1000 1005
aag ctg ggc atc tgc tgg aga atc ccc tcc ctg aag cgc agt ggc gag 3072
Lys Leu Gly Ile Cys Trp Arg Ile Pro Ser Leu Lys Arg Ser Gly Glu
1010 1015 1020
gcg agt gtg gaa gga ctc ctg aac cag ctc gtc ctg gag cac ctg cag 3120
Ala Ser Val Glu Gly Leu Leu Asn Gln Leu Val Leu Glu His Leu Gln
1025 1030 1035 1040
ctg gcg cct ctg cag tgg gat gtg ctg gtg gac gga cag cca tgt gac 3168
Leu Ala Pro Leu Gln Trp Asp Val Leu Val Asp Gly Gln Pro Cys Asp
1045 1050 1055
cgc gag gct gtg gcg gcc tgc cag gtg ggc gac ccc gtg cgc ctg gag 3216
Arg Glu Ala Val Ala Ala Cys Gln Val Gly Asp Pro Val Arg Leu Glu
1060 1065 1070
gtg cgg ctg acc aac cgg agc ccg cgc agc gta ggg ccc ttc gcc ctc 3264
Val Arg Leu Thr Asn Arg Ser Pro Arg Ser Val Gly Pro Phe Ala Leu
1075 1080 1085
act gtg gtc ccc ttc cag gac cac cag aac ggc gtg cac aac tac gac 3312
Thr Val Val Pro Phe Gln Asp His Gln Asn Gly Val His Asn Tyr Asp
1090 1095 1100
ctg cac gac acc gtc tcc ttc gtg ggc tcc agc acc ttc tac ctc gac 3360
Leu His Asp Thr Val Ser Phe Val Gly Ser Ser Thr Phe Tyr Leu Asp
1105 1110 1115 1120
gcg gtg cag ccg tcc ggc cag tcg gcc tgc ctc ggg gcc ctc ctc ttc 3408
Ala Val Gln Pro Ser Gly Gln Ser Ala Cys Leu Gly Ala Leu Leu Phe
1125 1130 1135
ctc tac acg gga gac ttc ttc ctc cac atc cgg ttc cac gag gac agc 3456
Leu Tyr Thr Gly Asp Phe Phe Leu His Ile Arg Phe His Glu Asp Ser
1140 1145 1150
acc agc aag gag ctg cca ccc tct tgg ttc tgc ctg ccc agt gtg cac 3504
Thr Ser Lys Glu Leu Pro Pro Ser Trp Phe Cys Leu Pro Ser Val His
1155 1160 1165
gtg tgt gcc ctg gag gcg cag gcc tgagcccgcc tacttccgtc cctctttctg 3558
Val Cys Ala Leu Glu Ala Gln Ala
1170 1175
cagggccaga ggtgaccctg cctggcctcc cacaccccct gcaatgagca aggccttcac 3618
tgcagcccca tctcctcctc ctcccccaga cccctcccag ccctctcctc ctgttcctcc 3678
tgtagcatct ttgctgggct acgcagaagc cccggacatg gcagccccac cccatgccac 3738
gccccttcct acactgttcc ctggaccata cacaggctga agcagaggaa atcccaaagc 3798
gggtgcccat ccagcccagg tcccaggatc cctgcaccca tttctgtgac ctggggcccc 3858
agccgtgctg tgctgctcat cccagcagag ggacctccct cgtccagcga cttccctttg 3918
gccatagaaa gaaatggtga gcatgagact gggcacagcc tgagggcgtg ggcagcttcc 3978
caccctccct gggccttgga atcccccaag gctggttttc ttcctggaga cccccatggg 4038
caacttggca ggagagatgg tgccgtagga ggtcgtggat ggttgatgcc aagagaggcc 4098
ctccacccgt ggtgggcaaa tgtccaggcc tgggctggca gcccagggct gtttctgggt 4158
gctccctggc cccagggtgg cgtctggtta ccatggctgt gtgtgtccat gtctgcaagc 4218
agttcttcaa taaatggcct gcctccccc 4247
<210> 6
<211> 6640
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (361)..(987)
<400> 6
cagctgaagc tttcacaagt ttgtacaaaa aagcaggctc ttcctccttg ctgctgctgc 60
cgccgccaat cctggtccgg ttgcccgagt tcccggaggt ctctcgcggg acctctctca 120
ccgccaccgc tcctactctc gggcttccaa atctggggcg atgtctcccc aggttaaatt 180
accctagctc ctgctccaga tcgcttcccc gtgccccgcc agagcccagt agttcaaaaa 240
ttaaatttgg ggcaaggggt gcgcgccaga gcgcagctgt ttctggagcc tgcggcagcg 300
gtggcgagcc acagggcggc gaccgtgagc tccgggagct gcgcaaacca cctggagacc 360
atg tct ggg gat gcg acc agg acc ctg ggg aaa gga agc cag ccc cca 408
Met Ser Gly Asp Ala Thr Arg Thr Leu Gly Lys Gly Ser Gln Pro Pro
1 5 10 15
ggg cca gtc ccg gag ggg ctg atc cgc atc tac agc atg agg ttc tgc 456
Gly Pro Val Pro Glu Gly Leu Ile Arg Ile Tyr Ser Met Arg Phe Cys
20 25 30
ccc tat tct cac agg acc cgc ctc gtc ctc aag gcc aaa gac atc aga 504
Pro Tyr Ser His Arg Thr Arg Leu Val Leu Lys Ala Lys Asp Ile Arg
35 40 45
cat gaa gtg gtc aac att aac ctg aga aac aag cct gaa tgg tac tat 552
His Glu Val Val Asn Ile Asn Leu Arg Asn Lys Pro Glu Trp Tyr Tyr
50 55 60
aca aag cac cct ttt ggc cac att cct gtc ctg gag acc agc caa tgt 600
Thr Lys His Pro Phe Gly His Ile Pro Val Leu Glu Thr Ser Gln Cys
65 70 75 80
caa ctg atc tat gaa tct gtt att gct tgt gag tac ctg gat gat gct 648
Gln Leu Ile Tyr Glu Ser Val Ile Ala Cys Glu Tyr Leu Asp Asp Ala
85 90 95
tat cca gga agg aag ctg ttt cca tat gac cct tat gaa cga gct cgc 696
Tyr Pro Gly Arg Lys Leu Phe Pro Tyr Asp Pro Tyr Glu Arg Ala Arg
100 105 110
caa aag atg tta ttg gag cta ttt tgt aag att ctt gag tat cag aac 744
Gln Lys Met Leu Leu Glu Leu Phe Cys Lys Ile Leu Glu Tyr Gln Asn
115 120 125
acc acc ttc ttt ggt gga acc tgt ata tcc atg att gat tac ctc ctc 792
Thr Thr Phe Phe Gly Gly Thr Cys Ile Ser Met Ile Asp Tyr Leu Leu
130 135 140
tgg ccc tgg ttt gag cgg ctg gat gtg tat ggg ata ctg gac tgt gtg 840
Trp Pro Trp Phe Glu Arg Leu Asp Val Tyr Gly Ile Leu Asp Cys Val
145 150 155 160
agc cac acg cca gcc ctg cgg ctc tgg ata tca gcc atg aag tgg gac 888
Ser His Thr Pro Ala Leu Arg Leu Trp Ile Ser Ala Met Lys Trp Asp
165 170 175
ccc aca gtc tgt gct ctt ctc atg gat aag agc att ttc cag ggc ttc 936
Pro Thr Val Cys Ala Leu Leu Met Asp Lys Ser Ile Phe Gln Gly Phe
180 185 190
ttg aat ctc tat ttt cag aac aac cct aat gcc ttt gac ttt ggg ctg 984
Leu Asn Leu Tyr Phe Gln Asn Asn Pro Asn Ala Phe Asp Phe Gly Leu
195 200 205
tgc tgagtctcac tgtccacccc ttcgctgtcc agaattcccc agcttgttgg 1037
Cys
gagtctacgt cacggcttgt cttgggaacc aatccgtctc tctttctttt ctttgaagtt 1097
cccaataaaa tgaaaacagg aaatgtattc ttctgataat catttgtctg actcctctag 1157
cctgtagctg ctgctactgc tgcttttttt tccttttttt ttttgaggca agatcttgct 1217
tcgttactca ggctggagtg cagtgggaca gtcggctcac tgcagccttg aactcctggg 1277
ctcagttgat tctcccgcct cagcctcctg agaagctagg actacaggta tgtgtcacca 1337
cgcccagcta atttttaaaa aaatgttgtt gagacagggt ctcactatgt tgctcaggct 1397
ggtctccatc tcctggccgc aagccatcca cccaacttgg tctccaaagt gttgagatta 1457
caggcatgag ccacctggcc tgcctagcct gtagcttcta actattttct agaaagtgcc 1517
tggtgctaat gtcagagcac attttgggag cctgtgtgcc tcctaactcc ttggccttca 1577
gcctagtttg atctccagat tattggtgca gatgctgatg ctgaggttca gggtcaacct 1637
atgactgatg cttgtcacta cagaatggca ccctccaaag acctccctgg tgaaagctta 1697
agaagaggga gaaggaagaa gaaaacactc caaacctaaa taccttccat ttggcaaatg 1757
aatgtcgctg tccacgtggg cgagtatgag tgtaattctc caaactttgg ccatgggcct 1817
ggagacaccg agggtcttgt gactggaaag aattccaaag caggaaatga aacaccggtt 1877
tatcacccag gactaactac gtcagaattt cactggtgct gtcagggctc aggatgtctc 1937
acaaaatgtt ctatagccgg catcctgcct taaaaaaata cacacgcccc aaacccagcc 1997
acaagcaaag gtcattccgt tggatttgcg tcacactctc ttccaattca tctcaacaaa 2057
tgcctctggt tgcctgttgt gtaccaagcc caggggcaca ctggggataa agttgaaaaa 2117
gacattggct ctgtccctgg gagcccccat ctctccactt atcttggttg aggttttaaa 2177
cctagcgctt tggagctgct tgtctgttgt agggatgctt gtctctgatg tggcccctct 2237
gccagcttcg ccaggcagta ccaccccact gaagacagca aagagctgag gatggctggt 2297
gccaacaggt cccttgggct gggagcatct actgcctgca ctatgggagg gggaccaagt 2357
ttgtccaatg ctgtttcatc tcagtgtctg aggatcagcc tgtgatcatt tcagctttgc 2417
tcatgctgga tcctttccca tcctcctagc tgtgctgaga caccaggcag caacaggagc 2477
gatgttttgc cttgtctgtg ctccctaagg agctgtctgc cacctctgcc tctgagacag 2537
cccaaatcag ccctcctctc tccccacgtc tcagtctggc cctgccatct gctggctctg 2597
ggtatatttt gacccatgta aatagttacc aggtgttact ctgtgccagc tggacagaaa 2657
caagcctgct tggtgaggtg gccagaactt ccagaagagg cagcgcatga caggggcagt 2717
ctgccgcacc atctgctccg gatgtggagc cctgttgcca tggcacccta gttaatgctg 2777
agtgaggtct gtgaatcacc gttttcctgg gatctttgct ccaagaagag gggaatcttc 2837
tcctgtggcc tctcatagga cttaggagat tgatcagcct tttgccggta caaatcctgc 2897
atgctagtga gtgcttgtca taagaccaaa gttaccactg atgcaacagg aaatggcatg 2957
cctctggctc accaaaatga aggtgttctc ttttggggct ccgcacacta atttagaaag 3017
cttgaagaac caggaattat agaatggaga aaagcactca tgggggaggt ttgcagcctg 3077
gtggcctata gcccagtttg ggatttgagt gcctcttggc caggcatgct ccctgctgtt 3137
taccagggtc ctggtcacct gcacatcctt acacttcctc catctcgctt ctttgtcact 3197
tgtctgaccc acagagaaaa ctgagttatc aatctttgaa taaaactaga aggatgtggc 3257
ctggagaaaa gagggaagga ggtaattaaa taatgctcca tgaagaactt taaaaatatg 3317
cttcttaatt aatacatgtg gaaagaatac atatgggtta aaaatacaat aatgtaaaag 3377
aatagaaaat cttccattat cttgtgtcct agagataacc acttttagta attggtgtct 3437
accttttttt ccccttttct tttaaataat agagatgggg tttcgctatg ttgaccaggc 3497
tggtcttgaa ctcctggctt caagtgatcc tcccatctcg gcatcccaaa gtgctgggat 3557
tataggtctg agtcatcatg cctggtgggt gtctatctta taaaaagcac taaacttaaa 3617
gatttaaagg tgagagcctg tattaggcca ttctcatggt gctataaaga actgcccaag 3677
actgagtaat ttataaagga aagaggttta attgactcac aattccacat ggctggggaa 3737
gcctcaggaa actttcaatc atggtggaag gggaagcaaa catgtccttc tgcacaaggt 3797
ggcagaagag agaagtacag agccaaagag ggaaaaactc cttataaaac cattagatct 3857
catgagagct cgctcactgt catgagaaaa gcatagggaa aacactccca caatcgaatc 3917
acctcccttg aggtccctcc cccaacatgt ggggattata attcagatta aaattgaaga 3977
tgagatttta ggtggagcac agccaaacca tatcagagcc attctctatc ttttgcagat 4037
tggaaaaata aaaaaattag tttaaaccac agcatttaaa agaaatggtc atagatgatg 4097
ttaacatcag aatggaggtt gtgtggagtt cccctccttg gaggttctga gctacagcct 4157
agcctggact cagtccagtt acaatctaac atggctaggg ggaggaggga gcatgatggg 4217
gcagggcttg tgtggctggg acagctggct atggtcgtgg tgctggtgac taaatggtac 4277
ttgaaatttc ttttagatct aaaattctgt gatatggttt ggatctgaga ccccaccaaa 4337
tttcgtgttg aaatgtagtc cccaatgctg gtgggaggtg attggatcat gggggtagat 4397
ttctcatgaa tggtttagca ccattctctt ggtgctgttc tcatgatagt gagtgagttc 4457
tcatgagttc tggttgttta aaagtgtgca gcatctccct gacccctctt gcgccagctc 4517
ccaccatggg agatgcctcg ctcccccttt gctttttgcc atgagaggaa gcttcctgag 4577
ggctctccag aagcagaatc tactacactt ctcatacagc ctgcagaacc atgagccaat 4637
taaacctctt tttaaaaaca cttttatttt aagttcaggg gtacatgtgc aggatgtgca 4697
ggttcgttcc ataggtaaac atgcgtcatg gggtttgtta tgcagattat ttcatcacct 4757
agatattaag tctagtatcc attagttatt tttcctgatc ctctccctcc tcccaccctg 4817
caccctctga taggcctcag tgtgtattgt taccctccac atgtccatgt gttctcatga 4877
tttagctccc acttataagt gagaacatgc agtttttgta aacctgtttt ctttagggaa 4937
agtttagaat tttctagaga ctggttgtat ggttgtgacc aaaatgccag tagtgatatg 4997
gacagtgaag gccaggctga ggaggtgatg gaaatgagga acttactggg aactggagca 5057
aaggtcactt ttgttatgct ttagcaaaga acttggctgc attgtgctcc tggcctaggg 5117
acctgtggaa gtttgaactt gagagtgatg attttagggt atctggcaga aaaaatttct 5177
aagcagcaaa gcattcaaga tgcgccctgg ctgctttcaa caacctatgc tcatatttgt 5237
gagcaaagaa attatgttaa gttggaactt atatttacag gggaagcaga gcgtaaaagt 5297
ttggagaatt tgcagccagc tatgtggtag aaaagaagag cccattttca gggaaggaat 5357
tcaagcaggc tgcagaaatt tgcataagta aaaaggagcc aagtgctaat ggccaagaaa 5417
atggggaaaa ggccttgaag gcatttcaga gacctttgtg gcagcccctc ccatcacaga 5477
ctcagaggcc taggaggaca gaatgactct gagtattaaa gtagatttaa tctaaatcat 5537
aaagtacaag ctagagcaat ccctccagtg ctgcagaaac tgcctgcatg tggctcccac 5597
attgtttctt ggagctttgg tggtggtagg ggtggcaggg gcaggtgtta tgtttcttgg 5657
tttctcacct tcaggtctcc ttacccatgt ttcctttgtc agagtcattg gccttgatgg 5717
aggggagact gggttactct gagttgagaa gcaagcttga ctgccctgtg gtttatatca 5777
gcctagctta gtttgttttc tgaagtcttc acaaataacc aaatatccag cccagtgcag 5837
ttctcaatga gtagtctcca gttcccatcc cccaccacac acacacacca aacactgtgc 5897
ccacctggga gacccatttt catttatctt cttctttgag tgcctggaag gctgaacatt 5957
tgatcttttc aaacattttt ctctcatctg gcctgttagc gctcttgagc ctttgaaaat 6017
gccatttcct ctacatacta ttcttcctca gagcagccct ctgttttttt ttccaaccag 6077
tcttttctct tgcaaagttc agatcaaaat tcttctttat gtaaagagaa gtggcgtgcc 6137
ataggggtta caaacagggc ttcaaggtta gcactgggat caaattccga ctttgcaact 6197
taactttctt gagtaagttg ctacacctct attagcctca gtttcttatc tgtaaaatgg 6257
gacggctaag tacctatttc attgggctgt tgaataatga tggtctcatc tatgcaaggg 6317
cttagtacca gctgggtgaa actcagtgaa tggcaggcaa ccctattatg actaggcttc 6377
tgccaatgat cagacttgaa ccctgtgctc atccccccat cactttgtgg aactagatgt 6437
cctacgttgc acatggcttg attcatggtt tggagttgtt ttctggttgt ttcctgtgtg 6497
caggtttaat ctcaacctgt gatcccttga aagctgcaat tttatccctc cctctcaccc 6557
caaggccaag tttaatactg tgcattctga aatcttgcaa tgttgatgat gtaaaataaa 6617
tcttctctcc tgtgtttctg aag 6640
<210> 7
<211> 2145
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(1714)
<400> 7
g gca tcc gcc tcc tcg ccc gac gac ggc ggc cgc acc gac cgc ttt gcc 49
Ala Ser Ala Ser Ser Pro Asp Asp Gly Gly Arg Thr Asp Arg Phe Ala
1 5 10 15
ttc cag ctg ccc ttt gct gag ggc gcg ggc gat ggg gcg cgc ctc gac 97
Phe Gln Leu Pro Phe Ala Glu Gly Ala Gly Asp Gly Ala Arg Leu Asp
20 25 30
ttc gtg gtg cgc tat gag acc cct gag ggc act ttc tgg gcc aac aac 145
Phe Val Val Arg Tyr Glu Thr Pro Glu Gly Thr Phe Trp Ala Asn Asn
35 40 45
cac ggc cgc aac tac aca gtc ctg ctc cgg atc gca ccc gct ccc aca 193
His Gly Arg Asn Tyr Thr Val Leu Leu Arg Ile Ala Pro Ala Pro Thr
50 55 60
ccc act gat gcc gaa ggg ctg ccc cag cag cag cag ctg ccg cag ctg 241
Pro Thr Asp Ala Glu Gly Leu Pro Gln Gln Gln Gln Leu Pro Gln Leu
65 70 75 80
gag cca cag ccc gag tgc cag ggt ccc gtg gag gct gag gcc agg cag 289
Glu Pro Gln Pro Glu Cys Gln Gly Pro Val Glu Ala Glu Ala Arg Gln
85 90 95
ctg aag agc tgc atg aag ccg gtg agg cgc agg cct gcc gag gag gaa 337
Leu Lys Ser Cys Met Lys Pro Val Arg Arg Arg Pro Ala Glu Glu Glu
100 105 110
ctg aag acg aag aac atg gat gat aac acc ttt gcc atg gca gag cat 385
Leu Lys Thr Lys Asn Met Asp Asp Asn Thr Phe Ala Met Ala Glu His
115 120 125
cct gat gtc cag gag tca gtg ggt cca ctg gta gcc ccc acc cct ctc 433
Pro Asp Val Gln Glu Ser Val Gly Pro Leu Val Ala Pro Thr Pro Leu
130 135 140
cgt cca tgg ccc cag atg aca ctt cag gtt tct gac gtt ccg atg act 481
Arg Pro Trp Pro Gln Met Thr Leu Gln Val Ser Asp Val Pro Met Thr
145 150 155 160
ggc aac ccc gca gaa gaa ggt gat gtc ccc aga agc agt cca cct gtg 529
Gly Asn Pro Ala Glu Glu Gly Asp Val Pro Arg Ser Ser Pro Pro Val
165 170 175
gct ttt aca gag gtc ctc cag gca ccg gcc atc agg att ccc ccc tcc 577
Ala Phe Thr Glu Val Leu Gln Ala Pro Ala Ile Arg Ile Pro Pro Ser
180 185 190
tcc cct ctc tgt ggc ctg ggt ggc tcc ccc aga gac cag gcc tca ggg 625
Ser Pro Leu Cys Gly Leu Gly Gly Ser Pro Arg Asp Gln Ala Ser Gly
195 200 205
ccc gat gcg agc gag ggg gcc acc ggg cct ttc ctg gag ccc agt cag 673
Pro Asp Ala Ser Glu Gly Ala Thr Gly Pro Phe Leu Glu Pro Ser Gln
210 215 220
cag cag gca gag gcc aca tgg gga gta tcg agt gag aat gga ggg ggg 721
Gln Gln Ala Glu Ala Thr Trp Gly Val Ser Ser Glu Asn Gly Gly Gly
225 230 235 240
ctg gag gct gtg agt ggg tca gag gag ctg ctc ggt gag gac acc atc 769
Leu Glu Ala Val Ser Gly Ser Glu Glu Leu Leu Gly Glu Asp Thr Ile
245 250 255
gac cag gag ctg gag cag ctc tac ctg tct cac ctg agc cgc cta cgg 817
Asp Gln Glu Leu Glu Gln Leu Tyr Leu Ser His Leu Ser Arg Leu Arg
260 265 270
gct gct gtg gct gcg ggt ggg gca ggg ggt ggt ggg gag ggc tcc aca 865
Ala Ala Val Ala Ala Gly Gly Ala Gly Gly Gly Gly Glu Gly Ser Thr
275 280 285
gat gga ggg atg tcc ccc agc cat ccc ctg ggc ata ctg acg gac cgc 913
Asp Gly Gly Met Ser Pro Ser His Pro Leu Gly Ile Leu Thr Asp Arg
290 295 300
gac ctg atc ttg aag tgg cct ggc cct gag cgg gcc ctg aac agc gcc 961
Asp Leu Ile Leu Lys Trp Pro Gly Pro Glu Arg Ala Leu Asn Ser Ala
305 310 315 320
ctg gct gag gag atc acg ctg cac tat gcc cgg ctg ggg cgt ggc gtg 1009
Leu Ala Glu Glu Ile Thr Leu His Tyr Ala Arg Leu Gly Arg Gly Val
325 330 335
gag ctc atc aag gac acc gaa gac cct gat gat gaa ggg gag ggt gaa 1057
Glu Leu Ile Lys Asp Thr Glu Asp Pro Asp Asp Glu Gly Glu Gly Glu
340 345 350
gag ggg ctc tct gtc aca ccc tcc agc cca gaa ggg gac agc ccc aag 1105
Glu Gly Leu Ser Val Thr Pro Ser Ser Pro Glu Gly Asp Ser Pro Lys
355 360 365
gaa tcg cct cca gaa atc ctc tcc ggg gcc cgt tct gtg gta gcc acg 1153
Glu Ser Pro Pro Glu Ile Leu Ser Gly Ala Arg Ser Val Val Ala Thr
370 375 380
atg gga gat gtg tgg ctc cca tgg gca gag ggc tca gga tgt gac ggc 1201
Met Gly Asp Val Trp Leu Pro Trp Ala Glu Gly Ser Gly Cys Asp Gly
385 390 395 400
cct gtg gtt ctg ggt aca gag ggt cag ttc att ggg gat cct gag aaa 1249
Pro Val Val Leu Gly Thr Glu Gly Gln Phe Ile Gly Asp Pro Glu Lys
405 410 415
ggg atg ggc aag gac acc agc tct ttg cac atg aat agg gtg ata gct 1297
Gly Met Gly Lys Asp Thr Ser Ser Leu His Met Asn Arg Val Ile Ala
420 425 430
ggg gtg act gag tcc ctg ggg gag gcc ggg aca gaa gcc cag ata gag 1345
Gly Val Thr Glu Ser Leu Gly Glu Ala Gly Thr Glu Ala Gln Ile Glu
435 440 445
gtc acc agt gag tgg gca ggc agc ttg gat ccc ata tct ggc aag gag 1393
Val Thr Ser Glu Trp Ala Gly Ser Leu Asp Pro Ile Ser Gly Lys Glu
450 455 460
cca gcc tct ccc gtc ctt ctg cag ggg caa aat ccc acc ctc ctc agt 1441
Pro Ala Ser Pro Val Leu Leu Gln Gly Gln Asn Pro Thr Leu Leu Ser
465 470 475 480
ccc ttg ggg gcc gaa gtc tgt ctc tct agt gta gcc agg cct cat gtg 1489
Pro Leu Gly Ala Glu Val Cys Leu Ser Ser Val Ala Arg Pro His Val
485 490 495
agc tcc cag gat gaa aag gat gca ggc cca agc ctt gaa ccc cca aag 1537
Ser Ser Gln Asp Glu Lys Asp Ala Gly Pro Ser Leu Glu Pro Pro Lys
500 505 510
aag tct ccc acc cta gca gtc cct gca gaa tgt gtg tgt gca ctg cct 1585
Lys Ser Pro Thr Leu Ala Val Pro Ala Glu Cys Val Cys Ala Leu Pro
515 520 525
cct cag ctc cgg ggg ccc ttg acc cag act ctg ggg gtc ctg gcc ggg 1633
Pro Gln Leu Arg Gly Pro Leu Thr Gln Thr Leu Gly Val Leu Ala Gly
530 535 540
cta gtg gtg gtc cct gtg gct ctg aac agc ggt gtg tcc ctc ctg gtg 1681
Leu Val Val Val Pro Val Ala Leu Asn Ser Gly Val Ser Leu Leu Val
545 550 555 560
ctt gcg ctg tgc ctc tct ctg gct tgg ttc tca taggctctgc ttgtgggatc 1734
Leu Ala Leu Cys Leu Ser Leu Ala Trp Phe Ser
565 570
agcagaggct taagatggga tacatggcct gtgcagtgag gggacctggg tcctttgctt 1794
ctgagaatgc tcaactgaaa gagaggcctt ctcatcccca agctctccag tcaacacagg 1854
gctccctgtg gtgacaccag tggagatgag ggaacgggta gatggtgtga gtgaggggaa 1914
cttttagagt ggaactgggc atgtcctccg cctacccccc gagcctgtat ttatttttgt 1974
ataattctct ggatgaggga gagtggtcgt gagctggtct tggggcacaa ttacccagag 2034
atatatttat taacagccaa cctgtgcaac ctgctggagc tttattttta atttaattta 2094
tatagagtac ctattattat atgccacaat agagctctat gagaaacagt g 2145
<210> 8
<211> 2347
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(679)
<400> 8
c gcg gcg acg ggg cgc ggc ggg ctc cct cgg ggt ccc agc tgg ccg gca 49
Ala Ala Thr Gly Arg Gly Gly Leu Pro Arg Gly Pro Ser Trp Pro Ala
1 5 10 15
ctc ggc ggc cgc ggc gcg atg gag gcg ccg gcc gag cta ctg gcc gcg 97
Leu Gly Gly Arg Gly Ala Met Glu Ala Pro Ala Glu Leu Leu Ala Ala
20 25 30
ctg cct gcg ctg gcc acc gcg ctg gcc ctt ctg ctc gcc tgg cta ctg 145
Leu Pro Ala Leu Ala Thr Ala Leu Ala Leu Leu Leu Ala Trp Leu Leu
35 40 45
gtg cgg cgt ggg gcg gcc gcg agc ccg gag cct gcc cgc gcg ccc ccg 193
Val Arg Arg Gly Ala Ala Ala Ser Pro Glu Pro Ala Arg Ala Pro Pro
50 55 60
gaa ccc gcg ccc ccg gcc gag gcc acc ggg gcc ccg gcg ccg tcc cgc 241
Glu Pro Ala Pro Pro Ala Glu Ala Thr Gly Ala Pro Ala Pro Ser Arg
65 70 75 80
ccc tgc gcc ccc gag ccg gcg gcc tcg ccc gcg ggg ccg gag gag cct 289
Pro Cys Ala Pro Glu Pro Ala Ala Ser Pro Ala Gly Pro Glu Glu Pro
85 90 95
gga gag ccc gcg ggg ctg ggg gag ctc ggg gag cct gcg gga ccg ggg 337
Gly Glu Pro Ala Gly Leu Gly Glu Leu Gly Glu Pro Ala Gly Pro Gly
100 105 110
gag ccc gaa ggg cca ggg gat ccc gcg gcg gcg cca gcg gag gcg gag 385
Glu Pro Glu Gly Pro Gly Asp Pro Ala Ala Ala Pro Ala Glu Ala Glu
115 120 125
gag cag gcg gtg gag gcg agg cag gaa gag gag cag gac ttg gat ggt 433
Glu Gln Ala Val Glu Ala Arg Gln Glu Glu Glu Gln Asp Leu Asp Gly
130 135 140
gag aag ggg cca tca tcg gaa ggg cct gag gag gag gac gga gaa ggc 481
Glu Lys Gly Pro Ser Ser Glu Gly Pro Glu Glu Glu Asp Gly Glu Gly
145 150 155 160
ttc tcc ttc aaa tac agc ccc ggg aag ctg agg gga aac cag tac aag 529
Phe Ser Phe Lys Tyr Ser Pro Gly Lys Leu Arg Gly Asn Gln Tyr Lys
165 170 175
aag atg atg acc aaa gag gag ctg gag gag gag cag aga gtt cag aag 577
Lys Met Met Thr Lys Glu Glu Leu Glu Glu Glu Gln Arg Val Gln Lys
180 185 190
gaa cag ctg gct gcc atc ttc aag ctc atg aaa gac aac aag gag acg 625
Glu Gln Leu Ala Ala Ile Phe Lys Leu Met Lys Asp Asn Lys Glu Thr
195 200 205
ttt ggc gag atg tcc gac ggc gac gtg cag gag cag ctc cgg ctc tac 673
Phe Gly Glu Met Ser Asp Gly Asp Val Gln Glu Gln Leu Arg Leu Tyr
210 215 220
gac atg tagactgcgc ccacgggatg cacagcggcc atgcctgatg ctgtccccac 729
Asp Met
225
ccttgcccca tgcccttgtt ctttccagac ttctgtaggc ctaattttcc cttataaaca 789
tagatgcagg cgtacattct ataggtactt gccgagttct tgcagtgtgt atattacttt 849
gcaggaatca aaattgtttt attcagaaga caaagtccct gacccctctg gtttctgctt 909
gctggtgagg ttccagctgt tattgggctc tgaggccctc tcctgatcca aatctttttt 969
tgtttgtttg tttgagatgg agtctcactc tgtctcccag gctggagtgc agtggcgtga 1029
tcttggctca ctgcaacctc cacctccctg attcaagcga ctctcctgcc tcagcctccc 1089
gagtagctgg gattgcaggc atgcaccatc acacctggct aattttgtat tttcagtaga 1149
gacagcgttt tgccatgctg gccaggctcg tctcgaactc ctgacctcag gtgatccgcc 1209
caccttggcc tcccaaagtg ctgggattac acgcgtgagc caccatgccc ggccaatact 1269
acactcttat tctgccttac gaggatttct ggaagcttct gtaactgtag ggaagaaagc 1329
tgtaggggag catggttttt ttgttttgtt tttgtttttg agacgaggtc tcactgtcac 1389
ccaggctgga gtgcagtggt gccatcatga ctcactgcag ccttgacctc ccaggcgcac 1449
aaaatcctcc tctcagcctg agtaactggg actacaggtg tgtaccacta catccagcta 1509
aaggagagcg ttttttcctt tttctttttg agacagagtc tcactctggc gcctggtcac 1569
tgtgcccttc accttctggg ttcaagtgat tctcctgcct cagcctccct agtagctggg 1629
attacaggca tgcaccacca tacccagcta atttctgtat ttttagtaga gatggagttt 1689
caccatgttg gccaggctgg tctcgaactc ctgacctcag gtgatccacc tgcctcggcc 1749
tcccaaagtg ctaggattac aggcgtgagc caccatgccc agctgggagc gttctttcag 1809
caaaaagtct aattcagaga ctcaccttcc atcctggcac taaacaatag tactagaggt 1869
aagatctgcc aagaatttct ctcacttaag caggatactt actttaaata taattaggca 1929
agacttattt gagtacttga ctaattgtat ttaattagaa ctatgctgga tgctgtagac 1989
agaagaaata aaatactcaa cttggtccct atcttccagc aggttagagg cctttaggga 2049
agaaaagcct gtgtgcaggg aaagagaacc tcaccagaaa gtattcaata aaatgccaaa 2109
gtgatcttac agaacagtgc ctgtgacacc ggaggggatt gttctgtctt tgtgagtgaa 2169
catacaatga tggggataac tggtcctcat accatgcagg tcccatttct gatcacccca 2229
ctggcaggac aattcacagt tcttgagcaa actgagttag ggagggtatg ctaacgacag 2289
aacctatgca agttcttaga aatatttttt gtttgctgaa ataaagctca atcaacac 2347
<210> 9
<211> 1920
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (3)..(827)
<400> 9
gg cgg ccg gct ggg cgt gcg ctc gct ccc cga agc cgg ggc tgg gcc 47
Arg Pro Ala Gly Arg Ala Leu Ala Pro Arg Ser Arg Gly Trp Ala
1 5 10 15
gga gcc ggg cga ggg ctg gga gct ggg ccg ggt ccg ggg aca gcg ggc 95
Gly Ala Gly Arg Gly Leu Gly Ala Gly Pro Gly Pro Gly Thr Ala Gly
20 25 30
gag ggg cag ctg ccg gag ccg ggc agc cag gcc gct cag ggc agg gga 143
Glu Gly Gln Leu Pro Glu Pro Gly Ser Gln Ala Ala Gln Gly Arg Gly
35 40 45
cag ctg gcg ccg gtt ctg cgg tct ccg ggg ccc aga tgt gag gcg gcg 191
Gln Leu Ala Pro Val Leu Arg Ser Pro Gly Pro Arg Cys Glu Ala Ala
50 55 60
gcg ccc ccg gcc cga gag cgc acg atg ggg gcc ccg ctc gcc gta gcg 239
Ala Pro Pro Ala Arg Glu Arg Thr Met Gly Ala Pro Leu Ala Val Ala
65 70 75
ctg ggc gcc ctc cac tac ctg gca ctt ttc ctg caa ctc ggc ggc gcc 287
Leu Gly Ala Leu His Tyr Leu Ala Leu Phe Leu Gln Leu Gly Gly Ala
80 85 90 95
acg cgg ccc gcc ggc cac gcg ccc tgg gac aac cac gtc tcc ggc cac 335
Thr Arg Pro Ala Gly His Ala Pro Trp Asp Asn His Val Ser Gly His
100 105 110
gcc ctg ttc aca gag aca ccc cat gac atg aca gca cgg acg ggc gag 383
Ala Leu Phe Thr Glu Thr Pro His Asp Met Thr Ala Arg Thr Gly Glu
115 120 125
gac gtg gag atg gcc tgc tcc ttc cgc ggc agc ggc tcc ccc tcc tac 431
Asp Val Glu Met Ala Cys Ser Phe Arg Gly Ser Gly Ser Pro Ser Tyr
130 135 140
tcg ctg gag atc cag tgg tgg tat gta cgg agc cac cgg gac tgg acc 479
Ser Leu Glu Ile Gln Trp Trp Tyr Val Arg Ser His Arg Asp Trp Thr
145 150 155
gac aag cag gcg tgg gcc tcg aac cag cta aaa gca tct cag cag gaa 527
Asp Lys Gln Ala Trp Ala Ser Asn Gln Leu Lys Ala Ser Gln Gln Glu
160 165 170 175
gac gca ggg aag gag gca acc aaa ata agt gtg gtc aag gtg gtg ggc 575
Asp Ala Gly Lys Glu Ala Thr Lys Ile Ser Val Val Lys Val Val Gly
180 185 190
agc aac atc tcc cac aag ctg cgc ctg tcc cgg gtg aag ccc acg gac 623
Ser Asn Ile Ser His Lys Leu Arg Leu Ser Arg Val Lys Pro Thr Asp
195 200 205
gaa ggc acc tac gag tgc cgc gtc atc gac ttc agc gac ggc aag gcc 671
Glu Gly Thr Tyr Glu Cys Arg Val Ile Asp Phe Ser Asp Gly Lys Ala
210 215 220
cgg cac cac aag gtc aag gcc tac ctg cgg gtg cag cca ggg gag aac 719
Arg His His Lys Val Lys Ala Tyr Leu Arg Val Gln Pro Gly Glu Asn
225 230 235
tcc gtc ctg cat ctg ccc gaa gcc cct ccc gcc gcg ccc gcc ccg ccg 767
Ser Val Leu His Leu Pro Glu Ala Pro Pro Ala Ala Pro Ala Pro Pro
240 245 250 255
ccc ccc aag cca ggc aag gag ctg agg aag cgc tcg gtg gac cag gag 815
Pro Pro Lys Pro Gly Lys Glu Leu Arg Lys Arg Ser Val Asp Gln Glu
260 265 270
gcc tgc agc ctc tagactgatg cccctgcccc cgcccatccg cccccacgct 867
Ala Cys Ser Leu
275
gtacagagtg catgaggagc cgccggacca ccggggaccg actgcctgcg tccagccgcg 927
ccccatcccc gaggccgcct gtggccacca tgtcggccct ctttccacca ccccttgctc 987
agcatgtaag ccccacccac ccctgccctt tcagacccct gcggtgacct ggctcggaga 1047
aggtggccct gggcaccaag gggccaaccg ccctgaacac tggggcaggg accatgctgg 1107
ggcccggggc cacccccttc ctgtcaccag cttctgtgga gtccagtgtt ttgctttgct 1167
tgcttgtccc ccatcctgtc ctgagccggg gccccccagc ctcgcctccc tcctcctacc 1227
atccctcact tggacctggg ggtgtggaca gtgacccctc cctgaatatg gacttgaatc 1287
ttctgagcag aactagggcc tctcccctgg tgaagaccca gggaacccag gagggccctt 1347
ctggggcagt ggctctgcag ggtcactcat ggaggcctag gggaacagcg agatgcccca 1407
ccacctcctg gcgagtcctt cctgttcagc tccctgtgcg accctccagg gatgcagggg 1467
atccaggatt ctctgccctg tcacacggcg agtcagaagg gaggggcctt tccctcggac 1527
ccatggcccc aggcagagtt ttgcaccagc aggacccctt tgagggcctt caaggctctc 1587
ccaggagtcc ccctctgccg gccccccaat gccccagctc cctcttgggt cctgtgccaa 1647
gtccgcccca gggcctgggg ctgttgggag ccaagggccc cctggtactc agttccctca 1707
cgattcccga tcacgggcac acctgccccc tggttatttg taaatatttc tattggaccc 1767
aattctcctc ggaattggct ggcacctctg gttgccacag ctcagtgatg acgtggggga 1827
ggtgggagag gccgagggct ttgcctaggg gtgggttgcc ctgtatacat gatccagtct 1887
gtgactacca gccaacctga ataaagcggt ttt 1920
<210> 10
<211> 6385
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2801)..(4435)
<400> 10
gatgggccag cagcacccca agccattcac tccatggggc ccggtaggac aggacaagaa 60
atggcttggg gcagggacag aacagccacc tgtggaggac gcagccctcg tcagcgacca 120
gggtcaccca tcacgggggt gtcccctctg gggtccctgt tcagagagaa gaggcctaag 180
gagcactggg gtccagtccc aagacagtct gatgtccagg gatacagagg cgcccccacc 240
ccctcacagc tctgcaggcc atgcacgaag gcctgtgtgg atgagaaccg tgtgtgttgg 300
taagaggcac agccctcgcc ctctggggcc tcgcctcctc tttgaactca gattctgaaa 360
gagcccccat cacattcccc tgtgcctctg cggccattcc ctgcagatca tctgcccagc 420
ggccccctcc accccacagg tgaccctcac aaggggccag gaggaacaga gccacagcca 480
tgcctcctgt ggccaggggt cctcctcagt ctagacagca cctgcgatgc tccacggcac 540
ctgtgtccct cctcctgtgc ctggtggcca ggacccactc cctggctcag actgtgggca 600
agacaggcag cacccctcag ggcagccatg tgagtcccat cgcacctgtc actcagccat 660
ggactgtccg tctgcccagc acccctcccc accgggggac ttcacggctc ttagggtcag 720
tgcatatgtg tgcgtgtgca tgtgctcgca catcctgaag tccccctggg ctcagccctt 780
cactggcttt gagctcctct gcctgaggac tcctgggtgg acaagtgggc tggccagagg 840
tgaaggcagc agtgactggg ggcctcggga gaaggctgtg gcccagtgtg acctccacag 900
ggcagaaagg gcatgcagag gagaggctgg acagcggctt gtaggaatga agtcccaggc 960
agcaaagaca ggagtacgag gttgaccaga tggctctggg gccgggggtg ttgaggattt 1020
ctgtgatttt aagtgatcac gtagtgaatt atttttacgc ttaacaaagc caccgaggta 1080
tggaaatcca tcctgacagc agaagactag gtgtgtgaat tcactttaga agtcactctg 1140
tgttgggggg tttggagggg cctctgtgga gtccacacct tgccccctct gccttctggg 1200
ctggccccgg gcttggaggc agaggcccct tggggtcctg gtgggatgag ccgttgtgtg 1260
gggagggcct actgtggcca cgctgggaga gaggctggac gccagctccg cagcaggatt 1320
gcacctcccc ctctcctcct ggccatgcct gtgctgaggc cctgggaggg cggtgcctga 1380
cctcagagaa tgtcgctcta atggtggaat ttcaggtact gccacgtagg gaagagtctc 1440
gtgggaaggg cattttccag cctctgtcac aaaatctcca agggaattca gacctcatct 1500
agttccatcc tcttattttg cagaagagga ggctgagggc ccgggagacc aaggctgccc 1560
acctgtgggt ggcagagccc aggcacccag cgcccccagc tccctgcccc ctgctccttc 1620
cgcccaccca cacacccagc agctggcctc tctgcggcca gcctgctcgc ttgctggaca 1680
cctcgacagc cccaccgcca gccgcaggtg cgcgcaaggg acccgtcaca agcatcttgc 1740
cctgtggcct cctgagaagc gagtgagtgg accacagagg tcatttccca gccacgtggc 1800
caggcccagc ccgaggcagg accgggctca gggtgggaag cagggctgct cagaggagcc 1860
aggatggggc ggccaagccg acggcctgcg gggccgcctc cggactccta acgcggcagg 1920
tgtttttact cctgctgtca cttatgtatc agtgtcactc ttgcttcatc ttatctgttt 1980
ccttccaggt ccagctttag gggcagctgc caggccttag gagcctttgc aatggttttg 2040
tattcctgta gacaccagat ataaacagat ttatatattt gtgtccctcc cctgctcccc 2100
ccagttcccc agcccgacgc acactctaca cgcaggccac tctgacctct gaccccgtgt 2160
tccatgtggc aggagggccc gccccgggcc aggccccagg tcatggctgc tggctgtctg 2220
tggcggaagt ccttggttgt ttcctctccc ctcctcccgg ctctcaccat tctgtccgtc 2280
tatctcgtct tgtcctcagt ttcaaagcaa tgtcacgaag ggcttccttt ccatgcctaa 2340
aaggaaacaa tatgtttttg gaaagaggtg gggttggagc gggttctggg gcaacttggg 2400
gccctcggtc ttggggcgat cagaggatat ctctggattg gctggcatca cagccagttg 2460
ggagcctccc aagcctgggt cacgttggag aatctgccac ccaaggcagg cagggccggg 2520
ctgggatgcg tcggcctgct ttggtggaga caccagggcc tgcagtgggc tgaaccgtat 2580
tgtttccctt tgaggccagt cctggtcctc gtccccatcc atagctcatg ctttttgatt 2640
tgcatctttt tttgcatgga catgccgagt agtgataagc agggttttct gttcttttgg 2700
gcgtctctgg tcagactctt ctctttcaga tcccgtgtca cagcattagg ttcaggtgtg 2760
tcattgccgt gtcctttgag ccctaaaacc aagggcatga cca agg tgt gca ggg 2815
Pro Arg Cys Ala Gly
1 5
aga tct agc cct gtg agg aag cgg cac ggt ggc cgc agg gca gga ggt 2863
Arg Ser Ser Pro Val Arg Lys Arg His Gly Gly Arg Arg Ala Gly Gly
10 15 20
aag gac acc ctg gtc tct gtg cct agg tcc gtg caa gac agc ggc cag 2911
Lys Asp Thr Leu Val Ser Val Pro Arg Ser Val Gln Asp Ser Gly Gln
25 30 35
ggc ggc cgg gag aag ctg gag ctc gtc ctg tcg aac ctg cag gca gac 2959
Gly Gly Arg Glu Lys Leu Glu Leu Val Leu Ser Asn Leu Gln Ala Asp
40 45 50
gtc ctg gag ttg ctg ctg gag ttt gtc tac acg ggc tcc ctg gtc atc 3007
Val Leu Glu Leu Leu Leu Glu Phe Val Tyr Thr Gly Ser Leu Val Ile
55 60 65
gac tcg gcc aac gcc aag aca ctg ctg gag gcg gcc agc aag ttc cag 3055
Asp Ser Ala Asn Ala Lys Thr Leu Leu Glu Ala Ala Ser Lys Phe Gln
70 75 80 85
ttc cac acc ttc tgc aaa gtc tgc gtg tcc ttt ctc gag aag cag ctg 3103
Phe His Thr Phe Cys Lys Val Cys Val Ser Phe Leu Glu Lys Gln Leu
90 95 100
acg gcc agc aac tgc ctg ggc gtg ctg gcc atg gcc gag gcc atg cag 3151
Thr Ala Ser Asn Cys Leu Gly Val Leu Ala Met Ala Glu Ala Met Gln
105 110 115
tgc agc gag ctc tac cac atg gcc aag gcc ttc gcg ctg cag atc ttc 3199
Cys Ser Glu Leu Tyr His Met Ala Lys Ala Phe Ala Leu Gln Ile Phe
120 125 130
ccc gag gtg gcc gcc cag gag gag atc ctc agc atc tcc aag gac gac 3247
Pro Glu Val Ala Ala Gln Glu Glu Ile Leu Ser Ile Ser Lys Asp Asp
135 140 145
ttc atc gcc tac gtc tcc aac gac agc ctc aac acc aag gct gag gag 3295
Phe Ile Ala Tyr Val Ser Asn Asp Ser Leu Asn Thr Lys Ala Glu Glu
150 155 160 165
ctg gtg tac gag aca gtc atc aag tgg atc aag aag gac ccc gcg aca 3343
Leu Val Tyr Glu Thr Val Ile Lys Trp Ile Lys Lys Asp Pro Ala Thr
170 175 180
cgc aca cag tac gcg gct gag ctc ctg gcc gtg gtc cgc ctc ccc ttc 3391
Arg Thr Gln Tyr Ala Ala Glu Leu Leu Ala Val Val Arg Leu Pro Phe
185 190 195
atc cac ccc agc tac ctg ctc aat gtg gtt gac aat gaa gag ctg atc 3439
Ile His Pro Ser Tyr Leu Leu Asn Val Val Asp Asn Glu Glu Leu Ile
200 205 210
aag tca tca gaa gcc tgc cgg gac ctg gtg aac gag gcc aaa cgc tac 3487
Lys Ser Ser Glu Ala Cys Arg Asp Leu Val Asn Glu Ala Lys Arg Tyr
215 220 225
cat atg ctg ccc cac gcc cgc cag gag atg cag acg ccc cga acc cgg 3535
His Met Leu Pro His Ala Arg Gln Glu Met Gln Thr Pro Arg Thr Arg
230 235 240 245
ccg cgc ctc tct gca ggt gtg gct gag gtc atc gtc ttg gtt ggg ggc 3583
Pro Arg Leu Ser Ala Gly Val Ala Glu Val Ile Val Leu Val Gly Gly
250 255 260
cgt cag atg gtg ggg atg acc cag cgc tcg ctg gtg gct gtc acc tgc 3631
Arg Gln Met Val Gly Met Thr Gln Arg Ser Leu Val Ala Val Thr Cys
265 270 275
tgg aac ccg cag aac aac aag tgg tac ccc ttg gcc tcg ctg ccc ttc 3679
Trp Asn Pro Gln Asn Asn Lys Trp Tyr Pro Leu Ala Ser Leu Pro Phe
280 285 290
tat gac cgc gag ttc ttc agt gta gtg agt gca ggg gac aac atc tac 3727
Tyr Asp Arg Glu Phe Phe Ser Val Val Ser Ala Gly Asp Asn Ile Tyr
295 300 305
ctc tca ggt gga atg gaa tca ggg gtg acg ctg gct gat gtc tgg tgc 3775
Leu Ser Gly Gly Met Glu Ser Gly Val Thr Leu Ala Asp Val Trp Cys
310 315 320 325
tac atg tcc ctg ctt gat aac tgg aac ctc gtc tcc aga atg aca gtc 3823
Tyr Met Ser Leu Leu Asp Asn Trp Asn Leu Val Ser Arg Met Thr Val
330 335 340
ccc cgc tgt cgg cac aat agc ctc gtc tac gat ggg aag att tac acc 3871
Pro Arg Cys Arg His Asn Ser Leu Val Tyr Asp Gly Lys Ile Tyr Thr
345 350 355
ctc ggg gga ctt ggc gtg gca ggc aac gtg gac cac gtg gag agg tac 3919
Leu Gly Gly Leu Gly Val Ala Gly Asn Val Asp His Val Glu Arg Tyr
360 365 370
gac acc atc acc aac caa tgg gag gcg gtg gcc cct ctg ccc aag gca 3967
Asp Thr Ile Thr Asn Gln Trp Glu Ala Val Ala Pro Leu Pro Lys Ala
375 380 385
gta cac tct gct gca gcc aca gtg tgt ggc ggc aag atc tac gtg ttt 4015
Val His Ser Ala Ala Ala Thr Val Cys Gly Gly Lys Ile Tyr Val Phe
390 395 400 405
ggt ggg gtg aac gag gca ggc cga gct gcc ggc gtc ctc cag tct tac 4063
Gly Gly Val Asn Glu Ala Gly Arg Ala Ala Gly Val Leu Gln Ser Tyr
410 415 420
gtt cct cag acc aac acg tgg agc ttc atc gag tcc cca atg att gac 4111
Val Pro Gln Thr Asn Thr Trp Ser Phe Ile Glu Ser Pro Met Ile Asp
425 430 435
aac aag tat gcc ccc gct gtc acg ctc aat ggc ttc gtt ttc atc ctg 4159
Asn Lys Tyr Ala Pro Ala Val Thr Leu Asn Gly Phe Val Phe Ile Leu
440 445 450
ggc ggg gct tat gcc aga gct acc acc atc tac gac cct gag aaa gga 4207
Gly Gly Ala Tyr Ala Arg Ala Thr Thr Ile Tyr Asp Pro Glu Lys Gly
455 460 465
aac att aag gcg ggc cca aac atg aac cac tct cgc cag ttc tgc agt 4255
Asn Ile Lys Ala Gly Pro Asn Met Asn His Ser Arg Gln Phe Cys Ser
470 475 480 485
gct gtg gtg ctt gat ggc aag att tat gca act gga ggt att gtc agc 4303
Ala Val Val Leu Asp Gly Lys Ile Tyr Ala Thr Gly Gly Ile Val Ser
490 495 500
agt gaa ggg ccc gcg ctg ggc aac atg gag gcc tac gag ccc aca acc 4351
Ser Glu Gly Pro Ala Leu Gly Asn Met Glu Ala Tyr Glu Pro Thr Thr
505 510 515
aac aca tgg acc ctc ctc ccc cac atg ccc tgc cct gtg ttc aga cac 4399
Asn Thr Trp Thr Leu Leu Pro His Met Pro Cys Pro Val Phe Arg His
520 525 530
ggc tgc gtc gtg ata aag aaa tat att caa agc ggc tgacatcagc 4445
Gly Cys Val Val Ile Lys Lys Tyr Ile Gln Ser Gly
535 540 545
agaaagccca cgataagact gtggacaagt ctggtgaggc aagtgccacg caatgataat 4505
tttccagcga caccaacaag aggccaacaa aacacaatca aggaactcac tgcgctcaac 4565
atgttgaata ttctctacat tgaatgtaga aaatcatcct cgcctttgga tgaaacggag 4625
gcaccgcgct tggagccgca ggaaccacga tcccgccatg gggctggctg cctcctgaac 4685
aggggcgctc gctctgccag gtgcaataga gtttcacgta tttttcaact gggagagaga 4745
agctgttttt tccttcctgc agagcaagct tgatccctaa acaaccatag atcagttatc 4805
ttatgacaac attaggcatc aggctctctt ggaataagat caaagtgtcc ttatcacttt 4865
gattcctact tttgtttctt aaccgatcta cactttcagt ggccgacaga aaacgaggga 4925
caatactgtg catcacaagg cctaggaggc tgctggtccc cactggggct gaagagaagc 4985
ccagctgccc acgcggagcc aggggtggca gctgtgggac agccagggag cagggacagc 5045
ggtctgtcct tcacaggttt ttctactgtg tttttgctgg agaaggacag tgattgcgct 5105
agctttctct tacccggtat gaattattta gatttctgag gcattttctt gataaacaaa 5165
aggctatttt taagtactga gaggaggagc aggccacaag agggataatg ttgtgggaat 5225
tcccaaagct ctttgtaggt agtgccagag gggggctttt gctctcattt ttctatgtgc 5285
agaatagagg atctctcctg gggtgggcga tgcccccatt ttatttttag aaaaagtaac 5345
tcccagacag ccccataaaa gctgtgccca aggaagaaga gtctgctcta gaaggagccc 5405
ggttctggct caggacaccg gcccagctcc ctccatgagg tcaagctgag gaccaggcca 5465
gtgggaaggg aaggagggag aattagcgtc tataaagcac aggagactat ttttgatatt 5525
catagctata tattaaggca cctgccacaa gagctctcag gatggggaca gccttcttag 5585
tggagccatg gcagcaaggc ctgagggcat gagcagaacc actcttcttg tcacatacga 5645
acctgagaaa agggaagcca ggagggaggt cacaccatgg ctcaaaaggg aaaggccttc 5705
ccacttgtcc ttagcccctc aaacctcaca cggtcaacag tttccattcc agggcaggag 5765
aatgctgccg ccactgcgct gttgagttga agttggtacc aaatacacat ttaccacttt 5825
tatatctggg aagtcaactt gccatcgttt catgataaca accatttata agagaaaaag 5885
acaggacacg ctttccatcg ttcagtattt gatgacacaa aattccagtt ctaacgttgg 5945
gcatcaactt ctagcactac gagtgtggct cccacttgga caagataccg agcttcgtta 6005
tgcagttttt aatattattt attattttaa aaagtaataa gcacaaaact acatacattg 6065
tatgtcattt aaagtattta tgtcaaacag ggtgcaagtg tgaacccaag gactggagca 6125
caaattccta actgcctggg gcagggctaa tgttagcatt ggtgtgcgtc tgcctccaaa 6185
ggaggttcta gttgtcagcg agactcaaca cagatgacat tgaaattcgt ttctctcctc 6245
atctatcaca ctggagcaaa actggctatt tctgtgaatg atataaaaca gggttctctg 6305
taatggtatt gtacatagta tatgtttact gttaagttct tgttatatta taataaatat 6365
atttatagat ctagacttgg 6385
<210> 11
<211> 6522
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (5610)..(6137)
<400> 11
ggccgtcctt tttttttttt tttttttagc aatatatata tataatttat ttacattcac 60
gcccgataaa acccctatgt gccccggcgg ccgggcaagg ctgtgtacat aaggccaaga 120
gtaagtgcgt gaatgcactt aagacaaagt caggacacga gcttcacatg acaggccccg 180
cgtggggcac cagccagccc tggggacggg cacgccacgc cacacacaca ctcaccactg 240
tacagcctgg gactcccatt gcatattcac aggccccgcc gggcagggca cctcaaggct 300
gggggagggg caggggcagg gaggagccgt ggggtgtccc tgggtgggtg gagagggcag 360
catgtgagag gcaaatgtgc accaacactg ggcgtgagac gtgagcagcc tcaggtgtac 420
ggcatgagat gtgtgtggtt ggggggtgtc tgcgtgaccc gggagggggg tgtgtgtgag 480
atgagcacac gaggcatgcg tggcacgtgc tcgtgtggtg gtcgtgtgcc tgaatccagg 540
ggctaccccc tgtccggctg tggccctcgg tcctgcaggc ttggagcagg gcccctcaga 600
cgtgccccta cccagcaggc acagaaatgt ttgcataagg tccagctcag gcaggagctc 660
tggggccctg gcccaggccc agtgtgtgcg tgcatggccg tgtgtatgcg gggcccctgg 720
agagggacgg gaggagaggt agcatcacac gcacatacac acacacagat gggctgggcc 780
tggcccaggc tccaggcact gtccctcctg ggccctgggg ccgagtgtga gtggcagtgc 840
cagcctggct aggcaggcca tggagaggtg ggagttcggg cagggtctga aggaccagca 900
ttaagggaca gcttcggccg ggagggggcc tgtccgagca gctctccccg gtgtggggaa 960
gggcccctgt ccggcttctg cctgtccttg gtgggtgcca aagctggtga gagtgtaggt 1020
gtcgggcctg gccggcctca gcactcagcc tccgacactg tgaagaggcc agcgtctggc 1080
tggcccagtc cagccactgc tgcagccagc tggctgaggt tgccgttggg gaagtgccgc 1140
gcctcccaca ggttgaggat catggctgtg gggctgggct tggaggcaaa gaagctgaga 1200
tggctgtgga ggggaggggg gccgtcagca gcctggcctg gccccagctg acacctcctg 1260
ccccccctgg cccaccccag ggcagggcct ccccgaagcc aaacacaatg actgccccgg 1320
ccactagtag atgcggcaca ggccccacag ggcctcctgt ccacctcacc caccccgccg 1380
ccgggcagga ctgtcggtgg cctgcgcagg ccctctctgc ccctctcccg cccacctgtc 1440
caggtggagt ttctgggcca gagtccgcca gtcggcaccc cgcctacagg gtgggtccag 1500
gctggaaatt atcttctgcc gaatgaggaa ggggatcttg aaggcactgg ggcccaccag 1560
ggctgggacc cccgcttcac tctccagagc cagcagctca gcaaaccttg tgtcctggag 1620
gtgggaggga aagaggtgcc tgttagaact ttccccaggc ccgggtccac cgccacctcc 1680
tgctggggga gctcccagac tcccccagag catgggggcg gcatgagcac tggtacttcc 1740
tgggcacctc ccatgtgcca aggcccttct tactggggct cggaggtgca gcgacctgcc 1800
caaagccaca tagccagaga gcagaggagc cctctgtggc ctccctccct ctccgccctt 1860
ctcccctcca agtcactctg cccagcctct gctgacactc aacacacaca gctctcttca 1920
cggcctgagt cccccctgcc cagctctcac cgccagaaat gctcttcccc ttaatgccgc 1980
agccaaactg gctcattctt tggggcccag tacaaagccc ctcctcaggg aagccctcca 2040
gggtctcccc tcctgctgct cccatgtccc ctgcactccc ctccccaagt cccatttgca 2100
cctgtgcctc cctgtctctg agtctgggag gaagctgcca aggggtttct aggaggaggt 2160
gggcctgggg acaggcggga gggagagtgg gtggcagggc aggcagggca ggagcgtttt 2220
gggggctggt ggggagttgg tagtggaagc acgtcacggc ggtgcggcag cccctcccgt 2280
ccaccttggt gatgttgaag ttgatgctga agctctgccc gtcgccctcc acctgccaca 2340
cccacagctt gcaggccagg tcactagtgc tggggctgac acgctccagg gtgaaggtgc 2400
agtgcaagta ccgctgcgtg ccattccaga tgtgataaaa ggggatctcc tgatggggtt 2460
gcagagagga ggtgtcaggt gagcccaagc tccaggcctc ttgtgcgctg gccacggccc 2520
tggccctcac ctggtagctg acaaggagct tactcttcca cagggagctg ggcacatcgt 2580
ggatggatag gcgcaggttg tggtaactgt ccttgaagtg caggacccgt ggctcctgga 2640
tcagctgtcc ccccagctgc ttctccagct gcaccacctc ctgcagtgcc ccaggtggtc 2700
agcccaggcc cctccccact gcgcccaagg caggcagtcc acagggccag gcatagctcg 2760
gcctcccagc ggcggggggt gaggggcggg agataccttg agtgcatcgt gggtgtcatg 2820
caggcagtag acccggatgt tgtactcgag ggaggtgcag gccaccggcg caaacagaag 2880
cagcttgagg cgcttggcgg cagccacgct gagggcctct cccaccaggg caaagcggcc 2940
cagctgctcg gtgaagacgt agcaggcact ggcctccagc tggcagtagt agaggtggga 3000
gggcgcctcc tcgcccaggt gcagcacatc ctgtgggcag gggtcagtgg gtcaattcgg 3060
cccagcctca gagccctagg gccagctgtg gcttcaagaa gtggccggag actggggagg 3120
ggtgggggtg cggctcagga ccacccagcg gtggaaggcc gggtggagcg tgcagttagc 3180
ccacgttccc ttctggagcc cgggtcagtt ccctgctcac ctcccagctg ccctcgcacg 3240
actgcttttt gaggcgcagg ctccagctgt cagggctggg ctccccacag tggtccatag 3300
ccaggatgac tggccgggtg agcaggacgc cagggggtcc acagctaacg atgggactca 3360
gcagggtctg acagccagct aggggcaacc tcaagtgggg aaatatgggt ggggagggaa 3420
aggtgtcagt ggcccatcct gggtcctgag ggctaggcca agggcagggc agcagcatcc 3480
caggggccag gggcttgggc tgcaggacca cgtggggctt aggcacagct ctttcccccg 3540
ccgcctcggg ccaggcacaa cagtgggtgg gcagggggct tcctggggac cgaaggcagc 3600
aggcaggtcc ctcccacccc cggcaacagg gcccgcggcc acacctcacg tcttccggct 3660
tgtgcagcgt gaggtagatc tcatagatct tccctcgggg tatggcatct ggggggatga 3720
ggaggctgat tcctggaacg gcaggaagag ggccagggct tacccactgc caaagcaccc 3780
aacaccccac cacacagccc ctccaggagc cttcctgtga ggactgactg gtgccagccc 3840
ggcctgggga cagaggggcc tcaaagactg ccgtgctgac caaccacccc cacaggcatg 3900
aagctgagtc aggggcaccc catgagcaac tgagggaacc caggtgggga aagcatgcct 3960
ggctagctca ggcacagaaa acttggggac agaaggtatg cggtaccgac ctcccttgtc 4020
ccctgaaagg cactggaggg gaccagggta tcccttgctt aaaaccttcc ctgcccccgg 4080
atgaagtcca gctccctggc ctggcattga agggcctttg cctctggccc ccactgcctc 4140
tctagcccag ccagtgctac aaagttggtg gttctggaac cactgcctga gcccccactg 4200
ccctcccgtc ctgctccagc tcccctctcc cttgctcctt ctactctctg ccatgccatg 4260
gtccctgcag gtggggtggg tcttgaataa cctatctcct gggctggatg ccctcccgtc 4320
ggctggcttg cctctgctcc aacgtcttgg aggagcagga tcagcgtgtg agtggagagg 4380
ggagtggttc aacagggaga tggtgggaca ggatccggca aagacggggg ccagggagtc 4440
cctctagagg tggtgagcag ggactccccc agacagggag cccttacatg aggaggggct 4500
cccagctcag gacatgggga gggagaggaa ttccatgact ttaccccacc aagtccaaaa 4560
gcaaatgcac actccctcgt gggcagcgca gagcagggct gggctggtat caccgtttca 4620
ccggagaggg cagagtcaca gccactgctc ccttcactgg cctgcactgc cccgcggtcc 4680
cagctgtagt gttggtgaca atgccagaca tcctcgcagg aaggctctcc atggggaagg 4740
tacacgcact tacatcctca ttcatcaact cctaatgctg ggtactcagg acggcgaagg 4800
aaaaccacag accccactct catgaagttc ttagcccagc aggggagaag gacaagtcaa 4860
ccacagcaca ggatgcgggg gaagtggcag ccaccccccg gcctggggag cgaagagcaa 4920
gggctgccgg gccccactgt tcgcctgcgt ctctgtaaga gctgacagat gccgtctcac 4980
ctcgtcatcc caacagcccc atgaggggtt cgttatcatc cgtgtggacg tgaggcacag 5040
acggcaagag ctgtcaggcg gggacccaaa agagaagtcc cagctgagct ggtgggaggg 5100
gcagggggtc ggtgcatgca gagggcagca ccgtgtgtcg gggatgttga gtccatggtg 5160
gctgccgtgt gcgggacgat gtgtgtgagg gaagaggctg aagcagctgg cctgcagggg 5220
acaggtcttc tctgtggtat catgtgggct ttgcaggggg cagaggggag cccagaaggg 5280
attcgagcac gagagggtca tgatcaactc tggcttcatt tggtgaagtt ttacagcaag 5340
gaaccaggta aggggtttgg tggcccgtgt caggagaggc agcaagggag gacaagagca 5400
ggaggtggca ggagctattg aggagggact ggcgtgggac tgaagctggc ggggggcaag 5460
agaggacgag cagcaggtca aggtgacccc aagtttccgg cttgagcaac tagacgatcc 5520
gtgaactggg ggaagccagg aatggggagg ctctggggga gggtgggaga cggttctgtt 5580
ttaggcacca ggagtctgag gtgatgtga gac atc cag atg gag ctg gcc agg 5633
Asp Ile Gln Met Glu Leu Ala Arg
1 5
agg cag ctg ggt gta cag ttc tgt cgc ccg gga gaa ctg cag gct aag 5681
Arg Gln Leu Gly Val Gln Phe Cys Arg Pro Gly Glu Leu Gln Ala Lys
10 15 20
gat ggg gga gtg gga gtt atc tgc aca gac acg gcg att aaa gcc atg 5729
Asp Gly Gly Val Gly Val Ile Cys Thr Asp Thr Ala Ile Lys Ala Met
25 30 35 40
gag tgg gcg agg ccg ctc ggg gag aag ggc act ggg agg gag ctc gga 5777
Glu Trp Ala Arg Pro Leu Gly Glu Lys Gly Thr Gly Arg Glu Leu Gly
45 50 55
ggc aca cct aca ttt atg gaa cag gca gct gga gag ggg caa gca aac 5825
Gly Thr Pro Thr Phe Met Glu Gln Ala Ala Gly Glu Gly Gln Ala Asn
60 65 70
cag gag cgc atg ggg tca tct aag ctg agg gac ttg gga att tcg ggt 5873
Gln Glu Arg Met Gly Ser Ser Lys Leu Arg Asp Leu Gly Ile Ser Gly
75 80 85
ggg agt ggc cag cca tgc cag agg ctg cag cca ctt ctt gca agg cag 5921
Gly Ser Gly Gln Pro Cys Gln Arg Leu Gln Pro Leu Leu Ala Arg Gln
90 95 100
cca tgg aaa gag ttc ttc aga ggg ttt cgg tgc tgg ggt ggg tgg aag 5969
Pro Trp Lys Glu Phe Phe Arg Gly Phe Arg Cys Trp Gly Gly Trp Lys
105 110 115 120
cca gga tgc agt ggg gac tcc agg atg gcc tct gcc cag gcc cgt gct 6017
Pro Gly Cys Ser Gly Asp Ser Arg Met Ala Ser Ala Gln Ala Arg Ala
125 130 135
cat ctg ttt gcc ctt ggt tgt ctc cct gct cag ttg caa gct cct tgg 6065
His Leu Phe Ala Leu Gly Cys Leu Pro Ala Gln Leu Gln Ala Pro Trp
140 145 150
ggc cgg ggc ttg ccc tac ctc tgg gcc cag ggc ccc aca tgg aga ctg 6113
Gly Arg Gly Leu Pro Tyr Leu Trp Ala Gln Gly Pro Thr Trp Arg Leu
155 160 165
gaa caa gta ggc tgc ttg gct gtg taaatgaatg gacaattacc ttgtgcccac 6167
Glu Gln Val Gly Cys Leu Ala Val
170 175
aggcaacaag aggaagacag ctctggccac ggcctcggtg ccagccaaga ctgggttctc 6227
cccaaaactc cacccccttt ccttgctgat ctgtcacggg acaccgaggg gcacgagagg 6287
aacgtccggc tgtcccgtcc tcctccagga gggaagtgga gctctcagag ccccctgggg 6347
tccttcctac ctgtattagg gatcatcagc cggcccccga ggaagttgaa ggtcccatag 6407
gtcatgttgc tggtgcctcg gggcagggag cggaagtagt tctgggtgga gaggcgggag 6467
acgaactcct cggcctcaga ggtgggagag ctgtggtgca gtgtgtggcg gccgc 6522
<210> 12
<211> 5155
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (745)..(1179)
<400> 12
gaaagtatta taatagaata aagtaattcg gccgggcgtg gtggctcaca cctgtaatcc 60
caggactttg ggaggtggag gcgggcggat cacgaggtca agagatcgag cccatcctgg 120
ccaacatggt gaaaccccat ctctactaaa gatacaaaaa ttagctgggc gtggtggcgt 180
gcgcccagtc cgagtagctg ggactactcg ggaggctgag gcaggagaat cgcttggacc 240
caggaggtgg agtttgcagt gagccgaggt cccgctactg cactccagcc tggcgacaga 300
tcgagactcc atctcaaaaa taaaaataaa acaaaataat aataattcag aagctgtctg 360
aggggacatg acttatctga caggcttaga aacaaaataa gcctttttgt ccagaagatc 420
tgaaataaag ctgtgtggtg accgccatca gtttattcaa aagattactc agggactact 480
ccccattttt tcaagacaga attctgtcat acctgagaag aaatgatcaa gtaagttcag 540
ataataggat ccaaaggaaa caaaagataa gacatttagt ataacaacgt aaatgtagag 600
aacaaagaca ttgtctcaag tccctttctt ttgggggaac atcttcctgt taacaacaac 660
aacaacaaca aaaactaaaa aagcccacat gtgtttcaag ttaacctatt gttattattg 720
gaataattat gaacaaggct ttaa aaa tta aaa tca agg tca aac aaa aag 771
Lys Leu Lys Ser Arg Ser Asn Lys Lys
1 5
aaa aat agt gtt aat agg tgg cca tat aac ctt cca agt ttt tgt tta 819
Lys Asn Ser Val Asn Arg Trp Pro Tyr Asn Leu Pro Ser Phe Cys Leu
10 15 20 25
ctg att tca gta atc gtc tgc ttc tct gta ttg ata gtc atg ttt tat 867
Leu Ile Ser Val Ile Val Cys Phe Ser Val Leu Ile Val Met Phe Tyr
30 35 40
aat aat gat tat att ggc caa cct gat ttt tct att gac agt ata tct 915
Asn Asn Asp Tyr Ile Gly Gln Pro Asp Phe Ser Ile Asp Ser Ile Ser
45 50 55
gaa ata ttt cct gtg tcc tta agt att ttt cac agc ttc cca ttt tac 963
Glu Ile Phe Pro Val Ser Leu Ser Ile Phe His Ser Phe Pro Phe Tyr
60 65 70
atg tac gat ggt ctg cca tat cta gta agc ttt gtc aga tcc aaa ata 1011
Met Tyr Asp Gly Leu Pro Tyr Leu Val Ser Phe Val Arg Ser Lys Ile
75 80 85
cta aat gct gcc ttt ctg tcc cca gtg tct ttg ctc agt tca aaa gcc 1059
Leu Asn Ala Ala Phe Leu Ser Pro Val Ser Leu Leu Ser Ser Lys Ala
90 95 100 105
aca gaa gaa gtt ctg gac gat gtg ggc ctt caa tct tct aag atg gga 1107
Thr Glu Glu Val Leu Asp Asp Val Gly Leu Gln Ser Ser Lys Met Gly
110 115 120
ata gaa aga aga gga tct acc cag acc agg acc ctg ctg ccc aat ctc 1155
Ile Glu Arg Arg Gly Ser Thr Gln Thr Arg Thr Leu Leu Pro Asn Leu
125 130 135
agg tct aca tct gcc tca cca aaa taatacagtg gccagaaacc tccctgctaa 1209
Arg Ser Thr Ser Ala Ser Pro Lys
140 145
catcatagtt tgtctgtgat gcagaaatgc ctctgttttg ttgttgtttt gtttttgtag 1269
agacagggtc ttgctctgtc acctaggttg gagtgcactg gcacaatcat agttcactgc 1329
agccttgaac tcctggactc aagcgatcct accgcctcag cctcccaagt agctgggatt 1389
atgggcatgc gccatcatgc ctagctgatt tttattttgt agagactggg tctcactatg 1449
ttgcccaggc tggccttgaa ctcttgacct caagcgatct tcctgcctca gcctccccag 1509
ttgctgggat tacaggcgtg agccatcgtg cctggcccca gaaatgcttc tcttctatga 1569
ctttgttcag tggagacctc tgtgatttgg gggaggaaat agatttatcc agaaccccaa 1629
tttgcccgaa ttatgtatga tgggaaatag tttgagacct ggaatggaag agcatcacag 1689
caatattaag taacttaagg tctgtcagga acactaatgc aaaactggtc tcaaggtctt 1749
tcttcttcct ccagctctgg ctttctggat ctgtgcttct ctggtagagg aaccctgagg 1809
actgatctac tcttgcagtt ctaaaaccat ggcccatgta agttcacgtt tttctccttg 1869
ttgaaatatg tgtgtgttgg ccaggcacgg tggctcaggc ctgtcatccc aacactttgg 1929
gaggccaagg cgggtggatc acctgaggtc aggagtttga caccagcctg gccaacatgg 1989
tgaaaccctg tctctactaa aaatacaaaa acagcccagc atgatggcag atgcctgtaa 2049
tcccagctac tcaggaggct gaggcaggag aatcgtttga acctgggagg cagaggttgt 2109
agtgagccaa gattgtgcca ctacactcca gcctggacaa gagcgaaact tggtctcaaa 2169
aaaaaaaaaa agaaatatgt gtgtgttttt aggtcatgaa ggtattttta ttaattttcc 2229
tgaaatttcc taccaggaga ggctcatcaa ataacatgct tccagctctt attcttccag 2289
ttcccagcat ccacccaata tgtcccatta taaacctcct ttgttcatgt aattactgag 2349
caaattattc agcatgtctt aagagccctc tatcaagggc agtggtggag ttaagtcctt 2409
ttaagtaaag atttgtgctt tttgggtcaa gtctagtttg gtaagataga gtcaatggtt 2469
gtatttgacc atccataaca aaacttaaat tatcaccgac tggagatttc caggaagtga 2529
agatgaaaat ttggtattga atgatcaaac tcagtcattc attgggtgaa atctcatgct 2589
ttgtgtggtt cagtaaatga ctggggactt gcaaacttgg aggttatgga ttttggcaag 2649
aaacccctca ggaataaata tcaggactat tctgatgttg gatggaacct actaaattta 2709
gatgacatgt atataattca ttattcttcc ttagtattct ttaactcaat gcatttgtta 2769
tctattatta cataacaaat tagaccaaaa cttaatggct taagacagca aacatttatc 2829
tcacagtttc tttggatcag aaatccagac atggcttagc ttgcctctct ggctctcgat 2889
ctcttacaag gagatccagg ttttggtcag gactacagtc atcttaaggc ttaacttggg 2949
gtggatcttt ttctaagctc actcaactag ttgttaacag gatccagttt ctcacaggct 3009
tcagactgac agtcttagtt cctcattcgc tgttgcctgg aagctgctct tggtttattg 3069
tcatgtgggt ctgtccatag aatagcttaa aacatggtgg ctgtgtttta atcagtgagc 3129
aaatgagaca tcaagagagg gagagaaata cagaaaaagt gctgaagtaa catttcagca 3189
ctgtagctat tctgcttatt aagagcaagt cacttggttc agcccacgtg caagggaagg 3249
agattacaca agcatgtaat actatcggat ggggatcttt ggtaacctat atagcaagcc 3309
aggtgctcag tgagagatgg tagagagcca caaaagagaa tgtcattata agtagcaagt 3369
gaacacagaa tcctagaact ggggaagacc ttaaaaatgt tgtgtatcct aactgcattg 3429
taaacaagta agtctgatat tgccctaaga tataattctc atgtgccccc aaaatacatg 3489
gatttgtcaa aacaagagaa aaatgcttat ggtgagatct gatgtaatga agtatttaat 3549
gcttgagaga attcttaatt ttatgtaaac cctattttct aaaatcaagg ccaggtgctg 3609
taatccctgt aattccaggt gttcacacct gtaatctcac cacttaggaa ggctgatatg 3669
ggagcattgc ttgagaccag gagttcaaga ccagcctggc caacacagca agaccctgtc 3729
tctattaaaa ataataataa tcaagacttt atacttcatt tggggaaaat atctgattaa 3789
attttttgct cctcttaaaa gcaaaaattt tccccagcaa ctcatatgta tttaaaaatt 3849
tatatggcag ataccaatca gacattatgc ttatattcac tgtttacttt cataggcaaa 3909
cattattatt ttatttattc ccaaatggtg caagtatttt agagtatgcc acatagtttt 3969
ctatatgcca gcagtctttt tcaaaaatac gtatacattc cacttcagtc cccctcatac 4029
tgtaaatttg tagacaacat gcttgttaca ttgctctctt cctgcaatac agtccttatt 4089
tcccctccat ccatcttgaa agtattcctt taatttctcc agtaacacta tgggtttgcc 4149
atttcgggca tcaatgacat tcagggatgt ggccatagac ttctcgcagc aggagtggga 4209
atgtctcgac catgttcaga agaccttgta ttgggatgtg atgatggaga actgtagtaa 4269
cttggtatgt tttctgcctc agacaattta ggataacttt gaatctgctc tctaggatga 4329
tttttcattg tgatttttag ggctgtcttt caagagacta gttaaatttc ttttcccttt 4389
tccccaaaga agtggtttgg gctttgttgg gtaggaagtg acagcttcac tagacatact 4449
catcttccct gtccctcagt gatgttcaaa cttttctctg gtcatagtta cagttgtctt 4509
tcttctttta taaggaaggg ctagattagg attcagcagc ctatattata taaccattgt 4569
aaaaggatcc aggtttccta tttctttatg ctccgaatta ctgtggatac tccattatag 4629
catctgatcc acttgcagat taccttgaca taggatcctg aattgctcta attatcaaaa 4689
gaacctttat aatgtatata gctttggatt agataaattg aattcattta aaactgaagc 4749
aaacacattc tatttcctaa gcaaatttta acttattcgt ggttgaaagt tgtggtttaa 4809
agttgtcatc tataaacatt acattttaaa tggaatgaat atcctataaa tatcttttct 4869
tgtatactct taacaaaaca acaatatcaa taataacctt agtagtgaac actgtgtcta 4929
ttaaaggtag tgttctaagc atttcacata tatgtctcaa ccaatttaat ccttattaaa 4989
ttcgacaatc ttattatatc ctcattttcc agttgagaat attaaagccc agtggattta 5049
aatgacttgc tgaaggtcat gcacagctgg gaggtggtaa aggcaggaat tgataccagg 5109
ctctagagtc tgtattttca accatggtat tatgtgtact ctcagg 5155
<210> 13
<211> 5344
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (113)..(2017)
<400> 13
ctccacatcg cggctccggc acctgaaggg acgcgggcgg gcgcgggcag ctccgaccgg 60
cggcggcggg gcgggacagg cagcccggcg gcctccgatg gccccgccgt ga gag gcc 118
Glu Ala
1
gga ccc gcg gcg ggg acc agc agc ggt cta gag gag tcc cag gag cag 166
Gly Pro Ala Ala Gly Thr Ser Ser Gly Leu Glu Glu Ser Gln Glu Gln
5 10 15
cca gga cag gcg gaa gca gtg gct gcc atg gag gag gac aag ctc tta 214
Pro Gly Gln Ala Glu Ala Val Ala Ala Met Glu Glu Asp Lys Leu Leu
20 25 30
tct gca gtg cct gag gaa ggc gat gcc acc cgt gac ccc ggt cca gag 262
Ser Ala Val Pro Glu Glu Gly Asp Ala Thr Arg Asp Pro Gly Pro Glu
35 40 45 50
cct gaa gag gag cca ggg gtc cgg aat ggg atg gcc agt gag ggc ctg 310
Pro Glu Glu Glu Pro Gly Val Arg Asn Gly Met Ala Ser Glu Gly Leu
55 60 65
aac agc agc ctc tgc agc cca ggg cac gag cga agg ggc acc cca gcg 358
Asn Ser Ser Leu Cys Ser Pro Gly His Glu Arg Arg Gly Thr Pro Ala
70 75 80
gac act gag gaa ccc acg aag gac cca gat gtg gcc ttc cat ggc ctc 406
Asp Thr Glu Glu Pro Thr Lys Asp Pro Asp Val Ala Phe His Gly Leu
85 90 95
agc ctt ggc ctc tct ctc acc aat ggc cta gcc ctg ggg cca gac ttg 454
Ser Leu Gly Leu Ser Leu Thr Asn Gly Leu Ala Leu Gly Pro Asp Leu
100 105 110
aac att ctg gaa gat tca gcg gag tcc agg ccc tgg agg gct ggc gtg 502
Asn Ile Leu Glu Asp Ser Ala Glu Ser Arg Pro Trp Arg Ala Gly Val
115 120 125 130
ctg gca gag ggg gac aat gct tcc agg agc ctc tac cca gat gct gag 550
Leu Ala Glu Gly Asp Asn Ala Ser Arg Ser Leu Tyr Pro Asp Ala Glu
135 140 145
gac cct cag ctg ggg ttg gat ggt ccc ggg gag cca gat gtg cgg gat 598
Asp Pro Gln Leu Gly Leu Asp Gly Pro Gly Glu Pro Asp Val Arg Asp
150 155 160
ggc ttc agc gcc acg ttt gag aag att ctg gag tca gag ctg ctg cgg 646
Gly Phe Ser Ala Thr Phe Glu Lys Ile Leu Glu Ser Glu Leu Leu Arg
165 170 175
ggc acc cag tac agc agc ctc gac tcc cta gac ggg ctg agc ctc acg 694
Gly Thr Gln Tyr Ser Ser Leu Asp Ser Leu Asp Gly Leu Ser Leu Thr
180 185 190
gat gag agc gac agc tgc gtc agc ttc gag gcc ccc ctc aca ccc ctc 742
Asp Glu Ser Asp Ser Cys Val Ser Phe Glu Ala Pro Leu Thr Pro Leu
195 200 205 210
atc cag cag cgg gcc cgt gac agc cct gag cca ggg gct ggg ttg ggc 790
Ile Gln Gln Arg Ala Arg Asp Ser Pro Glu Pro Gly Ala Gly Leu Gly
215 220 225
att ggg gac atg gcg ttt gag ggg gac atg ggg gca gct ggt ggt gat 838
Ile Gly Asp Met Ala Phe Glu Gly Asp Met Gly Ala Ala Gly Gly Asp
230 235 240
ggg gag ctg ggc agc ccc ctg cgg cgc tcc atc tcc agc agc cgc tct 886
Gly Glu Leu Gly Ser Pro Leu Arg Arg Ser Ile Ser Ser Ser Arg Ser
245 250 255
gag aat gtc ctg agc cgc ctg tct ctc atg gcc atg ccc aat gga ttc 934
Glu Asn Val Leu Ser Arg Leu Ser Leu Met Ala Met Pro Asn Gly Phe
260 265 270
cat gaa gat ggc cct cag ggc cca ggg ggg gat gag gat gat gat gag 982
His Glu Asp Gly Pro Gln Gly Pro Gly Gly Asp Glu Asp Asp Asp Glu
275 280 285 290
gag gac acg gac aag ttg ctg aac tca gcc agt gac ccc agc ctg aag 1030
Glu Asp Thr Asp Lys Leu Leu Asn Ser Ala Ser Asp Pro Ser Leu Lys
295 300 305
gat ggc ctg tca gac tca gac tct gag ctc agc agc tcg gag ggg ttg 1078
Asp Gly Leu Ser Asp Ser Asp Ser Glu Leu Ser Ser Ser Glu Gly Leu
310 315 320
gag cct ggt agt gca gac cct ctg gcc aac ggg tgc cag ggg gtc agt 1126
Glu Pro Gly Ser Ala Asp Pro Leu Ala Asn Gly Cys Gln Gly Val Ser
325 330 335
gaa gct gct cat cgg ctg gca cgc cgt ctc tac cac ctc gag ggc ttc 1174
Glu Ala Ala His Arg Leu Ala Arg Arg Leu Tyr His Leu Glu Gly Phe
340 345 350
cag cgc tgt gat gtg gcc cgg cag ctg ggc aag aac aac gag ttt agc 1222
Gln Arg Cys Asp Val Ala Arg Gln Leu Gly Lys Asn Asn Glu Phe Ser
355 360 365 370
agg ctg gtg gcc ggg gag tac ctc agt ttc ttc gac ttc tcg ggc ttg 1270
Arg Leu Val Ala Gly Glu Tyr Leu Ser Phe Phe Asp Phe Ser Gly Leu
375 380 385
act ctg gac gga gca ctc aga aca ttc ttg aag gcc ttc ccg ctg atg 1318
Thr Leu Asp Gly Ala Leu Arg Thr Phe Leu Lys Ala Phe Pro Leu Met
390 395 400
ggg gag aca caa gag cgt gag cgg gtc ctc aca cac ttc tcc cgc cgg 1366
Gly Glu Thr Gln Glu Arg Glu Arg Val Leu Thr His Phe Ser Arg Arg
405 410 415
tac tgc cag tgc aac cct gat gac agc act tcg gaa gat ggg atc cac 1414
Tyr Cys Gln Cys Asn Pro Asp Asp Ser Thr Ser Glu Asp Gly Ile His
420 425 430
acg ctc acc tgt gcc ctg atg ctg ctc aac acg gac ctg cac ggc cac 1462
Thr Leu Thr Cys Ala Leu Met Leu Leu Asn Thr Asp Leu His Gly His
435 440 445 450
aac att ggc aaa aag atg tcc tgt cag caa ttc att gcc aac ttg gac 1510
Asn Ile Gly Lys Lys Met Ser Cys Gln Gln Phe Ile Ala Asn Leu Asp
455 460 465
cag ctg aat gat ggc caa gac ttt gcc aaa gac ctg ctg aag acc ctt 1558
Gln Leu Asn Asp Gly Gln Asp Phe Ala Lys Asp Leu Leu Lys Thr Leu
470 475 480
tac aac tcc atc aag aat gaa aag ctg gaa tgg gcc att gat gag gat 1606
Tyr Asn Ser Ile Lys Asn Glu Lys Leu Glu Trp Ala Ile Asp Glu Asp
485 490 495
gag ctg agg aaa tcc ctg tct gag ctg gtg gat gac aag ttc ggg aca 1654
Glu Leu Arg Lys Ser Leu Ser Glu Leu Val Asp Asp Lys Phe Gly Thr
500 505 510
ggc acg aag aag gtg acg cga atc ctg gat ggt ggc aac ccc ttc ctg 1702
Gly Thr Lys Lys Val Thr Arg Ile Leu Asp Gly Gly Asn Pro Phe Leu
515 520 525 530
gat gtc cca cag gcg ctc agt gcc acc acc tac aag cac ggc gtc ctg 1750
Asp Val Pro Gln Ala Leu Ser Ala Thr Thr Tyr Lys His Gly Val Leu
535 540 545
acc cgg aag act cac gct gac atg gat ggc aag agg acg ccc cgt ggg 1798
Thr Arg Lys Thr His Ala Asp Met Asp Gly Lys Arg Thr Pro Arg Gly
550 555 560
agg cgt ggc tgg aag aaa ttc tac gca gtg ctc aaa ggg acc atc ctg 1846
Arg Arg Gly Trp Lys Lys Phe Tyr Ala Val Leu Lys Gly Thr Ile Leu
565 570 575
tac ctg cag aag gat gag tac agg cct gac aaa gct cta tcg gag ggt 1894
Tyr Leu Gln Lys Asp Glu Tyr Arg Pro Asp Lys Ala Leu Ser Glu Gly
580 585 590
gac ctg aag aac gcc att cgc gtg cat cac gct ctg gcc acc agg gcc 1942
Asp Leu Lys Asn Ala Ile Arg Val His His Ala Leu Ala Thr Arg Ala
595 600 605 610
tct gac tac agc aag aag tcc aac gtg ctg aag ctt aag aca gcc gac 1990
Ser Asp Tyr Ser Lys Lys Ser Asn Val Leu Lys Leu Lys Thr Ala Asp
615 620 625
tgg agg gta ttc ctc ttc cag gca ccg tgagtaggag ctggagccct 2037
Trp Arg Val Phe Leu Phe Gln Ala Pro
630 635
tcactcccac ctggggccca gggccacagt gacccggcac acaacccctc tccttcccgt 2097
gagtcagcaa cagagcatgt gtatccacat gacacagacc gacagctggg tccctccaaa 2157
gcagtggttc ccaatgtgtg gttcccctca gcatcatttg cagtctctct ggccccacct 2217
tagacctact gaacccaaaa ctctgggatg gctccagcaa ctgtgtttaa caagccctgc 2277
aggggctgct gagtgcgctg gagtctgcaa tgtggggcag gtggcatctt tacgaatcat 2337
ccttcacttt aagtgtggga aagatgcacc ctccaggtaa cctgtaaggc tgtcctacca 2397
cctctcgtcc tcctgtgact tgagattggg gtgcacatgt cacatgagct tgcaaggagg 2457
caggctagtg ggtgaggcct caggatcaca gtgccagtgc tgggcaggca gcgggggagg 2517
gcaggagggc tgtgggggtc acccagttgc tattgcctct gatgatcccc agtccacagt 2577
cactcccacc tattcactcc agggccacta gctgtggacc tgggaagttt gatcttcaca 2637
tgagaattcg ggtgtgcctg gaagctggga cagaaaggga gctccgtgca tagctctggc 2697
caggcccatc cagcccccat gtgtcctccc tctccccttt atccacccat ctagaggtac 2757
tggtgccttc tcagtgccag gcccagtgct gggtacggga tgctgagtag gggacaggga 2817
gtcctgtgtg agaggccggc accctctccc cctcctgtcc ccaggagcaa ggaagaaatg 2877
ctgtcctgga tcctcaggat caacctggtg gcagccatct tctctgcccc ggccttccca 2937
gccgctgtca gctccatgaa gaagttctgt cggcccctgc tgccctcctg caccacccgc 2997
ctctgccagg aggagcaact gcggtctcat gagaataagt tgaggcagct gactgcggag 3057
ctggccgaac acaggtgtca cccagtcgag aggggcatca agtccaagga ggccgaggag 3117
taccggttga aggagcacta tctcaccttc gagaaaagcc gttatgagac ctatatccac 3177
ctcctggcta tgaaaatcaa agtgggctca gatgatctgg agcggattga ggcccggctg 3237
gccactctgg aaggggatga cccttctctc cggaagacac attcaagccc tgccctcagc 3297
cagggccatg tgactggcag caaaaccaca aaggatgcca ctgggcctga tacttagctg 3357
acatggattt gcagacccca gggtgggcag atgtctccag tggggtcagt gagcacaatt 3417
ccagccaggg gccacttgga ccaagctcca gtcagttgat gggcagctag aggggtgcag 3477
aaagcctgtg ggcccaggag atggagatgc cgtttgtggc gttgatctcc ttgcgtcctt 3537
gggcatctcc gggcatcaga ccctctccct ggcccttgtt ttcctctcca ccatggagcc 3597
tcattttgta ggccagttgt gtgcatgctc tagacaccac ctcgctggag aagctggaag 3657
ggctgttgtc ttcccaggtc tttctcttct catcaagctc ctctcctcat cttttttgtg 3717
tgtgagggca ggtcttgact ctaggtctca gctggaaccc caccctttct cctcctcctt 3777
cctctgagtt gaccagcagc aggtctgccg accaccagca ccatcctctc ctcccagcag 3837
cctccagaac catgcccagg tctcctgcct cacatcacaa taatctggga cccaggcttg 3897
tgccctttca gtgtaaagct gactccatca catgtgcatc cacttctttt catccattga 3957
gatcacactg cctccttttt atacagacac aaatatacat ctataagaat aatatataca 4017
taaggaaccc ctgaaagatg gttttggaac tggaatcagt tagaggatga aatcagataa 4077
aggaaaagcc tattttggag cttcccctgt taggaaggat ggctgcacct ggccccctgg 4137
cattcctgac gctctaggag ggaaggggga ggcagtgctg gcctcccttg ccctgttttt 4197
ccctcttcca gctgacctgt gacttatact gctcttaccg atgatacttt tggaaaaaat 4257
agagcgtgta tgcaccgccc cgtttgtccc atggatatcc tggggtgtga gtcggatggg 4317
accacggccc tgtttatatt tgggtcttta tgttggtgct gccaggtctc tgagctccag 4377
aggtggcctc ttggacagat ctactgctat aggaataaaa gacactctgt ctcgcaaatg 4437
gctgcttgtc aacaagccca aagatgcttg tcggaggacg gttatggaag cccttaattc 4497
ttggttgtgg gaaaaggtgg aatgacaagt tattgattgt ttttctgtcg ctatttcttt 4557
catttgtcta gtgaatcaga aaggcttagc caaggccaca tctgggaaga gtggagaaat 4617
ttgccacttg acgatcacgg attagctagc acctttaagc cctgcatttc tccaactgac 4677
aagtgggtgg gggtgatggc acattcagtg tggctatgaa gagcgaatcc tctctattgt 4737
ttaaatagat tactgtagtt tggccaggaa tttggcgtca gtggtaacac acttagttaa 4797
taaaataagc caggcttgca actaagtatc taactttaca ggcccactca catttgaggc 4857
aaggggctat tgagtatgtg gagagatgta gtgatttaaa ttcagattat ttaagttgga 4917
tcagctgaag tgtgttttag acccaaacca tctggcccct tcgttttgct cagaggaagt 4977
aaatgttcac ttaaatgaaa ttgaaaacgc catgtggcac cacaaaagag ctctctgtac 5037
tttccccatg ctgcctcaaa agttctgtga gtttcggggt cagtgtccca cccttcactt 5097
cccgagggcg ggtgagtgga gagcagagcc aggagctctg gcagctgtgg acagatgtgc 5157
ttcctgagca tgggttgtgc ctcccatcag taaaaaaatg tttagttcac ttccttaatt 5217
gtataattat ttatttgtaa attatataca tgtactactg tactaaaata ttatgtacat 5277
tataaaacat acacaaaaat agaaatttaa aaaagatgag atgaaaataa atctaagtca 5337
aagttct 5344
<210> 14
<211> 5432
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(1188)
<400> 14
cag gtc tcc ctg gac aag cca ctg tcc tca gct gcc cac ttg gac gat 48
Gln Val Ser Leu Asp Lys Pro Leu Ser Ser Ala Ala His Leu Asp Asp
1 5 10 15
gca gcc aag atg cct tct gca tcc agt ggt gaa gaa gca gac gct ggc 96
Ala Ala Lys Met Pro Ser Ala Ser Ser Gly Glu Glu Ala Asp Ala Gly
20 25 30
agc ctc ctg ccc acc acc aat gag ctc tcc caa gcc tta gct ggg gct 144
Ser Leu Leu Pro Thr Thr Asn Glu Leu Ser Gln Ala Leu Ala Gly Ala
35 40 45
gac tcc ctg gac agt cct ccc aga cct ctg gag aga tcc gtg ggc cag 192
Asp Ser Leu Asp Ser Pro Pro Arg Pro Leu Glu Arg Ser Val Gly Gln
50 55 60
ctc ccc agc ccc cca ctg ctg ccc act ccg cca ccc aag gca agc tcc 240
Leu Pro Ser Pro Pro Leu Leu Pro Thr Pro Pro Pro Lys Ala Ser Ser
65 70 75 80
aaa acc aca aaa aat gtc aca ggc caa gcc aca ctc ttc caa gcc tcc 288
Lys Thr Thr Lys Asn Val Thr Gly Gln Ala Thr Leu Phe Gln Ala Ser
85 90 95
agc atg aag agt gcc gac cct tcc ctc cgg ggc cag ctc tcc aca ccc 336
Ser Met Lys Ser Ala Asp Pro Ser Leu Arg Gly Gln Leu Ser Thr Pro
100 105 110
acg ggg tct ccg cat ctc acc acg gtc cac cgg cct ctt ccc cca agc 384
Thr Gly Ser Pro His Leu Thr Thr Val His Arg Pro Leu Pro Pro Ser
115 120 125
cgc gtc att gag gag ctg cac agg gcg ctg gcc acg aag cac cgc cag 432
Arg Val Ile Glu Glu Leu His Arg Ala Leu Ala Thr Lys His Arg Gln
130 135 140
gac agt ttt caa gga aga gaa agt aaa ggg tct cca aag aag cgg ctg 480
Asp Ser Phe Gln Gly Arg Glu Ser Lys Gly Ser Pro Lys Lys Arg Leu
145 150 155 160
gat gtc cgt ctg tcg aga acg tcc agc gtg gag cgg ggc aag gag agg 528
Asp Val Arg Leu Ser Arg Thr Ser Ser Val Glu Arg Gly Lys Glu Arg
165 170 175
gag gag gct tgg agc ttt gac ggg gca ttg gag aac aag cga act gcc 576
Glu Glu Ala Trp Ser Phe Asp Gly Ala Leu Glu Asn Lys Arg Thr Ala
180 185 190
gct aag gaa tct gag gag aac aag gag aac ctg atc ata aat tct gaa 624
Ala Lys Glu Ser Glu Glu Asn Lys Glu Asn Leu Ile Ile Asn Ser Glu
195 200 205
ctc aaa gac gac ttg ctt ttg tat cag gac gag gag gcg ctg aac gac 672
Leu Lys Asp Asp Leu Leu Leu Tyr Gln Asp Glu Glu Ala Leu Asn Asp
210 215 220
tcc att att tct gga aca ctg cca cgg aaa tgc aag aag gag ctc ctg 720
Ser Ile Ile Ser Gly Thr Leu Pro Arg Lys Cys Lys Lys Glu Leu Leu
225 230 235 240
gcc gtg aag cta agg aac cgg cca agc aaa cag gaa cta gaa gac cgg 768
Ala Val Lys Leu Arg Asn Arg Pro Ser Lys Gln Glu Leu Glu Asp Arg
245 250 255
aac att ttc ccc aga agg act gat gaa gaa aga cag gag atc cgg cag 816
Asn Ile Phe Pro Arg Arg Thr Asp Glu Glu Arg Gln Glu Ile Arg Gln
260 265 270
cag atc gag atg aag ctt tcc aaa cgg ctg agc caa aga cct gcc gtg 864
Gln Ile Glu Met Lys Leu Ser Lys Arg Leu Ser Gln Arg Pro Ala Val
275 280 285
gaa gag ctg gag aga aga aat atc ttg aaa caa agg aat gat cag aca 912
Glu Glu Leu Glu Arg Arg Asn Ile Leu Lys Gln Arg Asn Asp Gln Thr
290 295 300
gag cag gaa gaa aga aga gaa atc aag caa aga ttg aca aga aag ctt 960
Glu Gln Glu Glu Arg Arg Glu Ile Lys Gln Arg Leu Thr Arg Lys Leu
305 310 315 320
aat cag aga ccc act gtt gat gaa tta aga gac aga aaa att ctg ata 1008
Asn Gln Arg Pro Thr Val Asp Glu Leu Arg Asp Arg Lys Ile Leu Ile
325 330 335
cga ttc agt gat tac gtg gaa gta gca aaa gcg cag gac tat gac agg 1056
Arg Phe Ser Asp Tyr Val Glu Val Ala Lys Ala Gln Asp Tyr Asp Arg
340 345 350
agg gca gac aaa ccc tgg acg aga ctg tca gca gca gat aag gca gca 1104
Arg Ala Asp Lys Pro Trp Thr Arg Leu Ser Ala Ala Asp Lys Ala Ala
355 360 365
att cgt aaa gaa tta aat gag tac aaa agt aat gaa atg gag gta cat 1152
Ile Arg Lys Glu Leu Asn Glu Tyr Lys Ser Asn Glu Met Glu Val His
370 375 380
gca tca agc aag cac ttg aca aga ttc cac agg cca tagagatttt 1198
Ala Ser Ser Lys His Leu Thr Arg Phe His Arg Pro
385 390 395
cttctgagaa gaatttgtgt ttaatttttt gataccaaca ctgaacattc atcagggaac 1258
tttcctgaag ttcagctcaa gactacccta cctgctgtgt ttgtgagaag agtaggatca 1318
cacacacagg tgcaatcttg accacactta cctgcaagag gagtaaccag aggacacact 1378
tccttccttc tttggtgtct gaggagtgtg aactgttggg gtcagttaag acccaacata 1438
actctatcag aagaaaactg ttgtttgcct ttcaaccttg ttttacagtt ctgcagtgta 1498
atggaggacg ggcaacgtgc atgtgcaggc tcaccactcc caggcctctg acatgaggga 1558
catgtgacag tgtcattcag tattatgttc aaaagacatt tttatcctga tcataattaa 1618
tttgaaaact ctttaagttc atgttataca agatgattta ctgtattata cttttccttt 1678
tttatataat gtctaacaaa aaatacagct gcaacatttt gattcctgtt aattttgttc 1738
tttaattaaa tgactactta ttgcaggaaa ttaacccagg ctttacattt tcttgtggtt 1798
gggatgagtg gtactttaga acagggatct gtgaaacaag tcaattttta cttgtgaacg 1858
atcagaatcc aaacacatga atgagaattc caacttttta tgtgtttggt ttgtgcctta 1918
gaaaatcaaa ttgagtgcct ttaaccaaga aataagcaat aatttgttga aaaattctat 1978
caaaatgaaa agtatgcctc tttagagtat ttttaagctc acctctgcaa acactgttaa 2038
agaaaggaaa tttaccccag cacaaaggtg aaaatcttgg cacaaagggg gaaactaaca 2098
atacatgttc cctgaaaaat atttttaatt ggctttcaaa taagtgtcct tacagagatg 2158
agccttccct cctccttccc tcctggtata ctggtgaggc aggaagcaca gtaacttgga 2218
gcatgacttt cagagcttgc agaacttgtg cacaaatcca agctctgtgc tattcaacaa 2278
gttacttgtt ttttctatag tatggtcata tctctctcac acaacattgc tgtagtatct 2338
aaataagaca aagtagctaa agtgctggca cattgctgag gacgttctgg gaatccaata 2398
aatgttagct actgatatgg ttgccaggta gcacctggag caactgagtg tcctctatga 2458
cttcttgcca catatgctac ctcattgctt tccactattt ttccttttct acttaagact 2518
gctactgggt cagacgccca cagaaactgc caagtcttta tgttagcact ttgtccaaag 2578
tgacttatga tcagtcctag agactagcaa aacacactat tcattgaatt atagtttttt 2638
ttaataagaa attgttatgg tagaaaaatt ccagctctac ctcttagctg ctctgtaacc 2698
ttctgtgaat cagttgactt ctctgagcct caatttgttt gtgaaatgta tctaaatcct 2758
ttctaattag aagtcagtta aatttaaagg atttgatata tatatatata tacattatat 2818
attttctgtt attcataacc cattatttgt agccaaaact ttacctcttt tctatcttaa 2878
gcaacatatt agaagctctc tgtatttaat attcaattta aaatattaat gggagcccaa 2938
attatgtaaa actaagattt caatttccct cacatagaaa tattaatata tttactatga 2998
aacagaaaag gaaaaggtgc tactgtttag aaaccacctt ggcccccaaa gtacaaaaga 3058
tcagtactgc ttaacaaaag aaccaggcag atcctatagt tagaaatgac ctggttagat 3118
gggacatata tttagttttt tccccaagtc ccataactag tggataataa aactaaaaag 3178
gtaaagtgca ggataaccaa aggagggttt ggactggttc agactcaaag gttctaagat 3238
atcctaaaac cagcagacat attcacttaa atcaaaattt gtggcaatag catctaactt 3298
ctaccattgt gtttttatta tgcacaaact attactctat acttacatat ttcatgttct 3358
tatcaggtat tataagggtt aagagataaa ctaaaattga atcaagtata tttactctcc 3418
acctgcacat cacaactatc aagttagagg aataaaatta aaaagcaatt cacaagtcat 3478
attacactga tgacatacat gtatatgtac acgcatttgc atatatttat aatttgaaga 3538
gaaagtttat atacttaaca gccaacagat ttagaaaaaa ctatttgtcc tactgaataa 3598
cagtatatat gaattatcag aattcttcat tactagtatt caataatgag ctttttatgc 3658
aataaattta caaatcattt gcccctattt tcagaaaaaa ttattaaagc aaccatgaat 3718
atttaaatca taagttctgt ttttaaggag ttcaaaacat ttttacaagc atgttagtaa 3778
tttagtgaga ggaaattaaa tatcttctag aaacatctag taacttttaa gctaatgtaa 3838
tagaggaaat actacattat atgactttgt aaaaagtatt tctaaaaaaa tttaaagggg 3898
gctgggcatg gtggctcatg cctgtaatcc cagcactttg ggaggccgag atgagtggat 3958
tgctttagtc caggagttcg acaccagcct gggcaacata gcgaaactcc atctctacaa 4018
aacaaaaaat tagctgggca tggtggtatg cgcctgtagt cccagctact agggaggctg 4078
aggtggcagg atcacttgag cccgggaggt taaggccata gtgagccgtg attgcaccac 4138
tgcactccaa ccagggcaac agagcaagac cctgtctcaa aaaatataaa aataaaataa 4198
agagactttg aaatcatatc acctaagtaa ttttttgaca attgaaatgg tggtgggagc 4258
ggggaggaag aacctaacat caagacatcc accttgtgga aagatatggt aattacaggc 4318
ctacatttat tatacattgg aagtaaagtc aaagttttta gctatgtacc tacagtcctt 4378
caacaacaaa ttcaagctca gaaatgcaaa atcctaattg tactattaaa actgctatta 4438
ctatatgctt ccgataccca tttgtctagt ttttttccag agaccttttt ctccattttt 4498
catatgaaaa catctcagaa gatattcgta agcaagtgaa catatgaaac tgaatatatg 4558
aaaaatattt tcaaaaagta gatttctata aatcccattt gcttatttag ctgcaaattt 4618
atttacagtg atgatactag gaacatacta ctagttcaaa caaaaaatgt ttgttcgaat 4678
gaacaacaga aagtactggt tagcagttcc atagttctat gaaaatagag cagtattttt 4738
aaagtaataa aaattttctt aaaaaacctt ttcaaagttt ttaatctcaa ctgtaaatct 4798
taaagatata aatggggaaa ttctaaaaat tcagttttct tactaagcaa tgggaggaaa 4858
gaaatctgga tatatggtac atatgacaat aatttaatgt ttcaagttgc agacatgtta 4918
catatgtgta ttattccttt ctaaatgcag tttattatcc cttctcaaaa atacatacac 4978
tgaatgtatt actacaattc aggtgtgttc cctaaaagac tggtctatac caccagtcca 5038
ttctctgctt tccagatagc ctacttctta attataatgg tgacacctga tgtggcagaa 5098
agcacaatct ccagaaccac ctatcatctt gcttcaattt ttacattcca agttagattt 5158
gtttgttgat attatttgct ggtattttat tttgctcaaa attctacttg tttattcata 5218
gcatatatga ttcttccata ctttaagttt taattagata agaaacttat acactaactg 5278
tatgtggttt aactgttaat aggtggtaac agaatggtta aagaactaac tccagcaact 5338
gtgaagctca acactaggca ctgaactcag agttggcaga atttgtcact gtagactaca 5398
aatttcatgt tgaattaaat gctgtatgaa aacc 5432
<210> 15
<211> 3727
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1522)..(3444)
<400> 15
gtgaatctcc agagccacac ccacttcaag ataatgggtc atttttgtgg ttttccatga 60
tgtctcaaag catgggtggt gataacctca gtagcttaga tactaatgaa gcagaaattg 120
aaccagaaaa catgagagaa aagttcttca gaagcttagc aaggttactg gaaaacaaaa 180
gtaataatac taagatattt tctaaagcaa agtactgtca gttgataaag gaagtgaaag 240
aagctaaagc taaggcgaaa aaggaatcag ttgactaccg tcgcttggct agatttgatg 300
ttatccttgt acaaggaaat gagaagctaa ttgaggctgt aaatggggaa acagataaaa 360
tacggtatta cttacacagt gaggacttat ttgacattct gcataataca catctcagca 420
ttggacatgg tggacgtact cgcatggaga aagagttaca agcgaaatac aagaacatca 480
caaaagaagt tataatgctg tatctgaccc tctgtaaacc atgccaacag aaaaattcaa 540
aactcaagaa ggttctaaca tcaaaatcaa ttaaggaagt tagttcaaga tgccaagtag 600
atcttataga catgcagttg aatcctgatg gggagtacag atttattttg cattatcaag 660
atctctgtac aaagttaact tttttgcggt cattaaagtc taaaaggcct acggaagttg 720
cacatgctct tttagatata tttacaatta ttggagcacc cagtgtccta caatctgaca 780
atgggaggga attttcaagc caggttgtca gtgaactcag taatatttgg ccagaattga 840
aaattgtcca tgggaagtct cagacctgcc aaagccagag ttctgcagaa caaactgagg 900
atatccgaaa gaggattttc tcctggatgc aaactaacaa ctcatcacac tggactgaat 960
ttttgtggtt cattcagatg tcccaaaatc agccctatca cagaagcatg caacagactc 1020
catgtgaaag tgcatttagc tctgaagcta aactgggctt gtcccattct cagctaactg 1080
aagaacttgt tgccagcttg catacagaaa atgaattaga tcaggctgac aaagagttag 1140
aaaatacttt aagagcccag tatgaagaaa acattgagac tggaacagac agtagtgata 1200
ttgaagagaa tctttctgtc actcctaagg tggctgaaaa aagccctcct gagagcagac 1260
taagattttt atcctgtgta gtttgtgaaa aagaatgcac aggtgttaat agttgtatat 1320
catgtgatgg aaatatccat gcaatttgtg gagtgccctc tcaacatggg actgagggct 1380
gtggtcggca aataacttgt agcctttgct atgaaaccag cacaatgaag aggaaacatg 1440
atgagattca aagaagtttg cctgttaaac cttccaaaat gctaaagcca tcagggacac 1500
cattttcacc agacaaagta g gag act gga tgg cga aac aag ctt cac tgg 1551
Glu Thr Gly Trp Arg Asn Lys Leu His Trp
1 5 10
act ttt ttt gtc aag aaa aga cat gcc ttt tct gaa cac agt agt agt 1599
Thr Phe Phe Val Lys Lys Arg His Ala Phe Ser Glu His Ser Ser Ser
15 20 25
aat aaa aga aat gtt aac aat aga agt tat cct gaa gaa ggg aaa acc 1647
Asn Lys Arg Asn Val Asn Asn Arg Ser Tyr Pro Glu Glu Gly Lys Thr
30 35 40
aaa aga gtt cat gct agt ttc act cgg aaa tat gat cct tca tat att 1695
Lys Arg Val His Ala Ser Phe Thr Arg Lys Tyr Asp Pro Ser Tyr Ile
45 50 55
gag ttt ggt ttt gta gct gta att gat ggt gaa gta cta aaa cca cag 1743
Glu Phe Gly Phe Val Ala Val Ile Asp Gly Glu Val Leu Lys Pro Gln
60 65 70
tgt att att tgt gga gat gta ctg gct aat gaa gca atg aaa cca tca 1791
Cys Ile Ile Cys Gly Asp Val Leu Ala Asn Glu Ala Met Lys Pro Ser
75 80 85 90
aaa ctt aag cga cat tta tat tca aaa cat aaa gaa ata agt tca caa 1839
Lys Leu Lys Arg His Leu Tyr Ser Lys His Lys Glu Ile Ser Ser Gln
95 100 105
cca aaa gaa ttc ttt gaa aga aag agt agt gaa ttg aaa agc caa cca 1887
Pro Lys Glu Phe Phe Glu Arg Lys Ser Ser Glu Leu Lys Ser Gln Pro
110 115 120
aag cag gtg ttc aac gtt tct cat ata aac att agt gct ttg cgg gct 1935
Lys Gln Val Phe Asn Val Ser His Ile Asn Ile Ser Ala Leu Arg Ala
125 130 135
tca tat aaa gta gca ctt ccg gtt gcc aag tct aaa aca cca tac aca 1983
Ser Tyr Lys Val Ala Leu Pro Val Ala Lys Ser Lys Thr Pro Tyr Thr
140 145 150
att gct gag aca cta gtg aaa gac tgc atc aaa gaa gtt tgc ttg gaa 2031
Ile Ala Glu Thr Leu Val Lys Asp Cys Ile Lys Glu Val Cys Leu Glu
155 160 165 170
atg ttg ggt gaa tct gca gca aag aag gta gct cag gta cca ctt tcc 2079
Met Leu Gly Glu Ser Ala Ala Lys Lys Val Ala Gln Val Pro Leu Ser
175 180 185
aat gac acc ata gct cga cgt att cag gaa ctg gct aat gat atg gaa 2127
Asn Asp Thr Ile Ala Arg Arg Ile Gln Glu Leu Ala Asn Asp Met Glu
190 195 200
gat caa ctc ata gaa caa ata aaa cta gca aag tat ttt tca ttg caa 2175
Asp Gln Leu Ile Glu Gln Ile Lys Leu Ala Lys Tyr Phe Ser Leu Gln
205 210 215
ctt gat gaa tgc aga gat att gct aac atg ata att ctt tta gtc tat 2223
Leu Asp Glu Cys Arg Asp Ile Ala Asn Met Ile Ile Leu Leu Val Tyr
220 225 230
gtg agg ttt gaa cat gat gat gat ata aag gaa gag ttc ttt ttt tca 2271
Val Arg Phe Glu His Asp Asp Asp Ile Lys Glu Glu Phe Phe Phe Ser
235 240 245 250
gcc tct ttg cct aca aac aca act agc tca gaa ctg tat gaa gct gta 2319
Ala Ser Leu Pro Thr Asn Thr Thr Ser Ser Glu Leu Tyr Glu Ala Val
255 260 265
aag aat tat att gtt aac aaa tgt ggt ttg gaa ttt aaa ttt tgt gta 2367
Lys Asn Tyr Ile Val Asn Lys Cys Gly Leu Glu Phe Lys Phe Cys Val
270 275 280
gga gta tgt tct gat ggt gca gct tca atg aca gga aaa cat tct gaa 2415
Gly Val Cys Ser Asp Gly Ala Ala Ser Met Thr Gly Lys His Ser Glu
285 290 295
gtg gta acc cag att aag gaa ctt gcg cca gaa tgt aaa aca aca cat 2463
Val Val Thr Gln Ile Lys Glu Leu Ala Pro Glu Cys Lys Thr Thr His
300 305 310
tgc ttc att cat cga gaa agt ctt gcc atg aaa aaa ata tca gct gaa 2511
Cys Phe Ile His Arg Glu Ser Leu Ala Met Lys Lys Ile Ser Ala Glu
315 320 325 330
cta aat agt gta ctt aat gat ata gta aaa att gtg aat tat ata aaa 2559
Leu Asn Ser Val Leu Asn Asp Ile Val Lys Ile Val Asn Tyr Ile Lys
335 340 345
tct aat tca ttg aat tca aga tta ttc tct tta tta tgt gat aat atg 2607
Ser Asn Ser Leu Asn Ser Arg Leu Phe Ser Leu Leu Cys Asp Asn Met
350 355 360
gaa gct gat cat aag caa ctg tta ctg cat gct gag ata cgg tgg tta 2655
Glu Ala Asp His Lys Gln Leu Leu Leu His Ala Glu Ile Arg Trp Leu
365 370 375
tca cgg gga aaa gtt ctg tca aga atg ttt gaa ata cga aat gaa ctc 2703
Ser Arg Gly Lys Val Leu Ser Arg Met Phe Glu Ile Arg Asn Glu Leu
380 385 390
tta gtg ttt ctg caa ggc aag aaa ccc atg tgg tcc caa ctt ttt aaa 2751
Leu Val Phe Leu Gln Gly Lys Lys Pro Met Trp Ser Gln Leu Phe Lys
395 400 405 410
gat gtg aat tgg aca gcc aga ctt gct tat ttg tct gat atc ttc agt 2799
Asp Val Asn Trp Thr Ala Arg Leu Ala Tyr Leu Ser Asp Ile Phe Ser
415 420 425
att ttt aat gat ctt aat gct tct atg caa ggg aag aat gca act tat 2847
Ile Phe Asn Asp Leu Asn Ala Ser Met Gln Gly Lys Asn Ala Thr Tyr
430 435 440
ttt tca atg gca gat aaa gtt gaa gga caa aaa cag aag tta gaa gct 2895
Phe Ser Met Ala Asp Lys Val Glu Gly Gln Lys Gln Lys Leu Glu Ala
445 450 455
tgg aaa aac aga att tct aca gat tgt tat gac atg ttt cat aat tta 2943
Trp Lys Asn Arg Ile Ser Thr Asp Cys Tyr Asp Met Phe His Asn Leu
460 465 470
aca aca att atc aat gaa gta ggt aat gat ctt gat att gca cat ctg 2991
Thr Thr Ile Ile Asn Glu Val Gly Asn Asp Leu Asp Ile Ala His Leu
475 480 485 490
cga aaa gtt atc agt gaa cat ctt aca aat ttg tta gaa tgt ttt gaa 3039
Arg Lys Val Ile Ser Glu His Leu Thr Asn Leu Leu Glu Cys Phe Glu
495 500 505
ttt tat ttt cca tca aaa gaa gat cca cgc ata gga aat ttg tgg atc 3087
Phe Tyr Phe Pro Ser Lys Glu Asp Pro Arg Ile Gly Asn Leu Trp Ile
510 515 520
caa aat cca ttt ctt tca tca aaa gat aac tta aat tta act gta act 3135
Gln Asn Pro Phe Leu Ser Ser Lys Asp Asn Leu Asn Leu Thr Val Thr
525 530 535
cta cag gat aag ttg ttg aag ctg gct acc gac gaa gga ttg aaa atc 3183
Leu Gln Asp Lys Leu Leu Lys Leu Ala Thr Asp Glu Gly Leu Lys Ile
540 545 550
agt ttt gaa aat aca gca tca ctt cct tca ttt tgg ata aaa gct aaa 3231
Ser Phe Glu Asn Thr Ala Ser Leu Pro Ser Phe Trp Ile Lys Ala Lys
555 560 565 570
aat gac tat cct gag ctt gct gag att gct tta aaa ttg ctg ctt ctt 3279
Asn Asp Tyr Pro Glu Leu Ala Glu Ile Ala Leu Lys Leu Leu Leu Leu
575 580 585
ttc ccc tca aca tac ctc tgt gag acc gga ttc tct act tta agt gtt 3327
Phe Pro Ser Thr Tyr Leu Cys Glu Thr Gly Phe Ser Thr Leu Ser Val
590 595 600
att aaa aca aaa cat aga aac agt tta aat ata cat tat ccc ctg agg 3375
Ile Lys Thr Lys His Arg Asn Ser Leu Asn Ile His Tyr Pro Leu Arg
605 610 615
gta gca ttg tca tca atc caa cct aga tta gac aaa tta aca agc aag 3423
Val Ala Leu Ser Ser Ile Gln Pro Arg Leu Asp Lys Leu Thr Ser Lys
620 625 630
aag caa gct cac tta tca cat taaaagcttt aaatattgat atgtaaggta 3474
Lys Gln Ala His Leu Ser His
635 640
ttggttcaaa gtatgcatat aagcattgag tgtgaggaat ttgctatttc actttaaact 3534
ttctgtctag ttacagttat ggaagtatga gaagttatga gtgaaacagc aattttctat 3594
ataaattgcc tatatgtata ttttcaatta agaatgtgta cagtttttat aattctattt 3654
ttcctcatat ttgtcgtatt tattaaaata taattttaaa tctgttgatt ctaatattaa 3714
aacatttgat ctt 3727
<210> 16
<211> 4070
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (528)..(3821)
<400> 16
gccagctgct gcgcatgcgc cggccggggc cccgccccca tgcgccgcgc ggctccaggg 60
ccacgttcca gggtcgggtt tggtggattc ctcagtccct gccgccgcgg ggcgccctgg 120
gatagcggcg gggcctcctg gtgagcgcgc gccggggcgg cctccgggaa gtgggagacg 180
ctgcgggtcc tgggcccagg ccttgggatg ggcgggaagg cttggccgcg ccgggctgtg 240
ggcactgcag gaggcccctg tgcaggtgga gatcgccgcg gccctggcgg gactccttgc 300
tggctcttgg gcgcgctgat gcccatcatc tccctgagtt tctgagccct atctctcatg 360
tgtcagtggt caccgccgaa tccagacact ccggccctgt tccggaagag ccctgatatc 420
cgtggctcca tggtgctgtc tgtcgatacc atgcactcta gctctcaagg aggaaaggtt 480
ttgtggaagg gaatagagac ttggaataac agacctgtgc taattag aga cag gaa 536
Arg Gln Glu
1
aga tgg aac aag ggg agt gac cct tct cca ccc cca tat cct aat gtg 584
Arg Trp Asn Lys Gly Ser Asp Pro Ser Pro Pro Pro Tyr Pro Asn Val
5 10 15
ctc tct ctc tat cca gaa cag atc tcg gcc cct ttc caa aca ctc ctg 632
Leu Ser Leu Tyr Pro Glu Gln Ile Ser Ala Pro Phe Gln Thr Leu Leu
20 25 30 35
atg cct cat ttg cct ctc gcc tct ttt cga cca cca ttt tgg ggg ctg 680
Met Pro His Leu Pro Leu Ala Ser Phe Arg Pro Pro Phe Trp Gly Leu
40 45 50
agg cac tca cgg ggc ctc ccc agg ttt cac tcc gtt tct aca cag tcg 728
Arg His Ser Arg Gly Leu Pro Arg Phe His Ser Val Ser Thr Gln Ser
55 60 65
gag ccc cat gga tct ccc atc tcc cgg agg aac cgt gaa gcc aaa cag 776
Glu Pro His Gly Ser Pro Ile Ser Arg Arg Asn Arg Glu Ala Lys Gln
70 75 80
aag cgc ctg cga gag aag cag gcg act ctg gag gct gag ata gca ggg 824
Lys Arg Leu Arg Glu Lys Gln Ala Thr Leu Glu Ala Glu Ile Ala Gly
85 90 95
gag agc aag tca cct gca gaa tcc att aag gcc tgg agg cct aag gag 872
Glu Ser Lys Ser Pro Ala Glu Ser Ile Lys Ala Trp Arg Pro Lys Glu
100 105 110 115
tta gta ttg tat gaa atc cct acg aaa ccc ggt gaa aag aaa gat gtc 920
Leu Val Leu Tyr Glu Ile Pro Thr Lys Pro Gly Glu Lys Lys Asp Val
120 125 130
tct ggg ccc ctg cct cct gca tac agc ccc cga tat gtt gag gct gcc 968
Ser Gly Pro Leu Pro Pro Ala Tyr Ser Pro Arg Tyr Val Glu Ala Ala
135 140 145
tgg tac ccg tgg tgg gta cga gag ggc ttc ttc aaa cca gaa tat cag 1016
Trp Tyr Pro Trp Trp Val Arg Glu Gly Phe Phe Lys Pro Glu Tyr Gln
150 155 160
gcc cgg ctg ccc caa gct aca ggg gag acc ttt tcc atg tgt atc cca 1064
Ala Arg Leu Pro Gln Ala Thr Gly Glu Thr Phe Ser Met Cys Ile Pro
165 170 175
cct ccc aat gtc act ggc tcc ctg cac att ggc cac gca ctc acg gtg 1112
Pro Pro Asn Val Thr Gly Ser Leu His Ile Gly His Ala Leu Thr Val
180 185 190 195
gcc ata cag gat gcc ctc gtg cgc tgg cac cgg atg cgt ggg gat caa 1160
Ala Ile Gln Asp Ala Leu Val Arg Trp His Arg Met Arg Gly Asp Gln
200 205 210
gtg ctg tgg gtc cct ggt tca gat cat gca gga att gct aca caa gct 1208
Val Leu Trp Val Pro Gly Ser Asp His Ala Gly Ile Ala Thr Gln Ala
215 220 225
gtg gtg gag aaa caa ctg tgg aag gaa cgg gga gtg agg aga cat gag 1256
Val Val Glu Lys Gln Leu Trp Lys Glu Arg Gly Val Arg Arg His Glu
230 235 240
ctg agc cgg gag gcc ttc ctt agg gag gtg tgg cag tgg aag gag gcg 1304
Leu Ser Arg Glu Ala Phe Leu Arg Glu Val Trp Gln Trp Lys Glu Ala
245 250 255
aaa ggt gga gag atc tgt gag cag ctg cga gct ctg ggt gcc tcc ctg 1352
Lys Gly Gly Glu Ile Cys Glu Gln Leu Arg Ala Leu Gly Ala Ser Leu
260 265 270 275
gac tgg gat cga gag tgt ttt acc atg gat gtt ggc tcc tca gtg gct 1400
Asp Trp Asp Arg Glu Cys Phe Thr Met Asp Val Gly Ser Ser Val Ala
280 285 290
gtg act gaa gct ttt gtg cgg ctc tac aag gcg ggg ttg ctg tac cgg 1448
Val Thr Glu Ala Phe Val Arg Leu Tyr Lys Ala Gly Leu Leu Tyr Arg
295 300 305
aac cat cag ctt gtc aac tgg tca tgt gct tta aga tca gcc atc tcg 1496
Asn His Gln Leu Val Asn Trp Ser Cys Ala Leu Arg Ser Ala Ile Ser
310 315 320
gac att gag gtg gag aac cgg ccc ctg cct ggc cac aca cag ctt cga 1544
Asp Ile Glu Val Glu Asn Arg Pro Leu Pro Gly His Thr Gln Leu Arg
325 330 335
ctg cct ggc tgc ccc acc ccc gtg tct ttt ggc ctc cta ttt tct gtt 1592
Leu Pro Gly Cys Pro Thr Pro Val Ser Phe Gly Leu Leu Phe Ser Val
340 345 350 355
gcc ttc ccc gtg gat gga gag cct gat gca gag gtt gtg gta gga acc 1640
Ala Phe Pro Val Asp Gly Glu Pro Asp Ala Glu Val Val Val Gly Thr
360 365 370
aca agg cca gag acg ctg cct gga gat gtg gct gtg gcc gtt cat cca 1688
Thr Arg Pro Glu Thr Leu Pro Gly Asp Val Ala Val Ala Val His Pro
375 380 385
gac gac tcg cga tac aca cat cta cac ggg cga cag ctt cgt cac ccc 1736
Asp Asp Ser Arg Tyr Thr His Leu His Gly Arg Gln Leu Arg His Pro
390 395 400
ttg atg ggg cag cct ctt ccc ctc atc aca gac tat gct gtt cag cca 1784
Leu Met Gly Gln Pro Leu Pro Leu Ile Thr Asp Tyr Ala Val Gln Pro
405 410 415
cat gtg ggc acg ggg gca gtg aag gtg act cca gct cac agt cct gcc 1832
His Val Gly Thr Gly Ala Val Lys Val Thr Pro Ala His Ser Pro Ala
420 425 430 435
gat gct gag atg ggg gcc cga cat ggc ttg agc ccc ttg aat gtc att 1880
Asp Ala Glu Met Gly Ala Arg His Gly Leu Ser Pro Leu Asn Val Ile
440 445 450
gcg gag gat ggg acc atg acc tcc ctc tgc ggg gac tgg ctg cag ggt 1928
Ala Glu Asp Gly Thr Met Thr Ser Leu Cys Gly Asp Trp Leu Gln Gly
455 460 465
ctt cac cgg ttt gtg gcc cgg gaa aag ata atg tct gtg ctg agt gaa 1976
Leu His Arg Phe Val Ala Arg Glu Lys Ile Met Ser Val Leu Ser Glu
470 475 480
tgg ggc ctg ttc cgg ggc ctc cag aac cac ccc atg gta ctg ccc atc 2024
Trp Gly Leu Phe Arg Gly Leu Gln Asn His Pro Met Val Leu Pro Ile
485 490 495
tgc agc cgt tct ggg gat gtg ata gaa tac ctg ctg aag aac cag tgg 2072
Cys Ser Arg Ser Gly Asp Val Ile Glu Tyr Leu Leu Lys Asn Gln Trp
500 505 510 515
ttt gtc cgc tgc cag gaa atg ggg gcc cga gct gcc aag gct gtg gag 2120
Phe Val Arg Cys Gln Glu Met Gly Ala Arg Ala Ala Lys Ala Val Glu
520 525 530
tcg ggg gcc ctg gag ctc agt ccc tcc ttc cac cag aag aac tgg cag 2168
Ser Gly Ala Leu Glu Leu Ser Pro Ser Phe His Gln Lys Asn Trp Gln
535 540 545
cac tgg ttt tcc cat att ggg gac tgg tgt gtc tcc cgg cag ctg tgg 2216
His Trp Phe Ser His Ile Gly Asp Trp Cys Val Ser Arg Gln Leu Trp
550 555 560
tgg ggc cat cag att cca gcc tac ctg gtt gta gag gac cat gcg cag 2264
Trp Gly His Gln Ile Pro Ala Tyr Leu Val Val Glu Asp His Ala Gln
565 570 575
gga gaa gag gac tgt tgg gtg gtt ggg cgg tca gag gct gag gcc aga 2312
Gly Glu Glu Asp Cys Trp Val Val Gly Arg Ser Glu Ala Glu Ala Arg
580 585 590 595
gag gta gca gcg gaa ctg aca ggg agg cca ggg gca gag ctg acc ctg 2360
Glu Val Ala Ala Glu Leu Thr Gly Arg Pro Gly Ala Glu Leu Thr Leu
600 605 610
gag agg gat cct gat gtc cta gac aca tgg ttt tct tct gcc ctg ttc 2408
Glu Arg Asp Pro Asp Val Leu Asp Thr Trp Phe Ser Ser Ala Leu Phe
615 620 625
ccc ttt tct gcc ctg ggc tgg ccc caa gag acc cca gac ctt gct cgt 2456
Pro Phe Ser Ala Leu Gly Trp Pro Gln Glu Thr Pro Asp Leu Ala Arg
630 635 640
ttc tac ccc ctg tca ctt ttg gaa acg ggc agc gac ctt ctg ctg ttc 2504
Phe Tyr Pro Leu Ser Leu Leu Glu Thr Gly Ser Asp Leu Leu Leu Phe
645 650 655
tgg gtg ggc cgc atg gtc atg ttg ggg acc cag ctc aca ggg cag ctg 2552
Trp Val Gly Arg Met Val Met Leu Gly Thr Gln Leu Thr Gly Gln Leu
660 665 670 675
ccc ttc agc aag gtg ctt ctt cat ccc atg gtt cgg gac agg cag ggc 2600
Pro Phe Ser Lys Val Leu Leu His Pro Met Val Arg Asp Arg Gln Gly
680 685 690
cgg aag atg agc aag tcc ctg ggg aat gtg ctg gac cca aga gac atc 2648
Arg Lys Met Ser Lys Ser Leu Gly Asn Val Leu Asp Pro Arg Asp Ile
695 700 705
atc agt ggg gtg gag atg cag gtg ctg cag gaa aag ctg aga agc gga 2696
Ile Ser Gly Val Glu Met Gln Val Leu Gln Glu Lys Leu Arg Ser Gly
710 715 720
aat ttg gac cct gca gag ctg gcc att gtg gct gca gca cag aaa aag 2744
Asn Leu Asp Pro Ala Glu Leu Ala Ile Val Ala Ala Ala Gln Lys Lys
725 730 735
gac ttt cct cac ggg atc cct gag tgt ggg aca gat gcc ctg aga ttc 2792
Asp Phe Pro His Gly Ile Pro Glu Cys Gly Thr Asp Ala Leu Arg Phe
740 745 750 755
aca ctc tgc tcc cat gga gtt cag gcg ggc gac ttg cac ctg tca gtc 2840
Thr Leu Cys Ser His Gly Val Gln Ala Gly Asp Leu His Leu Ser Val
760 765 770
tct gag gtc cag agc tgc cga cat ttc tgc aac aag atc tgg aat gct 2888
Ser Glu Val Gln Ser Cys Arg His Phe Cys Asn Lys Ile Trp Asn Ala
775 780 785
ctt cgc ttt atc ctc aat gct tta ggg gag aaa ttt gtg cca cag cct 2936
Leu Arg Phe Ile Leu Asn Ala Leu Gly Glu Lys Phe Val Pro Gln Pro
790 795 800
gct gag gag ctg tct ccc tcc tcc ccg atg gat gcc tgg atc ctg agc 2984
Ala Glu Glu Leu Ser Pro Ser Ser Pro Met Asp Ala Trp Ile Leu Ser
805 810 815
cgc ctt gcc ctg gct gcc cag gag tgt gag cgg ggc ttc ctc acc cga 3032
Arg Leu Ala Leu Ala Ala Gln Glu Cys Glu Arg Gly Phe Leu Thr Arg
820 825 830 835
gag ctc tcg ctc gtc act cat gcc ctg cac cac ttc tgg ctt cac aac 3080
Glu Leu Ser Leu Val Thr His Ala Leu His His Phe Trp Leu His Asn
840 845 850
ctc tgt gac gtc tac ctg gag gct gtg aag ccc gtg ctg tgg cac tcg 3128
Leu Cys Asp Val Tyr Leu Glu Ala Val Lys Pro Val Leu Trp His Ser
855 860 865
ccc cgc ccc ctg ggg ccc cct cag gtc ctg ttc tcc tgc gct gac ctc 3176
Pro Arg Pro Leu Gly Pro Pro Gln Val Leu Phe Ser Cys Ala Asp Leu
870 875 880
ggc ctc cgc ctc ctg gcc cca ctg atg ccc ttc ctg gct gaa gag ctc 3224
Gly Leu Arg Leu Leu Ala Pro Leu Met Pro Phe Leu Ala Glu Glu Leu
885 890 895
tgg cag agg ctg ccc ccc agg cct ggt tgc ccc cct gcc ccc agc atc 3272
Trp Gln Arg Leu Pro Pro Arg Pro Gly Cys Pro Pro Ala Pro Ser Ile
900 905 910 915
tcg gtt gcc ccc tac ccc agc gcc tgc agc ttg gag cac tgg cgc cag 3320
Ser Val Ala Pro Tyr Pro Ser Ala Cys Ser Leu Glu His Trp Arg Gln
920 925 930
cca gag ctg gag cgg cgc ttc tcc cgg gtc caa gag gtc gtg cag gtg 3368
Pro Glu Leu Glu Arg Arg Phe Ser Arg Val Gln Glu Val Val Gln Val
935 940 945
cta agg gct ctc cga gcc acg tac cag ctc acc aaa gcc cgg ccc cga 3416
Leu Arg Ala Leu Arg Ala Thr Tyr Gln Leu Thr Lys Ala Arg Pro Arg
950 955 960
gtg ctg ctg cag agc tca gag cct ggg gac cag ggc ctc ttc gag gcc 3464
Val Leu Leu Gln Ser Ser Glu Pro Gly Asp Gln Gly Leu Phe Glu Ala
965 970 975
ttc ttg gag ccc ctg ggc acc ctg ggc tac tgt ggg gct gtg ggc ctg 3512
Phe Leu Glu Pro Leu Gly Thr Leu Gly Tyr Cys Gly Ala Val Gly Leu
980 985 990 995
tta ccc cca ggc gca gca gct ccc tcc ggc tgg gcc cag gct cca ctc 3560
Leu Pro Pro Gly Ala Ala Ala Pro Ser Gly Trp Ala Gln Ala Pro Leu
1000 1005 1010
agt gac acg gct caa gtc tac atg gag ctg cag ggc ctg gtg gac ccg 3608
Ser Asp Thr Ala Gln Val Tyr Met Glu Leu Gln Gly Leu Val Asp Pro
1015 1020 1025
cag atc cag cta cct ctg tta gcc gcc cga agg tac aag ttg cag aag 3656
Gln Ile Gln Leu Pro Leu Leu Ala Ala Arg Arg Tyr Lys Leu Gln Lys
1030 1035 1040
cag ctt gat agc ctc aca gcc agg acc cca tca gaa ggg gag gca ggg 3704
Gln Leu Asp Ser Leu Thr Ala Arg Thr Pro Ser Glu Gly Glu Ala Gly
1045 1050 1055
act cag agg caa caa aag ctt tct tcc ctc cag ctg gaa ttg tca aaa 3752
Thr Gln Arg Gln Gln Lys Leu Ser Ser Leu Gln Leu Glu Leu Ser Lys
1060 1065 1070 1075
ctg gac aag gca gcc tct cac ctc cag cag ctg atg gat gag cct cca 3800
Leu Asp Lys Ala Ala Ser His Leu Gln Gln Leu Met Asp Glu Pro Pro
1080 1085 1090
gcc cca ggg agc ccg gag ctc taactcatca tccccatcag ttttcctccc 3851
Ala Pro Gly Ser Pro Glu Leu
1095
tctcagacct gtctttgagg acaaacagat ttgtcagctg tcagggtgca gtgggacgtc 3911
agagactatg tggtccatcg ccttcattgt gtaaatgagg acacagactg gcttggtcgc 3971
agtgactgtg gtgtccttga gatgctcaca ttactgcccg gcctgcctcc cacctggaag 4031
tctgggaatg aggagattga gataaacttt tgaaatccc 4070
<210> 17
<211> 3960
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1645)..(3636)
<400> 17
gcggccgcag cgccgtctgc tcagcgcccg ggtcaatagg agccagtcct tcgcaggcgt 60
cctcggcagc cacgagcggg ggcccaggag tttcccggtc ttcagcccgc cggggccccc 120
acggaagccc cccgcgctct cccgagtgtc caggatgttt tccgtggctc acccagccgc 180
caaggtgccg cagcccgagc ggctggacct ggtgtacacg gcgctgaagc ggggcctgac 240
ggcctacttg gaagtgcacc agcaggagca agagaaactc caggggcaga taagggagtc 300
caagaggaat tcccgcttgg gcttcctgta tgatctggac aagcaagtca agtccattga 360
acgcttcctg cgacgactgg agttccatgc cagcaagatc gatgagctgt atgaggcata 420
ctgtgtccag cggcgtctcc gggatggtgc ctacaacatg gtccgtgcct acaccactgg 480
gtccccggga agccgagagg cccgggacag cctggcagag gccactcggg ggcatcgcga 540
gtacacggag agcatgtgtc tgctggagag cgagctggag gcacagctgg gcgagtttca 600
tctccgaatg aaagggctgg ctggcttcgc caggctgtgt gtaggcgatc agtatgagat 660
ctgcatgaaa tatgggcgtc agcgctggaa actacggggc cgaattgagg gtagtggaaa 720
gcaggtgtgg gacagtgaag aaaccatctt tctccctcta ctcacggaat ttctgtctat 780
taaggtgaca gaactgaagg gcctggccaa ccatgtggtt gtgggcagtg tctcctgtga 840
gaccaaggac ctgtttgccg ccctgcccca ggttgtggct gtggatatca atgaccttgg 900
taccatcaag ctcagcctgg aagtcacatg gagccccttc gacaaggatg accagccctc 960
agctgcttct tctgtcaaca aggcctccac agtcaccaag cgcttctcca cctatagcca 1020
gagcccaccg gacacaccct cacttcggga acaggctttc tataacatgc tgcgacggca 1080
ggaggagctg gagaatggga cagcatggtc cctgtcatct gaatcttcag acgactcatc 1140
cagcccacag ctctcaggca ctgcccgcca ctcaccagcc cctaggcccc tggtgcagca 1200
gcccgagccc cttcccatcc aagttgcctt ccgcaggcct gagaccccca gctctgggcc 1260
cttggatgag gagggggccg tggccccagt cctggcaaat gggcatgcac cctacagtcg 1320
gactctgagc cacatcagtg aggctagtgt agatgctgcc ttggctgagg cttcagtgga 1380
ggccgttggc ccagaaagcc tagcctgggg acctagccca cctacacacc cagctcccac 1440
ccatggagag caccccagtc ctgttcctcc tgccctggac cctggccact ctgccacaag 1500
ctctaccctc ggtacaacag gctctgtccc cacatctaca gaccctgccc catctgcaca 1560
cctagactca gttcataagt ccacagactc tggcccttca gaactgccag gccccactca 1620
caccactaca ggctctacct atag tgc cat tac cac tac cca cag tgc tcc 1671
Cys His Tyr His Tyr Pro Gln Cys Ser
1 5
aag ccc cct cac tca cac tac tac agg ctc cac ccc aag ccc ata atc 1719
Lys Pro Pro His Ser His Tyr Tyr Arg Leu His Pro Lys Pro Ile Ile
10 15 20 25
tct acc ctt act act aca ggc cct acc ctc aat atc ata ggc cca gtc 1767
Ser Thr Leu Thr Thr Thr Gly Pro Thr Leu Asn Ile Ile Gly Pro Val
30 35 40
cag act acc aca agc ccc acc cac act atg cca agc cct acc cat acc 1815
Gln Thr Thr Thr Ser Pro Thr His Thr Met Pro Ser Pro Thr His Thr
45 50 55
aca gca agc ccc act cat act tcc aca agc ccc acc cat acc ccc aca 1863
Thr Ala Ser Pro Thr His Thr Ser Thr Ser Pro Thr His Thr Pro Thr
60 65 70
agt ccc acc cac aaa acc agt atg tca cct ccc acc act aca agt cct 1911
Ser Pro Thr His Lys Thr Ser Met Ser Pro Pro Thr Thr Thr Ser Pro
75 80 85
acc ccc agt ggt atg ggc cta gtc cag act gcc aca agt ccc acc cat 1959
Thr Pro Ser Gly Met Gly Leu Val Gln Thr Ala Thr Ser Pro Thr His
90 95 100 105
cct acc aca agc ccc acc cat ccc acc aca agc ccc atc ctt ata aat 2007
Pro Thr Thr Ser Pro Thr His Pro Thr Thr Ser Pro Ile Leu Ile Asn
110 115 120
gta agc cct tcc act tct cta gaa ctt gct acc ctc tcc agc ccc tcc 2055
Val Ser Pro Ser Thr Ser Leu Glu Leu Ala Thr Leu Ser Ser Pro Ser
125 130 135
aaa cac tca gac ccc acc ctc cca ggc act gac tcc ctt ccc tgt agt 2103
Lys His Ser Asp Pro Thr Leu Pro Gly Thr Asp Ser Leu Pro Cys Ser
140 145 150
ccc cca gtc tcc aat tcc tac act cag gca gac cct atg gcc ccc aga 2151
Pro Pro Val Ser Asn Ser Tyr Thr Gln Ala Asp Pro Met Ala Pro Arg
155 160 165
act ccc cac cca agt cct gcc cat tcc agt agg aaa ccc ctc aca agc 2199
Thr Pro His Pro Ser Pro Ala His Ser Ser Arg Lys Pro Leu Thr Ser
170 175 180 185
cct gcc cca gat ccc tca gag tct acg gtt cag agt cta agc ccc act 2247
Pro Ala Pro Asp Pro Ser Glu Ser Thr Val Gln Ser Leu Ser Pro Thr
190 195 200
ccc tca ccc cca acc cct gca ccc cag cat tca gac ctt tgc ctg gcc 2295
Pro Ser Pro Pro Thr Pro Ala Pro Gln His Ser Asp Leu Cys Leu Ala
205 210 215
atg gct gtc cag acc cca gtc cca acg gca gcc gga ggg tct ggg gac 2343
Met Ala Val Gln Thr Pro Val Pro Thr Ala Ala Gly Gly Ser Gly Asp
220 225 230
agg agc ctg gag gag gca ctg ggg gcc cta atg gct gcc ctg gat gac 2391
Arg Ser Leu Glu Glu Ala Leu Gly Ala Leu Met Ala Ala Leu Asp Asp
235 240 245
tac cgt ggc cag ttt cct gag ctg cag ggc ctg gag cag gag gtg acc 2439
Tyr Arg Gly Gln Phe Pro Glu Leu Gln Gly Leu Glu Gln Glu Val Thr
250 255 260 265
cgc cta gaa agt ctg ctc atg aga caa ggt ctg act cgc agc cgg gcc 2487
Arg Leu Glu Ser Leu Leu Met Arg Gln Gly Leu Thr Arg Ser Arg Ala
270 275 280
tcc agt ctc agc atc act gtg gag cat gcc ttg gag agc ttc agc ttc 2535
Ser Ser Leu Ser Ile Thr Val Glu His Ala Leu Glu Ser Phe Ser Phe
285 290 295
ctc aat gaa gac gaa gat gaa gac aat gat gtt cct ggg gac agg cct 2583
Leu Asn Glu Asp Glu Asp Glu Asp Asn Asp Val Pro Gly Asp Arg Pro
300 305 310
cca agc agc ccg gag gct ggg gct gag gac agc ata gac tca ccc agt 2631
Pro Ser Ser Pro Glu Ala Gly Ala Glu Asp Ser Ile Asp Ser Pro Ser
315 320 325
gcc cgc ccc ctc agc acg ggg tgt cca gct ctg gat gct gcc ttg gtc 2679
Ala Arg Pro Leu Ser Thr Gly Cys Pro Ala Leu Asp Ala Ala Leu Val
330 335 340 345
cgg cac ctg tac cac tgc agt cgc ctc ctg ctg aaa ctg ggc aca ttt 2727
Arg His Leu Tyr His Cys Ser Arg Leu Leu Leu Lys Leu Gly Thr Phe
350 355 360
ggg ccc ctg cgc tgc cag gag gca tgg gcc ctg gag cgg ctg ctg cgg 2775
Gly Pro Leu Arg Cys Gln Glu Ala Trp Ala Leu Glu Arg Leu Leu Arg
365 370 375
gaa gcc cga gta ctg gag gca gta tgc gag ttc agc agg cgg tgg gag 2823
Glu Ala Arg Val Leu Glu Ala Val Cys Glu Phe Ser Arg Arg Trp Glu
380 385 390
atc ccg gcc agc tct gcc cag gaa gtg gtg cag ttc tcg gcc tct cgg 2871
Ile Pro Ala Ser Ser Ala Gln Glu Val Val Gln Phe Ser Ala Ser Arg
395 400 405
cct ggc ttc ctg acc ttc tgg gac cag tgc aca gag aga ctc agc tgc 2919
Pro Gly Phe Leu Thr Phe Trp Asp Gln Cys Thr Glu Arg Leu Ser Cys
410 415 420 425
ttc ctc tgc ccg gtg gag cgg gtg ctt ctc acc ttc tgc aac cag tat 2967
Phe Leu Cys Pro Val Glu Arg Val Leu Leu Thr Phe Cys Asn Gln Tyr
430 435 440
ggt gcc cgc ctc tcc ctg cgc cag cca ggc ttg gct gag gct gtg tgt 3015
Gly Ala Arg Leu Ser Leu Arg Gln Pro Gly Leu Ala Glu Ala Val Cys
445 450 455
gtg aag ttc ctg gag gat gcc ctg ggg cag aag ctg ccc aga agg ccc 3063
Val Lys Phe Leu Glu Asp Ala Leu Gly Gln Lys Leu Pro Arg Arg Pro
460 465 470
cag cca ggg cct gga gag cag ctc aca gtc ttc cag ttc tgg agt ttt 3111
Gln Pro Gly Pro Gly Glu Gln Leu Thr Val Phe Gln Phe Trp Ser Phe
475 480 485
gtg gaa acc ttg gac agc ccc acc atg gag gcc tac gtg act gag acc 3159
Val Glu Thr Leu Asp Ser Pro Thr Met Glu Ala Tyr Val Thr Glu Thr
490 495 500 505
gct gag gag gtg cta ctg gtg cgg aat ctg aac tcg gat gat cag gct 3207
Ala Glu Glu Val Leu Leu Val Arg Asn Leu Asn Ser Asp Asp Gln Ala
510 515 520
gtt gtg ctg aag gcc ctg aga ttg gcg ccc gag ggg cgt ctg cga agg 3255
Val Val Leu Lys Ala Leu Arg Leu Ala Pro Glu Gly Arg Leu Arg Arg
525 530 535
gac ggg ctg cgg gcc ctc agc tcc ctg ctc gtc cat ggc aac aac aag 3303
Asp Gly Leu Arg Ala Leu Ser Ser Leu Leu Val His Gly Asn Asn Lys
540 545 550
gtc atg gct gct gtc agc acc cag ctc cgg agc ctg tca ctg ggc cct 3351
Val Met Ala Ala Val Ser Thr Gln Leu Arg Ser Leu Ser Leu Gly Pro
555 560 565
acc ttc cgg gag agg gcc ctc ctg tgc ttc ctg gac cag ctg gag gat 3399
Thr Phe Arg Glu Arg Ala Leu Leu Cys Phe Leu Asp Gln Leu Glu Asp
570 575 580 585
gag gac gtg cag act cga gtg gct ggc tgc ctg gcc cta ggc tgc atc 3447
Glu Asp Val Gln Thr Arg Val Ala Gly Cys Leu Ala Leu Gly Cys Ile
590 595 600
aag gct ccc gag ggc att gag ccc ctg gtg tac ctc tgc caa act gac 3495
Lys Ala Pro Glu Gly Ile Glu Pro Leu Val Tyr Leu Cys Gln Thr Asp
605 610 615
aca gaa gct gtg agg gaa gct gcc cgg caa agc cta cag cag tgt gga 3543
Thr Glu Ala Val Arg Glu Ala Ala Arg Gln Ser Leu Gln Gln Cys Gly
620 625 630
gaa gag gga cag tct gcc cat cga cgg ctg gag gag tcc ctg gac gcc 3591
Glu Glu Gly Gln Ser Ala His Arg Arg Leu Glu Glu Ser Leu Asp Ala
635 640 645
ctg ccc cgc atc ttt ggg cct ggc agc atg gcc agc aca gca ttc 3636
Leu Pro Arg Ile Phe Gly Pro Gly Ser Met Ala Ser Thr Ala Phe
650 655 660
taaactattc acccatgggt tcctggtgcc cctttccccc cactttcagg gctcaccagg 3696
cactggcagg gagggtaagg gctggctcca gatacccctc ccccacagat tcctagcaat 3756
gaaaatctaa tatattcttc tgttgcccct ggggttggag agtcagtgcc tgcagtcaag 3816
tgcctcccag cctcggctca gcacatccct tgccacaaat cagtgtctgg ggcttggcca 3876
ccctgccgct gcccagccac atcccttggt tttgtatttt atttacagag ttttacagaa 3936
aataaaaaag caaaatgtct ttcc 3960
<210> 18
<211> 3957
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(2401)
<400> 18
c gcg ggg ttg ctg cgg gag tcc ggg gat gtg gtc ctg tct ggc tgt agc 49
Ala Gly Leu Leu Arg Glu Ser Gly Asp Val Val Leu Ser Gly Cys Ser
1 5 10 15
acc ctg agc ctg ctg act ccc aca ctg caa cag ctg aac cac gta ttt 97
Thr Leu Ser Leu Leu Thr Pro Thr Leu Gln Gln Leu Asn His Val Phe
20 25 30
gag ctg cac ctg ggg cca tgg ggc cct ggc cag aca ggc ttt gtg gct 145
Glu Leu His Leu Gly Pro Trp Gly Pro Gly Gln Thr Gly Phe Val Ala
35 40 45
ctg ccc tcc cat cct gcc gac tcc cct gtt att ctt cag ctt cag ttt 193
Leu Pro Ser His Pro Ala Asp Ser Pro Val Ile Leu Gln Leu Gln Phe
50 55 60
ctc ttc gat gtg ctg cag aaa aca ctt tca ctc aag ctg gtc cat gtt 241
Leu Phe Asp Val Leu Gln Lys Thr Leu Ser Leu Lys Leu Val His Val
65 70 75 80
gct ggt cct ggc ccc aca ggg ccc atc aag att ttc ccc ttc aaa tcc 289
Ala Gly Pro Gly Pro Thr Gly Pro Ile Lys Ile Phe Pro Phe Lys Ser
85 90 95
ctt cgg cac ctg gag ctc cga ggt gtt ccc ctc cac tgt ctg cat ggc 337
Leu Arg His Leu Glu Leu Arg Gly Val Pro Leu His Cys Leu His Gly
100 105 110
ctc cga ggc atc tac tcc cag ctg gag acc ctg att tgc agc agg agc 385
Leu Arg Gly Ile Tyr Ser Gln Leu Glu Thr Leu Ile Cys Ser Arg Ser
115 120 125
ctc cag gca tta gag gag ctc ctc tca gcc tgc ggc ggc gac ttc tgc 433
Leu Gln Ala Leu Glu Glu Leu Leu Ser Ala Cys Gly Gly Asp Phe Cys
130 135 140
tct gcc ctc cct tgg ctg gct ctg ctt tct gcc aac ttc agc tac aat 481
Ser Ala Leu Pro Trp Leu Ala Leu Leu Ser Ala Asn Phe Ser Tyr Asn
145 150 155 160
gca ctg acc gcc tta gac agc tcc ctg cgc ctc ttg tca gct ctg cgt 529
Ala Leu Thr Ala Leu Asp Ser Ser Leu Arg Leu Leu Ser Ala Leu Arg
165 170 175
ttc ttg aac cta agc cac aat caa gtc cag gac tgt cag gga ttc ctg 577
Phe Leu Asn Leu Ser His Asn Gln Val Gln Asp Cys Gln Gly Phe Leu
180 185 190
atg gat ttg tgt gag ctc cac cat ctg gac atc tcc tat aat cgc ctg 625
Met Asp Leu Cys Glu Leu His His Leu Asp Ile Ser Tyr Asn Arg Leu
195 200 205
cat ttg gtg cca aga atg gga ccc tca ggg gct gct ctg ggg gtc ctg 673
His Leu Val Pro Arg Met Gly Pro Ser Gly Ala Ala Leu Gly Val Leu
210 215 220
ata ctg cga ggc aat gag ctt cgg agc ctg cat ggc cta gag cag ctg 721
Ile Leu Arg Gly Asn Glu Leu Arg Ser Leu His Gly Leu Glu Gln Leu
225 230 235 240
agg aat ctg cgg cac ctg gat ttg gca tac aac ctg ctg gaa gga cac 769
Arg Asn Leu Arg His Leu Asp Leu Ala Tyr Asn Leu Leu Glu Gly His
245 250 255
cgg gag ctg tca cca ctg tgg ctg ctg gct gag ctc cgc aag ctc tac 817
Arg Glu Leu Ser Pro Leu Trp Leu Leu Ala Glu Leu Arg Lys Leu Tyr
260 265 270
ctg gag ggg aac cct ctt tgg ttc cac cct gag cac cga gca gcc act 865
Leu Glu Gly Asn Pro Leu Trp Phe His Pro Glu His Arg Ala Ala Thr
275 280 285
gcc cag tac ttg tca ccc cgg gcc agg gat gct gct act ggc ttc ctt 913
Ala Gln Tyr Leu Ser Pro Arg Ala Arg Asp Ala Ala Thr Gly Phe Leu
290 295 300
ctc gat ggc aag gtc ttg tca ctg aca gat ttt cag act cac aca tcc 961
Leu Asp Gly Lys Val Leu Ser Leu Thr Asp Phe Gln Thr His Thr Ser
305 310 315 320
ttg ggg ctc agc ccc atg ggc cca cct ttg ccc tgg cca gtg ggg agt 1009
Leu Gly Leu Ser Pro Met Gly Pro Pro Leu Pro Trp Pro Val Gly Ser
325 330 335
act cct gaa acc tca ggt ggc cct gac ctg agt gac agc ctc tcc tca 1057
Thr Pro Glu Thr Ser Gly Gly Pro Asp Leu Ser Asp Ser Leu Ser Ser
340 345 350
ggg ggt gtt gtg acc cag ccc ctg ctt cat aag gtt aag agc cga gtc 1105
Gly Gly Val Val Thr Gln Pro Leu Leu His Lys Val Lys Ser Arg Val
355 360 365
cgt gtg agg cgg gca agc atc tct gaa ccc agt gat acg gac ccg gag 1153
Arg Val Arg Arg Ala Ser Ile Ser Glu Pro Ser Asp Thr Asp Pro Glu
370 375 380
ccc cga act ctg aac ccc tct ccg gct gga tgg ttc gtg cag cag cac 1201
Pro Arg Thr Leu Asn Pro Ser Pro Ala Gly Trp Phe Val Gln Gln His
385 390 395 400
ccg gag ctg gag ctc atg agc agc ttc cgg gaa cgg ttc ggc cgc aac 1249
Pro Glu Leu Glu Leu Met Ser Ser Phe Arg Glu Arg Phe Gly Arg Asn
405 410 415
tgg ctg cag tac agg agt cac ctg gag ccc tcc gga aac cct ctg ccg 1297
Trp Leu Gln Tyr Arg Ser His Leu Glu Pro Ser Gly Asn Pro Leu Pro
420 425 430
gcc acc ccc act act tct gca ccc agt gca cct cca gcc agc tcc cag 1345
Ala Thr Pro Thr Thr Ser Ala Pro Ser Ala Pro Pro Ala Ser Ser Gln
435 440 445
ggc ccc gac act gca ccc aga cct tca ccc ccg cag gag gaa gcc aga 1393
Gly Pro Asp Thr Ala Pro Arg Pro Ser Pro Pro Gln Glu Glu Ala Arg
450 455 460
ggc ccc cag gag tca cca cag aaa atg tca gag gag gtc agg gcg gag 1441
Gly Pro Gln Glu Ser Pro Gln Lys Met Ser Glu Glu Val Arg Ala Glu
465 470 475 480
cca cag gag gag gaa gag gag aag gag ggg aag gag gag aag gag gag 1489
Pro Gln Glu Glu Glu Glu Glu Lys Glu Gly Lys Glu Glu Lys Glu Glu
485 490 495
ggg gag atg gtg gaa cag gga gaa gag gag gca gga gag gag gaa gaa 1537
Gly Glu Met Val Glu Gln Gly Glu Glu Glu Ala Gly Glu Glu Glu Glu
500 505 510
gag gag cag gac cag aag gaa gtg gaa gcg gaa ctc tgt cgc ccc ttg 1585
Glu Glu Gln Asp Gln Lys Glu Val Glu Ala Glu Leu Cys Arg Pro Leu
515 520 525
ttg gtg tgt ccc ctg gag ggg cct gag ggc gta cgg ggc agg gaa tgc 1633
Leu Val Cys Pro Leu Glu Gly Pro Glu Gly Val Arg Gly Arg Glu Cys
530 535 540
ttt ctc agg gtc act tct gcc cac ctg ttt gag gtg gaa ctc caa gca 1681
Phe Leu Arg Val Thr Ser Ala His Leu Phe Glu Val Glu Leu Gln Ala
545 550 555 560
gct cgc acc ttg gag cga ctg gag ctc cag agt ctg gag gca gct gag 1729
Ala Arg Thr Leu Glu Arg Leu Glu Leu Gln Ser Leu Glu Ala Ala Glu
565 570 575
ata gag ccg gag gcc cag gcc cag agg tcg ccc agg ccc acg ggc tca 1777
Ile Glu Pro Glu Ala Gln Ala Gln Arg Ser Pro Arg Pro Thr Gly Ser
580 585 590
gat ctg ctc cct gga gcc ccc atc ctc agt ctg cgc ttc tcc tac atc 1825
Asp Leu Leu Pro Gly Ala Pro Ile Leu Ser Leu Arg Phe Ser Tyr Ile
595 600 605
tgc cct gac cgg cag ttg cgt cgc tat ttg gtg ctg gag cct gat gcc 1873
Cys Pro Asp Arg Gln Leu Arg Arg Tyr Leu Val Leu Glu Pro Asp Ala
610 615 620
cac gca gct gtc cag gag ctg ctt gcc gtg ttg acc cca gtc acc aat 1921
His Ala Ala Val Gln Glu Leu Leu Ala Val Leu Thr Pro Val Thr Asn
625 630 635 640
gtg gct cgg gaa cag ctt ggg gag gcc agg gac ctc ctg ctg ggt aga 1969
Val Ala Arg Glu Gln Leu Gly Glu Ala Arg Asp Leu Leu Leu Gly Arg
645 650 655
ttc cag tgt cta cgc tgt ggc cat gag ttc aag cca gag gag ccc agg 2017
Phe Gln Cys Leu Arg Cys Gly His Glu Phe Lys Pro Glu Glu Pro Arg
660 665 670
atg gga tta gac agt gag gaa ggc tgg agg cct ctg ttc caa aag aca 2065
Met Gly Leu Asp Ser Glu Glu Gly Trp Arg Pro Leu Phe Gln Lys Thr
675 680 685
ggg agc gga aac agg gag agc agt ctc tgg ctc ctt ctc cgt ttg cca 2113
Gly Ser Gly Asn Arg Glu Ser Ser Leu Trp Leu Leu Leu Arg Leu Pro
690 695 700
gcc ctg tct gcc acc ctc ctg gcc atg gtg acc acc ttg aca ggg cca 2161
Ala Leu Ser Ala Thr Leu Leu Ala Met Val Thr Thr Leu Thr Gly Pro
705 710 715 720
aga aca gcc cac ctc agg cac cga gca ccc gtg acc atg gta gtt gga 2209
Arg Thr Ala His Leu Arg His Arg Ala Pro Val Thr Met Val Val Gly
725 730 735
gcc tca gtc ccc ccc ctg agc gct gtg gcc tcc gct ctg tgg acc acc 2257
Ala Ser Val Pro Pro Leu Ser Ala Val Ala Ser Ala Leu Trp Thr Thr
740 745 750
gac tcc ggc tct tcc tgg atg ttg agg tgt tca gcg atg ccc agg agg 2305
Asp Ser Gly Ser Ser Trp Met Leu Arg Cys Ser Ala Met Pro Arg Arg
755 760 765
agt tcc agt gct gcc tca agg tgc cag tgg cat tgg cag gcc aca ctg 2353
Ser Ser Ser Ala Ala Ser Arg Cys Gln Trp His Trp Gln Ala Thr Leu
770 775 780
ggg agt tca tgt gcc ttg tgg ttg tgt ctg acc gca ggc tgt acc tgt 2401
Gly Ser Ser Cys Ala Leu Trp Leu Cys Leu Thr Ala Gly Cys Thr Cys
785 790 795 800
tgaaggtgac tggggagatg cgtgagcctc cagctagctg gctgcagctg accctggctg 2461
ttcccctgca ggatctgagt ggcatagagc tgggcctggc aggccagagc ctgcggctag 2521
agtgggcagc tggggcgggc cgctgtgtgc tgctgccccg agatgccagg cattgccggg 2581
ccttcctaga ggagctcctt gatgtcttgc agtctctgcc ccctgcctgg aggaactgtg 2641
tcagtgccac agaggaggag gtcacccccc agcaccggct ctggccattg ctggaaaaag 2701
actcatcctt ggaggctcgc cagttcttct accttcgggc gttcctggtt gaaggtgaag 2761
cctctgtgca gctgatgctt ccctggtctc tgtaccccac cttgtcacag gcactggccc 2821
acgcgcagca cctgtacagt gcccttgcag caactgtcag ttcctggaac ctggagaacc 2881
tctcacttga gggcatagct gagtaacagc ctcagtgtgc cctgaccctc tgggtggggt 2941
taggaggcac ccaggatcct cagttaccca gagcctttgc tctcgggccc aggctttcta 3001
tgaagtggag cagcgtggct gagtcggccc tgcctcatca gtcacagaag gagagggtgg 3061
agccttgttt ctgagtaggg tgggtgtggg gcagtggggg gcggtaggcc tcagaggtat 3121
tccagagtag agattcatgg agcttgggaa agggagggtc ttggccccag gggcacaggc 3181
tgctttgtga cccacctcag agtgtggttc acacctcttc tccccagcag cactgggacg 3241
ctggccccag ggatctgggc ccctccatga ccttccacac tggatgcctc tttccctgca 3301
ggcccttcca cctgcctcgt atccctgttg ctgactccgt ccaccctgtt cctgttagat 3361
gaggatgctg cagggtcccc ggcagagccc tctcctccag cagcatctgg cgaagcctct 3421
gagaaggtgc ctccctcggg gccgggccct gctgtgcgtg tcagggagca gcagccactc 3481
agcagcctga gctccgtgct gctctaccgc tcagcccctg aggacttgcg gctgctcttc 3541
tacgatgagg tgtcccggct ggagagcttt tgggcactcc gtgtggtgtg tcaggagcag 3601
ctgacagccc tgcttgcctg gatccgggaa ccatgggagg agctgttttc catcggactc 3661
cggacagtga tccaagaggc gctggccctt gaccgatgag ggtcccacgc tgaccttggc 3721
cctgacctca ggagccacgc tgtagacatt ccctctcctg gtctctgggt ctggcttcca 3781
ggctctggct gtggatgtct tcagcctctg ggtgctggcc agtgaggtcc caaatgaccc 3841
agggcttaag ggagaggcga gagaatgatc tggcctcagg ggacaggcca cctggtcagg 3901
aggaatattt ttcctgcact ttttctcagg tatcaataaa gttgtttcca actcat 3957
<210> 19
<211> 3037
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(787)
<400> 19
g ttt gat ggt cca ccc cag cca gag aat ctg cgt acg agg ctc aca ggc 49
Phe Asp Gly Pro Pro Gln Pro Glu Asn Leu Arg Thr Arg Leu Thr Gly
1 5 10 15
ttt cag ctg cca gcc acc att gtt agt gca gcc acc acc ctc tct ctg 97
Phe Gln Leu Pro Ala Thr Ile Val Ser Ala Ala Thr Thr Leu Ser Leu
20 25 30
cgc ctc atc agc gac tat gca gtc agt gcc caa ggc ttc cac gcc acc 145
Arg Leu Ile Ser Asp Tyr Ala Val Ser Ala Gln Gly Phe His Ala Thr
35 40 45
tat gaa gtt ctc ccc agc cac aca tgt ggg aac cca ggg agg ctg ccc 193
Tyr Glu Val Leu Pro Ser His Thr Cys Gly Asn Pro Gly Arg Leu Pro
50 55 60
aat ggc atc cag cag ggt tca acc ttc aac ctc ggt gac aag gtc cgc 241
Asn Gly Ile Gln Gln Gly Ser Thr Phe Asn Leu Gly Asp Lys Val Arg
65 70 75 80
tac agc tgc aac ctt ggc ttc ttc ctg gag ggc cac gcc gtg ctc acc 289
Tyr Ser Cys Asn Leu Gly Phe Phe Leu Glu Gly His Ala Val Leu Thr
85 90 95
tgc cac gct ggc tct gag aac agc gcc acg tgg gac ttc ccc ctg cct 337
Cys His Ala Gly Ser Glu Asn Ser Ala Thr Trp Asp Phe Pro Leu Pro
100 105 110
tcc tgc aga gct gat gat gcc tgt ggt ggg acc ctg cgg ggc cag agt 385
Ser Cys Arg Ala Asp Asp Ala Cys Gly Gly Thr Leu Arg Gly Gln Ser
115 120 125
ggc atc atc tcc agc ccc cac ttc ccc tcg gag tac cat aac aat gcc 433
Gly Ile Ile Ser Ser Pro His Phe Pro Ser Glu Tyr His Asn Asn Ala
130 135 140
gac tgc aca tgg acc atc ctg gct gag ctg ggg gac acc atc gcc ctg 481
Asp Cys Thr Trp Thr Ile Leu Ala Glu Leu Gly Asp Thr Ile Ala Leu
145 150 155 160
gtg ttt att gac ttc cag ctg gag gat ggt tac gac ttt ctg gaa gtc 529
Val Phe Ile Asp Phe Gln Leu Glu Asp Gly Tyr Asp Phe Leu Glu Val
165 170 175
act ggg aca gaa ggc tcc tcc ctc tgg ttc acc gga gcc agc ctc cca 577
Thr Gly Thr Glu Gly Ser Ser Leu Trp Phe Thr Gly Ala Ser Leu Pro
180 185 190
gcc ccc gtt atc agc agc aag aac tgg ctg cga ctg cac ttc aca tcg 625
Ala Pro Val Ile Ser Ser Lys Asn Trp Leu Arg Leu His Phe Thr Ser
195 200 205
gat ggc aac cac cgg cag cgc gga ttc agt gcc caa tac caa gtc aag 673
Asp Gly Asn His Arg Gln Arg Gly Phe Ser Ala Gln Tyr Gln Val Lys
210 215 220
aag caa att gag ttg aag tct cga ggt gtg aag ctg atg ccc agc aaa 721
Lys Gln Ile Glu Leu Lys Ser Arg Gly Val Lys Leu Met Pro Ser Lys
225 230 235 240
gac aac agc cag aag acg tct gtg tgt aag tgt ccg ccc gcc agc ccg 769
Asp Asn Ser Gln Lys Thr Ser Val Cys Lys Cys Pro Pro Ala Ser Pro
245 250 255
cct gcc ggg cct ctt gct tgaggtcaca cgtgcaggca ctaaaccgtg 817
Pro Ala Gly Pro Leu Ala
260
gggaagccag atgatgctgg aatgactttc cttcaaccct gcttcctcca tcctgggctc 877
tgcgcttgga cacactgacc caacggccag agagcagccc tggggcctgc cctttgcctt 937
ctcctcctcc tcgtcccatt ttcctccttg tcctctttct ccctcacttc gcatttgaac 997
tgcagtccag ggagtcatta ctgctacatt tctctcttta ttttcctttt gaatcacttc 1057
cacctgctaa gacagtcctc cttcagctcc tcttttagga gtcagtaacc ttgggctggg 1117
atcttggggg ctgtagggcc ccagttccag ctcccaaggc cagattggga aagggaggca 1177
ggggaatgcc tagtaacctc ctccaagggc ccctggacct ggctgagcaa agagaggact 1237
ggggaagtgt gggggtgggt cctgcctcgg ggcgatgggg aggcactggg gggctctctt 1297
cttgggcgtg gggaggatga catctttcag aacatgttcc tgctaccagt gtttgtaagg 1357
ttaatagggg tctagttctg gggaatggtc tggaatgtgg gtcccagaaa cacctgatca 1417
gggcctgtgc taaggggctt ttgagctgaa gcccaggaag ggctggggaa ggttctgggc 1477
acctccacaa gactgggctt aaaagcaggg gaggtttttt tccagaaacc ttccactttc 1537
cttctgggcc gagtggtagg gagccttcct tggtcctcca aatgactgac ctggttttcc 1597
ttctccttcc ctccctttgc ccttctcctg catctctggt cctctctccc ttcacttttt 1657
ctcctcttgc cactttctgc tccatcgttc ttctcatcct tctgctcccg cctcttctct 1717
ccctgcttgt cttcctgtcc ctgacttgct cctcccaccc tccccattcc tccccttcca 1777
tctcacctcc atgtgctcac tttcctggag atgctagaag tacttattga ggagagaagg 1837
gcatagagtg gcaccggtgg ttttcgagct ctgtgccgat gtgtgtggcc gggagaacat 1897
caggagaaga ggcactttgg tctcatgatc cgactgaacc gagcctgggt tggggattgt 1957
ctgtggtctt tgaagttgtc tgagtggaga cagttgccac aggctgttgg atacagccag 2017
gaactggcat ctccagtcac caccctcagg gactggatgg cagagggcac cagaagagcc 2077
ttcttctgtg cttgactggg ggtttagggg aaaggacagg gtgctaggag agcccagggt 2137
caggccagga acctgcagag ggctcaagcc atggccctga ccctcacagg ctttttcttt 2197
ctgtagccac aggagaccca atgaactgag gcaggcacag tttaatttca cagtttgccc 2257
tggagccaca ggcaaactta aggaaatggc cctgacttgg gagatcaaat atttatccca 2317
actgtgtcct cagggagcat gttaccccag acactaatgt ttcagttttc acatctgtag 2377
aatggggatg caaaatccta ccctgccttc ctcccaagag tgttgccaga tctaagatgg 2437
gtcttgttga cctaggactt caggaggccc atggactagc caaaaaagta catgtgatgc 2497
tctgggtgca tttttctgga gacaggatgc agggttttca ttaagccgac taagaagatg 2557
ccaaaaactc cctcaggaag ataaagagaa accctgttct aatgaaagaa tgcaaatgct 2617
ctttgcaaaa taaatatata tatctctatt gagaagatat tagcctccat tctaagagtt 2677
cctcagaaag aatgggtgtg tgactccagc tgcctcagac ccaggtagcc acatgtctgc 2737
aggtggctgg cctcctgtgt tctgcccaaa ttccctgtgg acagctgccc tgaagttcat 2797
ggggagagtg gacccaactc tctacctagc catcagttct gaccacccac ccacctgtga 2857
ctccctgcgt tgggtagaag gtggtgcctt cctgagcaca ttgcggcttc tccctcgcca 2917
tagtggccct cccagagtag agtaaggatt actccccatt cctgtccctc agtagccaac 2977
aggaaagcaa tgctgtaaac acaagcagct ggttttgagt aggcagagaa gactggaatt 3037
<210> 20
<211> 2883
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (3)..(1967)
<400> 20
cc ttg tat gca ggg cag cgt ctg ccc caa cat ggg tat cct ggg cct 47
Leu Tyr Ala Gly Gln Arg Leu Pro Gln His Gly Tyr Pro Gly Pro
1 5 10 15
ccc cag gcc cag cca ctg ccc cga cag ggg gtc aag aga acc tac tct 95
Pro Gln Ala Gln Pro Leu Pro Arg Gln Gly Val Lys Arg Thr Tyr Ser
20 25 30
gag gtg tat cca ggg cag cag tat ctg caa gga ggc cag tat gca ccc 143
Glu Val Tyr Pro Gly Gln Gln Tyr Leu Gln Gly Gly Gln Tyr Ala Pro
35 40 45
agc acc gcc cag ttt gcg ccc agc cct ggg cag ccc cct gcc ccc tcc 191
Ser Thr Ala Gln Phe Ala Pro Ser Pro Gly Gln Pro Pro Ala Pro Ser
50 55 60
cct tcc tac cct ggg cac agg ctg ccc ctg cag cag ggc atg acc cag 239
Pro Ser Tyr Pro Gly His Arg Leu Pro Leu Gln Gln Gly Met Thr Gln
65 70 75
tcc ctg tcc gtg cct ggc ccc acg gga ctg cat tat aag cct acc cgt 287
Ser Leu Ser Val Pro Gly Pro Thr Gly Leu His Tyr Lys Pro Thr Arg
80 85 90 95
tcc atc ccg ggc tat ccc agt tcc cca ctg cca ggg aac ccc acg cca 335
Ser Ile Pro Gly Tyr Pro Ser Ser Pro Leu Pro Gly Asn Pro Thr Pro
100 105 110
ccc atg acc cca agc agc agc gtc cct tac atg tca cca aac caa gag 383
Pro Met Thr Pro Ser Ser Ser Val Pro Tyr Met Ser Pro Asn Gln Glu
115 120 125
gtc aag tct ccc ttc ttg cct gat ctc aag ccc aac ctc aac tcc ttg 431
Val Lys Ser Pro Phe Leu Pro Asp Leu Lys Pro Asn Leu Asn Ser Leu
130 135 140
cac tca tcg ccc tct gga agc ggg cct tgt gac gag ttg cgg ctg acc 479
His Ser Ser Pro Ser Gly Ser Gly Pro Cys Asp Glu Leu Arg Leu Thr
145 150 155
ttc cct gtg cgc gat ggg gtg gtc ctg gag ccc ttc cgc ctg cag cac 527
Phe Pro Val Arg Asp Gly Val Val Leu Glu Pro Phe Arg Leu Gln His
160 165 170 175
aac ctg gct gta agc aac cat gtc ttc cag ctg cga gac tca gtc tac 575
Asn Leu Ala Val Ser Asn His Val Phe Gln Leu Arg Asp Ser Val Tyr
180 185 190
aag acc ctg ata atg agg cct gac ctg gag ctg caa ttc aag tgc tac 623
Lys Thr Leu Ile Met Arg Pro Asp Leu Glu Leu Gln Phe Lys Cys Tyr
195 200 205
cac cac gag gac cgg cag atg aac acc aac tgg cca gcc tcg gtg cag 671
His His Glu Asp Arg Gln Met Asn Thr Asn Trp Pro Ala Ser Val Gln
210 215 220
gtc agc gtc aat gcc acg ccg ctc acc atc gag cgt ggc gac aac aag 719
Val Ser Val Asn Ala Thr Pro Leu Thr Ile Glu Arg Gly Asp Asn Lys
225 230 235
acc tcg cac aag cca ctc tac ctg aag cat gtg tgc cag cca ggc cgc 767
Thr Ser His Lys Pro Leu Tyr Leu Lys His Val Cys Gln Pro Gly Arg
240 245 250 255
aac acc atc cag atc acc gtc acc gcc tgc tgc tgc tcc cac ctc ttc 815
Asn Thr Ile Gln Ile Thr Val Thr Ala Cys Cys Cys Ser His Leu Phe
260 265 270
gtg ctg cag cta gtg cac cgc cca tcc gtc cgc tcg gtg ctg cag ggc 863
Val Leu Gln Leu Val His Arg Pro Ser Val Arg Ser Val Leu Gln Gly
275 280 285
ctc ctc aaa aag cgc ctc ctg cct gct gag cac tgc atc acc aag ata 911
Leu Leu Lys Lys Arg Leu Leu Pro Ala Glu His Cys Ile Thr Lys Ile
290 295 300
aag cgg aac ttc agc agc ggc acc atc cct ggc acc cct ggg ccc aac 959
Lys Arg Asn Phe Ser Ser Gly Thr Ile Pro Gly Thr Pro Gly Pro Asn
305 310 315
gga gag gac ggg gtg gag cag aca gct atc aag gtg tcc ctg aag tgc 1007
Gly Glu Asp Gly Val Glu Gln Thr Ala Ile Lys Val Ser Leu Lys Cys
320 325 330 335
ccc atc acc ttc cgc agg atc cag ctc cct gcc cga ggt cat gac tgt 1055
Pro Ile Thr Phe Arg Arg Ile Gln Leu Pro Ala Arg Gly His Asp Cys
340 345 350
cgc cac ata cag tgc ttt gac ctg gag tcg tac ctg cag ctc aac tgt 1103
Arg His Ile Gln Cys Phe Asp Leu Glu Ser Tyr Leu Gln Leu Asn Cys
355 360 365
gag cgg ggg act tgg agg tgt cct gtg tgc aac aag aca gct ttg ctg 1151
Glu Arg Gly Thr Trp Arg Cys Pro Val Cys Asn Lys Thr Ala Leu Leu
370 375 380
gag ggc ctg gag gtg gac cag tac atg ctg ggc atc ctg att tac att 1199
Glu Gly Leu Glu Val Asp Gln Tyr Met Leu Gly Ile Leu Ile Tyr Ile
385 390 395
cag aac tct gac tat gag gag atc acc atc gac ccc acg tgc agc tgg 1247
Gln Asn Ser Asp Tyr Glu Glu Ile Thr Ile Asp Pro Thr Cys Ser Trp
400 405 410 415
aag cca gtg ccc gtg aag cct gac atg cac atc aag gag gag ccg gat 1295
Lys Pro Val Pro Val Lys Pro Asp Met His Ile Lys Glu Glu Pro Asp
420 425 430
ggg cca gca ctg aag cgc tgc cgc acc gtg agc ccc gcc cac gtg ctc 1343
Gly Pro Ala Leu Lys Arg Cys Arg Thr Val Ser Pro Ala His Val Leu
435 440 445
atg ccc agc gtg atg gag atg atc gcc gcc ctg ggc ccc ggc gct gcc 1391
Met Pro Ser Val Met Glu Met Ile Ala Ala Leu Gly Pro Gly Ala Ala
450 455 460
ccc ttt gcc ccc ctg cag ccc ccc tca gtc cct gcc ccc agc gac tac 1439
Pro Phe Ala Pro Leu Gln Pro Pro Ser Val Pro Ala Pro Ser Asp Tyr
465 470 475
cct ggc cag ggt tcc agc ttc ctg ggg cct gga act ttc cct gag tcc 1487
Pro Gly Gln Gly Ser Ser Phe Leu Gly Pro Gly Thr Phe Pro Glu Ser
480 485 490 495
ttc cca ccc acc atg ccc agc acc cca acc ctt gct gag ttc acc ccg 1535
Phe Pro Pro Thr Met Pro Ser Thr Pro Thr Leu Ala Glu Phe Thr Pro
500 505 510
gga cca ccc ccc atc tcc tac cag tct gac att ccc agc agc ctc ctg 1583
Gly Pro Pro Pro Ile Ser Tyr Gln Ser Asp Ile Pro Ser Ser Leu Leu
515 520 525
act tca gag aag tct acc gcc tgc ctc cca agc cag atg gca cca gca 1631
Thr Ser Glu Lys Ser Thr Ala Cys Leu Pro Ser Gln Met Ala Pro Ala
530 535 540
ggt cac ctg gac ccc act cac aat cct ggg aca cca gga cta cac acc 1679
Gly His Leu Asp Pro Thr His Asn Pro Gly Thr Pro Gly Leu His Thr
545 550 555
tcc aac ctt ggg gcc cct cca ggt ccc cag ctg cac cat tca aac cct 1727
Ser Asn Leu Gly Ala Pro Pro Gly Pro Gln Leu His His Ser Asn Pro
560 565 570 575
ccc cca gcg tcc cgg cag tcc ttg ggc caa gcg agc tta gga cct acg 1775
Pro Pro Ala Ser Arg Gln Ser Leu Gly Gln Ala Ser Leu Gly Pro Thr
580 585 590
ggt gaa ctg gcc ttc agt cct gcc aca ggc gtg atg ggg ccc ccc agc 1823
Gly Glu Leu Ala Phe Ser Pro Ala Thr Gly Val Met Gly Pro Pro Ser
595 600 605
atg tct gga gcc ggg gag gcc cca gaa cca gct ctg gac ctg ctc ccg 1871
Met Ser Gly Ala Gly Glu Ala Pro Glu Pro Ala Leu Asp Leu Leu Pro
610 615 620
gaa ctg acc aac cct gat gag cta ctg tcc tac ttg ggc cca ccc gac 1919
Glu Leu Thr Asn Pro Asp Glu Leu Leu Ser Tyr Leu Gly Pro Pro Asp
625 630 635
ctc cct acg aac aac aat gac gac ctg ctt tct ctg ttt gag aac aac 1967
Leu Pro Thr Asn Asn Asn Asp Asp Leu Leu Ser Leu Phe Glu Asn Asn
640 645 650 655
tgatcctgtg tttaccccaa gcccggcggg gacacgctca cagatgtcac cacagccctg 2027
cccttcatgc ccagccccat gggacacccg gtggtctttc ccaaacctcc cccaaaacac 2087
acctggagcc agagccttct gccgccagcc ctgcccctga attggaagca gccctgtgct 2147
cgatgggagg ggctcccagg ccggcagccc ttgccacctc cctctgccaa gcctgctgct 2207
gcagaacggt ttttgctgag gtgcccctgc ccagccctgt ccagccttgt ccacacacac 2267
atctcacgcc cctggtctca cagcctcaca ccttgtcctt ccacccctgc ctgcccccac 2327
ccagcctgct tcttgtccag cattgatcct tctgtttcaa caactcctcc actgggcaga 2387
gctgggcatc tggcagggct ggctctgtcc cctgggcctt tggctccagt ggcccctgtg 2447
cccagcagtc cagctcttgg aacctcgctg aatggcagcc tcttgggggc ctggagctct 2507
ggcagcccag ccgtgtgtgg tgtcaggttc ctctccccac cccagcttca agcagaggcc 2567
tcggggtggg ggagctacaa agcacaacaa tgtacatagt gtagaaacac taacagctgg 2627
gagaggggag ccagctgtcc agccagcatg ttcctgttgt acagcccggt ctccctgcgc 2687
gctctgctcc ttcccatgtg cagacagatc agggaggaac ctccaatggg tgggggacag 2747
tgagcttcgg cctggccgag gtgggtgggt gggctctcag attcagctct gtgtaaagat 2807
tctctagcgg gctgcgctcc caagttcccc ttctctgtga aagtgaagaa ttagtacagc 2867
tgtgtttttt aaagcc 2883
<210> 21
<211> 3441
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(1516)
<400> 21
c ggg ctc acg ctc ttc ttc gtg gtg ctc ggc tct ctg tcg gtg caa gtg 49
Gly Leu Thr Leu Phe Phe Val Val Leu Gly Ser Leu Ser Val Gln Val
1 5 10 15
ttc agc ttc cgc tgg ttt gtg cac gat ttc agc acc gag gac agc gcc 97
Phe Ser Phe Arg Trp Phe Val His Asp Phe Ser Thr Glu Asp Ser Ala
20 25 30
acg gcc gct gct gcc tcc agc tgc ccg cag cct gga gcc gat tgc aag 145
Thr Ala Ala Ala Ala Ser Ser Cys Pro Gln Pro Gly Ala Asp Cys Lys
35 40 45
acg gtg gtc ggc ggt ggg tct gca gcc ggg gaa ggc gag gct cgt cct 193
Thr Val Val Gly Gly Gly Ser Ala Ala Gly Glu Gly Glu Ala Arg Pro
50 55 60
tcc acg ccg caa agg caa gca tct aac gcc agc aag agc aac atc gcc 241
Ser Thr Pro Gln Arg Gln Ala Ser Asn Ala Ser Lys Ser Asn Ile Ala
65 70 75 80
gcg gcc aac agc ggc agc aac agc agc ggg gct acc cgg gcc agt ggc 289
Ala Ala Asn Ser Gly Ser Asn Ser Ser Gly Ala Thr Arg Ala Ser Gly
85 90 95
aag cac agg tct gcg tcc tgc tcc ttc tgc atc tgg ctc ctg cag tca 337
Lys His Arg Ser Ala Ser Cys Ser Phe Cys Ile Trp Leu Leu Gln Ser
100 105 110
ctc atc cac atc ttg cag ctc ggg caa atc tgg aga tat ttc cac aca 385
Leu Ile His Ile Leu Gln Leu Gly Gln Ile Trp Arg Tyr Phe His Thr
115 120 125
ata tac tta ggt att cga agc cga cag agt ggg gag aat gac aga tgg 433
Ile Tyr Leu Gly Ile Arg Ser Arg Gln Ser Gly Glu Asn Asp Arg Trp
130 135 140
agg ttt tac tgg aaa atg gta tat gag tat gcg gat gtg agt atg ctg 481
Arg Phe Tyr Trp Lys Met Val Tyr Glu Tyr Ala Asp Val Ser Met Leu
145 150 155 160
cat ttg cta gcc acc ttt ctg gaa agt gct cca cag ctg gtc ctg cag 529
His Leu Leu Ala Thr Phe Leu Glu Ser Ala Pro Gln Leu Val Leu Gln
165 170 175
ctc tgc att atc gta cag act cat agc tta cag gcc ctc caa ggt ttc 577
Leu Cys Ile Ile Val Gln Thr His Ser Leu Gln Ala Leu Gln Gly Phe
180 185 190
aca gcg gca gct tcc ctc gtg tcc ctg gcc tgg gcc ttg gcc tcc tac 625
Thr Ala Ala Ala Ser Leu Val Ser Leu Ala Trp Ala Leu Ala Ser Tyr
195 200 205
cag aag gcc ctc cgg gac tct cga gat gac aag aag ccc atc agc tac 673
Gln Lys Ala Leu Arg Asp Ser Arg Asp Asp Lys Lys Pro Ile Ser Tyr
210 215 220
atg gcc gtc atc atc cag ttc tgc tgg cac ttc ttc acc atc gcc gcc 721
Met Ala Val Ile Ile Gln Phe Cys Trp His Phe Phe Thr Ile Ala Ala
225 230 235 240
agg gtc atc acg ttt gcc ctc ttt gcc tcg gtt ttc cag ctg tac ttt 769
Arg Val Ile Thr Phe Ala Leu Phe Ala Ser Val Phe Gln Leu Tyr Phe
245 250 255
ggg atc ttc atc gtc ctt cac tgg tgc atc atg acc ttc tgg atc gtc 817
Gly Ile Phe Ile Val Leu His Trp Cys Ile Met Thr Phe Trp Ile Val
260 265 270
cac tgt gag aca gaa ttc tgt atc acc aaa tgg gaa gag att gtg ttc 865
His Cys Glu Thr Glu Phe Cys Ile Thr Lys Trp Glu Glu Ile Val Phe
275 280 285
gac atg gtg gtg ggg att atc tat atc ttc agt tgg ttc aat gtc aag 913
Asp Met Val Val Gly Ile Ile Tyr Ile Phe Ser Trp Phe Asn Val Lys
290 295 300
gaa ggc agg aca cgc tgc agg cta ttc att tac tat ttt gtg atc ctt 961
Glu Gly Arg Thr Arg Cys Arg Leu Phe Ile Tyr Tyr Phe Val Ile Leu
305 310 315 320
ttg gaa aat aca gcc ttg agt gcc ctc tgg tac ctc tac aag gct ccc 1009
Leu Glu Asn Thr Ala Leu Ser Ala Leu Trp Tyr Leu Tyr Lys Ala Pro
325 330 335
cag att gca gac gca ttt gcc att cca gcg ctg tgt gtg gtg ttc agc 1057
Gln Ile Ala Asp Ala Phe Ala Ile Pro Ala Leu Cys Val Val Phe Ser
340 345 350
agc ttt tta act ggc gtt gtt ttt atg ctg atg tat tat gcc ttc ttt 1105
Ser Phe Leu Thr Gly Val Val Phe Met Leu Met Tyr Tyr Ala Phe Phe
355 360 365
cat ccc aat gga ccc aga ttc ggg cag tca cca agt tgt gct tgt gag 1153
His Pro Asn Gly Pro Arg Phe Gly Gln Ser Pro Ser Cys Ala Cys Glu
370 375 380
gac cca gcc gct gcc ttc act ttg ccc cca gac gtg gcc aca agc acc 1201
Asp Pro Ala Ala Ala Phe Thr Leu Pro Pro Asp Val Ala Thr Ser Thr
385 390 395 400
cta cgg tcc atc tcc aac aac cgc agt gtt gtc agc gac cgc gat cag 1249
Leu Arg Ser Ile Ser Asn Asn Arg Ser Val Val Ser Asp Arg Asp Gln
405 410 415
aaa ttc gca gag cgg gat ggg tgt gta cct gtc ttt caa gtg agg ccc 1297
Lys Phe Ala Glu Arg Asp Gly Cys Val Pro Val Phe Gln Val Arg Pro
420 425 430
act gcc cca tcc acc cca tca tct cgc cca cca cgg att gaa gaa tca 1345
Thr Ala Pro Ser Thr Pro Ser Ser Arg Pro Pro Arg Ile Glu Glu Ser
435 440 445
gtc att aaa att gac ttg ttc agg aat agg tac cca gca tgg gag aga 1393
Val Ile Lys Ile Asp Leu Phe Arg Asn Arg Tyr Pro Ala Trp Glu Arg
450 455 460
cat gtt ttg gac cga agc ctc cga aag gct att tta gct ttt gaa tgt 1441
His Val Leu Asp Arg Ser Leu Arg Lys Ala Ile Leu Ala Phe Glu Cys
465 470 475 480
tcc cca tct cct cca agg ctg cag tac aaa gat gat gcc ctt att cag 1489
Ser Pro Ser Pro Pro Arg Leu Gln Tyr Lys Asp Asp Ala Leu Ile Gln
485 490 495
gag cgg ttg gag tac gaa acc act tta taaagcaaaa ggagttgcag 1536
Glu Arg Leu Glu Tyr Glu Thr Thr Leu
500 505
gacccacaac atccagatga aggggtgaca gcagggctgt ggccataatg acacttcatc 1596
ctagagcagg gcagtgagcc gtgaagttcc tagtgggacc gtcatcacca ttatcatttg 1656
atcctgtcgg ctgggggcgg ctggtctcct tccaaagcag ctgcacccga gagtctctga 1716
ctccacctga aagaatgacg ctggcttaat aggactctcc attgctacca aactcctcct 1776
gcacggtctt gggtgcaccc accagagggt actactatta tggaaaaatt ttgcctccaa 1836
tcattagggt gtcttgatgg cgttaactga tctttccata aaaatagatt cagtcataca 1896
cacatacaca cactaacaca cataagttac accagtcctc tgtcaaaaaa gcttaggtga 1956
cttttcttga tgcaaagctc tgattcccac aggaatataa aaacaaagaa agagggaaac 2016
atccctcgag aaaaaaaata gtattgctta gaaaagaaac cattttctca tttggaaatc 2076
cataccatgt gtgaaaatcc tatccaacgg acagcaaacc caaatgttgt ctacacatgt 2136
gttagcattg atggagtggt tcattttcta cacatttcag gatttgtttt atattttaaa 2196
ttttcagttg cgaacatcct ttttgacaga aatcctatgc agcccatgta cggctttcaa 2256
caagaccaag gagctcaata acttcatgat gtaaattaaa tagtaatcat gattcagtat 2316
tcaattgcaa aaatgtaaca ggtacacaaa gaggaagtgg ggaaaaaggc aaaatgagag 2376
tctgattccc aggcatgtgc agcgcccatt gggacataac ggcagtgcgg cgcgagccag 2436
aggaatgggc tggaaccgga tctgtttcca gacgcagaat gagtggctct gtgtgaccat 2496
aggcagatgc tgactctgga agactccgtg ccactccttt ctagtgccaa acaccatcca 2556
accacaggac tgacgtggaa gccccaaaca actgagaatg agtggcatga gccccctaaa 2616
agcaggcgag agaacgagca atcaagttct ccactgtgta cagacttttc ctccccccaa 2676
tccaaggtca aagtgatgtg tcttttagag gctttgggac actttttagt aagtatgagc 2736
agacaaatgc aatgaatatg ctatgaaaaa acccttctga actgagagag ggcttatcac 2796
tatatccagc taagatttgt atttgaatca tctgtaaagt cgcactctta caacaagctt 2856
ctgggtttta aatacctccg tacagcaagt aaacgttccc cgctttctgt tctcagtgtc 2916
ctcggtcatg gtgcttttcg ttgcattaaa agtgccggtc aaactttgat agtatttttt 2976
tatagttggt gcagagtgga ataactcatg gattatttca atatttttgt aataaaaaat 3036
atagggtata cacataggca tcatcacatt ttttatagac ctggaatcgt ttaaaatact 3096
ttaagcatca taattacttg ggatgtcaga aactggtcca caaattccat cagcctgcct 3156
cagcagattg aaaacatttg tctcttgcaa gatcacccta ctttgcaagt tggtgccccc 3216
aggaacctgg ccaggggtgc tatcagaata tcaggtgaag agagaatcag cttaaataga 3276
aagggcttgt caagactggc caatgtttcc caggaaatca aagatgtaaa tgattacttt 3336
catccatcca ttataacaaa cctgaccaca gtggaagctg tcttaaactt ccttccctgg 3396
ttttatatta acccaactga tagattaagt attagtcaaa ccact 3441
<210> 22
<211> 3289
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(3144)
<400> 22
ggg aaa gga gtt caa atg atc ttt cac acc ttt cat ctt gag agt tcc 48
Gly Lys Gly Val Gln Met Ile Phe His Thr Phe His Leu Glu Ser Ser
1 5 10 15
cac gac tat tta ctg atc aca gag gat gga agt ttt tcc gag ccc gtt 96
His Asp Tyr Leu Leu Ile Thr Glu Asp Gly Ser Phe Ser Glu Pro Val
20 25 30
gcc agg ctc acc ggg tcg gtg ttg cct cat acg atc aag gca ggc ctg 144
Ala Arg Leu Thr Gly Ser Val Leu Pro His Thr Ile Lys Ala Gly Leu
35 40 45
ttt gga aac ttc act gcc cag ctt cgg ttt ata tca gac ttc tca att 192
Phe Gly Asn Phe Thr Ala Gln Leu Arg Phe Ile Ser Asp Phe Ser Ile
50 55 60
tcg tac gag ggc ttc aat atc aca ttt tca gaa tat gac ctg gag cca 240
Ser Tyr Glu Gly Phe Asn Ile Thr Phe Ser Glu Tyr Asp Leu Glu Pro
65 70 75 80
tgt gat gat cct gga gtc cct gcc ttc agc cga aga att ggt ttt cac 288
Cys Asp Asp Pro Gly Val Pro Ala Phe Ser Arg Arg Ile Gly Phe His
85 90 95
ttt ggt gtg gga gac tct ctg acg ttt tcc tgc ttc ctg gga tat cgt 336
Phe Gly Val Gly Asp Ser Leu Thr Phe Ser Cys Phe Leu Gly Tyr Arg
100 105 110
tta gaa ggt gcc acc aag ctt acc tgc ctg ggt ggg ggc cgc cgt gtg 384
Leu Glu Gly Ala Thr Lys Leu Thr Cys Leu Gly Gly Gly Arg Arg Val
115 120 125
tgg agt gca cct ctg cca agg tgt gtg gcc gaa tgt gga gca agt gtc 432
Trp Ser Ala Pro Leu Pro Arg Cys Val Ala Glu Cys Gly Ala Ser Val
130 135 140
aaa gga aat gaa gga aca tta ctg tct cca aat ttt cca tcc aat tat 480
Lys Gly Asn Glu Gly Thr Leu Leu Ser Pro Asn Phe Pro Ser Asn Tyr
145 150 155 160
gat aat aac cat gag tgt atc tat aaa ata gaa aca gaa gcc ggc aag 528
Asp Asn Asn His Glu Cys Ile Tyr Lys Ile Glu Thr Glu Ala Gly Lys
165 170 175
ggc atc cac ctt aga aca cga agc ttc cag ctg ttt gaa gga gat act 576
Gly Ile His Leu Arg Thr Arg Ser Phe Gln Leu Phe Glu Gly Asp Thr
180 185 190
cta aag gta tat gat gga aaa gac agt tcc tca cgt cca ctg ggc acg 624
Leu Lys Val Tyr Asp Gly Lys Asp Ser Ser Ser Arg Pro Leu Gly Thr
195 200 205
ttc act aaa aat gaa ctt ctg ggg ctg atc cta aac agc aca tcc aat 672
Phe Thr Lys Asn Glu Leu Leu Gly Leu Ile Leu Asn Ser Thr Ser Asn
210 215 220
cac cta tgg cta gag ttc aac acc aat gga tct gac acc gac caa ggt 720
His Leu Trp Leu Glu Phe Asn Thr Asn Gly Ser Asp Thr Asp Gln Gly
225 230 235 240
ttt caa ctc acc tat acc agt ttt gat ctg gta aaa tgt gag gat ccg 768
Phe Gln Leu Thr Tyr Thr Ser Phe Asp Leu Val Lys Cys Glu Asp Pro
245 250 255
ggc atc cct aac tac ggc tat agg atc cgt gat gaa ggc cac ttt acc 816
Gly Ile Pro Asn Tyr Gly Tyr Arg Ile Arg Asp Glu Gly His Phe Thr
260 265 270
gac act gta gtt ctg tac agt tgc aac ccg ggg tac gcc atg cat ggc 864
Asp Thr Val Val Leu Tyr Ser Cys Asn Pro Gly Tyr Ala Met His Gly
275 280 285
agc aac acc ctg acc tgt ttg agt gga gac agg aga gtg tgg gac aaa 912
Ser Asn Thr Leu Thr Cys Leu Ser Gly Asp Arg Arg Val Trp Asp Lys
290 295 300
cca cta cct tcg tgc ata gcg gaa tgt ggt ggt cag atc cat gca gcc 960
Pro Leu Pro Ser Cys Ile Ala Glu Cys Gly Gly Gln Ile His Ala Ala
305 310 315 320
aca tca gga cga ata ttg tcc cct ggc tat cca gct ccg tat gac aac 1008
Thr Ser Gly Arg Ile Leu Ser Pro Gly Tyr Pro Ala Pro Tyr Asp Asn
325 330 335
aac ctc cac tgc acc tgg att ata gag gca gac cca gga aag acc att 1056
Asn Leu His Cys Thr Trp Ile Ile Glu Ala Asp Pro Gly Lys Thr Ile
340 345 350
agc ctc cat ttc att gtt ttc gac acg gag atg gct cac gac atc ctc 1104
Ser Leu His Phe Ile Val Phe Asp Thr Glu Met Ala His Asp Ile Leu
355 360 365
aag gtc tgg gac ggg ccg gtg gac agt gac atc ctg ctg aag gag tgg 1152
Lys Val Trp Asp Gly Pro Val Asp Ser Asp Ile Leu Leu Lys Glu Trp
370 375 380
agt ggc tcc gcc ctt ccg gag gac atc cac agc acc ttc aac tca ctc 1200
Ser Gly Ser Ala Leu Pro Glu Asp Ile His Ser Thr Phe Asn Ser Leu
385 390 395 400
acc ctg cag ttc gac agc gac ttc ttc atc agc aag tct ggc ttc tcc 1248
Thr Leu Gln Phe Asp Ser Asp Phe Phe Ile Ser Lys Ser Gly Phe Ser
405 410 415
atc cag ttc tcc acc tca att gca gcc acc tgt aac gat cca ggt atg 1296
Ile Gln Phe Ser Thr Ser Ile Ala Ala Thr Cys Asn Asp Pro Gly Met
420 425 430
ccc caa aat ggc acc cgc tat gga gac agc aga gag gct gga gac acc 1344
Pro Gln Asn Gly Thr Arg Tyr Gly Asp Ser Arg Glu Ala Gly Asp Thr
435 440 445
gtc aca ttc cag tgt gac cct ggc tat cag ctc caa gga caa gcc aaa 1392
Val Thr Phe Gln Cys Asp Pro Gly Tyr Gln Leu Gln Gly Gln Ala Lys
450 455 460
atc acc tgt gtg cag ctg aat aac cgg ttc ttt tgg caa cca gac cct 1440
Ile Thr Cys Val Gln Leu Asn Asn Arg Phe Phe Trp Gln Pro Asp Pro
465 470 475 480
cct aca tgc ata gct gct tgt gga ggg aat ctg acg ggc cca gca ggt 1488
Pro Thr Cys Ile Ala Ala Cys Gly Gly Asn Leu Thr Gly Pro Ala Gly
485 490 495
gtt att ttg tca ccc aac tac cca cag ccg tat cct cct ggg aag gaa 1536
Val Ile Leu Ser Pro Asn Tyr Pro Gln Pro Tyr Pro Pro Gly Lys Glu
500 505 510
tgt gac tgg aga gta aaa gtg aac ccg gac ttt gtc atc gcc ttg ata 1584
Cys Asp Trp Arg Val Lys Val Asn Pro Asp Phe Val Ile Ala Leu Ile
515 520 525
ttc aaa agt ttc aac atg gag ccc agc tat gac ttc cta cac atc tat 1632
Phe Lys Ser Phe Asn Met Glu Pro Ser Tyr Asp Phe Leu His Ile Tyr
530 535 540
gaa ggg gaa gat tcc aac agc ccc ctc att ggg agt tac cag ggc tct 1680
Glu Gly Glu Asp Ser Asn Ser Pro Leu Ile Gly Ser Tyr Gln Gly Ser
545 550 555 560
cag gcc cca gaa aga ata gag agt agc gga aac agc ctg ttt ctg gca 1728
Gln Ala Pro Glu Arg Ile Glu Ser Ser Gly Asn Ser Leu Phe Leu Ala
565 570 575
ttt cgg agt gat gcc tcc gtg ggc ctt tca ggg ttc gcc att gaa ttt 1776
Phe Arg Ser Asp Ala Ser Val Gly Leu Ser Gly Phe Ala Ile Glu Phe
580 585 590
aaa gag aaa cca cgg gaa gct tgt ttt gac cca gga aat ata atg aat 1824
Lys Glu Lys Pro Arg Glu Ala Cys Phe Asp Pro Gly Asn Ile Met Asn
595 600 605
ggg aca aga gtt gga aca gac ttc aag ctt ggc tcc acc atc acc tac 1872
Gly Thr Arg Val Gly Thr Asp Phe Lys Leu Gly Ser Thr Ile Thr Tyr
610 615 620
cag tgt gac tct ggc tat aag att ctt gac ccc tca tcc atc acc tgt 1920
Gln Cys Asp Ser Gly Tyr Lys Ile Leu Asp Pro Ser Ser Ile Thr Cys
625 630 635 640
gtg att ggg gct gat ggg aaa ccc tcc tgg gac caa gtg ctg ccc tcc 1968
Val Ile Gly Ala Asp Gly Lys Pro Ser Trp Asp Gln Val Leu Pro Ser
645 650 655
tgc aat gct ccc tgt gga ggc cag tac acg gga tca gaa ggg gta gtt 2016
Cys Asn Ala Pro Cys Gly Gly Gln Tyr Thr Gly Ser Glu Gly Val Val
660 665 670
tta tca cca aac tac ccc cat aat tac aca gct ggt caa ata tgc ctc 2064
Leu Ser Pro Asn Tyr Pro His Asn Tyr Thr Ala Gly Gln Ile Cys Leu
675 680 685
tat tcc atc acg gta cca aag gaa ttc gtg gtc ttt gga cag ttt gcc 2112
Tyr Ser Ile Thr Val Pro Lys Glu Phe Val Val Phe Gly Gln Phe Ala
690 695 700
tat ttc cag aca gcc ctg aat gat ttg gca gaa tta ttt gat gga acc 2160
Tyr Phe Gln Thr Ala Leu Asn Asp Leu Ala Glu Leu Phe Asp Gly Thr
705 710 715 720
cat gca cag gcc aga ctt ctc agc tca ctc tcg ggg tct cac tca ggg 2208
His Ala Gln Ala Arg Leu Leu Ser Ser Leu Ser Gly Ser His Ser Gly
725 730 735
gaa aca ttg ccc ttg gct acg tca aat caa att ctg ctc cga ttc agt 2256
Glu Thr Leu Pro Leu Ala Thr Ser Asn Gln Ile Leu Leu Arg Phe Ser
740 745 750
gca aag agc ggt gcc tct gcc cgc ggc ttc cac ttc gtg tat caa gct 2304
Ala Lys Ser Gly Ala Ser Ala Arg Gly Phe His Phe Val Tyr Gln Ala
755 760 765
gtt cct cgt acc agt gac acc caa tgc agc tct gtc ccc gag ccc aga 2352
Val Pro Arg Thr Ser Asp Thr Gln Cys Ser Ser Val Pro Glu Pro Arg
770 775 780
tac gga agg aga att ggt tct gag ttt tct gcc ggc tcc atc gtc cga 2400
Tyr Gly Arg Arg Ile Gly Ser Glu Phe Ser Ala Gly Ser Ile Val Arg
785 790 795 800
ttc gag tgc aac ccg gga tac ctg ctt cag ggt tcc acg gcg ctc cac 2448
Phe Glu Cys Asn Pro Gly Tyr Leu Leu Gln Gly Ser Thr Ala Leu His
805 810 815
tgc cag tcc gtg ccc aac gcc ttg gca cag tgg aac gac acg atc ccc 2496
Cys Gln Ser Val Pro Asn Ala Leu Ala Gln Trp Asn Asp Thr Ile Pro
820 825 830
agc tgt gtg gta ccc tgc agt ggc aat ttc act caa cga aga ggt aca 2544
Ser Cys Val Val Pro Cys Ser Gly Asn Phe Thr Gln Arg Arg Gly Thr
835 840 845
atc ctg tcc ccc ggc tac cct gag cca tac gga aac aac ttg aac tgt 2592
Ile Leu Ser Pro Gly Tyr Pro Glu Pro Tyr Gly Asn Asn Leu Asn Cys
850 855 860
ata tgg aag atc ata gtt acg gag ggc tcg gga att cag atc caa gtg 2640
Ile Trp Lys Ile Ile Val Thr Glu Gly Ser Gly Ile Gln Ile Gln Val
865 870 875 880
atc agt ttt gcc acg gag cag aac tgg gac tcc ctt gag atc cac gat 2688
Ile Ser Phe Ala Thr Glu Gln Asn Trp Asp Ser Leu Glu Ile His Asp
885 890 895
ggt ggg gat gtg acc gca ccc aga ctg gga agc ttc tca ggc acc aca 2736
Gly Gly Asp Val Thr Ala Pro Arg Leu Gly Ser Phe Ser Gly Thr Thr
900 905 910
gta ccg gca ctg ctg aac agt act tcc aac caa ctc tac ctg cat ttc 2784
Val Pro Ala Leu Leu Asn Ser Thr Ser Asn Gln Leu Tyr Leu His Phe
915 920 925
cag tct gac att agt gtg gca gct gct ggt ttc cac ctg gaa tac aaa 2832
Gln Ser Asp Ile Ser Val Ala Ala Ala Gly Phe His Leu Glu Tyr Lys
930 935 940
act gta ggt ctt gct gca tgc caa gaa cca gcc ctc ccc agc aac agc 2880
Thr Val Gly Leu Ala Ala Cys Gln Glu Pro Ala Leu Pro Ser Asn Ser
945 950 955 960
atc aaa atc gga gat cgg tac atg gtg aac gac gtg ctc tcc ttc cag 2928
Ile Lys Ile Gly Asp Arg Tyr Met Val Asn Asp Val Leu Ser Phe Gln
965 970 975
tgc gag ccc ggg tac acc ctg cag ggc cgt tcc cac att tcc tgt atg 2976
Cys Glu Pro Gly Tyr Thr Leu Gln Gly Arg Ser His Ile Ser Cys Met
980 985 990
cca ggg acc gtt cgc cgt tgg aac tat ccg tct ccc ctg tgc att gca 3024
Pro Gly Thr Val Arg Arg Trp Asn Tyr Pro Ser Pro Leu Cys Ile Ala
995 1000 1005
acc tgt gga ggg acg ctg agc acc ttg ggt ggt gtg atc ctg agc ccc 3072
Thr Cys Gly Gly Thr Leu Ser Thr Leu Gly Gly Val Ile Leu Ser Pro
1010 1015 1020
ggc ttc cca ggt tct tac ccc aac aac tta gac tgc acc tgg agg atc 3120
Gly Phe Pro Gly Ser Tyr Pro Asn Asn Leu Asp Cys Thr Trp Arg Ile
1025 1030 1035 1040
tca tta ccc atc ggc tat ggt aag tgaaaatgtt accaattcct aaaagttcat 3174
Ser Leu Pro Ile Gly Tyr Gly Lys
1045
ttttttcatt tccatcttgg aaagaattaa gcagaataaa atgccttatt ctgttggcat 3234
aactgtatct gtaattgaaa gtggcgtaca gatttaaata aagctttggc agagt 3289
<210> 23
<211> 3373
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(2341)
<400> 23
a aaa gat gca agt atg acc caa gcc ctt tgc aga atg att gac tgg cta 49
Lys Asp Ala Ser Met Thr Gln Ala Leu Cys Arg Met Ile Asp Trp Leu
1 5 10 15
tcc tgg cca ttg gct cag cat gtg gat aca tgg gta att gca ctc ctg 97
Ser Trp Pro Leu Ala Gln His Val Asp Thr Trp Val Ile Ala Leu Leu
20 25 30
aaa gga ctg gca gct gtc cag aag ttt act att ttg ata gat gtt act 145
Lys Gly Leu Ala Ala Val Gln Lys Phe Thr Ile Leu Ile Asp Val Thr
35 40 45
ttg ctg aaa ata gaa ctg gtt ttt aat cga ctt tgg ttt cct ctt gtg 193
Leu Leu Lys Ile Glu Leu Val Phe Asn Arg Leu Trp Phe Pro Leu Val
50 55 60
aga cct ggt gct ctt gca gtt ctt tct cac atg ctg ctt agc ttt cag 241
Arg Pro Gly Ala Leu Ala Val Leu Ser His Met Leu Leu Ser Phe Gln
65 70 75 80
cat tct cca gag gcg ttc cat ttg att gtt cct cat gtg gtt aat ttg 289
His Ser Pro Glu Ala Phe His Leu Ile Val Pro His Val Val Asn Leu
85 90 95
gtt cat tct ttc aaa aat gat ggt ctg cct tca agt aca gcc ttc tta 337
Val His Ser Phe Lys Asn Asp Gly Leu Pro Ser Ser Thr Ala Phe Leu
100 105 110
gta caa tta aca gaa ttg ata cac tgt atg atg tat cat tat tct gga 385
Val Gln Leu Thr Glu Leu Ile His Cys Met Met Tyr His Tyr Ser Gly
115 120 125
ttt cca gat ctc tat gaa cct att ctg gag gca ata aag gat ttt cct 433
Phe Pro Asp Leu Tyr Glu Pro Ile Leu Glu Ala Ile Lys Asp Phe Pro
130 135 140
aag ccc agt gaa gag aag att aag tta att ctc aat caa agt gcc tgg 481
Lys Pro Ser Glu Glu Lys Ile Lys Leu Ile Leu Asn Gln Ser Ala Trp
145 150 155 160
act tct caa tcc aat tct ttg gcg tct tgc ttg tct aga ctt tct gga 529
Thr Ser Gln Ser Asn Ser Leu Ala Ser Cys Leu Ser Arg Leu Ser Gly
165 170 175
aaa tct gaa act ggg aaa act ggt ctt att aac cta gga aat aca tgt 577
Lys Ser Glu Thr Gly Lys Thr Gly Leu Ile Asn Leu Gly Asn Thr Cys
180 185 190
tat atg aac agt gtt ata caa gcc ttg ttt atg gcc aca gat ttc agg 625
Tyr Met Asn Ser Val Ile Gln Ala Leu Phe Met Ala Thr Asp Phe Arg
195 200 205
aga caa gta tta tct tta aat cta aat ggg tgc aat tca tta atg aaa 673
Arg Gln Val Leu Ser Leu Asn Leu Asn Gly Cys Asn Ser Leu Met Lys
210 215 220
aaa tta cag cat ctt ttt gcc ttt ctg gcc cat aca cag agg gaa gca 721
Lys Leu Gln His Leu Phe Ala Phe Leu Ala His Thr Gln Arg Glu Ala
225 230 235 240
tac gca cct cgg ata ttc ttt gag gct tcc aga cct cca tgg ttt act 769
Tyr Ala Pro Arg Ile Phe Phe Glu Ala Ser Arg Pro Pro Trp Phe Thr
245 250 255
ccc aga tca cag caa gac tgt tct gaa tac ctc aga ttt ctc ctt gac 817
Pro Arg Ser Gln Gln Asp Cys Ser Glu Tyr Leu Arg Phe Leu Leu Asp
260 265 270
agg ctc cat gaa gaa gaa aag atc ttg aaa gtt cag gcc tca cac aag 865
Arg Leu His Glu Glu Glu Lys Ile Leu Lys Val Gln Ala Ser His Lys
275 280 285
cct tct gaa att ctg gaa tgc agt gaa act tct tta cag gaa gta gct 913
Pro Ser Glu Ile Leu Glu Cys Ser Glu Thr Ser Leu Gln Glu Val Ala
290 295 300
agt aaa gca gca gta cta aca gag acc cct cgt aca agt gac ggt gag 961
Ser Lys Ala Ala Val Leu Thr Glu Thr Pro Arg Thr Ser Asp Gly Glu
305 310 315 320
aag act tta ata gaa aaa atg ttt gga gga aaa cta cga act cac ata 1009
Lys Thr Leu Ile Glu Lys Met Phe Gly Gly Lys Leu Arg Thr His Ile
325 330 335
cgt tgt ttg aac tgc agg agt acc tca caa aaa gtg gaa gcc ttt aca 1057
Arg Cys Leu Asn Cys Arg Ser Thr Ser Gln Lys Val Glu Ala Phe Thr
340 345 350
gat ctt tcg ctt gcc ttt tgt cct tcc tct tct ttg gaa aac atg tct 1105
Asp Leu Ser Leu Ala Phe Cys Pro Ser Ser Ser Leu Glu Asn Met Ser
355 360 365
gtc caa gat cca gca tca tca ccc agt ata caa gat ggt ggt cta atg 1153
Val Gln Asp Pro Ala Ser Ser Pro Ser Ile Gln Asp Gly Gly Leu Met
370 375 380
caa gcc tct gta ccc ggt cct tca gaa gaa cca gta gtt tat aat cca 1201
Gln Ala Ser Val Pro Gly Pro Ser Glu Glu Pro Val Val Tyr Asn Pro
385 390 395 400
aca aca gct gcc ttc atc tgt gac tca ctt gtg aat gaa aaa acc ata 1249
Thr Thr Ala Ala Phe Ile Cys Asp Ser Leu Val Asn Glu Lys Thr Ile
405 410 415
ggc agt cct cct aat gag ttt tac tgt tct gaa aac act tct gtc cct 1297
Gly Ser Pro Pro Asn Glu Phe Tyr Cys Ser Glu Asn Thr Ser Val Pro
420 425 430
aac gaa tct aac aag att ctt gtt aat aaa gat gta cct cag aaa cca 1345
Asn Glu Ser Asn Lys Ile Leu Val Asn Lys Asp Val Pro Gln Lys Pro
435 440 445
gga ggt gaa acc aca cct tca gta act gac tta cta aat tat ttt ttg 1393
Gly Gly Glu Thr Thr Pro Ser Val Thr Asp Leu Leu Asn Tyr Phe Leu
450 455 460
gct cca gag att ctt act ggt gat aac caa tat tat tgt gaa aac tgt 1441
Ala Pro Glu Ile Leu Thr Gly Asp Asn Gln Tyr Tyr Cys Glu Asn Cys
465 470 475 480
gcc tct ctg caa aat gct gag aaa act atg caa atc acg gag gaa cct 1489
Ala Ser Leu Gln Asn Ala Glu Lys Thr Met Gln Ile Thr Glu Glu Pro
485 490 495
gaa tac ctt att ctt act ctc ctg aga ttt tca tat gat cag aag tat 1537
Glu Tyr Leu Ile Leu Thr Leu Leu Arg Phe Ser Tyr Asp Gln Lys Tyr
500 505 510
cat gtg aga agg aaa att tta gac aat gta tca ctg cca ctg gtt ttg 1585
His Val Arg Arg Lys Ile Leu Asp Asn Val Ser Leu Pro Leu Val Leu
515 520 525
gag ttg cca gtt aaa aga att act tct ttc tct tca ttg tca gaa agt 1633
Glu Leu Pro Val Lys Arg Ile Thr Ser Phe Ser Ser Leu Ser Glu Ser
530 535 540
tgg tct gta gat gtt gac ttc act gat ctt agt gag aac ctt gct aaa 1681
Trp Ser Val Asp Val Asp Phe Thr Asp Leu Ser Glu Asn Leu Ala Lys
545 550 555 560
aaa tta aag cct tca ggg act gat gaa gct tcc tgc aca aaa ttg gtg 1729
Lys Leu Lys Pro Ser Gly Thr Asp Glu Ala Ser Cys Thr Lys Leu Val
565 570 575
ccc tat cta tta agt tcc gtt gtg gtt cac tct ggt ata tcc tct gaa 1777
Pro Tyr Leu Leu Ser Ser Val Val Val His Ser Gly Ile Ser Ser Glu
580 585 590
agt ggg cat tac tat tct tat gcc aga aat atc aca agt aca gac tct 1825
Ser Gly His Tyr Tyr Ser Tyr Ala Arg Asn Ile Thr Ser Thr Asp Ser
595 600 605
tca tat cag atg tac cac cag tct gag gct ctg gca tta gca tcc tcc 1873
Ser Tyr Gln Met Tyr His Gln Ser Glu Ala Leu Ala Leu Ala Ser Ser
610 615 620
cag agt cat tta cta ggg aga gat agt ccc agt gca gtt ttt gaa cag 1921
Gln Ser His Leu Leu Gly Arg Asp Ser Pro Ser Ala Val Phe Glu Gln
625 630 635 640
gat ttg gaa aat aag gaa atg tca aaa gaa tgg ttt tta ttt aat gac 1969
Asp Leu Glu Asn Lys Glu Met Ser Lys Glu Trp Phe Leu Phe Asn Asp
645 650 655
agt aga gtg aca ttt act tca ttt cag tca gtc cag aaa att acg agc 2017
Ser Arg Val Thr Phe Thr Ser Phe Gln Ser Val Gln Lys Ile Thr Ser
660 665 670
agg ttt cca aag gac aca gct tat gtg ctt ttg tat aaa aaa cag cat 2065
Arg Phe Pro Lys Asp Thr Ala Tyr Val Leu Leu Tyr Lys Lys Gln His
675 680 685
agt act aat ggt tta agt ggt aat aac cca acc agt gga ctc tgg ata 2113
Ser Thr Asn Gly Leu Ser Gly Asn Asn Pro Thr Ser Gly Leu Trp Ile
690 695 700
aat gga gac cca cct cta cag aaa gaa ctt atg gat gct ata aca aaa 2161
Asn Gly Asp Pro Pro Leu Gln Lys Glu Leu Met Asp Ala Ile Thr Lys
705 710 715 720
gac aat aaa cta tat tta cag gaa caa gag ttg aat gct cga gcc cgg 2209
Asp Asn Lys Leu Tyr Leu Gln Glu Gln Glu Leu Asn Ala Arg Ala Arg
725 730 735
gcc ctc caa gct gca tct gct tca tgt tca ttt cgg ccc aat gga ttt 2257
Ala Leu Gln Ala Ala Ser Ala Ser Cys Ser Phe Arg Pro Asn Gly Phe
740 745 750
gat gac aac gac cca cca gga agc tgt gga cca act ggt gga ggg ggt 2305
Asp Asp Asn Asp Pro Pro Gly Ser Cys Gly Pro Thr Gly Gly Gly Gly
755 760 765
gga gga gga ttt aat aca gtt ggc aga ctc gta ttt tgatcctgag 2351
Gly Gly Gly Phe Asn Thr Val Gly Arg Leu Val Phe
770 775 780
agagtccaaa atgcactggt cacgaaacgt ctaatactat gactgttaaa atgtcagact 2411
ataacaaata tctatctttt atttttcatt agacccttat acttcaagag aacacactca 2471
gtgcttgttt ttattttctt gacacattta ttaacaaaat gcatcatgga aaaaaaatct 2531
acctcttaaa attccatttg cttttatggt tagacatgct tgaccaaaaa tgttcagaag 2591
aaaatatgta cctggtccct aattaagctg cgttaaattt ggtagaagca tttaaatggt 2651
ctatcttcag ttttactgaa caaaaaatgt aatttattta gcattcttta taaaagaatt 2711
gatgctagag gtaaaaaaaa atacttgttt ttaaaaaatc ctttacgtct tgtgtaatta 2771
ccccattatt aaattcaagt ccttgaaaat caactagaga ttataaagtc tctaaagaag 2831
gcaataacaa aatttatcaa gatatagtac ttttcagttt ttgtttagtg tcttcagcat 2891
cactgtgtct gtatttcaag tacaaatgtt tttaaaaagg attctttata catatgtgct 2951
gaattgattt taagaaagtt gcatgatcct gtaggagcaa catttttacc taaaaaatgc 3011
taactttata gtatttctaa ttgttcaagg attttaaaat tctatttcag ggagtatatc 3071
ttctgtggtt ttgaaggagg tgagttctgt atgtgccttg cagtactgta attcaaaaat 3131
aggaatcttt ggctgcaaaa ttttaatgaa atgttaggaa gtaattttcg tgctaacatt 3191
aaaattataa ctttttgaaa ggtaatagat tttccagaag taaaatctga tggttctaaa 3251
tcaatcaatg tgatagttca tttttaactc ttagaagaat tcagaggaaa ttaacccagc 3311
taagtaaaaa atctgtcttg attttgttac ttattcctca gaatattaaa cattgatcac 3371
at 3373
<210> 24
<211> 3439
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(1032)
<400> 24
gga gaa gct agg aag aaa atg gcg gcc gtg gct gca gag gcg gca gcg 48
Gly Glu Ala Arg Lys Lys Met Ala Ala Val Ala Ala Glu Ala Ala Ala
1 5 10 15
act gca gcg tcc ccc ggg gag ggg ggc gcc ggc gag gcc gag ccg gag 96
Thr Ala Ala Ser Pro Gly Glu Gly Gly Ala Gly Glu Ala Glu Pro Glu
20 25 30
atg gag ccc atc ccc ggc agt gag gcc ggc act gac ccc ctc ccg gtc 144
Met Glu Pro Ile Pro Gly Ser Glu Ala Gly Thr Asp Pro Leu Pro Val
35 40 45
acg gcc act gaa gcg tct gtg ccg gat ggc gag act gac ggg cag caa 192
Thr Ala Thr Glu Ala Ser Val Pro Asp Gly Glu Thr Asp Gly Gln Gln
50 55 60
tcc gct cct cag gcc gac gag ccg ccg ctc ccg ccg cca ccg ccg ccg 240
Ser Ala Pro Gln Ala Asp Glu Pro Pro Leu Pro Pro Pro Pro Pro Pro
65 70 75 80
ccg ggg gag ctc gcc cgc agc cca gag gcg gtg ggg ccg gag ctg gag 288
Pro Gly Glu Leu Ala Arg Ser Pro Glu Ala Val Gly Pro Glu Leu Glu
85 90 95
gct gag gag aaa ctg tcc gtt cgg gtg gcg gag tcg gcg gca gcc gcg 336
Ala Glu Glu Lys Leu Ser Val Arg Val Ala Glu Ser Ala Ala Ala Ala
100 105 110
cct cag gga ggg ccg gaa ctt cca cct tct cct gca tcg ccg ccg gag 384
Pro Gln Gly Gly Pro Glu Leu Pro Pro Ser Pro Ala Ser Pro Pro Glu
115 120 125
cag ccc ccg gct ccc gag gag cgc gag gag ccg ccg ctg cct cag ccc 432
Gln Pro Pro Ala Pro Glu Glu Arg Glu Glu Pro Pro Leu Pro Gln Pro
130 135 140
gta gcc ccg gcg ctc gtg ccg ccg gcg ggc ggg gac tcc acg gtg tcg 480
Val Ala Pro Ala Leu Val Pro Pro Ala Gly Gly Asp Ser Thr Val Ser
145 150 155 160
caa ctg atc ccg ggc tcg gag gtg cgg gtc acg ctg gac cac atc att 528
Gln Leu Ile Pro Gly Ser Glu Val Arg Val Thr Leu Asp His Ile Ile
165 170 175
gag gac gcg ctt gtc gtg tcg ttc cgc ttc ggg gag aag ctc ttc tcc 576
Glu Asp Ala Leu Val Val Ser Phe Arg Phe Gly Glu Lys Leu Phe Ser
180 185 190
ggg gtc ctc atg gat ctg tcc aaa agg tac cgc gcc cgc ccc cag cac 624
Gly Val Leu Met Asp Leu Ser Lys Arg Tyr Arg Ala Arg Pro Gln His
195 200 205
cct ccg gtc cgc tgg gtc ccc agg gag gga gtg agc ccg ggc ggc ccc 672
Pro Pro Val Arg Trp Val Pro Arg Glu Gly Val Ser Pro Gly Gly Pro
210 215 220
ctc tcc ccg agc cct cgc ttc ccc gct gcc cgg ggc ccc gcc ctc tgg 720
Leu Ser Pro Ser Pro Arg Phe Pro Ala Ala Arg Gly Pro Ala Leu Trp
225 230 235 240
tcc tgc gcg gcg cgt ggc gcg cgc tcc gag tgg ggc agg cgc gcg aga 768
Ser Cys Ala Ala Arg Gly Ala Arg Ser Glu Trp Gly Arg Arg Ala Arg
245 250 255
ccg gga gag ggc ggt ccc ggc ccc agg tgg gcg cct gcg ggg cag gtg 816
Pro Gly Glu Gly Gly Pro Gly Pro Arg Trp Ala Pro Ala Gly Gln Val
260 265 270
tgc cac cag gtg gga ggg tgc agg tgc aga tgg gct cac agg tct cac 864
Cys His Gln Val Gly Gly Cys Arg Cys Arg Trp Ala His Arg Ser His
275 280 285
ccc tca cgc ctg gtc ttc ttc tgc gac ttg gcc tgg agt cac cgt cac 912
Pro Ser Arg Leu Val Phe Phe Cys Asp Leu Ala Trp Ser His Arg His
290 295 300
tgg gcg ggt ggg tta atg caa gat gca gat gga ttt ttg cag aga acg 960
Trp Ala Gly Gly Leu Met Gln Asp Ala Asp Gly Phe Leu Gln Arg Thr
305 310 315 320
gag tgt tgg acg ggg gga cgg ggg ggc ggg ggg cat tgc ttg cga gga 1008
Glu Cys Trp Thr Gly Gly Arg Gly Gly Gly Gly His Cys Leu Arg Gly
325 330 335
ggc act tgt ctg tcc tct aac gtt taactttagc gggattggag aaaggggtgt 1062
Gly Thr Cys Leu Ser Ser Asn Val
340
gtacattggg gttcccaaaa ggttacctgt cactccttac agggggaaga gcatttgtta 1122
aaatgaacct tttaactgta attcttacgg ttagaagtca ttcccgttag gttaaccatt 1182
caacagagaa gcatattttg ctaattttaa tgatagttta agaaaagttt aaagaagaga 1242
ttggagattt ttaaaaaatc ctttaatgtc aaggtggtgt agcataaaaa tgttcaagta 1302
acttctaaca cctgggattc agtttatgta ttaaggacaa gaaggttgag ttcatctaca 1362
aatttggctt aaccacattt tcattttttg atttaaaatg tttttgggct ctgctttcaa 1422
caggggaaga gaacctgttc taagtctgga gttaaccttt ttattctctt cttttgcaga 1482
tatgaagtgc agagactccc attcacggct ttccattctg tttcctgcaa cagcacacgc 1542
tctgatcgta tctcacctgt ttgcaatggc ttattagtat gttttactct taaacttgct 1602
tgctctccca tcaagaccca ggaatccttc tttcccattt ggggaaaaaa gccatttaaa 1662
atttcatatt gaagtatggt gtttgagggg aaagatgaga gcagttattc tgtattattt 1722
aatctcaaat atcttttttt ttttttttga gacggagtct cactctgtca cccaggctgg 1782
agtgcagtgg cgccatatcg tctcactgca acctccgcct cctgggttca agcgattctc 1842
ccgcctcagc ctcccaagta gctgggacta caggcgtgcg ccaccatgct ggtctaattt 1902
ttgtattttt taactagagc cgggatttca ccatgttggc caggctggtc ttgaaatcct 1962
gacctcaggt tatccacccg cctcagcctc ccaaagggct gggattacag gcatgagccg 2022
ccgtgcccgg cctcaagtat cttttttaaa aataattttg gattagggca tttgcacttc 2082
tgtttatgtt tgtttgttta cttatttatt agagacaggg tcttgctgtg ctgcctaggc 2142
tggtctcaga ctcctgggct caagcgatcc tactgccttg gccttccaaa gtgctgaaat 2202
tacaggcgtg agccactgtg cccggcctgc actcctgttt tcataattag aaagcagaat 2262
aaggtgagtg tgctttctat tgatgtattt tatttattta tttatttatt tatttttgag 2322
acagagtttc gctcttgttg tccaggctgc agtggtacaa tctctgctca ctgcaccctc 2382
cacctccctg gttcaagtga ttctcctacc tcagcctccc aagtagctgg gattacaggc 2442
atgtgccacc acgcctggct aattttgtat ttttagtaga gatggggttt caccatgttg 2502
gtcaggttgg tctcaaactc ctgacctcag gtgatccacc caccttggcc tcccaaagtg 2562
ctgagattac aggcatgagc cacctcgccc ggcctctact gatgtatata atagtaagag 2622
acactctggt gtgtatgctg ttgaatgaaa cagaagagat ggccagtgat tgtgtgagaa 2682
tctggatttc aaatctgcaa aaggaaggtg cagaaggttt gggccccatg gtatccctgt 2742
gacagtattt cccaaaaggg aatataagga taaaccagaa gccatgccgc tccaaagtaa 2802
tacattccaa gaagggacag aagtcaagtg tgaagcaaat ggtgctgttc ccgatgaccc 2862
ttctcctgtc ccgcatcccg agctgagctt ggctgaaagc ctgtggactt ccaaaccacc 2922
acctctcttc catgaaggag caccttatcc tccccctttg tttatcaggg acacatataa 2982
ccaatcaata cctcagccac ctcctcggaa aattaagcga cccaaacgaa aaatgtacag 3042
ggaagaaccc acttcaataa tgaatgctat taaactacga cccaggcaag ttctgtgtga 3102
taaatgtaaa aacagtgttg ttgctgaaaa aaaggaaatt agaaaaggta gtagtgcaac 3162
tgactcttct aaatatgaag ataaaaaacg gagaaatgaa agtgtaacta ctgtgaacaa 3222
aaaactgaaa actgaccata aagtggatgg gaaaaaccaa aatgaaagcc agaaaagaaa 3282
tgctgtggtt aaggtttcaa atattgctca cagcagaggc agagtagtaa aagtttctgc 3342
tcaggcaaat acatcaaaag ctcagttaag tactaaaaaa gttctccaga gtaagaacat 3402
ggatcatgcg aaagctcggg aagtgttaaa aattgcc 3439
<210> 25
<211> 3083
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (777)..(2366)
<400> 25
cgctcagccc gggcgggcga tgcgggcggc gcgggcggcc ccctcccccg gcccgcgtct 60
ccgggacggc tgcgggcggc ccccccggcg gccggagggc tccctggccc cgatctgacg 120
gcggcggcgg cggcggccac agcggcggga gcggcgcggg gaaggagcag cggctcgcag 180
ccctcggccc gcgcccccac ccagcgccag cccgaggggg gaggcgcagc gccggagggt 240
ggcggtcctc ggccctccca ggtctccgcg ccgggaagcc gctccgagcc ggggctggag 300
ggttgttttg ccgttgtgtt gagcacgtca cccattaaga gccctttaaa gacctggatt 360
gattggaagg acaaaaatta aaagcaatct gatccagcct catgcaggat ccctgcggat 420
tttctcctta tcccatttcc atccactgtc acaatttgag aatctgcctg atttgatcag 480
attcacctcc aggggaggtg tgataccagg gttaggagga cgtgaagtta tgggcaactt 540
tctgatctgt ccatcagcag tctgagaaac gctggctctg aattttccgt gtcggccttt 600
tggaaacaac aagttcctcg ctgtttgcaa agcttcagtg ctcgggtccc tgggacaccc 660
cggccaccct cgcctggtag atgtggcatt tccatgctga ggccgcgagt cccgcctgac 720
cccgtcgctg cctctccagg gcttctctgg gccgcgcctc tgcagactgc gcagcc atg 779
Met
1
ctg cat ctg ctg gcg ctc ttc ctg cac tgc ctc cct ctg gcc tct ggg 827
Leu His Leu Leu Ala Leu Phe Leu His Cys Leu Pro Leu Ala Ser Gly
5 10 15
gac tat gac atc tgc aaa tcc tgg gtg acc aca gat gag ggc ccc acc 875
Asp Tyr Asp Ile Cys Lys Ser Trp Val Thr Thr Asp Glu Gly Pro Thr
20 25 30
tgg gag ttc tac gcc tgc cag ccc aag gtg atg cgc ctg aag gac tac 923
Trp Glu Phe Tyr Ala Cys Gln Pro Lys Val Met Arg Leu Lys Asp Tyr
35 40 45
gtc aag gtg aag gtg gag ccc tca ggc atc aca tgt gga gac ccc cct 971
Val Lys Val Lys Val Glu Pro Ser Gly Ile Thr Cys Gly Asp Pro Pro
50 55 60 65
gag agg ttc tgc tcc cat gag aat ccc tac cta tgc agc aac gag tgt 1019
Glu Arg Phe Cys Ser His Glu Asn Pro Tyr Leu Cys Ser Asn Glu Cys
70 75 80
gac gcc tcc aac ccg gac ctg gcc cac ccg ccc agg ctc atg ttc gac 1067
Asp Ala Ser Asn Pro Asp Leu Ala His Pro Pro Arg Leu Met Phe Asp
85 90 95
aag gag gag gag ggc ctg gcc acc tac tgg cag agc atc acc tgg agc 1115
Lys Glu Glu Glu Gly Leu Ala Thr Tyr Trp Gln Ser Ile Thr Trp Ser
100 105 110
cgc tac ccc agc ccg ctg gaa gcc aac atc acc ctt tcg tgg aac aag 1163
Arg Tyr Pro Ser Pro Leu Glu Ala Asn Ile Thr Leu Ser Trp Asn Lys
115 120 125
acc gtg gag ctg acc gac gac gtg gtg atg acc ttc gag tac ggc cgg 1211
Thr Val Glu Leu Thr Asp Asp Val Val Met Thr Phe Glu Tyr Gly Arg
130 135 140 145
ccc acg gtc atg gtc ctg gag aag tcc ctg gac aac ggg cgc acc tgg 1259
Pro Thr Val Met Val Leu Glu Lys Ser Leu Asp Asn Gly Arg Thr Trp
150 155 160
cag ccc tac cag ttc tac gcc gag gac tgc atg gag gcc ttc ggt atg 1307
Gln Pro Tyr Gln Phe Tyr Ala Glu Asp Cys Met Glu Ala Phe Gly Met
165 170 175
tcc gcc cgc cgg gcc cgc gac atg tca tcc tcc agc gcg cac cgc gtg 1355
Ser Ala Arg Arg Ala Arg Asp Met Ser Ser Ser Ser Ala His Arg Val
180 185 190
ctc tgc acc gag gag tac tcg cgc tgg gca ggc tcc aag aag gag aag 1403
Leu Cys Thr Glu Glu Tyr Ser Arg Trp Ala Gly Ser Lys Lys Glu Lys
195 200 205
cac gtg cgc ttc gag gtg cgg gac cgc ttc gcc atc ttt gcc ggc ccc 1451
His Val Arg Phe Glu Val Arg Asp Arg Phe Ala Ile Phe Ala Gly Pro
210 215 220 225
gac ctg cgc aac atg gac aac ctc tac acg cgg ctg gag agc gcc aag 1499
Asp Leu Arg Asn Met Asp Asn Leu Tyr Thr Arg Leu Glu Ser Ala Lys
230 235 240
ggc ctc aag gag ttc ttc acc ctc acc gac ctg cgc atg cgg ctg ctg 1547
Gly Leu Lys Glu Phe Phe Thr Leu Thr Asp Leu Arg Met Arg Leu Leu
245 250 255
cgc ccg gcg ctg ggc ggc acc tat gtg cag cgg gag aac ctc tac aag 1595
Arg Pro Ala Leu Gly Gly Thr Tyr Val Gln Arg Glu Asn Leu Tyr Lys
260 265 270
tac ttc tac gcc atc tcc aac atc gag gtc atc ggc agg tgc aag tgc 1643
Tyr Phe Tyr Ala Ile Ser Asn Ile Glu Val Ile Gly Arg Cys Lys Cys
275 280 285
aac ctg cat gcc aac ctg tgc tcc atg cgc gag ggc agc ctg cag tgc 1691
Asn Leu His Ala Asn Leu Cys Ser Met Arg Glu Gly Ser Leu Gln Cys
290 295 300 305
gag tgc gag cac aac acc acc ggc ccc gac tgc ggc aag tgc aag aag 1739
Glu Cys Glu His Asn Thr Thr Gly Pro Asp Cys Gly Lys Cys Lys Lys
310 315 320
aat ttc cgc acc cgg tcc tgg cgg gcc ggc tcc tac ctg ccg ctg ccc 1787
Asn Phe Arg Thr Arg Ser Trp Arg Ala Gly Ser Tyr Leu Pro Leu Pro
325 330 335
cat ggc tct ccc aac gcc tgt gcc gct gca ggt tcc ttt ggc aac tgc 1835
His Gly Ser Pro Asn Ala Cys Ala Ala Ala Gly Ser Phe Gly Asn Cys
340 345 350
gaa tgc tac ggt cac tcc aac cgc tgc agc tac att gac ttc ctg aat 1883
Glu Cys Tyr Gly His Ser Asn Arg Cys Ser Tyr Ile Asp Phe Leu Asn
355 360 365
gtg gtg acc tgc gtc agc tgc aag cac aac acg cga ggt cag cac tgc 1931
Val Val Thr Cys Val Ser Cys Lys His Asn Thr Arg Gly Gln His Cys
370 375 380 385
cag cac tgc cgg ctg ggc tac tac cgc aac ggc tcg gca gag ctg gat 1979
Gln His Cys Arg Leu Gly Tyr Tyr Arg Asn Gly Ser Ala Glu Leu Asp
390 395 400
gat gag aac gtc tgc att gag tgt aac tgc aac cag ata ggc tcc gtg 2027
Asp Glu Asn Val Cys Ile Glu Cys Asn Cys Asn Gln Ile Gly Ser Val
405 410 415
cac gac cgg tgc aac gag acc ggc ttc tgc gag tgc cgc gag ggc gcg 2075
His Asp Arg Cys Asn Glu Thr Gly Phe Cys Glu Cys Arg Glu Gly Ala
420 425 430
gcg ggc ccc aag tgc gac gac tgc ctc ccc acg cac tac tgg cgc cag 2123
Ala Gly Pro Lys Cys Asp Asp Cys Leu Pro Thr His Tyr Trp Arg Gln
435 440 445
ggc tgc tac ccc aac gtg tgc gac gac gac cag ctg ctg tgc cag aac 2171
Gly Cys Tyr Pro Asn Val Cys Asp Asp Asp Gln Leu Leu Cys Gln Asn
450 455 460 465
gga ggc acc tgc ctg cag aac cag cgc tgc gcc tgc ccg cgc ggc tac 2219
Gly Gly Thr Cys Leu Gln Asn Gln Arg Cys Ala Cys Pro Arg Gly Tyr
470 475 480
acc ggc gtg cgc tgc gag cag ccc cgc tgc gac ccc gcc gac gat gac 2267
Thr Gly Val Arg Cys Glu Gln Pro Arg Cys Asp Pro Ala Asp Asp Asp
485 490 495
ggc ggt ctg gac tgc gac cgc gcg ccc ggg gcc gcc ccg cgc ccc gcc 2315
Gly Gly Leu Asp Cys Asp Arg Ala Pro Gly Ala Ala Pro Arg Pro Ala
500 505 510
acc ctg ctc ggc tgc ctg ctg ctg ctg ggg ctg gcc gcc cgc ctg ggc 2363
Thr Leu Leu Gly Cys Leu Leu Leu Leu Gly Leu Ala Ala Arg Leu Gly
515 520 525
cgc tgagccccgc ccggaggacg ctccccgcac ccggaggccg ggggtcccgg 2416
Arg
530
ggtcccgggg cggggccggc gtccgaggcc gggcggtgag aagggtgcgg cccgaggtgc 2476
tcccaggtgc tactcagcag ggccccccgc ccggcccgcg ctcccgcccg cactgccctc 2536
cccccgcagc aggggcgcct tgggactccg gtccccgcgc ctgcgatttg gtttcgtttt 2596
tcttttgtat tatccgccgc ccagttcctt tttttgtctt tctctctctc tctttttttt 2656
tttttttttc tggcggtgag ccagagggtc gggagaaacg ctgctcgccc cacaccccgt 2716
cctgcctccc accacactta cacacacggg actgtggccg acaccccctg gcctgtgcca 2776
ggctcacggg cggcggcgga ccccgacctc cagttgccta caattccagt cgctgacttg 2836
gtcctgtttt ctattcttta tttttcctgc aacccaccag accccaggcc tcaccggagg 2896
cccggtgacc acggaactca ccgtctgggg gaggaggaga gaaggaaggg gtggggggcc 2956
tggaaacttc gttctgtaga gaactatttt tgtttgtatt cactgtcccc tgcaaggggg 3016
acggggcggg agcactggtc accgcggggg ccgatggtgg agaatccgag gagtaaagag 3076
tttgctc 3083
<210> 26
<211> 3016
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(1674)
<400> 26
aag atg caa gaa aag gac agt ctc aca gaa agc ttc cca gca gct ctg 48
Lys Met Gln Glu Lys Asp Ser Leu Thr Glu Ser Phe Pro Ala Ala Leu
1 5 10 15
gcc aaa acc ctt gag gat gca gcg ctg agg gag gag aat gaa caa ctt 96
Ala Lys Thr Leu Glu Asp Ala Ala Leu Arg Glu Glu Asn Glu Gln Leu
20 25 30
tcc aat gcc tca tca tcc atc ggc tta tgc gta gat cct tta aaa ggt 144
Ser Asn Ala Ser Ser Ser Ile Gly Leu Cys Val Asp Pro Leu Lys Gly
35 40 45
cgc tgt ctc gtt gcc aca aaa gat att ctc cca gga gag ctc ctg gtg 192
Arg Cys Leu Val Ala Thr Lys Asp Ile Leu Pro Gly Glu Leu Leu Val
50 55 60
cag gag gat gct ttt gtg agt gtt ctc aac cca gga gaa ctg cca cca 240
Gln Glu Asp Ala Phe Val Ser Val Leu Asn Pro Gly Glu Leu Pro Pro
65 70 75 80
ccg cat cac ggc cta gac agc aaa tgg gac acc aga gtc acc aat ggg 288
Pro His His Gly Leu Asp Ser Lys Trp Asp Thr Arg Val Thr Asn Gly
85 90 95
gac ctc tat tgt cac cga tgt ttg aag cac act ttg gcc aca gtt ccg 336
Asp Leu Tyr Cys His Arg Cys Leu Lys His Thr Leu Ala Thr Val Pro
100 105 110
tgt gac gga tgc agt tat gcc aag tat tgc agc cag gag tgt ttg cag 384
Cys Asp Gly Cys Ser Tyr Ala Lys Tyr Cys Ser Gln Glu Cys Leu Gln
115 120 125
cag gcc tgg gag ctc tac cac agg aca gaa tgt cct ctg gga ggg ctg 432
Gln Ala Trp Glu Leu Tyr His Arg Thr Glu Cys Pro Leu Gly Gly Leu
130 135 140
ctt ctc aca ctg ggt gtc ttt tgc cac att gcc ctg agg ttg act ctt 480
Leu Leu Thr Leu Gly Val Phe Cys His Ile Ala Leu Arg Leu Thr Leu
145 150 155 160
ttg gtg gga ttt gag gat gtt cgc aaa atc ata acg aag ctt tgt gat 528
Leu Val Gly Phe Glu Asp Val Arg Lys Ile Ile Thr Lys Leu Cys Asp
165 170 175
aag att agt aac aag gac atc tgt tta cct gaa agc aac aat cag gtc 576
Lys Ile Ser Asn Lys Asp Ile Cys Leu Pro Glu Ser Asn Asn Gln Val
180 185 190
aag aca ctt aat tat ggc cta ggg gag agt gag aaa aat ggc aac atc 624
Lys Thr Leu Asn Tyr Gly Leu Gly Glu Ser Glu Lys Asn Gly Asn Ile
195 200 205
gtt gag acc cca att cct gga tgc gat att aat ggg aag tat gaa aat 672
Val Glu Thr Pro Ile Pro Gly Cys Asp Ile Asn Gly Lys Tyr Glu Asn
210 215 220
aat tat aat gct gtc ttc aac ctt ttg ccc cac act gaa aac cat agc 720
Asn Tyr Asn Ala Val Phe Asn Leu Leu Pro His Thr Glu Asn His Ser
225 230 235 240
cca gag cac aaa ttc ctc tgt gct ctc tgt gtt tct gca ctg tgc aga 768
Pro Glu His Lys Phe Leu Cys Ala Leu Cys Val Ser Ala Leu Cys Arg
245 250 255
cag cta gaa gca gcc agt tta cag gcc atc cca act gag agg att gtg 816
Gln Leu Glu Ala Ala Ser Leu Gln Ala Ile Pro Thr Glu Arg Ile Val
260 265 270
aac tcc tct cag ctt aaa gca gca gtg aca cct gaa ttg tgt cct gac 864
Asn Ser Ser Gln Leu Lys Ala Ala Val Thr Pro Glu Leu Cys Pro Asp
275 280 285
gtg act att tgg gga gtg gcg atg ctg aga cac atg tta cag ctt cag 912
Val Thr Ile Trp Gly Val Ala Met Leu Arg His Met Leu Gln Leu Gln
290 295 300
tgt aac gct cag gcg atg acc acc ata caa cgc aca gga cct aaa ggg 960
Cys Asn Ala Gln Ala Met Thr Thr Ile Gln Arg Thr Gly Pro Lys Gly
305 310 315 320
agc atc gtt acc gac agc agg cag gtg cgc ctt gcc aca ggc atc ttc 1008
Ser Ile Val Thr Asp Ser Arg Gln Val Arg Leu Ala Thr Gly Ile Phe
325 330 335
cct gtt atc agc ctc ctg aac cac tcc tgt agc ccc aac acc agc gtg 1056
Pro Val Ile Ser Leu Leu Asn His Ser Cys Ser Pro Asn Thr Ser Val
340 345 350
tcc ttc att agc act gtc gcc acc atc cgg gcg tca cag tgg att aga 1104
Ser Phe Ile Ser Thr Val Ala Thr Ile Arg Ala Ser Gln Trp Ile Arg
355 360 365
aag ggg caa gag att ctc cac tgc tat ggg cct cac aag agc cgg atg 1152
Lys Gly Gln Glu Ile Leu His Cys Tyr Gly Pro His Lys Ser Arg Met
370 375 380
ggg gtt gcc gaa agg cag cag aag ctg agg tct cag tat ttc ttt gac 1200
Gly Val Ala Glu Arg Gln Gln Lys Leu Arg Ser Gln Tyr Phe Phe Asp
385 390 395 400
tgc gcc tgt cca gct tgt caa act gag gca cac agg atg gct gca ggg 1248
Cys Ala Cys Pro Ala Cys Gln Thr Glu Ala His Arg Met Ala Ala Gly
405 410 415
ccc agg tgg gaa gca ttc tgt tgc aac agt tgc gga gcg ccc atg cag 1296
Pro Arg Trp Glu Ala Phe Cys Cys Asn Ser Cys Gly Ala Pro Met Gln
420 425 430
gga gat gac gtg ctg cgc tgt ggc agc aga tct tgt gca gaa tcc gcc 1344
Gly Asp Asp Val Leu Arg Cys Gly Ser Arg Ser Cys Ala Glu Ser Ala
435 440 445
gtc agc agg gac cac ctg gtc tct cgg tta cag gac cta cag cag cag 1392
Val Ser Arg Asp His Leu Val Ser Arg Leu Gln Asp Leu Gln Gln Gln
450 455 460
gtc aga gtg gcc cag aag ctt ctc aga gat ggt gaa cta gag cga gct 1440
Val Arg Val Ala Gln Lys Leu Leu Arg Asp Gly Glu Leu Glu Arg Ala
465 470 475 480
gtt cag cgg ctg tcg ggg tgc cag cgt gac gcc gag agc ttc ctg tgg 1488
Val Gln Arg Leu Ser Gly Cys Gln Arg Asp Ala Glu Ser Phe Leu Trp
485 490 495
gca gag cac gcc gtg gtg gga gag atc gcg gat ggc ctg gcc cgg gcc 1536
Ala Glu His Ala Val Val Gly Glu Ile Ala Asp Gly Leu Ala Arg Ala
500 505 510
tgt gct gcc tta gga gac tgg caa aag tca gcc acc cat cta cag agg 1584
Cys Ala Ala Leu Gly Asp Trp Gln Lys Ser Ala Thr His Leu Gln Arg
515 520 525
agt ctc tgc gtg gtg gag gtt cgc cac ggg ccg tcc agt gtt gaa atg 1632
Ser Leu Cys Val Val Glu Val Arg His Gly Pro Ser Ser Val Glu Met
530 535 540
ggc cat gag ctc ttc aaa ttg gcc cag atc ttt ttc aac ggg 1674
Gly His Glu Leu Phe Lys Leu Ala Gln Ile Phe Phe Asn Gly
545 550 555
tgagtccctt tcccacttca cagggcacac actgttcctt tcctaggctc ttctgggtat 1734
tacttttttt ctatggtgtt tttaaaagca tgttgaaact tctttatttc ctctttagga 1794
gagatctttg ccttctgttt tgctaacaaa cataacgctt tagaatttcc tcctgggagg 1854
cggagcttgc agtgagccaa gatcgcgcca ctgcactgta gcctgggcga cagagcgaga 1914
ctccatctca aaaaaaaaaa aaaaaagaat ttcctcctgg gagatatggg gaaagctttt 1974
agatctcaag tgagtttccc agagagaact tggccaggca gccccaccca cactccagta 2034
tagctctcct tctggttgtt ctctggttcg tcacagcccc tttggctgtc tcattcccag 2094
cactctcagc tccaggctct ctgccttctc cctactctgc atcctcttcc ttttaaagca 2154
tttgcttgaa ccagggctga ttttaccccc agacaacatg tggcattgcc tggagacttt 2214
tttttttttt tttttgagaa ggggtcttgc tgtcttgccc aggctggagt acagtggctc 2274
ggtcacagct cactgcttcc tgcaactcct gggctcaagc gatctctgga gatattttgg 2334
attgtcatga cttggaaggg ggttgctgct ggcatctatt cgaaagaggc cagaggtgct 2394
gctaagcatc ctcccgggta caagacagtg ccccccccac caaccccagc aaagaaggag 2454
ccagcaacag tgtcaatagt gtcagggtcg agaaaccctg cttttttgct gctgttgttt 2514
gttttttgtt ttaaatcccc tgttctactt tgaaggttaa aaggaaagaa cagcacgctg 2574
atgagctgat gccaaatcca cttttgctac tgagaccatt tagggcccaa tttcaggatt 2634
tccgggtgga acctccaacc ctggtaaagc tgcccttggt gtatgtggct atgagaaagt 2694
gcttaaagct tcaacctaag gctgtgctga gtgggcctgg tacccaagac acagagacat 2754
tctgtttggt cagagagaga gaaaagggct gcagagttag gtcagaggtc agccctaaga 2814
tggtcaccct aaaatgaaaa ctgctatgta aacctgtggt aaataccaga tgaaaacttt 2874
gaggccgggt gtggtagctc acgcctgtaa tccagctgcc caggaggttg aggcaggaga 2934
atcacttgaa cccgggaggc ggaggttgca gtgagccgag gtcgcacaaa tacctgggcg 2994
acaagagtga aactcttgtc tc 3016
<210> 27
<211> 3228
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2158)..(3228)
<400> 27
attttttgta ttttcagttg aaacgaggtt tcaccatatt ggccaggctg gtctcgaact 60
tctgacctca ggtgatccac cccccgcctc gtcctccaaa agagctggga ttacaagtgt 120
gagccaccgc gcccggccca gttgtggact ttaacagagg gaagctttaa acatgtttaa 180
ccacaggccc aatttgaaca aagatacttc aatcattata gagaggaaaa cagtactttt 240
tgttcaattg tgcaaactct ccaagtatct aatggagaag tagagaagaa ccctaatgaa 300
actgagtgtg aatgaggctc agctaggctt ctacttgggt tcactttctc atctgtctgc 360
ctgtcctggg attgaccctc gctcctctga agaccagcct gaaagcctta aaactggtca 420
gatgatggat gagtctgatg aggactttaa agaactctgc gctagctttt tccaaagggt 480
gaaaaaacat ggaatcaagg aagtgtcagg agaaaggaag acacaaaagg ctgcctcaaa 540
cggcactcag ataagaagca aattgaaaag gaccaaacaa actgctacca agaccaaaac 600
ccttcaaggc cctgcagaga agaaacctcc gtctggcagc caggccccta ggactaaaaa 660
gcaaagggta accaaatggc aagcaagtga accggcccac tctgtgaatg gggagggggg 720
tgtgcttgcc tctgctccag atccacctgt gctccgggaa acagcacaaa acacccagac 780
gggtaaccag caagaaccat cgccaaacct ttccagagag aaaaccagag agaatgtgcc 840
caacagcgac tcccagcctc ctccttcctg tttgacaaca gcagtgccaa gtccctccaa 900
accccgcaca gcacaattgg tcctacagcg aatgcagcag ttcaagagag cagaccccga 960
gcgtttgaga cacgcttcag aagagtgctc cctcgaggct gcgcgggaag aaaatgtccc 1020
aaaggatcct caagaggaga tgatggcggg gaatgtgtat gggcttgggc cccctgcccc 1080
agagagcgac gctgcggtgg ccttgaccct gcagcaggag tttgcacggg taggagcatc 1140
ggcacatgat gatagcctgg aggaaaaggg tttgttcttc tgccagattt gtcaaaagaa 1200
cctctcagcc atgaacgtga cccgaaggga acagcatgtg aacaggtggg ggcagcttgg 1260
gccgtcgcct ctcccgtgta tgtgaccaga atccccaagc accccagggc tggagtgcgg 1320
atctccacac cttaccatga caacagcacc tcagctttgt tgtcaaccta ccccttttta 1380
aaaataactt cgttgagata caattcacag aacatatagt tcacccattt aaactgaaca 1440
aatcactact tttgggatat tcacacagtt gggcaactgt caccacaatc acttctggat 1500
tattttcatc acccccaaaa gaatccccac acccattggc agccactccc tattgcccct 1560
ctctcccctg acaaccacta atcaacattc tgtctggatg gatttgccga ttctgaaatt 1620
gcacagttgt ccttttgtgt ctgccttctt tgacttaata tgttgttttt gaggttcatc 1680
catgttgtag catggagcag gcttcattcc tttttatggc tgagcagtat ctcattgtat 1740
ggctagactg tgtttttccc attcttagat gaggaatatg accatggttt atcctttcgt 1800
ccattggcgg acatttggag catttctacc ctttgggatt gtggatagag ctgccgtgaa 1860
catgggtttc atgtatttgt ttgggtacct gctttcagtt ctttggggtc tctacttagg 1920
agtggaattt ctaagtcatc atgtaactgc atttaatctt tccttgcttt ctttagccaa 1980
ctttgctgac agatacctaa gtgtagtgtc taggggctga ctgccgggag acggagccag 2040
gctgtgtaga ggggattggc tttggggaac ttgctttgac cacagcacgt ctgtgttgac 2100
ctggacccac atttgctcca atccacattc ctggggaggg tggttctcct gtattga 2157
ctg ttt tcc ttc agg tgc ttg gat gaa gct gaa aag aca cta aga cct 2205
Leu Phe Ser Phe Arg Cys Leu Asp Glu Ala Glu Lys Thr Leu Arg Pro
1 5 10 15
tct gtg cct cag atc cct gag tgc ccg att tgt ggg aaa ccg ttt ctt 2253
Ser Val Pro Gln Ile Pro Glu Cys Pro Ile Cys Gly Lys Pro Phe Leu
20 25 30
acc tta aag agc aga acc agt cac ttg aag cag tgt gct gtg aag atg 2301
Thr Leu Lys Ser Arg Thr Ser His Leu Lys Gln Cys Ala Val Lys Met
35 40 45
gag gtt ggc ccc cag ctc ctg ctt cag gct gtg cgg ctg cag aca gca 2349
Glu Val Gly Pro Gln Leu Leu Leu Gln Ala Val Arg Leu Gln Thr Ala
50 55 60
cag cct gag ggt agc agc agc cca ccc atg ttc agc ttc agt gat cac 2397
Gln Pro Glu Gly Ser Ser Ser Pro Pro Met Phe Ser Phe Ser Asp His
65 70 75 80
agt aga ggt ctg aaa cgg aga gga ccc acc agc aag aag gag cca cgg 2445
Ser Arg Gly Leu Lys Arg Arg Gly Pro Thr Ser Lys Lys Glu Pro Arg
85 90 95
aag agg cgg aag gtg gac gag gca ccg tcc gag gac ctg ctg gtg gcc 2493
Lys Arg Arg Lys Val Asp Glu Ala Pro Ser Glu Asp Leu Leu Val Ala
100 105 110
atg gct ctg tcc cgg tcg gag atg gag ccg ggt gcg gct gta cca gcg 2541
Met Ala Leu Ser Arg Ser Glu Met Glu Pro Gly Ala Ala Val Pro Ala
115 120 125
ctc agg ctg gaa agt gcc ttt tct gag agg ata aga cca gaa gca gag 2589
Leu Arg Leu Glu Ser Ala Phe Ser Glu Arg Ile Arg Pro Glu Ala Glu
130 135 140
aat aaa agt cgc aag aag aaa ccc ccg gta tcc ccc cca ttg ttg tta 2637
Asn Lys Ser Arg Lys Lys Lys Pro Pro Val Ser Pro Pro Leu Leu Leu
145 150 155 160
gtc cag gac tct gaa acc aca ggc cga cag ata gag gac cgt gtg gcc 2685
Val Gln Asp Ser Glu Thr Thr Gly Arg Gln Ile Glu Asp Arg Val Ala
165 170 175
ctg ctc ctc tct gag gaa gtg gaa ttg tct agc acg cca cca ctt cct 2733
Leu Leu Leu Ser Glu Glu Val Glu Leu Ser Ser Thr Pro Pro Leu Pro
180 185 190
gcc agc agg att tta aag gaa ggg tgg gaa aga gcg ggc cag tgt cct 2781
Ala Ser Arg Ile Leu Lys Glu Gly Trp Glu Arg Ala Gly Gln Cys Pro
195 200 205
cct cca cct gaa cgc aag cag agc ttt ctg tgg gag ggc agc gca ctg 2829
Pro Pro Pro Glu Arg Lys Gln Ser Phe Leu Trp Glu Gly Ser Ala Leu
210 215 220
act ggg gcc tgg gcc atg gag gac ttc tac acg gcc agg ctg gtc cct 2877
Thr Gly Ala Trp Ala Met Glu Asp Phe Tyr Thr Ala Arg Leu Val Pro
225 230 235 240
cct ctc gtg ccc cag cgg cct gcc cag ggc ctt atg cag gag ccc gtg 2925
Pro Leu Val Pro Gln Arg Pro Ala Gln Gly Leu Met Gln Glu Pro Val
245 250 255
ccg cct ctg gtg cca cct gag cac tca gag ctg agc gag cga agg tca 2973
Pro Pro Leu Val Pro Pro Glu His Ser Glu Leu Ser Glu Arg Arg Ser
260 265 270
ccc gct ctc cac ggc acc ccc act gca ggc tgt ggc tcc agg ggc ccg 3021
Pro Ala Leu His Gly Thr Pro Thr Ala Gly Cys Gly Ser Arg Gly Pro
275 280 285
tcg cct tcg gcc agc cag agg gag cac cag gcc ctg cag gac ctc gtg 3069
Ser Pro Ser Ala Ser Gln Arg Glu His Gln Ala Leu Gln Asp Leu Val
290 295 300
gac ctg gcg agg gag gga ctg agc gcc agc ccg tgg ccc ggc agt ggg 3117
Asp Leu Ala Arg Glu Gly Leu Ser Ala Ser Pro Trp Pro Gly Ser Gly
305 310 315 320
ggc ctg gct ggc tcg gaa ggg act gca ggg ttg gac gtg gtg ccc ggc 3165
Gly Leu Ala Gly Ser Glu Gly Thr Ala Gly Leu Asp Val Val Pro Gly
325 330 335
ggc ctt cct ctg act ggg ttt gtg gtg cca tcg cag gac aag cac ccg 3213
Gly Leu Pro Leu Thr Gly Phe Val Val Pro Ser Gln Asp Lys His Pro
340 345 350
gac agg ggc ggc cgc 3228
Asp Arg Gly Gly Arg
355
<210> 28
<211> 3458
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (593)..(1213)
<400> 28
ctgtctcaat aaaaaaaaag aaaagttaaa caattaactt tttaaaagag tataaatgag 60
aaaaagatat tttatatgtg ctaatcttct gcctacatga tgaaaaagag aaggtacagt 120
ttaccaatga gagaatgaga gatgtaggca cagtggcgtg tgcctatact ctcatctatt 180
tgagaatctt aggtgggagg atcacttgaa tccaggagtt tgagaccagc aggggaacat 240
agtgagaccc tatctcttta ataataaaaa aaaagaatgt gaggacctca atgcagatct 300
tttagacatt aaaaggataa gtaaaattat gaacattatt atgccaataa tttttttttt 360
tttttttttt gagacagcgt ctatctctgt tgcccgggct ggagtgcggt ggcgcgatct 420
cagcccactg caacctccat ctcccgggtt caggcgattc tcctgcccca gcctcccagg 480
tggctgggat tataggtgtg tgccaccaaa cctagctaat ttctgtattt ttagtagaga 540
cggggttttg gcatgttggt cgggcaggtt tcgagctcct gacctcaggt ga cct gcc 598
Pro Ala
1
tgc ctc agc ctc cca agg tgc tgg gat tgc agg tgt gag cca cca cgc 646
Cys Leu Ser Leu Pro Arg Cys Trp Asp Cys Arg Cys Glu Pro Pro Arg
5 10 15
cca gcc tat gcc aat aac ttt tac agc tta gat ata atg gac aaa ttc 694
Pro Ala Tyr Ala Asn Asn Phe Tyr Ser Leu Asp Ile Met Asp Lys Phe
20 25 30
cat gaa aaa cac aaa cca tca aaa ttc cct cga gaa caa ata gag aac 742
His Glu Lys His Lys Pro Ser Lys Phe Pro Arg Glu Gln Ile Glu Asn
35 40 45 50
cta aat aac ttt aca tta aga act tgg tct ttt cct ctt gga agc ccc 790
Leu Asn Asn Phe Thr Leu Arg Thr Trp Ser Phe Pro Leu Gly Ser Pro
55 60 65
ctc tct ctc act aga gag aga gct gat ttc ctt tct tta tct ttc tct 838
Leu Ser Leu Thr Arg Glu Arg Ala Asp Phe Leu Ser Leu Ser Phe Ser
70 75 80
ctt ttg cct gtt aaa cct cca ctc cta aat tcc tca tgt gtg tcc gtg 886
Leu Leu Pro Val Lys Pro Pro Leu Leu Asn Ser Ser Cys Val Ser Val
85 90 95
tcc aaa att ttc ctg gca cga gat gat gaa cct cag gta ttt acc cca 934
Ser Lys Ile Phe Leu Ala Arg Asp Asp Glu Pro Gln Val Phe Thr Pro
100 105 110
gac aac gta gct gct tca tac tgg gga cct cgt ccc aga tat caa ggt 982
Asp Asn Val Ala Ala Ser Tyr Trp Gly Pro Arg Pro Arg Tyr Gln Gly
115 120 125 130
aca gca ttc atc aaa acg caa cat ctg gtg gag gca aac cag tgt tac 1030
Thr Ala Phe Ile Lys Thr Gln His Leu Val Glu Ala Asn Gln Cys Tyr
135 140 145
aac cca tcg gaa tgg cta aca gca atc aaa ctc caa atg gtg ctg cag 1078
Asn Pro Ser Glu Trp Leu Thr Ala Ile Lys Leu Gln Met Val Leu Gln
150 155 160
aca gaa cca cac atg gac gtg cct ttc ttc cga gga ctc tta gat cgg 1126
Thr Glu Pro His Met Asp Val Pro Phe Phe Arg Gly Leu Leu Asp Arg
165 170 175
ccc cag gag gag ccc tac ctg ctg ttc ccc aca caa cac ctc ttt tca 1174
Pro Gln Glu Glu Pro Tyr Leu Leu Phe Pro Thr Gln His Leu Phe Ser
180 185 190
gca gga agt agc cag aaa gag tcc tcg tcc aac agc ccc taatagtagt 1223
Ala Gly Ser Ser Gln Lys Glu Ser Ser Ser Asn Ser Pro
195 200 205
tagggttacc actccagagc ggggaatgat acaggtgtta agaagaaatt acttaggtgg 1283
atactgaggg tacagaagtc cttggtaagg ttttccattt aatgaaaagc agccccaaat 1343
tattttcttt ctaacaaaga gcagcctgta aaattcagtt gcagacatag atgttagcag 1403
ttatgaaatc atgttcaaga tgggagcttc atcttccctt cgctttgtca accatatgta 1463
cagtaaggag cagacaagat ggcaccagcc aaggggaaag ttcatttgca taataacatt 1523
agggtggggt agccagcctt cccctaaagc tatgtaaaca tcatacctga ttgaaccaat 1583
ctgtaatccc tatgtaaatc agacgccgcc gcctcaagcc tgagtaaaat ccagcacatc 1643
tgccaccaac tggtctggga gtcccctctc tcacgagaga gagctgtttt actttctctt 1703
tctttctttt ttttttcttt ttttcctatt aaacctttgc tcctacactc cttaaaaaaa 1763
aaaaaaaaga aagaaattga atttgtagtt tcaaaaatct ttcaacaaag aaaacttaag 1823
gtctagatgg tggccttact agtaaattat agcaaacatt taaggaggaa ataataccaa 1883
ttatacacac agtctttcag aaaatgggag gaagcactta ccagctcgtt ttatgaagcc 1943
agcattatac taatatcaaa accaaataaa gacatgagaa gcatagaaag ttacagcccc 2003
agtatccatc atgaacatag atgtaaaaat attctataaa acttaagcaa gtttaagtca 2063
agagatacat aaaaagaata atacatcatg aacaaatggg gttcacttga aaatcagtgc 2123
agtccactat attagtagac taaaaaataa ccacatgatc tcaagatata gcagaaaaat 2183
catttgtcta aattatcaaa aacctaggct atgcattctc atcaggtgaa aattggctct 2243
tgcttgggag ggagaatcta aaatgttaac aattttttgt gccctttcag ggggccataa 2303
tacataaaca gatatacagt atatctggta ttaaaatttc ttgggtggag agcacaatga 2363
ccacaaaaca aagtctgaaa aggctctttg gagaatgata atgaaataat gtttgaaaaa 2423
cacaactatg tataagtaga atatcatata atatgaaaga ttgacttctt cctcttaggt 2483
tcaggaacaa agcaaggatg tcagcttttg taacttctat tcagtatttt actggagtta 2543
gtggtcagta tgaagaaata aaagggaaaa aggggaaaag gaaaacaaaa gaagagacat 2603
acagattgga aagaagtaaa atgattttta ttctcagaca acatgatcat ctatgtagaa 2663
atccagagat ctacaaaaaa gctaatatta taatgagttt tttcaatatt acatgataca 2723
aagtcaacat acaaagatca attatttttc tgtatgatag caatgaacaa ttagaaattg 2783
aaatgtaatg ggaggttgaa gcaggagaat tccttgaacc tgggaggcag aagcttcagt 2843
gagccgagca tcataaaatc atgaagtgct tagggacaaa tttagccaat acctacaaga 2903
cacatacact gaagactaca acatattact gagagaaatt aaagaatacc taaataaatg 2963
aagagatata ccatatttgt ggatccaaag attgaatatg ggctgcgtgt ggtggctcat 3023
gtctgtaatc ctagcacttt gggaaactga agtgggagaa tcacttgagg ccaggagttt 3083
gagaccagcc tgggcaacac agcaagatcc catctctaca aaaaaattgt taaaagataa 3143
agaaaaaata acaataatga aaaataggcc gagcgccgtg gctaactcac gcctgtaatc 3203
ctagcgcttc gggaggccga ggcaggcaga tcacctgagg tcgggagttc gagaccagcc 3263
tgaccaacat ggagaaaccc tgtctctact aaaaatacaa aattagccag gcatggtggc 3323
acatgcctgt aatcccagct tctcgggagg ctgaggcagg agaattgctt gaacccggga 3383
ggcagaggtt gcagtgagct gagatcgtgc cattgcactc tagcctgggc aatgagcgaa 3443
actccttctt aaaag 3458
<210> 29
<211> 3326
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2)..(979)
<400> 29
g aag gaa ggg cac cgc caa gat gga gag ggg gcc tta gca gct cct gaa 49
Lys Glu Gly His Arg Gln Asp Gly Glu Gly Ala Leu Ala Ala Pro Glu
1 5 10 15
gct gag cca gca gga aag gtg cag gcc cct gag ggg ctg atc cca gcc 97
Ala Glu Pro Ala Gly Lys Val Gln Ala Pro Glu Gly Leu Ile Pro Ala
20 25 30
aca ggc cag gca gag gag cta gca gcc aaa gat cac gac tcc tgc gca 145
Thr Gly Gln Ala Glu Glu Leu Ala Ala Lys Asp His Asp Ser Cys Ala
35 40 45
gga ctg gag ggg aga gct gaa ggg caa gga gga gtg gat gtc gtg cta 193
Gly Leu Glu Gly Arg Ala Glu Gly Gln Gly Gly Val Asp Val Val Leu
50 55 60
agg acc cag gaa gct gtt gct gag gaa gat ccc ata atg gca gaa aag 241
Arg Thr Gln Glu Ala Val Ala Glu Glu Asp Pro Ile Met Ala Glu Lys
65 70 75 80
ttc agg gag gaa gcg gtg gat gag gac cca gag gag gaa gag gac aaa 289
Phe Arg Glu Glu Ala Val Asp Glu Asp Pro Glu Glu Glu Glu Asp Lys
85 90 95
gag tgc act ctg gag aca gaa gcg atg cag gac agg aac tcg gaa ggg 337
Glu Cys Thr Leu Glu Thr Glu Ala Met Gln Asp Arg Asn Ser Glu Gly
100 105 110
gac ggg gac atg gaa gga gaa gga aac aca caa aag aat gag ggc atg 385
Asp Gly Asp Met Glu Gly Glu Gly Asn Thr Gln Lys Asn Glu Gly Met
115 120 125
gga gga gga agg gtt gtg gct gtg gaa gtt cta cac gga ggt ggt gaa 433
Gly Gly Gly Arg Val Val Ala Val Glu Val Leu His Gly Gly Gly Glu
130 135 140
acg gca gaa aca gcc gca gag gag agg gag gtg ttg gca ggt tcg gag 481
Thr Ala Glu Thr Ala Ala Glu Glu Arg Glu Val Leu Ala Gly Ser Glu
145 150 155 160
aca gcc gag gag aaa aca ata gca aat aaa gcc tcc tcc ttt tca gat 529
Thr Ala Glu Glu Lys Thr Ile Ala Asn Lys Ala Ser Ser Phe Ser Asp
165 170 175
gtt gct gag gaa gaa acc tgg cac caa cag gat gag tta gta gga aaa 577
Val Ala Glu Glu Glu Thr Trp His Gln Gln Asp Glu Leu Val Gly Lys
180 185 190
aca gca gct gca ggg aag gtg gtg gta gag gaa tta gca cgg agt ggg 625
Thr Ala Ala Ala Gly Lys Val Val Val Glu Glu Leu Ala Arg Ser Gly
195 200 205
gag gaa gtg cca gca gca gag gag atg aca gtg aca tat aca aca gag 673
Glu Glu Val Pro Ala Ala Glu Glu Met Thr Val Thr Tyr Thr Thr Glu
210 215 220
gct ggg gtg ggc act cca gga gcc ctg gag cgg aag acc tca ggg cta 721
Ala Gly Val Gly Thr Pro Gly Ala Leu Glu Arg Lys Thr Ser Gly Leu
225 230 235 240
gga cag gag caa gag gaa ggg tca gag ggc cag gag gca gcc act ggg 769
Gly Gln Glu Gln Glu Glu Gly Ser Glu Gly Gln Glu Ala Ala Thr Gly
245 250 255
agt ggc gat ggg agg cag gag aca gga gca gct gaa aaa ttc cga tta 817
Ser Gly Asp Gly Arg Gln Glu Thr Gly Ala Ala Glu Lys Phe Arg Leu
260 265 270
gga tta tca cgg gag gga gag agg gaa ttg agt ccg gag agt cta cag 865
Gly Leu Ser Arg Glu Gly Glu Arg Glu Leu Ser Pro Glu Ser Leu Gln
275 280 285
gcg atg gca aca ctt cca gtg aag cct gat ttc act gaa acc cga gag 913
Ala Met Ala Thr Leu Pro Val Lys Pro Asp Phe Thr Glu Thr Arg Glu
290 295 300
aag caa cag cat atg gtg caa gga gaa agc gag act gca gat gtt tcc 961
Lys Gln Gln His Met Val Gln Gly Glu Ser Glu Thr Ala Asp Val Ser
305 310 315 320
ccc aac aac atg cag gtc taggagactt gctggcagac ggataattta 1009
Pro Asn Asn Met Gln Val
325
aagatgtctt ctgaagatgt aaagagtgga gaaagattca cgcaagcatc tcaccaggat 1069
tcttgatttt ctctctctcc tctttagttg ctggttgcgc ttgtctgaga tgattcccaa 1129
tctgtcagcc ctggtcagta gctcagtaag caccttgaga atagctcaag tagatctgta 1189
ggacccttct tagaagcagt ggttcctcat ggagaaactt gtgaggctgt tacacattct 1249
acacacctaa cattattttc aaacaaaaat gataattttc agatgcttga cttttaccaa 1309
agatcactgg aaggcccagt cctaatgtta ggggtttgtt taaagtcctt tttattttac 1369
aatacagagc cccagtcaat tccacaatct caatttcata catgggaatt ttatttaaaa 1429
atctgtggtt tggggcttta atgaattggc ctgtgaaaat gagctctaaa tttcctccca 1489
cgtacactca aaactcaaga ttgctccaaa tctctaagtt cttccagcaa aagatttctt 1549
ggcatgtata ttcacttata cttagaaata ttcattcttt taatttatgc cagaataaca 1609
aagtggaaat cttatttcaa aatgctcttt gtttttttgt gtgtgtttct gtagttctgc 1669
tttctggggt agactagtaa aatggtagct tccagcattt tgtccctggg gccttcttta 1729
tagggccact caaatttaaa taaaagtagt aaataattta gctaagtgga ataagtataa 1789
taattatagt ggtaagcata gcacatcagc attatgccaa cattctagac tctttagttg 1849
atgtcattaa atggaaaaga aacttggatt aaatgagtgt gctgctcacc ttcccaagtt 1909
ctgttatttc aaacctgtga actaaccttg cagttcatta taaatcaaca gtaacaactg 1969
cattctaaat tactccctga tattattttc tagttgtgta tcagcctgtc tcctaggggt 2029
tttcatttcc ctgaagacat acaagtgccc cagagcgcat gtatatgtct accatttctc 2089
tatatgagaa ggtaaaaaaa atttccttaa gcagtgattt tccagccaga atatacatta 2149
gattttcatg ggacgctttt ataaatgact caaccctttt ccccacccca gagattcaga 2209
cttaattcgt tttagatgga tctacacatc agtatatata tatttttaac ttttcacttg 2269
attcttctct gtagccaagg ttgagaaccg ctgttctaaa tcatcatata atccatgctg 2329
gccacattac actcaaggtc cctagggacc aggcatatta tcatagtagg tattttccat 2389
tttaatgtgt aatggagcca ttcaatgatc aaaaatacac tggaccagat agtagactgg 2449
tcccttgatc agaagcatca gcacatcagc atcacctgga aattgttccc agcctttgtc 2509
tcctacctac taaattagaa actcttggtg ggttccagta atccatagct taacaagccc 2569
tgcagttaat actgatgtac actgatgtcc aaaaactgct gtcatggact attgattgta 2629
ttgaggatta gtctcagttg gaaagccaac tacagaggca ttttgaactt tctttctttg 2689
cctctctatg tctctctgtc ttttcctgtc ttctgattta tctgtctttc tttctctagt 2749
aaatggcact caatataaaa gtggtggagt caatcttaaa cttattttta ttatgattgt 2809
attgatacat gcacgaagtc cctctgccct actccctatt caaggatatt actcactgca 2869
catcataaat ctccatcatc tgtcttaaag ttttatgagt agatttcatc tacattatat 2929
tcaagttcat ttattactga gctgtattac tgtggagctc taacagtatt tgtttcctga 2989
tttcaaactc aatgctacag agcactttga atacatcaca ccttatagga aagatagtaa 3049
atgtattaat cccattgaaa aattagtttt gtacaatgtg ctaaatagta ttgcattgga 3109
ttacttttat atttaacaca ctccatcaaa acatcccata acataatttt acaatctgca 3169
tgtgaattta actgtgaaat tcagtattgt gatattttga ataagtgaat tctttctctg 3229
caaatactat gttgataaaa ttacttgtat gttcccctga aatggtttgt ttcctgtttc 3289
ttcattatta aaacataaat caatcatatg tttacag 3326
<210> 30
<211> 3386
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (3)..(1544)
<400> 30
aa aat gtt tct tac atg tgc cag cca ggc tac acg atg gaa ttg aat 47
Asn Val Ser Tyr Met Cys Gln Pro Gly Tyr Thr Met Glu Leu Asn
1 5 10 15
ggc tcc aga atc agg act tgt aca att aat ggc aca tgg agt gga gta 95
Gly Ser Arg Ile Arg Thr Cys Thr Ile Asn Gly Thr Trp Ser Gly Val
20 25 30
atg cca act tgt aga gct gtt acc tgc cca act cct ccc cag atc tct 143
Met Pro Thr Cys Arg Ala Val Thr Cys Pro Thr Pro Pro Gln Ile Ser
35 40 45
aat gga agg ctg gaa gga aca aat ttc gac tgg ggc ttt agt att agc 191
Asn Gly Arg Leu Glu Gly Thr Asn Phe Asp Trp Gly Phe Ser Ile Ser
50 55 60
tac atc tgt tct cca ggc tat gag cta tcc ttc cct gct gtt ttg acc 239
Tyr Ile Cys Ser Pro Gly Tyr Glu Leu Ser Phe Pro Ala Val Leu Thr
65 70 75
tgt gta ggg aat ggt acc tgg agt ggt gaa gta ccg cag tgc tta cca 287
Cys Val Gly Asn Gly Thr Trp Ser Gly Glu Val Pro Gln Cys Leu Pro
80 85 90 95
aag ttt tgt ggt gac cct ggt ata cct gcc caa gga aaa aga gaa ggc 335
Lys Phe Cys Gly Asp Pro Gly Ile Pro Ala Gln Gly Lys Arg Glu Gly
100 105 110
aaa agc ttt ata tac cag tca gag gtt tca ttc agc tgc aat ttt cct 383
Lys Ser Phe Ile Tyr Gln Ser Glu Val Ser Phe Ser Cys Asn Phe Pro
115 120 125
ttc ata tta gtg gga tca agc acc aga ata tgt caa gca gat ggc act 431
Phe Ile Leu Val Gly Ser Ser Thr Arg Ile Cys Gln Ala Asp Gly Thr
130 135 140
tgg agt ggt tca tca cct cac tgc ata gag cct acc caa acc tct tgt 479
Trp Ser Gly Ser Ser Pro His Cys Ile Glu Pro Thr Gln Thr Ser Cys
145 150 155
gaa aac cca ggt gtg cct cgg cat gga tct cag aac aat aca ttc gga 527
Glu Asn Pro Gly Val Pro Arg His Gly Ser Gln Asn Asn Thr Phe Gly
160 165 170 175
ttt caa gta gga agt gtt gta cag ttc cat tgc aaa aaa gga cac ctt 575
Phe Gln Val Gly Ser Val Val Gln Phe His Cys Lys Lys Gly His Leu
180 185 190
ctc caa ggg tct aca aca cgc acc tgc ctc cct gat ctt acg tgg agt 623
Leu Gln Gly Ser Thr Thr Arg Thr Cys Leu Pro Asp Leu Thr Trp Ser
195 200 205
ggg att cag cct gaa tgc ata ccc cac agc tgt aaa cag cca gaa act 671
Gly Ile Gln Pro Glu Cys Ile Pro His Ser Cys Lys Gln Pro Glu Thr
210 215 220
cct gct cat gca aat gtc gta ggg atg gac ctt cca tct cat ggg tat 719
Pro Ala His Ala Asn Val Val Gly Met Asp Leu Pro Ser His Gly Tyr
225 230 235
aca ctg att tat acc tgt cag cct ggc ttc ttc tta gca ggt gga aca 767
Thr Leu Ile Tyr Thr Cys Gln Pro Gly Phe Phe Leu Ala Gly Gly Thr
240 245 250 255
gaa cat aga gtg tgt aga tcc gat aac acc tgg act gga aaa gtt ccc 815
Glu His Arg Val Cys Arg Ser Asp Asn Thr Trp Thr Gly Lys Val Pro
260 265 270
att tgt gaa gct ggt tct aaa ata ttg gtg aaa gat cct aga cct gca 863
Ile Cys Glu Ala Gly Ser Lys Ile Leu Val Lys Asp Pro Arg Pro Ala
275 280 285
ctg gga aca ccc agc cca aag cta agt gtt cct gat gat gta ttt gcc 911
Leu Gly Thr Pro Ser Pro Lys Leu Ser Val Pro Asp Asp Val Phe Ala
290 295 300
caa aat tat ata tgg aaa ggc tct tac aat ttc aaa gga agg aaa caa 959
Gln Asn Tyr Ile Trp Lys Gly Ser Tyr Asn Phe Lys Gly Arg Lys Gln
305 310 315
ccc atg acc tta aca gtt act agt ttc aat gct tcc act ggg aga gtt 1007
Pro Met Thr Leu Thr Val Thr Ser Phe Asn Ala Ser Thr Gly Arg Val
320 325 330 335
aac gca aca ctg agc aat agc aac atg gag ctg cta ctt tca ggg gta 1055
Asn Ala Thr Leu Ser Asn Ser Asn Met Glu Leu Leu Leu Ser Gly Val
340 345 350
tat aaa agc cag gaa gct cgc cta atg tta cgc ata tat ctt att aaa 1103
Tyr Lys Ser Gln Glu Ala Arg Leu Met Leu Arg Ile Tyr Leu Ile Lys
355 360 365
gta cct gct cat gct tct gtg aag aaa atg aag gaa gaa aat tgg gca 1151
Val Pro Ala His Ala Ser Val Lys Lys Met Lys Glu Glu Asn Trp Ala
370 375 380
atg gat ggc ttt gtt tct gct gag cct gat gga gct act tat gta ttt 1199
Met Asp Gly Phe Val Ser Ala Glu Pro Asp Gly Ala Thr Tyr Val Phe
385 390 395
caa gga ttt att caa ggc aaa gat tat gga caa ttt ggc cta caa aga 1247
Gln Gly Phe Ile Gln Gly Lys Asp Tyr Gly Gln Phe Gly Leu Gln Arg
400 405 410 415
ctg gga ctg aat atg tca gaa ggt tca aat tct tca aat caa cct cat 1295
Leu Gly Leu Asn Met Ser Glu Gly Ser Asn Ser Ser Asn Gln Pro His
420 425 430
ggt aca aat agt agt tct gta gcc att gct att ctt gtg cct ttt ttt 1343
Gly Thr Asn Ser Ser Ser Val Ala Ile Ala Ile Leu Val Pro Phe Phe
435 440 445
gca ctt ata ttt gca gga ttt gga ttt tat ctt tat aaa caa agg act 1391
Ala Leu Ile Phe Ala Gly Phe Gly Phe Tyr Leu Tyr Lys Gln Arg Thr
450 455 460
gca cct aaa aca cag tat aca gga tgt tca gtt cat gaa aat aac aat 1439
Ala Pro Lys Thr Gln Tyr Thr Gly Cys Ser Val His Glu Asn Asn Asn
465 470 475
ggc caa gca gct ttt gaa aat ccc atg tat gac acc aac gca aag tca 1487
Gly Gln Ala Ala Phe Glu Asn Pro Met Tyr Asp Thr Asn Ala Lys Ser
480 485 490 495
gtg gaa ggg aag gcg gta cga ttt gat ccc aac ttg aac acg gtt tgc 1535
Val Glu Gly Lys Ala Val Arg Phe Asp Pro Asn Leu Asn Thr Val Cys
500 505 510
aca atg gta taacgaggca acctttgcct tcttcagaag cttggaaatc 1584
Thr Met Val
gacacacaaa acagtgcaca tttagttcac tgctaaacaa attaaagcac acttttcagg 1644
acttgctggg tcaccttttc ctgaggaatg atacagaaga gtgttttttc ataaactgga 1704
ccataattct tcatgtttac catggagagt tttacagaaa tttggctgca ctctgagagt 1764
gcttactcac agtttatttg ctttttttaa aaaggagatt attctaaaat ataaacttat 1824
ttgcatatat tggaggagca tccactatca gtatttctgt gctttataaa ctatattaga 1884
gggactgggt ttaaaaaaaa agatataggg tagaaataag agctatactt acaagagtca 1944
atagatcact gacttctctt taactgtcct gagctacatc acgcccacaa atctgtcttt 2004
cataccactc tatctgccat tcattcttgt taaatgaggc attgttttct ttatttaaga 2064
ttgagttatg actttaaatg tttatgagct agtttatgcc tcggaagaca taaggtttcc 2124
cattccaatt tgaatctttc tatttcttta gcatcatatt gctcttcttc ttttagactg 2184
gttgtcatag tttctgcaaa gggcaagttc attgtgctac cctcccaatt cccccattat 2244
ggagtaacca gtattattca gtgagctatg gggtagtgtt tcactccttg cttcaatatg 2304
gttttttcct atttattttt attatttttt aatcccaaat gtgatagtag cttcagtatg 2364
tttcctcacc caaaaaccat ttggctggca gaaattatct agaaaaatta ttattttgca 2424
gttatctgag acaccaaact taccatcctt tattttacaa tatttttgat cgaccaaata 2484
aaatgtccta ataatctact ctatttatat ctacatttcc ttccagctga gtcttcaaat 2544
ttagcattat tttgacactt ttaattttca tttaacaatg gattagaaaa atattttggt 2604
ctagccacta aacataatga ctgaaaaaaa ttattattat aaatcaccaa tataaattta 2664
ttgcctacaa ttgtggactt atcatatttt tatatatttt taaatttctg tacctttatt 2724
gttaaacacc aagtttcaca tacaagaggt ttctttcatt gtcaaattaa gatatgtatt 2784
tgtaagttat tggtggctca tttaaactgt atcatacagt aatgccagca tttttattta 2844
ataggtcatc gtcttccttt tatagctgtt ttacttcatt gaatacgatc atgtacccag 2904
tgttaactct ttgttgttgt ccctttcaaa tatggataac tctgtgaaca gtcttgactt 2964
tttgtatcat gagacagagt gcctaagaga aacccttgca cctgggagcg ctgcttggct 3024
ctatctctac aacatttaat gaatttcttg taaaccactt tgagatacat tgttttggaa 3084
attcatacaa ttttctcaac atggtactct actggcagga tgcagtttta aattttatgc 3144
catcagttat ttattcttgt ttattcaaat gactgcaata acaatgattc attcattaat 3204
gttaaggcta atacatttaa gatcttgttt aaacaatgtt tcttctgcaa ctggctactc 3264
ctcttatatt aatgcataca cttttgtaaa gaagtatatt tttatgaaaa gaaactttgt 3324
aaaagtatga tgtaaatttt cagtgcaatg taaacaaaat aaaaatggat aattgctttt 3384
gt 3386
<210> 31
<211> 3515
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (1)..(2112)
<400> 31
gtt ctc ggt ggt ctc ctc ctg ctt gga gtt gct cag ctg ggt ctc ttt 48
Val Leu Gly Gly Leu Leu Leu Leu Gly Val Ala Gln Leu Gly Leu Phe
1 5 10 15
ctg gag gaa aag ctg cgg gag cat ctg gca gag aag gag aag ctg aac 96
Leu Glu Glu Lys Leu Arg Glu His Leu Ala Glu Lys Glu Lys Leu Asn
20 25 30
gag gag agg cta gag caa gag gag aag ctc aaa gcc aaa atc agg caa 144
Glu Glu Arg Leu Glu Gln Glu Glu Lys Leu Lys Ala Lys Ile Arg Gln
35 40 45
ctg acg gaa gag aag gcg gct ttg gag gag tac att act caa gag aga 192
Leu Thr Glu Glu Lys Ala Ala Leu Glu Glu Tyr Ile Thr Gln Glu Arg
50 55 60
aac aga gcg aaa gag act tta gag gaa gaa cgg aag aga atg caa gaa 240
Asn Arg Ala Lys Glu Thr Leu Glu Glu Glu Arg Lys Arg Met Gln Glu
65 70 75 80
ctg gag agc ctc ctg gcc cag cag aag aag gct ttg gcg aaa agc att 288
Leu Glu Ser Leu Leu Ala Gln Gln Lys Lys Ala Leu Ala Lys Ser Ile
85 90 95
acc cag gag aag aac aga gtg aag gaa gca tta gag gaa gag cag aca 336
Thr Gln Glu Lys Asn Arg Val Lys Glu Ala Leu Glu Glu Glu Gln Thr
100 105 110
aga gtc caa gag ctg gag gaa cgc ttg gcc cgc cag aag gag atc tca 384
Arg Val Gln Glu Leu Glu Glu Arg Leu Ala Arg Gln Lys Glu Ile Ser
115 120 125
gag agc aac att gcg tac gag aaa cgc aaa gca aag gag gcc atg gag 432
Glu Ser Asn Ile Ala Tyr Glu Lys Arg Lys Ala Lys Glu Ala Met Glu
130 135 140
aag gaa aag aaa aag gtg caa gac ctg gag aat cgc tta acc aag cag 480
Lys Glu Lys Lys Lys Val Gln Asp Leu Glu Asn Arg Leu Thr Lys Gln
145 150 155 160
aaa gag gaa tta gaa tta aaa gag caa aaa gag gac gtt tta aat aat 528
Lys Glu Glu Leu Glu Leu Lys Glu Gln Lys Glu Asp Val Leu Asn Asn
165 170 175
aaa tta agt gac gca ctg gcc atg gtt gaa gag act cag aaa aca aag 576
Lys Leu Ser Asp Ala Leu Ala Met Val Glu Glu Thr Gln Lys Thr Lys
180 185 190
gca act gaa agt cta aaa gca gag agc ctc gcc ttg aaa tta aat gaa 624
Ala Thr Glu Ser Leu Lys Ala Glu Ser Leu Ala Leu Lys Leu Asn Glu
195 200 205
aca tta gcc gaa ctg gaa act acc aag aca aaa atg atc atg gtg gaa 672
Thr Leu Ala Glu Leu Glu Thr Thr Lys Thr Lys Met Ile Met Val Glu
210 215 220
gag cgg cta atc ctg cag cag aag atg gta aag gcc ctc cag gat gag 720
Glu Arg Leu Ile Leu Gln Gln Lys Met Val Lys Ala Leu Gln Asp Glu
225 230 235 240
cag gaa tca cag aga cac ggg ttt gga gaa gag atc atg gaa tat aag 768
Gln Glu Ser Gln Arg His Gly Phe Gly Glu Glu Ile Met Glu Tyr Lys
245 250 255
gag caa atc aaa cag cac gcc cag aca att gtg agc ctc gaa gag aaa 816
Glu Gln Ile Lys Gln His Ala Gln Thr Ile Val Ser Leu Glu Glu Lys
260 265 270
ctc cag aaa gtc act cag cac cat aaa aaa ata gaa ggc gag att gca 864
Leu Gln Lys Val Thr Gln His His Lys Lys Ile Glu Gly Glu Ile Ala
275 280 285
aca ttg aag gac aat gac cca gcc cca aag gag gaa agg ccg caa gac 912
Thr Leu Lys Asp Asn Asp Pro Ala Pro Lys Glu Glu Arg Pro Gln Asp
290 295 300
cct ctg gtg gct ccc atg aca gag agc agt gcc aaa gac atg gcg tac 960
Pro Leu Val Ala Pro Met Thr Glu Ser Ser Ala Lys Asp Met Ala Tyr
305 310 315 320
gaa cat ctg ata gat gac tta ttg gct gct cag aag gaa att ctg tct 1008
Glu His Leu Ile Asp Asp Leu Leu Ala Ala Gln Lys Glu Ile Leu Ser
325 330 335
cag cag gaa gtc atc atg aag tta agg aaa gac ctt acc gaa gcc cac 1056
Gln Gln Glu Val Ile Met Lys Leu Arg Lys Asp Leu Thr Glu Ala His
340 345 350
agc aga atg tcg gat ttg aga ggg gag cta aac gag aag cag aag atg 1104
Ser Arg Met Ser Asp Leu Arg Gly Glu Leu Asn Glu Lys Gln Lys Met
355 360 365
gaa ctg gag cag aac gtg gtg ctg gtc cag cag cag agc aag gag ctg 1152
Glu Leu Glu Gln Asn Val Val Leu Val Gln Gln Gln Ser Lys Glu Leu
370 375 380
agt gtg ctc aag gag aag atg gcc cag atg agc agc ctg gta gaa aag 1200
Ser Val Leu Lys Glu Lys Met Ala Gln Met Ser Ser Leu Val Glu Lys
385 390 395 400
aaa gat cgg gag ctg aag gcc ctt gag gag gca ctc agg gct tcc caa 1248
Lys Asp Arg Glu Leu Lys Ala Leu Glu Glu Ala Leu Arg Ala Ser Gln
405 410 415
gag aaa cac aga ctc cag ctg aac aca gag aag gaa cag aag ccc cgg 1296
Glu Lys His Arg Leu Gln Leu Asn Thr Glu Lys Glu Gln Lys Pro Arg
420 425 430
aag aag acc cag acg tgt gac acc tct gtg cag ata gaa ccc gtc cac 1344
Lys Lys Thr Gln Thr Cys Asp Thr Ser Val Gln Ile Glu Pro Val His
435 440 445
act gag gcc ttc tcc agc agc caa gag cag caa tcc ttc agc gat cta 1392
Thr Glu Ala Phe Ser Ser Ser Gln Glu Gln Gln Ser Phe Ser Asp Leu
450 455 460
ggg gtc agg tgc aaa ggg tcc cgg cac gag gag gtc att cag cgt cag 1440
Gly Val Arg Cys Lys Gly Ser Arg His Glu Glu Val Ile Gln Arg Gln
465 470 475 480
aaa aag gcc tta tct gaa ctt cga gcg cga att aaa gaa ctc gag aag 1488
Lys Lys Ala Leu Ser Glu Leu Arg Ala Arg Ile Lys Glu Leu Glu Lys
485 490 495
gcg cgc tca cca gat cat aaa gac cac cag aat gaa tca ttt cta gat 1536
Ala Arg Ser Pro Asp His Lys Asp His Gln Asn Glu Ser Phe Leu Asp
500 505 510
tta aag aac ctc aga atg gaa aac aat gtc cag aaa ata cta ctg gat 1584
Leu Lys Asn Leu Arg Met Glu Asn Asn Val Gln Lys Ile Leu Leu Asp
515 520 525
gca aaa ccg gat ttg cca act ctc tca aga ata gag atc cta gcg cct 1632
Ala Lys Pro Asp Leu Pro Thr Leu Ser Arg Ile Glu Ile Leu Ala Pro
530 535 540
cag aat ggc ctt tgc aac gca agg ttc ggc tca gcc atg gag aag tca 1680
Gln Asn Gly Leu Cys Asn Ala Arg Phe Gly Ser Ala Met Glu Lys Ser
545 550 555 560
ggg aag atg gat gtg gct gag gct tta gag ctc agt gaa aag ctg tac 1728
Gly Lys Met Asp Val Ala Glu Ala Leu Glu Leu Ser Glu Lys Leu Tyr
565 570 575
ctg gat atg agc aaa acc ctc gga agt ctc atg aac atc aag aat atg 1776
Leu Asp Met Ser Lys Thr Leu Gly Ser Leu Met Asn Ile Lys Asn Met
580 585 590
tca ggc cac gtg tcc atg aaa tac ctc tcc cgc cag gag agg gag aag 1824
Ser Gly His Val Ser Met Lys Tyr Leu Ser Arg Gln Glu Arg Glu Lys
595 600 605
gtc aac cag ctt cga caa agg gac ctc gac ctg gtg ttt gat aag atc 1872
Val Asn Gln Leu Arg Gln Arg Asp Leu Asp Leu Val Phe Asp Lys Ile
610 615 620
acc caa ctc aag aac cag ctg ggg agg aaa gag gag ctg ttg aga gga 1920
Thr Gln Leu Lys Asn Gln Leu Gly Arg Lys Glu Glu Leu Leu Arg Gly
625 630 635 640
tat gaa aag gac gtt gaa cag ctc agg cgg agc aaa gtg tcc att gag 1968
Tyr Glu Lys Asp Val Glu Gln Leu Arg Arg Ser Lys Val Ser Ile Glu
645 650 655
atg tac cag tcg cag gtg gca aag ctg gag gat gat atc tac aaa gag 2016
Met Tyr Gln Ser Gln Val Ala Lys Leu Glu Asp Asp Ile Tyr Lys Glu
660 665 670
gcc gaa gag aag gcc ctg ctg aag gag gcc ctg gag cgc atg gag cac 2064
Ala Glu Glu Lys Ala Leu Leu Lys Glu Ala Leu Glu Arg Met Glu His
675 680 685
cag ctg tgc cag gag aag agg atc aac agg gcc atc cgg cag cag aag 2112
Gln Leu Cys Gln Glu Lys Arg Ile Asn Arg Ala Ile Arg Gln Gln Lys
690 695 700
tagatgggag cttccagccc ctgcctcgca gacagatggt gaggatgaaa gaatgcaggc 2172
atccctcgct atctgctgtc cactctcgct gtcgtcatac ctcaggagcc aagtgctttt 2232
ggatggtgag ggtgcttgtc ccgtctctga agcacctgcc cagggcctgg caccagccgg 2292
cggacactgc cttccccatg ctgcccatga cccatgctgc ccgtgaccca tgcaggtggt 2352
ctcctgggtg agtcctgcct cagacccggt agggtctaca cactggaaag tttccccaaa 2412
gcaagtcaca catgagcagc acccgggcgg cttccaaagc tccctgaaag ctggcagagg 2472
ggctttcctg tctgctttgc agttgacata ttcatcatcc actcctgcag tccacaaggc 2532
actggaacag tgacctctaa atagagccag actctccctt tccttcactc ttgcctaagg 2592
tggagcagat cccaaggctg gggttaggca agtcagggaa agaaccacag tcactgtttc 2652
ttcatgaccg tctccaggcc agggactggg ctcacctgag ctttgcagcc gtgatagggc 2712
ttattaatgc ctctaatagc cctgcctggt acgatgatcc ctattttaca gatgaggaaa 2772
ctgcgcctca gagagttaag aggcgcccaa gtgcacacag cggctggtaa ggacagctgg 2832
gacctgaccc cagatccctc tgaccctggt tctcctgctg tctgcagggt tggccgtgag 2892
tcccctcctt gtaactgtca gcttttatgt gtgtgtgcat tctcgtgtgt gtgtacattc 2952
tcatgtgtgt gtgtgtgtgc acatgtgtac catgtgcatg agggtttggc tgtgtgtgac 3012
actatgtgtg tgtgtttgtg tgtgttgcct gcctgagctc agagagagcc aaacccccag 3072
agaagggtgc cccctccacc aaccaggtga gctccttgca gaggcctggc cttcatccca 3132
caaaccttgc agaccacagg ctccctggct tgcagccccc aaaaatgaag gcagcgctct 3192
gctctggacg tggcctttcc agcactcacc actctgaatt aaacatcaca ggcccccatc 3252
tgcacatgtc gtggggctgc ctcgggcaga ggaccgtttc cttttacgct gtgccatgcc 3312
agggagatct gggcagcagc agagtcctga gatgtccttt gatgtaccca caggagagtt 3372
cgctccttcc cgaggcagtg tcccccaggg cttggcaggc caggcccaca gcagagacac 3432
caaaccacag aatgggacgg cagagggccc taaaaagccc attgtcatgc actggctgaa 3492
aatgaaagaa aacctgcaga ttt 3515
<210> 32
<211> 2247
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (3)..(2246)
<400> 32
ag aag atg gat ttc aag tcc tca aag cag gcc gat tcc act tcc ata 47
Lys Met Asp Phe Lys Ser Ser Lys Gln Ala Asp Ser Thr Ser Ile
1 5 10 15
gga aag gag gat cct ggg tcc tca cgg aag gca gat ccc atg ttt aca 95
Gly Lys Glu Asp Pro Gly Ser Ser Arg Lys Ala Asp Pro Met Phe Thr
20 25 30
gga aag gca gag cct gaa atc ttg gga aag ggg gat cct gtg gct cct 143
Gly Lys Ala Glu Pro Glu Ile Leu Gly Lys Gly Asp Pro Val Ala Pro
35 40 45
gga agg atg gat ccc atg act gta aga aag gaa gat ctt gga tcc ctg 191
Gly Arg Met Asp Pro Met Thr Val Arg Lys Glu Asp Leu Gly Ser Leu
50 55 60
gga aaa gta gat cct ttg tgc tcc agc aag acg tat aca gtg tca ccg 239
Gly Lys Val Asp Pro Leu Cys Ser Ser Lys Thr Tyr Thr Val Ser Pro
65 70 75
agg aag gag gat cct ggg tct ttg aga aag gtg gat cct gtg tcc tca 287
Arg Lys Glu Asp Pro Gly Ser Leu Arg Lys Val Asp Pro Val Ser Ser
80 85 90 95
gac aaa gtg gac cct gta ttc cca aga aag gag gag ccc agg tat tca 335
Asp Lys Val Asp Pro Val Phe Pro Arg Lys Glu Glu Pro Arg Tyr Ser
100 105 110
gga aaa gag cat cct gtg tcc tca gaa aag gtc gct cct aca tct gca 383
Gly Lys Glu His Pro Val Ser Ser Glu Lys Val Ala Pro Thr Ser Ala
115 120 125
gaa aag gta gat ctt gta ttg tcg gga aag aga gat cct ggg ccc tcg 431
Glu Lys Val Asp Leu Val Leu Ser Gly Lys Arg Asp Pro Gly Pro Ser
130 135 140
gga aag gca gat ccc gtg ccc ttg gaa agc atg gat tct gcg tcc aca 479
Gly Lys Ala Asp Pro Val Pro Leu Glu Ser Met Asp Ser Ala Ser Thr
145 150 155
gga aag aca gag ccg ggg ctc ctg ggc aag ctg att cca ggc tca tca 527
Gly Lys Thr Glu Pro Gly Leu Leu Gly Lys Leu Ile Pro Gly Ser Ser
160 165 170 175
ggc aag aat ggg cct gta tcc tct ggg acc ggg gct cct ggg tcc ttg 575
Gly Lys Asn Gly Pro Val Ser Ser Gly Thr Gly Ala Pro Gly Ser Leu
180 185 190
gga agg ctg gat ccc aca tgc ttg ggg atg gca gat ccc gca tct gtg 623
Gly Arg Leu Asp Pro Thr Cys Leu Gly Met Ala Asp Pro Ala Ser Val
195 200 205
gga aat gta gaa act gtg cct gcc aca aaa gag gac tcc cgg ttc ctg 671
Gly Asn Val Glu Thr Val Pro Ala Thr Lys Glu Asp Ser Arg Phe Leu
210 215 220
gga aag atg gac cct gcc tcc tca gga gag ggg cgt cct gtg tct ggc 719
Gly Lys Met Asp Pro Ala Ser Ser Gly Glu Gly Arg Pro Val Ser Gly
225 230 235
cac acg gat act acg gct tca gca aag aca gat ctc aca tct ttg aaa 767
His Thr Asp Thr Thr Ala Ser Ala Lys Thr Asp Leu Thr Ser Leu Lys
240 245 250 255
aat gtg gat ccc atg tct tca ggc aag gtg gat cca gtt tct ctg gga 815
Asn Val Asp Pro Met Ser Ser Gly Lys Val Asp Pro Val Ser Leu Gly
260 265 270
aag atg gac ccc atg tgc tca gga aag cca gag ctc ttg tct cct gga 863
Lys Met Asp Pro Met Cys Ser Gly Lys Pro Glu Leu Leu Ser Pro Gly
275 280 285
cag gca gag cgt gtg tct gtg gga aag gca gga act gta tcc cca gga 911
Gln Ala Glu Arg Val Ser Val Gly Lys Ala Gly Thr Val Ser Pro Gly
290 295 300
aaa gag gac ccg gtg tcc tcc aga agg gag gac ccc ata tct gct gga 959
Lys Glu Asp Pro Val Ser Ser Arg Arg Glu Asp Pro Ile Ser Ala Gly
305 310 315
agt aga aag aca tca tct gaa aaa gtg aat cct gag tct tca gga aag 1007
Ser Arg Lys Thr Ser Ser Glu Lys Val Asn Pro Glu Ser Ser Gly Lys
320 325 330 335
aca aac cct gtg tct tca ggt cca ggc gat ccc agg tcc ttg ggg aca 1055
Thr Asn Pro Val Ser Ser Gly Pro Gly Asp Pro Arg Ser Leu Gly Thr
340 345 350
gca ggt ccc cca tct gca gta aag gct gag cca gcg acg ggg gga aaa 1103
Ala Gly Pro Pro Ser Ala Val Lys Ala Glu Pro Ala Thr Gly Gly Lys
355 360 365
gga gat ccc ctg tcc tcg gag aag gca ggt ctg gtg gcc tct gga aag 1151
Gly Asp Pro Leu Ser Ser Glu Lys Ala Gly Leu Val Ala Ser Gly Lys
370 375 380
gcg gct ccc aca gcc tca ggg aag gcc gag ccc ctc gcg gtg ggc aag 1199
Ala Ala Pro Thr Ala Ser Gly Lys Ala Glu Pro Leu Ala Val Gly Lys
385 390 395
gag gac cct gtg agc aag gga aag gca gac gct ggc ccc tct gga caa 1247
Glu Asp Pro Val Ser Lys Gly Lys Ala Asp Ala Gly Pro Ser Gly Gln
400 405 410 415
ggg gac tct gtg tct ata ggt aaa gtg gtc tca act cca gga aaa aca 1295
Gly Asp Ser Val Ser Ile Gly Lys Val Val Ser Thr Pro Gly Lys Thr
420 425 430
gtc ccg gtg ccc tcg ggg aag gtg gat ccc gtg tcc ctg gga aaa gca 1343
Val Pro Val Pro Ser Gly Lys Val Asp Pro Val Ser Leu Gly Lys Ala
435 440 445
gaa gct atc cca gag gga aaa gtg ggt tct ctg cct cta gag aag ggg 1391
Glu Ala Ile Pro Glu Gly Lys Val Gly Ser Leu Pro Leu Glu Lys Gly
450 455 460
agt cct gtt acc acc aca aag gcg gat ccc agg gcc tcg ggg aaa gca 1439
Ser Pro Val Thr Thr Thr Lys Ala Asp Pro Arg Ala Ser Gly Lys Ala
465 470 475
cag ccg cag tct ggt ggc aaa gca gaa aca aag ctc cct ggg caa gag 1487
Gln Pro Gln Ser Gly Gly Lys Ala Glu Thr Lys Leu Pro Gly Gln Glu
480 485 490 495
ggc gct gca gca cca gga gaa gca ggg gct gtg tgt ttg aaa aag gag 1535
Gly Ala Ala Ala Pro Gly Glu Ala Gly Ala Val Cys Leu Lys Lys Glu
500 505 510
aca cca cag gcc tca gag aag gtg gat cct gga tcc tgc aga aaa gca 1583
Thr Pro Gln Ala Ser Glu Lys Val Asp Pro Gly Ser Cys Arg Lys Ala
515 520 525
gag ccc ctt gcc tca ggg aag gga gag cct gtg tcc ctg ggg aaa gcc 1631
Glu Pro Leu Ala Ser Gly Lys Gly Glu Pro Val Ser Leu Gly Lys Ala
530 535 540
gac tct gca cct tcc aga aaa acg gag tcc cca tcc ttg ggg aag gtg 1679
Asp Ser Ala Pro Ser Arg Lys Thr Glu Ser Pro Ser Leu Gly Lys Val
545 550 555
gtc ccc ctg agt ctg gag aag acc aag ccg tcc tcc tcc tcc agg cag 1727
Val Pro Leu Ser Leu Glu Lys Thr Lys Pro Ser Ser Ser Ser Arg Gln
560 565 570 575
tta gac cgc aaa gcc ctc ggc tca gcc cgg tct ccc gag ggt gcc agg 1775
Leu Asp Arg Lys Ala Leu Gly Ser Ala Arg Ser Pro Glu Gly Ala Arg
580 585 590
ggc agt gaa ggc cgc gtg gag ccg aag gcc gag ccc gtg tcc agc acc 1823
Gly Ser Glu Gly Arg Val Glu Pro Lys Ala Glu Pro Val Ser Ser Thr
595 600 605
gag gcc tcc agt ctc ggc cag aaa gac ctg gaa gcc gct ggg gcc gag 1871
Glu Ala Ser Ser Leu Gly Gln Lys Asp Leu Glu Ala Ala Gly Ala Glu
610 615 620
aga agc ccc tgc cca gag gcc gca gcg ccc ccg ccg ggg ccg cgg act 1919
Arg Ser Pro Cys Pro Glu Ala Ala Ala Pro Pro Pro Gly Pro Arg Thr
625 630 635
cgc gac aac ttc acc aag gcg ccg tcg tgg gag gcg agc gcc ccg ccg 1967
Arg Asp Asn Phe Thr Lys Ala Pro Ser Trp Glu Ala Ser Ala Pro Pro
640 645 650 655
ccg ccg cgc gag gac gcg ggc act cag gcg ggc gcg cag gcc tgc gtc 2015
Pro Pro Arg Glu Asp Ala Gly Thr Gln Ala Gly Ala Gln Ala Cys Val
660 665 670
tca gtg gcc gtg agc ccc atg tct ccg cag gac ggc gct ggg ggc tcg 2063
Ser Val Ala Val Ser Pro Met Ser Pro Gln Asp Gly Ala Gly Gly Ser
675 680 685
gcc ttc agc ttc cag gcg gcg ccg cgc gcg ccc agc ccg ccc tcg cgc 2111
Ala Phe Ser Phe Gln Ala Ala Pro Arg Ala Pro Ser Pro Pro Ser Arg
690 695 700
cga gat gcg ggc ctg cag gtg tcg ctg ggc gcc gcc gag acg cgc tcc 2159
Arg Asp Ala Gly Leu Gln Val Ser Leu Gly Ala Ala Glu Thr Arg Ser
705 710 715
gtg gcc act ggg ccc atg aca cct caa gcc gcc gcg ccg ccc gcc ttc 2207
Val Ala Thr Gly Pro Met Thr Pro Gln Ala Ala Ala Pro Pro Ala Phe
720 725 730 735
ccc gaa gtg cgg gtg cgg ccc ggc tca gcg ctg gcg gcc g 2247
Pro Glu Val Arg Val Arg Pro Gly Ser Ala Leu Ala Ala
740 745
<210> 33
<211> 3512
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (818)..(2344)
<400> 33
ccaaagatgg gttttgtctg cgggtgattt gggctctcga agtgcaactt tgtggccggg 60
acgcggatcg gccggcctgg gctcctgcag agcagatcct gtctgcgtcc tccaggagga 120
gtgggtggca ggactggggt ttcccacagg ttttggggcg gcggcgagat tggcacggtc 180
cggggtcgca ggcgcgcagc cacgcccctg gaagtccgcc ccggcccccg cccccaaccc 240
gcctcttcgg ggctttatgg cgtgaggttt ggggctggga tccatctgga gccgagcaga 300
aaacttttcc cctcccgttc ccggtccctt ttgtctttct tggacgcggt ggcggcgccg 360
cctgagcggc gactccctct cccctgcccg gcttgctgcg cccggtgccc tccgagggca 420
ggcgcgcctg gactctgcgc ccggatggcg gcggccctct gtgagcaccg gcagcggcgc 480
atcccctgcc ccgaggcctc cggtgccccc ccggcgcggg cataggggcg cccccaccct 540
ccgtccgctt gcaccccttg ccacccgccc cctcgcctga ctcatccgcc cgcggtggcc 600
gcccgagccc tgggatgggg agggagaccg cggctgcccg cggcggccga gattcccgct 660
gacgcccccg accctgccgc cttcttcgtc cgcctccaga ggcgcccgac gtcccgacag 720
ctcctggagt gagaccagga ctgagaacag ggagaggcga cccgaccccc agggcccggt 780
gctcaggaca gcacacagag ccgctgaaaa cgactga aga gag caa ggg att tcc 835
Arg Glu Gln Gly Ile Ser
1 5
tgg gac atc tgg ctc tgg aga gta aaa ggc caa gct atg ata gca act 883
Trp Asp Ile Trp Leu Trp Arg Val Lys Gly Gln Ala Met Ile Ala Thr
10 15 20
ggt gga gtg ata act ggc ctg gcc gcc ttg aaa agg caa gac tct gcc 931
Gly Gly Val Ile Thr Gly Leu Ala Ala Leu Lys Arg Gln Asp Ser Ala
25 30 35
aga tca cag cag cat gtc aac ctc agc ccg tct cct gct acc caa gag 979
Arg Ser Gln Gln His Val Asn Leu Ser Pro Ser Pro Ala Thr Gln Glu
40 45 50
aag aag ccc atc agg cgc cgg ccc cgg gca gat gtt gtg gtt gtt cgt 1027
Lys Lys Pro Ile Arg Arg Arg Pro Arg Ala Asp Val Val Val Val Arg
55 60 65 70
ggc aaa atc cgg ctt tat tcc cca tct ggt ttt ttt ctt att tta gga 1075
Gly Lys Ile Arg Leu Tyr Ser Pro Ser Gly Phe Phe Leu Ile Leu Gly
75 80 85
gtg ctc atc tcc att ata gga att gct atg gcc gtt ctt gga tat tgg 1123
Val Leu Ile Ser Ile Ile Gly Ile Ala Met Ala Val Leu Gly Tyr Trp
90 95 100
ccc caa aaa gaa cat ttt att gat gct gaa aca aca ctg tca aca aat 1171
Pro Gln Lys Glu His Phe Ile Asp Ala Glu Thr Thr Leu Ser Thr Asn
105 110 115
gaa act cag gtc att cgg aat gaa ggc ggt gtg gtg gtt cgc ttc ttt 1219
Glu Thr Gln Val Ile Arg Asn Glu Gly Gly Val Val Val Arg Phe Phe
120 125 130
gag cag cat ttg cat tct gat aag atg aaa atg ctt ggc cca ttc acc 1267
Glu Gln His Leu His Ser Asp Lys Met Lys Met Leu Gly Pro Phe Thr
135 140 145 150
atg ggg att ggc att ttc att ttc att tgt gct aat gcc att ctt cat 1315
Met Gly Ile Gly Ile Phe Ile Phe Ile Cys Ala Asn Ala Ile Leu His
155 160 165
gaa aac cgt gac aaa gag acc aaa atc ata cac atg agg gat atc tat 1363
Glu Asn Arg Asp Lys Glu Thr Lys Ile Ile His Met Arg Asp Ile Tyr
170 175 180
tcc aca gtc att gac att cac acg cta aga atc aag gag caa agg caa 1411
Ser Thr Val Ile Asp Ile His Thr Leu Arg Ile Lys Glu Gln Arg Gln
185 190 195
atg aac ggc atg tac act ggt ttg atg gga gaa aca gaa gta aaa cag 1459
Met Asn Gly Met Tyr Thr Gly Leu Met Gly Glu Thr Glu Val Lys Gln
200 205 210
aat ggg agc tcc tgt gcc tcg aga ttg gca gca aat acg atc gcc tct 1507
Asn Gly Ser Ser Cys Ala Ser Arg Leu Ala Ala Asn Thr Ile Ala Ser
215 220 225 230
ttc tcg ggt ttt cgg agc agt ttt cga atg gac agc tcc gtg gag gag 1555
Phe Ser Gly Phe Arg Ser Ser Phe Arg Met Asp Ser Ser Val Glu Glu
235 240 245
gat gaa ctt atg tta aat gaa ggt aag agt tct ggg cat ctt atg ccc 1603
Asp Glu Leu Met Leu Asn Glu Gly Lys Ser Ser Gly His Leu Met Pro
250 255 260
cct ttg ctc tct gac agc tct gtg tct gtc ttt ggc ctc tat cca cct 1651
Pro Leu Leu Ser Asp Ser Ser Val Ser Val Phe Gly Leu Tyr Pro Pro
265 270 275
cct tcc aag aca act gat gat aag acc agc ggc tct aag aaa tgt gaa 1699
Pro Ser Lys Thr Thr Asp Asp Lys Thr Ser Gly Ser Lys Lys Cys Glu
280 285 290
acc aag tca att gtg tca tcg tcc atc agt gct ttt aca ttg cct gtg 1747
Thr Lys Ser Ile Val Ser Ser Ser Ile Ser Ala Phe Thr Leu Pro Val
295 300 305 310
atc aaa ctt aat aac tgt gtt att gat gag ccc agt ata gat aac atc 1795
Ile Lys Leu Asn Asn Cys Val Ile Asp Glu Pro Ser Ile Asp Asn Ile
315 320 325
act gaa gat gct gac aac ctc aaa agt agg tca agg aat ttg tca atg 1843
Thr Glu Asp Ala Asp Asn Leu Lys Ser Arg Ser Arg Asn Leu Ser Met
330 335 340
gat tcc ctt gtg gtt cct ttg ccc aac acc agt gaa tcc ttc cag ccc 1891
Asp Ser Leu Val Val Pro Leu Pro Asn Thr Ser Glu Ser Phe Gln Pro
345 350 355
gtc agc aca gtg cta cca agg aat aat tcc att ggg gag tcg ttg tcg 1939
Val Ser Thr Val Leu Pro Arg Asn Asn Ser Ile Gly Glu Ser Leu Ser
360 365 370
agt cag tac aag tca tct atg gct ctc gga cct ggg gct gga cag ctc 1987
Ser Gln Tyr Lys Ser Ser Met Ala Leu Gly Pro Gly Ala Gly Gln Leu
375 380 385 390
ttg tct cct ggg gct gcc aga aga cag ttt ggg tcc aat aca tcc ttg 2035
Leu Ser Pro Gly Ala Ala Arg Arg Gln Phe Gly Ser Asn Thr Ser Leu
395 400 405
cat ttg ctc tcg tca cac tca aag tcc ttg gac tta gac cgg ggt ccc 2083
His Leu Leu Ser Ser His Ser Lys Ser Leu Asp Leu Asp Arg Gly Pro
410 415 420
tcc act cta act gtt cag gca gaa caa cgg aaa cat cca agt tgg cct 2131
Ser Thr Leu Thr Val Gln Ala Glu Gln Arg Lys His Pro Ser Trp Pro
425 430 435
agg ttg gat cgg aac aac agc aag gga tat atg aaa cta gag aac aaa 2179
Arg Leu Asp Arg Asn Asn Ser Lys Gly Tyr Met Lys Leu Glu Asn Lys
440 445 450
gaa gac ccg atg gat agg ttg ctt gtg ccc caa gtt gcc atc aaa aag 2227
Glu Asp Pro Met Asp Arg Leu Leu Val Pro Gln Val Ala Ile Lys Lys
455 460 465 470
gac ttt acc aat aag gag aag ctt ctt atg att tca aga tct cac aat 2275
Asp Phe Thr Asn Lys Glu Lys Leu Leu Met Ile Ser Arg Ser His Asn
475 480 485
aat ttg agt ttt gaa cat gat gag ttt ttg agt aac aac cta aag agg 2323
Asn Leu Ser Phe Glu His Asp Glu Phe Leu Ser Asn Asn Leu Lys Arg
490 495 500
gga act tct gaa aca agg ttt taatgttaaa agaatatatc attttacaag 2374
Gly Thr Ser Glu Thr Arg Phe
505
ggtatatatt ttaaaacgat tttcactggt gtttccttct taaagtattg gctgtaagcc 2434
tttttaatca aatggtttgt agtgtattag aattggctgc ttagttctgt aatgaagatg 2494
gttgtatgtt tgggttactt gtgactgcag tactctatgt taccacacat gattttattt 2554
ttctcttcct ttgaaagcat gatctctttt attaatatga atgcaaaatg cttgcatcca 2614
aattaaagct tattttcttt acttttaagt tctttgattg ccctattcat aaaatgaaat 2674
gtccagtatg gaaaacatag ggtaccaaag tgtggaccag gagtacaaat tcagtcccaa 2734
tactcaatac gtattataga tgactatgag tgcaaacctt aggatgtgat tttctgaata 2794
attgttcttt gtaggatttg gttacattat ttaaaatgaa aaagatctag ttttagtgtg 2854
agctcagtaa tgttaattgg ttaagttcat tgtgaatctt gagttttaga taagtagtta 2914
tttttttcaa tatcacttct gtttttagtg atattatatc aagaaacaac gtattcaaga 2974
gccatggctg acagtgccag atatacttag ggataaacat caaaatgcaa ttatagttgc 3034
tataacgtta gatactcgga atcaaaattt atttgcaagc tgacttgata aactaaatga 3094
accaataaaa tttgtagaaa tgctatcctg aaataattat atacatgaag acaatgttga 3154
ctaatgaatt aagatacatt atatactagt taatgctaac tagtctcagt acctgttttt 3214
agccatctgt tactgtccaa tagcacctca ttcccacatt ctattttccc ccggtattct 3274
ttagatccta gtatttggaa aacaatcggc taaccttgac atttcttttt accttcatat 3334
gccactatct cggtagttca aaaaaattta gttcttgata aattgccttg aagtttacct 3394
tgtgctggag agccttatga taactccaaa gactttctta cggtataata catgttgttt 3454
aggattgtgt ttcttagtca ctgaagataa taaatattaa aatggatgtt ttcatcag 3512
<210> 34
<211> 3700
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (224)..(1603)
<400> 34
ctgggtctcc agcacaccct ggagaaaagc aggtcagggg gtggtgcttc tgtttccaga 60
tgagcaaact agggctgagg gagagggcgt tctctggtca gggtggtaga gccagtctcg 120
tggaagccgg ctgagccact gaaacagaat cagttttctc atggaccaaa cggggatagt 180
gtctggcctg cgccacccca agtgtattgt gaagatgaaa tga gaa ggg gtt ttt 235
Glu Gly Val Phe
1
tgt tgt ggt tgc tgg aag gag atc cta gaa ggg agg aat ggg gga gaa 283
Cys Cys Gly Cys Trp Lys Glu Ile Leu Glu Gly Arg Asn Gly Gly Glu
5 10 15 20
gat ggc aaa ggg atc cta gga cga ggc gta tgg gca agg gaa gct tgt 331
Asp Gly Lys Gly Ile Leu Gly Arg Gly Val Trp Ala Arg Glu Ala Cys
25 30 35
ctt tct ggg agg aaa atc tcg aac caa gaa ttt cag gaa gaa ggc act 379
Leu Ser Gly Arg Lys Ile Ser Asn Gln Glu Phe Gln Glu Glu Gly Thr
40 45 50
ggg aaa aga ggg agg agg tgc cct cag atc cat gga ggt gag tgt cag 427
Gly Lys Arg Gly Arg Arg Cys Pro Gln Ile His Gly Gly Glu Cys Gln
55 60 65
ccc cca gat ctc ttg tgt ggc cac cac ttt cac ggc aac cac tcc tca 475
Pro Pro Asp Leu Leu Cys Gly His His Phe His Gly Asn His Ser Ser
70 75 80
ctg gga atc ctt gga gtc tgg tgc ttt cac cac cct gcc ccg gct gct 523
Leu Gly Ile Leu Gly Val Trp Cys Phe His His Pro Ala Pro Ala Ala
85 90 95 100
tca tgt gtg cca cct cgc cct gtt tcc aga ctg atg cag aga aag ttc 571
Ser Cys Val Pro Pro Arg Pro Val Ser Arg Leu Met Gln Arg Lys Phe
105 110 115
tca gag ccc aac act tac atc gat ggc ctg cct agc cag gac cgc cag 619
Ser Glu Pro Asn Thr Tyr Ile Asp Gly Leu Pro Ser Gln Asp Arg Gln
120 125 130
gag gag ctg tat gac gac gtg gac ctg tca gag ctc aca gct gcg gtg 667
Glu Glu Leu Tyr Asp Asp Val Asp Leu Ser Glu Leu Thr Ala Ala Val
135 140 145
gag cct acc gag gaa gcc acc cct gtt gca gat gac cca aat gag aga 715
Glu Pro Thr Glu Glu Ala Thr Pro Val Ala Asp Asp Pro Asn Glu Arg
150 155 160
gaa tct gac cga gtg tac ctg gac ctc aca cct gtc aag tcc ttt ctg 763
Glu Ser Asp Arg Val Tyr Leu Asp Leu Thr Pro Val Lys Ser Phe Leu
165 170 175 180
cat ggc ccc agc agt gca cag gcc cag gcc tcc tcc ccg acg ttg tcc 811
His Gly Pro Ser Ser Ala Gln Ala Gln Ala Ser Ser Pro Thr Leu Ser
185 190 195
tgc ctg gac aat gca act gag gcc ctc ccg gca gac tca ggc cca ggt 859
Cys Leu Asp Asn Ala Thr Glu Ala Leu Pro Ala Asp Ser Gly Pro Gly
200 205 210
ccc acc cca gat gag ccc tgc ata aag tgt cca gag aac ctg gga gaa 907
Pro Thr Pro Asp Glu Pro Cys Ile Lys Cys Pro Glu Asn Leu Gly Glu
215 220 225
cag cag ctg gag agt ttg gag cca gag gat cct tcc ctg aga atc acc 955
Gln Gln Leu Glu Ser Leu Glu Pro Glu Asp Pro Ser Leu Arg Ile Thr
230 235 240
acc gtc aaa atc cag acg gaa cag cag aga atc tcc ttc cca ccg agc 1003
Thr Val Lys Ile Gln Thr Glu Gln Gln Arg Ile Ser Phe Pro Pro Ser
245 250 255 260
tgc ccg gat gcc gtg gtg gcc acc cca cct ggt gcc agc cca cct gtg 1051
Cys Pro Asp Ala Val Val Ala Thr Pro Pro Gly Ala Ser Pro Pro Val
265 270 275
aag gac agg ttg cgc gtg acc agt gca gag atc aag ctt ggc aag aat 1099
Lys Asp Arg Leu Arg Val Thr Ser Ala Glu Ile Lys Leu Gly Lys Asn
280 285 290
cgg aca gaa gct gag gtg aag cgg tac aca gag gag aag gag agg ctt 1147
Arg Thr Glu Ala Glu Val Lys Arg Tyr Thr Glu Glu Lys Glu Arg Leu
295 300 305
gaa aag aag aag gaa gaa atc cgg ggg cac ctg gct cag ctc cgg aaa 1195
Glu Lys Lys Lys Glu Glu Ile Arg Gly His Leu Ala Gln Leu Arg Lys
310 315 320
gag aaa cgg gag cta aag gaa acc cta ctg aaa tgc aca gac aag gaa 1243
Glu Lys Arg Glu Leu Lys Glu Thr Leu Leu Lys Cys Thr Asp Lys Glu
325 330 335 340
gtc ctg gcg agc ctg gag cag aag ctg aag gaa att gac gag gag tgc 1291
Val Leu Ala Ser Leu Glu Gln Lys Leu Lys Glu Ile Asp Glu Glu Cys
345 350 355
cgg ggc gag gag agc agg cgc gtg gac ctg gag ctc agc atc atg gag 1339
Arg Gly Glu Glu Ser Arg Arg Val Asp Leu Glu Leu Ser Ile Met Glu
360 365 370
gtg aag gac aac ctg aag aag gct gag gca ggg cct gtg acg tta ggc 1387
Val Lys Asp Asn Leu Lys Lys Ala Glu Ala Gly Pro Val Thr Leu Gly
375 380 385
acc acc gtg gac acc acc cac ctg gag aat gtg agc ccc cgc ccc aaa 1435
Thr Thr Val Asp Thr Thr His Leu Glu Asn Val Ser Pro Arg Pro Lys
390 395 400
gct gtc aca cct gcc tct gcc cca gac tgt acc cca gtc aac tct gca 1483
Ala Val Thr Pro Ala Ser Ala Pro Asp Cys Thr Pro Val Asn Ser Ala
405 410 415 420
acc aca ctc aag aac agg cct ctc tcg gtc gtg gtc aca ggc aaa ggc 1531
Thr Thr Leu Lys Asn Arg Pro Leu Ser Val Val Val Thr Gly Lys Gly
425 430 435
act gta ctc cag aaa gcc aag gta agc agt cac tcc cag cct ccc ttg 1579
Thr Val Leu Gln Lys Ala Lys Val Ser Ser His Ser Gln Pro Pro Leu
440 445 450
ggg cca gca gaa atg tca cta agg taggggacca tcttctgacc caggtggtgc 1633
Gly Pro Ala Glu Met Ser Leu Arg
455 460
tttgtggctt ttggcaccag ctctgagcaa gggctcggca ctcaccaagg agtggccatg 1693
gctttgccat gccttcagct cagtctctgg cactgaccaa ccagaaaggt gtctgcactt 1753
ggttcgagca tccttccctg tgagcttggg gatgcttcag ttcttgccct ccacacagct 1813
atgggtagat tccaggaggc cagctgcagc tcagggacag attatgactg agttttcaaa 1873
ccaaaaagtc agtcagtgac tccacaggac ttctaccatt aaaactaaag aaacagactc 1933
acataggtgg tttgaattgc ccaaagttgc ccacccaact actggcaaaa ttctcagcaa 1993
gttgtgaatg aggaatccat gctatgatgc ggtttctccc tgatggctag tcaaaaaagt 2053
atcattaatc atttactcat ttactcattt gtacactaaa gaacatgcct agggctgcct 2113
ttgcttttgg caaattctgc tccttattcc tatcctcctt acctgctgga gatatttgga 2173
ggtcccacaa caatggctaa ccaataaatc catgcaagca tctgaacctc cactcccttg 2233
ttgctgtgag agacatcact tataggcact cgctgttgtg ccacagtctc aaaatctctg 2293
aataggcact ctaagcagtg atgaacacac agggtttgaa acatttatca cctctgtgcc 2353
taaaatagtt tttctcacat tttttctatg tatatcaagg ttgctacagt ggataactaa 2413
ctatcttttt ggtggggggt gggtgctggt atttgaatgg taccctagga atgggagaag 2473
aaaggagcaa gttagaaaac aagcttcatc taaagactct catgtcaatg tggaccttgg 2533
tgacaatcct gctttgttaa agcaaaaact atgcgaaagg gtgagtctgt ttagaagaaa 2593
aagcaaagac tgaggtactg tgaatggaga gcttcagcta agaggaggct ctgtcccttt 2653
tcagagccaa aggaaataat acaacaaaaa ggaggcttct ttggagacct aagtctattg 2713
gatgtaaaca agacgttgta tttagggatg ttctgtgttt ctttcttttt tgaagttgtc 2773
atcaattgct ttactaagat ttttaaatag tgaaaacctc ctgtttagac tttggtggaa 2833
gatgaatcaa ggaagcaggg ccctgtctta tgggtcacgt gtctttggtg agtgagaaga 2893
cctaaactcc tggccatcat ctcttatcca atacttagca gttggggatt aaaccatcct 2953
tgccttcagt tctctccaat attaccaggc ccaactcagt cttcagtgat tttaaacagc 3013
attgacatca tctgtaaaac catcatctgt aaaaccatct atgacatgag ttttgagaaa 3073
caataatggg gaaaatattt gggaccaagc tgaagcacta atcccactaa gttaaagact 3133
tctttccagt ccaaggcagg cctgaatcaa ctgtctttaa ataaaatttt aagtgatgct 3193
gtattatata taggaaaaaa tgcttaaaat cctgtcattt agaacagtga aaagtatctt 3253
ttgagattaa agtgactctt tactgtagga aaaatattac tctgtgttta cagattcatt 3313
gctgtggtca ggccattttt aagggaagag ttatttaata taaatagtct ctgattttaa 3373
gttctgttta atgttcattc tccttccaag aacaaagtgg tgatttttgg ttagggtgat 3433
cgccctctta aaattggcag tgctgttcct tgtgctgccc ctgtcttttc ctctgatggc 3493
attttttttt ttttttaaca caggttgaaa catttcatct attatctctg cctcatttct 3553
ggagggttgt gtatcagttc tctaacactt gttcctgaga actaaatgtc ttttttattc 3613
ttatttcctc tctcataaac atttggtgac cttttaccaa gtggtgagtt agttaggttt 3673
tttaaaataa aatgttcatt gtatttg 3700
<210> 35
<211> 3712
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (407)..(1909)
<400> 35
gcgaggacag ctgaataact gggagctcga agttgcttgc tgcctccggg agtggtctgc 60
ttggctgtcg cgttttcctt ttcttttttt tttaaaaaaa aaaaagccaa acgtttgctc 120
accatcacct gcagaagtga gtttttattg aagaagctcc agctgctttc tccttattgc 180
cggcagcctg gcaagcctcc cttagagtga gctacttctc aggccgggca cgacagcagt 240
gctagcacat ttgaaaaagc aggaaaccgc ccctgcatgc agagcttcca gcctgagaac 300
cccaggccag tcctcccagc agctgtgtag ccagagagcc ttggtactgt ggtcacatcc 360
ctcaaaagtg aacagtcgcc atcggaggcg tttggaggag accgtg atg ttg cag 415
Met Leu Gln
1
atg ctg tgg cat ttc cta gct agc ttt ttc ccc agg gct ggg tgc cac 463
Met Leu Trp His Phe Leu Ala Ser Phe Phe Pro Arg Ala Gly Cys His
5 10 15
ggc tcc aga gag ggg gac gat cgt gaa gtc aga ggc acc cca gcc cct 511
Gly Ser Arg Glu Gly Asp Asp Arg Glu Val Arg Gly Thr Pro Ala Pro
20 25 30 35
gcc tgg aga gac cag atg gca agc ttt ttg ggg aaa cag gac gga agg 559
Ala Trp Arg Asp Gln Met Ala Ser Phe Leu Gly Lys Gln Asp Gly Arg
40 45 50
gct gag gcc acg gaa aaa aga ccc acc att ttg ctg gtg gtt gga cct 607
Ala Glu Ala Thr Glu Lys Arg Pro Thr Ile Leu Leu Val Val Gly Pro
55 60 65
gca gag cag ttt cct aag aaa att gta caa gct gga gat aag gac ctt 655
Ala Glu Gln Phe Pro Lys Lys Ile Val Gln Ala Gly Asp Lys Asp Leu
70 75 80
gat ggg cag cta gac ttt gaa gaa ttt gtc cat tat ctc caa gat cat 703
Asp Gly Gln Leu Asp Phe Glu Glu Phe Val His Tyr Leu Gln Asp His
85 90 95
gag aag aag ctg agg ctg gtg ttt aag agt ttg gac aaa aag aat gat 751
Glu Lys Lys Leu Arg Leu Val Phe Lys Ser Leu Asp Lys Lys Asn Asp
100 105 110 115
gga cgc att gac gcg cag gag atc atg cag tcc ctg cgg gac ttg gga 799
Gly Arg Ile Asp Ala Gln Glu Ile Met Gln Ser Leu Arg Asp Leu Gly
120 125 130
gtc aag ata tct gaa cag cag gca gaa aaa att ctc aag aga ata cga 847
Val Lys Ile Ser Glu Gln Gln Ala Glu Lys Ile Leu Lys Arg Ile Arg
135 140 145
acg ggc cat ttc tgg ggc cct gtc acc tac atg gat aaa aac ggc acg 895
Thr Gly His Phe Trp Gly Pro Val Thr Tyr Met Asp Lys Asn Gly Thr
150 155 160
atg acc atc gac tgg aac gag tgg aga gac tac cac ctc ctc cac ccc 943
Met Thr Ile Asp Trp Asn Glu Trp Arg Asp Tyr His Leu Leu His Pro
165 170 175
gtg gaa aac atc ccc gag atc atc ctc tac tgg aag cat tcc acg atc 991
Val Glu Asn Ile Pro Glu Ile Ile Leu Tyr Trp Lys His Ser Thr Ile
180 185 190 195
ttt gat gtg ggt gag aat cta acg gtc ccg gat gag ttc aca gtg gag 1039
Phe Asp Val Gly Glu Asn Leu Thr Val Pro Asp Glu Phe Thr Val Glu
200 205 210
gag agg cag acg ggg atg tgg tgg aga cac ctg gtg gca gga ggt ggg 1087
Glu Arg Gln Thr Gly Met Trp Trp Arg His Leu Val Ala Gly Gly Gly
215 220 225
gca ggg gcc gta tcc aga acc tgc acg gcc ccc ctg gac agg ctc aag 1135
Ala Gly Ala Val Ser Arg Thr Cys Thr Ala Pro Leu Asp Arg Leu Lys
230 235 240
gtg ctc atg cag gtc cat gcc tcc cgc agc aac aac atg ggc atc gtt 1183
Val Leu Met Gln Val His Ala Ser Arg Ser Asn Asn Met Gly Ile Val
245 250 255
ggt ggc ttc act cag atg att cga gaa gga ggg gcc agg tca ctc tgg 1231
Gly Gly Phe Thr Gln Met Ile Arg Glu Gly Gly Ala Arg Ser Leu Trp
260 265 270 275
cgg ggc aat ggc atc aac gtc ctc aaa att gcc ccc gaa tca gcc atc 1279
Arg Gly Asn Gly Ile Asn Val Leu Lys Ile Ala Pro Glu Ser Ala Ile
280 285 290
aaa ttc atg gcc tat gag cag atc aag cgc ctt gtt ggt agt gac cag 1327
Lys Phe Met Ala Tyr Glu Gln Ile Lys Arg Leu Val Gly Ser Asp Gln
295 300 305
gag act ctg agg att cac gag agg ctt gtg gca ggg tcc ttg gca ggg 1375
Glu Thr Leu Arg Ile His Glu Arg Leu Val Ala Gly Ser Leu Ala Gly
310 315 320
gcc atc gcc cag agc agc atc tac cca atg gag gtc ctg aag acc cgg 1423
Ala Ile Ala Gln Ser Ser Ile Tyr Pro Met Glu Val Leu Lys Thr Arg
325 330 335
atg gcg ctg cgg aag aca ggc cag tac tca gga atg ctg gac tgc gcc 1471
Met Ala Leu Arg Lys Thr Gly Gln Tyr Ser Gly Met Leu Asp Cys Ala
340 345 350 355
agg agg atc ctg gcc aga gag ggg gtg gcc gcc ttc tac aaa ggc tat 1519
Arg Arg Ile Leu Ala Arg Glu Gly Val Ala Ala Phe Tyr Lys Gly Tyr
360 365 370
gtc ccc aac atg ctg ggc atc atc ccc tat gcc ggc atc gac ctt gca 1567
Val Pro Asn Met Leu Gly Ile Ile Pro Tyr Ala Gly Ile Asp Leu Ala
375 380 385
gtc tac gag acg ctc aag aat gcc tgg ctg cag cac tat gca gtg aac 1615
Val Tyr Glu Thr Leu Lys Asn Ala Trp Leu Gln His Tyr Ala Val Asn
390 395 400
agc gcg gac ccc ggc gtg ttt gtg ctc ctg gcc tgt ggc acc atg tcc 1663
Ser Ala Asp Pro Gly Val Phe Val Leu Leu Ala Cys Gly Thr Met Ser
405 410 415
agt acc tgt ggc cag ctg gcc agc tac ccc ctg gcc cta gtc agg acc 1711
Ser Thr Cys Gly Gln Leu Ala Ser Tyr Pro Leu Ala Leu Val Arg Thr
420 425 430 435
cgg atg cag gcg caa gcc tct att gag ggc gct ccg gag gtg acc atg 1759
Arg Met Gln Ala Gln Ala Ser Ile Glu Gly Ala Pro Glu Val Thr Met
440 445 450
agc agc ctc ttc aaa cat atc ctg cgg acc gag ggg gcc ttc ggg ctg 1807
Ser Ser Leu Phe Lys His Ile Leu Arg Thr Glu Gly Ala Phe Gly Leu
455 460 465
tac agg ggg ctg gcc ccc aac ttc atg aag gtc atc cca gct gtg agc 1855
Tyr Arg Gly Leu Ala Pro Asn Phe Met Lys Val Ile Pro Ala Val Ser
470 475 480
atc agc tac gtg gtc tac gag aac ctg aag atc acc ctg ggc gtg cag 1903
Ile Ser Tyr Val Val Tyr Glu Asn Leu Lys Ile Thr Leu Gly Val Gln
485 490 495
tcg cgg tgacgggggg agggccgccc ggcagtggac tcgctgatcc tgggccgcag 1959
Ser Arg
500
cctggggtgt gcagccatct cattctgtga atgtgccaac actaagctgt ctcgagccaa 2019
gctgtgaaaa ccctagacgc acccgcaggg agggtgggga gagctggcag gcccagggct 2079
tgtcctgctg accccagcag accctcctgt tggttccagc gaagaccaca ggcattcctt 2139
agggtccagg gtcagcaggc tccgggctca catgtgtaag gacaggacat tttctgcagt 2199
gcctgccaat agtgagcttg gagcctggag gccggcttag ttcttccatt tcacccttgc 2259
agccagctgt tggccacggc ccctgccctc tggtctgccg tgcatctccc tgtgccctct 2319
tgctgcctgc ctgtctgctg aggtaaggtg ggaggagggc tacagcccac atcccacccc 2379
ctcgtccaat cccataatcc atgatgaaag gtgaggtcac gtggcctccc aggcctgact 2439
tcccaaccta cagcattgac gccaacttgg ctgtgaagga agaggaaagg atctggcctt 2499
gtggtcactg gcatctgagc cctgctgatg gctggggctc tcgggcatgc ttgggagtgc 2559
agggggctcg ggctgcctgg cctggctgca cagaaggcaa gtgctggggc tcatggtgct 2619
ctgagctggc ctggaccctg tcaggatggg ccccacctca gaaccaaact cactgtcccc 2679
actgtggcat gagggcagtg gagcaccatg tttgagggcg aagggcagag cgtttgtgtg 2739
ttctggggag ggaaggaaaa ggtgttggag gccttaatta tggactgttg ggaaaagggt 2799
tttgtccaga aggacaagcc ggacaaatga gcgacttctg tgcttccaga ggaagacgag 2859
ggagcaggag cttggctgac tgctcagagt ctgttctgac gccctggggg ttcctgtcca 2919
accccagcag gggcgcagcg ggaccagccc cacattccac ttgtgtcact gcttggaacc 2979
tatttatttt gtatttattt gaacagagtt atgtcctaac tatttttata gatttgttta 3039
attaatagct tgtcattttc aagttcattt tttattcata tttatgttca tggttgattg 3099
taccttccca agcccgccca gtgggatggg aggaggagga gaaggggggc cttgggccgc 3159
tgcagtcaca tctgtccaga gaaattcctt ttgggactgg aggcagaaaa gcggccagaa 3219
ggcagcagcc ctggctcctt tcctttggca ggttggggaa gggcttgccc ccagccttag 3279
gatttcaggg tttgactggg ggcgtggaga gagagggagg aacctcaata accttgaagg 3339
tggaatccag ttatttcctg cgctgcgagg gtttctttat ttcactcttt tctgaatgtc 3399
aaggcagtga ggtgcctctc actgtgaatt tgtggtgggc gggggctgga ggagagggtg 3459
gggggctggc tccgtccctc ccagccttct gctgcccttg cttaacaatg ccggccaact 3519
ggcgacctca cggttgcact tccattccac cagaatgacc tgatgaggaa atcttcaata 3579
ggatgcaaag atcaatgcaa aaattgttat atatgaacat ataactggag tcgtcaaaaa 3639
gcaaattaag aaagaattgg acgttagaag ttgtcattta aagcagcctt ctaataaagt 3699
tgtttcaaag ctg 3712
<210> 36
<211> 3299
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (3)..(3041)
<400> 36
tt cag ttt ctc ttc gat gtg ctg cag aaa aca ctt tca ctc aag ctg 47
Gln Phe Leu Phe Asp Val Leu Gln Lys Thr Leu Ser Leu Lys Leu
1 5 10 15
gtc cat gtt gct ggt cct ggc ccc aca ggg ccc atc aag att ttc ccc 95
Val His Val Ala Gly Pro Gly Pro Thr Gly Pro Ile Lys Ile Phe Pro
20 25 30
ttc aaa tcc ctt cgg cac ctg gag ctc cga ggt gtt ccc ctc cac tgt 143
Phe Lys Ser Leu Arg His Leu Glu Leu Arg Gly Val Pro Leu His Cys
35 40 45
ctg cat ggc ctc cga ggc atc tac tcc cag ctg gag acc ctg att tgc 191
Leu His Gly Leu Arg Gly Ile Tyr Ser Gln Leu Glu Thr Leu Ile Cys
50 55 60
agc agg agc ctc cag gca tta gag gag ctc ctc tca gcc tgc ggc ggc 239
Ser Arg Ser Leu Gln Ala Leu Glu Glu Leu Leu Ser Ala Cys Gly Gly
65 70 75
gac ttc tgc tct gcc ctc cct tgg ctg gct ctg ctt tct gcc aac ttc 287
Asp Phe Cys Ser Ala Leu Pro Trp Leu Ala Leu Leu Ser Ala Asn Phe
80 85 90 95
agc tac aat gca ctg acc gcc tta gac agc tcc ctg cgc ctc ttg tca 335
Ser Tyr Asn Ala Leu Thr Ala Leu Asp Ser Ser Leu Arg Leu Leu Ser
100 105 110
gct ctg cgt ttc ttg aac cta agc cac aat caa gtc cag gac tgt cag 383
Ala Leu Arg Phe Leu Asn Leu Ser His Asn Gln Val Gln Asp Cys Gln
115 120 125
gga ttc ctg atg gat ttg tgt gag ctc cac cat ctg gac atc tcc tat 431
Gly Phe Leu Met Asp Leu Cys Glu Leu His His Leu Asp Ile Ser Tyr
130 135 140
aat cgc ctg cat ttg gtg cca aga atg gga ccc tca ggg gct gct ctg 479
Asn Arg Leu His Leu Val Pro Arg Met Gly Pro Ser Gly Ala Ala Leu
145 150 155
ggg gtc ctg ata ctg cga ggc aat gag ctt cgg agc ctg cat ggc cta 527
Gly Val Leu Ile Leu Arg Gly Asn Glu Leu Arg Ser Leu His Gly Leu
160 165 170 175
gag cag ctg agg aat ctg cgg cac ctg gat ttg gca tac aac ctg ctg 575
Glu Gln Leu Arg Asn Leu Arg His Leu Asp Leu Ala Tyr Asn Leu Leu
180 185 190
gaa gga cac cgg gag ctg tca cca ctg tgg ctg ctg gct gag ctc cgc 623
Glu Gly His Arg Glu Leu Ser Pro Leu Trp Leu Leu Ala Glu Leu Arg
195 200 205
aag ctc tac ctg gag ggg aac cct ctt tgg ttc cac cct gag cac cga 671
Lys Leu Tyr Leu Glu Gly Asn Pro Leu Trp Phe His Pro Glu His Arg
210 215 220
gca gcc act gcc cag tac ttg tca ccc cgg gcc agg gat gct gct act 719
Ala Ala Thr Ala Gln Tyr Leu Ser Pro Arg Ala Arg Asp Ala Ala Thr
225 230 235
ggc ttc ctt ctc gat ggc aag gtc ttg tca ctg aca gat ttt cag act 767
Gly Phe Leu Leu Asp Gly Lys Val Leu Ser Leu Thr Asp Phe Gln Thr
240 245 250 255
cac aca tcc ttg ggg ctc agc ccc atg ggc cca cct ttg ccc tgg cca 815
His Thr Ser Leu Gly Leu Ser Pro Met Gly Pro Pro Leu Pro Trp Pro
260 265 270
gtg ggg agt act cct gaa acc tca ggt ggc cct gac ctg agt gac agc 863
Val Gly Ser Thr Pro Glu Thr Ser Gly Gly Pro Asp Leu Ser Asp Ser
275 280 285
ctc tcc tca ggg ggt gtt gtg acc cag ccc ctg ctt cat aag gtt aag 911
Leu Ser Ser Gly Gly Val Val Thr Gln Pro Leu Leu His Lys Val Lys
290 295 300
agc cga gtc cgt gtg agg cgg gca agc atc tct gaa ccc agt gat acg 959
Ser Arg Val Arg Val Arg Arg Ala Ser Ile Ser Glu Pro Ser Asp Thr
305 310 315
gac ccg gag ccc cga act ctg aac ccc tct ccg gct gga tgg ttc gtg 1007
Asp Pro Glu Pro Arg Thr Leu Asn Pro Ser Pro Ala Gly Trp Phe Val
320 325 330 335
cag cag cac ccg gag ctg gag ctc atg agc agc ttc cgg gaa cgg ttc 1055
Gln Gln His Pro Glu Leu Glu Leu Met Ser Ser Phe Arg Glu Arg Phe
340 345 350
ggc cgc aac tgg ctg cag tac agg agt cac ctg gag ccc tcc gga aac 1103
Gly Arg Asn Trp Leu Gln Tyr Arg Ser His Leu Glu Pro Ser Gly Asn
355 360 365
cct ctg ccg gcc acc ccc act act tct gca ccc agt gca cct cca gcc 1151
Pro Leu Pro Ala Thr Pro Thr Thr Ser Ala Pro Ser Ala Pro Pro Ala
370 375 380
agc tcc cag ggc ccc gac act gca ccc aga cct tca ccc ccg cag gag 1199
Ser Ser Gln Gly Pro Asp Thr Ala Pro Arg Pro Ser Pro Pro Gln Glu
385 390 395
gaa gcc aga ggc ccc cag gag tca cca cag aaa atg tca gag gag gtc 1247
Glu Ala Arg Gly Pro Gln Glu Ser Pro Gln Lys Met Ser Glu Glu Val
400 405 410 415
agg gcg gag cca cag gag gag gaa gag gag aag gag ggg aag gag gag 1295
Arg Ala Glu Pro Gln Glu Glu Glu Glu Glu Lys Glu Gly Lys Glu Glu
420 425 430
aag gag gag ggg gag atg gtg gaa cag gga gaa gag gag gca gga gag 1343
Lys Glu Glu Gly Glu Met Val Glu Gln Gly Glu Glu Glu Ala Gly Glu
435 440 445
gag gaa gaa gag gag cag gac cag aag gaa gtg gaa gcg gaa ctc tgt 1391
Glu Glu Glu Glu Glu Gln Asp Gln Lys Glu Val Glu Ala Glu Leu Cys
450 455 460
cgc ccc ttg ttg gtg tgt ccc ctg gag ggg cct gag ggc gta cgg ggc 1439
Arg Pro Leu Leu Val Cys Pro Leu Glu Gly Pro Glu Gly Val Arg Gly
465 470 475
agg gaa tgc ttt ctc agg gtc act tct gcc cac ctg ttt gag gtg gaa 1487
Arg Glu Cys Phe Leu Arg Val Thr Ser Ala His Leu Phe Glu Val Glu
480 485 490 495
ctc caa gca gct cgc acc ttg gag cga ctg gag ctc cag agt ctg gag 1535
Leu Gln Ala Ala Arg Thr Leu Glu Arg Leu Glu Leu Gln Ser Leu Glu
500 505 510
gca gct gag ata gag ccg gag gcc cag gcc cag agg tcg ccc agg ccc 1583
Ala Ala Glu Ile Glu Pro Glu Ala Gln Ala Gln Arg Ser Pro Arg Pro
515 520 525
acg ggc tca gat ctg ctc cct gga gcc ccc atc ctc agt ctg cgc ttc 1631
Thr Gly Ser Asp Leu Leu Pro Gly Ala Pro Ile Leu Ser Leu Arg Phe
530 535 540
tcc tac atc tgc cct gac cgg cag ttg cgt cgc tat ttg gtg ctg gag 1679
Ser Tyr Ile Cys Pro Asp Arg Gln Leu Arg Arg Tyr Leu Val Leu Glu
545 550 555
cct gat gcc cac gca gct gtc cag gag ctg ctt gcc gtg ttg acc cca 1727
Pro Asp Ala His Ala Ala Val Gln Glu Leu Leu Ala Val Leu Thr Pro
560 565 570 575
gtc acc aat gtg gct cgg gaa cag ctt ggg gag gcc agg gac ctc ctg 1775
Val Thr Asn Val Ala Arg Glu Gln Leu Gly Glu Ala Arg Asp Leu Leu
580 585 590
ctg ggt aga ttc cag tgt cta cgc tgt ggc cat gag ttc aag cca gag 1823
Leu Gly Arg Phe Gln Cys Leu Arg Cys Gly His Glu Phe Lys Pro Glu
595 600 605
gag ccc agg atg gga tta gac agt gag gaa ggc tgg agg cct ctg ttc 1871
Glu Pro Arg Met Gly Leu Asp Ser Glu Glu Gly Trp Arg Pro Leu Phe
610 615 620
caa aag aca gaa tct cct gct gtg tgt cct aac tgt ggt agt gac cac 1919
Gln Lys Thr Glu Ser Pro Ala Val Cys Pro Asn Cys Gly Ser Asp His
625 630 635
gtg gtt ctc ctc gct gtg tct cgg gga acc ccc aac agg gag cgg aaa 1967
Val Val Leu Leu Ala Val Ser Arg Gly Thr Pro Asn Arg Glu Arg Lys
640 645 650 655
cag gga gag cag tct ctg gct cct tct ccg ttt gcc agc cct gtc tgc 2015
Gln Gly Glu Gln Ser Leu Ala Pro Ser Pro Phe Ala Ser Pro Val Cys
660 665 670
cac cct cct ggc cat ggt gac cac ctt gac agg gcc aag aac agc cca 2063
His Pro Pro Gly His Gly Asp His Leu Asp Arg Ala Lys Asn Ser Pro
675 680 685
cct cag gca ccg agc acc cgt gac cat ggt agt tgg agc ctc agt ccc 2111
Pro Gln Ala Pro Ser Thr Arg Asp His Gly Ser Trp Ser Leu Ser Pro
690 695 700
ccc cct gag cgc tgt ggc ctc cgc tct gtg gac cac cga ctc cgg ctc 2159
Pro Pro Glu Arg Cys Gly Leu Arg Ser Val Asp His Arg Leu Arg Leu
705 710 715
ttc ctg gat gtt gag gtg ttc agc gat gcc cag gag gag ttc cag tgc 2207
Phe Leu Asp Val Glu Val Phe Ser Asp Ala Gln Glu Glu Phe Gln Cys
720 725 730 735
tgc ctc aag gtg cca gtg gca ttg gca ggc cac act ggg gag ttc atg 2255
Cys Leu Lys Val Pro Val Ala Leu Ala Gly His Thr Gly Glu Phe Met
740 745 750
tgc ctt gtg gtt gtg tct gac cgc agg ctg tac ctg ttg aag gtg act 2303
Cys Leu Val Val Val Ser Asp Arg Arg Leu Tyr Leu Leu Lys Val Thr
755 760 765
ggg gag atg cgt gag cct cca gct agc tgg ctg cag ctg acc ctg gct 2351
Gly Glu Met Arg Glu Pro Pro Ala Ser Trp Leu Gln Leu Thr Leu Ala
770 775 780
gtt ccc ctg cag gat ctg agt ggc ata gag ctg ggc ctg gca ggc cag 2399
Val Pro Leu Gln Asp Leu Ser Gly Ile Glu Leu Gly Leu Ala Gly Gln
785 790 795
agc ctg cgg cta gag tgg gca gct ggg gcg ggc cgc tgt gtg ctg ctg 2447
Ser Leu Arg Leu Glu Trp Ala Ala Gly Ala Gly Arg Cys Val Leu Leu
800 805 810 815
ccc cga gat gcc agg cat tgc cgg gcc ttc cta gag gag ctc ctt gat 2495
Pro Arg Asp Ala Arg His Cys Arg Ala Phe Leu Glu Glu Leu Leu Asp
820 825 830
gtc ttg cag tct ctg ccc cct gcc tgg agg aac tgt gtc agt gcc aca 2543
Val Leu Gln Ser Leu Pro Pro Ala Trp Arg Asn Cys Val Ser Ala Thr
835 840 845
gag gag gag gtc acc ccc cag cac cgg ctc tgg cca ttg ctg gaa aaa 2591
Glu Glu Glu Val Thr Pro Gln His Arg Leu Trp Pro Leu Leu Glu Lys
850 855 860
gac tca tcc ttg gag gct cgc cag ttc ttc tac ctt cgg gcg ttc ctg 2639
Asp Ser Ser Leu Glu Ala Arg Gln Phe Phe Tyr Leu Arg Ala Phe Leu
865 870 875
gtt gaa ggc cct tcc acc tgc ctc gta tcc ctg ttg ctg act ccg tcc 2687
Val Glu Gly Pro Ser Thr Cys Leu Val Ser Leu Leu Leu Thr Pro Ser
880 885 890 895
acc ctg ttc ctg tta gat gag gat gct gca ggg tcc ccg gca gag ccc 2735
Thr Leu Phe Leu Leu Asp Glu Asp Ala Ala Gly Ser Pro Ala Glu Pro
900 905 910
tct cct cca gca gca tct ggc gaa gcc tct gag aag gtg cct ccc tcg 2783
Ser Pro Pro Ala Ala Ser Gly Glu Ala Ser Glu Lys Val Pro Pro Ser
915 920 925
ggg ccg ggc cct gct gtg cgt gtc agg gag cag cag cca ctc agc agc 2831
Gly Pro Gly Pro Ala Val Arg Val Arg Glu Gln Gln Pro Leu Ser Ser
930 935 940
ctg agc tcc gtg ctg ctc tac cgc tca gcc cct gag gac ttg cgg ctg 2879
Leu Ser Ser Val Leu Leu Tyr Arg Ser Ala Pro Glu Asp Leu Arg Leu
945 950 955
ctc ttc tac gat gag gtg tcc cgg ctg gag agc ttt tgg gca ctc cgt 2927
Leu Phe Tyr Asp Glu Val Ser Arg Leu Glu Ser Phe Trp Ala Leu Arg
960 965 970 975
gtg gtg tgt cag gag cag ctg aca gcc ctg ctt gcc tgg atc cgg gaa 2975
Val Val Cys Gln Glu Gln Leu Thr Ala Leu Leu Ala Trp Ile Arg Glu
980 985 990
cca tgg gag gag ctg ttt tcc atc gga ctc cgg aca gtg atc caa gag 3023
Pro Trp Glu Glu Leu Phe Ser Ile Gly Leu Arg Thr Val Ile Gln Glu
995 1000 1005
gcg ctg gcc ctt gac cga tgagggtccc acgctgacct tggccctgac 3071
Ala Leu Ala Leu Asp Arg
1010
ctcaggagcc acgctgtaga cattccctct cctggtctct gggtctggct tccaggctct 3131
ggctgtggat gtcttcagcc tctgggtgct ggccagtgag gtcccaaatg acccagggct 3191
taagggagag gcgagagaat gatctggcct caggggacag gccacctggt caggaggaat 3251
atttttcctg cactttttct caggtatcaa taaagttgtt tccaactc 3299
<210> 37
<211> 6814
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (517)..(849)
<400> 37
ggttcattgc cttgtgtcct ctgctccggg agacttcggt gattctgcca ctcactccgt 60
cgctctgtga cctgctggca ttgaatgatt tatagctaag actccaggac acccctgaag 120
ccgagaaatg tggaaaagcg agattgtctt tgaccatttg ctatggacat tgtcagggat 180
ggtgcagcct gaatgagaat gtctgcggta tttcccgaaa taagtatccc tcaaggatga 240
tactgtggac ctcactcagg aagaatgtgt ccttcaggga aagctgcaca gagacgtgat 300
gctggagaag cagtcatctg gttactttag catgaatgag agatcctttc gcttcacgtc 360
tttgccagca tttggtgttg tcagtgttct agattttggc cattatagta catgacttca 420
ggctcctatg tgcagaagtc ctcaaccccc aggctgcaga ccagtacctg tccatggcct 480
gttaggaaac tggccgcaca gcaggaggtg aaatga tta ttg cag ttt cga ggg 534
Leu Leu Gln Phe Arg Gly
1 5
gag atg atg tca caa gat ctc gaa ccc ata tct cag aac cgc ctg cct 582
Glu Met Met Ser Gln Asp Leu Glu Pro Ile Ser Gln Asn Arg Leu Pro
10 15 20
cct ccc aca cac tct ggc tgt cct gac cat cac ttc aag ccc acc agc 630
Pro Pro Thr His Ser Gly Cys Pro Asp His His Phe Lys Pro Thr Ser
25 30 35
cct gca gca aaa aaa atc aaa cga gta gtt cag aag gca aag tct gca 678
Pro Ala Ala Lys Lys Ile Lys Arg Val Val Gln Lys Ala Lys Ser Ala
40 45 50
agg gaa gga agt ata ttg gga ggt aca gta agt att cct gtc tgg aga 726
Arg Glu Gly Ser Ile Leu Gly Gly Thr Val Ser Ile Pro Val Trp Arg
55 60 65 70
agg tct tct ttg ggg cta gaa act gag cca gtg cta aag cca tgc agg 774
Arg Ser Ser Leu Gly Leu Glu Thr Glu Pro Val Leu Lys Pro Cys Arg
75 80 85
aag gag gaa ata atc agt gag cca cgg gct gaa ctt gtg gaa aag aaa 822
Lys Glu Glu Ile Ile Ser Glu Pro Arg Ala Glu Leu Val Glu Lys Lys
90 95 100
tgg agg gca agg tca caa acc agt ccc taactgcttc taatttaatg 869
Trp Arg Ala Arg Ser Gln Thr Ser Pro
105 110
taatcctcac tgtttgtcat tattgctttt gatggccatg aaatctgttt tttcccagtt 929
ctctagtgta atttggaatt aatttcccag ctgctttatt ttttttctag aagagtcggg 989
gacattttca ggattagtag aggtgtttct acaacacctt catgccttcg atagtgtgta 1049
agagttcacc aattgaatta ccttattctg ttcagaagta gtaactatgg agtttaacca 1109
ctctggacat attatatact taatggagac catagctgtg tggtgactga aaacatgcat 1169
tgtctactcc ttatggtgag actgctttcc ttcatgaagt cagtaaagaa gggtaacttt 1229
tctgtagatt tttgtatttt taaaaaaggt caagatcttc ggcagttttc tgaattgaat 1289
tgtttgtgat atttgctctg tttttaaaat cttctaaggc cattaccagg ttcagtgctt 1349
atcttagtac aatttaaatt gattccgtga ataatttttc ctgcctccag tttccctttt 1409
acatagtggg ctcatgtttt aatatgttaa acaattctga cattgaaata attttatttt 1469
atatttttgc atccacaaat actaattttc tccatatatt aatggtgatg tagtttctcc 1529
tctgcaatat tttattgcct actcatgttt tatgtcccct ggattacagc actgtcattc 1589
cactgtgccc tatttcttgg ttatgatgaa aagactctta attccagtat gaaatcattt 1649
ctgcattcca taaatttcat ttggcatctg tggaattaag gcatcttcat tgtatttgga 1709
tctctgaatc ataagtgata tttgcctata aattaagcaa tgatatttaa ataagagatt 1769
accacaatca tattttataa atatacaacc atttttaaaa acctgacaaa atctacaaaa 1829
aaaagaaaat gaaatatatg tggagtttta gaaaagactg actaggtcca aggttgaggc 1889
acatagtggt taagcacatt gtcagctccg gactctgcct cttacaagga aggagggggc 1949
tttagatgtt atttagctgt tatttagcct ttctacacct aagttcccct cacctgtaaa 2009
aaaaaaaata ataataaatt gcttactttg tagaatcatt gtaaatgttc tatgtaaaat 2069
atgaatatag tactcacagt aaatatgtgt gtgtgtggtg tgtggtgtgt ggtgtgtagt 2129
tcagcaatag aaaatggaat ttaataagaa cgttatgagt tgaaaacaat ggtctgtgga 2189
caatattgaa tttcagttgc caaaataaaa tagacctcat gtttcccttc aggatcctct 2249
aactctgaat atgaatagaa gtatcagtca ctggaaaaga acaaatgtcc tctagggatg 2309
tggcaatagg aataatctta tctcagacca taattgggtt cctgatgaaa atattttttc 2369
tttagcattg tatttctctt tatttcacca aatgcacatt aaggtctaca gatctgattc 2429
tcaaacacct ggctgtagcc acctcctttg tgatacttta taaaggagcc ccacagagaa 2489
tgcctgcttt tgggttgaaa cattttctca atcataatta ggtacaaact tgttttctat 2549
gttcacaaag tgggcagggc tgtgtgcatt ggcaccactt gccttttgaa tatcttctag 2609
gccatcacag tcagccccat gtactcctgc tgggcagacc tgaaactaca agctcccaaa 2669
tagaatgggt gctccaacat cctctgctgg attgtgatat gctggtaacc gcacgtgact 2729
ggcaagtgga atggcgaaaa caacacaaag aaggatttgg ttactgttct gcagtagtta 2789
ataactgata tggtttggct atgtccccac tcaaacctca tcttgaattc ctatgtgttg 2849
tgggaggaac ttggtgggag gtgattgaat tatgggggca ggtcttttct gtattgttct 2909
cgggatggtg aatgagtctc atgagacctg atggttttaa aaacaggagt ttccctgtac 2969
aagctctctc tttgcctgct cccatccatg taagacctga cttgctcctg cttaccttcc 3029
gccatgattg tgaggcctcc ccaaccatgt ggaactgaaa gtttgttaaa cctttttttc 3089
ttcccagtct tgggtatgtc tttatcagca atgtgaaaaa aaatggacta atacagcaaa 3149
gtggtatcag tagagtgggg tgctgctgaa aagataccca aaaatttgga agcgactttg 3209
gaactgggta acagggaaag gttggaatac ttgagatggc tcaaaagaag aaaggaaaat 3269
gtgggaaagt ttggaacttc ctagagactt gttgaatggt tttacccaaa atgctgatag 3329
caataaagtc caggctaagg tggtctcaga tggagatgag gaacttgttg gaaactggag 3389
caaaggtgac tctcgttatg ttttagcaaa gagactgatg gtattttgcc cctgccctag 3449
agatttgtgg aactttgaac ttgagagaga tgatttaggg tatctggcag aagaaatttc 3509
taagcagcaa agcattcaag aggtgacttg ggtgctgtta aaggcattca ttttttaaat 3569
caatacctcc tgaagtcctc ctgagagcgc cagatatgca ttcgggccac atggagcaga 3629
aaaggagggc tccatcactc cctgctggcc atcagggtgc agtgcaggac actcagagct 3689
caatacttag ttgtttccca gtgccccctg ctagacctcc taccgaatcc tgaaattctc 3749
aatttcttga tctgtctgct cctttggctg aacatgcgtc aaatccacgg gagcatcaca 3809
aaaggattct ttctcagtca tgcctgtgtg ataggacctt cttccagttt ctttcaaaaa 3869
ttgaaaaaaa catctgaatg atgcagtcta cacttctatc ctcagactct gctgcagagc 3929
ttcatcctgc aggacacaga tgactacagg gctgtttgag atggaatctt cctttctaaa 3989
tcttctaggt ttgatcacag tcacaaatga gcagagcaga tgaaagtgaa aaatcttctg 4049
gcttgtccct atttggggag agctctccag aagtattttc aagaaattag tagctattcg 4109
agggagtgaa atacagctgg tttgccttcg agaaaggaaa gtgaagtgtg cttttgtttc 4169
ctctcattta acagttgcaa aaggcaaatt ataaaatagg aatactcatg acactttctt 4229
ataaatactg ctaaacaaaa taatctacat tcatattttt gtactggtaa aatatctgaa 4289
agcaaaatca tgggaagata tttaagtata ttatacatca tgaatttcaa aaacagagtg 4349
aagatcatga tacaggtttc ttatatgaat aactgtatgg tttatatttt ccttcgcctt 4409
ttgaaatact caggaaatgt ttgttgtaca cagaatttga ataaaaattg tccaggccag 4469
gtgcagtggc tcacacctat aatcccagca ctttgggagg ccaaggcagg cagatcacct 4529
gaggtcagaa gttcaagacc agcctgacca atatggtgaa accttgcctc tactaaaaat 4589
gcaaacatta gctgggcgtg atggcaagtg cctgtagtcc cagctacttg agaggctgag 4649
ataggagaat tgcttgaacc caggaggcgg aggttgcagt gagccgagat tgtgccactg 4709
cactccagcc tgggcgacag agtgagactc catctcaaaa taaataaata aataaataaa 4769
taaataaata aatataaaaa taaaaatttc catgcaaact tttcttcaat atatttttgt 4829
tacagttaac gtaaaattag ctgttgtaat agacaaacct gaatttatca gcagtttgat 4889
catcatgtaa tatttatcat tgctggcata taccacctca acacaggctt tcaaatgtca 4949
tgtgactctg cttgtggatt ttcctccaaa tatggattta gggaccccca taccttctat 5009
tggtgtattt ctcattgcta acctgtgtct tcaaatctgg ctatggtgct tgggaaaatt 5069
tcatgcagtt gtatggcaaa aaatatttgg ggcttcacag agggtgtttt cacaggtcag 5129
gcctgtacca ggtaaatgtc gtttccaccc actcagctag gatgcttgta tggccagacc 5189
tcagaacaaa aagcctgggg aagggagtct tgaactgtga gaccagcggg aatacaagat 5249
tggtctggtg aataagacca ctgtttttct tacactatgt gaaaaaaatg ctgttaatat 5309
gagcaacagc cggttttcct gctttcatac atttacgcaa gaaaatgtga cttaaaatct 5369
aattccacag attaaaccta gagaatatat ttcagggaag agggcaaaaa ggtcaataag 5429
gctcacactg ttgaccatct gaaaggggat tcccattgaa tcccaaaagc acagagtaga 5489
gttgaaatat tccctagaca agtgttgtca aaaaaaatat gaaagtacat tacatatgta 5549
ggaatattta gatatgaact gacactagtg ccaacactca gtcaccacga gaggccctca 5609
agcatccagg atagcctgtt cttccctcag cataaccata atgtaacaag ctaacctttt 5669
cacaaagggt ggctatgatt ataaaccgag tatcagtaat gagttaacac aggcaacgat 5729
aatatcttgc tactgtaact taaaaacgtg tttttatgtc catcaaaatc aagtagtgat 5789
tgtgcataaa attttgtaat atttaagggt taggtatcca tcaaacatgt tacagaccag 5849
gcagttactc cagaaatacc tttaaaagga gagaattatt gcaaaccaca taagatattt 5909
agtctgcatt aggaaatgaa aatgcaagaa acattctgca atgtggaatg aacaaatatg 5969
attccttaaa atgcataaaa ctcagcaagg acagaaaacc acgctgtttc tccactgtgc 6029
tgcaaacaca gtggtcaaca aaaccacatt aacattttgc catgcctttg tcaataactt 6089
ttcttggcag cctcagatat agccttcact tgcaagacaa gttgaaaggg tcttgtaaca 6149
tctttctctt ttgaaaacca atttatcaaa atgaaaacct tttattgttg atttcaccat 6209
aaagttagta gaatctcaga aaagatacag ataatgcata tgtgtctgag aaagagctcc 6269
tgtaaagccc aacaccaaat acaggtcata tcatagaagt attggtgtcc tgcattttgc 6329
tcttgtaatt atacctgcgt ttattactct ggggagattc tcaggaccag cattgctgca 6389
ttttgctttt ctatatttta attgttaata gatattgttt ggttgccagc ccctaaatgt 6449
ggaacacttt ataatcgtaa tatgcaaggt atgagaacac agtacttctt tcatcatctg 6509
tgatgtatga ttattcaccc taattttctt gagtctgagc agtttaacgg aggtatattt 6569
ttattagttc actattgttt aaatttcttg tcttttcatg ctaaaaggaa agttgatatt 6629
tatctatatc tatgtctgta tatgcatgta tatctataga agtagatcca tttgtttgaa 6689
caaccctctg taattaattg tgatgaatta tgcctgtttt agaaagtatt ttgtacacat 6749
atttttaata aaaattttcc ccccgaatat ttaaccagta tgacattaaa agagtcagga 6809
atttg 6814
<210> 38
<211> 6734
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2994)..(4055)
<400> 38
caggagacct cccaggaccc tgcaggaaac attcctcatt cacagcatgg gggcctcagc 60
tcctcacctt cccagagcaa cccaggctcc cagaggcccc cgcctacaca gtgccttttc 120
tggcactgct gtgactgtct ccatcactct gtgcctgtga cagtagcact tccttgcaag 180
tttattttct cccaaatctc tttaagtcca ggacattttc tggaaaactg gaagactgga 240
cacaggactg cccccaagca ctggggaagg agggagaatc ccatcttcct gcagctttta 300
tcagagggaa tgcatttaga taagggccaa gggagaaaac agtcctcaga aagtgtggag 360
ggaagtgggg tacaggcctg gacgcctgga attggagaaa ggggagacct ggagaggcag 420
acgtggggat ttcttcaccc cacttttact gcccctatca taagaggctc ccagagggcc 480
atgtgtgagt cactgttctg cactgtcccc tgcgtagact cgactgaagc aggaggaagg 540
agcccccggt aacaccctgg gtgggactgg ggaggccagg gctttggggc ttttcctttc 600
ctaagggcgg tgccatggaa tcttttggct ggagatgaca ttggggtttc tttccctgtg 660
cccagctcca agcaggtcgc tgtgtggcca ccccacccaa aggaatgcac gggttgctgg 720
ctgggagatt cttcaagagc acacactctc aggcccgatc gagctgttct cagtttttgg 780
gtttcccata gaaaacccaa agttttatct tccagcttct tgacaggaaa agtgggcaga 840
aagccttttc cccaccacac tgataaaagg cgtttgagta ctgaaacgtt cttaaaggct 900
gtgtggcttg gctcaccaaa gcagcctcct ggtgcttctc aggtggcctt gaacacagcc 960
tgggccactt gcctgccggc ctcgcccagg aggcacactg tggacccttc caggctgccc 1020
tggccctggg cacagccttg gtttggagtg gggagagttg ggggtgatgc ctggcctggg 1080
gctggactgc tgggagaatg tgctgtcatt gagttctttt cccggcagcc actccgggca 1140
tgggcacaca cccttgttct gtcattggtc aggggacacc aggacaagct gtcccgggag 1200
gcaggcatcc tggggaagga aggagaattc catcttcctg tagcttttat cagagtgaat 1260
gcatttagat aagggccaag agagaatact acctagagag tgtggaagga agtgggggtg 1320
caggcctgta ctcctggaat tagagaaggg ggaggcctgg ctcttgtctc acagactcca 1380
ggctctctcc ttctgagccc agttgttcct ccagcagtac ctggcatggg gttggttctg 1440
taggatggag cctgaagctg cgttttctgt gggttccgtc ttcctctggg atggaggctt 1500
ctgagcaggg agggatttga gctttcatgc agcagtgagg ggctgagggc atgagctgtg 1560
gagggggctc ctgctcctca gagtggaagc agagccatcg aatcagccag gacaggagtc 1620
atcccaaagg ccagaggccc caagtgttga tgctccccat agaaggagcc cgctaataac 1680
actctccagc acataacccc tggcatcttc gccaccacca agagtgctgc cccagactct 1740
cggcacccac cgcacgctaa tcatctctgg cagtttctgt tggcctttgt ctccttaact 1800
ctcctcctct tctaaattgt aagtagtcag ctttcatcca cgcgataaca cgtgcatttc 1860
aagcaaaaca gccaaccaca agcccagccc accaacgcac tgcgcgcctc gttagaggtt 1920
agcgtccatc tctgacaaat gcatgttctc ctggctttgt tttctaaccc acgcctcctg 1980
tctgctgagc cttgctgcct gtttggtgag gagtgtgtgt gtctggtgtg cttttgtgct 2040
attaactctt acacgggttt gctctccgga cctattcgct ggttacccta aaacccgtaa 2100
aggacacacc aggagctggt aaccagtgca gagggacagt cctatccatt tttatagccc 2160
tttggagggg aacactctgc ctgcttctgc atgctgtgtt catgagcagt tttgtaacta 2220
gaccctccta gggatctgga gtcaaatgtg aaattcccaa ggataccggc ccagtgcagg 2280
tttatgaaga tggcaagaaa gaggtctggg gacagggagt tttgttcctc cccattgtga 2340
ggggaaggaa agggtcagag agaccagcct ccctccggag ggtgtcaggg tttatagtca 2400
ctgggctcag ccagcggctc accgtgcacc ctcaaacaag caggctcctc tcctgggtct 2460
tagttgcctg tctaggtccc atgatctaaa tctttgacct cactacttgc ccttataaag 2520
aacaaagtta caagcattct gtcctgcaga cagagcaggt ggcctcgtag gaagttcaga 2580
gccagtgccc actgctctgc aacattttcc accccgggga cccttgtcaa tttgcaaaat 2640
ttcatctgag cctctgtacg tgggggaaag aaaggtcccc tcatagtcat gggtgaaaag 2700
tggtaggccc ccgtgtgtat tgtaggttgg ctgccctggt aaggcttaga taacgaactg 2760
aaagaccctt gtaattagcc aagactgggc cccagaagag aggcgaaaag agcatcaatt 2820
gcccacgagt ggtcctggag caaggggaca cactgggctt cacacactag gtcagcaggt 2880
gtttcaaagc tagtgaccca ctttgcctga catgaggagt tgaagccagg catccagact 2940
tcccggtggc tgccagctgc caggaccaca gcctagggca gctcacaagg tga ggg 2996
Gly
1
gaa gtc agt ggc ccc agc act tgg gtg ttg tcc tca gag aga tca gca 3044
Glu Val Ser Gly Pro Ser Thr Trp Val Leu Ser Ser Glu Arg Ser Ala
5 10 15
cta atc acg gtg ttg cgc tca aca gtg aag gag aag cca cag tat ggc 3092
Leu Ile Thr Val Leu Arg Ser Thr Val Lys Glu Lys Pro Gln Tyr Gly
20 25 30
aag aac ccc gtg gtg atg gtg gac gag att atg agc tcc agc cct ccc 3140
Lys Asn Pro Val Val Met Val Asp Glu Ile Met Ser Ser Ser Pro Pro
35 40 45
aag ttc acc ttc cct gaa gca ggc tta cga atc atg atc acc aat aag 3188
Lys Phe Thr Phe Pro Glu Ala Gly Leu Arg Ile Met Ile Thr Asn Lys
50 55 60 65
ttt gga ccc agg acc cga cta cgg atg gcc agc agg atc atc att aat 3236
Phe Gly Pro Arg Thr Arg Leu Arg Met Ala Ser Arg Ile Ile Ile Asn
70 75 80
gag cgg cag aga ctg atc aac tcg gcc aat ggt gtg agc agt aag ccg 3284
Glu Arg Gln Arg Leu Ile Asn Ser Ala Asn Gly Val Ser Ser Lys Pro
85 90 95
ctt caa aac ggg agg cac gag aac att gag aac ggg aat gtt cct gtg 3332
Leu Gln Asn Gly Arg His Glu Asn Ile Glu Asn Gly Asn Val Pro Val
100 105 110
gaa aac ccc gaa gac cct cag cag aat cag gag cag cag ccg ccg cca 3380
Glu Asn Pro Glu Asp Pro Gln Gln Asn Gln Glu Gln Gln Pro Pro Pro
115 120 125
cag cca cca ccg cca gag cca gag ccg gtg gag gct gac ttc ctg tcc 3428
Gln Pro Pro Pro Pro Glu Pro Glu Pro Val Glu Ala Asp Phe Leu Ser
130 135 140 145
ccc ttc tcc gtg ccg gag gcc aga ggg gac aag gtc aag tgg gtg ttc 3476
Pro Phe Ser Val Pro Glu Ala Arg Gly Asp Lys Val Lys Trp Val Phe
150 155 160
acc tgg ccc ctc atc ttc ctc ctg tgc gtc acc att ccc aac tgc agc 3524
Thr Trp Pro Leu Ile Phe Leu Leu Cys Val Thr Ile Pro Asn Cys Ser
165 170 175
aag ccc cgc tgg gag aag ttc ttc atg gtc acc ttc atc acc gcc acg 3572
Lys Pro Arg Trp Glu Lys Phe Phe Met Val Thr Phe Ile Thr Ala Thr
180 185 190
ctg tgg atc gct gtg ttc tcc tac atc atg gtg tgg ctg gtg act att 3620
Leu Trp Ile Ala Val Phe Ser Tyr Ile Met Val Trp Leu Val Thr Ile
195 200 205
atc gga tac aca ctt ggg atc ccg gat gtc atc atg ggc att act ttc 3668
Ile Gly Tyr Thr Leu Gly Ile Pro Asp Val Ile Met Gly Ile Thr Phe
210 215 220 225
ctg gca gca ggg aca agt gtt cca gac tgc atg gcc agc cta att gtg 3716
Leu Ala Ala Gly Thr Ser Val Pro Asp Cys Met Ala Ser Leu Ile Val
230 235 240
gcg aga caa ggc ctt ggg gac atg gca gtc tcc aac acc ata gga agc 3764
Ala Arg Gln Gly Leu Gly Asp Met Ala Val Ser Asn Thr Ile Gly Ser
245 250 255
aac gtg ttt gac atc ctg gta gga ctt ggt gta ccg tgg ggc ctg cag 3812
Asn Val Phe Asp Ile Leu Val Gly Leu Gly Val Pro Trp Gly Leu Gln
260 265 270
acc atg gtt gtt aat tat gga tca aca gtg aag atc aac agc cgg ggg 3860
Thr Met Val Val Asn Tyr Gly Ser Thr Val Lys Ile Asn Ser Arg Gly
275 280 285
ctg gtc tat tcc gtg gtc ctg ttg ctg ggc tct gtc gct ctc acc gtc 3908
Leu Val Tyr Ser Val Val Leu Leu Leu Gly Ser Val Ala Leu Thr Val
290 295 300 305
ctc ggc atc cac cta aac aag tgg cga ctg gac cgg aag ctg ggt gtc 3956
Leu Gly Ile His Leu Asn Lys Trp Arg Leu Asp Arg Lys Leu Gly Val
310 315 320
tac gtg ctg gtt ctc tac gcc atc ttc ttg tgc ttc tcc ata atg ata 4004
Tyr Val Leu Val Leu Tyr Ala Ile Phe Leu Cys Phe Ser Ile Met Ile
325 330 335
gag ttt aac gtc ttt acc ttc gtc aac ttg ccg atg tgc cgg gaa gac 4052
Glu Phe Asn Val Phe Thr Phe Val Asn Leu Pro Met Cys Arg Glu Asp
340 345 350
gat tagcgctgag tcgcggcccc tgggagctga tctggacacc ctgtgacact 4105
Asp
ggcgtcctcc tctcccctcc ttcccccacc acaggtctct cctgcatagg cagccactgt 4165
ccgttctttc acacactgga aggaagagcc atcgtggtct ttgtctggcc acaggccagg 4225
ctgctgggca tcctcctcct ccttggagtt ccacccctgc aaggctggat ttgggggcca 4285
ttatctgagc agcttcaaag acccctgagc tgccaaccac ggagatgtgc caagcatctc 4345
atctctcctg cacactttag tcagaaggac ttctgcatgc agtttgtctt tctgttctgc 4405
aggcagcttc agaattgagg tcatttgtga gcacaagatc tcatagggca ggtgcaaaat 4465
aggaatgttg ttctcaagtg tcacctccag cccagaggtg gttccttagg cagcatgtgc 4525
tcctgggagc ctctgacttt tgctggaagc acccacagtt tggaaggggc aagacctcaa 4585
cctgttgggg tttagggccc atgatggcag acattctacc ccttttcctg gaaaaactgg 4645
aagaatgaaa ataatttttt tctgtggaag agagaaaatg agtgaatatt cttctcactt 4705
ttattgatgc attcagagaa taagcaatga aatattaaaa aatgaaacat catataggtc 4765
atcatacttg aaaattatca ttccatatga aaggatcatg atacacacca aaaaagtaat 4825
gatcgtaaag acacaaatcc tctgtatgcc atcttgcatt ggcactgagg tgtttggttt 4885
ggaataggga aaaaggtaag agactaacgt ggaaaggtgc taactcagag actggagatt 4945
atagtttaca gctgtacttt ccagatcttc tatgtgacac aatgcactgt ccttgtgggt 5005
ttgtcattta ttggttaatg ctctagtttc aaaaccaccc tgttgaaagt tccagttatt 5065
tatatgccca acaaatttca tagcctgctg aactgaactg agtgtgtcag aagtgctggt 5125
taatgacgag aagagattgc ctgaaaaaca acaaactgct ttctggttag ctgaaggcaa 5185
gtgtgaaaat cagaatttag aatatttaga gctaagcttc tggaaccacg tagtttctac 5245
acgtggcagg ccaagaatgg gaggctgact caaaactaga tagaaaaata taaaataatc 5305
ttcgaccact tgatagctct caaatatata tttaaaagat ttatgaatac aaaccattta 5365
tggtttatga tttctaaaaa gaaagcacaa ttaattttat agagaggttt tttatttttt 5425
taatatttct attgcaaaag tctatccgat ttgatgcact ttgaatattg agatattttg 5485
cacggatgaa tgtatgggaa ctacccatga tgatgtaaga ggaaagaaca tttttttgtg 5545
attcaccaga catcacttta aacttggtga tgagtttaaa tccagtagct aatcccttcc 5605
tgagactcaa agatcgtgac gctggttgga atttctgact gtgcccttta gggcctcctg 5665
agtttcaaaa ggaggaagtg ttcgtgcttg tgtccctgaa gttccctgtt gcatgagcct 5725
gcgacaggac ctcaccccca ccaccaggct tctatttggg attcacatca gtattagtat 5785
cgtagctaca ccaagttcag gcttctcttt ttgttttttt acctagaaat tgggctcagt 5845
ggtcttcaac ttgaggacga gggtgatttt cctaagaaat cagcaaagag ggaaggcagg 5905
gcccctgtag attcaccagt ataaacttca gctgcaggga ttccagagcc ctcgggacca 5965
ctctgtcacc ttaatagcca agttctcctg gttcctccga tcttacaggc tcatccaggt 6025
tccaaagtgc ttctgtctct gttttgattc tccaaactgc tctgtgatgt atgtagggat 6085
tattctcccc acttaacaga aagtagtgtc ttggagaggt caagggtctc tagttcaatg 6145
gccagtcata gcagaaggga ggccaagcac cagtccatca cccctcccag gccagcctct 6205
gtaagttggc cacacttggg gagtgagtgt gggtatgact ttaccctcct ggttggttct 6265
tactgtttga gtcaaaacct catcaatata tcattgactc ctgggttcct caggtcattt 6325
cctaatatct gtccctatcc aatgcctcta ttttatcttg aaaaaaggac caaaaattat 6385
ttttagctat ggcaaggcac aggccacatg gcccctgatg gcgtccctgc tggttttcaa 6445
ttctctgaag ccttgtgtag ctttcagagc acacgtatcc taattaccct cctcttcctc 6505
agcagaaccc atttgagatt ctaaatgaat actcttagtc tctaaagttg cagttagaaa 6565
ctaaaataat gttttttaat atgtaatatg ctcctcttgg ctaattttct tttgacttta 6625
atgtgccaat gtaacttcct ttaaaggatc tatgcattta ttaaatctgg aaaactatat 6685
gtacactgta ggtggaaaat tctctttttt aactaaatat ttttccatc 6734
<210> 39
<211> 5499
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (83)..(1921)
<400> 39
aatggcggaa gtggcaccgt tgccaggcag ccgttgcctg gcgtcgcggg gcgtactctg 60
cgctgggcgc gcggaggcct ag gcg gga agc tcg agc ggc ggc gcc atg gcc 112
Ala Gly Ser Ser Ser Gly Gly Ala Met Ala
1 5 10
cga ggt agc gcg cgg ctg gcg ggg gtc ccg agg atc ccg ggt tcc ggg 160
Arg Gly Ser Ala Arg Leu Ala Gly Val Pro Arg Ile Pro Gly Ser Gly
15 20 25
gtt ccg ggg tcc agg gtc ggc ggc cgg agc gct cag ggc ccc cgc cta 208
Val Pro Gly Ser Arg Val Gly Gly Arg Ser Ala Gln Gly Pro Arg Leu
30 35 40
ggc cct gag gcc aga gtc ccc gct cag gca ggg tcc cgt ctg ccg ggc 256
Gly Pro Glu Ala Arg Val Pro Ala Gln Ala Gly Ser Arg Leu Pro Gly
45 50 55
gcc cac cct cgg gct ccc gga gct tgg ctt ccc tcg ggc gcc cgg cct 304
Ala His Pro Arg Ala Pro Gly Ala Trp Leu Pro Ser Gly Ala Arg Pro
60 65 70
cgt ccc gcc agc cta gct ggc gtt ggc gtc gcc ggg gaa ggc gcg gag 352
Arg Pro Ala Ser Leu Ala Gly Val Gly Val Ala Gly Glu Gly Ala Glu
75 80 85 90
agc ccc ggc tcc tgc ctg gtc ccc gag gcc cgg ccc agc ccc gca gtg 400
Ser Pro Gly Ser Cys Leu Val Pro Glu Ala Arg Pro Ser Pro Ala Val
95 100 105
gcc ccg gct cgc gtg gcc gag tcc cct gac gcc cgg cgc ccg ctc ccc 448
Ala Pro Ala Arg Val Ala Glu Ser Pro Asp Ala Arg Arg Pro Leu Pro
110 115 120
gca gcg tgg cag cac ccg ttc ctc aac gtc ttc aga cac ttc cgg gtg 496
Ala Ala Trp Gln His Pro Phe Leu Asn Val Phe Arg His Phe Arg Val
125 130 135
gac gag tgg aag cgc tcc gcc aag cag ggg gac gtg gcc gtg gtc acg 544
Asp Glu Trp Lys Arg Ser Ala Lys Gln Gly Asp Val Ala Val Val Thr
140 145 150
gac aag acc ctg aag ggc gcc gtg tat cgc att cgg ggc tca gtc tct 592
Asp Lys Thr Leu Lys Gly Ala Val Tyr Arg Ile Arg Gly Ser Val Ser
155 160 165 170
gcc gcc aac tac atc cag ctc cct aag agc agc acc cag tct ctg ggg 640
Ala Ala Asn Tyr Ile Gln Leu Pro Lys Ser Ser Thr Gln Ser Leu Gly
175 180 185
ctg acg gga cga tac ctg tat gtg ctc ttt cgg ccc ctg ccc agc aag 688
Leu Thr Gly Arg Tyr Leu Tyr Val Leu Phe Arg Pro Leu Pro Ser Lys
190 195 200
cac ttc gtc atc cac ctc gat gtg tcc tcc aag gac aac caa gtc atc 736
His Phe Val Ile His Leu Asp Val Ser Ser Lys Asp Asn Gln Val Ile
205 210 215
cgt gtg tct ttc tcc aac ctc ttc aag gag ttt aag tct acg gcc acg 784
Arg Val Ser Phe Ser Asn Leu Phe Lys Glu Phe Lys Ser Thr Ala Thr
220 225 230
tgg ctc cag ttt ccc ttg gtc ctg gag gcc agg aca cct cag aga gat 832
Trp Leu Gln Phe Pro Leu Val Leu Glu Ala Arg Thr Pro Gln Arg Asp
235 240 245 250
ctg gtg ggt ttg gcc ccc tcc gga gcc cgc tgg acc tgc ctg cag ctc 880
Leu Val Gly Leu Ala Pro Ser Gly Ala Arg Trp Thr Cys Leu Gln Leu
255 260 265
gat ctg cag gac gtt ctc ctg gtc tac ctg aac cgg tgc tac ggc cat 928
Asp Leu Gln Asp Val Leu Leu Val Tyr Leu Asn Arg Cys Tyr Gly His
270 275 280
ctc aag agc atc agg ctg tgc gcc agc ctg ctg gtc agg aac ctg tac 976
Leu Lys Ser Ile Arg Leu Cys Ala Ser Leu Leu Val Arg Asn Leu Tyr
285 290 295
acc agt gac ctg tgc ttt gag cct gcc atc tct ggg gcc cag tgg gca 1024
Thr Ser Asp Leu Cys Phe Glu Pro Ala Ile Ser Gly Ala Gln Trp Ala
300 305 310
aag ctg ccc gtg act cct atg cct cgg gaa atg gca ttc cct gtg ccc 1072
Lys Leu Pro Val Thr Pro Met Pro Arg Glu Met Ala Phe Pro Val Pro
315 320 325 330
aag gga gag agc tgg cat gac cgc tac atc cac gtc cgg ttt cca agt 1120
Lys Gly Glu Ser Trp His Asp Arg Tyr Ile His Val Arg Phe Pro Ser
335 340 345
gag agc ttg aaa gtg cct tcc aag ccg att gag aag agc tgt tcc cct 1168
Glu Ser Leu Lys Val Pro Ser Lys Pro Ile Glu Lys Ser Cys Ser Pro
350 355 360
cct gag gca gtc ctc ctg ggg ccg ggg cca cag cct ctc cct tgc ccg 1216
Pro Glu Ala Val Leu Leu Gly Pro Gly Pro Gln Pro Leu Pro Cys Pro
365 370 375
gtg gcc tcc agc aaa cct gtg cgg ttc agt gtg tct cca gtg gtc cag 1264
Val Ala Ser Ser Lys Pro Val Arg Phe Ser Val Ser Pro Val Val Gln
380 385 390
acg ccc agc ccc aca gcc cag tcc ggc cgg gcc gcc ttg gca ccc agg 1312
Thr Pro Ser Pro Thr Ala Gln Ser Gly Arg Ala Ala Leu Ala Pro Arg
395 400 405 410
ccc ttc ccg gag gtc agc ctg tcc caa gag cgc tca gac gcc tcc aac 1360
Pro Phe Pro Glu Val Ser Leu Ser Gln Glu Arg Ser Asp Ala Ser Asn
415 420 425
gcg gat ggc ccc ggt ttc cat agc ctt gag ccc tgg gcc cag ctg gag 1408
Ala Asp Gly Pro Gly Phe His Ser Leu Glu Pro Trp Ala Gln Leu Glu
430 435 440
gcc tct gac atc cac acg gct gct gcc ggc acc cac gtg ttg act cac 1456
Ala Ser Asp Ile His Thr Ala Ala Ala Gly Thr His Val Leu Thr His
445 450 455
gag tcg gct gag gtg ccc gtg gcc cgc acc ggc tcc tgc gaa ggc ttc 1504
Glu Ser Ala Glu Val Pro Val Ala Arg Thr Gly Ser Cys Glu Gly Phe
460 465 470
ctc cca gac cca gtc ctg agg ctc aag ggc gtc atc ggc ttt ggg ggc 1552
Leu Pro Asp Pro Val Leu Arg Leu Lys Gly Val Ile Gly Phe Gly Gly
475 480 485 490
cac ggc acc aga cag gcc ctg tgg acc cca gac ggg gcg gct gtc gtg 1600
His Gly Thr Arg Gln Ala Leu Trp Thr Pro Asp Gly Ala Ala Val Val
495 500 505
tac ccc tgc cat gcg gtc atc gtc gtc ctg ctc gtg gac acg ggg gag 1648
Tyr Pro Cys His Ala Val Ile Val Val Leu Leu Val Asp Thr Gly Glu
510 515 520
cag cgc ttc ttc ctt ggc cac aca gac aag gtc tcc gcc ctg gcg ctg 1696
Gln Arg Phe Phe Leu Gly His Thr Asp Lys Val Ser Ala Leu Ala Leu
525 530 535
gat ggc agc agc tca cta ttg gcc tcg gcc cag gca agg gcc cct agt 1744
Asp Gly Ser Ser Ser Leu Leu Ala Ser Ala Gln Ala Arg Ala Pro Ser
540 545 550
gtg atg cgg ctc tgg gac ttc cag acc ggg cgg tgc ttg tgc ctg ttc 1792
Val Met Arg Leu Trp Asp Phe Gln Thr Gly Arg Cys Leu Cys Leu Phe
555 560 565 570
cgg agc cca atg cac gtt gtc tgc tct ctc agc ttc tct gac agc ggg 1840
Arg Ser Pro Met His Val Val Cys Ser Leu Ser Phe Ser Asp Ser Gly
575 580 585
gcc ctt ctc tgc ggg gtt ggc aag gac cac cac ggg agg acg gta aca 1888
Ala Leu Leu Cys Gly Val Gly Lys Asp His His Gly Arg Thr Val Thr
590 595 600
ggg ccc tgg ctg cgg gtt ggg gtg ggg ctg tcc tgatgcacgc agacagctgg 1941
Gly Pro Trp Leu Arg Val Gly Val Gly Leu Ser
605 610
aagggtcttg gttttctgaa actccagctt catgtgaccc tgggtccctg ctctgtgtcc 2001
tccctgtggt ggggctccct gcactctggt gttcatgccg cccccgtgcc ctgcacaaaa 2061
accaccacca gcagctcaca tttcacgtct cggcttttcg gccatcggag tcggtttaag 2121
ccagcgttta cacacctcgc ctcgttctct ctgctggtga tgcggttgta agattttcac 2181
gtggcagagc ccctgcagct gacccacgct tgggcatcaa tggcacctcc acacagccga 2241
agcacccggc atcttggtgg gggcaccttt ttaaacattt tttttttgag agagagcctt 2301
gttctgtcac ctaggctgga atgcagtggt gcaatcatag ctcactgcag ccttgacccc 2361
ctccctgacc tccccgccag ctggactcca gccatcctcc tgcctcagct tcccagatag 2421
ctgggccaca ggcgtgtgcc accacacctg gctaactttt tttttttttt tttttttgag 2481
atggcatctt gctctgtcgc ccaggatgga gtgcagtggc gcgatcttag ctcactgcaa 2541
gctccgcctc ccaggttcac accattctcc tgcctcagcc tccccagtag ctgggactac 2601
aggcgcctgc caacatgcct ggctaatttt ttttttttgt atttttagta gagataggct 2661
ttcaccttgt tagccaggat ggtctcacgt tttgtttttt taactttttg tagagatagg 2721
gtttcattac attgctcagg ctggtctcaa actccagggc tcaagcagtc ctcccacctc 2781
agcctcccaa agtgctggga tcacaggcat gagccactgt gcccagccac gagctgcttt 2841
ttatccacct tccggcttgc gggggggttt gtacttctct gcgaatgggt tgcagtgctc 2901
agcattgcgg gtctctgtag aagtcccaaa gcttcaggat gagaagatgg aggggctggt 2961
ttccttccac tctccacggc cgcacgctca gaggctggtc tccgcatgag agaaaagcca 3021
cctccgggtg gggctcggct gggccttgca gaatcgaggt ggccccgact ggccctgccg 3081
tgcgggctca gcctgggctt gttgcagatg gtggtggcct ggggcaccgg ccaggtgggc 3141
ctcggtggcg aggtggtcgt tctggcaaag gcgcacactg actttgacgt ccaggccttc 3201
cgggtcacct tttttgatga aaccaggatg gcgtcgtgcg ggcagggcag tgtgcggctc 3261
tggcggctgc gtggcggggt gctgcgttcc tgccccgtgg acttagggga gcaccacgcg 3321
ctgcagttca ccgacctggc cttcaagcag gcccgggacg gctgcccgga gccctcggct 3381
gccatgctct tcgtgtgcag ccgcagtggc cacatcttgg agattgactg tcagcgcatg 3441
gtcgtgcggc atgcccgccg cctgctcccc acacggactc caggcggtcc ccacccacag 3501
aagcagacct tcagctcagg ccccggcatt gccatcagca gcctcagcgt ctccccggcc 3561
atgtgtgctg tgggctctga ggacggcttc ttgcggctct ggcccctgga cttctcctcg 3621
gtgctcctgg aggcaggtga tgctgtgggc acgctctccc aactccggga gagcctcgcc 3681
tggatgctgg ggcggggaag gcccagtccc cggggctctg ctgtcagccg ccctggctcc 3741
tggctgcaca gggtgccacg gggccaagtg gcatatccag agccctgggg cggctgatgc 3801
cagggcggcc cgcggggcag cctcacggaa gggcccggtg ggacgtgggt agtgtgtgaa 3861
gcccctcagc ctgggtcccg cctagcctaa gagtggtggc ctcgagggag ctgcatcttg 3921
cagccgtgtg gagcctggga ctttgagcag cagggagtgc atttgctggg gtgtcgggga 3981
cccgaagcct gagcatgcgg ctcaccccgg ggtggccctg gaggcccctg accccacccc 4041
acccacagag cacgagggcc ccgtcagctc agtctgtgtc agccccgatg gcctccgtgt 4101
gctgtctgcc acctcctcgg gccacctggg cttcctggac acgctgtccc gggtgtacca 4161
catgctggct cgctcccaca ccgccccggt gttggccctc gccatggagc agaggcgggg 4221
acagctggcc accgtgtccc aggaccgtac cgtccgcatc tgggacctgg ccaccctgca 4281
gcagctatac gacttcacat catcagagga cgccccgtgc gctgtcacct tccaccccac 4341
aaggccaacc tttttctgtg gctttagcag tggggccgtg cgctccttca gcctggaggc 4401
cgctgaggtc ctggtggaac acacgtgcca ccgaggagct gtcaccggcc tgaccgccac 4461
ccctgacggc cgcctgctct tcagctcctg ctcccagggc tccctggccc agtacagctg 4521
tgcggacccc cagtggcatg tcctccgagt ggcagcggac atggtatgcc cggatgcccc 4581
cgcgagcccc agcgccctgg cagtcagcag ggatggccgc ctgctggcct ttgtgggacc 4641
ctccaggtgc acagtgacag tcatgggctc ggcctccctt gatgagctgc tgcgagttga 4701
catcggcact ctggacctgg ccagcagccg cctggactca gccatggctg tgtgctttgg 4761
ccctgcagct ctgggccacc tgctggtgtc cacctcgtcc aacagagtcg tggtgctgga 4821
tgctgtgtcg ggccgcatca tccgggagct gcccggtgtc caccctgagc cctgcccctc 4881
cttgacgctc agtgaggacg cccgcttcct gctgattgcc gccggccgga ccatcaaggt 4941
gtgggactac gccacacagg ccagcccagg cccccaggtg tacatcggcc actcggaacc 5001
cgtgcaggct gtggccttct ctcctgacca gcagcaggtc ctcagcgcag gggacgccgt 5061
cttcctctgg gatgtcctgg cccctactga gagcgaccaa agcttccccg gggccccccc 5121
agcctgcaag acaggcccgg gcgcaggacc gctggaggac gcagcgtcca gggccagcga 5181
gctcccccgg cagcaggtcc ccaagccatg tcaggcatct ccaccacggc tgggcgtctg 5241
tgccaggcct cccgaaggtg gcgatggcgc cagggacacc aggaattcgg gggccccacg 5301
caccacctac ctggcttcct gcaaggcctt cacgcctgcc agggtcagct gcagccccca 5361
ctctgccaag ggcacttgcc cgcctcccgc cagcggtggg tggctgcgtc tgaaggctgt 5421
cgtcggttac agcgggaatg ggcgggccaa catggtctgg aggccggaca caggcttctt 5481
tgcctacacg tgcggccg 5499
<210> 40
<211> 5497
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (472)..(4266)
<400> 40
cagtcaagta gcttcccagt cccgaacgcc gcccgtcccc accccgccgt ggccactagc 60
aacgacctct gtgaagttgg agaggcggta acggaggcac tccccctgct gcaccccgcc 120
gtttctacgg ggctcagaaa ccagtttgtt tgtttcgtcg gggtagtgtc gacctgtctt 180
acgggcgtcg cccgagacag gacggagtca aacccgtggt atcaactgaa gacgagtgtc 240
aggtgtggag agtctcagtg ccccctttca gtctggactg tgagctgctg ctggttagac 300
agtcttggtt tctctttcag gatgtcattt tcaaaatgcg ggatggtacc tctgctttat 360
taagccccgt aggaagactg ccacacctag actgatgctt attagtcatc accgttattc 420
ctactaacgt cctgtgtcac tgagtttttt aaatgtctag catatctgta a aga tgc 477
Arg Cys
1
ctt aga aaa aga atc atg gag aag tat gtt aga cta cag aag att gga 525
Leu Arg Lys Arg Ile Met Glu Lys Tyr Val Arg Leu Gln Lys Ile Gly
5 10 15
gaa ggt tca ttt gga aaa gcc att ctt gtt aaa tct aca gaa gat ggc 573
Glu Gly Ser Phe Gly Lys Ala Ile Leu Val Lys Ser Thr Glu Asp Gly
20 25 30
aga cag tat gtt atc aag gaa att aac atc tca aga atg tcc agt aaa 621
Arg Gln Tyr Val Ile Lys Glu Ile Asn Ile Ser Arg Met Ser Ser Lys
35 40 45 50
gaa aga gaa gaa tca agg aga gaa gtt gca gta ttg gca aac atg aag 669
Glu Arg Glu Glu Ser Arg Arg Glu Val Ala Val Leu Ala Asn Met Lys
55 60 65
cat cca aat att gtc cag tat aga gaa tca ttt gaa gaa aat ggc tct 717
His Pro Asn Ile Val Gln Tyr Arg Glu Ser Phe Glu Glu Asn Gly Ser
70 75 80
ctc tac ata gta atg gat tac tgt gag gga ggg gat ctg ttt aag cga 765
Leu Tyr Ile Val Met Asp Tyr Cys Glu Gly Gly Asp Leu Phe Lys Arg
85 90 95
ata aat gct cag aaa ggc gtt ttg ttt caa gag gat cag att ttg gac 813
Ile Asn Ala Gln Lys Gly Val Leu Phe Gln Glu Asp Gln Ile Leu Asp
100 105 110
tgg ttt gta cag ata tgt ttg gcc ctg aaa cat gta cat gat aga aaa 861
Trp Phe Val Gln Ile Cys Leu Ala Leu Lys His Val His Asp Arg Lys
115 120 125 130
att ctt cat cga gac att aaa tct cag aac ata ttt tta act aaa gat 909
Ile Leu His Arg Asp Ile Lys Ser Gln Asn Ile Phe Leu Thr Lys Asp
135 140 145
gga aca gta caa ctt gga gat ttt gga att gct aga gtt ctt aat agt 957
Gly Thr Val Gln Leu Gly Asp Phe Gly Ile Ala Arg Val Leu Asn Ser
150 155 160
act gta gag ctg gct cga act tgc ata ggg acc cca tac tac ttg tca 1005
Thr Val Glu Leu Ala Arg Thr Cys Ile Gly Thr Pro Tyr Tyr Leu Ser
165 170 175
cct gaa atc tgt gaa aac aaa cct tac aat aat aaa agt gac att tgg 1053
Pro Glu Ile Cys Glu Asn Lys Pro Tyr Asn Asn Lys Ser Asp Ile Trp
180 185 190
gct ctg ggg tgt gtc ctt tat gag ctg tgt aca ctt aaa cat gct ttt 1101
Ala Leu Gly Cys Val Leu Tyr Glu Leu Cys Thr Leu Lys His Ala Phe
195 200 205 210
gaa gct ggc agt atg aaa aac ctg gta ctg aag ata ata tct gga tct 1149
Glu Ala Gly Ser Met Lys Asn Leu Val Leu Lys Ile Ile Ser Gly Ser
215 220 225
ttt cca cct gtg tct ttg cat tat tcc tat gat ctc cgc agt ttg gtg 1197
Phe Pro Pro Val Ser Leu His Tyr Ser Tyr Asp Leu Arg Ser Leu Val
230 235 240
tct cag tta ttt aaa aga aat cct agg gat aga cca tca gtc aac tcc 1245
Ser Gln Leu Phe Lys Arg Asn Pro Arg Asp Arg Pro Ser Val Asn Ser
245 250 255
ata ttg gag aaa ggt ttt ata gcc aaa cgc att gaa aag ttt ctc tct 1293
Ile Leu Glu Lys Gly Phe Ile Ala Lys Arg Ile Glu Lys Phe Leu Ser
260 265 270
cct cag ctt att gca gaa gaa ttt tgt cta aaa aca ttt tcg aag ttt 1341
Pro Gln Leu Ile Ala Glu Glu Phe Cys Leu Lys Thr Phe Ser Lys Phe
275 280 285 290
gga tca cag cct ata cca gct aaa aga cca gct tca gga caa aac tcg 1389
Gly Ser Gln Pro Ile Pro Ala Lys Arg Pro Ala Ser Gly Gln Asn Ser
295 300 305
att tct gtt atg cct gct cag aaa att aca aag cct gcc gct aaa tat 1437
Ile Ser Val Met Pro Ala Gln Lys Ile Thr Lys Pro Ala Ala Lys Tyr
310 315 320
gga ata cct tta gca tat aag aaa tat gga gat aaa aaa tta cac gaa 1485
Gly Ile Pro Leu Ala Tyr Lys Lys Tyr Gly Asp Lys Lys Leu His Glu
325 330 335
aag aaa cca ctg caa aaa cat aaa cag gcc cat caa act cca gag aag 1533
Lys Lys Pro Leu Gln Lys His Lys Gln Ala His Gln Thr Pro Glu Lys
340 345 350
aga gtg aat act gga gaa gaa agg agg aaa ata tct gag gaa gca gca 1581
Arg Val Asn Thr Gly Glu Glu Arg Arg Lys Ile Ser Glu Glu Ala Ala
355 360 365 370
aga aag aga agg ctg gaa ttt att gaa aaa gaa aag aaa caa aag gat 1629
Arg Lys Arg Arg Leu Glu Phe Ile Glu Lys Glu Lys Lys Gln Lys Asp
375 380 385
cag att att agt tta atg aag gct gaa caa atg aaa agg caa gaa aag 1677
Gln Ile Ile Ser Leu Met Lys Ala Glu Gln Met Lys Arg Gln Glu Lys
390 395 400
gaa agg ttg gaa aga ata aat agg gcc agg gaa caa gga tgg aga aat 1725
Glu Arg Leu Glu Arg Ile Asn Arg Ala Arg Glu Gln Gly Trp Arg Asn
405 410 415
gtg cta agt gct ggt gga agt ggt gaa gta aag gct cct ttt ctg ggc 1773
Val Leu Ser Ala Gly Gly Ser Gly Glu Val Lys Ala Pro Phe Leu Gly
420 425 430
agt gga ggg act ata gct cca tca tct ttt tct tct cga gga cag tat 1821
Ser Gly Gly Thr Ile Ala Pro Ser Ser Phe Ser Ser Arg Gly Gln Tyr
435 440 445 450
gaa cat tac cat gcc att ttt gac caa atg cag caa caa aga gca gaa 1869
Glu His Tyr His Ala Ile Phe Asp Gln Met Gln Gln Gln Arg Ala Glu
455 460 465
gat aat gaa gct aaa tgg aaa aga gaa ata tat ggt cga ggt ctt cca 1917
Asp Asn Glu Ala Lys Trp Lys Arg Glu Ile Tyr Gly Arg Gly Leu Pro
470 475 480
gaa aga gga att ctg cct gga gtt cgt cca gga ttt cct tat ggg gct 1965
Glu Arg Gly Ile Leu Pro Gly Val Arg Pro Gly Phe Pro Tyr Gly Ala
485 490 495
gca ggt cat cac cat ttt cct gat gct gat gat att aga aaa act ttg 2013
Ala Gly His His His Phe Pro Asp Ala Asp Asp Ile Arg Lys Thr Leu
500 505 510
aaa aga ttg aag gcg gtg tct aaa caa gcc aat gca aac agg caa aaa 2061
Lys Arg Leu Lys Ala Val Ser Lys Gln Ala Asn Ala Asn Arg Gln Lys
515 520 525 530
ggg cag cta gct gta gaa aga gct aaa caa gta gaa gag ttc ctg cag 2109
Gly Gln Leu Ala Val Glu Arg Ala Lys Gln Val Glu Glu Phe Leu Gln
535 540 545
cga aaa cgg gaa gct atg cag aat aaa gct cga gcc gaa gga cat atg 2157
Arg Lys Arg Glu Ala Met Gln Asn Lys Ala Arg Ala Glu Gly His Met
550 555 560
gtt tat ctg gca aga ctg agg caa ata aga cta cag aat ttc aat gag 2205
Val Tyr Leu Ala Arg Leu Arg Gln Ile Arg Leu Gln Asn Phe Asn Glu
565 570 575
cgc caa cag att aaa gcc aaa ctt cgt ggt gaa aag aaa gaa gct aat 2253
Arg Gln Gln Ile Lys Ala Lys Leu Arg Gly Glu Lys Lys Glu Ala Asn
580 585 590
cat tct gaa gga caa gaa gga agt gaa gag gct gac atg agg cgc aaa 2301
His Ser Glu Gly Gln Glu Gly Ser Glu Glu Ala Asp Met Arg Arg Lys
595 600 605 610
aaa atc gaa tca ctg aag gcc cat gca aat gca cgt gct gct gta cta 2349
Lys Ile Glu Ser Leu Lys Ala His Ala Asn Ala Arg Ala Ala Val Leu
615 620 625
aaa gaa caa cta gaa cga aag aga aag gag gct tat gag aga gaa aaa 2397
Lys Glu Gln Leu Glu Arg Lys Arg Lys Glu Ala Tyr Glu Arg Glu Lys
630 635 640
aaa gtg tgg gaa gag cat ttg gtg gct aaa gga gtt aag agt tct gat 2445
Lys Val Trp Glu Glu His Leu Val Ala Lys Gly Val Lys Ser Ser Asp
645 650 655
gtt tct cca cct ttg gga cag cat gaa aca ggt ggc tct cca tca aag 2493
Val Ser Pro Pro Leu Gly Gln His Glu Thr Gly Gly Ser Pro Ser Lys
660 665 670
caa cag atg aga tct gtt att tct gta act tca gct ttg aaa gaa gtt 2541
Gln Gln Met Arg Ser Val Ile Ser Val Thr Ser Ala Leu Lys Glu Val
675 680 685 690
ggc gtg gac agt agt tta act gat acc cgg gaa act tca gaa gag atg 2589
Gly Val Asp Ser Ser Leu Thr Asp Thr Arg Glu Thr Ser Glu Glu Met
695 700 705
caa aag acc aac aat gct att tca agt aag cga gaa ata ctt cgt aga 2637
Gln Lys Thr Asn Asn Ala Ile Ser Ser Lys Arg Glu Ile Leu Arg Arg
710 715 720
tta aat gaa aat ctt aaa gct caa gaa gat gaa aaa gga aag cag aat 2685
Leu Asn Glu Asn Leu Lys Ala Gln Glu Asp Glu Lys Gly Lys Gln Asn
725 730 735
ctc tct gat act ttt gag ata aat gtt cat gaa gat gcc aaa gag cat 2733
Leu Ser Asp Thr Phe Glu Ile Asn Val His Glu Asp Ala Lys Glu His
740 745 750
gaa aaa gaa aaa tca gtt tca tct gat cgc aag aag tgg gag gca gga 2781
Glu Lys Glu Lys Ser Val Ser Ser Asp Arg Lys Lys Trp Glu Ala Gly
755 760 765 770
ggt caa ctt gtg att cct ctg gat gag tta aca cta gat aca tcc ttc 2829
Gly Gln Leu Val Ile Pro Leu Asp Glu Leu Thr Leu Asp Thr Ser Phe
775 780 785
tct aca act gaa aga cat aca gtg gga gaa gtt att aaa tta ggt cct 2877
Ser Thr Thr Glu Arg His Thr Val Gly Glu Val Ile Lys Leu Gly Pro
790 795 800
aat gga tct cca aga aga gcc tgg ggg aaa agt ccg aca gat tct gtt 2925
Asn Gly Ser Pro Arg Arg Ala Trp Gly Lys Ser Pro Thr Asp Ser Val
805 810 815
cta aag ata ctt gga gaa gct gaa cta caa ctt cag aca gaa cta tta 2973
Leu Lys Ile Leu Gly Glu Ala Glu Leu Gln Leu Gln Thr Glu Leu Leu
820 825 830
gaa aat aca act att aga agt gag att tct ccc gaa ggg gaa aag tac 3021
Glu Asn Thr Thr Ile Arg Ser Glu Ile Ser Pro Glu Gly Glu Lys Tyr
835 840 845 850
aaa ccc tta att act gga gaa aaa aaa gta caa tgt att tca cat gaa 3069
Lys Pro Leu Ile Thr Gly Glu Lys Lys Val Gln Cys Ile Ser His Glu
855 860 865
ata aac cca tca gct att gtt gat tct cct gtt gag aca aaa agt ccc 3117
Ile Asn Pro Ser Ala Ile Val Asp Ser Pro Val Glu Thr Lys Ser Pro
870 875 880
gag ttc agt gag gca tct cca cag atg tca ttg aaa ctg gaa gga aat 3165
Glu Phe Ser Glu Ala Ser Pro Gln Met Ser Leu Lys Leu Glu Gly Asn
885 890 895
tta gaa gaa cct gat gat ttg gaa aca gaa att cta caa gag cca agt 3213
Leu Glu Glu Pro Asp Asp Leu Glu Thr Glu Ile Leu Gln Glu Pro Ser
900 905 910
gga aca aac aaa gat gag agc ttg cca tgc act att act gat gtg tgg 3261
Gly Thr Asn Lys Asp Glu Ser Leu Pro Cys Thr Ile Thr Asp Val Trp
915 920 925 930
att agt gag gaa aaa gaa aca aag gaa act cag tcg gca gat agg atc 3309
Ile Ser Glu Glu Lys Glu Thr Lys Glu Thr Gln Ser Ala Asp Arg Ile
935 940 945
acc att cag gaa aat gaa gtt tct gaa gat gga gtc tcg agt act gtg 3357
Thr Ile Gln Glu Asn Glu Val Ser Glu Asp Gly Val Ser Ser Thr Val
950 955 960
gac caa ctt agt gac att cat ata gag cct gga acc aat gat tct cag 3405
Asp Gln Leu Ser Asp Ile His Ile Glu Pro Gly Thr Asn Asp Ser Gln
965 970 975
cac tct aaa tgt gat gta gat aag tct gtg caa ccg gaa cca ttt ttc 3453
His Ser Lys Cys Asp Val Asp Lys Ser Val Gln Pro Glu Pro Phe Phe
980 985 990
cat aag gtg gtt cat tct gaa cac ttg aac tta gtc cct caa gtt caa 3501
His Lys Val Val His Ser Glu His Leu Asn Leu Val Pro Gln Val Gln
995 1000 1005 1010
tca gtt cag tgt tca cca gaa gaa tcc ttt gca ttt cga tct cac tcg 3549
Ser Val Gln Cys Ser Pro Glu Glu Ser Phe Ala Phe Arg Ser His Ser
1015 1020 1025
cat tta cca cca aaa aat aaa aac aag aat tcc ttg ctg att gga ctt 3597
His Leu Pro Pro Lys Asn Lys Asn Lys Asn Ser Leu Leu Ile Gly Leu
1030 1035 1040
tca act ggt ctg ttt gat gca aac aac cca aag atg tta agg aca tgt 3645
Ser Thr Gly Leu Phe Asp Ala Asn Asn Pro Lys Met Leu Arg Thr Cys
1045 1050 1055
tca ctt cca gat ctc tca aag ctg ttc aga acc ctt atg gat gtt ccc 3693
Ser Leu Pro Asp Leu Ser Lys Leu Phe Arg Thr Leu Met Asp Val Pro
1060 1065 1070
acc gta gga gat gtt cgt caa gac aat ctt gaa ata gat gaa att gaa 3741
Thr Val Gly Asp Val Arg Gln Asp Asn Leu Glu Ile Asp Glu Ile Glu
1075 1080 1085 1090
gat gaa aac att aaa gaa gga cct tct gat tct gaa gac att gtg ttt 3789
Asp Glu Asn Ile Lys Glu Gly Pro Ser Asp Ser Glu Asp Ile Val Phe
1095 1100 1105
gaa gaa act gac aca gat tta caa gag ctg cag gcc tcg atg gaa cag 3837
Glu Glu Thr Asp Thr Asp Leu Gln Glu Leu Gln Ala Ser Met Glu Gln
1110 1115 1120
tta ctt agg gaa caa cct ggt gaa gaa tac agt gaa gaa gaa gag tca 3885
Leu Leu Arg Glu Gln Pro Gly Glu Glu Tyr Ser Glu Glu Glu Glu Ser
1125 1130 1135
gtc ttg aag aac agt gat gtg gag cca act gca aat ggg aca gat gtg 3933
Val Leu Lys Asn Ser Asp Val Glu Pro Thr Ala Asn Gly Thr Asp Val
1140 1145 1150
gca gat gaa gat gac aat ccc agc agt gaa agt gcc ctg aac gaa gaa 3981
Ala Asp Glu Asp Asp Asn Pro Ser Ser Glu Ser Ala Leu Asn Glu Glu
1155 1160 1165 1170
tgg cac tca gat aac agt gat ggt gaa att gct agt gaa tgt gaa tgc 4029
Trp His Ser Asp Asn Ser Asp Gly Glu Ile Ala Ser Glu Cys Glu Cys
1175 1180 1185
gat agt gtc ttt aac cat tta gag gaa ctg aga ctt cat ctg gag cag 4077
Asp Ser Val Phe Asn His Leu Glu Glu Leu Arg Leu His Leu Glu Gln
1190 1195 1200
gaa atg ggc ttt gaa aaa ttc ttt gag gtt tat gag aaa ata aag gct 4125
Glu Met Gly Phe Glu Lys Phe Phe Glu Val Tyr Glu Lys Ile Lys Ala
1205 1210 1215
att cat gaa gat gaa gat gaa aat att gaa att tgt tca aaa ata gtt 4173
Ile His Glu Asp Glu Asp Glu Asn Ile Glu Ile Cys Ser Lys Ile Val
1220 1225 1230
caa aat att ttg gga aat gaa cat cag cat ctt tat gcc aag att ctt 4221
Gln Asn Ile Leu Gly Asn Glu His Gln His Leu Tyr Ala Lys Ile Leu
1235 1240 1245 1250
cat tta gtc atg gca gat gga gcc tac caa gaa gat aat gat gaa 4266
His Leu Val Met Ala Asp Gly Ala Tyr Gln Glu Asp Asn Asp Glu
1255 1260 1265
taatcctcaa aatgtttttt aatcctcaac tatatgaaag catttgaatt tggcttatca 4326
gaataacaag cttcagtggg aaatacagca attatttatt taaaaaatca gatttaagat 4386
ggactttctt attgcatgaa aaagatggag aaacatgcca tttttcaatg aagattctaa 4446
tattttatct attttgttca ttgaattcca tggttaaatc tcataaaata tatactttat 4506
taaatcatcc aaccaaagca taggaaacat tgacccagaa cctgacttaa tggttttgaa 4566
gatttactat gcaatagggt aactttgagt ttcagcaaat gtctttaggt tgaaggaatt 4626
acctatgtca tgaaggacct gtctgtggtt tttcaatgga gtctttaagc atgatctttt 4686
ttctgtctag tacttgtttt cattctggcc agcagttcta cattaaatca ccttgtcaag 4746
ggctctgttt acatctacac attttgaaga tgaaattttt agccttaaag tttatattct 4806
caagtccttt tacaatcagt gtgtctcctg aactagcaca caggctgtag aaacagtctt 4866
agaaatcatt gaaagatttg attatgaaag aatagcaaaa ttatatttct tgacataaaa 4926
agttggttta atgcctttat ttctctttaa ggaccagaac caggaatact gtatcgaaaa 4986
attagtctgt ggatttaaca ctgacttagc atatagctta aagttgctct tttggttttt 5046
aacttcctcc atacataagc ttcaaggaca ataagatgtt aaaaaggagg aaataattat 5106
ttttattttg acactgtgac agttttggta actaggatcc tagggaggga aatgtttgcc 5166
tgttgaactt ctttctgtta tgagaggatt tagttaggtc attaagatgt tgatcacaca 5226
gcttcaatca caatatgcca agtataacct ggtttcgtta gaggtgtcta cagtccagat 5286
gttcttcgta ataaaagcaa agtttttgaa cctctgagtc caaagcaggc tggttggcat 5346
aatatgtaat ttgaaaaata aaatcttatc ttgcagcact atcagtatgt tgaatttatt 5406
atgtatatta tttctaatat ccgaaactaa atacttgatt ttttaatatg tgtgtttatt 5466
ttatgatatt gctattaaat ttttattatc t 5497
<210> 41
<211> 4237
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (2393)..(3271)
<400> 41
gcggcactgg ggctgcagcg gcgccgggct ctagagagcc gcaggatcgg ccagagtgcg 60
gactggacac ccgggtccca gatactacag acacccggag aggtggctcc ttcgccctga 120
agccttcctc ggccccctac gcactcgggc cccttccgca gaggattcgc agcgtgagcg 180
ccccgcagcc cgctcaggac cagctcacag gactaaggac caaaggcatt tctgggcact 240
gagatcctac ctctctgcct gcagctatga gcagacgtgt ggttcggcaa agcaagttcc 300
gccatgtgtt tgggcaggca gcaaaggccg accaggccta cgaggacatc cgtgtgtcca 360
aggtcacatg ggacagctcc ttctgtgccg tcaaccccaa attcctggcc attattgtgg 420
aggctggagg cgggggtgcc ttcatcgtcc tgcctctggc caaggtgtgg cagattccag 480
actatacccc catgcgcaac attacggaac ctatcatcac acttgagggc cactccaagc 540
gtgtgggcat cctctcctgg caccctactg ccaggaatgt cctgctcagt gcaggtctgg 600
gggctaagga ggggtccccc atagatgggt gggtggtggc aaaggtggga cccaggtgac 660
catgcctggc cactctgggc aggtggtgac aatgtgatca tcatctggaa tgtgggcacc 720
ggggaggtgc tgctgagcct ggatgatatg cacccagacg tcatccacag tgtgtgctgg 780
aacagcaacg gtagcctgct agccaccacc tgcaaggaca agaccttgcg catcattgac 840
cccagaaaag gccaagtggt ggcggtgagt gcctggcctg gccactcccc agaacccctt 900
gaccagtggc cagctgtcag cacccttgta cccaccatta tcagagcccc tggtcccggc 960
tttcacctgt cgcatccctg ctccagagac tagccccctt cccccgtctc tgcaaccaca 1020
ggtctctgcc ctaagtaagc ccccagcctg caattctctg cccggctcct gttccctccc 1080
ctcccttgca tctcagaagc tcagagaagg gctcctcgct ttgggattga ggaggggaga 1140
agctggaaac tccatcctcc caccagaacc tgcatgaaac ctgagctcct ggacccctct 1200
cctctcagcc ggcacccaga cgcatcttcc agggtctggg gcctgacact ggctcgaggg 1260
ctggcccttt ctccttgtgg acccgctggg cgggggatgg ggcaaccctc gcgcgttgct 1320
tatcccagcc ggatgccctg ggagggcggc ggccctggcc tcgccagcct caagtctaac 1380
ctggggtgcg ggtgcgtggg ccgcgtgcat gtctgcgtgg agcaagcgcg ccgagagcgg 1440
tcggggctgg cgccggggcg ctctaaccca ctaacccggc gagcaggagc aagcccggcc 1500
tcacgagggc gcccgcccgc tgcgggctgt cttcaccgca gacgggaagc tgctcagcac 1560
cggcttcagc aggatgagtg agcggcaact cgcgctctgg gacccggtag gccaggccgc 1620
gccgggcgcg ctagctcgct ccttctctgg gtctgccggg agcgccttcc cgggctgtgg 1680
ctggcgccca agaggccaga gcggagcggg gagctcccat tcgggaaaag cctcgctgcg 1740
actccgggga gccagcagac acccctgctc tagggatcgg cctttgagcg cgcttaaggg 1800
ttcgggatcg gctgagagcc gcgtgcctcc tgctctctcc aggcccggat gctctccagg 1860
tcaggggcca gacgtggccc cgcaggaagc tgagtgctta ggccccattt gtctcttccc 1920
cgcatccgct cctccacccc ggacacccaa gtcagcaagc tggttgctca tccctccatg 1980
ctcctcaggg caggcagctg ggtgatgtcc ctccctgtct ccccctcccc ccaggagagg 2040
tttgcggccc acgaggggat gaggcccatg cgggccgtct tcacgcgcca gggccatatc 2100
ttcaccacgg gcttcacccg catgagccag cgagagctgg gcctgtggga cccggtaacg 2160
cagctggagg cttggggtgt gtgcctcggg gactggcatc acgggagagt ggccaggcgc 2220
cccccacccg cagggcatgc ccacctggac aggactgaag gctgctgcca cctctgcagc 2280
gcctgccatt ctcacaccca cccctctgcc cagttttgct gcgtcgcgtg aaggatcctg 2340
cgctggcgcc cccacctggt gatgccgagt ccctgggggt ggtttgctgt ga ctg cat 2398
Leu His
1
gcg gcg cgc agc ggg tat atg tgc agg gga gga gga gct atg tgc gca 2446
Ala Ala Arg Ser Gly Tyr Met Cys Arg Gly Gly Gly Ala Met Cys Ala
5 10 15
ggg gag gag gag aga tgt gcc cag ggg agg ggc ctg caa ctc tgg tca 2494
Gly Glu Glu Glu Arg Cys Ala Gln Gly Arg Gly Leu Gln Leu Trp Ser
20 25 30
cta gag gtt tgg ggc ata ggg ttt ggg aag gcc agg aag cgt aaa ggg 2542
Leu Glu Val Trp Gly Ile Gly Phe Gly Lys Ala Arg Lys Arg Lys Gly
35 40 45 50
gct tct ggg ggt ccc gaa gta gca cgg gag ggt ggg gca ggg ctg gtc 2590
Ala Ser Gly Gly Pro Glu Val Ala Arg Glu Gly Gly Ala Gly Leu Val
55 60 65
acg cct cct gtg ctc ggg cag aac aac ttc gag gag cca gtg gca ctg 2638
Thr Pro Pro Val Leu Gly Gln Asn Asn Phe Glu Glu Pro Val Ala Leu
70 75 80
cag gag atg gac aca agc aac ggg gtc cta ttg ccc ttt tac gat ccc 2686
Gln Glu Met Asp Thr Ser Asn Gly Val Leu Leu Pro Phe Tyr Asp Pro
85 90 95
gac tcc agc atc gtc tac ctg tgt ggc aag ggc gac agc agc att cgg 2734
Asp Ser Ser Ile Val Tyr Leu Cys Gly Lys Gly Asp Ser Ser Ile Arg
100 105 110
tac ttt gag att acc gac gag ccg cct ttc gtg cac tac ctg aac acg 2782
Tyr Phe Glu Ile Thr Asp Glu Pro Pro Phe Val His Tyr Leu Asn Thr
115 120 125 130
ttc agc agc aaa gag ccg cag cgg ggc atg ggt ttc atg ccc aaa agg 2830
Phe Ser Ser Lys Glu Pro Gln Arg Gly Met Gly Phe Met Pro Lys Arg
135 140 145
gga ctg gat gtc agc aag tgt gag atc gcc cgg ttc tac aag cta cac 2878
Gly Leu Asp Val Ser Lys Cys Glu Ile Ala Arg Phe Tyr Lys Leu His
150 155 160
gaa aga aag tgt gaa cct atc atc atg act gtg ccc cgc aag tca gac 2926
Glu Arg Lys Cys Glu Pro Ile Ile Met Thr Val Pro Arg Lys Ser Asp
165 170 175
ctc ttc cag gac gat ctg tac ccg gat acg cca ggc ccg gag ccg gcc 2974
Leu Phe Gln Asp Asp Leu Tyr Pro Asp Thr Pro Gly Pro Glu Pro Ala
180 185 190
cta gaa gcg gac gaa tgg cta tcc ggc cag gac gcc gaa ccc gtg ctc 3022
Leu Glu Ala Asp Glu Trp Leu Ser Gly Gln Asp Ala Glu Pro Val Leu
195 200 205 210
att tcg ctg agg gac ggc tat gtg ccc ccc aag cac cgc gag ctc cgg 3070
Ile Ser Leu Arg Asp Gly Tyr Val Pro Pro Lys His Arg Glu Leu Arg
215 220 225
gtc acg aag cgc aac atc ctg gac gtg cgc ccg ccc tcc ggc ccc cgc 3118
Val Thr Lys Arg Asn Ile Leu Asp Val Arg Pro Pro Ser Gly Pro Arg
230 235 240
cgc agc cag tcg gcc agc gac gcc ccc ttg tcg cag cac acc ctg gag 3166
Arg Ser Gln Ser Ala Ser Asp Ala Pro Leu Ser Gln His Thr Leu Glu
245 250 255
acg ctg ctg gaa gag atc aag gcc ctc cgc gag cgg gtg cag gcc cag 3214
Thr Leu Leu Glu Glu Ile Lys Ala Leu Arg Glu Arg Val Gln Ala Gln
260 265 270
gag cag cgc atc acg gct ctg gag aac atg ctg tgc gag ctg gtg gac 3262
Glu Gln Arg Ile Thr Ala Leu Glu Asn Met Leu Cys Glu Leu Val Asp
275 280 285 290
ggc acg gac tagccccgcg cgccaggcag gcggagcggg gcggggcgca 3311
Gly Thr Asp
caagctcggc cccgccccgg cttttagtcc cgaactccgg accccgcctt cttgggctgg 3371
gcccgggggc gggactgggg agggaactcc gcccctcgcg ggagaccaga actcttggag 3431
cttaggggag acccacgtcg ctccagcgga ggctggactg cgagcctcgt ctgggactcg 3491
gctggagctg gcctagggag gcctggggta acctgggggg ctcagcaatg gtgctgcacg 3551
gcgaggtggt gtcccccttt gtcctccgcc cagggcaggg aaagtgctta gtattagcgt 3611
gatgcttggg gttattggag cctgagcttg acctcaaacg ggtggcgatt tgatgggtac 3671
ccccaggctg gggaaaatga cagcgcttct cctaatcagc tcactggatt ccatcaccct 3731
gagcggtaaa ccagatgggc gtcaccccag ttctgcagac acatacacaa cccgtttgct 3791
gcagagccgg acccagtggc tacacccaca gcggtctgtg gtagagaact ctcttccttc 3851
tttccaccga caggggcgag ggctgcttcc tcgcggcagc ccccgcgaag aaatctcgag 3911
agaactggca tgaggagtta ggttcatcac aaatacacac acactgcccc caaccctctg 3971
ccgttgcctc tctcagaaaa acaagacgta ctgaatgaaa tattttacta agcgttcagt 4031
ctgtgcctcc tgcatgggtg ggagtgaggg gaacgagacc cccagcctct gcaaatgcta 4091
cccccaggct cctgggagac ctggcgatgc actcctgggc tcagggtcca tcaggcagcc 4151
tcttacccta gagctctctc cactctgagg ttcagaagga ccccaaccca caccgtaggc 4211
gttcccccca agtaaagtta ggtagc 4237
<210> 42
<211> 4118
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (597)..(3377)
<400> 42
aaaccccgtc tctacttaaa atacaaaaat tagccagatg tggtggcgtg tgactgtaat 60
cccagctact ggggaggcag aggcaggaaa attgcttgaa aacgggaggt ggaagttgca 120
gtgagccgag atcacactac tgcactccag cctgggcaac agagtgagac tacatctcaa 180
aaaaaaaaaa aaaaaaggcc aggcacggtg gctcacacct gtaatcccag cactttggga 240
ggccgaggca ggtagatcac ctgaggtcag gagttcaaga ccagcctggc caacatagtg 300
aaaccccatc tttattaaga atacaaagat tagccgggta tgatgccatg cgcctgtagt 360
cccagctact agggaggctg aggcaggaga atcgcttaaa cccgggaggt ggaggttgca 420
gtgagccaag atagcgccgc tgcactcccg cctgggtgaa agagtgagac tctaaaaaaa 480
aaaaaaatag gaaaccaaag cccagtccgg acacgacgct cacgcctgta atcccagcta 540
ctcgggaggc ggaggcagga aaatcgcttg aacctgggat gaggaggttg cagtga gcc 599
Ala
1
aag atc ata cca ctg cac tcc agc ctg ggc aac aga ggg aga ctc tgt 647
Lys Ile Ile Pro Leu His Ser Ser Leu Gly Asn Arg Gly Arg Leu Cys
5 10 15
ctc aaa aat aaa aaa tat aat aaa aaa caa agc cca gag agg tgc agg 695
Leu Lys Asn Lys Lys Tyr Asn Lys Lys Gln Ser Pro Glu Arg Cys Arg
20 25 30
cac tta ccc aag gtc acg cgg caa gtg act gaa agg ggt agg aca ggg 743
His Leu Pro Lys Val Thr Arg Gln Val Thr Glu Arg Gly Arg Thr Gly
35 40 45
acc caa ggg tgc tca ctc cct gag tcc gtg ctg tgg tcc ctc aag cct 791
Thr Gln Gly Cys Ser Leu Pro Glu Ser Val Leu Trp Ser Leu Lys Pro
50 55 60 65
gcc cct gga ggc agg ttg gag gcc agg att cac ggc aca gtt tct gcc 839
Ala Pro Gly Gly Arg Leu Glu Ala Arg Ile His Gly Thr Val Ser Ala
70 75 80
ccc gtt gta cca cag agt gct atg cag gtt att ggg atc ccg ccc agc 887
Pro Val Val Pro Gln Ser Ala Met Gln Val Ile Gly Ile Pro Pro Ser
85 90 95
atc cag cag ctg gtc ctg cag ctc gtg gcg ggg atc ttg cac ctg ggg 935
Ile Gln Gln Leu Val Leu Gln Leu Val Ala Gly Ile Leu His Leu Gly
100 105 110
aac atc agt ttc tgt gaa gac ggg aat tac gcc cga gtg gag agt gtg 983
Asn Ile Ser Phe Cys Glu Asp Gly Asn Tyr Ala Arg Val Glu Ser Val
115 120 125
gac ctc ctg gcc ttt ccc gcc tac ctg ctg ggc att gac agc ggg cga 1031
Asp Leu Leu Ala Phe Pro Ala Tyr Leu Leu Gly Ile Asp Ser Gly Arg
130 135 140 145
ctg cag gag aag ctg acc agc cgc aag atg gac agc cgc tgg ggc ggg 1079
Leu Gln Glu Lys Leu Thr Ser Arg Lys Met Asp Ser Arg Trp Gly Gly
150 155 160
cgc agc gag tcc atc aat gtg acc ctc aac gtg gag cag gca gcc tac 1127
Arg Ser Glu Ser Ile Asn Val Thr Leu Asn Val Glu Gln Ala Ala Tyr
165 170 175
acc cgt gat gcc ctg gcc aag ggg ctc tat gcc cgc ctc ttc gac ttc 1175
Thr Arg Asp Ala Leu Ala Lys Gly Leu Tyr Ala Arg Leu Phe Asp Phe
180 185 190
ctc gtg gag gcc atc aac cgt gct atg cag aaa ccc cag gaa gag tac 1223
Leu Val Glu Ala Ile Asn Arg Ala Met Gln Lys Pro Gln Glu Glu Tyr
195 200 205
agc atc ggt gtg ctg gac att tac ggc ttc gag atc ttc cag aaa aat 1271
Ser Ile Gly Val Leu Asp Ile Tyr Gly Phe Glu Ile Phe Gln Lys Asn
210 215 220 225
ggc ttc gag cag ttt tgc atc aac ttc gtc aat gag aag ctg cag caa 1319
Gly Phe Glu Gln Phe Cys Ile Asn Phe Val Asn Glu Lys Leu Gln Gln
230 235 240
atc ttt atc gaa ctt acc ctg aag gcc gag cag gag gag tat gtg cag 1367
Ile Phe Ile Glu Leu Thr Leu Lys Ala Glu Gln Glu Glu Tyr Val Gln
245 250 255
gca ggc atc cgc tgg act cca atc cag tac ttc aac aac aag gtc gtc 1415
Ala Gly Ile Arg Trp Thr Pro Ile Gln Tyr Phe Asn Asn Lys Val Val
260 265 270
tgt gac ctc atc gaa aac aag ctg agc ccc cca ggc atc atg agc gtc 1463
Cys Asp Leu Ile Glu Asn Lys Leu Ser Pro Pro Gly Ile Met Ser Val
275 280 285
ttg gac gac gtg tgc gcc acc atg cac gcc acg ggc ggg gga gca gac 1511
Leu Asp Asp Val Cys Ala Thr Met His Ala Thr Gly Gly Gly Ala Asp
290 295 300 305
cag aca ctg ctg cag aag ctg cag gcg gct gtg ggg acc cac gag cat 1559
Gln Thr Leu Leu Gln Lys Leu Gln Ala Ala Val Gly Thr His Glu His
310 315 320
ttc aac agc tgg agc gcc ggc ttc gtc atc cac cac tac gct ggc aag 1607
Phe Asn Ser Trp Ser Ala Gly Phe Val Ile His His Tyr Ala Gly Lys
325 330 335
gtc tcc tac gac gtc agc ggc ttc tgc gag agg aac cga gac gtt ctc 1655
Val Ser Tyr Asp Val Ser Gly Phe Cys Glu Arg Asn Arg Asp Val Leu
340 345 350
ttc tcc gac ctc ata gag ctg atg cag acc agt gag cag gcc ttc ctc 1703
Phe Ser Asp Leu Ile Glu Leu Met Gln Thr Ser Glu Gln Ala Phe Leu
355 360 365
cgg atg ctc ttc ccc gag aag ctg gat gga gac aag aag ggg cgc ccc 1751
Arg Met Leu Phe Pro Glu Lys Leu Asp Gly Asp Lys Lys Gly Arg Pro
370 375 380 385
agc acc gcc ggc tcc aag atc aag aaa caa gcc aac gac ctg gtg gcc 1799
Ser Thr Ala Gly Ser Lys Ile Lys Lys Gln Ala Asn Asp Leu Val Ala
390 395 400
aca ctg atg agg tgc aca ccc cac tac atc cgc tgc atc aaa ccc aac 1847
Thr Leu Met Arg Cys Thr Pro His Tyr Ile Arg Cys Ile Lys Pro Asn
405 410 415
gag acc aag agg ccc cga gac tgg gag gag aac aga gtc aag cac cag 1895
Glu Thr Lys Arg Pro Arg Asp Trp Glu Glu Asn Arg Val Lys His Gln
420 425 430
gtg gaa tac ctg ggc ctg aag gag aac atc agg gtg cgc aga gcc ggc 1943
Val Glu Tyr Leu Gly Leu Lys Glu Asn Ile Arg Val Arg Arg Ala Gly
435 440 445
ttc gcc tac cgc cgc cag ttc gcc aaa ttc ctg cag agg tat gcc att 1991
Phe Ala Tyr Arg Arg Gln Phe Ala Lys Phe Leu Gln Arg Tyr Ala Ile
450 455 460 465
ctg acc ccc gag acg tgg ccg cgg tgg cgt ggg gac gaa cgc cag ggc 2039
Leu Thr Pro Glu Thr Trp Pro Arg Trp Arg Gly Asp Glu Arg Gln Gly
470 475 480
gtc cag cac ctg ctt cgg gcg gtc aac atg gag ccc gac cag tac cag 2087
Val Gln His Leu Leu Arg Ala Val Asn Met Glu Pro Asp Gln Tyr Gln
485 490 495
atg ggg agc acc aag gtc ttt gtc aag aac cca gag tcg ctt ttc ctc 2135
Met Gly Ser Thr Lys Val Phe Val Lys Asn Pro Glu Ser Leu Phe Leu
500 505 510
ctg gag gag gtg cga gag cga aag ttc gat ggc ttt gcc cga acc atc 2183
Leu Glu Glu Val Arg Glu Arg Lys Phe Asp Gly Phe Ala Arg Thr Ile
515 520 525
cag aag gcc tgg cgg cgc cac gtg gct gtc cgg aag tac gag gag atg 2231
Gln Lys Ala Trp Arg Arg His Val Ala Val Arg Lys Tyr Glu Glu Met
530 535 540 545
cgg gag gaa gct tcc aac atc ctg ctg aac aag aag gag cgg agg cgc 2279
Arg Glu Glu Ala Ser Asn Ile Leu Leu Asn Lys Lys Glu Arg Arg Arg
550 555 560
aac agc atc aat cgg aac ttc gtc ggg gac tac ctg ggg ctg gag gag 2327
Asn Ser Ile Asn Arg Asn Phe Val Gly Asp Tyr Leu Gly Leu Glu Glu
565 570 575
cgg ccc gag ctg cgt cag ttc ctg ggc aag agg gag cgg gtg gac ttc 2375
Arg Pro Glu Leu Arg Gln Phe Leu Gly Lys Arg Glu Arg Val Asp Phe
580 585 590
gcc gat tcg gtc acc aag tac gac cgc cgc ttc aag ccc atc aag cgg 2423
Ala Asp Ser Val Thr Lys Tyr Asp Arg Arg Phe Lys Pro Ile Lys Arg
595 600 605
gac ttg atc ctg acg ccc aag tgt gtg tat gtg att ggg cga gag aaa 2471
Asp Leu Ile Leu Thr Pro Lys Cys Val Tyr Val Ile Gly Arg Glu Lys
610 615 620 625
gtg aag aag gga cct gag aag ggc cag gtg tgt gaa gtc ttg aag aag 2519
Val Lys Lys Gly Pro Glu Lys Gly Gln Val Cys Glu Val Leu Lys Lys
630 635 640
aaa gtg gac atc cag gct ctg cgg gga gtc tcc ctc agc acg cga cag 2567
Lys Val Asp Ile Gln Ala Leu Arg Gly Val Ser Leu Ser Thr Arg Gln
645 650 655
gac gac ttc ttc atc ctc caa gag gat gcc gcc gac agc ttc ctg gag 2615
Asp Asp Phe Phe Ile Leu Gln Glu Asp Ala Ala Asp Ser Phe Leu Glu
660 665 670
agc gtc ttc aag acc gag ttt gtc agc ctt ctg tgc aag cgc ttc gag 2663
Ser Val Phe Lys Thr Glu Phe Val Ser Leu Leu Cys Lys Arg Phe Glu
675 680 685
gag gcg acg cgg agg ccc ctg ccc ctc acc ttc agc gac aca cta cag 2711
Glu Ala Thr Arg Arg Pro Leu Pro Leu Thr Phe Ser Asp Thr Leu Gln
690 695 700 705
ttt cgg gtg aag aag gag ggc tgg ggc ggt ggc ggc acc cgc agc gtc 2759
Phe Arg Val Lys Lys Glu Gly Trp Gly Gly Gly Gly Thr Arg Ser Val
710 715 720
acc ttc tcc cgc ggc ttc ggc gac ttg gca gtg ctc aag gtt ggc ggt 2807
Thr Phe Ser Arg Gly Phe Gly Asp Leu Ala Val Leu Lys Val Gly Gly
725 730 735
cgg acc ctc acg gtc agc gtg ggc gat ggg ctg ccc aag agc tcc aag 2855
Arg Thr Leu Thr Val Ser Val Gly Asp Gly Leu Pro Lys Ser Ser Lys
740 745 750
cct acg cgg aag gga atg gcc aag gga aaa cct cgg agg tcg tcc caa 2903
Pro Thr Arg Lys Gly Met Ala Lys Gly Lys Pro Arg Arg Ser Ser Gln
755 760 765
gcc cct acc cgg gcg gcc cct gcg ccc ccc aga ggc atg gat cgc aat 2951
Ala Pro Thr Arg Ala Ala Pro Ala Pro Pro Arg Gly Met Asp Arg Asn
770 775 780 785
ggg gtg ccc ccc tct gcc aga ggg ggc ccc ctg ccc ctg gag atc atg 2999
Gly Val Pro Pro Ser Ala Arg Gly Gly Pro Leu Pro Leu Glu Ile Met
790 795 800
tct gga ggg ggc acc cac agg cct ccc cgg ggc cct ccg tcc aca tcc 3047
Ser Gly Gly Gly Thr His Arg Pro Pro Arg Gly Pro Pro Ser Thr Ser
805 810 815
ctg gga gcc agc aga cga ccc cgg gca cgt ccg ccc tca gag cac aac 3095
Leu Gly Ala Ser Arg Arg Pro Arg Ala Arg Pro Pro Ser Glu His Asn
820 825 830
aca gaa ttc ctc aac gtg cct gac cag ggc atg gcc ggc atg cag agg 3143
Thr Glu Phe Leu Asn Val Pro Asp Gln Gly Met Ala Gly Met Gln Arg
835 840 845
aag cgc agc gtg ggg caa cgg cca gtg cct ggt gtg ggc cga ccc aag 3191
Lys Arg Ser Val Gly Gln Arg Pro Val Pro Gly Val Gly Arg Pro Lys
850 855 860 865
ccc cag cct cgg aca cat ggt ccc agg tgc cgg gcc cta tac cag tac 3239
Pro Gln Pro Arg Thr His Gly Pro Arg Cys Arg Ala Leu Tyr Gln Tyr
870 875 880
gtg ggc caa gat gtg gac gag ctg agc ttc aac gtg aac gag gtc att 3287
Val Gly Gln Asp Val Asp Glu Leu Ser Phe Asn Val Asn Glu Val Ile
885 890 895
gag atc ctc atg gaa gat ccc tcg ggc tgg tgg aag ggc cgg ctt cac 3335
Glu Ile Leu Met Glu Asp Pro Ser Gly Trp Trp Lys Gly Arg Leu His
900 905 910
ggc cag gag ggc ctt ttc cca gga aac tac gtg gag aag atc 3377
Gly Gln Glu Gly Leu Phe Pro Gly Asn Tyr Val Glu Lys Ile
915 920 925
tgagctgggc cctgggatac tgccttctct ttcgcccgcc tatctgcctg ccggcctggt 3437
ggggagccag gccctgccaa tgagagcctc gtttacctgg gctgcaatag cctaaaagtc 3497
cagtcctttg gcctccagtc ctgcccaggc cctgggtcac caggtcactg ctgcagcccc 3557
cgcccctggg ccctggtctt cctccaacat cacacctgct gcccattctc catttctgtg 3617
tgtgtcaaag gggactaaca gcagaatcta cctcccaact gccatgtgat taagaaatgg 3677
gtcttgagtc ctgtgctgtt ggcaaagtgc caggcacagt tggggagggg ggggtcctta 3737
acaagcgtga ctttgctcat tctgtcatca ctaaggcaat aaacctttgc caggtgaaag 3797
cacgagttaa cttactaagt gcccaacaag gacgatgttt tcacagccct gtgaggtagg 3857
agctgtgaag gaccccatct tacaggtgga acaatggagg ttcagagagg ttcactgacc 3917
caagactgca cagagccatg ccttgcattc atatgtgacc ataaagcttg aatttgtccc 3977
agtttgggct gggcgcagtg gctcacgcct gtaatcccag cactttggga ggctgaggcg 4037
ggcagatcac gtgagaccag gagttcaaga ccagcctgga taacacagca aaaccctgtt 4097
tctactaaaa atttaaaaaa g 4118
<210> 43
<211> 4382
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (815)..(3856)
<400> 43
caactagagg agctcgcccc ctgcctgtga cagcttaggg tcgcctactt tttgtggaca 60
cactgccgtc cacggagtgc agaggccgcc tgcagttttc ttgcgtccct gtgagacgca 120
cggtggcgca attcccgagc gtgccaatcc cgggctggct ggagggcggg ctcttcaaat 180
ttgaattgcg gaagtgtttt gtgtgttaga gaacgtcgcg gaggaggtaa gcgtcgcttg 240
gcgcctggcc gccgcgggca ggatacaccg tgggcctgag gcgcgagccc ggcggcgtgc 300
ggccctctct ccgcgcggag ccgagccgga actgcggcag tctctccctg ccaggctctt 360
catccaaggt ttctgtggat cccttctgaa gttctatctg aaaattgcgc ttaagtgaat 420
tttctgttag aagaacttgg ttgctacttt cttgtcaaga tgattgcaac acctttgaaa 480
cattcaagaa tttacttacc tccagaggca tcttctcaaa ggagaaatct acccatggat 540
gcaatctttt ttgacagcat tccttcaggc acacttactc ctgtaaaaga tttggtgaaa 600
tatcagaact cctccttaaa attgaatgac cataaaaaga atcagttcct aaaaatgaca 660
acttttaaca ataaaaatat atttcaatca actatgctaa cagaggctac tacctctaac 720
agttctcttg atatcagtgc tataaagccc aacaaggatg gattaaaaaa taaagcaaac 780
tatgaatcac caggaaaaat atttctaaga atga aag aaa aag tac tgc gtg aca 835
Lys Lys Lys Tyr Cys Val Thr
1 5
agc aag aac agc cat caa gaa aca gta gtt tgt tgg aac cac aga aaa 883
Ser Lys Asn Ser His Gln Glu Thr Val Val Cys Trp Asn His Arg Lys
10 15 20
gtg gaa ata atg aaa cct tca ctc cta aca gag ttg aaa aaa aaa aaa 931
Val Glu Ile Met Lys Pro Ser Leu Leu Thr Glu Leu Lys Lys Lys Lys
25 30 35
ttg cag cat acc tac cta tgt gaa gaa aag gaa aac aac aaa tca ttc 979
Leu Gln His Thr Tyr Leu Cys Glu Glu Lys Glu Asn Asn Lys Ser Phe
40 45 50 55
cag tca gat gac agt tca cta aga gcc tca gtc caa gga gtt cct cta 1027
Gln Ser Asp Asp Ser Ser Leu Arg Ala Ser Val Gln Gly Val Pro Leu
60 65 70
gaa tca tca aat aat gat att ttc ctc ccg gtc aaa caa aag att cag 1075
Glu Ser Ser Asn Asn Asp Ile Phe Leu Pro Val Lys Gln Lys Ile Gln
75 80 85
tgc cag cag gaa aag aaa gca cca ctg cac aat tta act tac gaa ctt 1123
Cys Gln Gln Glu Lys Lys Ala Pro Leu His Asn Leu Thr Tyr Glu Leu
90 95 100
cca act ctg aac caa gaa cag gaa aat ttt ttg gct gta gaa gcc cga 1171
Pro Thr Leu Asn Gln Glu Gln Glu Asn Phe Leu Ala Val Glu Ala Arg
105 110 115
aac aag aca tta act aga gct cag ttg gct aaa caa att ttt cac tca 1219
Asn Lys Thr Leu Thr Arg Ala Gln Leu Ala Lys Gln Ile Phe His Ser
120 125 130 135
aag gag agt ata gtt gca acc act aaa tcc aaa aag gac acg ttt gtt 1267
Lys Glu Ser Ile Val Ala Thr Thr Lys Ser Lys Lys Asp Thr Phe Val
140 145 150
tta gaa agc gtt gat tct gct gat gaa caa ttt caa aat act aat gct 1315
Leu Glu Ser Val Asp Ser Ala Asp Glu Gln Phe Gln Asn Thr Asn Ala
155 160 165
gag act ctc agt act aat tgt att cct att aaa aat ggc agc ctg tta 1363
Glu Thr Leu Ser Thr Asn Cys Ile Pro Ile Lys Asn Gly Ser Leu Leu
170 175 180
atg gtt tct gat agt gag agg aca aca gaa ggg act tcg caa cag aaa 1411
Met Val Ser Asp Ser Glu Arg Thr Thr Glu Gly Thr Ser Gln Gln Lys
185 190 195
gtt aag gaa gga aat gga aaa aca gtg cct gga gag aca ggt ctt cca 1459
Val Lys Glu Gly Asn Gly Lys Thr Val Pro Gly Glu Thr Gly Leu Pro
200 205 210 215
ggt tcc atg aaa gat aca tgt aaa att gta ctt gca aca cca aga ctt 1507
Gly Ser Met Lys Asp Thr Cys Lys Ile Val Leu Ala Thr Pro Arg Leu
220 225 230
cat ata aca ata cct cgg agg tca aaa aga aat att tca aag ctt tct 1555
His Ile Thr Ile Pro Arg Arg Ser Lys Arg Asn Ile Ser Lys Leu Ser
235 240 245
cct cca aga ata ttt caa act gtt aca aat gga ctt aaa aaa aat cag 1603
Pro Pro Arg Ile Phe Gln Thr Val Thr Asn Gly Leu Lys Lys Asn Gln
250 255 260
gta gtt cag cta cag gaa tgg atg att aaa agc atc aat aat aat act 1651
Val Val Gln Leu Gln Glu Trp Met Ile Lys Ser Ile Asn Asn Asn Thr
265 270 275
gct ata tgt gta gaa gga aaa ttg ata gac gtc act aac ata tat tgg 1699
Ala Ile Cys Val Glu Gly Lys Leu Ile Asp Val Thr Asn Ile Tyr Trp
280 285 290 295
cac agt aat gta att ata gag cgg att gag cac aac aaa ctt agg act 1747
His Ser Asn Val Ile Ile Glu Arg Ile Glu His Asn Lys Leu Arg Thr
300 305 310
ata tca ggc aac gtt tat ata tta aaa ggc atg ata gac caa att tcc 1795
Ile Ser Gly Asn Val Tyr Ile Leu Lys Gly Met Ile Asp Gln Ile Ser
315 320 325
atg aaa gaa gca gga tat cca aat tat ctc ata agg aaa ttt atg ttt 1843
Met Lys Glu Ala Gly Tyr Pro Asn Tyr Leu Ile Arg Lys Phe Met Phe
330 335 340
gga ttt cca gaa aat tgg aaa gag cac att gat aat ttt ctg gaa caa 1891
Gly Phe Pro Glu Asn Trp Lys Glu His Ile Asp Asn Phe Leu Glu Gln
345 350 355
tta agg gct ggt gaa aag aac agg gaa aag acc aaa caa aaa cag aaa 1939
Leu Arg Ala Gly Glu Lys Asn Arg Glu Lys Thr Lys Gln Lys Gln Lys
360 365 370 375
act gga aga tct gtc cgt gac ata agg aaa tca atg aaa aat gat gca 1987
Thr Gly Arg Ser Val Arg Asp Ile Arg Lys Ser Met Lys Asn Asp Ala
380 385 390
caa gaa aac caa aca gat act gct caa aga gcc acc acc act tac gat 2035
Gln Glu Asn Gln Thr Asp Thr Ala Gln Arg Ala Thr Thr Thr Tyr Asp
395 400 405
ttt gat tgt gat aat ttg gaa ctg aag agt aat aag cac agt gag tca 2083
Phe Asp Cys Asp Asn Leu Glu Leu Lys Ser Asn Lys His Ser Glu Ser
410 415 420
cca gga gct aca gaa tta aac atg tgc cac agt aat tgc caa aat aaa 2131
Pro Gly Ala Thr Glu Leu Asn Met Cys His Ser Asn Cys Gln Asn Lys
425 430 435
cca aca tta agg ttc cca gat gac caa gta aat aat act att caa aat 2179
Pro Thr Leu Arg Phe Pro Asp Asp Gln Val Asn Asn Thr Ile Gln Asn
440 445 450 455
gga gga gga gat gac tta tct aat cag gaa tta att gga aaa aaa gaa 2227
Gly Gly Gly Asp Asp Leu Ser Asn Gln Glu Leu Ile Gly Lys Lys Glu
460 465 470
tat aaa atg tct tca aag aaa cta aaa att ggt gaa aga aca aat gaa 2275
Tyr Lys Met Ser Ser Lys Lys Leu Lys Ile Gly Glu Arg Thr Asn Glu
475 480 485
agg ata ata aaa agt cag aag caa gag aca act gaa gaa ttg gat gta 2323
Arg Ile Ile Lys Ser Gln Lys Gln Glu Thr Thr Glu Glu Leu Asp Val
490 495 500
tcc att gat att cta acc tca agg gaa cag ttt ttc tca gat gaa gaa 2371
Ser Ile Asp Ile Leu Thr Ser Arg Glu Gln Phe Phe Ser Asp Glu Glu
505 510 515
aga aaa tac atg gcc atc aat cag aag aaa gct tat att tta gta aca 2419
Arg Lys Tyr Met Ala Ile Asn Gln Lys Lys Ala Tyr Ile Leu Val Thr
520 525 530 535
cca ctt aaa tct aga aaa gtg ata gag caa aga tgc atg agg tat aat 2467
Pro Leu Lys Ser Arg Lys Val Ile Glu Gln Arg Cys Met Arg Tyr Asn
540 545 550
ctg tcc gct ggc acc atc aaa gca gta aca gat ttt gta ata cca gag 2515
Leu Ser Ala Gly Thr Ile Lys Ala Val Thr Asp Phe Val Ile Pro Glu
555 560 565
tgt caa aaa aaa agt ccc atc agc aag tcc atg ggg act tta gaa aat 2563
Cys Gln Lys Lys Ser Pro Ile Ser Lys Ser Met Gly Thr Leu Glu Asn
570 575 580
aca ttt gaa ggt cat aaa agt aaa aac aag gaa gat tgc gat gaa cgt 2611
Thr Phe Glu Gly His Lys Ser Lys Asn Lys Glu Asp Cys Asp Glu Arg
585 590 595
gac tta ctt act gtc aac cgg aaa ata aaa ata tct aac ctt gaa aag 2659
Asp Leu Leu Thr Val Asn Arg Lys Ile Lys Ile Ser Asn Leu Glu Lys
600 605 610 615
gaa caa atg ctc acc tct gac ttt aag aaa aat acc aga cta tta cca 2707
Glu Gln Met Leu Thr Ser Asp Phe Lys Lys Asn Thr Arg Leu Leu Pro
620 625 630
aaa ttg aag aaa ata gaa aat cag gta gct atg tca ttt tat aag cat 2755
Lys Leu Lys Lys Ile Glu Asn Gln Val Ala Met Ser Phe Tyr Lys His
635 640 645
cag tcc tca cca gat ttg tca agt gaa gaa agt gaa aca gaa aag gaa 2803
Gln Ser Ser Pro Asp Leu Ser Ser Glu Glu Ser Glu Thr Glu Lys Glu
650 655 660
att aaa agg aaa gct gaa gtt aag aaa acc aaa gca gga aac acc aaa 2851
Ile Lys Arg Lys Ala Glu Val Lys Lys Thr Lys Ala Gly Asn Thr Lys
665 670 675
gaa gca gtg gtt cac ctg aga aag agc aca aga aac aca agt aat att 2899
Glu Ala Val Val His Leu Arg Lys Ser Thr Arg Asn Thr Ser Asn Ile
680 685 690 695
cca gtg att ttg gaa cct gaa act gaa gaa agt gaa aat gaa ttt tat 2947
Pro Val Ile Leu Glu Pro Glu Thr Glu Glu Ser Glu Asn Glu Phe Tyr
700 705 710
atc aaa caa aag aaa gct aga cct tcc gtc aaa gaa act ctt cag aag 2995
Ile Lys Gln Lys Lys Ala Arg Pro Ser Val Lys Glu Thr Leu Gln Lys
715 720 725
tct ggt gtt agg aaa gag ttt cca att act gag gca gta gga tct gat 3043
Ser Gly Val Arg Lys Glu Phe Pro Ile Thr Glu Ala Val Gly Ser Asp
730 735 740
aag aca aat agg cat ccc tta gaa tgc tta cct ggt tta att cag gat 3091
Lys Thr Asn Arg His Pro Leu Glu Cys Leu Pro Gly Leu Ile Gln Asp
745 750 755
aag gaa tgg aat gag aag gag tta cag aaa ctt cat tgt gct ttt gca 3139
Lys Glu Trp Asn Glu Lys Glu Leu Gln Lys Leu His Cys Ala Phe Ala
760 765 770 775
tct ctt cca aag cac aaa cct ggt ttc tgg tca gag gta gct gcg gct 3187
Ser Leu Pro Lys His Lys Pro Gly Phe Trp Ser Glu Val Ala Ala Ala
780 785 790
gta ggt tct cga tct cct gaa gaa tgc cag agg aaa tac atg gaa aat 3235
Val Gly Ser Arg Ser Pro Glu Glu Cys Gln Arg Lys Tyr Met Glu Asn
795 800 805
ccc aga gga aaa gga tcc cag aaa cat gtc act aag aag aag cca gcc 3283
Pro Arg Gly Lys Gly Ser Gln Lys His Val Thr Lys Lys Lys Pro Ala
810 815 820
aat tcc aaa ggc caa aat ggc aag aga ggt gat gct gat cag aaa caa 3331
Asn Ser Lys Gly Gln Asn Gly Lys Arg Gly Asp Ala Asp Gln Lys Gln
825 830 835
act att aag ata act gcc aaa gtg gga act ctt aaa agg aag caa cag 3379
Thr Ile Lys Ile Thr Ala Lys Val Gly Thr Leu Lys Arg Lys Gln Gln
840 845 850 855
atg agg gaa ttt ctg gaa cag ttg cca aaa gat gac cat gat gat ttt 3427
Met Arg Glu Phe Leu Glu Gln Leu Pro Lys Asp Asp His Asp Asp Phe
860 865 870
ttc agt aca aca cct tta cag cat caa aga ata ctg ttg cca agt ttc 3475
Phe Ser Thr Thr Pro Leu Gln His Gln Arg Ile Leu Leu Pro Ser Phe
875 880 885
cag gac agt gaa gat gat gat gat att ctg cca aat atg gac aaa aat 3523
Gln Asp Ser Glu Asp Asp Asp Asp Ile Leu Pro Asn Met Asp Lys Asn
890 895 900
cca aca act cca tca tca gtt atc ttt cca ttg gta aaa act cct caa 3571
Pro Thr Thr Pro Ser Ser Val Ile Phe Pro Leu Val Lys Thr Pro Gln
905 910 915
tgt cag cat gtc agt cct ggc atg cta ggt tct ata aat agg aat gac 3619
Cys Gln His Val Ser Pro Gly Met Leu Gly Ser Ile Asn Arg Asn Asp
920 925 930 935
tgt gat aaa tat gtt ttt cgt atg caa aaa tat cat aaa agt aat ggt 3667
Cys Asp Lys Tyr Val Phe Arg Met Gln Lys Tyr His Lys Ser Asn Gly
940 945 950
ggt att gtc tgg ggc aac atc aag aaa aaa tta gtt gaa act gat ttc 3715
Gly Ile Val Trp Gly Asn Ile Lys Lys Lys Leu Val Glu Thr Asp Phe
955 960 965
tca act cca aca cca aga agg aaa acc cca ttt aac aca gac tta gga 3763
Ser Thr Pro Thr Pro Arg Arg Lys Thr Pro Phe Asn Thr Asp Leu Gly
970 975 980
gaa aac tct ggt att gga aaa ctt ttc act aat gct gtg gaa tct tta 3811
Glu Asn Ser Gly Ile Gly Lys Leu Phe Thr Asn Ala Val Glu Ser Leu
985 990 995
gat gaa gaa gag aaa gat tat tat ttt tcg aac tct gat tct gca 3856
Asp Glu Glu Glu Lys Asp Tyr Tyr Phe Ser Asn Ser Asp Ser Ala
1000 1005 1010
tagtaaaatg agaaaatatg attcctggga tttttaccat aaagcagaca gtgtttgtat 3916
tttcaactgg agtacatgta ttttctttgt aaagtagctt cctatgaaaa tgtggacttt 3976
tttgaaggtt tcatatgttt gtgttcaaag taaaatatcc tcattgctgc agcttactaa 4036
aaatgtaaag aaaattgttt ttgctcgtgt agatatctgt aaatttgttt ttgcatatta 4096
aaatatatat agataatttt ttaataagca tccaagtctg tttactttaa gaaaaccatt 4156
tcccaaacag attttttttt tatttcaaga aaattttgct accatttaag taagagaagg 4216
tgagaaggat gacagaggtt gtattggtag ctattgaatt catgaaaact tttaagttag 4276
catttgttag cagttattat ccaagccaga gtaggatttg ttaccagttg ttatccaaac 4336
ctaatgttta aattacacat tgttgaaatt aaattacaca ttgttg 4382
<210> 44
<211> 8262
<212> DNA
<213> Homo sapiens
<220>
<221> CDS
<222> (5728)..(5955)
<400> 44
gtccttttaa aaaaactcat tttggccagg cgcggtggct cacgcctgta atcccagcac 60
tttgggaggc tgagacgggc ggatcatttg aggtcaggag tttgagacca ccctggccaa 120
catggtaaaa ccctatctct actaaaaata caaaaattag cagagtgtgg tggcacgtgc 180
ctataaaccc agctactcag gaggctgagg cacgagaatc acttgaaccc gggaggtgga 240
ggttgcagtg agccgagatg gtgccactgc actccagcct gggcaacaga gtgagactct 300
gtctcaagaa aagcccccat tttaaccact ctgtctcttc ttccaggcct ttagaggcag 360
aagagaacca ttgtaaattc cagcacaagc tctttcaggc attgtttata gtttaataaa 420
tcctgccacc tctattcatc ccggtaatga aggggcgact gctcaggaga aaagccaggg 480
tgcacaaaga gatgggcatt agcatataaa aggacagagg tgggccttcc accctttgca 540
acgatgctcc ttgtacatcc acagagaatt cccttctgat cagtgcacag tgagtaaagg 600
cgccggctcc aaagatgcct gctcagagtg gctgggcgtg tctgccccag gctgtcctgt 660
tgtttcgggt tcatcatggt gggaggtggt gtcagggatc agaacaggca gagatgcctg 720
gggatggatg ccacagcctc aggaagggac catcgtcatg tatttaataa cttgaaacag 780
attaagacca cgagtgtgcg tgcacacaca cacacttgcg cttgcatacg tgcactcaca 840
gaactgtgct ttgctcagca ctttcccagc tgaatggccc tcttcaggtg cctgaaggcc 900
tctggaacac acacttgtct gtctgtcgac cacctgttca tgcacgatgc tttgagtgcc 960
tctgaacatt cggactccat gtgttctagt tctgcatgct ggacataaca gactgacaag 1020
caccatcccc tgctctggag gagccttcca ttgagagggg tcagggagta gacatacctc 1080
caccagagtc cgcgaaagcg ctcggcaggg agagctgggg agaggggctc ccctgtctgg 1140
ccacgggcag tgtctagtca cctggcttgt tttcttcttt ggcaaaccag cagcagagac 1200
cttgtgggca gaagtctcca ctctggccgg acccttgact ggggtagcca cccatccatg 1260
gcaccactgt cgaggggaga ggagccgttg cccccagggc tcttccctcc ctctctgccg 1320
gcatcattgg gtagaaagca ggcatccttg acactgctag ccatcaggag agccagtgtt 1380
accaaaggcc atcaggagag ccggtgtcac caaaggaagt catgtttctt tctctgaagt 1440
ccgtgtgtca gtcccacagg atgtgagacg tcagaagatg aagatagctc ttctggtggg 1500
gactttgatg ttacgacttg cagcaccaca gagcacctgc ttttcatata agttctttct 1560
gtggacctga aataatccaa aatcatctca tcaagtggaa tatgccatat gatatttaaa 1620
tacctgtaat ttctctaaaa ccctggggcc agaaactcta gtggttgctc tattcatgga 1680
ttcatgttgc agtaaaatgt gataattgat gaatatctgt ggaaatggat atgattgtgt 1740
ccttttaaat gagctaacaa tttcacgccg aaatctcagc agtgctattg tttagaataa 1800
aaggagacat gcaaatgagg ccagggtgca gattgtggta gaacaatgct ggcagcctct 1860
ttattggacc cattaattaa ttccccatca gccagtttcc ctgggaatga actggaaaaa 1920
taattaaaat gatacatgag gtaaggagtt tagaaattca aaaggtgaag gcgagaatta 1980
gtttaatacg taattagaca tattatttgt gactctacgt ttttttaaat gtggctcact 2040
gaggcagtag ggtcctattt ttagtcctgg atagttaata aatatcactt ggggctgttt 2100
ttcaacccac aatggctcat aaattttctt aataacatta agtggctaat tatattgtac 2160
attgttaaaa taacttctat tgcaaaatct aataatacca caaaagtttg caaaagcaag 2220
tttttagagg aattcatttg actcaatttc caagtgagat ctggcacacg tagaccaaac 2280
tgaaaaatgt atagcaaaga atctgcaggt tcacatatat ttttgttgca aacatgatcc 2340
aaacaatctc aaaacccaac ttcgttttag tgtgagtggt gcagatggaa cgtcgtacct 2400
cccagagttc agactgagtg ctggctccat ggctcttggg agtggcggaa tcttaacgga 2460
acgtgcttcc actgctgtgt cctctcggtg aggcctcggc cctagcctca gtgttgcata 2520
tgtcaggatg gacgaacgtg agtcttggca tggtgcaagt gcatctgaat tgcatgtctc 2580
tgtccttgct ggagcaaaac cctgttgttt ggatggagcg gagctgacct gacggttcca 2640
ggctgttccc tgttgtttgg ttggacgctg ttccctgttt ggatggagtg gagctgactt 2700
gacggttccc ggctgttccc tgttgtttgg ttggagcggg gctgacctga cggtcctggc 2760
tgttcccggt tgtttggttg gagcggagct gacctgacgg tcctggctgt tccctattgt 2820
ttggttggag cagagctgac ttgacggttc ccggctgttc cctgttgttt ggttggagcg 2880
gggctgacct gacggttcca ggctgttccc tgttgtttgg ttggagcgga gctgacttga 2940
cggttccagc gctgtcccct gttgtttggt tggagcagag ctgacctgac ggtcccagcg 3000
ctgtcccctg tttggttgga gcggagctga cctgacggtc ccagcgctgt cccctgttgt 3060
ttggttggag cggagctgac ctgatggttc caggctgttc cctgttgttt ggttggacgc 3120
tgtcccctgt tgtttggttg gagcggagct gacttgatgg ttcgagactg ttccctgttg 3180
tttggttgga gcagggctga cctgacggtc ctggctgttc cctgttgttt ggttggacgc 3240
tgtcccctgt tgtttggttg gagcggagct gacctgacgg ttccaggctg ttccctgttg 3300
tttggttgga cgctgtcccc tgttgtttgg ttggagcgga gctgacttga tggttccagg 3360
ctgttccctg ttgtttggtt ggagcggggc tgacctgacg gtcctggctg ttccctgttg 3420
tttggttgga gcggagctga cctgacggtt ccaggttgtt ccctgttgtt tggttggacg 3480
ctgtcccctg ttgtttggtt ggagcggagc tgacctgacg gttccaggct gttccctgtt 3540
gtttggttgg agcggagctg acctgactgt tccaggctgt tccctgttgt ttggttggac 3600
gctgtcccct gttgtttggt tggagcggag ctgacctgac ggttccaggc tgtcccctgt 3660
tgtttggctg gagcggggct gacctgacgg ttccaggctg ttccctgttg tttggctgga 3720
gcggggctga cctgacggtt ccaggctgtt ccctgttgtt tggctggagc ggggctgacc 3780
tgacggttcc aggctgttcc ctgttgtttg gctggagcgg ggctgacctg acggttccag 3840
gctgttccct gttgtttggc tggagcgggg ctgacctgac ggttccaggc tgttccctgt 3900
tgtttggctg gagcggggct gacctgacgg ttccaggctg ttccctgttg tttggttgga 3960
gcggagctga cctgacggtc ccggcgctgt tcgtgtaagt tgtttctgtt tccctgttat 4020
tttgaatata cttaaatggc ccaataattc catttctgtg aacctaagag tataaacatt 4080
tcataagatt attgaactta aagtgatagg cctggctgga aatattttac cccatacctg 4140
tgcttatttc tttttaatga gcaactgaaa agtaatttta gtaaccaacc tcccatacct 4200
ttggagaaca aataaaacca cagtaatttc cacgtattaa gagcccacat gcgcctatca 4260
ctgtcatcca cagtctgcac ctgttagctt gtttaagcct cccaggagct tgcacagagt 4320
atggccagat acccgcctta cagcctgtgc tgctgtggcc aaggaaaaga ggttttgtgc 4380
ctggggcagt cagaccttcc agtgggagtg actcttggct gctccctgcc cgtcctcagt 4440
tgctttttgg tggaagtgca ggcccctctc tccccactct gccctcccag ccacctgagc 4500
cactgagctt ctgcagcagc acaagccctt tcagtcttcc tgagtcggcc ccactctggg 4560
gtgggctccc gcttttaggc tctttgcttt cacaatcaaa gttacaacgg gaagcccttc 4620
actcggggcc ggcgggaaag gatccagccc tgctacttcc attctttcct tttaaacaag 4680
actctgtcca ttaatcatcc tctggtttga tctgtcaatc cagaaaccct tgctaatgct 4740
ttatggctgt ggtttgttgc ggttctggga aagaggaaaa gaaatgagcc agtcctaata 4800
aatgaaagtt atttcctcat ttgaatttct gagtcctgag agtcaatatg atgcagacat 4860
caatcttgtt attctcggag tggagagaca gtttctgatt gtgcctctcc ctgggttgtg 4920
atgataaaac gtgtggtgtg tttggctggg ctcagtcatg gcctctcctg agaccccctg 4980
cctttttata ttgcagataa cacacatggc ttaatgctgg aaacaggcct gtgttccctt 5040
ggtctttacg agtgttattc tagaaaactt catgtttcta aggtggtttc atttgtcatg 5100
aatactcaga tgcggttttc tctagcagcc ctccgagtgt gctcggaaag gtgaggcccc 5160
cgaggcagag gcatggccag gacgctggtg ggcttctttg ggaattagga ggaaaaagag 5220
gagtggggca gagcctggag aggctgccct cctggggtgc aggccggtgg aggatgtgga 5280
gctgcccagc tggcaggtgg cagaattggg actgaagctc agtccccact cgtccaggcc 5340
acagcttctc ctttggggcc actctgcctc attcaccatc tctccagcac caagcaattg 5400
tgtgattctt acaatgtgcc tgtcgtgctt ctggtcgcag cctccttggc tttgcagcag 5460
tgcagtgcct gggcctgcat ttggcagaag tggagttccc agtcccatcc gtggagtggc 5520
cttcccctga gagcttgcct ggtatgtttc agatgcaggg gcttctgtct ctccatgcct 5580
tatgcacttg gccacacccc tgttagcacc cctgccattt cccagggcca ctgttgcagg 5640
cacccaggaa ggtgcctgac ctggctctac atcccacacc ctgtgcaggg ggcctagcat 5700
gctggcagcc tgctaagagc tggatag gta tgt ggt gag cac atg gca ccc cga 5754
Val Cys Gly Glu His Met Ala Pro Arg
1 5
tcc gat tta ctt ggt gtc ctg tcg ggc aca ctt gtg aat acc cgc cag 5802
Ser Asp Leu Leu Gly Val Leu Ser Gly Thr Leu Val Asn Thr Arg Gln
10 15 20 25
gcc tgt gtg tgg gtg gag aca gaa cct gcc ctc tgg cca tcg agg ctg 5850
Ala Cys Val Trp Val Glu Thr Glu Pro Ala Leu Trp Pro Ser Arg Leu
30 35 40
tgc ctc ttg gcc ata cat gtc atg gtc ttg ttc atc aga cct gcc ctg 5898
Cys Leu Leu Ala Ile His Val Met Val Leu Phe Ile Arg Pro Ala Leu
45 50 55
cct gtc tgc tgg gag ggc ccg cat gga gcc tct tgc cca gct gca gac 5946
Pro Val Cys Trp Glu Gly Pro His Gly Ala Ser Cys Pro Ala Ala Asp
60 65 70
aag gga ggg tgaccggccc cacactccta tatgaaggat ggatagctct 5995
Lys Gly Gly
75
gggcagcttt ttggttctgg ggtcagattt gaatgtaaaa ctgaaaagtc ctaaatttat 6055
taaaggcttt cttttggggc tggaatatgc aatagatttt ctatccctct caattccctc 6115
ttaaaattta atactgccat aaaaatgatg gataacaaat tattccagga aaaatggtac 6175
tgaaagactc ttctaactgt tgaagcagaa gtacagactt ggcgaagaaa tagacatgac 6235
ccagaaaact cacgaggaaa gaggaaatcg tgtactttgt ggaaaccata tccacgtaat 6295
gttgttccag gctgtagaaa gcaactattg ttattccaga tactgaagca cagagtctat 6355
tttgtgccgc tggcggtttc attaacatag taacactctg cctttctaga attgggtgag 6415
cagccttccc tgtttggaac actgcagagg gagactgaac cccacaattg atatcagaag 6475
gccccgtgtg gatgcctctt cagggtgcct tccacagagc tctgccagca tgctgaggct 6535
gcggggaaaa accatgaagc ctcataaata tatacaatga ttgtgtgtac ccataatcat 6595
tagaaataaa aatttaaaaa aacacataaa accacaaggg ctctgattgc tgggtctagg 6655
acttccccac tccgacagtg atggtagcag aaacaaggcc tgacagccag accccagggc 6715
aggcagcagg gaggaaaaca gatttaaaaa tcgcaccaga cactcatttt ctacagaagt 6775
ctgcgtgtcg ttaatacatg ggcaaacagc cctttgttcc ttagtaggcc ttaaagacac 6835
caccccatgg gagctttcta aaaaagatag tattaaaaga ctatgattca ggctgggcac 6895
attagcacac acctgtaatc tcagcacttt ggaaggtcga ggagggcaga tcacttgagt 6955
ctaggagttt gagaccagcc tgggcaacat agcaagacct gatctctcta caaaaattag 7015
ctgagtatgg tggcatacac ctctggtctc agctactcgg gaggatcagt cgggccccgg 7075
aggttgaggc tgcagtgagc tatgatcgca ccactgtact ctagcctggg tgacagtgag 7135
acccagtctc aaaagcaaaa caaaacaccc cacaaagtat ataattcaga cttaacatcc 7195
tgactcttgt ttttactcca aggcaatgaa gcaatgatgt ggacgatgcg tgtcctaatc 7255
cggagcgcct cccaggctag gaatccaagc caggttatgg cttaaatgga gaccgatgga 7315
gctgctagct gagatcgccc acattggcct ctgcagactc ccttgttttc cctaatgtgt 7375
gtcaacacat ctattaaggg gaaaggactg ctcggtatca atgatttcca cttggaaatg 7435
tcaccatgac aactgaggag gtgctggaag caggcacttt gttaattctg tcttaaccca 7495
tcccaggcaa cttcaaaact ttttctctgg agagaaatag gcttatttag gaaaggcgtc 7555
tctcattgat cctcagctac atccactgaa gtgaataatt tacttctttg gcaggtttag 7615
aaaattagat gcatacaatc taatatccct tctttgtaca ttttcatgtg aaatcttttg 7675
gagtgctttt tgatagccac cccctcgggg acctggaagt ggcactgtct gaattctgtt 7735
ctgtcggttc tgtgagggac agatgctcca tgtcatcctg agagccatcc taccgccttc 7795
ctgcccagtc ctcttcactg gcacctccat tggccaacag atcaagtctg cacaccgcag 7855
cctggctgcg agggtccttt tcatccagcc tcattcatca gtgttcttgt gagcatctac 7915
tctgggttag gcccagacct cctgggttaa tcagattgaa caaaggggtg taatagagaa 7975
ccaaagggcc aggtgcggtg gctcacgcct gtaatcccac cactttggga ggccaaggca 8035
ggcagatcac gaggtcagga gattgagacc atcctggcta gcacagtgaa accctgtctc 8095
tactaaaaat acaaaaaatt agccacgcat gttggcgggc gcctgtagtc ccagctgctc 8155
ggtaggctga ggcaggagaa tggcgtgaac cagggaggcg gagcttgcag tgagacgaga 8215
tcacaccact gcagtccagc ctgggcgaca gagcgagatt ccgtctc 8262
─────────────────────────────────────────────────────
フロントページの続き
(51)Int.Cl.7 識別記号 FI テーマコート゛(参考)
A61K 48/00 A61K 37/02
(72)発明者 中島 大輔
千葉県木更津市矢那1532番3号 財団法人
かずさディー・エヌ・エー研究所内
Fターム(参考) 4B024 AA01 AA11 CA04 EA04 GA14
HA01
4B050 CC03 DD11 LL01 LL03
4C084 AA01 AA13 BA35
4C087 BC83 CA12 NA14
4H045 AA10 BA10 CA40 EA20 EA50
FA74
Claims (5)
- 【請求項1】 以下の(a)又は(b)のポリペプチド
をコードする塩基配列を含むDNA: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示されるアミノ酸配
列と同一又は実質的に同一のアミノ酸配列から成るポリ
ペプチド、 (b)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示されるアミノ酸配
列において、一部のアミノ酸が欠失、置換又は付加され
たアミノ酸配列から成り、(a)のポリペプチドの機能
と実質的に同質の生物学的活性を有するポリペプチド。 - 【請求項2】 以下の(a)又は(b)のDNA: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示される塩基配列に
おいて、夫々の配列で示されるアミノ酸配列をコードす
る塩基配列を含むDNA、 (b)(a)のDNAとストリンジェントな条件下でハ
イブリダイズし、(a)のポリペプチドの機能と実質的
に同質の生物学的活性を有する蛋白質をコードするDN
A。 - 【請求項3】 請求項1又は2記載のヒトDNAを含む
遺伝子。 - 【請求項4】 以下の(a)又は(b)の組換えポリペ
プチド: (a)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示されるアミノ酸配
列と同一又は実質的に同一のアミノ酸配列から成るポリ
ペプチド、 (b)配列番号:1乃至44(但し、配列番号7、11
及び25は除く)のいずれか一つで示されるアミノ酸配
列において、一部のアミノ酸が欠失、置換又は付加され
たアミノ酸配列から成り、(a)のポリペプチドの機能
と実質的に同質の生物学的活性を有するポリペプチド。 - 【請求項5】 請求項3に記載の遺伝子にコードされる
組換え蛋白質。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2002220624A JP2003135080A (ja) | 2002-07-30 | 2002-07-30 | 新規遺伝子及びそれにコードされる蛋白質 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2002220624A JP2003135080A (ja) | 2002-07-30 | 2002-07-30 | 新規遺伝子及びそれにコードされる蛋白質 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2001168370 Division | 2001-06-04 | 2001-06-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2003135080A true JP2003135080A (ja) | 2003-05-13 |
Family
ID=19196086
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2002220624A Pending JP2003135080A (ja) | 2002-07-30 | 2002-07-30 | 新規遺伝子及びそれにコードされる蛋白質 |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP2003135080A (ja) |
-
2002
- 2002-07-30 JP JP2002220624A patent/JP2003135080A/ja active Pending
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20250041457A1 (en) | Use of adeno-associated viral vectors to correct gene defects/ express proteins in hair cells and supporting cells in the inner ear | |
AU2017267184B2 (en) | Method for assessing a prognosis and predicting the response of patients with malignant diseases to immunotherapy | |
KR101708544B1 (ko) | 세포 증식 질환을 분석하기 위한 방법 및 핵산 | |
CN107941681B (zh) | 鉴定生物样品中定量细胞组成的方法 | |
KR101421326B1 (ko) | 유방암 예후 예측을 위한 조성물 및 이를 포함하는 키트 | |
CN107223159A (zh) | 源自特定细胞类型的dna的检测及相关方法 | |
KR20180093902A (ko) | 태아와 임신 여성간에 상이하게 메틸화된 디엔에이 영역을 이용한 태아 염색체 이수성의 검출 | |
CA2941594A1 (en) | Genetic polymorphisms of the protein receptor c (procr) associated with myocardial infarction, methods of detection and uses thereof | |
KR20220024184A (ko) | 대장암의 검출 | |
KR20220025749A (ko) | 대장암의 검출 | |
WO2005014846A2 (en) | Methods for identifying risk of breast cancer and treatments thereof | |
JP2003156489A (ja) | 痛みに関連する分子の同定及び使用 | |
KR20060045950A (ko) | 혈액학적 악성종양에 대한 예후 | |
KR102046839B1 (ko) | 대장암의 시험관내 진단 또는 예후 예측 방법 | |
JP2003235573A (ja) | 糖尿病性腎症マーカーおよびその利用 | |
JP2002017376A (ja) | 分泌蛋白質、または膜蛋白質 | |
CN100516876C (zh) | 用于诊断肾细胞癌(rcc)和其他实体瘤的方法 | |
WO2006022636A1 (en) | Methods for identifying risk of type ii diabetes and treatments thereof | |
JP2002017375A (ja) | 全長cDNA合成用プライマー、およびその用途 | |
KR102763004B1 (ko) | 흑색종의 검출 방법 | |
JP2003245082A (ja) | 糸球体硬化症の疾患マーカーおよびその利用 | |
JP2003135080A (ja) | 新規遺伝子及びそれにコードされる蛋白質 | |
JP2003259875A (ja) | ヒト遺伝子の一塩基多型(4) | |
TW202313972A (zh) | 新nrg1融合物、融合接合處及檢測彼等之方法 | |
KR20150094601A (ko) | 성별에 독립적인 연령 결정 방법 |