CN102382182B - 一种与植物耐逆性相关的蛋白nek6及其编码基因与应用 - Google Patents
一种与植物耐逆性相关的蛋白nek6及其编码基因与应用 Download PDFInfo
- Publication number
- CN102382182B CN102382182B CN 201010270556 CN201010270556A CN102382182B CN 102382182 B CN102382182 B CN 102382182B CN 201010270556 CN201010270556 CN 201010270556 CN 201010270556 A CN201010270556 A CN 201010270556A CN 102382182 B CN102382182 B CN 102382182B
- Authority
- CN
- China
- Prior art keywords
- nek6
- ser
- protein
- plant
- pro
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 71
- 101000588540 Homo sapiens Serine/threonine-protein kinase Nek6 Proteins 0.000 title abstract description 45
- 102100031401 Serine/threonine-protein kinase Nek6 Human genes 0.000 title abstract description 44
- 102000004169 proteins and genes Human genes 0.000 claims abstract description 19
- 125000003275 alpha amino acid group Chemical group 0.000 claims abstract 3
- 230000009261 transgenic effect Effects 0.000 claims description 41
- 230000014509 gene expression Effects 0.000 claims description 22
- 239000013604 expression vector Substances 0.000 claims description 19
- 238000000034 method Methods 0.000 claims description 10
- 230000015784 hyperosmotic salinity response Effects 0.000 claims description 9
- 238000003259 recombinant expression Methods 0.000 claims description 9
- 230000024346 drought recovery Effects 0.000 claims description 7
- 241000894006 Bacteria Species 0.000 claims description 4
- 238000003780 insertion Methods 0.000 claims description 3
- 230000037431 insertion Effects 0.000 claims description 3
- 235000018102 proteins Nutrition 0.000 claims 2
- 230000008676 import Effects 0.000 claims 1
- 230000008521 reorganization Effects 0.000 claims 1
- 241000196324 Embryophyta Species 0.000 abstract description 76
- 238000002474 experimental method Methods 0.000 abstract description 7
- 125000000539 amino acid group Chemical group 0.000 abstract description 6
- 238000012217 deletion Methods 0.000 abstract description 3
- 230000037430 deletion Effects 0.000 abstract description 3
- 238000006467 substitution reaction Methods 0.000 abstract description 3
- 241000219194 Arabidopsis Species 0.000 description 53
- 230000035882 stress Effects 0.000 description 27
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 18
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 16
- 235000013339 cereals Nutrition 0.000 description 16
- 239000006870 ms-medium Substances 0.000 description 15
- 230000004083 survival effect Effects 0.000 description 14
- 238000011282 treatment Methods 0.000 description 14
- 241000219195 Arabidopsis thaliana Species 0.000 description 13
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 13
- 229930195725 Mannitol Natural products 0.000 description 13
- 235000010355 mannitol Nutrition 0.000 description 13
- 239000000594 mannitol Substances 0.000 description 13
- 230000017260 vegetative to reproductive phase transition of meristem Effects 0.000 description 13
- 101150081016 NEK6 gene Proteins 0.000 description 12
- 239000013598 vector Substances 0.000 description 12
- 108020004414 DNA Proteins 0.000 description 11
- 239000013612 plasmid Substances 0.000 description 11
- 239000012634 fragment Substances 0.000 description 9
- 239000011780 sodium chloride Substances 0.000 description 8
- 150000001413 amino acids Chemical group 0.000 description 7
- 210000004027 cell Anatomy 0.000 description 7
- 239000000047 product Substances 0.000 description 7
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 6
- 108700026244 Open Reading Frames Proteins 0.000 description 6
- 230000009466 transformation Effects 0.000 description 6
- PAJPWUMXBYXFCZ-UHFFFAOYSA-N 1-aminocyclopropanecarboxylic acid Chemical compound OC(=O)C1(N)CC1 PAJPWUMXBYXFCZ-UHFFFAOYSA-N 0.000 description 5
- 241000589158 Agrobacterium Species 0.000 description 5
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 5
- 241001460678 Napo <wasp> Species 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 239000011259 mixed solution Substances 0.000 description 5
- 239000002773 nucleotide Substances 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- 241000880493 Leptailurus serval Species 0.000 description 4
- 208000020990 adrenal cortex carcinoma Diseases 0.000 description 4
- 239000003623 enhancer Substances 0.000 description 4
- 108020004999 messenger RNA Proteins 0.000 description 4
- 230000002018 overexpression Effects 0.000 description 4
- 238000011084 recovery Methods 0.000 description 4
- 150000003839 salts Chemical class 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 3
- 241000208125 Nicotiana Species 0.000 description 3
- 235000002637 Nicotiana tabacum Nutrition 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 238000004458 analytical method Methods 0.000 description 3
- 239000005547 deoxyribonucleotide Substances 0.000 description 3
- 125000002637 deoxyribonucleotide group Chemical group 0.000 description 3
- 108091054761 ethylene receptor family Proteins 0.000 description 3
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 3
- 238000009396 hybridization Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 108091033319 polynucleotide Proteins 0.000 description 3
- 102000040430 polynucleotide Human genes 0.000 description 3
- 239000002157 polynucleotide Substances 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 108010077112 prolyl-proline Proteins 0.000 description 3
- 230000004044 response Effects 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 108010026333 seryl-proline Proteins 0.000 description 3
- 239000000126 substance Substances 0.000 description 3
- 238000013518 transcription Methods 0.000 description 3
- 230000035897 transcription Effects 0.000 description 3
- 238000013519 translation Methods 0.000 description 3
- 229910052902 vermiculite Inorganic materials 0.000 description 3
- 235000019354 vermiculite Nutrition 0.000 description 3
- 239000010455 vermiculite Substances 0.000 description 3
- UISQLSIBJKEJSS-GUBZILKMSA-N Arg-Arg-Ser Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(O)=O UISQLSIBJKEJSS-GUBZILKMSA-N 0.000 description 2
- 241000701489 Cauliflower mosaic virus Species 0.000 description 2
- 108020004705 Codon Proteins 0.000 description 2
- VGGSQFUCUMXWEO-UHFFFAOYSA-N Ethene Chemical compound C=C VGGSQFUCUMXWEO-UHFFFAOYSA-N 0.000 description 2
- 239000005977 Ethylene Substances 0.000 description 2
- 108700024394 Exon Proteins 0.000 description 2
- IESFZVCAVACGPH-PEFMBERDSA-N Glu-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CCC(O)=O IESFZVCAVACGPH-PEFMBERDSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 2
- IRMLZWSRWSGTOP-CIUDSAMLSA-N Leu-Ser-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O IRMLZWSRWSGTOP-CIUDSAMLSA-N 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108091028043 Nucleic acid sequence Proteins 0.000 description 2
- 238000012408 PCR amplification Methods 0.000 description 2
- FRKBNXCFJBPJOL-GUBZILKMSA-N Pro-Glu-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O FRKBNXCFJBPJOL-GUBZILKMSA-N 0.000 description 2
- 108091034057 RNA (poly(A)) Proteins 0.000 description 2
- VAUMZJHYZQXZBQ-WHFBIAKZSA-N Ser-Asn-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O VAUMZJHYZQXZBQ-WHFBIAKZSA-N 0.000 description 2
- VQBCMLMPEWPUTB-ACZMJKKPSA-N Ser-Glu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O VQBCMLMPEWPUTB-ACZMJKKPSA-N 0.000 description 2
- RWDVVSKYZBNDCO-MELADBBJSA-N Ser-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)[C@H](CO)N)C(=O)O RWDVVSKYZBNDCO-MELADBBJSA-N 0.000 description 2
- XKWABWFMQXMUMT-HJGDQZAQSA-N Thr-Pro-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O XKWABWFMQXMUMT-HJGDQZAQSA-N 0.000 description 2
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 2
- 108010028230 Trp-Ser- His-Pro-Gln-Phe-Glu-Lys Proteins 0.000 description 2
- 108090000848 Ubiquitin Proteins 0.000 description 2
- 102000044159 Ubiquitin Human genes 0.000 description 2
- 241000700605 Viruses Species 0.000 description 2
- 230000036579 abiotic stress Effects 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 108010013835 arginine glutamate Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 2
- 230000001580 bacterial effect Effects 0.000 description 2
- 238000010367 cloning Methods 0.000 description 2
- 230000002596 correlated effect Effects 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 108010069495 cysteinyltyrosine Proteins 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 241001233957 eudicotyledons Species 0.000 description 2
- 108010050848 glycylleucine Proteins 0.000 description 2
- 108010081551 glycylphenylalanine Proteins 0.000 description 2
- 108010085325 histidylproline Proteins 0.000 description 2
- 108010018006 histidylserine Proteins 0.000 description 2
- 108010034529 leucyl-lysine Proteins 0.000 description 2
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 2
- 239000007788 liquid Substances 0.000 description 2
- 230000001404 mediated effect Effects 0.000 description 2
- 239000002609 medium Substances 0.000 description 2
- 108010056582 methionylglutamic acid Proteins 0.000 description 2
- 238000000746 purification Methods 0.000 description 2
- 239000000523 sample Substances 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 108010071207 serylmethionine Proteins 0.000 description 2
- 239000000243 solution Substances 0.000 description 2
- 210000001519 tissue Anatomy 0.000 description 2
- FJVAQLJNTSUQPY-CIUDSAMLSA-N Ala-Ala-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCCN FJVAQLJNTSUQPY-CIUDSAMLSA-N 0.000 description 1
- QDRGPQWIVZNJQD-CIUDSAMLSA-N Ala-Arg-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O QDRGPQWIVZNJQD-CIUDSAMLSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- LTSBJNNXPBBNDT-HGNGGELXSA-N Ala-His-Gln Chemical compound N[C@@H](C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)O LTSBJNNXPBBNDT-HGNGGELXSA-N 0.000 description 1
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 1
- OQWQTGBOFPJOIF-DLOVCJGASA-N Ala-Lys-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N OQWQTGBOFPJOIF-DLOVCJGASA-N 0.000 description 1
- NINQYGGNRIBFSC-CIUDSAMLSA-N Ala-Lys-Ser Chemical compound NCCCC[C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CO)C(O)=O NINQYGGNRIBFSC-CIUDSAMLSA-N 0.000 description 1
- AWNAEZICPNGAJK-FXQIFTODSA-N Ala-Met-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O AWNAEZICPNGAJK-FXQIFTODSA-N 0.000 description 1
- BFMIRJBURUXDRG-DLOVCJGASA-N Ala-Phe-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 BFMIRJBURUXDRG-DLOVCJGASA-N 0.000 description 1
- RMAWDDRDTRSZIR-ZLUOBGJFSA-N Ala-Ser-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RMAWDDRDTRSZIR-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- DYXOFPBJBAHWFY-JBDRJPRFSA-N Ala-Ser-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N DYXOFPBJBAHWFY-JBDRJPRFSA-N 0.000 description 1
- WQKAQKZRDIZYNV-VZFHVOOUSA-N Ala-Ser-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O WQKAQKZRDIZYNV-VZFHVOOUSA-N 0.000 description 1
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 1
- 108700031434 Arabidopsis Nek6 Proteins 0.000 description 1
- 101100516357 Arabidopsis thaliana NEK5 gene Proteins 0.000 description 1
- 101100186869 Arabidopsis thaliana NEK6 gene Proteins 0.000 description 1
- VKKYFICVTYKFIO-CIUDSAMLSA-N Arg-Ala-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N VKKYFICVTYKFIO-CIUDSAMLSA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- VNFWDYWTSHFRRG-SRVKXCTJSA-N Arg-Gln-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O VNFWDYWTSHFRRG-SRVKXCTJSA-N 0.000 description 1
- NXDXECQFKHXHAM-HJGDQZAQSA-N Arg-Glu-Thr Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NXDXECQFKHXHAM-HJGDQZAQSA-N 0.000 description 1
- UPKMBGAAEZGHOC-RWMBFGLXSA-N Arg-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O UPKMBGAAEZGHOC-RWMBFGLXSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 1
- SSZGOKWBHLOCHK-DCAQKATOSA-N Arg-Lys-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCN=C(N)N SSZGOKWBHLOCHK-DCAQKATOSA-N 0.000 description 1
- AFNHFVVOJZBIJD-GUBZILKMSA-N Arg-Met-Asp Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(O)=O)C(O)=O AFNHFVVOJZBIJD-GUBZILKMSA-N 0.000 description 1
- HNJNAMGZQZPSRE-GUBZILKMSA-N Arg-Pro-Asn Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O HNJNAMGZQZPSRE-GUBZILKMSA-N 0.000 description 1
- FRBAHXABMQXSJQ-FXQIFTODSA-N Arg-Ser-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O FRBAHXABMQXSJQ-FXQIFTODSA-N 0.000 description 1
- INOIAEUXVVNJKA-XGEHTFHBSA-N Arg-Thr-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O INOIAEUXVVNJKA-XGEHTFHBSA-N 0.000 description 1
- UVTGNSWSRSCPLP-UHFFFAOYSA-N Arg-Tyr Natural products NC(CCNC(=N)N)C(=O)NC(Cc1ccc(O)cc1)C(=O)O UVTGNSWSRSCPLP-UHFFFAOYSA-N 0.000 description 1
- LLQIAIUAKGNOSE-NHCYSSNCSA-N Arg-Val-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N LLQIAIUAKGNOSE-NHCYSSNCSA-N 0.000 description 1
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 1
- ACRYGQFHAQHDSF-ZLUOBGJFSA-N Asn-Asn-Asn Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O ACRYGQFHAQHDSF-ZLUOBGJFSA-N 0.000 description 1
- KXFCBAHYSLJCCY-ZLUOBGJFSA-N Asn-Asn-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O KXFCBAHYSLJCCY-ZLUOBGJFSA-N 0.000 description 1
- QHBMKQWOIYJYMI-BYULHYEWSA-N Asn-Asn-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QHBMKQWOIYJYMI-BYULHYEWSA-N 0.000 description 1
- WVCJSDCHTUTONA-FXQIFTODSA-N Asn-Asp-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WVCJSDCHTUTONA-FXQIFTODSA-N 0.000 description 1
- SPIPSJXLZVTXJL-ZLUOBGJFSA-N Asn-Cys-Ser Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O SPIPSJXLZVTXJL-ZLUOBGJFSA-N 0.000 description 1
- DXVMJJNAOVECBA-WHFBIAKZSA-N Asn-Gly-Asn Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O DXVMJJNAOVECBA-WHFBIAKZSA-N 0.000 description 1
- FTCGGKNCJZOPNB-WHFBIAKZSA-N Asn-Gly-Ser Chemical compound NC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FTCGGKNCJZOPNB-WHFBIAKZSA-N 0.000 description 1
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 1
- LVHMEJJWEXBMKK-GMOBBJLQSA-N Asn-Ile-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC(=O)N)N LVHMEJJWEXBMKK-GMOBBJLQSA-N 0.000 description 1
- XLZCLJRGGMBKLR-PCBIJLKTSA-N Asn-Ile-Phe Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XLZCLJRGGMBKLR-PCBIJLKTSA-N 0.000 description 1
- NLDNNZKUSLAYFW-NHCYSSNCSA-N Asn-Lys-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(O)=O NLDNNZKUSLAYFW-NHCYSSNCSA-N 0.000 description 1
- XTMZYFMTYJNABC-ZLUOBGJFSA-N Asn-Ser-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N XTMZYFMTYJNABC-ZLUOBGJFSA-N 0.000 description 1
- OOXUBGLNDRGOKT-FXQIFTODSA-N Asn-Ser-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O OOXUBGLNDRGOKT-FXQIFTODSA-N 0.000 description 1
- REQUGIWGOGSOEZ-ZLUOBGJFSA-N Asn-Ser-Asn Chemical compound C([C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N)C(=O)N REQUGIWGOGSOEZ-ZLUOBGJFSA-N 0.000 description 1
- DOURAOODTFJRIC-CIUDSAMLSA-N Asn-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)N)N DOURAOODTFJRIC-CIUDSAMLSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- WQAOZCVOOYUWKG-LSJOCFKGSA-N Asn-Val-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC(=O)N)N WQAOZCVOOYUWKG-LSJOCFKGSA-N 0.000 description 1
- NAPNAGZWHQHZLG-ZLUOBGJFSA-N Asp-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N NAPNAGZWHQHZLG-ZLUOBGJFSA-N 0.000 description 1
- ZCKYZTGLXIEOKS-CIUDSAMLSA-N Asp-Asp-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CC(=O)O)N ZCKYZTGLXIEOKS-CIUDSAMLSA-N 0.000 description 1
- SBHUBSDEZQFJHJ-CIUDSAMLSA-N Asp-Asp-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O SBHUBSDEZQFJHJ-CIUDSAMLSA-N 0.000 description 1
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 1
- RSMIHCFQDCVVBR-CIUDSAMLSA-N Asp-Gln-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CCCNC(N)=N RSMIHCFQDCVVBR-CIUDSAMLSA-N 0.000 description 1
- QCLHLXDWRKOHRR-GUBZILKMSA-N Asp-Glu-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N QCLHLXDWRKOHRR-GUBZILKMSA-N 0.000 description 1
- LTXGDRFJRZSZAV-CIUDSAMLSA-N Asp-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N LTXGDRFJRZSZAV-CIUDSAMLSA-N 0.000 description 1
- KTTCQQNRRLCIBC-GHCJXIJMSA-N Asp-Ile-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O KTTCQQNRRLCIBC-GHCJXIJMSA-N 0.000 description 1
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 1
- UZNSWMFLKVKJLI-VHWLVUOQSA-N Asp-Ile-Trp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O UZNSWMFLKVKJLI-VHWLVUOQSA-N 0.000 description 1
- UJGRZQYSNYTCAX-SRVKXCTJSA-N Asp-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UJGRZQYSNYTCAX-SRVKXCTJSA-N 0.000 description 1
- SAKCBXNPWDRWPE-BQBZGAKWSA-N Asp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N SAKCBXNPWDRWPE-BQBZGAKWSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- MVRGBQGZSDJBSM-GMOBBJLQSA-N Asp-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CC(=O)O)N MVRGBQGZSDJBSM-GMOBBJLQSA-N 0.000 description 1
- KGHLGJAXYSVNJP-WHFBIAKZSA-N Asp-Ser-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O KGHLGJAXYSVNJP-WHFBIAKZSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- MGSVBZIBCCKGCY-ZLUOBGJFSA-N Asp-Ser-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MGSVBZIBCCKGCY-ZLUOBGJFSA-N 0.000 description 1
- 241000219198 Brassica Species 0.000 description 1
- 235000003351 Brassica cretica Nutrition 0.000 description 1
- 235000003343 Brassica rupestris Nutrition 0.000 description 1
- WAJDEKCJRKGRPG-CIUDSAMLSA-N Cys-His-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N WAJDEKCJRKGRPG-CIUDSAMLSA-N 0.000 description 1
- YFAFBAPQHGULQT-HJPIBITLSA-N Cys-Ile-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CS)N YFAFBAPQHGULQT-HJPIBITLSA-N 0.000 description 1
- 108010090461 DFG peptide Proteins 0.000 description 1
- 241000620209 Escherichia coli DH5[alpha] Species 0.000 description 1
- XZWYTXMRWQJBGX-VXBMVYAYSA-N FLAG peptide Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](NC(=O)[C@@H](N)CC(O)=O)CC1=CC=C(O)C=C1 XZWYTXMRWQJBGX-VXBMVYAYSA-N 0.000 description 1
- 229930182566 Gentamicin Natural products 0.000 description 1
- CEAZRRDELHUEMR-URQXQFDESA-N Gentamicin Chemical compound O1[C@H](C(C)NC)CC[C@@H](N)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](NC)[C@@](C)(O)CO2)O)[C@H](N)C[C@@H]1N CEAZRRDELHUEMR-URQXQFDESA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- GMGKDVVBSVVKCT-NUMRIWBASA-N Gln-Asn-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GMGKDVVBSVVKCT-NUMRIWBASA-N 0.000 description 1
- LMPBBFWHCRURJD-LAEOZQHASA-N Gln-Asn-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCC(=O)N)N LMPBBFWHCRURJD-LAEOZQHASA-N 0.000 description 1
- IXFVOPOHSRKJNG-LAEOZQHASA-N Gln-Asp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O IXFVOPOHSRKJNG-LAEOZQHASA-N 0.000 description 1
- NROSLUJMIQGFKS-IUCAKERBSA-N Gln-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N NROSLUJMIQGFKS-IUCAKERBSA-N 0.000 description 1
- YRWWJCDWLVXTHN-LAEOZQHASA-N Gln-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)N)N YRWWJCDWLVXTHN-LAEOZQHASA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 1
- CGYDXNKRIMJMLV-GUBZILKMSA-N Glu-Arg-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O CGYDXNKRIMJMLV-GUBZILKMSA-N 0.000 description 1
- WPLGNDORMXTMQS-FXQIFTODSA-N Glu-Gln-Ser Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O WPLGNDORMXTMQS-FXQIFTODSA-N 0.000 description 1
- HVYWQYLBVXMXSV-GUBZILKMSA-N Glu-Leu-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O HVYWQYLBVXMXSV-GUBZILKMSA-N 0.000 description 1
- LZMQSTPFYJLVJB-GUBZILKMSA-N Glu-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCC(=O)O)N LZMQSTPFYJLVJB-GUBZILKMSA-N 0.000 description 1
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 1
- ZQYZDDXTNQXUJH-CIUDSAMLSA-N Glu-Met-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CCC(=O)O)N ZQYZDDXTNQXUJH-CIUDSAMLSA-N 0.000 description 1
- HQOGXFLBAKJUMH-CIUDSAMLSA-N Glu-Met-Ser Chemical compound CSCC[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N HQOGXFLBAKJUMH-CIUDSAMLSA-N 0.000 description 1
- KXTAGESXNQEZKB-DZKIICNBSA-N Glu-Phe-Val Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C(C)C)C(O)=O)CC1=CC=CC=C1 KXTAGESXNQEZKB-DZKIICNBSA-N 0.000 description 1
- TWYFJOHWGCCRIR-DCAQKATOSA-N Glu-Pro-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYFJOHWGCCRIR-DCAQKATOSA-N 0.000 description 1
- UDEPRBFQTWGLCW-CIUDSAMLSA-N Glu-Pro-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O UDEPRBFQTWGLCW-CIUDSAMLSA-N 0.000 description 1
- BIYNPVYAZOUVFQ-CIUDSAMLSA-N Glu-Pro-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O BIYNPVYAZOUVFQ-CIUDSAMLSA-N 0.000 description 1
- DAHLWSFUXOHMIA-FXQIFTODSA-N Glu-Ser-Gln Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(O)=O DAHLWSFUXOHMIA-FXQIFTODSA-N 0.000 description 1
- HMJULNMJWOZNFI-XHNCKOQMSA-N Glu-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CCC(=O)O)N)C(=O)O HMJULNMJWOZNFI-XHNCKOQMSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- HZISRJBYZAODRV-XQXXSGGOSA-N Glu-Thr-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O HZISRJBYZAODRV-XQXXSGGOSA-N 0.000 description 1
- JVYNYWXHZWVJEF-NUMRIWBASA-N Glu-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O JVYNYWXHZWVJEF-NUMRIWBASA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- UMZHHILWZBFPGL-LOKLDPHHSA-N Glu-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N)O UMZHHILWZBFPGL-LOKLDPHHSA-N 0.000 description 1
- CAQXJMUDOLSBPF-SUSMZKCASA-N Glu-Thr-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O CAQXJMUDOLSBPF-SUSMZKCASA-N 0.000 description 1
- MLILEEIVMRUYBX-NHCYSSNCSA-N Glu-Val-Arg Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O MLILEEIVMRUYBX-NHCYSSNCSA-N 0.000 description 1
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 1
- RLFSBAPJTYKSLG-WHFBIAKZSA-N Gly-Ala-Asp Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O RLFSBAPJTYKSLG-WHFBIAKZSA-N 0.000 description 1
- MZZSCEANQDPJER-ONGXEEELSA-N Gly-Ala-Phe Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MZZSCEANQDPJER-ONGXEEELSA-N 0.000 description 1
- UPOJUWHGMDJUQZ-IUCAKERBSA-N Gly-Arg-Arg Chemical compound NC(=N)NCCC[C@H](NC(=O)CN)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O UPOJUWHGMDJUQZ-IUCAKERBSA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 1
- IFHJOBKVXBESRE-YUMQZZPRSA-N Gly-Met-Gln Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)CN IFHJOBKVXBESRE-YUMQZZPRSA-N 0.000 description 1
- YHYDTTUSJXGTQK-UWVGGRQHSA-N Gly-Met-Leu Chemical compound CSCC[C@H](NC(=O)CN)C(=O)N[C@@H](CC(C)C)C(O)=O YHYDTTUSJXGTQK-UWVGGRQHSA-N 0.000 description 1
- HFPVRZWORNJRRC-UWVGGRQHSA-N Gly-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN HFPVRZWORNJRRC-UWVGGRQHSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- LBDXVCBAJJNJNN-WHFBIAKZSA-N Gly-Ser-Cys Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(O)=O LBDXVCBAJJNJNN-WHFBIAKZSA-N 0.000 description 1
- 244000068988 Glycine max Species 0.000 description 1
- 235000010469 Glycine max Nutrition 0.000 description 1
- DZMVESFTHXSSPZ-XVYDVKMFSA-N His-Ala-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O DZMVESFTHXSSPZ-XVYDVKMFSA-N 0.000 description 1
- NELVFWFDOKRTOR-SDDRHHMPSA-N His-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O NELVFWFDOKRTOR-SDDRHHMPSA-N 0.000 description 1
- DVHGLDYMGWTYKW-GUBZILKMSA-N His-Gln-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O DVHGLDYMGWTYKW-GUBZILKMSA-N 0.000 description 1
- IIVZNQCUUMBBKF-GVXVVHGQSA-N His-Gln-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CC1=CN=CN1 IIVZNQCUUMBBKF-GVXVVHGQSA-N 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- AKAPKBNIVNPIPO-KKUMJFAQSA-N His-His-Lys Chemical compound C([C@@H](C(=O)N[C@@H](CCCCN)C(O)=O)NC(=O)[C@@H](N)CC=1NC=NC=1)C1=CN=CN1 AKAPKBNIVNPIPO-KKUMJFAQSA-N 0.000 description 1
- FLXCRBXJRJSDHX-AVGNSLFASA-N His-Pro-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O FLXCRBXJRJSDHX-AVGNSLFASA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- WECYRWOMWSCWNX-XUXIUFHCSA-N Ile-Arg-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC(C)C)C(O)=O WECYRWOMWSCWNX-XUXIUFHCSA-N 0.000 description 1
- DCQMJRSOGCYKTR-GHCJXIJMSA-N Ile-Asp-Ser Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O DCQMJRSOGCYKTR-GHCJXIJMSA-N 0.000 description 1
- OVPYIUNCVSOVNF-KQXIARHKSA-N Ile-Gln-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N OVPYIUNCVSOVNF-KQXIARHKSA-N 0.000 description 1
- OVPYIUNCVSOVNF-ZPFDUUQYSA-N Ile-Gln-Pro Natural products CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(O)=O OVPYIUNCVSOVNF-ZPFDUUQYSA-N 0.000 description 1
- PFPUFNLHBXKPHY-HTFCKZLJSA-N Ile-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(=O)O)N PFPUFNLHBXKPHY-HTFCKZLJSA-N 0.000 description 1
- TVYWVSJGSHQWMT-AJNGGQMLSA-N Ile-Leu-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N TVYWVSJGSHQWMT-AJNGGQMLSA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- AKOYRLRUFBZOSP-BJDJZHNGSA-N Ile-Lys-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)O)N AKOYRLRUFBZOSP-BJDJZHNGSA-N 0.000 description 1
- XLXPYSDGMXTTNQ-UHFFFAOYSA-N Ile-Phe-Leu Natural products CCC(C)C(N)C(=O)NC(C(=O)NC(CC(C)C)C(O)=O)CC1=CC=CC=C1 XLXPYSDGMXTTNQ-UHFFFAOYSA-N 0.000 description 1
- JNLSTRPWUXOORL-MMWGEVLESA-N Ile-Ser-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N JNLSTRPWUXOORL-MMWGEVLESA-N 0.000 description 1
- PXKACEXYLPBMAD-JBDRJPRFSA-N Ile-Ser-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)O)N PXKACEXYLPBMAD-JBDRJPRFSA-N 0.000 description 1
- RTSQPLLOYSGMKM-DSYPUSFNSA-N Ile-Trp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)N[C@@H](CC(C)C)C(=O)O)N RTSQPLLOYSGMKM-DSYPUSFNSA-N 0.000 description 1
- 108020005350 Initiator Codon Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- FADYJNXDPBKVCA-UHFFFAOYSA-N L-Phenylalanyl-L-lysin Natural products NCCCCC(C(O)=O)NC(=O)C(N)CC1=CC=CC=C1 FADYJNXDPBKVCA-UHFFFAOYSA-N 0.000 description 1
- KWTVLKBOQATPHJ-SRVKXCTJSA-N Leu-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N KWTVLKBOQATPHJ-SRVKXCTJSA-N 0.000 description 1
- ZDSNOSQHMJBRQN-SRVKXCTJSA-N Leu-Asp-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N ZDSNOSQHMJBRQN-SRVKXCTJSA-N 0.000 description 1
- CQGSYZCULZMEDE-UHFFFAOYSA-N Leu-Gln-Pro Natural products CC(C)CC(N)C(=O)NC(CCC(N)=O)C(=O)N1CCCC1C(O)=O CQGSYZCULZMEDE-UHFFFAOYSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- QJUWBDPGGYVRHY-YUMQZZPRSA-N Leu-Gly-Cys Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N QJUWBDPGGYVRHY-YUMQZZPRSA-N 0.000 description 1
- SGIIOQQGLUUMDQ-IHRRRGAJSA-N Leu-His-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C(C)C)C(=O)O)N SGIIOQQGLUUMDQ-IHRRRGAJSA-N 0.000 description 1
- HNDWYLYAYNBWMP-AJNGGQMLSA-N Leu-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CC(C)C)N HNDWYLYAYNBWMP-AJNGGQMLSA-N 0.000 description 1
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 1
- RZXLZBIUTDQHJQ-SRVKXCTJSA-N Leu-Lys-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O RZXLZBIUTDQHJQ-SRVKXCTJSA-N 0.000 description 1
- LVTJJOJKDCVZGP-QWRGUYRKSA-N Leu-Lys-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O LVTJJOJKDCVZGP-QWRGUYRKSA-N 0.000 description 1
- REPBGZHJKYWFMJ-KKUMJFAQSA-N Leu-Lys-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N REPBGZHJKYWFMJ-KKUMJFAQSA-N 0.000 description 1
- KPYAOIVPJKPIOU-KKUMJFAQSA-N Leu-Lys-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O KPYAOIVPJKPIOU-KKUMJFAQSA-N 0.000 description 1
- RTIRBWJPYJYTLO-MELADBBJSA-N Leu-Lys-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N RTIRBWJPYJYTLO-MELADBBJSA-N 0.000 description 1
- BJWKOATWNQJPSK-SRVKXCTJSA-N Leu-Met-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BJWKOATWNQJPSK-SRVKXCTJSA-N 0.000 description 1
- DAYQSYGBCUKVKT-VOAKCMCISA-N Leu-Thr-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(O)=O DAYQSYGBCUKVKT-VOAKCMCISA-N 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- 108060001084 Luciferase Proteins 0.000 description 1
- RVOMPSJXSRPFJT-DCAQKATOSA-N Lys-Ala-Arg Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O RVOMPSJXSRPFJT-DCAQKATOSA-N 0.000 description 1
- ZQCVMVCVPFYXHZ-SRVKXCTJSA-N Lys-Asn-Lys Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCCN ZQCVMVCVPFYXHZ-SRVKXCTJSA-N 0.000 description 1
- KPJJOZUXFOLGMQ-CIUDSAMLSA-N Lys-Asp-Asn Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)N)C(=O)O)N KPJJOZUXFOLGMQ-CIUDSAMLSA-N 0.000 description 1
- BYEBKXRNDLTGFW-CIUDSAMLSA-N Lys-Cys-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(O)=O BYEBKXRNDLTGFW-CIUDSAMLSA-N 0.000 description 1
- GJJQCBVRWDGLMQ-GUBZILKMSA-N Lys-Glu-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O GJJQCBVRWDGLMQ-GUBZILKMSA-N 0.000 description 1
- PBIPLDMFHAICIP-DCAQKATOSA-N Lys-Glu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O PBIPLDMFHAICIP-DCAQKATOSA-N 0.000 description 1
- PAMDBWYMLWOELY-SDDRHHMPSA-N Lys-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CCCCN)N)C(=O)O PAMDBWYMLWOELY-SDDRHHMPSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- PRCHKVGXZVTALR-KKUMJFAQSA-N Lys-His-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CCCCN)N PRCHKVGXZVTALR-KKUMJFAQSA-N 0.000 description 1
- PINHPJWGVBKQII-SRVKXCTJSA-N Lys-Leu-Cys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCCN)N PINHPJWGVBKQII-SRVKXCTJSA-N 0.000 description 1
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 1
- WVJNGSFKBKOKRV-AJNGGQMLSA-N Lys-Leu-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WVJNGSFKBKOKRV-AJNGGQMLSA-N 0.000 description 1
- WGILOYIKJVQUPT-DCAQKATOSA-N Lys-Pro-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(O)=O WGILOYIKJVQUPT-DCAQKATOSA-N 0.000 description 1
- HYSVGEAWTGPMOA-IHRRRGAJSA-N Lys-Pro-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O HYSVGEAWTGPMOA-IHRRRGAJSA-N 0.000 description 1
- SQXZLVXQXWILKW-KKUMJFAQSA-N Lys-Ser-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O SQXZLVXQXWILKW-KKUMJFAQSA-N 0.000 description 1
- CUHGAUZONORRIC-HJGDQZAQSA-N Lys-Thr-Asn Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCCN)N)O CUHGAUZONORRIC-HJGDQZAQSA-N 0.000 description 1
- VVURYEVJJTXWNE-ULQDDVLXSA-N Lys-Tyr-Val Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O VVURYEVJJTXWNE-ULQDDVLXSA-N 0.000 description 1
- MDDUIRLQCYVRDO-NHCYSSNCSA-N Lys-Val-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCCN MDDUIRLQCYVRDO-NHCYSSNCSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- WDTLNWHPIPCMMP-AVGNSLFASA-N Met-Arg-Leu Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O WDTLNWHPIPCMMP-AVGNSLFASA-N 0.000 description 1
- HLQWFLJOJRFXHO-CIUDSAMLSA-N Met-Glu-Ser Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(O)=O HLQWFLJOJRFXHO-CIUDSAMLSA-N 0.000 description 1
- AEQVPPGEJJBFEE-CYDGBPFRSA-N Met-Ile-Arg Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AEQVPPGEJJBFEE-CYDGBPFRSA-N 0.000 description 1
- HAQLBBVZAGMESV-IHRRRGAJSA-N Met-Lys-Lys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCCN)C(O)=O HAQLBBVZAGMESV-IHRRRGAJSA-N 0.000 description 1
- LLKWSEXLNFBKIF-CYDGBPFRSA-N Met-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CCSC LLKWSEXLNFBKIF-CYDGBPFRSA-N 0.000 description 1
- BJPQKNHZHUCQNQ-SRVKXCTJSA-N Met-Pro-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@@H]1CCCN1C(=O)[C@H](CCSC)N BJPQKNHZHUCQNQ-SRVKXCTJSA-N 0.000 description 1
- ZDJICAUBMUKVEJ-CIUDSAMLSA-N Met-Ser-Gln Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(N)=O ZDJICAUBMUKVEJ-CIUDSAMLSA-N 0.000 description 1
- QAVZUKIPOMBLMC-AVGNSLFASA-N Met-Val-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC(C)C QAVZUKIPOMBLMC-AVGNSLFASA-N 0.000 description 1
- 102000029749 Microtubule Human genes 0.000 description 1
- 108091022875 Microtubule Proteins 0.000 description 1
- 101710135898 Myc proto-oncogene protein Proteins 0.000 description 1
- 102100038895 Myc proto-oncogene protein Human genes 0.000 description 1
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 1
- 101150005851 NOS gene Proteins 0.000 description 1
- 206010028980 Neoplasm Diseases 0.000 description 1
- XWBJLKDCHJVKAK-KKUMJFAQSA-N Phe-Arg-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N XWBJLKDCHJVKAK-KKUMJFAQSA-N 0.000 description 1
- SCKXGHWQPPURGT-KKUMJFAQSA-N Phe-Lys-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CO)C(O)=O SCKXGHWQPPURGT-KKUMJFAQSA-N 0.000 description 1
- ZOGICTVLQDWPER-UFYCRDLUSA-N Phe-Tyr-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O ZOGICTVLQDWPER-UFYCRDLUSA-N 0.000 description 1
- JSGWNFKWZNPDAV-YDHLFZDLSA-N Phe-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JSGWNFKWZNPDAV-YDHLFZDLSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- 108700001094 Plant Genes Proteins 0.000 description 1
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 1
- CQZNGNCAIXMAIQ-UBHSHLNASA-N Pro-Ala-Phe Chemical compound C[C@H](NC(=O)[C@@H]1CCCN1)C(=O)N[C@@H](Cc1ccccc1)C(O)=O CQZNGNCAIXMAIQ-UBHSHLNASA-N 0.000 description 1
- CGBYDGAJHSOGFQ-LPEHRKFASA-N Pro-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 CGBYDGAJHSOGFQ-LPEHRKFASA-N 0.000 description 1
- SWXSLPHTJVAWDF-VEVYYDQMSA-N Pro-Asn-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SWXSLPHTJVAWDF-VEVYYDQMSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- XUSDDSLCRPUKLP-QXEWZRGKSA-N Pro-Asp-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 XUSDDSLCRPUKLP-QXEWZRGKSA-N 0.000 description 1
- QGOZJLYCGRYYRW-KKUMJFAQSA-N Pro-Glu-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O QGOZJLYCGRYYRW-KKUMJFAQSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- JUJCUYWRJMFJJF-AVGNSLFASA-N Pro-Lys-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H]1CCCN1 JUJCUYWRJMFJJF-AVGNSLFASA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- BARPGRUZBKFJMA-SRVKXCTJSA-N Pro-Met-Arg Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@@H]1CCCN1 BARPGRUZBKFJMA-SRVKXCTJSA-N 0.000 description 1
- AJBQTGZIZQXBLT-STQMWFEESA-N Pro-Phe-Gly Chemical compound C([C@@H](C(=O)NCC(=O)O)NC(=O)[C@H]1NCCC1)C1=CC=CC=C1 AJBQTGZIZQXBLT-STQMWFEESA-N 0.000 description 1
- HWLKHNDRXWTFTN-GUBZILKMSA-N Pro-Pro-Cys Chemical compound C1C[C@H](NC1)C(=O)N2CCC[C@H]2C(=O)N[C@@H](CS)C(=O)O HWLKHNDRXWTFTN-GUBZILKMSA-N 0.000 description 1
- PRKWBYCXBBSLSK-GUBZILKMSA-N Pro-Ser-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O PRKWBYCXBBSLSK-GUBZILKMSA-N 0.000 description 1
- AIOWVDNPESPXRB-YTWAJWBKSA-N Pro-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2)O AIOWVDNPESPXRB-YTWAJWBKSA-N 0.000 description 1
- CXGLFEOYCJFKPR-RCWTZXSCSA-N Pro-Thr-Val Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O CXGLFEOYCJFKPR-RCWTZXSCSA-N 0.000 description 1
- FZXSYIPVAFVYBH-KKUMJFAQSA-N Pro-Tyr-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O FZXSYIPVAFVYBH-KKUMJFAQSA-N 0.000 description 1
- LZHHZYDPMZEMRX-STQMWFEESA-N Pro-Tyr-Gly Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(O)=O LZHHZYDPMZEMRX-STQMWFEESA-N 0.000 description 1
- BXHRXLMCYSZSIY-STECZYCISA-N Pro-Tyr-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](Cc1ccc(O)cc1)NC(=O)[C@@H]1CCCN1)C(O)=O BXHRXLMCYSZSIY-STECZYCISA-N 0.000 description 1
- YHUBAXGAAYULJY-ULQDDVLXSA-N Pro-Tyr-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O YHUBAXGAAYULJY-ULQDDVLXSA-N 0.000 description 1
- IIRBTQHFVNGPMQ-AVGNSLFASA-N Pro-Val-Lys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@@H]1CCCN1 IIRBTQHFVNGPMQ-AVGNSLFASA-N 0.000 description 1
- XRGIDCGRSSWCKE-SRVKXCTJSA-N Pro-Val-Met Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O XRGIDCGRSSWCKE-SRVKXCTJSA-N 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 238000012300 Sequence Analysis Methods 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- JPIDMRXXNMIVKY-VZFHVOOUSA-N Ser-Ala-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPIDMRXXNMIVKY-VZFHVOOUSA-N 0.000 description 1
- NRCJWSGXMAPYQX-LPEHRKFASA-N Ser-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CO)N)C(=O)O NRCJWSGXMAPYQX-LPEHRKFASA-N 0.000 description 1
- OYEDZGNMSBZCIM-XGEHTFHBSA-N Ser-Arg-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OYEDZGNMSBZCIM-XGEHTFHBSA-N 0.000 description 1
- HBOABDXGTMMDSE-GUBZILKMSA-N Ser-Arg-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O HBOABDXGTMMDSE-GUBZILKMSA-N 0.000 description 1
- OBXVZEAMXFSGPU-FXQIFTODSA-N Ser-Asn-Arg Chemical compound C(C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CO)N)CN=C(N)N OBXVZEAMXFSGPU-FXQIFTODSA-N 0.000 description 1
- COAHUSQNSVFYBW-FXQIFTODSA-N Ser-Asn-Met Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O COAHUSQNSVFYBW-FXQIFTODSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- ICHZYBVODUVUKN-SRVKXCTJSA-N Ser-Asn-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O ICHZYBVODUVUKN-SRVKXCTJSA-N 0.000 description 1
- OHKLFYXEOGGGCK-ZLUOBGJFSA-N Ser-Asp-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O OHKLFYXEOGGGCK-ZLUOBGJFSA-N 0.000 description 1
- VAIZFHMTBFYJIA-ACZMJKKPSA-N Ser-Asp-Gln Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCC(N)=O VAIZFHMTBFYJIA-ACZMJKKPSA-N 0.000 description 1
- BYIROAKULFFTEK-CIUDSAMLSA-N Ser-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CO BYIROAKULFFTEK-CIUDSAMLSA-N 0.000 description 1
- MOVJSUIKUNCVMG-ZLUOBGJFSA-N Ser-Cys-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O)N)O MOVJSUIKUNCVMG-ZLUOBGJFSA-N 0.000 description 1
- DGHFNYXVIXNNMC-GUBZILKMSA-N Ser-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CO)N DGHFNYXVIXNNMC-GUBZILKMSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- GZBKRJVCRMZAST-XKBZYTNZSA-N Ser-Glu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZBKRJVCRMZAST-XKBZYTNZSA-N 0.000 description 1
- XXXAXOWMBOKTRN-XPUUQOCRSA-N Ser-Gly-Val Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXXAXOWMBOKTRN-XPUUQOCRSA-N 0.000 description 1
- ZFVFHHZBCVNLGD-GUBZILKMSA-N Ser-His-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFVFHHZBCVNLGD-GUBZILKMSA-N 0.000 description 1
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 1
- UBRMZSHOOIVJPW-SRVKXCTJSA-N Ser-Leu-Lys Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(O)=O UBRMZSHOOIVJPW-SRVKXCTJSA-N 0.000 description 1
- YUJLIIRMIAGMCQ-CIUDSAMLSA-N Ser-Leu-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YUJLIIRMIAGMCQ-CIUDSAMLSA-N 0.000 description 1
- LRWBCWGEUCKDTN-BJDJZHNGSA-N Ser-Lys-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LRWBCWGEUCKDTN-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- UGGWCAFQPKANMW-FXQIFTODSA-N Ser-Met-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O UGGWCAFQPKANMW-FXQIFTODSA-N 0.000 description 1
- ADJDNJCSPNFFPI-FXQIFTODSA-N Ser-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO ADJDNJCSPNFFPI-FXQIFTODSA-N 0.000 description 1
- NMZXJDSKEGFDLJ-DCAQKATOSA-N Ser-Pro-Lys Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CO)N)C(=O)N[C@@H](CCCCN)C(=O)O NMZXJDSKEGFDLJ-DCAQKATOSA-N 0.000 description 1
- KQNDIKOYWZTZIX-FXQIFTODSA-N Ser-Ser-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCNC(N)=N KQNDIKOYWZTZIX-FXQIFTODSA-N 0.000 description 1
- SRSPTFBENMJHMR-WHFBIAKZSA-N Ser-Ser-Gly Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O SRSPTFBENMJHMR-WHFBIAKZSA-N 0.000 description 1
- ILZAUMFXKSIUEF-SRVKXCTJSA-N Ser-Ser-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 ILZAUMFXKSIUEF-SRVKXCTJSA-N 0.000 description 1
- XQJCEKXQUJQNNK-ZLUOBGJFSA-N Ser-Ser-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O XQJCEKXQUJQNNK-ZLUOBGJFSA-N 0.000 description 1
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 1
- VGQVAVQWKJLIRM-FXQIFTODSA-N Ser-Ser-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O VGQVAVQWKJLIRM-FXQIFTODSA-N 0.000 description 1
- ZSDXEKUKQAKZFE-XAVMHZPKSA-N Ser-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N)O ZSDXEKUKQAKZFE-XAVMHZPKSA-N 0.000 description 1
- SNXUIBACCONSOH-BWBBJGPYSA-N Ser-Thr-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@@H](CO)C(O)=O SNXUIBACCONSOH-BWBBJGPYSA-N 0.000 description 1
- QYBRQMLZDDJBSW-AVGNSLFASA-N Ser-Tyr-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O QYBRQMLZDDJBSW-AVGNSLFASA-N 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- JVTHIXKSVYEWNI-JRQIVUDYSA-N Thr-Asn-Tyr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JVTHIXKSVYEWNI-JRQIVUDYSA-N 0.000 description 1
- LAFLAXHTDVNVEL-WDCWCFNPSA-N Thr-Gln-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCCCN)C(=O)O)N)O LAFLAXHTDVNVEL-WDCWCFNPSA-N 0.000 description 1
- FHDLKMFZKRUQCE-HJGDQZAQSA-N Thr-Glu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FHDLKMFZKRUQCE-HJGDQZAQSA-N 0.000 description 1
- WDFPMSHYMRBLKM-NKIYYHGXSA-N Thr-Glu-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O WDFPMSHYMRBLKM-NKIYYHGXSA-N 0.000 description 1
- JMGJDTNUMAZNLX-RWRJDSDZSA-N Thr-Glu-Ile Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JMGJDTNUMAZNLX-RWRJDSDZSA-N 0.000 description 1
- XOTBWOCSLMBGMF-SUSMZKCASA-N Thr-Glu-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XOTBWOCSLMBGMF-SUSMZKCASA-N 0.000 description 1
- WPAKPLPGQNUXGN-OSUNSFLBSA-N Thr-Ile-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WPAKPLPGQNUXGN-OSUNSFLBSA-N 0.000 description 1
- MECLEFZMPPOEAC-VOAKCMCISA-N Thr-Leu-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O MECLEFZMPPOEAC-VOAKCMCISA-N 0.000 description 1
- NCXVJIQMWSGRHY-KXNHARMFSA-N Thr-Leu-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N)O NCXVJIQMWSGRHY-KXNHARMFSA-N 0.000 description 1
- NYQIZWROIMIQSL-VEVYYDQMSA-N Thr-Pro-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O NYQIZWROIMIQSL-VEVYYDQMSA-N 0.000 description 1
- MXDOAJQRJBMGMO-FJXKBIBVSA-N Thr-Pro-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O MXDOAJQRJBMGMO-FJXKBIBVSA-N 0.000 description 1
- MROIJTGJGIDEEJ-RCWTZXSCSA-N Thr-Pro-Pro Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 MROIJTGJGIDEEJ-RCWTZXSCSA-N 0.000 description 1
- KERCOYANYUPLHJ-XGEHTFHBSA-N Thr-Pro-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O KERCOYANYUPLHJ-XGEHTFHBSA-N 0.000 description 1
- WPSKTVVMQCXPRO-BWBBJGPYSA-N Thr-Ser-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O WPSKTVVMQCXPRO-BWBBJGPYSA-N 0.000 description 1
- ZESGVALRVJIVLZ-VFCFLDTKSA-N Thr-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N)O ZESGVALRVJIVLZ-VFCFLDTKSA-N 0.000 description 1
- PWONLXBUSVIZPH-RHYQMDGZSA-N Thr-Val-Lys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N)O PWONLXBUSVIZPH-RHYQMDGZSA-N 0.000 description 1
- 108091036066 Three prime untranslated region Proteins 0.000 description 1
- 108091023040 Transcription factor Proteins 0.000 description 1
- 102000040945 Transcription factor Human genes 0.000 description 1
- 101710150448 Transcriptional regulator Myc Proteins 0.000 description 1
- UEFHVUQBYNRNQC-SFJXLCSZSA-N Trp-Phe-Thr Chemical compound C([C@@H](C(=O)N[C@@H]([C@H](O)C)C(O)=O)NC(=O)[C@@H](N)CC=1C2=CC=CC=C2NC=1)C1=CC=CC=C1 UEFHVUQBYNRNQC-SFJXLCSZSA-N 0.000 description 1
- LNGFWVPNKLWATF-ZVZYQTTQSA-N Trp-Val-Glu Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LNGFWVPNKLWATF-ZVZYQTTQSA-N 0.000 description 1
- CRWOSTCODDFEKZ-HRCADAONSA-N Tyr-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O CRWOSTCODDFEKZ-HRCADAONSA-N 0.000 description 1
- MOCXXGZHHSPNEJ-AVGNSLFASA-N Tyr-Cys-Glu Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O MOCXXGZHHSPNEJ-AVGNSLFASA-N 0.000 description 1
- KIJLSRYAUGGZIN-CFMVVWHZSA-N Tyr-Ile-Asp Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KIJLSRYAUGGZIN-CFMVVWHZSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- QMNWABHLJOHGDS-IHRRRGAJSA-N Tyr-Met-Cys Chemical compound SC[C@@H](C(O)=O)NC(=O)[C@H](CCSC)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 QMNWABHLJOHGDS-IHRRRGAJSA-N 0.000 description 1
- SYFHQHYTNCQCCN-MELADBBJSA-N Tyr-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC2=CC=C(C=C2)O)N)C(=O)O SYFHQHYTNCQCCN-MELADBBJSA-N 0.000 description 1
- MDXLPNRXCFOBTL-BZSNNMDCSA-N Tyr-Ser-Tyr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MDXLPNRXCFOBTL-BZSNNMDCSA-N 0.000 description 1
- DJIJBQYBDKGDIS-JYJNAYRXSA-N Tyr-Val-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@@H](N)Cc1ccc(O)cc1)C(C)C)C(O)=O DJIJBQYBDKGDIS-JYJNAYRXSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- HZYOWMGWKKRMBZ-BYULHYEWSA-N Val-Asp-Asp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(=O)O)C(=O)O)N HZYOWMGWKKRMBZ-BYULHYEWSA-N 0.000 description 1
- FBVUOEYVGNMRMD-NAKRPEOUSA-N Val-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](C(C)C)N FBVUOEYVGNMRMD-NAKRPEOUSA-N 0.000 description 1
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 1
- VLDMQVZZWDOKQF-AUTRQRHGSA-N Val-Glu-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N VLDMQVZZWDOKQF-AUTRQRHGSA-N 0.000 description 1
- FOADDSDHGRFUOC-DZKIICNBSA-N Val-Glu-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FOADDSDHGRFUOC-DZKIICNBSA-N 0.000 description 1
- XXROXFHCMVXETG-UWVGGRQHSA-N Val-Gly-Val Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O XXROXFHCMVXETG-UWVGGRQHSA-N 0.000 description 1
- HQYVQDRYODWONX-DCAQKATOSA-N Val-His-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](CO)C(=O)O)N HQYVQDRYODWONX-DCAQKATOSA-N 0.000 description 1
- KDKLLPMFFGYQJD-CYDGBPFRSA-N Val-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N KDKLLPMFFGYQJD-CYDGBPFRSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 1
- GVJUTBOZZBTBIG-AVGNSLFASA-N Val-Lys-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N GVJUTBOZZBTBIG-AVGNSLFASA-N 0.000 description 1
- SJRUJQFQVLMZFW-WPRPVWTQSA-N Val-Pro-Gly Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O SJRUJQFQVLMZFW-WPRPVWTQSA-N 0.000 description 1
- RYHUIHUOYRNNIE-NRPADANISA-N Val-Ser-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N RYHUIHUOYRNNIE-NRPADANISA-N 0.000 description 1
- GBIUHAYJGWVNLN-AEJSXWLSSA-N Val-Ser-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N GBIUHAYJGWVNLN-AEJSXWLSSA-N 0.000 description 1
- GBIUHAYJGWVNLN-UHFFFAOYSA-N Val-Ser-Pro Natural products CC(C)C(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O GBIUHAYJGWVNLN-UHFFFAOYSA-N 0.000 description 1
- UJMCYJKPDFQLHX-XGEHTFHBSA-N Val-Ser-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](C(C)C)N)O UJMCYJKPDFQLHX-XGEHTFHBSA-N 0.000 description 1
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 1
- PGBMPFKFKXYROZ-UFYCRDLUSA-N Val-Tyr-Phe Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)O)N PGBMPFKFKXYROZ-UFYCRDLUSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 241000082085 Verticillium <Phyllachorales> Species 0.000 description 1
- 241000607479 Yersinia pestis Species 0.000 description 1
- 240000008042 Zea mays Species 0.000 description 1
- 235000016383 Zea mays subsp huehuetenangensis Nutrition 0.000 description 1
- 235000002017 Zea mays subsp mays Nutrition 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 239000011543 agarose gel Substances 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010005233 alanylglutamic acid Proteins 0.000 description 1
- 230000000692 anti-sense effect Effects 0.000 description 1
- 108010089442 arginyl-leucyl-alanyl-arginine Proteins 0.000 description 1
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010047857 aspartylglycine Proteins 0.000 description 1
- 230000003115 biocidal effect Effects 0.000 description 1
- 239000003181 biological factor Substances 0.000 description 1
- 238000010170 biological method Methods 0.000 description 1
- 230000004790 biotic stress Effects 0.000 description 1
- QKSKPIVNLNLAAV-UHFFFAOYSA-N bis(2-chloroethyl) sulfide Chemical compound ClCCSCCCl QKSKPIVNLNLAAV-UHFFFAOYSA-N 0.000 description 1
- 238000009395 breeding Methods 0.000 description 1
- 210000004899 c-terminal region Anatomy 0.000 description 1
- FPPNZSSZRUTDAP-UWFZAAFLSA-N carbenicillin Chemical compound N([C@H]1[C@H]2SC([C@@H](N2C1=O)C(O)=O)(C)C)C(=O)C(C(O)=O)C1=CC=CC=C1 FPPNZSSZRUTDAP-UWFZAAFLSA-N 0.000 description 1
- 229960003669 carbenicillin Drugs 0.000 description 1
- 238000010523 cascade reaction Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000003153 chemical reaction reagent Substances 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000000875 corresponding effect Effects 0.000 description 1
- 238000012272 crop production Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 239000005549 deoxyribonucleoside Substances 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000018109 developmental process Effects 0.000 description 1
- 201000010099 disease Diseases 0.000 description 1
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 210000001339 epidermal cell Anatomy 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000009313 farming Methods 0.000 description 1
- 230000008014 freezing Effects 0.000 description 1
- 238000007710 freezing Methods 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 102000034356 gene-regulatory proteins Human genes 0.000 description 1
- 108091006104 gene-regulatory proteins Proteins 0.000 description 1
- 238000012214 genetic breeding Methods 0.000 description 1
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 1
- 108010075431 glycyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010089804 glycyl-threonine Proteins 0.000 description 1
- 108010015792 glycyllysine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 101150054900 gus gene Proteins 0.000 description 1
- 238000003306 harvesting Methods 0.000 description 1
- 230000002363 herbicidal effect Effects 0.000 description 1
- 239000004009 herbicide Substances 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 229930027917 kanamycin Natural products 0.000 description 1
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 1
- 229960000318 kanamycin Drugs 0.000 description 1
- 229930182823 kanamycin A Natural products 0.000 description 1
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 1
- 108010057821 leucylproline Proteins 0.000 description 1
- 108010003700 lysyl aspartic acid Proteins 0.000 description 1
- 108010009298 lysylglutamic acid Proteins 0.000 description 1
- 108010038320 lysylphenylalanine Proteins 0.000 description 1
- 108010017391 lysylvaline Proteins 0.000 description 1
- 238000010841 mRNA extraction Methods 0.000 description 1
- 235000009973 maize Nutrition 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000012528 membrane Substances 0.000 description 1
- 108010085203 methionylmethionine Proteins 0.000 description 1
- 238000000520 microinjection Methods 0.000 description 1
- 210000004688 microtubule Anatomy 0.000 description 1
- 235000010460 mustard Nutrition 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 230000001717 pathogenic effect Effects 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 230000026731 phosphorylation Effects 0.000 description 1
- 238000006366 phosphorylation reaction Methods 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 230000008488 polyadenylation Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 108010014614 prolyl-glycyl-proline Proteins 0.000 description 1
- 108010031719 prolyl-serine Proteins 0.000 description 1
- 108010079317 prolyl-tyrosine Proteins 0.000 description 1
- 108010004914 prolylarginine Proteins 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 230000001105 regulatory effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000021892 response to abiotic stimulus Effects 0.000 description 1
- 238000010839 reverse transcription Methods 0.000 description 1
- 238000012216 screening Methods 0.000 description 1
- 108010048818 seryl-histidine Proteins 0.000 description 1
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 1
- 230000008054 signal transmission Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 108010071097 threonyl-lysyl-proline Proteins 0.000 description 1
- 230000005026 transcription initiation Effects 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 238000011144 upstream manufacturing Methods 0.000 description 1
- IBIDRSSEHFLGSD-UHFFFAOYSA-N valinyl-arginine Natural products CC(C)C(N)C(=O)NC(C(O)=O)CCCN=C(N)N IBIDRSSEHFLGSD-UHFFFAOYSA-N 0.000 description 1
- 238000005406 washing Methods 0.000 description 1
Images
Landscapes
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
Abstract
本发明公开了一种与植物耐逆性相关的蛋白NEK6及其编码基因与应用。本发明提供的蛋白质,是如下1)或2)的蛋白质:1)由序列表中序列2所示的氨基酸序列组成的蛋白质;2)将序列2的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且与植物耐逆性和/或籽粒产量相关的由1)衍生的蛋白质。本发明的实验证明,所述蛋白及其编码基因对培育耐逆且籽粒高产植物品种具有重要意义。
Description
技术领域
本发明涉及一种与植物耐逆性相关的蛋白NEK6及其编码基因与应用。
背景技术
环境中物理、化学因素的变化,例如干旱、盐碱、冷害、冻害、水涝等胁迫因素以及生物因素,例如病虫害对植物的生长发育具有重要的影响,严重时会造成农作物大规模减产,培育耐逆性作物是种植业的主要目标之一。提高作物的耐逆性,除了利用传统的育种方法,目前,分子遗传育种已经成为科技工作者所关注的领域之一。在非生物和生物逆境的胁迫下,高等植物细胞有多种途径感受和应答外界环境中物化参数的变化,将胞外的信号变为胞内信号,经过一系列磷酸化级联反应将信号传递到细胞核,经转录因子调控相关的功能基因,启动逆境应答基因的表达,提高植物的耐逆性。专利号为ZL99119096.3的专利和以下参考文献中均已证明烟草的乙烯受体NTHK1参与了植物对逆境的应答反应,与植物耐逆性相关。NTHK1在植物对逆境应答中处于上游(Wanhong Cao,Jinsong Zhang & Shouyi Chen,et al.,Expression of tobaccoethylene receptor NTHK1 alters plant responses to salt stress,Plant,Cell andEnvironment(2006)29,1210-1219。)。
在过量表达的NTHK1的拟南芥中,发现NTHK1调控一系列基因,阐明NTHK1调控蛋白与耐逆性的关系,对于进一步了解NTHK1所参与的植物应答非生物逆境的信号传递,并将其应用于植物耐逆性的改良,既具有重要的理论意义也具有重大的实用价值。
发明内容
本发明的一个目的是提供一种与植物耐逆性相关的NEK6蛋白及其编码基因。
本发明所提供的蛋白质,名称为NEK6,来源于拟南芥(Arabidopsis thaliana),是如下1)或2)的蛋白质:
1)由序列表中序列2所示的氨基酸序列组成的蛋白质;
2)将序列表中序列2的氨基酸序列经过一个或几个氨基酸残基的取代和/或缺失和/或添加且与植物耐逆性和/或籽粒产量相关的由1)衍生的蛋白质。
上述序列表中序列2由956个氨基酸残基组成。所述一个或几个氨基酸残基的取代和/或缺失和/或添加为不超过10个氨基酸残基的取代和/或缺失和/或添加。
为了使1)中的NEK6便于纯化,可在由序列表中序列2所示的氨基酸序列组成的蛋白质的氨基末端或羧基末端连接上如表1所示的标签。
表1.标签的序列
标签 | 残基 | 序列 |
Poly-Arg | 5-6(通常为5个) | RRRRR |
Poly-His | 2-10(通常为6个) | HHHHHH |
FLAG | 8 | DYKDDDDK |
Strep-tag II | 8 | WSHPQFEK |
c-myc | 10 | EQKLISEEDL |
上述2)中的NEK6可人工合成,也可先合成其编码基因,再进行生物表达得到。上述2)中的NEK6的编码基因可通过将序列表中序列1的自5′末端第1-2871位碱基所示的DNA序列中缺失一个或几个氨基酸残基的密码子,和/或进行一个或几个碱基对的错义突变,和/或在其5′端和/或3′端连上表1所示的标签的编码序列得到。
上述与植物耐逆性相关的蛋白NEK6的编码基因也属于本发明的保护范围,该基因命名为NEK6。
本发明所提供的NEK6蛋白质的编码基因具体可为如下1)或2)或3)中任一所述的基因:
1)序列表中序列1所示的DNA分子;
2)在严格条件下与1)限定的DNA分子杂交且编码所述植物耐逆性和/或籽粒产量相关蛋白质的DNA分子;
3)与1)限定的DNA序列至少具有70%、至少具有75%、至少具有80%、至少具有85%、至少具有90%、至少具有95%、至少具有96%、至少具有97%、至少具有98%或至少具有99%同源性且编码所述植物耐逆性和/或籽粒产量相关蛋白质的DNA分子。
其中,序列表中序列1由2871位脱氧核糖核苷酸组成,其开放阅读框架(ORF)为自5′末端第1-2871位碱基,自5’端的第1至3位脱氧核糖核苷酸为NEK6的起始密码子ATG,自5’端的第2869至2871位脱氧核糖核苷酸为NEK6的终止密码子TAG,编码氨基酸序列是序列表中序列2所示的NEK6。
所述特异性杂交条件可为如下:50℃,在7%十二烷基硫酸钠(SDS)、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在50℃,2X SSC,0.1%SDS中漂洗;还可为:50℃,在7%十二烷基硫酸钠(SDS)、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在50℃,1X SSC,0.1%SDS中漂洗;还可为:50℃,在7%十二烷基硫酸钠(SDS)、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在50℃,0.5X SSC,0.1%SDS中漂洗;还可为:50℃,在7%十二烷基硫酸钠(SDS)、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在50℃,0.1X SSC,0.1%SDS中漂洗;还可为:50℃,在7%十二烷基硫酸钠(SDS)、0.5M NaPO4和1mM EDTA的混合溶液中杂交,在65℃,0.1X SSC,0.1%SDS中漂洗。
所述严格条件也可为在6×SSC,0.5%SDS的溶液中,在65℃下杂交,然后用2×SSC,0.1%SDS和1×SSC,0.1%SDS各洗膜一次。
含有上述蛋白NEK6编码基因NEK6的重组表达载体、重组菌、转基因细胞系或表达盒也属于本发明的保护范围。
所述重组表达载体具体可为在pROKII的BamHI和KpnI位点间插入序列表中序列1的自5’末端的第1-2871位脱氧核苷酸得到的pROKII-NEK6。
可用现有的植物表达载体构建含有NEK6基因的重组表达载体。
所述植物表达载体包括双元农杆菌载体和可用于植物微弹轰击的载体等。所述植物表达载体还可包含外源基因的3’端非翻译区域,即包含聚腺苷酸信号和任何其它参与mRNA加I或基因表达的DNA片段。所述聚腺苷酸信号可引导聚腺苷酸加入到mRNA前体的3’端,如农杆菌冠瘿瘤诱导(Ti)质粒基因(如胭脂合成酶Nos基因)、植物基因(如大豆贮存蛋白基因)3’端转录的非翻译区均具有类似功能。
使用NEK6构建重组植物表达载体时,在其转录起始核苷酸前可加上任何一种增强型启动子或组成型启动子,如花椰菜花叶病毒(CAMV)35S启动子、玉米的泛素启动子(Ubiquitin),它们可单独使用或与其它植物启动子结合使用;此外,使用本发明的基因构建植物表达载体时,还可使用增强子,包括翻译增强子或转录增强子,这些增强子区域可以是ATG起始密码子或邻接区域起始密码子等,但必需与编码序列的阅读框相同,以保证整个序列的正确翻译。所述翻译控制信号和起始密码子的来源是广泛的,可以是天然的,也可以是合成的。翻译起始区域可以来自转录起始区域或结构基因。
为了便于对转基因植物细胞或植物进行鉴定及筛选,可对所用植物表达载体进行加工,如加入可在植物中表达的编码可产生颜色变化的酶或发光化合物的基因(GUS基因、萤光素酶基因等)、具有抗性的抗生素标记物(庆大霉素标记物、卡那霉素标记物等)或是抗化学试剂标记基因(如抗除莠剂基因)等。从转基因植物的安全性考虑,可不加任何选择性标记基因,直接以逆境筛选转化植株。
扩增上述任一所述编码基因全长或其任意片段的引物对也属于本发明的保护范围。
本发明的另一个目的是提供培育籽粒产量提高和/或耐逆性提高的转基因植物的方法。
本发明所提供的培育籽粒产量提高和/或耐逆性提高的转基因植物的方法,是将所述的编码基因导入目的植物得到的转基因植物,所述转基因植物的籽粒产量和/或耐逆性高于所述目的植物。
所述转基因植物的籽粒产量高于所述目的植物由籽粒数目增加引起。
所述编码基因是通过所述重组表达载体导入所述目的植物中。
所述耐逆性为耐旱性和/或耐盐性。
所述植物为双子叶植物或单子叶植物,所述双子叶植物优选为拟南芥。
转化的细胞、组织或植物理解为不仅包含转化过程的最终产物,也包含其转基因子代。
本发明中所述的“多核苷酸”、“多核苷酸分子”、“多核苷酸序列”、“编码序列”、“开放阅读框(ORF)”等包括单链或双链的DNA和RNA分子,可包含一个或多个原核序列,cDNA序列,包含外显子和内含子的基因组DNA序列,化学合成的DNA和RNA序列,以及有义和相应的反义链。
本发明基因可通过如下方式导入宿主中:将本发明基因插入表达盒中,再将表达盒通过植物表达载体、非致病自我复制的病毒或弄杆菌导入宿主。携带本发明基因的表达载体可通过使用Ti质粒、Ri质粒、植物病毒载体、直接DNA转化、显微注射、电导、农杆菌介导等常规生物学方法转化植物细胞或组织。
本发明的实验证明,本发明的蛋白及其编码基因具有提高植物耐逆性功能,转入本发明基因的植物的耐逆性明显高于野生型植物,如转基因拟南芥,经过盐或干旱处理后,转基因植物的恢复情况明显好于野生型拟南芥,且存活率、抽薹率等都较对照组有显著提高。同时,转入本发明基因的转基因植物的每株籽粒总产量也明显高于对照,过量表达本发明所述基因的各转基因株系其耐旱、耐盐性及单株籽粒产量均与转基因植株中本发明所述基因的表达量呈正相关,进一步表明本发明所述蛋白及其编码基因对培育耐逆且籽粒高产植物品种,特别是培育耐非生物胁迫如耐干旱和/或耐盐且高产植物品种,从而提高农作物产量具有重要意义。因此,本发明蛋白及其编码基因有广阔的应用前景。
附图说明
图1为NEK6基因在NaCl和ACC处理时的表达情况。
图2为NEK6的过量表达载体的示意图和转基因植株的分子鉴定。
图3为NEK6突变体nek6的分子鉴定。
图4为正常生长条件下转基因植株和突变体nek6的表型分析。
图5为转基因植株和突变体nek6的耐盐性鉴定。
图6为转基因植株和突变体nek6的耐旱性鉴定。
图7为转基因植株和突变体nek6在不同浓度甘露醇处理后的存活率和开花率的统计。
具体实施方式
下述实施例中所使用的实验方法如无特殊说明,均为常规方法。
下述实施例中所用的材料、试剂等,如无特殊说明,均可从商业途径得到。
实施例1、拟南芥NEK6基因的克隆
1、NEK6基因的克隆
采用异硫氰酸胍-酚-氯仿的方法进行提取野生型拟南芥(Arabidopsis thaliana)Col-0(G Arabidopsis Research Science.1965Nov 26;150(3700):1192,公众可从中国科学院遗传与发育生物学研究所获得。)总RNA(Zhang et al.2001 Atwo-component gene(NTHK1)encoding a putative ethylene-receptor homolog isboth developmentally and stress-regulated in tobacco.Theor Appl Genet 102:815-824),使用Promega试剂盒(购自Promega公司)PolyAT tract mRNA isolationsystem IV进行mRNA分离纯化,分离得到的mRNA用紫外分光光度计进行定量,然后反转录得到cDNA。以反转录得到的cDNA为模板,正向引物:5’-cgcgggatccatggagtcacgaatggaccc g-3’和反向引物:5’-cgcggtcgactcatgaacaattcctggagctgccactg-3’进行PCR扩增。对PCR产物进行0.8%琼脂糖凝胶电泳检测,结果表明,PCR扩增产物的大小约为2.9kb,与预期结果相符。用琼脂糖凝胶回收试剂盒(TIANGEN)回收该片段。将该回收片段与pGEM-T Easy(Promega)连接,参照Cohen等的方法,将连接产物转化大肠杆菌DH5α感受态细胞,根据pGEM-T Easy载体上的羧卞青霉素抗性标记筛选阳性克隆,得到含有回收片段的重组质粒。以该重组质粒载体上的T7和SP6启动子序列为引物对其进行核苷酸序列测定,测序结果表明该PCR产物具有序列表中的序列1的核苷酸序列,该PCR产物的基因命名为NEK6,其ORF为序列1的自5’末端第1-2871位核苷酸。该基因编码的蛋白命名为NEK6,该蛋白的氨基酸序列为序列表的序列2,序列表中的序列2由956个氨基酸组成。序列分析表明,NEK6由14个外显子构成,ORF为2871个脱氧核糖核苷酸组成,编码956个氨基酸。
2、NEK6基因的表达模式乙烯前体ACC处理
将在MS培养基中生长10天的野生型拟南芥分别植入含200mM NaCl的MS培养基和含10μM ACC(一氨基环丙烷羧酸)的MS培养基中,在处理2、4、6、12小时时分别取样,制备每个样本的总RNA,每泳道上样量为15μg,以NEK6为探针,进行NORTHERN分析。结果如图1所示,可以看出,在乙烯的前体ACC(10μM)处理后诱导NEK6基因的表达,并在6小时表达量达到最高;同样NaCl(200mM)处理也诱导NEK6基因的表达,并在4小时表达量达到最高。因此NEK6是乙烯和盐响应基因。
实施例2、转NEK6拟南芥培育和NEK6基因突变体nek6的鉴定
1、植物表达载体的构建及转基因纯系的获得
将由实施例1获得的PCR产物连接在pMD-18T载体(Promega)上构建重组质粒pMD-18T-NEK6,将pMD-18T-NEK6经BamH1和Kpn1酶切得到的小片段连接到同样经BamH1和Kpn1酶切PBSK载体(Promega)得到重组质粒PBSK-NEK6。将重组质粒PBSK-NEK6用BamHI和KpnI酶切获得的小片段连接到经同样酶切双元载体pROKII(购自Stratagene公司)获得的大片段上,得到重组载体pROK II-NEK6。经测序,该重组载体pROK II-NEK6为将序列表中序列1的5’末端第1-2871位核苷酸插入载体pROK II的BamH1和Kpn1酶切位点间得到的。
将重组载体pROK II-NEK6用电击转化法导入农杆菌GV3101菌株(Clough-SJ,Bent-AF.Floral dip:a simplified method for Agrobacterium-mediatedtransformation of Arabidopsis thaliana.Plant-Journal.1998,16:6,735-743;公众可从中国科学院遗传与发育生物学研究所获得。)得到重组菌GV3101/pROK II-NEK6。将重组菌进行菌液PCR鉴定,引物为正向引物:5’-cgcgggatccatggagtcac gaatggaccc g-3’和反向引物:5’-cgcggtcgactcatgaacaattcctggagctgccactg-3’,得到2.9kb片段的重组菌提取质粒测序,结果为该质粒为pROK II-NEK6,将含有质粒的重组菌命名为GV3101/pROK II-NEK6。
挑取GV3101/pROK II-NEK6的单菌落在5mlLB中,于28℃培养8-12小时,再转接到200mlLB中继续培养3-6小时,收菌后重悬于LB培养基中得到转化液。将野生型拟南芥(Arabidopsis thaliana)Col-0的花浸泡于转化液中10s,取出后放入MS培养基中避光培养8-12小时,获得T0代转化种子,将其播于含卡那霉素(50mg/L)的MS培养基上,获得30株T0代转NEK6拟南芥。
采用同样的方法将空载体pROKII转入野生型拟南芥中,获得转pROKII拟南芥。
提取上述30株T0代转NEK6拟南芥植株的RNA,并反转录获得cDNA,分别进行RT-PCR鉴定,引物为5’-cgcgggatccatggagtcacgaatggatcag-3’和5’-cgcggtcgacacaattcctggagctgccactg-3’,结果如图2所示,其中图2A为表达载体pROK II-NEK6的部分结构示意图,图2B为T0代转NEK6拟南芥的分子鉴定,对照Col为转pROKII拟南芥,1-8分别为8株T0代转NEK6拟南芥,图2B显示,转pROKII拟南芥没有表达NEK6;在转基因株系中,NEK6基因表达较高的为编号1(NEK6-OE1)和编号2(NEK6-OE2)株系,NEK6-OE2株系中NEK6的表达量高于NEK6-OE1中NEK6的表达量。
T0代转NEK6拟南芥株系经过3代连续筛选,分别获得T3代转NEK6拟南芥纯系。将T3代转NEK6拟南芥纯系中的NEK6-OE1纯系和NEK6-OE2纯系采用同样的方法进行RT-PCR鉴定,结果为NEK6-OE1纯系和NEK6-OE2纯系的NEK6均得到表达。
2、NEK6T-DNA突变体鉴定:
NEK6突变体nek6记载在Motose H,Tominaga R,Wada T,Sugiyama M,Watanabe Y,ANIMA-related protein kinase suppresses ectopic outgrowth of epidermal cells through its kinaseactivity and the association with microtubules,Plant J.2008Jun;54(5):829-44;公众可从中国科学院遗传与发育生物学研究所获得。
提取野生型拟南芥叶片的DNA,以分别为NEK6基因两端的引物LP+BP序列为LP:CTTTGAAAGCAGGTCGATACG;BP:TTTGGTTACAGGGCTGAGTTG及以T-DNA边界序列设计的引物LBb1:GCGTGGACCGCTTGCTGCAACT共3条引物进行PCR,野生型拟南芥中,LP+BP引物可获得大于800bp的条带,而突变体nek6中,由于T-DNA的插入,不以LP+BP为引物进行扩增而以BP和LBb1或LP和LBb1为引物,扩出的2条PCR条带均小于800bp,因此PCR产物为两条带证明是杂合体。T-DNA的插入位点示于图3A。
提取nek6突变体的RNA后转录出CDNA进行RT-PCR鉴定,引物为5’-cgcgggatccatggagtcacgaatggatcag-3’和5’-cgcggtcgacacaattcctggagctgccactg-3’,以野生型拟南芥为对照,结果显示在nek6突变体中未检测到NEK6的转录(如图3B所示),其中col为对照。说明在NEK6突变体中,NEK6基因被沉默,没有转录子,不能正确表达。
实施例3、转NEK6拟南芥、突变体nek6和对照的表型分析
将由实施例2获得NEK6-OE1纯系和NEK6-OE2纯系及nek6突变体培养约2个月收获种子,观察各植株表型,以野生型拟南芥和实施例2获得的转pROKII拟南芥为对照,统计结果如表1所示,表型见图4,实验重复三次,结果取平均值±标准差。
表1Nek6-OE株系和突变体nek6的表型统计
n为株数;*表示显著差异;**表示极显著差异。(种子重为单株种子重。)
图中和表中,对照(col)是指野生型拟南芥。NEK6-OE1(OE1)和NEK6-OE2(OE2)的单株种子重(单株籽粒产量)、果荚长均大于野生型拟南芥,而千粒重均小于野生型拟南芥或者与野生型拟南芥相当,说明两个转基因株系的单株籽粒产量均高于野生型拟南芥是由籽粒数量增多、果荚数及长度增加所致。
表1和图4的结果均说明,在正常条件下,与野生型拟南芥(col)相比,nek6的莲座明显变小,荚变短。而NEK6-OE1(OE1)、NEK6-OE2(OE2的表型和突变体相反,莲座明显变大,叶宽、荚长,荚数增加与野生型拟南芥相比均呈显著或极显著。NEK6-OE2株系的上述表型,如叶宽、荚长、千粒重、单株籽粒产量(单株种子重)等性状比NEK6-OE1株系的变化更显著,NEK6-OE2中NEK6的表达量明显高于NEK6-OE1,说明转基因株系在上述性状上的变化是与NEK6基因的过量表达相关。野生型拟南芥和转pROKII拟南芥的结果无显著差异。
综上所述,可以说明转NEK6拟南芥的表型变化与NEK6的表达水平相关,且NEK6-OE2植株中NEK6的表达高于NEK6-OE1。
实施例4、NEK6过量表达转基因株系和突变体nek6的耐盐性和耐旱性鉴定
1、耐盐性鉴定
将由实施例2获得NEK6-OE1纯系、由实施例2获得NEK6-OE2纯系、nek6突变体、转pROKII拟南芥和野生型拟南芥在MS培养基上生长5天且生长一致的苗移至含有125mM NaClMS培养基上继续生长14天,移苗至蛭石中恢复生长,14天后统计存活率和开花率。以未经125mM NaCl处理为对照。NEK6-OE1(OE1)、NEK6-OE2(OE2)、nek6突变体、转pROKII拟南芥和野生型拟南芥各15株。实验重复三次,结果取平均值。
结果见图5所示,
图5A为苗期:在MS培养基上生长5天后移至含有125mM NaCl的MS培养基上继续生长14天的苗记作NaCl(125mM);一直在MS培养基生长的苗作为对照记作MS;从图中看出,125mM NaCl处理均影响了NEK6-OE1、NEK6-OE2、nek6突变体(nek6)和野生型拟南芥(col)的生长,但是2个转基因株系明显优于野生型对照,而野生型对照明显优于nek6突变体。
图5B为对照、nek6突变体和转基因转系经盐胁迫后恢复生长的比较。图中第二排和第三排是不同角度拍的上述株系的生长情况。从图中看出,在经过125mM NaCl处理14天后恢复生长14天时,NEK6-OE1和NEK6-OE2株系的生长明显优于nek6突变体(nek6)和野生型拟南芥(col),而野生型对照略优于nek6突变体。
图5C为显示了各株系经上述处理后的存活率和开花率比较。从图中看出,野生型拟南芥(col)存活率为73%,nek6突变体(nek6)存活率52%,NEK6-OE1和NEK6-OE2存活率分别为90%和98%,三者的明显差异与所观察的表型一致。开花率的差异则更为显著,野生型拟南芥(col)为32%,nek6突变体(nek6)为14%,而NEK6-OE1和NEK6-OE2开花率分别为64%和86%。
图5的结果表明,在转基因植株OE-2中,NEK6的表达量高于OE-1中NEK6的表达量,从表1列出的表型统计说明,OE-2株系的单株籽粒产量(单株种子重)明显高于野生型拟南芥和OE-1株系,从图5看出,OE-2株系的耐盐性明显高于野生型拟南芥和OE-1株系。因此NEK6的表达与植株的耐盐性和单株籽粒产量呈正相关。野生型拟南芥和转pROKII拟南芥的结果无显著差异。
2、耐旱性鉴定
将由实施例2获得NEK6-OE1和NEK6-OE2纯系、nek6突变体和野生型拟南芥在MS培养基上生长5天,将5天的苗移至分别含90、120、150、200、300mM甘露醇的MS培养基中14天,之后再移栽至蛭石中恢复14天,对存活率及抽薹率作统计。以未经任何浓度甘露醇处理为对照。实验中各株系均取15株,实验重复三次,结果取平均值。
结果如图6所示,
图6A为苗期:在MS培养基上生长5天移至含200mM甘露醇的MS培养基中14天的苗记作200mM甘露醇;在MS培养基生长未作处理的苗作为对照记作MS;从图中看出,NEK6-OE1、NEK6-OE2、nek6突变体(nek6)和野生型拟南芥(col)在经过200mM甘露醇处理后均出现不同程度的黄萎,但是2个转基因株系明显优于野生型对照,而野生型对照明显优于nek6突变体。
图6B为成熟期,经过200mM甘露醇处理14天,恢复生长14天后,NEK6-OE1和NEK6-OE2的生长明显优于nek6突变体(nek6)和野生型拟南芥(col)。
野生型拟南芥和转pROKII拟南芥的结果无显著差异。
将由实施例2获得NEK6-OE1和NEK6-OE2纯系、nek6突变体、野生型拟南芥分别经90、120、150、200、300mM甘露醇的MS培养基处理14天,之后再移栽至蛭石中恢复14天,统计存活率及开花率。
结果如图7所示,其中图7A为存活率统计;图7B为开花率统计,其中col为野生型拟南芥。从图7中可以看出,未加入甘露醇时,NEK6-OE1、NEK6-OE2、nek6和野生型拟南芥(col)的存活率和开花率均为100%,未显示差异。当甘露醇浓度达到120mM后,NEK6-OE1和NEK6-OE2的存活率分别为95%和100%,而nek6和野生型拟南芥(col)的存活率分别为50%和70%。开花率的差异在经90mM甘露醇处理后即显示了差异,NEK6-OE1和NEK6-OE2的开花率分别为90%和93%,而nek6和野生型拟南芥(col)的开花率分别为70%和73%,甘露醇浓度达120mM时,它们间的差异趋于显著,NEK6-OE1和NEK6-OE2的开花率分别为74%和80%,nek6和野生型拟南芥(col)的开花率分别仅为15%和35%。随着甘露醇浓度升高,三种株系间的存活率和开花率的差异也更加明显,因此NEK6基因的过表达可以提高植物的耐旱性。野生型拟南芥和转pROKII拟南芥的结果无显著差异。
序列表
<110>中国科学院遗传与发育生物学研究所
<120>一种与植物耐逆性相关的蛋白NEK6及其编码基因与应用
<130>CGGNACB 102635
<160>2
<210>1
<211>2871
<212>DNA
<213>拟南芥(Arabidopsis thaliana)
<400>1
atggagtcac gaatggaccc gtacgagctt atggaacaga tcgggagagg agcgtttggt 60
gctgctattt tagttcatca taaagctgag agaaagaagt atgttttgaa gaagatcaga 120
ttagctagac aaacggaacg ttgccggaga tctgctcatc aagagatgtc tttgattgca 180
agagttcaac atccttatat tgtggagttt aaagaagctt gggttgagaa aggttgctat 240
gtttgtattg tcactggata ctgtgaagga ggagacatgg ctgagttgat gaaaaagtca 300
aatggtgttt attttcctga ggagaaactt tgcaagtggt ttactcaatt attattagct 360
gttgagtatc tgcattctaa ctatgttcta catcgggatc taaagtgttc caacattttc 420
cttacaaaag atcaagatgt tcgccttggg gactttggtc ttgccaaaac tttgaaggcg 480
gatgacctaa cttcctcggt tgttggaact ccaaactaca tgtgcccgga actgcttgct 540
gatatccctt atggttttaa gtcagatatc tggtccttag gctgttgtat atatgagatg 600
gctgcgtatc gacctgcttt caaagctttt gatatggcag ggcttataag caaggttaat 660
cgctcctcaa ttggtccgtt gcctccgtgc tattcaccat ctttaaaagc actcatcaag 720
ggaatgttaa ggaagaaccc cgagtatcgg ccaaacgcct ccgagatttt gaagcatcct 780
tatctgcagc catatgttga gcagtaccgt ccaacgcttt ctgcggcgtc tataactcca 840
gaaaagcctc tcaattctcg cgagggtcgg aggagtatgg ctgaaagtca gaatagcaat 900
agttcaagtg aaaaagataa cttctacgtg agtgacaaaa atatccgata tgtggttcca 960
agtaatggta ataaagtcac tgagaccgat tcaggttttg tcgatgatga agacatctta 1020
gaccatgtgc aacaatcagc tgaaaatggt aatctccaga gtgtttcagc aacaaaacca 1080
gatggccatg ggattttaaa gcccgtacac agtgaccagc gaccagacgt tattcaacca 1140
aggcacccaa aaactatcag aaacatcatg atggttttga aagaagagaa agctcgagaa 1200
aatggttcac ccatgagatc taatcgaagt agaccttcta gtgttccgac acagaagaat 1260
aatgttgaaa ctccatcaaa gattcccaaa cttggtgaca ttgctcatag ctccaagact 1320
aacgcaagta cgcctattcc accatcaaag ctggcctctg attccgcaag gactccagga 1380
tcatttccac caaagcatca tatgccagtg attgactctt ctccaaaact taagcctaga 1440
aacgacagaa tttcaccttc tcctgctgct aaacatgagg ctgaagaggc gatgtcagtt 1500
aagcgtaggc aaagaacacc tcctactttg ccaagaagaa cgtctttgat agcgcaccag 1560
tcgagacaac taggagcaga tatttcaaat atggcagcaa aggaaaccgc gaagctacat 1620
ccatctgtgc catcagagtc tgaaactaat tctcatcaat ctcgtgttca tgcttctcct 1680
gtgagtacga cgccagagcc aaagagaacc tctgttggat ctgccaaagg aatgcagagt 1740
gaaagcagca actctatctc gtcgtcgttg tccatgcagg catttgagct ctgtgatgat 1800
gcttccaccc catatattga catgacagaa catacaactc ctgatgacca cagaagatcc 1860
tgtcacagtg aatactcata ttcatttcca gatatctcct cagagatgat gatccgtaga 1920
gatgaacata gcaccagtat gcggctgact gaaattcccg actctgtttc cggtgtacaa 1980
aacaccattg cccttcatca gccagaaaga gagcaaggaa gctgtcccac tgtgctcaaa 2040
gatgatagtc ctgcaacatt acagagctac gagcctaaca catcacaaca tcaacacggt 2100
gatgacaaat tcacagtcaa agaattcgtc tcctctgttc ccggacccgc accattgcct 2160
ttgcatgttg aaccatcaca ccaagtgaat tcccactcgg ataacaaaac aagcgtaatg 2220
tctcagaact cagctcttga gaagaacaac agtcactccc atcctcatcc cgtggttgat 2280
gatgtcatac atgttattcg ccacagcagt ttccgagtag ggagcgacca gccggtcatg 2340
gaaagtgttg aagtcggtgt ccaaaacgtc gatatgggga aactcataaa cgttgtgaga 2400
gatgaaatgg aagtgagaaa aggagccact ccctcagaat cacccaccac aagatcgatc 2460
atctcagagc ctgactcaag aaccgagcct cgtcctaaag aaccagaccc catcaccaat 2520
tactcagaaa caaagagctt taactcctgc tcggattctt caccagctga gaccagaact 2580
aactcattcg tgcctgaaga ggaaacaact ccgactccac ctgttaaaga gacgttggac 2640
atcaagtcgt tcagacagag agcagaagcg cttgaaggtc tattggaact atctgcggat 2700
ttgctggaac agagcaggct cgaagagcta gccattgtgt cgcagccgtt tgggaagaac 2760
aaagtttcgc ctcgagaaac cgctatttgg ttggccaaga gtttgaaagg aatgatgatc 2820
gaagacatca ataataataa tagcagtggc agctccagga attgttcatg a 2871
<210>2
<211>956
<212>PRT
<213>拟南芥(Arabidopsis thaliana)
<400>2
Met Glu Ser Arg Met Asp Pro Tyr Glu Leu Met Glu Gln Ile Gly Arg
1 5 10 15
Gly Ala Phe Gly Ala Ala Ile Leu Val His His Lys Ala Glu Arg Lys
20 25 30
Lys Tyr Val Leu Lys Lys Ile Arg Leu Ala Arg Gln Thr Glu Arg Cys
35 40 45
Arg Arg Ser Ala His Gln Glu Met Ser Leu Ile Ala Arg Val Gln His
50 55 60
Pro Tyr Ile Val Glu Phe Lys Glu Ala Trp Val Glu Lys Gly Cys Tyr
65 70 75 80
Val Cys Ile Val Thr Gly Tyr Cys Glu Gly Gly Asp Met Ala Glu Leu
85 90 95
Met Lys Lys Ser Asn Gly Val Tyr Phe Pro Glu Glu Lys Leu Cys Lys
100 105 110
Trp Phe Thr Gln Leu Leu Leu Ala Val Glu Tyr Leu His Ser Asn Tyr
115 120 125
Val Leu His Arg Asp Leu Lys Cys Ser Asn Ile Phe Leu Thr Lys Asp
130 135 140
Gln Asp Val Arg Leu Gly Asp Phe Gly Leu Ala Lys Thr Leu Lys Ala
145 150 155 160
Asp Asp Leu Thr Ser Ser Val Val Gly Thr Pro Asn Tyr Met Cys Pro
165 170 175
Glu Leu Leu Ala Asp Ile Pro Tyr Gly Phe Lys Ser Asp Ile Trp Ser
180 185 190
Leu Gly Cys Cys Ile Tyr Glu Met Ala Ala Tyr Arg Pro Ala Phe Lys
195 200 205
Ala Phe Asp Met Ala Gly Leu Ile Ser Lys Val Asn Arg Ser Ser Ile
210 215 220
Gly Pro Leu Pro Pro Cys Tyr Ser Pro Ser Leu Lys Ala Leu Ile Lys
225 230 235 240
Gly Met Leu Arg Lys Asn Pro Glu Tyr Arg Pro Asn Ala Ser Glu Ile
245 250 255
Leu Lys His Pro Tyr Leu Gln Pro Tyr Val Glu Gln Tyr Arg Pro Thr
260 265 270
Leu Ser Ala Ala Ser Ile Thr Pro Glu Lys Pro Leu Asn Ser Arg Glu
275 280 285
Gly Arg Arg Ser Met Ala Glu Ser Gln Asn Ser Asn Ser Ser Ser Glu
290 295 300
Lys Asp Asn Phe Tyr Val Ser Asp Lys Asn Ile Arg Tyr Val Val Pro
305 310 315 320
Ser Asn Gly Asn Lys Val Thr Glu Thr Asp Ser Gly Phe Val Asp Asp
325 330 335
Glu Asp Ile Leu Asp His Val Gln Gln Ser Ala Glu Asn Gly Asn Leu
340 345 350
Gln Ser Val Ser Ala Thr Lys Pro Asp Gly His Gly Ile Leu Lys Pro
355 360 365
Val His Ser Asp Gln Arg Pro Asp Val Ile Gln Pro Arg His Pro Lys
370 375 380
Thr Ile Arg Asn Ile Met Met Val Leu Lys Glu Glu Lys Ala Arg Glu
385 390 395 400
Asn Gly Ser Pro Met Arg Ser Asn Arg Ser Arg Pro Ser Ser Val Pro
405 410 415
Thr Gln Lys Asn Asn Val Glu Thr Pro Ser Lys Ile Pro Lys Leu Gly
420 425 430
Asp Ile Ala His Ser Ser Lys Thr Asn Ala Ser Thr Pro Ile Pro Pro
435 440 445
Ser Lys Leu Ala Ser Asp Ser Ala Arg Thr Pro Gly Ser Phe Pro Pro
450 455 460
Lys His His Met Pro Val Ile Asp Ser Ser Pro Lys Leu Lys Pro Arg
465 470 475 480
Asn Asp Arg Ile Ser Pro Ser Pro Ala Ala Lys His Glu Ala Glu Glu
485 490 495
Ala Met Ser Val Lys Arg Arg Gln Arg Thr Pro Pro Thr Leu Pro Arg
500 505 510
Arg Thr Ser Leu Ile Ala His Gln Ser Arg Gln Leu Gly Ala Asp Ile
515 520 525
Ser Asn Met Ala Ala Lys Glu Thr Ala Lys Leu His Pro Ser Val Pro
530 535 540
Ser Glu Ser Glu Thr Asn Ser His Gln Ser Arg Val His Ala Ser Pro
545 550 555 560
Val Ser Thr Thr Pro Glu Pro Lys Arg Thr Ser Val Gly Ser Ala Lys
565 570 575
Gly Met Gln Ser Glu Ser Ser Asn Ser Ile Ser Ser Ser Leu Ser Met
580 585 590
Gln Ala Phe Glu Leu Cys Asp Asp Ala Ser Thr Pro Tyr Ile Asp Met
595 600 605
Thr Glu His Thr Thr Pro Asp Asp His Arg Arg Ser Cys His Ser Glu
610 615 620
Tyr Ser Tyr Ser Phe Pro Asp Ile Ser Ser Glu Met Met Ile Arg Arg
625 630 635 640
Asp Glu His Ser Thr Ser Met Arg Leu Thr Glu Ile Pro Asp Ser Val
645 650 655
Ser Gly Val Gln Asn Thr Ile Ala Leu His Gln Pro Glu Arg Glu Gln
660 665 670
Gly Ser Cys Pro Thr Val Leu Lys Asp Asp Ser Pro Ala Thr Leu Gln
675 680 685
Ser Tyr Glu Pro Asn Thr Ser Gln His Gln His Gly Asp Asp Lys Phe
690 695 700
Thr Val Lys Glu Phe Val Ser Ser Val Pro Gly Pro Ala Pro Leu Pro
705 710 715 720
Leu His Val Glu Pro Ser His Gln Val Asn Ser His Ser Asp Asn Lys
725 730 735
Thr Ser Val Met Ser Gln Asn Ser Ala Leu Glu Lys Asn Asn Ser His
740 745 750
Ser His Pro His Pro Val Val Asp Asp Val Ile His Val Ile Arg His
755 760 765
Ser Ser Phe Arg Val Gly Ser Asp Gln Pro Val Met Glu Ser Val Glu
770 775 780
Val Gly Val Gln Asn Val Asp Met Gly Lys Leu Ile Asn Val Val Arg
785 790 795 800
Asp Glu Met Glu Val Arg Lys Gly Ala Thr Pro Ser Glu Ser Pro Thr
805 810 815
Thr Arg Ser Ile Ile Ser Glu Pro Asp Ser Arg Thr Glu Pro Arg Pro
820 825 830
Lys Glu Pro Asp Pro Ile Thr Asn Tyr Ser Glu Thr Lys Ser Phe Asn
835 840 845
Ser Cys Ser Asp Ser Ser Pro Ala Glu Thr Arg Thr Asn Ser Phe Val
850 855 860
Pro Glu Glu Glu Thr Thr Pro Thr Pro Pro Val Lys Glu Thr Leu Asp
865 870 875 880
Ile Lys Ser Phe Arg Gln Arg Ala Glu Ala Leu Glu Gly Leu Leu Glu
885 890 895
Leu Ser Ala Asp Leu Leu Glu Gln Ser Arg Leu Glu Glu Leu Ala Ile
900 905 910
Val Ser Gln Pro Phe Gly Lys Asn Lys Val Ser Pro Arg Glu Thr Ala
915 920 925
Ile Trp Leu Ala Lys Ser Leu Lys Gly Met Met Ile Glu Asp Ile Asn
930 935 940
Asn Asn Asn Ser Ser Gly Ser Ser Arg Asn Cys Ser
945 950 955
Claims (10)
1.一种蛋白质,是由序列表中序列2所示的氨基酸序列组成的蛋白质。
2.权利要求1所述蛋白质的编码基因。
3.根据权利要求2所述的编码基因,其特征在于:所述编码基因为序列表中序列1所示的DNA分子。
4.含有权利要求2或3所述编码基因的表达盒。
5.含有权利要求2或3所述编码基因的重组菌。
6.含有权利要求2或3所述编码基因的转基因细胞系。
7.含有权利要求2或3所述编码基因的重组表达载体。
8.根据权利要求7所述的重组表达载体,其特征在于:所述重组表达载体为将权利要求2或3所述的编码基因插入载体pROKII的多克隆位点得到的重组表达载体。
9.一种培育籽粒产量提高和/或耐逆性提高的转基因植物的方法,是将权利要求2或3所述的编码基因导入目的植物得到转基因植物,所述转基因植物的籽粒产量和/或耐逆性高于所述目的植物;所述耐逆性为耐旱性和/或耐盐性;所述植物为拟南芥。
10.根据权利要求9所述的方法,其特征在于:权利要求2或3所述编码基因是通过权利要求7或8所述重组表达载体导入所述目的植物中。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010270556 CN102382182B (zh) | 2010-09-01 | 2010-09-01 | 一种与植物耐逆性相关的蛋白nek6及其编码基因与应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN 201010270556 CN102382182B (zh) | 2010-09-01 | 2010-09-01 | 一种与植物耐逆性相关的蛋白nek6及其编码基因与应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102382182A CN102382182A (zh) | 2012-03-21 |
CN102382182B true CN102382182B (zh) | 2013-07-24 |
Family
ID=45822066
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN 201010270556 Expired - Fee Related CN102382182B (zh) | 2010-09-01 | 2010-09-01 | 一种与植物耐逆性相关的蛋白nek6及其编码基因与应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102382182B (zh) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007053974A1 (fr) * | 2005-11-08 | 2007-05-18 | Beijing North Elite Biotechnology Co., Ltd. | Protéine en rapport avec la croissance et la résistance aux contraintes de plantes, ses gènes de codage et son utilisation |
CN101619096A (zh) * | 2009-07-31 | 2010-01-06 | 中国科学院遗传与发育生物学研究所 | 一种植物耐逆性相关蛋白及其编码基因与应用 |
CN101659699A (zh) * | 2008-08-25 | 2010-03-03 | 中国科学院遗传与发育生物学研究所 | 植物耐逆性相关蛋白GmSIK2及其编码基因与应用 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1273483C (zh) * | 2002-07-30 | 2006-09-06 | 中国农业科学院生物技术研究所 | 一个玉米bZIP类转录因子及其编码基因与应用 |
-
2010
- 2010-09-01 CN CN 201010270556 patent/CN102382182B/zh not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2007053974A1 (fr) * | 2005-11-08 | 2007-05-18 | Beijing North Elite Biotechnology Co., Ltd. | Protéine en rapport avec la croissance et la résistance aux contraintes de plantes, ses gènes de codage et son utilisation |
CN101659699A (zh) * | 2008-08-25 | 2010-03-03 | 中国科学院遗传与发育生物学研究所 | 植物耐逆性相关蛋白GmSIK2及其编码基因与应用 |
CN101619096A (zh) * | 2009-07-31 | 2010-01-06 | 中国科学院遗传与发育生物学研究所 | 一种植物耐逆性相关蛋白及其编码基因与应用 |
Also Published As
Publication number | Publication date |
---|---|
CN102382182A (zh) | 2012-03-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101921321B (zh) | 与植物株型相关的蛋白ipa1及其编码基因与应用 | |
CN101607989B (zh) | 一种水稻矮化相关蛋白及其编码基因与应用 | |
CN102010466A (zh) | 植物抗性相关蛋白myb及其编码基因与应用 | |
CN101747419A (zh) | 耐盐相关蛋白及其编码基因与应用 | |
CN112795588B (zh) | OsSGD1蛋白在调控水稻籽粒大小中的应用 | |
CN110628808A (zh) | 拟南芥AtTCP5基因及其在调控株高上的应用 | |
CN110804090B (zh) | 蛋白质CkWRKY33及其编码基因与应用 | |
CN107759676B (zh) | 一种植物直链淀粉合成相关蛋白Du15与其编码基因及应用 | |
CN114369147B (zh) | Bfne基因在番茄株型改良和生物产量提高中的应用 | |
CN101412751B (zh) | 一种与植物耐冷性相关的蛋白及其编码基因与应用 | |
CN104140462B (zh) | 植物耐盐性相关蛋白GhSnRK2-6及其编码基因与应用 | |
CN108864265B (zh) | 蛋白质TabZIP60在调控植物根系发育中的应用 | |
CN103172714B (zh) | 水稻叶片卷曲相关蛋白OsMYB103L及其编码基因与应用 | |
CN101525379B (zh) | 植物耐旱相关蛋白及其编码基因与它们的应用 | |
CN111394500B (zh) | 一种用于鉴定待测植物样品是否来源于SbSNAC1-382事件或其后代的方法 | |
CN104650204B (zh) | 与水稻atp运输和叶绿体发育相关的蛋白及其编码基因和应用 | |
CN104592370A (zh) | OsPYL9蛋白及其编码基因和应用 | |
CN104844699B (zh) | 大豆GmNEK1蛋白及其编码基因与应用 | |
CN114539373A (zh) | 与甘薯茎线虫病抗性相关蛋白IbPIF1及其编码基因与应用 | |
CN107417780A (zh) | Ubc32蛋白及其编码基因在调控植物耐旱性中的应用 | |
CN101993479B (zh) | 植物耐逆性相关转录因子TaWRKY1及其编码基因与应用 | |
CN102382182B (zh) | 一种与植物耐逆性相关的蛋白nek6及其编码基因与应用 | |
CN111303262A (zh) | 一种植物耐热性和根发育相关蛋白及其编码基因与应用 | |
CN114349833A (zh) | 钙调素结合蛋白cold12在调控植物耐冷性中的应用 | |
CN103709241B (zh) | 来源于苔藓植物的抗旱蛋白PpLEA3-25及其编码基因和应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20130724 Termination date: 20190901 |
|
CF01 | Termination of patent right due to non-payment of annual fee |