CN101880669B - 小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用 - Google Patents
小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用 Download PDFInfo
- Publication number
- CN101880669B CN101880669B CN2010101397937A CN201010139793A CN101880669B CN 101880669 B CN101880669 B CN 101880669B CN 2010101397937 A CN2010101397937 A CN 2010101397937A CN 201010139793 A CN201010139793 A CN 201010139793A CN 101880669 B CN101880669 B CN 101880669B
- Authority
- CN
- China
- Prior art keywords
- leu
- ser
- gly
- asn
- ile
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 110
- 230000001580 bacterial effect Effects 0.000 title claims abstract description 39
- 235000007164 Oryza sativa Nutrition 0.000 title abstract description 39
- 235000009566 rice Nutrition 0.000 title abstract description 32
- 241000209094 Oryza Species 0.000 title abstract description 6
- 240000000125 Oryza minuta Species 0.000 title abstract description 5
- 208000035240 Disease Resistance Diseases 0.000 title description 14
- 239000002773 nucleotide Substances 0.000 claims description 12
- 125000003729 nucleotide group Chemical group 0.000 claims description 12
- 239000012634 fragment Substances 0.000 abstract description 24
- 201000010099 disease Diseases 0.000 abstract description 22
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 abstract description 22
- 230000009261 transgenic effect Effects 0.000 abstract description 7
- 230000001717 pathogenic effect Effects 0.000 abstract description 5
- 238000010367 cloning Methods 0.000 abstract description 4
- 238000002955 isolation Methods 0.000 abstract description 4
- 102000004169 proteins and genes Human genes 0.000 abstract description 4
- 241000894006 Bacteria Species 0.000 abstract description 3
- 241000907138 Xanthomonas oryzae pv. oryzae Species 0.000 abstract description 3
- 238000012795 verification Methods 0.000 abstract description 3
- 101710201625 Leucine-rich protein Proteins 0.000 abstract 1
- 108700001094 Plant Genes Proteins 0.000 abstract 1
- 241000196324 Embryophyta Species 0.000 description 45
- 240000007594 Oryza sativa Species 0.000 description 35
- 108020004414 DNA Proteins 0.000 description 18
- 230000002068 genetic effect Effects 0.000 description 18
- 241000282326 Felis catus Species 0.000 description 17
- 210000004436 artificial bacterial chromosome Anatomy 0.000 description 17
- 238000000034 method Methods 0.000 description 17
- 238000004458 analytical method Methods 0.000 description 14
- 230000009466 transformation Effects 0.000 description 14
- 239000013598 vector Substances 0.000 description 13
- 108010061238 threonyl-glycine Proteins 0.000 description 11
- FHETWELNCBMRMG-HJGDQZAQSA-N Asn-Leu-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FHETWELNCBMRMG-HJGDQZAQSA-N 0.000 description 10
- JKGHDYGZRDWHGA-SRVKXCTJSA-N Leu-Asn-Leu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JKGHDYGZRDWHGA-SRVKXCTJSA-N 0.000 description 8
- 239000013612 plasmid Substances 0.000 description 8
- 238000003752 polymerase chain reaction Methods 0.000 description 8
- 108010050848 glycylleucine Proteins 0.000 description 7
- 238000012163 sequencing technique Methods 0.000 description 7
- 108091028043 Nucleic acid sequence Proteins 0.000 description 6
- KCGIREHVWRXNDH-GARJFASQSA-N Ser-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N KCGIREHVWRXNDH-GARJFASQSA-N 0.000 description 6
- 108010034529 leucyl-lysine Proteins 0.000 description 6
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 5
- MKJBPDLENBUHQU-CIUDSAMLSA-N Asn-Ser-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O MKJBPDLENBUHQU-CIUDSAMLSA-N 0.000 description 5
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 5
- 108010006444 Leucine-Rich Repeat Proteins Proteins 0.000 description 5
- FQPDRTDDEZXCEC-SVSWQMSJSA-N Thr-Ile-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O FQPDRTDDEZXCEC-SVSWQMSJSA-N 0.000 description 5
- 241000746966 Zizania Species 0.000 description 5
- 235000002636 Zizania aquatica Nutrition 0.000 description 5
- 239000002299 complementary DNA Substances 0.000 description 5
- 238000009396 hybridization Methods 0.000 description 5
- 238000011081 inoculation Methods 0.000 description 5
- 230000003902 lesion Effects 0.000 description 5
- 108010015796 prolylisoleucine Proteins 0.000 description 5
- 230000004044 response Effects 0.000 description 5
- 238000010839 reverse transcription Methods 0.000 description 5
- 239000000523 sample Substances 0.000 description 5
- DJIMLSXHXKWADV-CIUDSAMLSA-N Asn-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(N)=O DJIMLSXHXKWADV-CIUDSAMLSA-N 0.000 description 4
- QOVWVLLHMMCFFY-ZLUOBGJFSA-N Asp-Asp-Asn Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O QOVWVLLHMMCFFY-ZLUOBGJFSA-N 0.000 description 4
- UMHUHHJMEXNSIV-CIUDSAMLSA-N Asp-Leu-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(O)=O UMHUHHJMEXNSIV-CIUDSAMLSA-N 0.000 description 4
- OFYVKOXTTDCUIL-FXQIFTODSA-N Asp-Ser-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N OFYVKOXTTDCUIL-FXQIFTODSA-N 0.000 description 4
- 108091026890 Coding region Proteins 0.000 description 4
- KCTIFOCXAIUQQK-QXEWZRGKSA-N Ile-Pro-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O KCTIFOCXAIUQQK-QXEWZRGKSA-N 0.000 description 4
- DPWGZWUMUUJQDT-IUCAKERBSA-N Leu-Gln-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O DPWGZWUMUUJQDT-IUCAKERBSA-N 0.000 description 4
- KUEVMUXNILMJTK-JYJNAYRXSA-N Leu-Gln-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 KUEVMUXNILMJTK-JYJNAYRXSA-N 0.000 description 4
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 4
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 4
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 4
- 150000001413 amino acids Chemical group 0.000 description 4
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 4
- 210000004901 leucine-rich repeat Anatomy 0.000 description 4
- 244000052769 pathogen Species 0.000 description 4
- 108010048818 seryl-histidine Proteins 0.000 description 4
- BVBKBQRPOJFCQM-DCAQKATOSA-N Arg-Asn-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BVBKBQRPOJFCQM-DCAQKATOSA-N 0.000 description 3
- 108700003861 Dominant Genes Proteins 0.000 description 3
- FBEJIDRSQCGFJI-GUBZILKMSA-N Glu-Leu-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FBEJIDRSQCGFJI-GUBZILKMSA-N 0.000 description 3
- JVWPPCWUDRJGAE-YUMQZZPRSA-N Gly-Asn-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O JVWPPCWUDRJGAE-YUMQZZPRSA-N 0.000 description 3
- BPQYBFAXRGMGGY-LAEOZQHASA-N Gly-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)CN BPQYBFAXRGMGGY-LAEOZQHASA-N 0.000 description 3
- OWSWUWDMSNXTNE-GMOBBJLQSA-N Ile-Pro-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(=O)O)C(=O)O)N OWSWUWDMSNXTNE-GMOBBJLQSA-N 0.000 description 3
- DLCOFDAHNMMQPP-SRVKXCTJSA-N Leu-Asp-Leu Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O DLCOFDAHNMMQPP-SRVKXCTJSA-N 0.000 description 3
- OYQUOLRTJHWVSQ-SRVKXCTJSA-N Leu-His-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O OYQUOLRTJHWVSQ-SRVKXCTJSA-N 0.000 description 3
- FAELBUXXFQLUAX-AJNGGQMLSA-N Leu-Leu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CC(C)C FAELBUXXFQLUAX-AJNGGQMLSA-N 0.000 description 3
- VDIARPPNADFEAV-WEDXCCLWSA-N Leu-Thr-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O VDIARPPNADFEAV-WEDXCCLWSA-N 0.000 description 3
- 244000118056 Oryza rufipogon Species 0.000 description 3
- 240000002582 Oryza sativa Indica Group Species 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- MSIYNSBKKVMGFO-BHNWBGBOSA-N Thr-Gly-Pro Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N1CCC[C@@H]1C(=O)O)N)O MSIYNSBKKVMGFO-BHNWBGBOSA-N 0.000 description 3
- GMXIJHCBTZDAPD-QPHKQPEJSA-N Thr-Ile-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N GMXIJHCBTZDAPD-QPHKQPEJSA-N 0.000 description 3
- 230000003321 amplification Effects 0.000 description 3
- 238000009395 breeding Methods 0.000 description 3
- 230000001488 breeding effect Effects 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 108010051673 leucyl-glycyl-phenylalanine Proteins 0.000 description 3
- 230000001404 mediated effect Effects 0.000 description 3
- 238000003199 nucleic acid amplification method Methods 0.000 description 3
- 108010051242 phenylalanylserine Proteins 0.000 description 3
- 230000001105 regulatory effect Effects 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 108091008146 restriction endonucleases Proteins 0.000 description 3
- 238000003757 reverse transcription PCR Methods 0.000 description 3
- 238000005204 segregation Methods 0.000 description 3
- XVZCXCTYGHPNEM-IHRRRGAJSA-N (2s)-1-[(2s)-2-[[(2s)-2-amino-4-methylpentanoyl]amino]-4-methylpentanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O XVZCXCTYGHPNEM-IHRRRGAJSA-N 0.000 description 2
- IMMKUCQIKKXKNP-DCAQKATOSA-N Ala-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCN=C(N)N IMMKUCQIKKXKNP-DCAQKATOSA-N 0.000 description 2
- HGRBNYQIMKTUNT-XVYDVKMFSA-N Ala-Asn-His Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N HGRBNYQIMKTUNT-XVYDVKMFSA-N 0.000 description 2
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 2
- RZZMZYZXNJRPOJ-BJDJZHNGSA-N Ala-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](C)N RZZMZYZXNJRPOJ-BJDJZHNGSA-N 0.000 description 2
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 2
- AWZKCUCQJNTBAD-SRVKXCTJSA-N Ala-Leu-Lys Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCCN AWZKCUCQJNTBAD-SRVKXCTJSA-N 0.000 description 2
- NLOMBWNGESDVJU-GUBZILKMSA-N Ala-Met-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLOMBWNGESDVJU-GUBZILKMSA-N 0.000 description 2
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 2
- VJVQKGYHIZPSNS-FXQIFTODSA-N Ala-Ser-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N VJVQKGYHIZPSNS-FXQIFTODSA-N 0.000 description 2
- NCQMBSJGJMYKCK-ZLUOBGJFSA-N Ala-Ser-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O NCQMBSJGJMYKCK-ZLUOBGJFSA-N 0.000 description 2
- KUFVXLQLDHJVOG-SHGPDSBTSA-N Ala-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C)N)O KUFVXLQLDHJVOG-SHGPDSBTSA-N 0.000 description 2
- IETUUAHKCHOQHP-KZVJFYERSA-N Ala-Thr-Val Chemical compound CC(C)[C@H](NC(=O)[C@@H](NC(=O)[C@H](C)N)[C@@H](C)O)C(O)=O IETUUAHKCHOQHP-KZVJFYERSA-N 0.000 description 2
- 108700028369 Alleles Proteins 0.000 description 2
- OTOXOKCIIQLMFH-KZVJFYERSA-N Arg-Ala-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N OTOXOKCIIQLMFH-KZVJFYERSA-N 0.000 description 2
- CPSHGRGUPZBMOK-CIUDSAMLSA-N Arg-Asn-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O CPSHGRGUPZBMOK-CIUDSAMLSA-N 0.000 description 2
- GMFAGHNRXPSSJS-SRVKXCTJSA-N Arg-Leu-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O GMFAGHNRXPSSJS-SRVKXCTJSA-N 0.000 description 2
- CMLGVVWQQHUXOZ-GHCJXIJMSA-N Asn-Ala-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O CMLGVVWQQHUXOZ-GHCJXIJMSA-N 0.000 description 2
- DAPLJWATMAXPPZ-CIUDSAMLSA-N Asn-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC(N)=O DAPLJWATMAXPPZ-CIUDSAMLSA-N 0.000 description 2
- NVGWESORMHFISY-SRVKXCTJSA-N Asn-Asn-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NVGWESORMHFISY-SRVKXCTJSA-N 0.000 description 2
- VKCOHFFSTKCXEQ-OLHMAJIHSA-N Asn-Asn-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O VKCOHFFSTKCXEQ-OLHMAJIHSA-N 0.000 description 2
- HYQYLOSCICEYTR-YUMQZZPRSA-N Asn-Gly-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O HYQYLOSCICEYTR-YUMQZZPRSA-N 0.000 description 2
- OWUCNXMFJRFOFI-BQBZGAKWSA-N Asn-Gly-Met Chemical compound [H]N[C@@H](CC(N)=O)C(=O)NCC(=O)N[C@@H](CCSC)C(O)=O OWUCNXMFJRFOFI-BQBZGAKWSA-N 0.000 description 2
- OLISTMZJGQUOGS-GMOBBJLQSA-N Asn-Ile-Arg Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N OLISTMZJGQUOGS-GMOBBJLQSA-N 0.000 description 2
- SEKBHZJLARBNPB-GHCJXIJMSA-N Asn-Ile-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O SEKBHZJLARBNPB-GHCJXIJMSA-N 0.000 description 2
- IBLAOXSULLECQZ-IUKAMOBKSA-N Asn-Ile-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC(N)=O IBLAOXSULLECQZ-IUKAMOBKSA-N 0.000 description 2
- NLRJGXZWTKXRHP-DCAQKATOSA-N Asn-Leu-Arg Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NLRJGXZWTKXRHP-DCAQKATOSA-N 0.000 description 2
- JEEFEQCRXKPQHC-KKUMJFAQSA-N Asn-Leu-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JEEFEQCRXKPQHC-KKUMJFAQSA-N 0.000 description 2
- VOKWBBBXJONREA-DCAQKATOSA-N Asn-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(=O)N)N VOKWBBBXJONREA-DCAQKATOSA-N 0.000 description 2
- BKFXFUPYETWGGA-XVSYOHENSA-N Asn-Phe-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O BKFXFUPYETWGGA-XVSYOHENSA-N 0.000 description 2
- WLVLIYYBPPONRJ-GCJQMDKQSA-N Asn-Thr-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O WLVLIYYBPPONRJ-GCJQMDKQSA-N 0.000 description 2
- AMGQTNHANMRPOE-LKXGYXEUSA-N Asn-Thr-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O AMGQTNHANMRPOE-LKXGYXEUSA-N 0.000 description 2
- ULZOQOKFYMXHPZ-AQZXSJQPSA-N Asn-Trp-Thr Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O ULZOQOKFYMXHPZ-AQZXSJQPSA-N 0.000 description 2
- GHWWTICYPDKPTE-NGZCFLSTSA-N Asn-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)N)N GHWWTICYPDKPTE-NGZCFLSTSA-N 0.000 description 2
- BUVNWKQBMZLCDW-UGYAYLCHSA-N Asp-Asn-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BUVNWKQBMZLCDW-UGYAYLCHSA-N 0.000 description 2
- VPSHHQXIWLGVDD-ZLUOBGJFSA-N Asp-Asp-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O VPSHHQXIWLGVDD-ZLUOBGJFSA-N 0.000 description 2
- APYNREQHZOGYHV-ACZMJKKPSA-N Asp-Cys-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC(=O)O)N APYNREQHZOGYHV-ACZMJKKPSA-N 0.000 description 2
- SNDBKTFJWVEVPO-WHFBIAKZSA-N Asp-Gly-Ser Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(O)=O SNDBKTFJWVEVPO-WHFBIAKZSA-N 0.000 description 2
- KLYPOCBLKMPBIQ-GHCJXIJMSA-N Asp-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N KLYPOCBLKMPBIQ-GHCJXIJMSA-N 0.000 description 2
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 2
- JSHWXQIZOCVWIA-ZKWXMUAHSA-N Asp-Ser-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O JSHWXQIZOCVWIA-ZKWXMUAHSA-N 0.000 description 2
- RKXVTTIQNKPCHU-KKHAAJSZSA-N Asp-Val-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CC(O)=O RKXVTTIQNKPCHU-KKHAAJSZSA-N 0.000 description 2
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 2
- NLCZGISONIGRQP-DCAQKATOSA-N Cys-Arg-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CS)N NLCZGISONIGRQP-DCAQKATOSA-N 0.000 description 2
- XABFFGOGKOORCG-CIUDSAMLSA-N Cys-Asp-Leu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O XABFFGOGKOORCG-CIUDSAMLSA-N 0.000 description 2
- CVLIHKBUPSFRQP-WHFBIAKZSA-N Cys-Gly-Ala Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H](C)C(O)=O CVLIHKBUPSFRQP-WHFBIAKZSA-N 0.000 description 2
- ZMWOJVAXTOUHAP-ZKWXMUAHSA-N Cys-Ile-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CS)N ZMWOJVAXTOUHAP-ZKWXMUAHSA-N 0.000 description 2
- JUNZLDGUJZIUCO-IHRRRGAJSA-N Cys-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CS)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O JUNZLDGUJZIUCO-IHRRRGAJSA-N 0.000 description 2
- CMYVIUWVYHOLRD-ZLUOBGJFSA-N Cys-Ser-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CMYVIUWVYHOLRD-ZLUOBGJFSA-N 0.000 description 2
- KVCJEMHFLGVINV-ZLUOBGJFSA-N Cys-Ser-Asn Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KVCJEMHFLGVINV-ZLUOBGJFSA-N 0.000 description 2
- 239000003298 DNA probe Substances 0.000 description 2
- 102000007260 Deoxyribonuclease I Human genes 0.000 description 2
- 108010008532 Deoxyribonuclease I Proteins 0.000 description 2
- WUAYFMZULZDSLB-ACZMJKKPSA-N Gln-Ala-Asn Chemical compound NC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O WUAYFMZULZDSLB-ACZMJKKPSA-N 0.000 description 2
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 2
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 2
- HSHCEAUPUPJPTE-JYJNAYRXSA-N Gln-Leu-Tyr Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N HSHCEAUPUPJPTE-JYJNAYRXSA-N 0.000 description 2
- LURQDGKYBFWWJA-MNXVOIDGSA-N Gln-Lys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(=O)N)N LURQDGKYBFWWJA-MNXVOIDGSA-N 0.000 description 2
- XBWGJWXGUNSZAT-CIUDSAMLSA-N Gln-Met-Asp Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CCC(=O)N)N XBWGJWXGUNSZAT-CIUDSAMLSA-N 0.000 description 2
- XZUUUKNKNWVPHQ-JYJNAYRXSA-N Gln-Phe-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(O)=O XZUUUKNKNWVPHQ-JYJNAYRXSA-N 0.000 description 2
- NHMRJKKAVMENKJ-WDCWCFNPSA-N Gln-Thr-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NHMRJKKAVMENKJ-WDCWCFNPSA-N 0.000 description 2
- IIMZHVKZBGSEKZ-SZMVWBNQSA-N Gln-Trp-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O IIMZHVKZBGSEKZ-SZMVWBNQSA-N 0.000 description 2
- NVHJGTGTUGEWCG-ZVZYQTTQSA-N Gln-Trp-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](C(C)C)C(O)=O NVHJGTGTUGEWCG-ZVZYQTTQSA-N 0.000 description 2
- ITYRYNUZHPNCIK-GUBZILKMSA-N Glu-Ala-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O ITYRYNUZHPNCIK-GUBZILKMSA-N 0.000 description 2
- AKJRHDMTEJXTPV-ACZMJKKPSA-N Glu-Asn-Ala Chemical compound C[C@H](NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O)C(O)=O AKJRHDMTEJXTPV-ACZMJKKPSA-N 0.000 description 2
- GLWXKFRTOHKGIT-ACZMJKKPSA-N Glu-Asn-Asn Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O GLWXKFRTOHKGIT-ACZMJKKPSA-N 0.000 description 2
- ZOXBSICWUDAOHX-GUBZILKMSA-N Glu-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CCC(O)=O ZOXBSICWUDAOHX-GUBZILKMSA-N 0.000 description 2
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 2
- PXHABOCPJVTGEK-BQBZGAKWSA-N Glu-Gln-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O PXHABOCPJVTGEK-BQBZGAKWSA-N 0.000 description 2
- DVLZZEPUNFEUBW-AVGNSLFASA-N Glu-His-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CCC(=O)O)N DVLZZEPUNFEUBW-AVGNSLFASA-N 0.000 description 2
- ZHNHJYYFCGUZNQ-KBIXCLLPSA-N Glu-Ile-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O ZHNHJYYFCGUZNQ-KBIXCLLPSA-N 0.000 description 2
- DNPCBMNFQVTHMA-DCAQKATOSA-N Glu-Leu-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O DNPCBMNFQVTHMA-DCAQKATOSA-N 0.000 description 2
- MWMJCGBSIORNCD-AVGNSLFASA-N Glu-Leu-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O MWMJCGBSIORNCD-AVGNSLFASA-N 0.000 description 2
- UGSVSNXPJJDJKL-SDDRHHMPSA-N Glu-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCC(=O)O)N UGSVSNXPJJDJKL-SDDRHHMPSA-N 0.000 description 2
- GJBUAAAIZSRCDC-GVXVVHGQSA-N Glu-Leu-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O GJBUAAAIZSRCDC-GVXVVHGQSA-N 0.000 description 2
- MIIGESVJEBDJMP-FHWLQOOXSA-N Glu-Phe-Tyr Chemical compound C([C@H](NC(=O)[C@H](CCC(O)=O)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CC=CC=C1 MIIGESVJEBDJMP-FHWLQOOXSA-N 0.000 description 2
- ALMBZBOCGSVSAI-ACZMJKKPSA-N Glu-Ser-Asn Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)N)C(=O)O)N ALMBZBOCGSVSAI-ACZMJKKPSA-N 0.000 description 2
- SYAYROHMAIHWFB-KBIXCLLPSA-N Glu-Ser-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O SYAYROHMAIHWFB-KBIXCLLPSA-N 0.000 description 2
- HAGKYCXGTRUUFI-RYUDHWBXSA-N Glu-Tyr-Gly Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CCC(=O)O)N)O HAGKYCXGTRUUFI-RYUDHWBXSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- BGVYNAQWHSTTSP-BYULHYEWSA-N Gly-Asn-Ile Chemical compound [H]NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O BGVYNAQWHSTTSP-BYULHYEWSA-N 0.000 description 2
- LXXANCRPFBSSKS-IUCAKERBSA-N Gly-Gln-Leu Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O LXXANCRPFBSSKS-IUCAKERBSA-N 0.000 description 2
- UUWOBINZFGTFMS-UWVGGRQHSA-N Gly-His-Met Chemical compound [H]NCC(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCSC)C(O)=O UUWOBINZFGTFMS-UWVGGRQHSA-N 0.000 description 2
- UHPAZODVFFYEEL-QWRGUYRKSA-N Gly-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)CN UHPAZODVFFYEEL-QWRGUYRKSA-N 0.000 description 2
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- MIIVFRCYJABHTQ-ONGXEEELSA-N Gly-Leu-Val Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O MIIVFRCYJABHTQ-ONGXEEELSA-N 0.000 description 2
- CLNSYANKYVMZNM-UWVGGRQHSA-N Gly-Lys-Arg Chemical compound NCCCC[C@H](NC(=O)CN)C(=O)N[C@H](C(O)=O)CCCN=C(N)N CLNSYANKYVMZNM-UWVGGRQHSA-N 0.000 description 2
- DHNXGWVNLFPOMQ-KBPBESRZSA-N Gly-Phe-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)CN DHNXGWVNLFPOMQ-KBPBESRZSA-N 0.000 description 2
- IEGFSKKANYKBDU-QWHCGFSZSA-N Gly-Phe-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CC=CC=C2)NC(=O)CN)C(=O)O IEGFSKKANYKBDU-QWHCGFSZSA-N 0.000 description 2
- SSFWXSNOKDZNHY-QXEWZRGKSA-N Gly-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)CN SSFWXSNOKDZNHY-QXEWZRGKSA-N 0.000 description 2
- OHUKZZYSJBKFRR-WHFBIAKZSA-N Gly-Ser-Asp Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O OHUKZZYSJBKFRR-WHFBIAKZSA-N 0.000 description 2
- VNNRLUNBJSWZPF-ZKWXMUAHSA-N Gly-Ser-Ile Chemical compound [H]NCC(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNNRLUNBJSWZPF-ZKWXMUAHSA-N 0.000 description 2
- RIYIFUFFFBIOEU-KBPBESRZSA-N Gly-Tyr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=C(O)C=C1 RIYIFUFFFBIOEU-KBPBESRZSA-N 0.000 description 2
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 2
- VOKCBYNCZVSILJ-KKUMJFAQSA-N His-Asn-Tyr Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC2=CN=CN2)N)O VOKCBYNCZVSILJ-KKUMJFAQSA-N 0.000 description 2
- UPGJWSUYENXOPV-HGNGGELXSA-N His-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N UPGJWSUYENXOPV-HGNGGELXSA-N 0.000 description 2
- XMENRVZYPBKBIL-AVGNSLFASA-N His-Glu-His Chemical compound N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O XMENRVZYPBKBIL-AVGNSLFASA-N 0.000 description 2
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 2
- VFBZWZXKCVBTJR-SRVKXCTJSA-N His-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC1=CN=CN1)N VFBZWZXKCVBTJR-SRVKXCTJSA-N 0.000 description 2
- AIPUZFXMXAHZKY-QWRGUYRKSA-N His-Leu-Gly Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AIPUZFXMXAHZKY-QWRGUYRKSA-N 0.000 description 2
- KFQDSSNYWKZFOO-LSJOCFKGSA-N His-Val-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KFQDSSNYWKZFOO-LSJOCFKGSA-N 0.000 description 2
- DRKZDEFADVYTLU-AVGNSLFASA-N His-Val-Val Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DRKZDEFADVYTLU-AVGNSLFASA-N 0.000 description 2
- RGSOCXHDOPQREB-ZPFDUUQYSA-N Ile-Asp-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N RGSOCXHDOPQREB-ZPFDUUQYSA-N 0.000 description 2
- NHJKZMDIMMTVCK-QXEWZRGKSA-N Ile-Gly-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N NHJKZMDIMMTVCK-QXEWZRGKSA-N 0.000 description 2
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 2
- IITVUURPOYGCTD-NAKRPEOUSA-N Ile-Pro-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O IITVUURPOYGCTD-NAKRPEOUSA-N 0.000 description 2
- XHBYEMIUENPZLY-GMOBBJLQSA-N Ile-Pro-Asn Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(O)=O XHBYEMIUENPZLY-GMOBBJLQSA-N 0.000 description 2
- CAHCWMVNBZJVAW-NAKRPEOUSA-N Ile-Pro-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)O)N CAHCWMVNBZJVAW-NAKRPEOUSA-N 0.000 description 2
- KTNGVMMGIQWIDV-OSUNSFLBSA-N Ile-Pro-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O KTNGVMMGIQWIDV-OSUNSFLBSA-N 0.000 description 2
- VGSPNSSCMOHRRR-BJDJZHNGSA-N Ile-Ser-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N VGSPNSSCMOHRRR-BJDJZHNGSA-N 0.000 description 2
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- SENJXOPIZNYLHU-UHFFFAOYSA-N L-leucyl-L-arginine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CCCN=C(N)N SENJXOPIZNYLHU-UHFFFAOYSA-N 0.000 description 2
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 2
- MDVZJYGNAGLPGJ-KKUMJFAQSA-N Leu-Asn-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 MDVZJYGNAGLPGJ-KKUMJFAQSA-N 0.000 description 2
- ULXYQAJWJGLCNR-YUMQZZPRSA-N Leu-Asp-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O ULXYQAJWJGLCNR-YUMQZZPRSA-N 0.000 description 2
- JQSXWJXBASFONF-KKUMJFAQSA-N Leu-Asp-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JQSXWJXBASFONF-KKUMJFAQSA-N 0.000 description 2
- QDSKNVXKLPQNOJ-GVXVVHGQSA-N Leu-Gln-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O QDSKNVXKLPQNOJ-GVXVVHGQSA-N 0.000 description 2
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 2
- VWHGTYCRDRBSFI-ZETCQYMHSA-N Leu-Gly-Gly Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)NCC(O)=O VWHGTYCRDRBSFI-ZETCQYMHSA-N 0.000 description 2
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 2
- HRTRLSRYZZKPCO-BJDJZHNGSA-N Leu-Ile-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O HRTRLSRYZZKPCO-BJDJZHNGSA-N 0.000 description 2
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 2
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 2
- YOKVEHGYYQEQOP-QWRGUYRKSA-N Leu-Leu-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YOKVEHGYYQEQOP-QWRGUYRKSA-N 0.000 description 2
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 2
- ONPJGOIVICHWBW-BZSNNMDCSA-N Leu-Lys-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 ONPJGOIVICHWBW-BZSNNMDCSA-N 0.000 description 2
- DRWMRVFCKKXHCH-BZSNNMDCSA-N Leu-Phe-Leu Chemical compound CC(C)C[C@H]([NH3+])C(=O)N[C@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)CC1=CC=CC=C1 DRWMRVFCKKXHCH-BZSNNMDCSA-N 0.000 description 2
- WMIOEVKKYIMVKI-DCAQKATOSA-N Leu-Pro-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WMIOEVKKYIMVKI-DCAQKATOSA-N 0.000 description 2
- AAKRWBIIGKPOKQ-ONGXEEELSA-N Leu-Val-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O AAKRWBIIGKPOKQ-ONGXEEELSA-N 0.000 description 2
- ALSRJRIWBNENFY-DCAQKATOSA-N Lys-Arg-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O ALSRJRIWBNENFY-DCAQKATOSA-N 0.000 description 2
- YVMQJGWLHRWMDF-MNXVOIDGSA-N Lys-Gln-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N YVMQJGWLHRWMDF-MNXVOIDGSA-N 0.000 description 2
- RZHLIPMZXOEJTL-AVGNSLFASA-N Lys-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCCN)N RZHLIPMZXOEJTL-AVGNSLFASA-N 0.000 description 2
- QKXZCUCBFPEXNK-KKUMJFAQSA-N Lys-Leu-His Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 QKXZCUCBFPEXNK-KKUMJFAQSA-N 0.000 description 2
- MGKFCQFVPKOWOL-CIUDSAMLSA-N Lys-Ser-Asp Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(=O)O)C(=O)O)N MGKFCQFVPKOWOL-CIUDSAMLSA-N 0.000 description 2
- KDBDVESGGJYVEH-PMVMPFDFSA-N Lys-Trp-Phe Chemical compound C([C@H](NC(=O)[C@H](CC=1C2=CC=CC=C2NC=1)NC(=O)[C@@H](N)CCCCN)C(O)=O)C1=CC=CC=C1 KDBDVESGGJYVEH-PMVMPFDFSA-N 0.000 description 2
- VHGIWFGJIHTASW-FXQIFTODSA-N Met-Ala-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O VHGIWFGJIHTASW-FXQIFTODSA-N 0.000 description 2
- QEVRUYFHWJJUHZ-DCAQKATOSA-N Met-Ala-Leu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(C)C QEVRUYFHWJJUHZ-DCAQKATOSA-N 0.000 description 2
- VTKPSXWRUGCOAC-GUBZILKMSA-N Met-Ala-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCSC VTKPSXWRUGCOAC-GUBZILKMSA-N 0.000 description 2
- ULNXMMYXQKGNPG-LPEHRKFASA-N Met-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCSC)N ULNXMMYXQKGNPG-LPEHRKFASA-N 0.000 description 2
- HZVXPUHLTZRQEL-UWVGGRQHSA-N Met-Leu-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O HZVXPUHLTZRQEL-UWVGGRQHSA-N 0.000 description 2
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 2
- GMMLGMFBYCFCCX-KZVJFYERSA-N Met-Thr-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O GMMLGMFBYCFCCX-KZVJFYERSA-N 0.000 description 2
- WXJLBSXNUHIGSS-OSUNSFLBSA-N Met-Thr-Ile Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O WXJLBSXNUHIGSS-OSUNSFLBSA-N 0.000 description 2
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 2
- 108010002311 N-glycylglutamic acid Proteins 0.000 description 2
- 239000004677 Nylon Substances 0.000 description 2
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 2
- LXVFHIBXOWJTKZ-BZSNNMDCSA-N Phe-Asn-Tyr Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O LXVFHIBXOWJTKZ-BZSNNMDCSA-N 0.000 description 2
- FSPGBMWPNMRWDB-AVGNSLFASA-N Phe-Cys-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N FSPGBMWPNMRWDB-AVGNSLFASA-N 0.000 description 2
- FIRWJEJVFFGXSH-RYUDHWBXSA-N Phe-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 FIRWJEJVFFGXSH-RYUDHWBXSA-N 0.000 description 2
- KJJROSNFBRWPHS-JYJNAYRXSA-N Phe-Glu-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O KJJROSNFBRWPHS-JYJNAYRXSA-N 0.000 description 2
- HBGFEEQFVBWYJQ-KBPBESRZSA-N Phe-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HBGFEEQFVBWYJQ-KBPBESRZSA-N 0.000 description 2
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 2
- VZFPYFRVHMSSNA-JURCDPSOSA-N Phe-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CC1=CC=CC=C1 VZFPYFRVHMSSNA-JURCDPSOSA-N 0.000 description 2
- YCCUXNNKXDGMAM-KKUMJFAQSA-N Phe-Leu-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YCCUXNNKXDGMAM-KKUMJFAQSA-N 0.000 description 2
- JLLJTMHNXQTMCK-UBHSHLNASA-N Phe-Pro-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CC1=CC=CC=C1 JLLJTMHNXQTMCK-UBHSHLNASA-N 0.000 description 2
- APKRGYLBSCWJJP-FXQIFTODSA-N Pro-Ala-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O APKRGYLBSCWJJP-FXQIFTODSA-N 0.000 description 2
- IWNOFCGBMSFTBC-CIUDSAMLSA-N Pro-Ala-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O IWNOFCGBMSFTBC-CIUDSAMLSA-N 0.000 description 2
- DEDANIDYQAPTFI-IHRRRGAJSA-N Pro-Asp-Tyr Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O DEDANIDYQAPTFI-IHRRRGAJSA-N 0.000 description 2
- DIZLUAZLNDFDPR-CIUDSAMLSA-N Pro-Cys-Gln Chemical compound NC(=O)CC[C@@H](C(O)=O)NC(=O)[C@H](CS)NC(=O)[C@@H]1CCCN1 DIZLUAZLNDFDPR-CIUDSAMLSA-N 0.000 description 2
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 2
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 2
- QUBVFEANYYWBTM-VEVYYDQMSA-N Pro-Thr-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(O)=O QUBVFEANYYWBTM-VEVYYDQMSA-N 0.000 description 2
- YIPFBJGBRCJJJD-FHWLQOOXSA-N Pro-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@@H]3CCCN3 YIPFBJGBRCJJJD-FHWLQOOXSA-N 0.000 description 2
- 108700008625 Reporter Genes Proteins 0.000 description 2
- MWMKFWJYRRGXOR-ZLUOBGJFSA-N Ser-Ala-Asn Chemical compound N[C@H](C(=O)N[C@H](C(=O)N[C@H](C(=O)O)CC(N)=O)C)CO MWMKFWJYRRGXOR-ZLUOBGJFSA-N 0.000 description 2
- BTKUIVBNGBFTTP-WHFBIAKZSA-N Ser-Ala-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)NCC(O)=O BTKUIVBNGBFTTP-WHFBIAKZSA-N 0.000 description 2
- FCRMLGJMPXCAHD-FXQIFTODSA-N Ser-Arg-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(N)=O)C(O)=O FCRMLGJMPXCAHD-FXQIFTODSA-N 0.000 description 2
- GHPQVUYZQQGEDA-BIIVOSGPSA-N Ser-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N)C(=O)O GHPQVUYZQQGEDA-BIIVOSGPSA-N 0.000 description 2
- SWSRFJZZMNLMLY-ZKWXMUAHSA-N Ser-Asp-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O SWSRFJZZMNLMLY-ZKWXMUAHSA-N 0.000 description 2
- OJPHFSOMBZKQKQ-GUBZILKMSA-N Ser-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CO OJPHFSOMBZKQKQ-GUBZILKMSA-N 0.000 description 2
- GWMXFEMMBHOKDX-AVGNSLFASA-N Ser-Gln-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GWMXFEMMBHOKDX-AVGNSLFASA-N 0.000 description 2
- JFWDJFULOLKQFY-QWRGUYRKSA-N Ser-Gly-Phe Chemical compound [H]N[C@@H](CO)C(=O)NCC(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JFWDJFULOLKQFY-QWRGUYRKSA-N 0.000 description 2
- KDGARKCAKHBEDB-NKWVEPMBSA-N Ser-Gly-Pro Chemical compound C1C[C@@H](N(C1)C(=O)CNC(=O)[C@H](CO)N)C(=O)O KDGARKCAKHBEDB-NKWVEPMBSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 2
- QBUWQRKEHJXTOP-DCAQKATOSA-N Ser-His-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O QBUWQRKEHJXTOP-DCAQKATOSA-N 0.000 description 2
- RJHJPZQOMKCSTP-CIUDSAMLSA-N Ser-His-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(O)=O RJHJPZQOMKCSTP-CIUDSAMLSA-N 0.000 description 2
- IFPBAGJBHSNYPR-ZKWXMUAHSA-N Ser-Ile-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O IFPBAGJBHSNYPR-ZKWXMUAHSA-N 0.000 description 2
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 2
- KCNSGAMPBPYUAI-CIUDSAMLSA-N Ser-Leu-Asn Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O KCNSGAMPBPYUAI-CIUDSAMLSA-N 0.000 description 2
- XNCUYZKGQOCOQH-YUMQZZPRSA-N Ser-Leu-Gly Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O XNCUYZKGQOCOQH-YUMQZZPRSA-N 0.000 description 2
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 2
- VZQRNAYURWAEFE-KKUMJFAQSA-N Ser-Leu-Phe Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 VZQRNAYURWAEFE-KKUMJFAQSA-N 0.000 description 2
- JUTGONBTALQWMK-NAKRPEOUSA-N Ser-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N JUTGONBTALQWMK-NAKRPEOUSA-N 0.000 description 2
- GDUZTEQRAOXYJS-SRVKXCTJSA-N Ser-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CO)N GDUZTEQRAOXYJS-SRVKXCTJSA-N 0.000 description 2
- UGTZYIPOBYXWRW-SRVKXCTJSA-N Ser-Phe-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(O)=O UGTZYIPOBYXWRW-SRVKXCTJSA-N 0.000 description 2
- JURQXQBJKUHGJS-UHFFFAOYSA-N Ser-Ser-Ser-Ser Chemical compound OCC(N)C(=O)NC(CO)C(=O)NC(CO)C(=O)NC(CO)C(O)=O JURQXQBJKUHGJS-UHFFFAOYSA-N 0.000 description 2
- FVFUOQIYDPAIJR-XIRDDKMYSA-N Ser-Trp-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H](CO)N FVFUOQIYDPAIJR-XIRDDKMYSA-N 0.000 description 2
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 2
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 2
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 2
- SKHPKKYKDYULDH-HJGDQZAQSA-N Thr-Asn-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(O)=O SKHPKKYKDYULDH-HJGDQZAQSA-N 0.000 description 2
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 2
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 2
- VOHWDZNIESHTFW-XKBZYTNZSA-N Thr-Glu-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N)O VOHWDZNIESHTFW-XKBZYTNZSA-N 0.000 description 2
- QQWNRERCGGZOKG-WEDXCCLWSA-N Thr-Gly-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(O)=O QQWNRERCGGZOKG-WEDXCCLWSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- YOOAQCZYZHGUAZ-KATARQTJSA-N Thr-Leu-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YOOAQCZYZHGUAZ-KATARQTJSA-N 0.000 description 2
- RVMNUBQWPVOUKH-HEIBUPTGSA-N Thr-Ser-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O RVMNUBQWPVOUKH-HEIBUPTGSA-N 0.000 description 2
- TZQWJCGVCIJDMU-HEIBUPTGSA-N Thr-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CS)C(=O)O)N)O TZQWJCGVCIJDMU-HEIBUPTGSA-N 0.000 description 2
- JAWUQFCGNVEDRN-MEYUZBJRSA-N Thr-Tyr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CC(C)C)C(=O)O)N)O JAWUQFCGNVEDRN-MEYUZBJRSA-N 0.000 description 2
- RERRMBXDSFMBQE-ZFWWWQNUSA-N Trp-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N RERRMBXDSFMBQE-ZFWWWQNUSA-N 0.000 description 2
- BGWSLEYVITZIQP-DCPHZVHLSA-N Trp-Phe-Ala Chemical compound C[C@H](NC(=O)[C@H](Cc1ccccc1)NC(=O)[C@@H](N)Cc1c[nH]c2ccccc12)C(O)=O BGWSLEYVITZIQP-DCPHZVHLSA-N 0.000 description 2
- AYHSJESDFKREAR-KKUMJFAQSA-N Tyr-Asn-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 AYHSJESDFKREAR-KKUMJFAQSA-N 0.000 description 2
- UNUZEBFXGWVAOP-DZKIICNBSA-N Tyr-Glu-Val Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O UNUZEBFXGWVAOP-DZKIICNBSA-N 0.000 description 2
- FNWGDMZVYBVAGJ-XEGUGMAKSA-N Tyr-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CC1=CC=C(C=C1)O)N FNWGDMZVYBVAGJ-XEGUGMAKSA-N 0.000 description 2
- GULIUBBXCYPDJU-CQDKDKBSSA-N Tyr-Leu-Ala Chemical compound [O-]C(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CC1=CC=C(O)C=C1 GULIUBBXCYPDJU-CQDKDKBSSA-N 0.000 description 2
- GYBVHTWOQJMYAM-HRCADAONSA-N Tyr-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N GYBVHTWOQJMYAM-HRCADAONSA-N 0.000 description 2
- HURRXSNHCCSJHA-AUTRQRHGSA-N Val-Gln-Gln Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N HURRXSNHCCSJHA-AUTRQRHGSA-N 0.000 description 2
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 2
- BVWPHWLFGRCECJ-JSGCOSHPSA-N Val-Gly-Tyr Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N BVWPHWLFGRCECJ-JSGCOSHPSA-N 0.000 description 2
- XTDDIVQWDXMRJL-IHRRRGAJSA-N Val-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N XTDDIVQWDXMRJL-IHRRRGAJSA-N 0.000 description 2
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 2
- ZHQWPWQNVRCXAX-XQQFMLRXSA-N Val-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZHQWPWQNVRCXAX-XQQFMLRXSA-N 0.000 description 2
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 2
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 2
- HPOSMQWRPMRMFO-GUBZILKMSA-N Val-Pro-Cys Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CS)C(=O)O)N HPOSMQWRPMRMFO-GUBZILKMSA-N 0.000 description 2
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 2
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 2
- KSFXWENSJABBFI-ZKWXMUAHSA-N Val-Ser-Asn Chemical compound [H]N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O KSFXWENSJABBFI-ZKWXMUAHSA-N 0.000 description 2
- JQTYTBPCSOAZHI-FXQIFTODSA-N Val-Ser-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CS)C(=O)O)N JQTYTBPCSOAZHI-FXQIFTODSA-N 0.000 description 2
- YQYFYUSYEDNLSD-YEPSODPASA-N Val-Thr-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O YQYFYUSYEDNLSD-YEPSODPASA-N 0.000 description 2
- WFTKOJGOOUJLJV-VKOGCVSHSA-N Val-Trp-Ile Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O)NC(=O)[C@@H]([NH3+])C(C)C)=CNC2=C1 WFTKOJGOOUJLJV-VKOGCVSHSA-N 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010008355 arginyl-glutamine Proteins 0.000 description 2
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 2
- 108010068380 arginylarginine Proteins 0.000 description 2
- 108010062796 arginyllysine Proteins 0.000 description 2
- 108010060035 arginylproline Proteins 0.000 description 2
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 2
- 108010092854 aspartyllysine Proteins 0.000 description 2
- 239000003153 chemical reaction reagent Substances 0.000 description 2
- 238000010835 comparative analysis Methods 0.000 description 2
- 238000010276 construction Methods 0.000 description 2
- 230000029087 digestion Effects 0.000 description 2
- 108010054813 diprotin B Proteins 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000001962 electrophoresis Methods 0.000 description 2
- 239000000499 gel Substances 0.000 description 2
- 108010074027 glycyl-seryl-phenylalanine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 101150054900 gus gene Proteins 0.000 description 2
- 108010040030 histidinoalanine Proteins 0.000 description 2
- 108010036413 histidylglycine Proteins 0.000 description 2
- 108010025306 histidylleucine Proteins 0.000 description 2
- 108010092114 histidylphenylalanine Proteins 0.000 description 2
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 2
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 2
- 108010047926 leucyl-lysyl-tyrosine Proteins 0.000 description 2
- 108010073472 leucyl-prolyl-proline Proteins 0.000 description 2
- 108010000761 leucylarginine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 108010017391 lysylvaline Proteins 0.000 description 2
- 239000012528 membrane Substances 0.000 description 2
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 2
- 229920001778 nylon Polymers 0.000 description 2
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010031719 prolyl-serine Proteins 0.000 description 2
- 108020003175 receptors Proteins 0.000 description 2
- 102000005962 receptors Human genes 0.000 description 2
- 108010048397 seryl-lysyl-leucine Proteins 0.000 description 2
- 238000001228 spectrum Methods 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 108700004896 tripeptide FEG Proteins 0.000 description 2
- 108010035534 tyrosyl-leucyl-alanine Proteins 0.000 description 2
- 108010078580 tyrosylleucine Proteins 0.000 description 2
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 2
- 108020005345 3' Untranslated Regions Proteins 0.000 description 1
- 108020003589 5' Untranslated Regions Proteins 0.000 description 1
- 241000589158 Agrobacterium Species 0.000 description 1
- KVWLTGNCJYDJET-LSJOCFKGSA-N Ala-Arg-His Chemical compound C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KVWLTGNCJYDJET-LSJOCFKGSA-N 0.000 description 1
- MQIGTEQXYCRLGK-BQBZGAKWSA-N Ala-Gly-Pro Chemical compound C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O MQIGTEQXYCRLGK-BQBZGAKWSA-N 0.000 description 1
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- ANNKVZSFQJGVDY-XUXIUFHCSA-N Ala-Val-Pro-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 ANNKVZSFQJGVDY-XUXIUFHCSA-N 0.000 description 1
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 1
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 1
- GIVWETPOBCRTND-DCAQKATOSA-N Arg-Gln-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O GIVWETPOBCRTND-DCAQKATOSA-N 0.000 description 1
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 1
- BTJVOUQWFXABOI-IHRRRGAJSA-N Arg-Lys-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CCCNC(N)=N BTJVOUQWFXABOI-IHRRRGAJSA-N 0.000 description 1
- XYOVHPDDWCEUDY-CIUDSAMLSA-N Asn-Ala-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O XYOVHPDDWCEUDY-CIUDSAMLSA-N 0.000 description 1
- APHUDFFMXFYRKP-CIUDSAMLSA-N Asn-Asn-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)N)N APHUDFFMXFYRKP-CIUDSAMLSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- SPCONPVIDFMDJI-QSFUFRPTSA-N Asn-Ile-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O SPCONPVIDFMDJI-QSFUFRPTSA-N 0.000 description 1
- ORJQQZIXTOYGGH-SRVKXCTJSA-N Asn-Lys-Leu Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ORJQQZIXTOYGGH-SRVKXCTJSA-N 0.000 description 1
- KEUNWIXNKVWCFL-FXQIFTODSA-N Asn-Met-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CO)C(O)=O KEUNWIXNKVWCFL-FXQIFTODSA-N 0.000 description 1
- HPBNLFLSSQDFQW-WHFBIAKZSA-N Asn-Ser-Gly Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O HPBNLFLSSQDFQW-WHFBIAKZSA-N 0.000 description 1
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 1
- VLDRQOHCMKCXLY-SRVKXCTJSA-N Asn-Ser-Phe Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O VLDRQOHCMKCXLY-SRVKXCTJSA-N 0.000 description 1
- SNYCNNPOFYBCEK-ZLUOBGJFSA-N Asn-Ser-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O SNYCNNPOFYBCEK-ZLUOBGJFSA-N 0.000 description 1
- ZAESWDKAMDVHLL-RCOVLWMOSA-N Asn-Val-Gly Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O ZAESWDKAMDVHLL-RCOVLWMOSA-N 0.000 description 1
- HPNDBHLITCHRSO-WHFBIAKZSA-N Asp-Ala-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)NCC(O)=O HPNDBHLITCHRSO-WHFBIAKZSA-N 0.000 description 1
- ATYWBXGNXZYZGI-ACZMJKKPSA-N Asp-Asn-Gln Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O ATYWBXGNXZYZGI-ACZMJKKPSA-N 0.000 description 1
- KNMRXHIAVXHCLW-ZLUOBGJFSA-N Asp-Asn-Ser Chemical compound C([C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N)C(=O)O KNMRXHIAVXHCLW-ZLUOBGJFSA-N 0.000 description 1
- SVFOIXMRMLROHO-SRVKXCTJSA-N Asp-Asp-Phe Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SVFOIXMRMLROHO-SRVKXCTJSA-N 0.000 description 1
- AYFVRYXNDHBECD-YUMQZZPRSA-N Asp-Leu-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O AYFVRYXNDHBECD-YUMQZZPRSA-N 0.000 description 1
- QJHOOKBAHRJPPX-QWRGUYRKSA-N Asp-Phe-Gly Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 QJHOOKBAHRJPPX-QWRGUYRKSA-N 0.000 description 1
- RPUYTJJZXQBWDT-SRVKXCTJSA-N Asp-Phe-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CC(=O)O)N RPUYTJJZXQBWDT-SRVKXCTJSA-N 0.000 description 1
- ZQFRDAZBTSFGGW-SRVKXCTJSA-N Asp-Ser-Phe Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O ZQFRDAZBTSFGGW-SRVKXCTJSA-N 0.000 description 1
- BYLPQJAWXJWUCJ-YDHLFZDLSA-N Asp-Tyr-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O BYLPQJAWXJWUCJ-YDHLFZDLSA-N 0.000 description 1
- 108020000946 Bacterial DNA Proteins 0.000 description 1
- 208000035143 Bacterial infection Diseases 0.000 description 1
- JXVFJOMFOLFPMP-KKUMJFAQSA-N Cys-Leu-Tyr Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O JXVFJOMFOLFPMP-KKUMJFAQSA-N 0.000 description 1
- 108020003215 DNA Probes Proteins 0.000 description 1
- 238000001712 DNA sequencing Methods 0.000 description 1
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 1
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 1
- 241000238557 Decapoda Species 0.000 description 1
- 102100031780 Endonuclease Human genes 0.000 description 1
- 241001302160 Escherichia coli str. K-12 substr. DH10B Species 0.000 description 1
- 108700024394 Exon Proteins 0.000 description 1
- 241000233866 Fungi Species 0.000 description 1
- 101000606500 Gallus gallus Inactive tyrosine-protein kinase 7 Proteins 0.000 description 1
- LJEPDHWNQXPXMM-NHCYSSNCSA-N Gln-Arg-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(O)=O LJEPDHWNQXPXMM-NHCYSSNCSA-N 0.000 description 1
- AQPZYBSRDRZBAG-AVGNSLFASA-N Gln-Phe-Asn Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCC(=O)N)N AQPZYBSRDRZBAG-AVGNSLFASA-N 0.000 description 1
- OAGVHWYIBZMWLA-YFKPBYRVSA-N Glu-Gly-Gly Chemical compound OC(=O)CC[C@H](N)C(=O)NCC(=O)NCC(O)=O OAGVHWYIBZMWLA-YFKPBYRVSA-N 0.000 description 1
- QXDXIXFSFHUYAX-MNXVOIDGSA-N Glu-Ile-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H](N)CCC(O)=O QXDXIXFSFHUYAX-MNXVOIDGSA-N 0.000 description 1
- 102000053187 Glucuronidase Human genes 0.000 description 1
- 108010060309 Glucuronidase Proteins 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- XCLCVBYNGXEVDU-WHFBIAKZSA-N Gly-Asn-Ser Chemical compound NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O XCLCVBYNGXEVDU-WHFBIAKZSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- SWQALSGKVLYKDT-UHFFFAOYSA-N Gly-Ile-Ala Natural products NCC(=O)NC(C(C)CC)C(=O)NC(C)C(O)=O SWQALSGKVLYKDT-UHFFFAOYSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 1
- YGHSQRJSHKYUJY-SCZZXKLOSA-N Gly-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN YGHSQRJSHKYUJY-SCZZXKLOSA-N 0.000 description 1
- MAABHGXCIBEYQR-XVYDVKMFSA-N His-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC1=CN=CN1)N MAABHGXCIBEYQR-XVYDVKMFSA-N 0.000 description 1
- HRGGKHFHRSFSDE-CIUDSAMLSA-N His-Asn-Ser Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CO)C(=O)O)N HRGGKHFHRSFSDE-CIUDSAMLSA-N 0.000 description 1
- LCNNHVQNFNJLGK-AVGNSLFASA-N His-Gln-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N LCNNHVQNFNJLGK-AVGNSLFASA-N 0.000 description 1
- OEROYDLRVAYIMQ-YUMQZZPRSA-N His-Gly-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O OEROYDLRVAYIMQ-YUMQZZPRSA-N 0.000 description 1
- PZAJPILZRFPYJJ-SRVKXCTJSA-N His-Ser-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O PZAJPILZRFPYJJ-SRVKXCTJSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- KIAOPHMUNPPGEN-PEXQALLHSA-N Ile-Gly-His Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N KIAOPHMUNPPGEN-PEXQALLHSA-N 0.000 description 1
- DFFTXLCCDFYRKD-MBLNEYKQSA-N Ile-Gly-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N DFFTXLCCDFYRKD-MBLNEYKQSA-N 0.000 description 1
- KYLIZSDYWQQTFM-PEDHHIEDSA-N Ile-Ile-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N KYLIZSDYWQQTFM-PEDHHIEDSA-N 0.000 description 1
- BBQABUDWDUKJMB-LZXPERKUSA-N Ile-Ile-Ile Chemical compound CC[C@H](C)[C@H]([NH3+])C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C([O-])=O BBQABUDWDUKJMB-LZXPERKUSA-N 0.000 description 1
- OUUCIIJSBIBCHB-ZPFDUUQYSA-N Ile-Leu-Asp Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O OUUCIIJSBIBCHB-ZPFDUUQYSA-N 0.000 description 1
- HPCFRQWLTRDGHT-AJNGGQMLSA-N Ile-Leu-Leu Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O HPCFRQWLTRDGHT-AJNGGQMLSA-N 0.000 description 1
- FFAUOCITXBMRBT-YTFOTSKYSA-N Ile-Lys-Ile Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FFAUOCITXBMRBT-YTFOTSKYSA-N 0.000 description 1
- SHVFUCSSACPBTF-VGDYDELISA-N Ile-Ser-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N SHVFUCSSACPBTF-VGDYDELISA-N 0.000 description 1
- NJGXXYLPDMMFJB-XUXIUFHCSA-N Ile-Val-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N NJGXXYLPDMMFJB-XUXIUFHCSA-N 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 1
- 108091026898 Leader sequence (mRNA) Proteins 0.000 description 1
- NTRAGDHVSGKUSF-AVGNSLFASA-N Leu-Arg-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O NTRAGDHVSGKUSF-AVGNSLFASA-N 0.000 description 1
- FIJMQLGQLBLBOL-HJGDQZAQSA-N Leu-Asn-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O FIJMQLGQLBLBOL-HJGDQZAQSA-N 0.000 description 1
- GLBNEGIOFRVRHO-JYJNAYRXSA-N Leu-Gln-Phe Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O GLBNEGIOFRVRHO-JYJNAYRXSA-N 0.000 description 1
- HVJVUYQWFYMGJS-GVXVVHGQSA-N Leu-Glu-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O HVJVUYQWFYMGJS-GVXVVHGQSA-N 0.000 description 1
- VBZOAGIPCULURB-QWRGUYRKSA-N Leu-Gly-His Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N VBZOAGIPCULURB-QWRGUYRKSA-N 0.000 description 1
- CFZZDVMBRYFFNU-QWRGUYRKSA-N Leu-His-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)NCC(O)=O CFZZDVMBRYFFNU-QWRGUYRKSA-N 0.000 description 1
- DBSLVQBXKVKDKJ-BJDJZHNGSA-N Leu-Ile-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O DBSLVQBXKVKDKJ-BJDJZHNGSA-N 0.000 description 1
- KYIIALJHAOIAHF-KKUMJFAQSA-N Leu-Leu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 KYIIALJHAOIAHF-KKUMJFAQSA-N 0.000 description 1
- LXKNSJLSGPNHSK-KKUMJFAQSA-N Leu-Leu-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)O)N LXKNSJLSGPNHSK-KKUMJFAQSA-N 0.000 description 1
- YUTNOGOMBNYPFH-XUXIUFHCSA-N Leu-Pro-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YUTNOGOMBNYPFH-XUXIUFHCSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 1
- FGZVGOAAROXFAB-IXOXFDKPSA-N Leu-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC(C)C)N)O FGZVGOAAROXFAB-IXOXFDKPSA-N 0.000 description 1
- VKVDRTGWLVZJOM-DCAQKATOSA-N Leu-Val-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O VKVDRTGWLVZJOM-DCAQKATOSA-N 0.000 description 1
- 241000209510 Liliopsida Species 0.000 description 1
- SSJBMGCZZXCGJJ-DCAQKATOSA-N Lys-Asp-Met Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O SSJBMGCZZXCGJJ-DCAQKATOSA-N 0.000 description 1
- QZONCCHVHCOBSK-YUMQZZPRSA-N Lys-Gly-Asn Chemical compound [H]N[C@@H](CCCCN)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O QZONCCHVHCOBSK-YUMQZZPRSA-N 0.000 description 1
- FHIAJWBDZVHLAH-YUMQZZPRSA-N Lys-Gly-Ser Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O FHIAJWBDZVHLAH-YUMQZZPRSA-N 0.000 description 1
- JYXBNQOKPRQNQS-YTFOTSKYSA-N Lys-Ile-Ile Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JYXBNQOKPRQNQS-YTFOTSKYSA-N 0.000 description 1
- WRODMZBHNNPRLN-SRVKXCTJSA-N Lys-Leu-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O WRODMZBHNNPRLN-SRVKXCTJSA-N 0.000 description 1
- OIQSIMFSVLLWBX-VOAKCMCISA-N Lys-Leu-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OIQSIMFSVLLWBX-VOAKCMCISA-N 0.000 description 1
- YTJFXEDRUOQGSP-DCAQKATOSA-N Lys-Pro-Ser Chemical compound [H]N[C@@H](CCCCN)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O YTJFXEDRUOQGSP-DCAQKATOSA-N 0.000 description 1
- AFFKUNVPPLQUGA-DCAQKATOSA-N Met-Leu-Ala Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O AFFKUNVPPLQUGA-DCAQKATOSA-N 0.000 description 1
- XDGFFEZAZHRZFR-RHYQMDGZSA-N Met-Leu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDGFFEZAZHRZFR-RHYQMDGZSA-N 0.000 description 1
- 108700005084 Multigene Family Proteins 0.000 description 1
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- 241000244206 Nematoda Species 0.000 description 1
- 108700026244 Open Reading Frames Proteins 0.000 description 1
- 240000003010 Oryza longistaminata Species 0.000 description 1
- 235000007189 Oryza longistaminata Nutrition 0.000 description 1
- 241000759801 Oxalis argentina Species 0.000 description 1
- 238000012408 PCR amplification Methods 0.000 description 1
- 238000010222 PCR analysis Methods 0.000 description 1
- LNIIRLODKOWQIY-IHRRRGAJSA-N Phe-Asn-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCSC)C(O)=O LNIIRLODKOWQIY-IHRRRGAJSA-N 0.000 description 1
- YYKZDTVQHTUKDW-RYUDHWBXSA-N Phe-Gly-Gln Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)NCC(=O)N[C@@H](CCC(=O)N)C(=O)O)N YYKZDTVQHTUKDW-RYUDHWBXSA-N 0.000 description 1
- HGNGAMWHGGANAU-WHOFXGATSA-N Phe-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC1=CC=CC=C1 HGNGAMWHGGANAU-WHOFXGATSA-N 0.000 description 1
- AUJWXNGCAQWLEI-KBPBESRZSA-N Phe-Lys-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O AUJWXNGCAQWLEI-KBPBESRZSA-N 0.000 description 1
- UNBFGVQVQGXXCK-KKUMJFAQSA-N Phe-Ser-Leu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O UNBFGVQVQGXXCK-KKUMJFAQSA-N 0.000 description 1
- 108091000080 Phosphotransferase Proteins 0.000 description 1
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 1
- ZCXQTRXYZOSGJR-FXQIFTODSA-N Pro-Asp-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZCXQTRXYZOSGJR-FXQIFTODSA-N 0.000 description 1
- NMELOOXSGDRBRU-YUMQZZPRSA-N Pro-Glu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CCC(=O)O)NC(=O)[C@@H]1CCCN1 NMELOOXSGDRBRU-YUMQZZPRSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- OWQXAJQZLWHPBH-FXQIFTODSA-N Pro-Ser-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(O)=O OWQXAJQZLWHPBH-FXQIFTODSA-N 0.000 description 1
- OOZJHTXCLJUODH-QXEWZRGKSA-N Pro-Val-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 OOZJHTXCLJUODH-QXEWZRGKSA-N 0.000 description 1
- 102000001253 Protein Kinase Human genes 0.000 description 1
- 108050002122 Protein kinase domains Proteins 0.000 description 1
- 102000012515 Protein kinase domains Human genes 0.000 description 1
- 244000184734 Pyrus japonica Species 0.000 description 1
- 108010092799 RNA-directed DNA polymerase Proteins 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- RDFQNDHEHVSONI-ZLUOBGJFSA-N Ser-Asn-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O RDFQNDHEHVSONI-ZLUOBGJFSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- GZFAWAQTEYDKII-YUMQZZPRSA-N Ser-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO GZFAWAQTEYDKII-YUMQZZPRSA-N 0.000 description 1
- MOINZPRHJGTCHZ-MMWGEVLESA-N Ser-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N MOINZPRHJGTCHZ-MMWGEVLESA-N 0.000 description 1
- HEUVHBXOVZONPU-BJDJZHNGSA-N Ser-Leu-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HEUVHBXOVZONPU-BJDJZHNGSA-N 0.000 description 1
- XUDRHBPSPAPDJP-SRVKXCTJSA-N Ser-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@@H](N)CO XUDRHBPSPAPDJP-SRVKXCTJSA-N 0.000 description 1
- FLMYSKVSDVHLEW-SVSWQMSJSA-N Ser-Thr-Ile Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FLMYSKVSDVHLEW-SVSWQMSJSA-N 0.000 description 1
- NADLKBTYNKUJEP-KATARQTJSA-N Ser-Thr-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O NADLKBTYNKUJEP-KATARQTJSA-N 0.000 description 1
- 108091081024 Start codon Proteins 0.000 description 1
- 108020005038 Terminator Codon Proteins 0.000 description 1
- 208000035199 Tetraploidy Diseases 0.000 description 1
- KZSYAEWQMJEGRZ-RHYQMDGZSA-N Thr-Leu-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O KZSYAEWQMJEGRZ-RHYQMDGZSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- ZMYCLHFLHRVOEA-HEIBUPTGSA-N Thr-Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O ZMYCLHFLHRVOEA-HEIBUPTGSA-N 0.000 description 1
- YKCXQOBTISTQJD-BZSNNMDCSA-N Tyr-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=C(C=C2)O)N YKCXQOBTISTQJD-BZSNNMDCSA-N 0.000 description 1
- NWEGIYMHTZXVBP-JSGCOSHPSA-N Tyr-Val-Gly Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O NWEGIYMHTZXVBP-JSGCOSHPSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- WOCYUGQDXPTQPY-FXQIFTODSA-N Val-Ala-Cys Chemical compound C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C(C)C)N WOCYUGQDXPTQPY-FXQIFTODSA-N 0.000 description 1
- SCBITHMBEJNRHC-LSJOCFKGSA-N Val-Asp-Val Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](C(C)C)C(=O)O)N SCBITHMBEJNRHC-LSJOCFKGSA-N 0.000 description 1
- NXRAUQGGHPCJIB-RCOVLWMOSA-N Val-Gly-Asn Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(O)=O NXRAUQGGHPCJIB-RCOVLWMOSA-N 0.000 description 1
- KNYHAWKHFQRYOX-PYJNHQTQSA-N Val-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C(C)C)N KNYHAWKHFQRYOX-PYJNHQTQSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- AGXGCFSECFQMKB-NHCYSSNCSA-N Val-Leu-Asp Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N AGXGCFSECFQMKB-NHCYSSNCSA-N 0.000 description 1
- MGVYZTPLGXPVQB-CYDGBPFRSA-N Val-Met-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](C(C)C)N MGVYZTPLGXPVQB-CYDGBPFRSA-N 0.000 description 1
- MHHAWNPHDLCPLF-ULQDDVLXSA-N Val-Phe-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)CC1=CC=CC=C1 MHHAWNPHDLCPLF-ULQDDVLXSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- AEFJNECXZCODJM-UWVGGRQHSA-N Val-Val-Gly Chemical compound CC(C)[C@H]([NH3+])C(=O)N[C@@H](C(C)C)C(=O)NCC([O-])=O AEFJNECXZCODJM-UWVGGRQHSA-N 0.000 description 1
- 241000700605 Viruses Species 0.000 description 1
- 239000002253 acid Substances 0.000 description 1
- 230000002411 adverse Effects 0.000 description 1
- 238000000246 agarose gel electrophoresis Methods 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 238000012197 amplification kit Methods 0.000 description 1
- 244000052616 bacterial pathogen Species 0.000 description 1
- 235000021028 berry Nutrition 0.000 description 1
- 238000007622 bioinformatic analysis Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 210000004027 cell Anatomy 0.000 description 1
- 235000013339 cereals Nutrition 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 210000000349 chromosome Anatomy 0.000 description 1
- 238000011109 contamination Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 230000009089 cytolysis Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000004665 defense response Effects 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 239000005546 dideoxynucleotide Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 238000001976 enzyme digestion Methods 0.000 description 1
- 241001233957 eudicotyledons Species 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 1
- 108010037850 glycylvaline Proteins 0.000 description 1
- 108010028295 histidylhistidine Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 230000009545 invasion Effects 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- 108010090333 leucyl-lysyl-proline Proteins 0.000 description 1
- 108010064235 lysylglycine Proteins 0.000 description 1
- 108020004999 messenger RNA Proteins 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 239000002777 nucleoside Substances 0.000 description 1
- 125000003835 nucleoside group Chemical group 0.000 description 1
- 108010084525 phenylalanyl-phenylalanyl-glycine Proteins 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 102000020233 phosphotransferase Human genes 0.000 description 1
- 244000000003 plant pathogen Species 0.000 description 1
- 108010070643 prolylglutamic acid Proteins 0.000 description 1
- 108060006633 protein kinase Proteins 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 108010026333 seryl-proline Proteins 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 230000001568 sexual effect Effects 0.000 description 1
- 230000019491 signal transduction Effects 0.000 description 1
- 238000000527 sonication Methods 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
- 230000014621 translational initiation Effects 0.000 description 1
- 108010073969 valyllysine Proteins 0.000 description 1
Images
Landscapes
- Breeding Of Plants And Reproduction By Means Of Culturing (AREA)
Abstract
本发明涉及植物基因工程技术领域。具体涉及包含小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3的DNA片段的分离克隆和功能验证。Xa3/Xa26-3基因编码富亮氨酸蛋白激酶类蛋白质。它能够赋予水稻抵抗由细菌性病原菌-白叶枯病菌(Xanthomonas oryzae pv.oryzae)引起的病害。将该片段与其自身调控序列直接转入水稻,携带Xa3/Xa26-3的转基因水稻对白叶枯病的抵抗能力显著增强。
Description
技术领域
本发明涉及植物基因工程技术领域。具体涉及一个水稻抗白叶枯病主效基因Xa3/Xa26-3的分离克隆、功能验证和应用。所述的DNA片段能够赋予水稻抵抗由白叶枯病菌引起的病害。将该片段与其内源调节序列直接转入植物体,转基因水稻可以产生由该基因介导的对白叶枯病菌的防卫反应。
技术背景
植物在生长的过程中,受到多种病原物的侵害。植物病原物的种类繁多,包括病毒、细菌、真菌和线虫等。病原物侵入植物导致两种结果:(1)病原体成功的在寄主植物内繁殖,引起相关的病症;(2)寄主植物产生抗病反应,杀死病原物或阻止其生长。利用抗性基因资源改良植物的抗病性,是预防病害同时又保护环境的根本出路。
植物的抗病反应是多基因参于调控的复杂过程。参于植物抗病反应的基因分为两类:(1)抗病(主效)基因,又称R(resistance)基因和(2)抗病相关基因。
根据目前人们对抗病基因功能的认识,抗病基因的产物主要是作为受体,直接或间接与病原蛋白相互作用,启动植物体内的抗病信号传导路径(Chisholm等,2006)。抗病基因介导的抗病反应抗性强,是很好的基因资源。但是农作物栽培种中抗病基因的资源有限。从近缘物种中发掘新的抗病基因用于农作物遗传改良一直是研究者工作的一个重要目标。
水稻是世界上重要的粮食作物,但病害的影响常常造成其产量和品质的下降。水稻白叶枯病由白叶枯病菌(Xanthomonas oryzae pv.oryzae)引起,是世界上对水稻危害最大的细菌性病害之一(过崇俭,1995)。水稻栽培种(Oryzasativa L.)中抗病基因的资源非常有限。如目前知道的抗白叶枯病基因只有大约30个(储昭晖和王石平,2007)。由于野生稻长期处于野生状态,经受了各种不良环境和灾害的自然选择,形成了极其丰富的遗传多样性。野生稻中蕴藏有优良抗病基因资源,是人们发掘新的优良基因的一个重要资源库。已知的大约30个抗白叶枯病基因中就有多个基因是从野生稻中导入进栽培稻中的。如抗白叶枯病显性基因Xa21源自西非长药野生稻(Oryza longistaminata),对大多数白叶枯病菌株表现高抗;但是该基因是成株期抗性,即只有在水稻成株期才对白叶枯病菌具有完全抗性(Khush等,1990;Wang等,1996)。Xa21基因已经被分离克隆(Song等,1995)。一个尚未被分离克隆的抗白叶枯病显性基因Xa23源自普通野生稻(Oryza rufipogon),它具有广谱抗性(章琦等,2000)。另一个已经被分离克隆的抗白叶枯病基因Xa27来自于四倍体小粒野生稻(Orzyaminuta),该基因抗白叶枯病菌菲律宾小种2、3、5和6(Amante-Bordeos等,1992;Gu等,2005)。此外,尚未被分离克隆的抗白叶枯病基因Xa29(t)和Xa30(t)是分别从药用野生稻(Orzya officinalis)和普通野生稻中鉴定出的(谭光轩等,2004;王春连等,2004)。
发明内容
本发明的目的是分离克隆小粒野生稻(Oryza minuta)中携带的抗白叶枯病基因Xa3/Xa26-3和包含调控这个基因的启动子的DNA片段,利用这个基因改良水稻品种或其他 植物抵御病害的能力。
本发明涉及分离和应用包含Xa3/Xa26-3基因的DNA片段,它的核苷酸序列如序列表SEQID NO:1所示,它的编码序列如序列表SEQ ID NO:1中第6072-8979和9084-9451位所示。该基因(片段)赋予水稻对由白叶枯病菌(Xanthomonas oryzae pv.oryzae)所引起的病害产生特异性的抗病反应。这一发明适用于所有对该病原菌敏感的植物。这些植物包括单子叶植物和双子叶植物。除上述所述的如SEQ ID NO:1所示的DNA片段外,本发明所定义的基因还包括基本上相当于SEQ ID NO:1所示的DNA序列,或者其功能相当于SEQ
ID NO:1所示序列的亚片段。对序列表SEQ ID NO:1所示序列进行生物信息学分析表明,该DNA序列编码一种受体蛋白激酶,它包含细胞外的富亮氨酸重复(leucine-rich repeat,LRR)结构域和细胞内的蛋白质激酶结构域。
可以采用已经克隆的Xa3/Xa26-3基因作探针,从cDNA和基因组文库中筛选到本发明的基因或同源基因。同样,采用PCR(polymerase chain reaction)技术,也可以从基因组、mRNA和cDNA中扩增得到本发明的Xa3/Xa26-3基因以及任何感兴趣的一段DNA或与其同源的一段DNA。采用以上技术,可以分离得到包含Xa3/Xa26-3基因的序列或者包含一段Xa3/Xa26-3基因的序列,将这一序列与合适的载体连接,可以转入植物细胞,并表达Xa3/Xa26-3基因,产生抗病转基因植物。采用这种转基因技术创造抗病植物是传统育种技术所不能达到的。
将克隆的抗病基因转入感病的植物,有助于产生新的抗病植物。特别是可以用遗传转化技术在植物中累加多个抗病基因,而不会产生传统育种技术中伴随出现的将遗传连锁累赘引入需改良的品种。同时,抗病基因的克隆可以克服传统育种不能在植物种间转移抗病基因的问题。本发明能够进一步提供或应用利用上述DNA片段获得的抗病的转基因植株和相应的种子,可以用有性杂交的方式将本发明的基因转入其它的植物。
本发明为增强水稻对白叶枯病的抗性提供了一种新的方法。这种方法包括将来源于野生稻的Xa3/Xa26-3基因和它的调控DNA序列与遗传转化载体连接、转入感病水稻,改良水稻对白叶枯病的抗性。
更详细的技术方案见《具体实施方式》所述。
附图说明
序列表SEQ ID NO:1.是本发明克隆的Xa3/Xa26-3基因的DNA序列。
图1.本发明鉴定和分离克隆水稻抗白叶枯病基因Xa3/Xa26-3以及验证Xa3/Xa26-3基因功能的流程图。
图2.籼稻品种明恢63中Xa3/Xa26基因和MRKa基因的结构及用于制备DNA探针的PCR引物(箭头)位置。
图3.Xa3/Xa26基因片段的DNA探针与7个BAC克隆的杂交结果。
图4.籼稻品种明恢63和小粒野生稻中Xa3/Xa26基因家族成员的分布和序列同源性。每一个箭头代表一个Xa3/Xa26家族成员和转录方向。Xa3/Xa26和Xa3/Xa26-3基因下游约有1.4kb核苷酸序列同源程度达到83%。13.7kb区段是用于遗传转化的DNA片段。
图5.遗传转化载体pCAMBIA1301的结构。
图6.T1代遗传转化植株中的报告基因GUS与抗病表型共分离。说明与GUS基因紧密连锁的Xa3/Xa26-3基因与抗病表型共分离。对照为水稻感病品种牡丹江8(遗传转化的受体)。病斑面积为接种白叶枯病菌PXO61两周后的数据。
图7.与对照牡丹江8相比,携带Xa3/Xa26-3基因的遗传转化植株(D103OM26)对不同白叶枯病菌的抗性都增强了。
图8.Xa3/Xa26-3基因结构。箭头示用于基因结构分析的PCR引物位置和方向。“ATG”和“TGA”分别是翻译起始密码和终止密码。数字示每一种结构的核苷酸数目。
具体实施方式
本发明的前期研究结果显示来源于籼稻(Oryza sativa L.ssp.indica)品种明恢63的抗白叶枯病显性基因Xa3/Xa26属于多基因家族成员;这个家族的成员以串连排列的形式位于水稻11号染色体的长臂(Yang等,2003;Sun等,2004,2006;Xiang等2006)。在明恢63中,Xa3/Xa26家族由4个成员(Xa3/Xa26、MRKa、MRKc和MRKd)组成,它们的编码区之间的DNA序列同源性为60%至78%(Sun等,2004)。Xa3/Xa26编码一个富亮氨酸重复(leucine-richrepeat,LRR)蛋白激酶类蛋白(Sun等,2004)。根据本发明中的目标基因与Xa3/Xa26基因序列同源性分析和它在小粒野生稻(Oryza minuta)的Xa3/Xa26家族中与其他成员的相对应位置关系,确定该基因是Xa3/Xa26基因的又一个等位基因,故命名为Xa3/Xa26-3基因(视Xa3/Xa26基因为Xa3/Xa26-1)。
以下实施例中进一步定义本发明。图1描叙了鉴定和分离克隆Xa3/Xa26-3基因以及验证Xa3/Xa26-3基因功能的流程。根据以下的描述和这些实施例,本领域技术人员可以确定本发明的基本特征、并且在不偏离本发明精神和范围的情况下,可以对本发明做出各种改变和修改,以使其适用各种用途和条件。
实施例1:从小粒野生稻中分离克隆Xa3/Xa26-3基因和基因结构分析
1.鉴定携带Xa3/Xa26基因同源序列的大片段DNA
本发明研究人员首先用Xa3/Xa26基因的特异性PCR引物RKb-3’race2(5’-TGGTCAAATACCGGAAGGAG-3’)和RKb-2R(5’-CAGTCCACCACATGGACAAG-3’)和Xa3/Xa26家族成员MRKa基因的特异性PCR引物RKa-11L(5’-TTGGCTTGAACGGCTTAACT-3’)和RKa-1R(5’-AAGATGAAATATGCTCGGTGGT-3’)从水稻品种明恢63(中国大面积推广的水稻品种)中扩增出Xa3/Xa26基因和MRKa的DNA片段,扩增长度各约1kb(图2)。将这两段PCR扩增产物混合作为探针筛选小粒野生稻(Oryzaminuta)的基因组BAC(bacterial artificial chromosome)文库(Ammiraju等,2006),鉴定了7个阳性BAC克隆。这7个阳性BAC克隆(Ammiraju等,2006)由美国University of Arizona大学的Rod Wing教授惠赠。将筛选到得BAC克隆扩大培养,采用标准碱裂解法分离提取BAC质粒(Sambrook和Russell,2001)。将所得BAC质粒用HindIII完全酶切、电泳、转移至尼龙膜;然后用上述Xa3/Xa26基因片段探针和MRKa基因片段探针分别与载有BAC质粒的尼龙膜杂交(Sambrook和Russell,2001)。用Xa3/Xa26基因片段作探针的杂交结果显示BAC 克隆OMBa0293H21与其他三个BAC克隆有相同的杂交带型,它们的杂交带最多(图3),推测这些BAC克隆可能覆盖Xa3/Xa26基因家族区段。故选择对OMBa0293H21克隆进行测序。
2.BAC克隆的shotgun文库的构建
本发明研究人员首先构建了用于测序的OMBa0293H21克隆的shotgun文库。采用超声波法构建shotgun文库(Sambrook和Russell,2001)。用超声波处理OMBa0293H21克隆的环形质粒1.6秒(超声破碎仪Soniprep 150为SANYO公司产品,具体操作参考该仪器的使用说明书)。通过1%琼脂糖凝胶电泳分离大约2.0-3.5kb的DNA片段,用UNIQ-10柱式DNA胶回收试剂盒(购自中国上海生工生物工程有限公司)从电泳凝胶中纯化DNA。纯化后的DNA用T4DNA聚合酶补平末端,与限制性内切酶SmaI酶切的去磷酸化的pUC19载体(购自美国Amersham Bioscience公司)平端连接,电转化大肠杆菌DH10B(购自美国Invitrogen公司)(电转化仪为eppendorf公司产品,本实施例所用电压为1800V,具体操作参考该仪器的使用说明书)。挑克隆检测插入片段大小:提取阳性克隆质粒,并用空pUC19质粒进行电泳比较,剔除假阳性克隆及小片段克隆,或用EcoRI和HindIII双酶切重组质粒,检测插入片段大小,筛选插入片段大小合适的克隆作为shotgun文库用于测序。
3.Shotgun文库的测序
采用M13-F(5′-GTAAAACGACGGCCAGT-3′)和M13-R(5′-CAGGAAACAGCTATGAC-3′)通用引物(购自中国上海生工生物工程有限公司)、美国Perkin Elmer公司的测序试剂盒(BigDye Kit),根据试剂盒说明书以双脱氧核苷酸末端终止法分别从shotgun文库中随机挑选的克隆两端测序。
4.原始序列的拼接
使用Squencer 4.5软件(美国Gene Codes Corporation)拼接序列。用Squencer4.5软件自动去除末端测序较差的序列和pUC19载体序列;软件没有去除干净的序列则用手工删除。BAC载体的碎片序列和污染的细菌DNA序列通过和BAC序列进行比较或BLAST分析的方法(Altschul等,1997)加以去除。用Squencer4.5软件对两条序列进行拼接的参数是:重叠序列长度(Mini Overlap)大于20bp(base pair),重叠序列的一致性(Mini Match)大于85%。每一个碱基的确定都要参照重叠在该位点的多个shotgun片段序列。对于由一个shotgun片段序列所覆盖的区域要再次测序验证,以确保碱基的准确性。
5.DNA序列分析
首先用Blastn和Blastx方法(Altschul等,1997)分析所得序列。然后采用Fgenesh(http://www.softberry.ru/berry.phtml)(Salamov和Solovyev,2000)和GeneMark.hmm(http://exon.gatech.edu/GeneMark/eukhmm.cgi)(Besemer和Borodovsky,2005)软件分析预测基因结构,分析结果进一步用Blastp方法(Altschul等,1997)进行确定。两条或多条核苷酸或氨基酸序列的比较分析使用BLAST 2 sequence方法(Tatusova等,1999)和ClustalW方法(http://www.ebi.ac.uk)。
对BAC克隆OMBa0293H21测序和序列拼接形成九个大的序列片段,其长度分别为71.3kb,24.9kb,17.6kb,11.1kb,8.1kb,6.6kb,4.6kb,4.1kb和1.1kb。序列分析发现在71.3kb的片段上有5个基因编码LRR-蛋白激酶类蛋白,与水稻抗白叶枯病基因Xa3/Xa26编码的产物有不同程度的同源性,它们被命名为OmRKa、OmRKb1、OmRKb、OmRKg和OmRKc(图4)。用Fgenesh、GeneMark.hmm和Blastx分析表明,OmRKa和OmRKb是2个完整的基因,OmRKb1和OmRKg是不完整基因,OmRKc是假基因(图4)。OmRKa和OmRKb的转录方向相同,而OmRKc的转录方向与OmRKa和OmRKb相反(图4)。
将该区段与水稻品种明恢63中Xa3/Xa26基因家族区段进行比较分析。发现小粒野生稻中的OmRKa基因和明恢63中的MRKa基因的同源性达到80%,小粒野生稻中的OmRKc基因和明恢63中的MRKc基因的同源性达到83%,小粒野生稻中的OmRKb基因和和明恢63中的Xa3/Xa26(又称为MRKb)基因同源性高达95%;并且OmRKb和MRKb基因下游约有1.4kb区段同源性达到83%,所以OmRKb为Xa3/Xa26的等位基因,我们命名该基因为Xa3/Xa26-3(图4)。
实施例2:Xa3/Xa26-3基因的功能验证
1.遗传转化载体的构建
本发明所用载体是pCAMBIA1301(图5),它是常用的水稻遗传转化载体(Sun等,2004)。用限制性内切酶SmaI从BAC克隆OMBa0293H21上酶切后回收13.7kb包含Xa3/Xa26-3基因的编码区、启动子以及尾部序列的片段(图4)。同时,用限制性内切酶SmaI酶切遗传转化载体pCAMBIA1301;酶切完毕,用SAP(虾碱性磷酸酶)去磷酸化;用氯仿∶异戊醇(体积比24∶1)抽提、纯化酶切产物。用包含Xa3/Xa26-3基因的酶切回收片段和纯化好的载体做连接反应。通过酶切验证阳性克隆,获得的重组质粒被命名为D103O。
2.遗传转化和T0代遗传转化植株分析
采用农杆菌介导的遗传转化方法(Lin和Zhang,2005)将D103O导入水稻感病品种牡丹江8(Oryza sativa L.ssp.japonica)。获得的遗传转化植株被命名为D103OM(该命名的前面部分即D103O为遗传转化载体名称,M代表水稻品种牡丹江8)。本发明共获得阳性独立转化植株19株,对全部阳性转化植株接种白叶枯病菌株PXO61。白叶枯病菌接种采用剪叶法对成株期的水稻进行接种(Sun等,2004)。白叶枯病菌株PXO61由菲律宾国际水稻研究所惠赠(Sun等,2004;Wu等,2008)。白叶枯病菌的培养遵循已经公开发表的方法(Sun等,2004)。接种14天后调查病斑面积(病斑长度/病叶长度×%)。与对照牡丹江8相比,所有阳性遗传转化植株的抗性显著增强(表1)。
表1.T0代遗传转化植株(D103OM)接种白叶枯病菌株PXO61的表型
(1)每株遗传转化基因植株接种3-5片叶,14天后调查病斑和病叶长度,每个数据来自多个叶片的平均值±标准差。
3.遗传转化植株的共分离分析
为了验证遗传转化植株的抗病能力增强是否与Xa3/Xa26-3基因转入相关,对在T0代抗性增强的2株遗传转化植株(D103OM25、D103OM43)的T1代家系进行基因型和抗性共分离分析。在孕穗期时接种白叶枯病菌PXO61,接种14天后调查病斑面积。同时取样抽取DNA,采用遗传转化载体所携带的报告基因GUS(β-葡萄糖醛酸酶)的PCR引物Gus2F(5’-CCAGGCAGTTTTAACGATCAGTTCGC-3’)和Gus2R(5’-GAGTGAAGATCCCTTTCTTGTTACCG-3’)扩增检测阳性遗传转化植株。分析发现遗传转化植株的抗性增强和GUS基因共分离(图6)。说明遗传转化植株的抗性增强是因为Xa3/Xa26-3基因的存在,Xa3/Xa26-3是一个抗白叶枯病基因。
4.遗传转化植株的抗谱分析
对来源于T0植株D103OM26的T1代遗传转化家系和对照牡丹江8在孕穗期接种不同白叶枯病菌株,分析遗传转化植株的抗谱。接种采用上述剪叶法。菲律宾白叶枯病菌株PXO61、PXO71、PXO112、PXO341由菲律宾国际水稻研究所惠赠(Sun等,2004;Wu等,2008)。中国白叶枯病菌株JL691是常用菌株(Sun等,2004)。日本菌株T7174也是常用白叶枯病菌株(Cao等,2007)。与遗传转化受体水稻品种牡丹江8相比,携带Xa3/Xa26-3基因的遗传 转化植株显著增强(P<0.01)了对不同白叶枯病菌株的抗性(图7)。遗传转化植株的病斑面积只是对照牡丹江8的30%至78%。这些结果说明Xa3/Xa26-3基因对白叶枯病菌具有广谱抗性。
5.Xa3/Xa26-3基因结构和编码产物分析
本发明研究人员利用转基因植株(D103OM)分析了Xa3/Xa26-3基因的结构。转基因植株的总RNA的抽提采用TRIzol Reagent(美国Invitrogen公司),操作方法按该公司提供的说明书进行。
先将用于反转录的总RNA用无RNA酶活性的DNA酶I(美国Invitrogen公司)处理,室温放置15分钟,去除总RNA中可能存在的DNA污染;再将总RNA在65℃处理10分钟,灭活DNA酶I。用SuperScript III反转录酶(美国Invitrogen公司),按试剂供应商提供的说明书进行反转录。采用跨越预测内含子位置的PCR引物RKb3F和RKb2R(表2)扩增基因组DNA和RNA反转录产物,RT-PCR产物比基因组扩增产物小,说明该区段确实存在一个内含子(图8)。采用Xa3/Xa26-3基因区段的其它引物(表2)进行反转录(RT)-PCR分析,没有再发现其它内含子的存在。
基因末端序列分析采用Clontech公司SMARTer RACE cDNA Amplification Kit试剂盒,根据试剂盒说明书,通过RACE(rapid amplification of cDNA end)分析方法,确定Xa3/Xa26-3基因的5’和3’末端序列。
表2.用于RT-PCR和RACE分析的基因特异性引物
采用上述DNA测序方法,对RT-PCR产物进行测序。比较分析Xa3/Xa26-3基因的cDNA序列和基因组序列,确定该基因由3601个核苷酸组成,包含两个外显子和一个内含子(图8)。第一个外显子由2954个核苷酸组成(位于序列表SEQ ID NO:1的6026-8979bp处),其中包含由46个核苷酸组成的5’端非翻译区(untranslated region,UTR)(位于序列表SEQ ID NO:1的6026-6071bp处)和2908个核苷酸组成的编码区(位于序列表SEQ ID NO:1的6072-8979bp处);内含子由104个核苷酸组成(位于序列表SEQ ID NO:1的8980-9083bp处);第二 个外显子由543个核苷酸组成(位于序列表SEQ ID NO:1的9084-9626bp处),其中包含由368个核苷酸组成的编码区(位于序列表SEQ ID NO:1的9084-9451bp处)和由175个核苷酸组成的3’端UTR(位于序列表SEQ ID NO:1的9452-9626bp处)。
Xa3/Xa26-3基因编码一个由1092个氨基酸组成的LRR受体激酶类蛋白质。XA3/XA26-3蛋白与XA3/XA26蛋白的氨基酸一致性为92%,氨基酸的同源性为95%。
参考文献
储昭晖,王石平,(2007),抗性基因分离克隆、结构与功能和分子进化,见:章琦主编,水稻白叶枯病抗性的遗传及改良。北京:科学出版社,pp.349-377。
过崇俭(1995)中国农作物病虫害。中国农业科学院植物保护研究所主编,中国农业出版社,pp.14-24。
谭光轩,任翔,翁清妹,时振英,祝莉莉,何光存,药用野生稻转育后代一个抗白叶枯病新基因的定位,遗传学报,(2004)31:724-729。
王春连,赵炳宇,章琦,赵开军,邢全党,水稻白叶枯病新抗源Y238的鉴定及其近等基因系培育,植物遗传资源学报,(2004)5:26-30。
章琦,赵炳宇,赵开军,普通野生稻的抗水稻白叶枯病新基因Xa23的鉴定和分子标记定位,作物学报,(2000)26:536-542。
Altschul SF,Madden TL,Schaffer AA,Zhang J,Zhang Z,Miller W,Lipman DJ(1997)GappedBLAST and PSI-BLAST:a new generation of protein database search programs.NucleicAcids Res.25:3389-3402.
Amante-Bordeos A,Sitch LA,Nelson R,Dalmacio RD,Oliva NP,Aswidinnoor H,Leung H(1992)Transfer of bacterial blight and blast resistance from the tetraploid wild rice Oryza minuta tocultivated rice,Oryza sativa.Theor.Appl.Genet.84:345-354.
Ammiraju JS,Luo M,Goicoechea JL,Wang W,Kudrna D,Mueller C,Talag J,Kim H,SisnerosNB,Blackmon B,Fang E,Tomkins JB,Brar D,MacKill D,McCouch S,Kurata N,Lambert G,Galbraith DW,Arumuganathan K,Rao K,Walling JG,Gill N,Yu Y,SanMiguel P,SoderlundC,Jackson S,Wing RA(2006)The Oryza bacterial artificial chromosome library resource:construction and analysis of 12 deep-coverage large-insert BAC libraries that represent the 10genome types of the genus Oryza.Genome Res.16:140-147.
Besemer J,Borodovsky M(2005)GeneMark:web software for gene finding in prokaryotes,eukaryotes and viruses.Nucleic Acids Res.33:451-454.
Cao Y,Ding X,Cai M,Zhao J,Lin Y,Li X,Xu C,Wang S(2007)The expression pattern of a ricedisease resistance gene Xa3/Xa26 is differentially regulated by the genetic backgrounds anddevelopmental stages that influence its function.Genetics 177:523-533.
Chisholm ST,Coaker G,Day B,Staskawicz BJ(2006)Host-microbe interactions:shaping theevolution of the plant immune response.Cell 124:803-814.
Gu K,Yang B,Tian D,Wu L,Wang D,Sreekala C,Yang F,Chu Z,Wang GL,White FF,Yin Z.(2005)R gene expression induced by a tyPe-III effector triggers disease resistance in rice.Nature 435:1122-1125.
Lin YJ,Zhang Q(2005)Optimising the tissue culture conditions for high efficiency transformationof indica rice.Plant Cell Rep.23:540-547.
Khush GS,Bacalangco E,Ogawa T(1990)A new gene for resistance to bacterial blight from O.longistaminate.Rice Genet.Newslett.7:121-122.
Salamov A,Solovyev V(2000)Ab initio gene finding in Drosophila genomic DNA.Genome Res.,10:516-522.
Sambrook J,Russell D(2001)Molecular cloning:A laboratory manual.Cold Spring Harbor,NY:Cold Spring Harbor Laboratory Press.
Song WY,Wang GL,Chen LL,Kim HS,Pi LY,Holsten T.Gardner J,Wang B,Zhai WX,Zhu LH,Fauquet C,Ronald P(1995)A receptor kinase-like protein encoded by the rice diseaseresistance gene,Xa21.Science 279:1804-1806.
Sun X,Cao Y,Yang Z,Xu C,Li X,Wang S,Zhang Q(2004)Xa26,a gene conferring resistance toXanthomonas oryzae pv.Oryzae in rice,encoding a LRR receptor kinase-like protein.Plant J.37:517-527.
Sun X,Cao Y,Wang S(2006)Point mutations with positive selection were a major force during theevolution ofa receptor-kinase resistance gene family of rice.Plant Physiol.140:998-1008.
Tatusova TA,Madden TL(1999)BLAST 2 Sequences,a new tool for comparing protein andnucleotide sequences.FEMS Microbiol.Lett.174:247-250.
Wang GL,Song WY,Ruan DL,Sideris S,Ronald PC(1996)The cloned gene,Xa21,confersresistance to multiple Xanthomonas oryzae pv.oryzae isolates in transgenic plants.Mol.Plant-Microbe Interact.9:850-855.
Wu X,Li X,Xu C,Wang S(2008)Fine genetic mapping of xa24,a recessive gene for resistanceagainst Xanthomonas oryzae pv.oryzae in rice.Theor.Appl.Genet.118:185-191.
Xiang Y,Cao Y,Xu C,Li X,Wang S(2006)Xa3,conferring resistance for rice bacterial blight andencoding a receptor kinase-like protein,is the same as Xa26.Theor.Appl.Genet.113:1347-1355.
Yang Z,Sun X,Wang S,Zhang Q(2003)Genetic and physical mapping of a new gene for bacterialblight resistance in rice.Theor.Appl.Genet.106:1467-1472.
序列表
<110>华中农业大学
<120>小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用
<130>
<141>2010-03-25
<160>2
<170>PatentIn version 3.5
<210>1
<211>13653
<212>DNA
<213>Oryza sativa
<220>
<221>gene
<222>(6026)..(9626)
<220>
<221>5’UTR
<222>(6026)..(6071)
<220>
<221>CDS
<222>(6072)..(8979)
<220>
<221>Intron
<222>(8980)..(9083)
<220>
<221>CDS
<222>(9084)..(9451)
<220>
<221>3’UTR
<222>(9452)..(9626)
<400>1
gggaacgaga ggagggggag ggagtaggtc gagagggaag tcggcccgag tggaggaggg 60
aggtaaagtg gactttacca gagaaaatga aagagggagt ttgagggctc agattcgaaa 120
tccgattttg cggttttctc cgggattgag acgagagggg gaacacgaga ccaggcacac 180
gacaagactt gaccaagaac aaaattttcg caaatagggt ttttagagaa ctaggttttc 240
ccactaacgc cacgacgaaa caggcgttac atatttttaa cacaaaaaac atattatcaa 300
aatatattca atgttagatt taatgaaact aatttgatat ttcagatttt gctaaatttt 360
tctataaact taatcaaact taacaaagtt tgactagaaa aaaattcaaa tgacttataa 420
tatgaaatgg agagagtacc gatttcttgc gttgctgttg catctgtata aatctctctg 480
tgagtgtgtg ttgttatctt gttgcttcta agttccctct gttacatttt ctggttgcag 540
tccaggttcc ctggaagatc aatgctgcta caagtaaaac atccttctcc tcggcaatga 600
ttccccttca caatcacgcg aagtgagccg aaaatgcttc aatacggacg ttcggtgcgg 660
ataaaaaatc ggaccccaca tgtaaccact catttcttat cctcccatct ccttcgtctt 720
tcacgcaggg cggagaagac agccattgcg gcgtcgcctt cgcgcttccc ctacgaagcc 780
aaccgccacg cagctccctc acgcttgctc cttcctcctc cacctctgac tgctgtagca 840
tgctgggcga cggcagatgg taaggccacc ttcctcatgt cccccatcta gatccatcaa 900
tctagttgga gcgtgctttc tgtgcttggt agccggcacc gcaatggcga aagaagaggg 960
ggaaggaagc acagtttgaa agaaatggag tgctttttga tgggtccggt caccggagca 1020
gaggaagaag atggatccag ccatcgaagt agaagaagac agatctagtc acctaagtgg 1080
atgaaaaaga cagacggtag agttgttgca atctattgta tatgtttgtt gcactctatt 1140
ttatttacgt gtttcaccct cgttttatcg gatgttgcac atgatgtaca atgtgtgttt 1200
cttcgatgtt gcagagtaga gtcttgatgt tgcgggtagt gttttttcat tgtttcatct 1260
gattgaaaca ttctttacca agcgatgaaa tatttctcat cgagtaatga aatatcttaa 1320
acatctttta ttgaaacatt attcattgag tgatgaaaca tctgaaatat ttttttacat 1380
gtgatgaaac accgtctgat ttgtagtctt tttttaggtg ataatatctt aatgctattt 1440
atttatacct agcttagtgg tcccaatcta gattgtttct ctctggtgga atttcgctat 1500
gattctatac tgaaaaaaat ttcttaggag tctaaccaag ttcgtcacag gtcttgacca 1560
atcaaattat ttgcttactt ctacatgatt aagtgcatgc ctgcctgcaa tatactacct 1620
ccttttcaaa atgtacgatg ctgttgactt ctcgtatgac gttttcaaaa tgtacgatgc 1680
cgttgacttc tcgtatgacg tttgattata cgtcttattc aaaaaatcta ttatatatta 1740
aaaatccatt aaacttccta caaaaagtca ccatgtggca cttctataaa cactcctatg 1800
ccgccacgtg tcatatctcc aaattaagag aaaatctata gattctgaaa aaaggaaaaa 1860
tatccaacca tcgatttcac ttaaatcggt gatccattat tttagatcat tagattagat 1920
ctattaaata aagcaaatcc ctccctccta actcccgtat atgtacatag aagtagaaaa 1980
agaatacgta ctaccctatt ttcctggaga aaaaggtcga aaacaaaata cactattcat 2040
caggaaaaaa gcaaaaaaaa acatacaaac atatttgtac ggccacctga agtagtgaaa 2100
aaacacagaa aaaagcatac gtcccctcgc ctgattgtag aaaaacatta tattattaca 2160
attaaattta caatgctaaa cttgtattaa ctgtatattt taatttgaga aataattatt 2220
ttactttggg ttttattcta aatttatatg tcacccaaat aatcataaaa aatataaaac 2280
aactaaaact gatatttaaa aattactgac tggtatgaaa ttatgatatg ttaaaataaa 2340
tttactacta gatttataaa aaaaataaaa taaaactaat tccatttatt tatttcatca 2400
taaaaataat aacaaaaata acaatattga ttataccatt atacataact tcataaatat 2460
ggataacaaa attatatcta tgattattcg aagtaaatat tttctttgtt atgaaacttt 2520
gataagctat aataacatta aagtcctaat aacattataa aattgtcata taaactactt 2580
tcatgtattc atatttttaa ttttcattgt gaatatattg caagaaagac aagatacaaa 2640
gctcttgcta tggttcaacc tacacaggtt atacgtggat aacttaaagg tatcaatcat 2700
atgtctcatt aatcttgaat gtgtcagccc cacttacaaa ttcgtatcca ccaattgcta 2760
acgaggtatt tgtgcatcat ctcatatatc ttagatatga tgagcatgtg taccattaga 2820
cgtataataa agataaaggg cattttataa tatagattat atcattctat atcattttat 2880
aaacatacat agccaaagtc atgtgtgatt tactttaaat ctaactaaaa aaatcatgtc 2940
tacaataata agaatcatag aaatttaaaa aacagaaaaa ctcaagagca tataccctaa 3000
tagtaaataa aaatatcaat ggctaatccc taggacatat aaaattataa aggtttgtta 3060
aacgtcctat agaaccctcc tacagtgtct attaaagcct ttaacaaata agagaaaact 3120
ttaaaattct tctataatag acaaagcatc caacacttaa tcctcattaa tttaatgctc 3180
ccattaattt tagctattgg tcatgtcaat gggtatgtgt atacacacat tatttaataa 3240
aaaaaaccat ccattcaagc gtgttttgta atagaaaatt atttgagatg tcaggtgagg 3300
ttgtagtgac ctgttgcaag gttcacaggg ttgtagcaag aatgaccagc atcaagttgc 3360
atgaggttgc cagccatcta atgtatcact aaaagagccc ataggagata tttgtgtttt 3420
gttgattaga gccacaacca aaagaaggca acaagagaac cccaacacaa ataataattt 3480
ttatgaataa accttctctc aactcaagta tctatgtaat attatttaag tcgcatcagc 3540
ttgaaacatg cataatttga taacatggta tatctagaag gctatatgtt agaacattca 3600
attaacccac cttagaaata ataatactta gcaagaactc atgcatgtaa caaatttcta 3660
tctttaattt tgatatttaa ttttatcact gaaaaggggg agatagagta tgtgaatatt 3720
gtccaatttg caagctaaat gtttttttaa atgtgatagt tcagttggtt actgattctc 3780
aagtattatc atactattca agagtagagc aagcaataat catatccaaa acagatacaa 3840
agaacttaca tatcatagat aattttaaca tttagaaaca cgaagatgcc tgcccaaccg 3900
cgtgggctac cttcctagtt aatataaata tgtaaaggta taacttaaga ttataattta 3960
tttgatgata aaacaagtat ttatgtatat atatatatat ttaataaaat gaatgatcaa 4020
acgctatacg aaaaatcaac ggcgtcgtgc attttgaaat ggaggaagta tttttttcga 4080
ggtaatcgtt atgtttgagg aaatgtggca tgagttcacc acttgccaac ttggaatata 4140
acacatcgta tcgttagtgt tccatgcact aaaagtctct ttctttcccc tttggaatga 4200
gaacaatata gcttggtaat actgttcatc ttattcgcct ttttcctcct ttggaaaggg 4260
acagttgagc ggtggtctgg tggttacatc taaaacatgc catgagaagc tttggcactg 4320
tgtgtcttgt gcttcaaatg gctcgacatt acaacctaat aaatctgaac actttgttcc 4380
atcttggact tcagagcact ggtacttcag tacatgccca agggtagctt agaagcactc 4440
tgcactcaga acaaggaaag caattaggct ttctccagag gttggttatt atgctagatg 4500
tgtcaatggc aatggaatac ctacatcatg agcactatga ggtggtctta cactacgatt 4560
tgaagcctag caacgtacta tttgacgatg ataggatggc acatgtggca gactttggca 4620
ttgcaaggtt attgttaggt gatgacaact ccatgatctc agctagcatg ccaggaacag 4680
ttgggtacat ggcaccaggt aattagtact agtttttgtt gtcttgctca aacattgcct 4740
gatattttat tattatcgag tagggtgcaa ctaatttttg gttgtctatc tttctgcgca 4800
gagtatgggg ctcttggaaa agcatcacag aagagcgacg tgttcagcta cgggatcatg 4860
ctactcgaag tgttcactag gaagagatcc acagatgtta tgtttgtagg agaactgaac 4920
atcaggcagt gggttcacca ggcgtttcct gcagagcttg tccatttgat ggactgccaa 4980
cttctacagg atggctcttc ttcttcttcc agtaacatgc atggcttcct tgtgccagtg 5040
ttcgagctgg gcttgctctg ctcagctgac tccccggagc aaaggatggc gatgagcgat 5100
gtggtcgtga cactgaagaa gattaggaag gactatgtca aattgatggc aaccacaggg 5160
agcgctgtgc agcaatgatc catcgctctg tcgtggtata tgagcgaata aaatatatat 5220
catttgcatc catttcttct tctgcatcag gaatagcatc agtgcatgcc cagtgatcga 5280
ttaccctatt tgtgtacggt tgaattgaat atatctgtgg tgcttcaggt tcagcaataa 5340
tttagttggt gtaaaaatgt gattgaactg ttggtcaata aatttgcatg atgaaaatgg 5400
gagtagatga tgtgctgctt atgttttctt atttctggcc aaaataaata aaaaaaagaa 5460
tattctgggc acagcatcac aactccggct caatcagcct taaacagcca cagttaacag 5520
tcctaagcat agaaacttaa caagcttttc aggcaaacaa aacatcaaaa ggtccacaag 5580
acaacagggt cttcagggag cacatcttca ggctgtgatg caaaaaggat ctgacagccg 5640
tatgataact actgaacagg tcgtgcatat tgataaggtc cgccactaac cacccaatca 5700
acaaaagagt aaggccgttg ccgtcaaatt gtttgacaga aaaaaaaaat ttcagtgagt 5760
gtatggctga tgaccgagcg acacagctat catatctagc gtgtcgtgca caccgcgatc 5820
tgctgaatat atattttgtg atgactttat tttccagcgt ttacctagta gtgctgccaa 5880
atatttatga ctggaatttg actggaggga gtatcattta agtttctttc actttctgag 5940
agcaacagtc aaggtcgtcc gagatgttga aagcaagcta gcactacgta ctgtgctaaa 6000
taaagctcaa cttgatcgtc actgtgaagt atgatgcact cttgttgcca atgcatcaca 6060
cacaaccaga c atg gcg ctt gga ttg cca gta tgg att ttc att gcg ttg 6110
Met Ala Leu Gly Leu Pro Val Trp Ile Phe Ile Ala Leu
1 5 10
ttg atc gct ttg tcc act gtg cct tgt gct tcc tct cta ggt ccg agc 6158
Leu Ile Ala Leu Ser Thr Val Pro Cys Ala Ser Ser Leu Gly Pro Ser
15 20 25
aac agt agc ggc agt gac acc gac ctc gct gca ctt ttg gct tta aaa 6206
Asn Ser Ser Gly Ser Asp Thr Asp Leu Ala Ala Leu Leu Ala Leu Lys
30 35 40 45
tcg cag ttc tct gat cct gat aac att ctt gcc ggc aac tgg acc att 6254
Ser Gln Phe Ser Asp Pro Asp Asn Ile Leu Ala Gly Asn Trp Thr Ile
50 55 60
ggc acg cca ttc tgc caa tgg atg ggt gtc tcg tgc agc cac cgc cgg 6302
Gly Thr Pro Phe Cys Gln Trp Met Gly Val Ser Cys Ser His Arg Arg
65 70 75
cag cgc gtc acc gcc ctg gaa ctg cca aac gtt cct ctc caa gga gag 6350
Gln Arg Val Thr Ala Leu Glu Leu Pro Asn Val Pro Leu Gln Gly Glu
80 85 90
ctc agc tct cac ctt ggt aac att tct ttt ctc ttg atc ctc aac ctc 6398
Leu Ser Ser His Leu Gly Asn Ile Ser Phe Leu Leu Ile Leu Asn Leu
95 100 105
acc aac acc ggc ctc aca ggc ttg gtg ccg gat tat ata gga agg cta 6446
Thr Asn Thr Gly Leu Thr Gly Leu Val Pro Asp Tyr Ile Gly Arg Leu
110 115 120 125
cgt cgc ctt gag atc ctt gat ctc ggc cac aat gcc ttg tca ggt ggc 6494
Arg Arg Leu Glu Ile Leu Asp Leu Gly His Asn Ala Leu Ser Gly Gly
130 135 140
gtc cca atc gcc ata ggg aac ctc acg agg ctt cag cta ctt aat cta 6542
Val Pro Ile Ala Ile Gly Asn Leu Thr Arg Leu Gln Leu Leu Asn Leu
145 150 155
cag ttt aac cag cta tat ggt cca atc cca gca gag ctg cag ggg ctg 6590
Gln Phe Asn Gln Leu Tyr Gly Pro Ile Pro Ala Glu Leu Gln Gly Leu
160 165 170
cac agt ctt gac agc atg aat ctc cgt cac aat tac ctc act gga tca 6638
His Ser Leu Asp Ser Met Asn Leu Arg His Asn Tyr Leu Thr Gly Ser
175 180 185
att ccg gac aat ctg ttc aac aac aca tct ttg cta act tat ctc aac 6686
Ile Pro Asp Asn Leu Phe Asn Asn Thr Ser Leu Leu Thr Tyr Leu Asn
190 195 200 205
gtt ggt aac aat agc ctg tca gga ccg ata ccg ggt tgc atc ggt tcc 6734
Val Gly Asn Asn Ser Leu Ser Gly Pro Ile Pro Gly Cys Ile Gly Ser
210 215 220
ttg cca atc ctc caa tac ctt aac ttg cag gcc aat aac tta act ggg 6782
Leu Pro Ile Leu Gln Tyr Leu Asn Leu Gln Ala Asn Asn Leu Thr Gly
225 230 235
gcg gtg cca cca gcc atc ttc aac atg tct aaa tta agt act att tct 6830
Ala Val Pro Pro Ala Ile Phe Asn Met Ser Lys Leu Ser Thr Ile Ser
240 245 250
ctt ata tcg aat ggt tta act ggc cct atc cct ggt aat aca agt ttc 6878
Leu Ile Ser Asn Gly Leu Thr Gly Pro Ile Pro Gly Asn Thr Ser Phe
255 260 265
agc ctc cca gtt cta caa tgg ttc gcc atc agt aaa aac aat ttc ttt 6926
Ser Leu Pro Val Leu Gln Trp Phe Ala Ile Ser Lys Asn Asn Phe Phe
270 275 280 285
ggt caa att cca ctg ggg ttc gca gcg tgt cca tac ctc caa gtt att 6974
Gly Gln Ile Pro Leu Gly Phe Ala Ala Cys Pro Tyr Leu Gln Val Ile
290 295 300
gcc ctg cct tat aat tta ttc gag ggt gtt ttg cca cca tgg ctg ggc 7022
Ala Leu Pro Tyr Asn Leu Phe Glu Gly Val Leu Pro Pro Trp Leu Gly
305 310 315
aag ttg acg agt ctt aat acc atc tcc ttg ggt ggg aat aac ctt gat 7070
Lys Leu Thr Ser Leu Asn Thr Ile Ser Leu Gly Gly Asn Asn Leu Asp
320 325 330
gct ggc ccg atc cct act gaa ctt agc aac ctc acc atg ctg gca gtc 7118
Ala Gly Pro Ile Pro Thr Glu Leu Ser Asn Leu Thr Met Leu Ala Val
335 340 345
tta gat ttg acg acg tgc aac ctg aca gga aac atc cct gca gat att 7166
Leu Asp Leu Thr Thr Cys Asn Leu Thr Gly Asn Ile Pro Ala Asp Ile
350 355 360 365
ggg cac cta ggc caa ctt tca tgg ttg cat ctt gcg agg aat caa cta 7214
Gly His Leu Gly Gln Leu Ser Trp Leu His Leu Ala Arg Asn Gln Leu
370 375 380
aca gga cct att cct gct tct ctt ggc aac ctt tca tcg tta gca atc 7262
Thr Gly Pro Ile Pro Ala Ser Leu Gly Asn Leu Ser Ser Leu Ala Ile
385 390 395
ctg cta ttg aaa gga aac ttg ttg gat gga tca tta cca gcg aca gtt 7310
Leu Leu Leu Lys Gly Asn Leu Leu Asp Gly Ser Leu Pro Ala Thr Val
400 405 410
gat agc atg aac tca cta act gca gtt gat gtt act gaa aac aat cta 7358
Asp Ser Met Asn Ser Leu Thr Ala Val Asp Val Thr Glu Asn Asn Leu
415 420 425
cac gga gat ctc aac ttc ctt tct act gtt tcc aat tgt aga aag ctt 7406
His Gly Asp Leu Asn Phe Leu Ser Thr Val Ser Asn Cys Arg Lys Leu
430 435 440 445
tct acc ctt caa atg gac ttt aat tat gtc acc gga agc ctc cca gac 7454
Ser Thr Leu Gln Met Asp Phe Asn Tyr Val Thr Gly Ser Leu Pro Asp
450 455 460
tat gtt ggg aac ctg tcg tca cag ctg aaa tgg ttc acg tta tct aac 7502
Tyr Val Gly Asn Leu Ser Ser Gln Leu Lys Trp Phe Thr Leu Ser Asn
465 470 475
aac aag tta act ggc acg ctt cca gct acc att tca aat tta act ggt 7550
Asn Lys Leu Thr Gly Thr Leu Pro Ala Thr Ile Ser Asn Leu Thr Gly
480 485 490
ctt gag gtg ata gat ctt tcg cat aac caa ctg cgc aat gca att cca 7598
Leu Glu Val Ile Asp Leu Ser His Asn Gln Leu Arg Asn Ala Ile Pro
495 500 505
gaa tca atc atg acg att gag aat ctc caa tgg ctt gac cta agt gga 7646
Glu Ser Ile Met Thr Ile Glu Asn Leu Gln Trp Leu Asp Leu Ser Gly
510 515 520 525
aat agc ttg tct ggc ttc atc cca tcg aat act gca ctt cta agg aac 7694
Asn Ser Leu Ser Gly Phe Ile Pro Ser Asn Thr Ala Leu Leu Arg Asn
530 535 540
att gta aaa cta ttc ctt gaa agc aac gaa att tct ggc tcc ata cca 7742
Ile Val Lys Leu Phe Leu Glu Ser Asn Glu Ile Ser Gly Ser Ile Pro
545 550 555
aag gac atg aga aac ctc act aat cta gag cac ctt cta ttg tct gat 7790
Lys Asp Met Arg Asn Leu Thr Asn Leu Glu His Leu Leu Leu Ser Asp
560 565 570
aac caa tta acg tca acc gtg cca cca agc tta ttt cat ctt gat aaa 7838
Asn Gln Leu Thr Ser Thr Val Pro Pro Ser Leu Phe His Leu Asp Lys
575 580 585
atc atc agg cta gat ctt tct cga aac ttc ttg agt ggt gca ctg ccg 7886
Ile Ile Arg Leu Asp Leu Ser Arg Asn Phe Leu Ser Gly Ala Leu Pro
590 595 600 605
gtt gat gta ggg tac ttg aag caa att acc atc ata gat ctc tct gac 7934
Val Asp Val Gly Tyr Leu Lys Gln Ile Thr Ile Ile Asp Leu Ser Asp
610 615 620
aac agc ttt tct ggc agc atc cca gat tcg ata gga gaa ctt caa atg 7982
Asn Ser Phe Ser Gly Ser Ile Pro Asp Ser Ile Gly Glu Leu Gln Met
625 630 635
tta aca cac ctg aat cta tca gct aac gaa ttc tat gat tct gtt cca 8030
Leu Thr His Leu Asn Leu Ser Ala Asn Glu Phe Tyr Asp Ser Val Pro
640 645 650
gac tct ttt ggt aat tta act ggc ttg caa act ttg gac ata tcc cat 8078
Asp Ser Phe Gly Asn Leu Thr Gly Leu Gln Thr Leu Asp Ile Ser His
655 660 665
aac agt att tct ggt acc atc cca aac tac ttg gct aat ttt acg acc 8126
Asn Ser Ile Ser Gly Thr Ile Pro Asn Tyr Leu Ala Asn Phe Thr Thr
670 675 680 685
ctt gtt agc ttg aac cta tct ttc aat aaa cta cat ggt caa ata ccg 8174
Leu Val Ser Leu Asn Leu Ser Phe Asn Lys Leu His Gly Gln Ile Pro
690 695 700
gaa gga ggt atc ttt gca aac atc act tta caa tac ttg gtg ggg aac 8222
Glu Gly Gly Ile Phe Ala Asn Ile Thr Leu Gln Tyr Leu Val Gly Asn
705 710 715
tca ggg cta tgt ggt gct gcc cgt tta gga ttc cca cca tgc caa acc 8270
Ser Gly Leu Cys Gly Ala Ala Arg Leu Gly Phe Pro Pro Cys Gln Thr
720 725 730
acc tcc ccc aag aga aat ggt cac atg cta aaa tac ttg cta ccg act 8318
Thr Ser Pro Lys Arg Asn Gly His Met Leu Lys Tyr Leu Leu Pro Thr
735 740 745
ata atc ata gta gtt gga gtt gta gct tgt tgc ttg tat gta atg att 8366
Ile Ile Ile Val Val Gly Val Val Ala Cys Cys Leu Tyr Val Met Ile
750 755 760 765
aga aag aaa gct aac cat caa aag att tct gct ggt atg gct gac ctt 8414
Arg Lys Lys Ala Asn His Gln Lys Ile Ser Ala Gly Met Ala Asp Leu
770 775 780
atc agc cat caa ttt ctg tcc tat cat gag ctt ctt cgt gca acc gat 8462
Ile Ser His Gln Phe Leu Ser Tyr His Glu Leu Leu Arg Ala Thr Asp
785 790 795
gat ttc agt gat gat aac atg ttg ggc ttc gga agc ttt gga aaa gtt 8510
Asp Phe Ser Asp Asp Asn Met Leu Gly Phe Gly Ser Phe Gly Lys Val
800 805 810
ttt aag gga cag ttg agc aac ggt atg gtg gtt gcc ata aaa gtt ata 8558
Phe Lys Gly Gln Leu Ser Asn Gly Met Val Val Ala Ile Lys Val Ile
815 820 825
cac cag cat ctg gaa cat gcc atg aga agc ttt gac acc gag tgt cgt 8606
His Gln His Leu Glu His Ala Met Arg Ser Phe Asp Thr Glu Cys Arg
830 835 840 845
gta ctc cga att gct cga cat cgt aac ctg ata aag att ctg aac act 8654
Val Leu Arg Ile Ala Arg His Arg Asn Leu Ile Lys Ile Leu Asn Thr
850 855 860
tgt tcc aac ctg gac ttc aga gca ctc gta ctt cag tac atg ccc aag 8702
Cys Ser Asn Leu Asp Phe Arg Ala Leu Val Leu Gln Tyr Met Pro Lys
865 870 875
ggt agc tta gaa gca ctc ctg cac tca gaa caa gga aag caa tta ggc 8750
Gly Ser Leu Glu Ala Leu Leu His Ser Glu Gln Gly Lys Gln Leu Gly
880 885 890
ttt ctc aag agg ttg gat att atg cta gat gtg tca atg gca atg gaa 8798
Phe Leu Lys Arg Leu Asp Ile Met Leu Asp Val Ser Met Ala Met Glu
895 900 905
tac ctg cat cat gag cac tat gag gtg gtc tta cac tgc gat ttg aag 8846
Tyr Leu His His Glu His Tyr Glu Val Val Leu His Cys Asp Leu Lys
910 915 920 925
cct agc aac gta cta ttt gac gat gat atg acg gca cat gtg gca gac 8894
Pro Ser Asn Val Leu Phe Asp Asp Asp Met Thr Ala His Val Ala Asp
930 935 940
ttt ggc att gca agg ttg ttg tta ggt gat gac aac tcc atg atc tca 8942
Phe Gly Ile Ala Arg Leu Leu Leu Gly Asp Asp Asn Ser Met Ile Ser
945 950 955
gct agc atg cca gga aca gtt ggg tac atg gca cca g gtacttagta 8989
Ala Ser Met Pro Gly Thr Val Gly Tyr Met Ala Pro
960 965
ctagtttttg ttgtcttgct caagcattgc ctgatctttt attattatca agtagggtgc 9049
gactaatttt tggtaactaa cttttcttga gcag ag tat ggg gct cta gga aag 9103
Glu Tyr Gly Ala Leu Gly Lys
975
gcg tca cgg aag agc gat gtg ttc agt tac ggg atc atg ttg ttt gaa 9151
Ala Ser Arg Lys Ser Asp Val Phe Ser Tyr Gly Ile Met Leu Phe Glu
980 985 990
gtg ttc act ggg aag aga ccc aca gat gct atg ttt gtg gga gaa ctg 9199
Val Phe Thr Gly Lys Arg Pro Thr Asp Ala Met Phe Val Gly Glu Leu
995 1000 1005
aac atc agg cag tgg gtt cac cag gcg ttt cct gca gag ctt gtc 9244
Asn Ile Arg Gln Trp Val His Gln Ala Phe Pro Ala Glu Leu Val
1010 1015 1020
cat gtg gtg gac tgc caa ctt cta cat gat ggc tct tct tcc agt 9289
His Val Val Asp Cys Gln Leu Leu His Asp Gly Ser Ser Ser Ser
1025 1030 1035
aac atg cat ggc ttc cat gtg cca gtg ttc gag ctg ggc ttg ctc 9334
Asn Met His Gly Phe His Val Pro Val Phe Glu Leu Gly Leu Leu
1040 1045 1050
tgc tca gct gac tcc ccg gag caa agg atg gcg atg agc gat gtg 9379
Cys Ser Ala Asp Ser Pro Glu Gln Arg Met Ala Met Ser Asp Val
1055 1060 1065
gtc gtg aca ctg aag aag att agg aag gac tat gtc aaa ttg atg 9424
Val Val Thr Leu Lys Lys Ile Arg Lys Asp Tyr Val Lys Leu Met
1070 1075 1080
gca acc aca gag aac gct gtg cag cag tgattcatca ctttcttgtg 9471
Ala Thr Thr Glu Asn Ala Val Gln Gln
1085 1090
gtatatgagc gaatgaaatg tatatccttt gcatccattt cttcttctgc attaggaaca 9531
gcatcaatgc atgcccagtg atcgaataac ccttttgctt ctatttgtgt atggttgaat 9591
tgaatatatc tacggtgctt caggttcagc aacaatttag ttggtgtaaa aatgtgattg 9651
aactgctggt cgataaattt gcatcatgaa aatgaaaatg ggagtagatg atgtgctgct 9711
tatattttct atttctggcc aaatatatat aaaaaaggat attctcactt gaaaacagaa 9771
tgaggtttgc tttgtagaca ttgggccttt gtcgtgggct tcactggtac cctgacatgc 9831
tttatagccc atgggcctat gtttgtaatg ggcttttgtt tctttcgaat gactcaaggt 9891
atattaaggc ctgtttgaac acttcggaat attttgcggt ggtagaggtg gacactaaac 9951
gacgacatag gtaaaatgat atatagaaac aaaattaatt caaacgatat actcatgcta 10011
taaatttgag atatggtaat taataaaact tcacataggc gacgaaccgt atatataaca 10071
gttaatgcat aataatggca gagatttcgt aatgaatcct ccaagattta ctcttgcgac 10131
tctatataaa cagctgcaaa atgaagctag gtagtcagca tatgcagtag cttccttcta 10191
aaagaagatc tctcatctca tcatggagca acttctcaac aagaaggctg cagtgttctt 10251
gttcatagct cttatggtga tggctaccgt aaatttctca tcctgtcata ctacacaagg 10311
tatatacctc tggaattaat ttctgcatca catccatata tattgaaaac agtttaactt 10371
gttctctttt gcaatggctc catcacatca atgaagctag aataatctga tatagcctga 10431
caagtatatg tgcattttac attttctgta tgaactattt tagctttctg taaaagtcgg 10491
gtcattaaca atataatgat cttttctcta aaaacatgca ttttacatgt gtatatatat 10551
aggtggatat ggagaaatgg attcgtgcat ggtccttgaa cgttgcgata tgaacaagtg 10611
catgagtgcc tgccaagtca acaagtacaa cggaggtcag tgcgacggcg agttgaacga 10671
ccactgctgc tgtactgatg aggccccgca caaatagtag atttctcttg tccaaatcaa 10731
cgacgacggt tgtataatca acggcaatct tattattgac tgatgcagag attccataat 10791
aaacttgcag ttgatgcttg taccaattca ctcagtatgt caattagtaa aaggagaatc 10851
ccgcgcaaat gtgcgggcaa cttatttatt ctttattaaa agaataaatt ttatttaaag 10911
actatagatt atcttatgtc attaaaaaag aaatatctct accctgaaat tgaccataga 10971
ctttgataca ctaaagtaac ttataagaat atctctcata ttaaaaaaga tcactatgtc 11031
ttggttatat ataaactttc ctatcacaac tacttattat tctctcgaaa aagctaagta 11091
tgtttttaat agatagaatt gcttattaaa aaacaaacat aatacatcat taatgcaata 11151
aaaaaatcat ctctatttca ttccgaagtg cacatgtctc acttgtgtct gaatgtataa 11211
atatttaaaa atgccgtttc tgtcaattaa tgaaagaaaa ccattaaaat tagatataaa 11271
tacccatata atgtgtgaag ttggtcatag ctgctaaggc tttattctct aatgccttga 11331
aacatgggtc ctgcattgac gtagccgtgc ccctcaagac tacgtcgatc ataattttga 11391
cacctagctc actctctaca catttataaa gaattgtatt agtaagtagg ctagttgagc 11451
tctacctcca agagtgaaag aagtcatagg ggattcgatg gactgatcct atgtatgtgt 11511
gtacaaacat aataacattt tagttttaaa tttgaggatt tggagggaaa gattaatatt 11571
ttttcaattg tacatgcata catatatttt tttctagggt tttggatgaa cagtacttat 11631
ttttctgtgg tagtggtact tacatgggag attaacaggg aaggatcact ttaatacatc 11691
taatctaata ttttaaaata atgggtatat tggtttaagt ataaatcaat ggtagatgtt 11751
ttgctttttt ctcatatttt tttatatttt caccaattta ttagagcatc aggttgttgc 11811
cttaggagcg tttgtagaga ttaggtggga gtgattagtt taacagatct aatctaataa 11871
ttaaaaatat tgggtcaacc catttaagta taatcaatcg ctagatgttt tacttttttg 11931
ttaggatttc ttggattttt tctaatttat tagagcacca tgtgatggtt taagattgtt 11991
tgtagcagtg ccacatggtg gcttagaagc gtttgtatga ttttcaatgg acttttaata 12051
tataataata tttttgtctc aatgattggt ggatagcact gcatatgtag aattgataca 12111
tgagtttgag tctgacataa aattaacaca taaggtaaga catgcatgag tcaaatacat 12171
tggatcttat ttattatata cccagattaa cttaagttta aactgcagag actgaagaga 12231
aaaagatcaa catatatgcc tcctattact cttaattaag acatatcgat ccggttcatt 12291
taatcattta ttttctatag tcctttacaa tatatctaga tttatattac tgtgataatt 12351
gatattttaa agcgaagata atatttttta atggtaatca caattagagt attttagttt 12411
ggaacattgt agattgcttt aaatattaaa gttatgatga tgtcaatcat gaaagattta 12471
agtaaagtta ttttgttttg tgaaaaataa tattgtttta acatataaat attttagaac 12531
agtggtatta ttttttggga gtgctaatca aaactaatta gaatatttta ttttgtgata 12591
ttattgagct aatctaatta tgcatgattg tgaaatgata tatgatgata tgatctacat 12651
tatctatttt cttcacaacc ttttttattg ttgggatgac atgcaaaagg ttagggtatg 12711
atttaaaata ctagagggaa tattttttgg aacgtcactc acgatttaag tgtttttgtt 12771
ttttcatatc tatcatattt aatttatttg tgcatgtttt ataaagtgat atatgatcat 12831
ataatatacg ttgtcatttt ttgtatttaa cagcttcaat tgcatactaa ggttaggatg 12891
tgataaaaat taaatatttt ttatgagtta aagaatgtag tggcttatat gggtatttga 12951
atggttaatt aagcttcatg aatacaatta taaagaatga taaaaagatt tatgtgcact 13011
agaggagggg gagggggagg gggaggggaa ggggggtggg ttggtggggg agaggggagg 13071
gaggtgtatt gtggggtttt ttttaaaaaa ataaatctaa tagataaata tagtgggcct 13131
accgatctaa tagataaaaa tagtaggcct accgatttat tagagagtca tatggcaact 13191
tagaagcatt tatagaagcc ccacgtgacg gcttgagaat gtttagtaga agtttaatgg 13251
actagggtaa tagctagtgt gtttatcttc atgaaatctt catcggctgc ttctagtaat 13311
ttgctttaga catgttaaaa taggccgatg agattagatc tgtttcaatt tatttattat 13371
cttgagtaag ccgatacagc tttgccctat cggctgggtt ataacaatgt gtcatcggct 13431
tatcggccga tcgaccattg ttttctgata tggtttactt gtttcttgtt gattgcagat 13491
caaatcaact ggcacgctct gcgtacgact aggcaagcca atttggacct gcactggagt 13551
taagcagatc tcccagaccg ccgtggtaac gcggaacccg ttttgttgct agacgggtcc 13611
aaattctgcg ccaacaaacc ccaagtacta aagcaacacc cg 13653
<210>2
<211>1092
<212>PRT
<213>Oryza sativa
<400>2
Met Ala Leu Gly Leu Pro Val Trp Ile Phe Ile Ala Leu Leu Ile Ala
1 5 10 15
Leu Ser Thr Val Pro Cys Ala Ser Ser Leu Gly Pro Ser Asn Ser Ser
20 25 30
Gly Ser Asp Thr Asp Leu Ala Ala Leu Leu Ala Leu Lys Ser Gln Phe
35 40 45
Ser Asp Pro Asp Asn Ile Leu Ala Gly Asn Trp Thr Ile Gly Thr Pro
50 55 60
Phe Cys Gln Trp Met Gly Val Ser Cys Ser His Arg Arg Gln Arg Val
65 70 75 80
Thr Ala Leu Glu Leu Pro Asn Val Pro Leu Gln Gly Glu Leu Ser Ser
85 90 95
His Leu Gly Asn Ile Ser Phe Leu Leu Ile Leu Asn Leu Thr Asn Thr
100 105 110
Gly Leu Thr Gly Leu Val Pro Asp Tyr Ile Gly Arg Leu Arg Arg Leu
115 120 125
Glu Ile Leu Asp Leu Gly His Asn Ala Leu Ser Gly Gly Val Pro Ile
130 135 140
Ala Ile Gly Asn Leu Thr Arg Leu Gln Leu Leu Asn Leu Gln Phe Asn
145 150 155 160
Gln Leu Tyr Gly Pro Ile Pro Ala Glu Leu Gln Gly Leu His Ser Leu
165 170 175
Asp Ser Met Asn Leu Arg His Asn Tyr Leu Thr Gly Ser Ile Pro Asp
180 185 190
Asn Leu Phe Asn Asn Thr Ser Leu Leu Thr Tyr Leu Asn Val Gly Asn
195 200 205
Asn Ser Leu Ser Gly Pro Ile Pro Gly Cys Ile Gly Ser Leu Pro Ile
210 215 220
Leu Gln Tyr Leu Asn Leu Gln Ala Asn Asn Leu Thr Gly Ala Val Pro
225 230 235 240
Pro Ala Ile Phe Asn Met Ser Lys Leu Ser Thr Ile Ser Leu Ile Ser
245 250 255
Asn Gly Leu Thr Gly Pro Ile Pro Gly Asn Thr Ser Phe Ser Leu Pro
260 265 270
Val Leu Gln Trp Phe Ala Ile Ser Lys Asn Asn Phe Phe Gly Gln Ile
275 280 285
Pro Leu Gly Phe Ala Ala Cys Pro Tyr Leu Gln Val Ile Ala Leu Pro
290 295 300
Tyr Asn Leu Phe Glu Gly Val Leu Pro Pro Trp Leu Gly Lys Leu Thr
305 310 315 320
Ser Leu Asn Thr Ile Ser Leu Gly Gly Asn Asn Leu Asp Ala Gly Pro
325 330 335
Ile Pro Thr Glu Leu Ser Asn Leu Thr Met Leu Ala Val Leu Asp Leu
340 345 350
Thr Thr Cys Asn Leu Thr Gly Asn Ile Pro Ala Asp Ile Gly His Leu
355 360 365
Gly Gln Leu Ser Trp Leu His Leu Ala Arg Asn Gln Leu Thr Gly Pro
370 375 380
Ile Pro Ala Ser Leu Gly Asn Leu Ser Ser Leu Ala Ile Leu Leu Leu
385 390 395 400
Lys Gly Asn Leu Leu Asp Gly Ser Leu Pro Ala Thr Val Asp Ser Met
405 410 415
Asn Ser Leu Thr Ala Val Asp Val Thr Glu Asn Asn Leu His Gly Asp
420 425 430
Leu Asn Phe Leu Ser Thr Val Ser Asn Cys Arg Lys Leu Ser Thr Leu
435 440 445
Gln Met Asp Phe Asn Tyr Val Thr Gly Ser Leu Pro Asp Tyr Val Gly
450 455 460
Asn Leu Ser Ser Gln Leu Lys Trp Phe Thr Leu Ser Asn Asn Lys Leu
465 470 475 480
Thr Gly Thr Leu Pro Ala Thr Ile Ser Asn Leu Thr Gly Leu Glu Val
485 490 495
Ile Asp Leu Ser His Asn Gln Leu Arg Asn Ala Ile Pro Glu SerIle
500 505 510
Met Thr Ile Glu Asn Leu Gln Trp Leu Asp Leu Ser Gly Asn Ser Leu
515 520 525
Ser Gly Phe Ile Pro Ser Asn Thr Ala Leu Leu Arg Asn Ile Val Lys
530 535 540
Leu Phe Leu Glu Ser Asn Glu Ile Ser Gly Ser Ile Pro Lys Asp Met
545 550 555 560
Arg Asn Leu Thr Asn Leu Glu His Leu Leu Leu Ser Asp Asn Gln Leu
565 570 575
Thr Ser Thr Val Pro Pro Ser Leu Phe His Leu Asp Lys Ile Ile Arg
580 585 590
Leu Asp Leu Ser Arg Asn Phe Leu Ser Gly Ala Leu Pro Val Asp Val
595 600 605
Gly Tyr Leu Lys Gln Ile Thr Ile Ile Asp Leu Ser Asp Asn Ser Phe
610 615 620
Ser Gly Ser Ile Pro Asp Ser Ile Gly Glu Leu Gln Met Leu Thr His
625 630 635 640
Leu Asn Leu Ser Ala Asn Glu Phe Tyr Asp Ser Val Pro Asp Ser Phe
645 650 655
Gly Asn Leu Thr Gly Leu Gln Thr Leu Asp Ile Ser His Asn Ser Ile
660 665 670
Ser Gly Thr Ile Pro Asn Tyr Leu Ala Asn Phe Thr Thr Leu Val Ser
675 680 685
Leu Asn Leu Ser Phe Asn Lys Leu His Gly Gln Ile Pro Glu Gly Gly
690 695 700
Ile Phe Ala Asn Ile Thr Leu Gln Tyr Leu Val Gly Asn Ser Gly Leu
705 710 715 720
Cys Gly Ala Ala Arg Leu Gly Phe Pro Pro Cys Gln Thr Thr Ser Pro
725 730 735
Lys Arg Asn Gly His Met Leu Lys Tyr Leu Leu Pro Thr Ile Ile Ile
740 745 750
Val Val Gly Val Val Ala Cys Cys Leu Tyr Val Met Ile Arg Lys Lys
755 760 765
Ala Asn His Gln Lys Ile Ser Ala Gly Met Ala Asp Leu Ile Ser His
770 775 780
Gln Phe Leu Ser Tyr His Glu Leu Leu Arg Ala Thr Asp Asp Phe Ser
785 790 795 800
Asp Asp Asn Met Leu Gly Phe Gly Ser Phe Gly Lys Val Phe Lys Gly
805 810 815
Gln Leu Ser Asn Gly Met Val Val Ala Ile Lys Val Ile His Gln His
820 825 830
Leu Glu His Ala Met Arg Ser Phe Asp Thr Glu Cys Arg Val Leu Arg
835 840 845
Ile Ala Arg His Arg Asn Leu Ile Lys Ile Leu Asn Thr Cys Ser Asn
850 855 860
Leu Asp Phe Arg Ala Leu Val Leu Gln Tyr Met Pro Lys Gly Ser Leu
865 870 875 880
Glu Ala Leu Leu His Ser Glu Gln Gly Lys Gln Leu Gly Phe Leu Lys
885 890 895
Arg Leu Asp Ile Met Leu Asp Val Ser Met Ala Met Glu Tyr Leu His
900 905 910
His Glu His Tyr Glu Val Val Leu His Cys Asp Leu Lys Pro Ser Asn
915 920 925
Val Leu Phe Asp Asp Asp Met Thr Ala His Val Ala Asp Phe Gly Ile
930 935 940
Ala Arg Leu Leu Leu Gly Asp Asp Asn Ser Met Ile Ser Ala Ser Met
945 950 955 960
Pro Gly Thr Val Gly Tyr Met Ala Pro Glu Tyr Gly Ala Leu Gly Lys
965 970 975
Ala Ser Arg Lys Ser Asp Val Phe Ser Tyr Gly Ile Met Leu Phe Glu
980 985 990
Val Phe Thr Gly Lys Arg Pro Thr Asp Ala Met Phe Val Gly Glu Leu
995 1000 1005
Asn Ile Arg Gln Trp Val His Gln Ala Phe Pro Ala Glu Leu Val
1010 1015 1020
His Val Val Asp Cys Gln Leu Leu Hi s Asp Gly Ser Ser Ser Ser
1025 1030 1035
Asn Met His Gly Phe His Val Pro Val Phe Glu Leu Gly Leu Leu
1040 1045 1050
Cys Ser Ala Asp Ser Pro Glu Gln Arg Met Ala Met Ser Asp Val
1055 1060 1065
Val Val Thr Leu Lys Lys Ile Arg Lys Asp Tyr Val Lys Leu Met
1070 1075 1080
Ala Thr Thr Glu Asn Ala Val Gln Gln
1085 1090
Claims (1)
1.一种分离的对白叶枯病产生抗性的基因Xa3/Xa26-3,它的核苷酸序列如序列表SEQID NO:1所示。
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010101397937A CN101880669B (zh) | 2010-04-01 | 2010-04-01 | 小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2010101397937A CN101880669B (zh) | 2010-04-01 | 2010-04-01 | 小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101880669A CN101880669A (zh) | 2010-11-10 |
CN101880669B true CN101880669B (zh) | 2012-01-25 |
Family
ID=43052811
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2010101397937A Expired - Fee Related CN101880669B (zh) | 2010-04-01 | 2010-04-01 | 小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101880669B (zh) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102703461B (zh) * | 2011-03-28 | 2014-02-26 | 华中农业大学 | 水稻抗病相关基因c3h12和它在改良水稻抗病性中的应用 |
CN108285898B (zh) * | 2017-01-08 | 2020-11-13 | 华中农业大学 | 水稻Xa4基因及在改良水稻多种农艺性状中的应用 |
CN107267523A (zh) * | 2017-06-27 | 2017-10-20 | 云南农业大学 | 一种白叶枯病抗性蛋白及编码基因 |
CN109112147B (zh) * | 2017-07-22 | 2020-09-15 | 华中农业大学 | 水稻OsMPKK10-2基因在改良水稻抗病性和抗旱性中的应用 |
CN109369790B (zh) * | 2018-12-04 | 2020-11-24 | 中国农业科学院作物科学研究所 | 水稻白枯病抗性相关蛋白OsBBR1及其编码基因与应用 |
CN110468229B (zh) * | 2019-09-03 | 2020-05-08 | 云南省农业科学院生物技术与种质资源研究所 | 水稻广谱高抗白叶枯病基因Xa45(t)的共分离分子标记Hxjy-1 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1493692A (zh) * | 2002-10-28 | 2004-05-05 | 华中农业大学 | 水稻抗白叶枯病基因Xa26(t) |
CN1498893A (zh) * | 2002-11-11 | 2004-05-26 | 华中农业大学 | 水稻抗白叶枯病基因Xa4 |
US6869601B2 (en) * | 2001-03-29 | 2005-03-22 | Council Of Scientific And Industrial Research | Bacterial mutant BX065 and a method thereof |
-
2010
- 2010-04-01 CN CN2010101397937A patent/CN101880669B/zh not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6869601B2 (en) * | 2001-03-29 | 2005-03-22 | Council Of Scientific And Industrial Research | Bacterial mutant BX065 and a method thereof |
CN1493692A (zh) * | 2002-10-28 | 2004-05-05 | 华中农业大学 | 水稻抗白叶枯病基因Xa26(t) |
CN1498893A (zh) * | 2002-11-11 | 2004-05-26 | 华中农业大学 | 水稻抗白叶枯病基因Xa4 |
Also Published As
Publication number | Publication date |
---|---|
CN101880669A (zh) | 2010-11-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Su et al. | Functional divergence of duplicated genes results in a novel blast resistance gene Pi50 at the Pi2/9 locus | |
Yuan et al. | The Pik-p resistance to Magnaporthe oryzae in rice is mediated by a pair of closely linked CC-NBS-LRR genes | |
Chen et al. | AB‐lectin receptor kinase gene conferring rice blast resistance | |
Ma et al. | Pi64, encoding a novel CC-NBS-LRR protein, confers resistance to leaf and neck blast in rice | |
Swiderski et al. | The Arabidopsis PBS1 resistance gene encodes a member of a novel protein kinase subfamily | |
Das et al. | A novel blast resistance gene, Pi54rh cloned from wild species of rice, Oryza rhizomatis confers broad spectrum resistance to Magnaporthe oryzae | |
Li et al. | The Gossypium hirsutum TIR‐NBS‐LRR gene GhDSC1 mediates resistance against Verticillium wilt | |
CN101892244B (zh) | 药用野生稻抗白叶枯病主效基因Xa3/Xa26-2和它在改良水稻抗病性中的应用 | |
CN101880669B (zh) | 小粒野生稻抗白叶枯病主效基因Xa3/Xa26-3和它在改良水稻抗病性中的应用 | |
Rong et al. | Expression of a potato antimicrobial peptide SN1 increases resistance to take-all pathogen Gaeumannomyces graminis var. tritici in transgenic wheat | |
WO2016101859A1 (zh) | 稻瘟病抗性基因Pi50及其制备方法与应用 | |
CN109575114B (zh) | 一种水稻粒形粒重相关基因、蛋白、分子标记及应用 | |
CN107384937A (zh) | 控制水稻粒长、粒重、产量和籽粒外观品质的基因及其应用 | |
CN107338254B (zh) | 用于制备抗真菌病原体的植物的多核苷酸和方法 | |
CN102702337A (zh) | 一种水稻抗稻瘟病蛋白及其编码基因与应用 | |
Zhang et al. | Molecular cloning of a CC–NBS–LRR gene from Vitis quinquangularis and its expression pattern in response to downy mildew pathogen infection | |
CN105969778B (zh) | 簇毛麦nam-v1基因及其分子标记和应用 | |
CN100381465C (zh) | 一种检测水稻白叶枯病抗性的方法 | |
CN107337720B (zh) | 一种植物谷蛋白转运储藏相关蛋白OsNHX5及其编码基因与应用 | |
CN102041262B (zh) | 稻瘟病抗性基因Pik-p及其应用 | |
CN109134633B (zh) | 抗稻瘟病蛋白和基因、分离的核酸及其应用 | |
CN101993880B (zh) | 水稻抗病相关基因gh3-2和它在培育广谱抗病水稻中的应用 | |
Joshi et al. | Molecular cloning, characterization, and expression analysis of resistance gene candidates in Kaempferia galanga L. | |
CN102051368A (zh) | 稻瘟病抗性基因Pik及其应用 | |
CN100556916C (zh) | 水稻稻瘟病抗性基因Pi36及其应用 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20120125 Termination date: 20180401 |
|
CF01 | Termination of patent right due to non-payment of annual fee |