CN1590546B - 含生物素生物合成基因的dna片段及其利用 - Google Patents
含生物素生物合成基因的dna片段及其利用 Download PDFInfo
- Publication number
- CN1590546B CN1590546B CN2004100634785A CN200410063478A CN1590546B CN 1590546 B CN1590546 B CN 1590546B CN 2004100634785 A CN2004100634785 A CN 2004100634785A CN 200410063478 A CN200410063478 A CN 200410063478A CN 1590546 B CN1590546 B CN 1590546B
- Authority
- CN
- China
- Prior art keywords
- ala
- leu
- gene
- gly
- gcc
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 108090000623 proteins and genes Proteins 0.000 title claims abstract description 236
- 239000012634 fragment Substances 0.000 title claims abstract description 206
- YBJHBAHKTGYVGT-ZKWXMUAHSA-N (+)-Biotin Chemical compound N1C(=O)N[C@@H]2[C@H](CCCCC(=O)O)SC[C@@H]21 YBJHBAHKTGYVGT-ZKWXMUAHSA-N 0.000 title abstract description 264
- 229960002685 biotin Drugs 0.000 title abstract description 132
- 235000020958 biotin Nutrition 0.000 title abstract description 132
- 239000011616 biotin Substances 0.000 title abstract description 132
- 241000736131 Sphingomonas Species 0.000 claims abstract description 64
- 238000000034 method Methods 0.000 claims abstract description 59
- 244000005700 microbiome Species 0.000 claims abstract description 50
- 108020004414 DNA Proteins 0.000 claims description 241
- 241000736110 Sphingomonas paucimobilis Species 0.000 claims description 44
- 238000011144 upstream manufacturing Methods 0.000 claims description 44
- 239000013598 vector Substances 0.000 claims description 43
- 230000000694 effects Effects 0.000 claims description 39
- 102000004169 proteins and genes Human genes 0.000 claims description 39
- 241001135759 Sphingomonas sp. Species 0.000 claims description 31
- 230000014509 gene expression Effects 0.000 claims description 28
- 108700026244 Open Reading Frames Proteins 0.000 claims description 27
- 230000001105 regulatory effect Effects 0.000 claims description 22
- 101710117026 Biotin synthase Proteins 0.000 claims description 21
- WWUZIQQURGPMPG-UHFFFAOYSA-N (-)-D-erythro-Sphingosine Natural products CCCCCCCCCCCCCC=CC(O)C(N)CO WWUZIQQURGPMPG-UHFFFAOYSA-N 0.000 claims description 6
- 230000003362 replicative effect Effects 0.000 claims description 4
- WWUZIQQURGPMPG-KRWOKUGFSA-N sphingosine Chemical compound CCCCCCCCCCCCC\C=C\[C@@H](O)[C@@H](N)CO WWUZIQQURGPMPG-KRWOKUGFSA-N 0.000 claims description 3
- 125000003275 alpha amino acid group Chemical group 0.000 claims 4
- 239000013612 plasmid Substances 0.000 abstract description 90
- 230000015572 biosynthetic process Effects 0.000 abstract description 69
- 238000010353 genetic engineering Methods 0.000 abstract description 4
- 238000009395 breeding Methods 0.000 abstract 1
- 230000001488 breeding effect Effects 0.000 abstract 1
- 150000001413 amino acids Chemical group 0.000 description 75
- RLMISHABBKUNFO-WHFBIAKZSA-N Ala-Ala-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O RLMISHABBKUNFO-WHFBIAKZSA-N 0.000 description 69
- 238000006243 chemical reaction Methods 0.000 description 68
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 66
- 238000012217 deletion Methods 0.000 description 52
- 230000037430 deletion Effects 0.000 description 52
- 108091008146 restriction endonucleases Proteins 0.000 description 44
- 108090000790 Enzymes Proteins 0.000 description 41
- 102000004190 Enzymes Human genes 0.000 description 39
- 229940088598 enzyme Drugs 0.000 description 39
- 235000018102 proteins Nutrition 0.000 description 33
- 101150023452 bioD gene Proteins 0.000 description 32
- 101150032820 bioF gene Proteins 0.000 description 32
- 101150076754 bioA gene Proteins 0.000 description 31
- 101150085692 bioC gene Proteins 0.000 description 29
- 239000000243 solution Substances 0.000 description 29
- 235000001014 amino acid Nutrition 0.000 description 27
- 238000004519 manufacturing process Methods 0.000 description 26
- 239000000203 mixture Substances 0.000 description 25
- 241000588724 Escherichia coli Species 0.000 description 23
- 101150029327 bioB gene Proteins 0.000 description 23
- 101100381793 Bacillus subtilis (strain 168) bioK gene Proteins 0.000 description 22
- 241000894006 Bacteria Species 0.000 description 22
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 20
- 238000002360 preparation method Methods 0.000 description 20
- 238000012360 testing method Methods 0.000 description 20
- 108091026890 Coding region Proteins 0.000 description 19
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 19
- 239000013600 plasmid vector Substances 0.000 description 18
- KCEGBPIYGIWCDH-UHFFFAOYSA-N 7,8-diaminononanoic acid Chemical compound CC(N)C(N)CCCCCC(O)=O KCEGBPIYGIWCDH-UHFFFAOYSA-N 0.000 description 16
- GUAHPAJOXVYFON-UHFFFAOYSA-N 8-amino-7-oxononanoic acid Chemical compound CC([NH3+])C(=O)CCCCCC([O-])=O GUAHPAJOXVYFON-UHFFFAOYSA-N 0.000 description 16
- 230000006696 biosynthetic metabolic pathway Effects 0.000 description 16
- LYCRXMTYUZDUGA-UYRKPTJQSA-N pimeloyl-CoA Chemical compound O[C@@H]1[C@H](OP(O)(O)=O)[C@@H](COP(O)(=O)OP(O)(=O)OCC(C)(C)[C@@H](O)C(=O)NCCC(=O)NCCSC(=O)CCCCCC(O)=O)O[C@H]1N1C2=NC=NC(N)=C2N=C1 LYCRXMTYUZDUGA-UYRKPTJQSA-N 0.000 description 16
- 239000000872 buffer Substances 0.000 description 15
- 239000001963 growth medium Substances 0.000 description 15
- 239000001888 Peptone Substances 0.000 description 14
- 108010080698 Peptones Proteins 0.000 description 14
- 238000000246 agarose gel electrophoresis Methods 0.000 description 14
- 229940041514 candida albicans extract Drugs 0.000 description 14
- 108020004707 nucleic acids Proteins 0.000 description 14
- 102000039446 nucleic acids Human genes 0.000 description 14
- 150000007523 nucleic acids Chemical class 0.000 description 14
- 235000019319 peptone Nutrition 0.000 description 14
- 239000012138 yeast extract Substances 0.000 description 14
- 101100218845 Escherichia coli (strain K12) bioH gene Proteins 0.000 description 13
- 230000035772 mutation Effects 0.000 description 13
- 108090000340 Transaminases Proteins 0.000 description 12
- 238000003780 insertion Methods 0.000 description 12
- 230000037431 insertion Effects 0.000 description 12
- 239000002253 acid Substances 0.000 description 11
- 239000002609 medium Substances 0.000 description 11
- 238000006467 substitution reaction Methods 0.000 description 11
- 229940088594 vitamin Drugs 0.000 description 11
- 239000011782 vitamin Substances 0.000 description 11
- AUTOLBMXDDTRRT-JGVFFNPUSA-N (4R,5S)-dethiobiotin Chemical compound C[C@@H]1NC(=O)N[C@@H]1CCCCCC(O)=O AUTOLBMXDDTRRT-JGVFFNPUSA-N 0.000 description 10
- 229920001817 Agar Polymers 0.000 description 10
- 108090000364 Ligases Proteins 0.000 description 10
- COYHRQWNJDJCNA-NUJDXYNKSA-N Thr-Thr-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O COYHRQWNJDJCNA-NUJDXYNKSA-N 0.000 description 10
- 239000008272 agar Substances 0.000 description 10
- 238000004520 electroporation Methods 0.000 description 10
- 238000012986 modification Methods 0.000 description 10
- 230000004048 modification Effects 0.000 description 10
- 239000011780 sodium chloride Substances 0.000 description 10
- 108010061238 threonyl-glycine Proteins 0.000 description 10
- 239000004098 Tetracycline Substances 0.000 description 9
- 150000007513 acids Chemical class 0.000 description 9
- 108010005233 alanylglutamic acid Proteins 0.000 description 9
- 238000005520 cutting process Methods 0.000 description 9
- 238000002474 experimental method Methods 0.000 description 9
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 9
- 238000010369 molecular cloning Methods 0.000 description 9
- 229960002180 tetracycline Drugs 0.000 description 9
- 229930101283 tetracycline Natural products 0.000 description 9
- 235000019364 tetracycline Nutrition 0.000 description 9
- 150000003522 tetracyclines Chemical class 0.000 description 9
- QKNYBSVHEMOAJP-UHFFFAOYSA-N 2-amino-2-(hydroxymethyl)propane-1,3-diol;hydron;chloride Chemical compound Cl.OCC(N)(CO)CO QKNYBSVHEMOAJP-UHFFFAOYSA-N 0.000 description 8
- 108091081024 Start codon Proteins 0.000 description 8
- 108010049041 glutamylalanine Proteins 0.000 description 8
- 108010036413 histidylglycine Proteins 0.000 description 8
- 230000000813 microbial effect Effects 0.000 description 8
- 235000021251 pulses Nutrition 0.000 description 8
- 108030002422 Dethiobiotin synthases Proteins 0.000 description 7
- 102000003929 Transaminases Human genes 0.000 description 7
- 108010047495 alanylglycine Proteins 0.000 description 7
- 108010013835 arginine glutamate Proteins 0.000 description 7
- 230000001580 bacterial effect Effects 0.000 description 7
- 230000001851 biosynthetic effect Effects 0.000 description 7
- 229930003231 vitamin Natural products 0.000 description 7
- 235000013343 vitamin Nutrition 0.000 description 7
- 229960000723 ampicillin Drugs 0.000 description 6
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 6
- 108010043240 arginyl-leucyl-glycine Proteins 0.000 description 6
- 238000007796 conventional method Methods 0.000 description 6
- 108010052305 exodeoxyribonuclease III Proteins 0.000 description 6
- 108010050848 glycylleucine Proteins 0.000 description 6
- 108010040030 histidinoalanine Proteins 0.000 description 6
- 238000011534 incubation Methods 0.000 description 6
- 239000007788 liquid Substances 0.000 description 6
- 230000035484 reaction time Effects 0.000 description 6
- YLTKNGYYPIWKHZ-ACZMJKKPSA-N Ala-Ala-Glu Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O YLTKNGYYPIWKHZ-ACZMJKKPSA-N 0.000 description 5
- 240000006024 Lactobacillus plantarum Species 0.000 description 5
- 235000013965 Lactobacillus plantarum Nutrition 0.000 description 5
- XBBKIIGCUMBKCO-JXUBOQSCSA-N Leu-Ala-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XBBKIIGCUMBKCO-JXUBOQSCSA-N 0.000 description 5
- 102000003960 Ligases Human genes 0.000 description 5
- 108010040443 aspartyl-aspartic acid Proteins 0.000 description 5
- 238000010367 cloning Methods 0.000 description 5
- 230000009089 cytolysis Effects 0.000 description 5
- 230000029087 digestion Effects 0.000 description 5
- 230000012010 growth Effects 0.000 description 5
- 229940072205 lactobacillus plantarum Drugs 0.000 description 5
- 230000002906 microbiologic effect Effects 0.000 description 5
- 239000000047 product Substances 0.000 description 5
- 108010053725 prolylvaline Proteins 0.000 description 5
- 238000004445 quantitative analysis Methods 0.000 description 5
- 238000012546 transfer Methods 0.000 description 5
- COXMUHNBYCVVRG-DCAQKATOSA-N Arg-Leu-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O COXMUHNBYCVVRG-DCAQKATOSA-N 0.000 description 4
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 4
- HUFUVTYGPOUCBN-MBLNEYKQSA-N Gly-Thr-Ile Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HUFUVTYGPOUCBN-MBLNEYKQSA-N 0.000 description 4
- JBCLFWXMTIKCCB-UHFFFAOYSA-N H-Gly-Phe-OH Natural products NCC(=O)NC(C(O)=O)CC1=CC=CC=C1 JBCLFWXMTIKCCB-UHFFFAOYSA-N 0.000 description 4
- NZOCIWKZUVUNDW-ZKWXMUAHSA-N Ile-Gly-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O NZOCIWKZUVUNDW-ZKWXMUAHSA-N 0.000 description 4
- 108010086093 Mung Bean Nuclease Proteins 0.000 description 4
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 4
- SMIDBHKWSYUBRZ-ACZMJKKPSA-N Ser-Glu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O SMIDBHKWSYUBRZ-ACZMJKKPSA-N 0.000 description 4
- DDRBQONWVBDQOY-GUBZILKMSA-N Val-Ala-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O DDRBQONWVBDQOY-GUBZILKMSA-N 0.000 description 4
- 108010044940 alanylglutamine Proteins 0.000 description 4
- 108010068380 arginylarginine Proteins 0.000 description 4
- 108010047857 aspartylglycine Proteins 0.000 description 4
- 230000001419 dependent effect Effects 0.000 description 4
- 108010003700 lysyl aspartic acid Proteins 0.000 description 4
- 239000006228 supernatant Substances 0.000 description 4
- 238000003786 synthesis reaction Methods 0.000 description 4
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 4
- MZZYGYNZAOVRTG-UHFFFAOYSA-N 2-hydroxy-n-(1h-1,2,4-triazol-5-yl)benzamide Chemical compound OC1=CC=CC=C1C(=O)NC1=NC=NN1 MZZYGYNZAOVRTG-UHFFFAOYSA-N 0.000 description 3
- BTYTYHBSJKQBQA-GCJQMDKQSA-N Ala-Asp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N)O BTYTYHBSJKQBQA-GCJQMDKQSA-N 0.000 description 3
- WGDNWOMKBUXFHR-BQBZGAKWSA-N Ala-Gly-Arg Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N WGDNWOMKBUXFHR-BQBZGAKWSA-N 0.000 description 3
- VGPWRRFOPXVGOH-BYPYZUCNSA-N Ala-Gly-Gly Chemical compound C[C@H](N)C(=O)NCC(=O)NCC(O)=O VGPWRRFOPXVGOH-BYPYZUCNSA-N 0.000 description 3
- PCIFXPRIFWKWLK-YUMQZZPRSA-N Ala-Gly-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N PCIFXPRIFWKWLK-YUMQZZPRSA-N 0.000 description 3
- MNZHHDPWDWQJCQ-YUMQZZPRSA-N Ala-Leu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O MNZHHDPWDWQJCQ-YUMQZZPRSA-N 0.000 description 3
- OMDNCNKNEGFOMM-BQBZGAKWSA-N Ala-Met-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)NCC(O)=O OMDNCNKNEGFOMM-BQBZGAKWSA-N 0.000 description 3
- 101100220068 Arabidopsis thaliana CDA6 gene Proteins 0.000 description 3
- LVMUGODRNHFGRA-AVGNSLFASA-N Arg-Leu-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O LVMUGODRNHFGRA-AVGNSLFASA-N 0.000 description 3
- YBZMTKUDWXZLIX-UWVGGRQHSA-N Arg-Leu-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O YBZMTKUDWXZLIX-UWVGGRQHSA-N 0.000 description 3
- VYZBPPBKFCHCIS-WPRPVWTQSA-N Arg-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCCN=C(N)N VYZBPPBKFCHCIS-WPRPVWTQSA-N 0.000 description 3
- HXVILZUZXFLVEN-DCAQKATOSA-N Asp-Met-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O HXVILZUZXFLVEN-DCAQKATOSA-N 0.000 description 3
- 101150015687 CDA7 gene Proteins 0.000 description 3
- SNHRIJBANHPWMO-XGEHTFHBSA-N Cys-Met-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CS)N)O SNHRIJBANHPWMO-XGEHTFHBSA-N 0.000 description 3
- UTKUTMJSWKKHEM-WDSKDSINSA-N Glu-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O UTKUTMJSWKKHEM-WDSKDSINSA-N 0.000 description 3
- SYWCGQOIIARSIX-SRVKXCTJSA-N Glu-Pro-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O SYWCGQOIIARSIX-SRVKXCTJSA-N 0.000 description 3
- GUOWMVFLAJNPDY-CIUDSAMLSA-N Glu-Ser-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GUOWMVFLAJNPDY-CIUDSAMLSA-N 0.000 description 3
- PYTZFYUXZZHOAD-WHFBIAKZSA-N Gly-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)CN PYTZFYUXZZHOAD-WHFBIAKZSA-N 0.000 description 3
- 101000658622 Homo sapiens Testis-specific Y-encoded-like protein 2 Proteins 0.000 description 3
- JJQQGCMKLOEGAV-OSUNSFLBSA-N Ile-Thr-Met Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)O)N JJQQGCMKLOEGAV-OSUNSFLBSA-N 0.000 description 3
- WNGVUZWBXZKQES-YUMQZZPRSA-N Leu-Ala-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O WNGVUZWBXZKQES-YUMQZZPRSA-N 0.000 description 3
- RXGLHDWAZQECBI-SRVKXCTJSA-N Leu-Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O RXGLHDWAZQECBI-SRVKXCTJSA-N 0.000 description 3
- FDBTVENULFNTAL-XQQFMLRXSA-N Leu-Val-Pro Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N FDBTVENULFNTAL-XQQFMLRXSA-N 0.000 description 3
- IUWMQCZOTYRXPL-ZPFDUUQYSA-N Lys-Ile-Asp Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O IUWMQCZOTYRXPL-ZPFDUUQYSA-N 0.000 description 3
- JPCHYAUKOUGOIB-HJGDQZAQSA-N Met-Glu-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JPCHYAUKOUGOIB-HJGDQZAQSA-N 0.000 description 3
- 241001465754 Metazoa Species 0.000 description 3
- WUGMRIBZSVSJNP-UHFFFAOYSA-N N-L-alanyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C)C(O)=O)=CNC2=C1 WUGMRIBZSVSJNP-UHFFFAOYSA-N 0.000 description 3
- SITLTJHOQZFJGG-UHFFFAOYSA-N N-L-alpha-glutamyl-L-valine Natural products CC(C)C(C(O)=O)NC(=O)C(N)CCC(O)=O SITLTJHOQZFJGG-UHFFFAOYSA-N 0.000 description 3
- ZLGQEBCCANLYRA-RYUDHWBXSA-N Phe-Gly-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O ZLGQEBCCANLYRA-RYUDHWBXSA-N 0.000 description 3
- XDMMOISUAHXXFD-SRVKXCTJSA-N Phe-Ser-Asp Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O XDMMOISUAHXXFD-SRVKXCTJSA-N 0.000 description 3
- FMLRRBDLBJLJIK-DCAQKATOSA-N Pro-Leu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FMLRRBDLBJLJIK-DCAQKATOSA-N 0.000 description 3
- 238000012300 Sequence Analysis Methods 0.000 description 3
- 102100034917 Testis-specific Y-encoded-like protein 2 Human genes 0.000 description 3
- DNOOLPROHJWCSQ-RCWTZXSCSA-N Val-Arg-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O DNOOLPROHJWCSQ-RCWTZXSCSA-N 0.000 description 3
- UMPVMAYCLYMYGA-ONGXEEELSA-N Val-Leu-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O UMPVMAYCLYMYGA-ONGXEEELSA-N 0.000 description 3
- DIOSYUIWOQCXNR-ONGXEEELSA-N Val-Lys-Gly Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CCCCN)C(=O)NCC(O)=O DIOSYUIWOQCXNR-ONGXEEELSA-N 0.000 description 3
- NHXZRXLFOBFMDM-AVGNSLFASA-N Val-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C NHXZRXLFOBFMDM-AVGNSLFASA-N 0.000 description 3
- 108010086434 alanyl-seryl-glycine Proteins 0.000 description 3
- 108010087924 alanylproline Proteins 0.000 description 3
- 108010001271 arginyl-glutamyl-arginine Proteins 0.000 description 3
- 108010072041 arginyl-glycyl-aspartic acid Proteins 0.000 description 3
- 101150043536 bioH gene Proteins 0.000 description 3
- 230000000295 complement effect Effects 0.000 description 3
- 108010016616 cysteinylglycine Proteins 0.000 description 3
- 239000000284 extract Substances 0.000 description 3
- 238000012224 gene deletion Methods 0.000 description 3
- 108010078326 glycyl-glycyl-valine Proteins 0.000 description 3
- 108010089804 glycyl-threonine Proteins 0.000 description 3
- 108010081551 glycylphenylalanine Proteins 0.000 description 3
- 108010037850 glycylvaline Proteins 0.000 description 3
- 108010018006 histidylserine Proteins 0.000 description 3
- 230000006801 homologous recombination Effects 0.000 description 3
- 238000002744 homologous recombination Methods 0.000 description 3
- 239000003550 marker Substances 0.000 description 3
- 239000002773 nucleotide Substances 0.000 description 3
- 125000003729 nucleotide group Chemical group 0.000 description 3
- 239000002243 precursor Substances 0.000 description 3
- 238000012216 screening Methods 0.000 description 3
- 238000012163 sequencing technique Methods 0.000 description 3
- FWMNVWWHGCHHJJ-SKKKGAJSSA-N 4-amino-1-[(2r)-6-amino-2-[[(2r)-2-[[(2r)-2-[[(2r)-2-amino-3-phenylpropanoyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]hexanoyl]piperidine-4-carboxylic acid Chemical compound C([C@H](C(=O)N[C@H](CC(C)C)C(=O)N[C@H](CCCCN)C(=O)N1CCC(N)(CC1)C(O)=O)NC(=O)[C@H](N)CC=1C=CC=CC=1)C1=CC=CC=C1 FWMNVWWHGCHHJJ-SKKKGAJSSA-N 0.000 description 2
- BYXHQQCXAJARLQ-ZLUOBGJFSA-N Ala-Ala-Ala Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O BYXHQQCXAJARLQ-ZLUOBGJFSA-N 0.000 description 2
- WQVFQXXBNHHPLX-ZKWXMUAHSA-N Ala-Ala-His Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](Cc1cnc[nH]1)C(O)=O WQVFQXXBNHHPLX-ZKWXMUAHSA-N 0.000 description 2
- LGQPPBQRUBVTIF-JBDRJPRFSA-N Ala-Ala-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O LGQPPBQRUBVTIF-JBDRJPRFSA-N 0.000 description 2
- FXKNPWNXPQZLES-ZLUOBGJFSA-N Ala-Asn-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FXKNPWNXPQZLES-ZLUOBGJFSA-N 0.000 description 2
- BGNLUHXLSAQYRQ-FXQIFTODSA-N Ala-Glu-Gln Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O BGNLUHXLSAQYRQ-FXQIFTODSA-N 0.000 description 2
- PAIHPOGPJVUFJY-WDSKDSINSA-N Ala-Glu-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O PAIHPOGPJVUFJY-WDSKDSINSA-N 0.000 description 2
- ZVFVBBGVOILKPO-WHFBIAKZSA-N Ala-Gly-Ala Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O ZVFVBBGVOILKPO-WHFBIAKZSA-N 0.000 description 2
- LMFXXZPPZDCPTA-ZKWXMUAHSA-N Ala-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N LMFXXZPPZDCPTA-ZKWXMUAHSA-N 0.000 description 2
- QCTFKEJEIMPOLW-JURCDPSOSA-N Ala-Ile-Phe Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 QCTFKEJEIMPOLW-JURCDPSOSA-N 0.000 description 2
- LNNSWWRRYJLGNI-NAKRPEOUSA-N Ala-Ile-Val Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O LNNSWWRRYJLGNI-NAKRPEOUSA-N 0.000 description 2
- YHKANGMVQWRMAP-DCAQKATOSA-N Ala-Leu-Arg Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N YHKANGMVQWRMAP-DCAQKATOSA-N 0.000 description 2
- CCDFBRZVTDDJNM-GUBZILKMSA-N Ala-Leu-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O CCDFBRZVTDDJNM-GUBZILKMSA-N 0.000 description 2
- 108010011667 Ala-Phe-Ala Proteins 0.000 description 2
- IOFVWPYSRSCWHI-JXUBOQSCSA-N Ala-Thr-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@H](C)N IOFVWPYSRSCWHI-JXUBOQSCSA-N 0.000 description 2
- YJHKTAMKPGFJCT-NRPADANISA-N Ala-Val-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O YJHKTAMKPGFJCT-NRPADANISA-N 0.000 description 2
- OMSKGWFGWCQFBD-KZVJFYERSA-N Ala-Val-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OMSKGWFGWCQFBD-KZVJFYERSA-N 0.000 description 2
- GHNDBBVSWOWYII-LPEHRKFASA-N Arg-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O GHNDBBVSWOWYII-LPEHRKFASA-N 0.000 description 2
- IYMAXBFPHPZYIK-BQBZGAKWSA-N Arg-Gly-Asp Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O IYMAXBFPHPZYIK-BQBZGAKWSA-N 0.000 description 2
- AUFHLLPVPSMEOG-YUMQZZPRSA-N Arg-Gly-Glu Chemical compound NC(N)=NCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O AUFHLLPVPSMEOG-YUMQZZPRSA-N 0.000 description 2
- UBCPNBUIQNMDNH-NAKRPEOUSA-N Arg-Ile-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C)C(O)=O UBCPNBUIQNMDNH-NAKRPEOUSA-N 0.000 description 2
- OTZMRMHZCMZOJZ-SRVKXCTJSA-N Arg-Leu-Glu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O OTZMRMHZCMZOJZ-SRVKXCTJSA-N 0.000 description 2
- JOADBFCFJGNIKF-GUBZILKMSA-N Arg-Met-Ala Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O JOADBFCFJGNIKF-GUBZILKMSA-N 0.000 description 2
- VLIJAPRTSXSGFY-STQMWFEESA-N Arg-Tyr-Gly Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=C(O)C=C1 VLIJAPRTSXSGFY-STQMWFEESA-N 0.000 description 2
- FXGMURPOWCKNAZ-JYJNAYRXSA-N Arg-Val-Phe Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 FXGMURPOWCKNAZ-JYJNAYRXSA-N 0.000 description 2
- HZPSDHRYYIORKR-WHFBIAKZSA-N Asn-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CC(N)=O HZPSDHRYYIORKR-WHFBIAKZSA-N 0.000 description 2
- ODBSSLHUFPJRED-CIUDSAMLSA-N Asn-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CC(=O)N)N ODBSSLHUFPJRED-CIUDSAMLSA-N 0.000 description 2
- SGAUXNZEFIEAAI-GARJFASQSA-N Asn-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CC(=O)N)N)C(=O)O SGAUXNZEFIEAAI-GARJFASQSA-N 0.000 description 2
- NPZJLGMWMDNQDD-GHCJXIJMSA-N Asn-Ser-Ile Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O NPZJLGMWMDNQDD-GHCJXIJMSA-N 0.000 description 2
- AECPDLSSUMDUAA-ZKWXMUAHSA-N Asn-Val-Cys Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CC(=O)N)N AECPDLSSUMDUAA-ZKWXMUAHSA-N 0.000 description 2
- KRXIWXCXOARFNT-ZLUOBGJFSA-N Asp-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC(O)=O KRXIWXCXOARFNT-ZLUOBGJFSA-N 0.000 description 2
- IXIWEFWRKIUMQX-DCAQKATOSA-N Asp-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@@H](N)CC(O)=O IXIWEFWRKIUMQX-DCAQKATOSA-N 0.000 description 2
- CELPEWWLSXMVPH-CIUDSAMLSA-N Asp-Asp-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC(O)=O CELPEWWLSXMVPH-CIUDSAMLSA-N 0.000 description 2
- PMEHKVHZQKJACS-PEFMBERDSA-N Asp-Gln-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PMEHKVHZQKJACS-PEFMBERDSA-N 0.000 description 2
- VAWNQIGQPUOPQW-ACZMJKKPSA-N Asp-Glu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VAWNQIGQPUOPQW-ACZMJKKPSA-N 0.000 description 2
- WBDWQKRLTVCDSY-WHFBIAKZSA-N Asp-Gly-Asp Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@@H](CC(O)=O)C(O)=O WBDWQKRLTVCDSY-WHFBIAKZSA-N 0.000 description 2
- ILQCHXURSRRIRY-YUMQZZPRSA-N Asp-His-Gly Chemical compound C1=C(NC=N1)C[C@@H](C(=O)NCC(=O)O)NC(=O)[C@H](CC(=O)O)N ILQCHXURSRRIRY-YUMQZZPRSA-N 0.000 description 2
- HOBNTSHITVVNBN-ZPFDUUQYSA-N Asp-Ile-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N HOBNTSHITVVNBN-ZPFDUUQYSA-N 0.000 description 2
- RRUWMFBLFLUZSI-LPEHRKFASA-N Asp-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N RRUWMFBLFLUZSI-LPEHRKFASA-N 0.000 description 2
- ZKAOJVJQGVUIIU-GUBZILKMSA-N Asp-Pro-Arg Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O ZKAOJVJQGVUIIU-GUBZILKMSA-N 0.000 description 2
- GWWSUMLEWKQHLR-NUMRIWBASA-N Asp-Thr-Glu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O GWWSUMLEWKQHLR-NUMRIWBASA-N 0.000 description 2
- JJQGZGOEDSSHTE-FOHZUACHSA-N Asp-Thr-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O JJQGZGOEDSSHTE-FOHZUACHSA-N 0.000 description 2
- JSNWZMFSLIWAHS-HJGDQZAQSA-N Asp-Thr-Leu Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)O)NC(=O)[C@H](CC(=O)O)N)O JSNWZMFSLIWAHS-HJGDQZAQSA-N 0.000 description 2
- NWAHPBGBDIFUFD-KKUMJFAQSA-N Asp-Tyr-Leu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(C)C)C(O)=O NWAHPBGBDIFUFD-KKUMJFAQSA-N 0.000 description 2
- UXRVDHVARNBOIO-QSFUFRPTSA-N Asp-Val-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CC(=O)O)N UXRVDHVARNBOIO-QSFUFRPTSA-N 0.000 description 2
- 101150026507 CDA3 gene Proteins 0.000 description 2
- VEXZGXHMUGYJMC-UHFFFAOYSA-M Chloride anion Chemical compound [Cl-] VEXZGXHMUGYJMC-UHFFFAOYSA-M 0.000 description 2
- DZLQXIFVQFTFJY-BYPYZUCNSA-N Cys-Gly-Gly Chemical compound SC[C@H](N)C(=O)NCC(=O)NCC(O)=O DZLQXIFVQFTFJY-BYPYZUCNSA-N 0.000 description 2
- ZXCAQANTQWBICD-DCAQKATOSA-N Cys-Lys-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CS)N ZXCAQANTQWBICD-DCAQKATOSA-N 0.000 description 2
- RESAHOSBQHMOKH-KKUMJFAQSA-N Cys-Phe-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)NC(=O)[C@H](CS)N RESAHOSBQHMOKH-KKUMJFAQSA-N 0.000 description 2
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 2
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 2
- 241000196324 Embryophyta Species 0.000 description 2
- NNQHEEQNPQYPGL-FXQIFTODSA-N Gln-Ala-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O NNQHEEQNPQYPGL-FXQIFTODSA-N 0.000 description 2
- KVYVOGYEMPEXBT-GUBZILKMSA-N Gln-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O KVYVOGYEMPEXBT-GUBZILKMSA-N 0.000 description 2
- MLZRSFQRBDNJON-GUBZILKMSA-N Gln-Ala-Lys Chemical compound C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCC(=O)N)N MLZRSFQRBDNJON-GUBZILKMSA-N 0.000 description 2
- ZFBBMCKQSNJZSN-AUTRQRHGSA-N Gln-Val-Gln Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O ZFBBMCKQSNJZSN-AUTRQRHGSA-N 0.000 description 2
- LKDIBBOKUAASNP-FXQIFTODSA-N Glu-Ala-Glu Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O LKDIBBOKUAASNP-FXQIFTODSA-N 0.000 description 2
- BPDVTFBJZNBHEU-HGNGGELXSA-N Glu-Ala-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 BPDVTFBJZNBHEU-HGNGGELXSA-N 0.000 description 2
- NLKVNZUFDPWPNL-YUMQZZPRSA-N Glu-Arg-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O NLKVNZUFDPWPNL-YUMQZZPRSA-N 0.000 description 2
- NADWTMLCUDMDQI-ACZMJKKPSA-N Glu-Asp-Cys Chemical compound C(CC(=O)O)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CS)C(=O)O)N NADWTMLCUDMDQI-ACZMJKKPSA-N 0.000 description 2
- GMAGZGCAYLQBKF-NHCYSSNCSA-N Glu-Met-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C(C)C)C(O)=O GMAGZGCAYLQBKF-NHCYSSNCSA-N 0.000 description 2
- KIEICAOUSNYOLM-NRPADANISA-N Glu-Val-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O KIEICAOUSNYOLM-NRPADANISA-N 0.000 description 2
- NTNUEBVGKMVANB-NHCYSSNCSA-N Glu-Val-Met Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O NTNUEBVGKMVANB-NHCYSSNCSA-N 0.000 description 2
- VSVZIEVNUYDAFR-YUMQZZPRSA-N Gly-Ala-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN VSVZIEVNUYDAFR-YUMQZZPRSA-N 0.000 description 2
- LJPIRKICOISLKN-WHFBIAKZSA-N Gly-Ala-Ser Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O LJPIRKICOISLKN-WHFBIAKZSA-N 0.000 description 2
- DTPOVRRYXPJJAZ-FJXKBIBVSA-N Gly-Arg-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N DTPOVRRYXPJJAZ-FJXKBIBVSA-N 0.000 description 2
- PMNHJLASAAWELO-FOHZUACHSA-N Gly-Asp-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PMNHJLASAAWELO-FOHZUACHSA-N 0.000 description 2
- CCQOOWAONKGYKQ-BYPYZUCNSA-N Gly-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)CN CCQOOWAONKGYKQ-BYPYZUCNSA-N 0.000 description 2
- IDOGEHIWMJMAHT-BYPYZUCNSA-N Gly-Gly-Cys Chemical compound NCC(=O)NCC(=O)N[C@@H](CS)C(O)=O IDOGEHIWMJMAHT-BYPYZUCNSA-N 0.000 description 2
- CCBIBMKQNXHNIN-ZETCQYMHSA-N Gly-Leu-Gly Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O CCBIBMKQNXHNIN-ZETCQYMHSA-N 0.000 description 2
- LHYJCVCQPWRMKZ-WEDXCCLWSA-N Gly-Leu-Thr Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LHYJCVCQPWRMKZ-WEDXCCLWSA-N 0.000 description 2
- JQFILXICXLDTRR-FBCQKBJTSA-N Gly-Thr-Gly Chemical compound NCC(=O)N[C@@H]([C@H](O)C)C(=O)NCC(O)=O JQFILXICXLDTRR-FBCQKBJTSA-N 0.000 description 2
- MYXNLWDWWOTERK-BHNWBGBOSA-N Gly-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN)O MYXNLWDWWOTERK-BHNWBGBOSA-N 0.000 description 2
- UMBDRSMLCUYIRI-DVJZZOLTSA-N Gly-Trp-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)CN)O UMBDRSMLCUYIRI-DVJZZOLTSA-N 0.000 description 2
- WRFOZIJRODPLIA-QWRGUYRKSA-N Gly-Tyr-Cys Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)CN)O WRFOZIJRODPLIA-QWRGUYRKSA-N 0.000 description 2
- GJHWILMUOANXTG-WPRPVWTQSA-N Gly-Val-Arg Chemical compound [H]NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GJHWILMUOANXTG-WPRPVWTQSA-N 0.000 description 2
- FNXSYBOHALPRHV-ONGXEEELSA-N Gly-Val-Lys Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN FNXSYBOHALPRHV-ONGXEEELSA-N 0.000 description 2
- OSZUPUINVNPCOE-SDDRHHMPSA-N His-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC2=CN=CN2)N)C(=O)O OSZUPUINVNPCOE-SDDRHHMPSA-N 0.000 description 2
- VTMLJMNQHKBPON-QWRGUYRKSA-N His-Gly-His Chemical compound C([C@H](N)C(=O)NCC(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 VTMLJMNQHKBPON-QWRGUYRKSA-N 0.000 description 2
- QCBYAHHNOHBXIH-UWVGGRQHSA-N His-Pro-Gly Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)NCC(O)=O)C1=CN=CN1 QCBYAHHNOHBXIH-UWVGGRQHSA-N 0.000 description 2
- PBVQWNDMFFCPIZ-ULQDDVLXSA-N His-Pro-Phe Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CN=CN1 PBVQWNDMFFCPIZ-ULQDDVLXSA-N 0.000 description 2
- WYKXJGWSJUULSL-AVGNSLFASA-N His-Val-Arg Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)Cc1cnc[nH]1)C(=O)N[C@@H](CCCNC(=N)N)C(=O)O WYKXJGWSJUULSL-AVGNSLFASA-N 0.000 description 2
- 241000282412 Homo Species 0.000 description 2
- VSZALHITQINTGC-GHCJXIJMSA-N Ile-Ala-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(=O)O)C(=O)O)N VSZALHITQINTGC-GHCJXIJMSA-N 0.000 description 2
- GYAFMRQGWHXMII-IUKAMOBKSA-N Ile-Asp-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N GYAFMRQGWHXMII-IUKAMOBKSA-N 0.000 description 2
- FUOYNOXRWPJPAN-QEWYBTABSA-N Ile-Glu-Phe Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N FUOYNOXRWPJPAN-QEWYBTABSA-N 0.000 description 2
- PDTMWFVVNZYWTR-NHCYSSNCSA-N Ile-Gly-Lys Chemical compound CC[C@H](C)[C@H](N)C(=O)NCC(=O)N[C@@H](CCCCN)C(O)=O PDTMWFVVNZYWTR-NHCYSSNCSA-N 0.000 description 2
- CKRFDMPBSWYOBT-PPCPHDFISA-N Ile-Lys-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N CKRFDMPBSWYOBT-PPCPHDFISA-N 0.000 description 2
- HQEPKOFULQTSFV-JURCDPSOSA-N Ile-Phe-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)O)N HQEPKOFULQTSFV-JURCDPSOSA-N 0.000 description 2
- ANTFEOSJMAUGIB-KNZXXDILSA-N Ile-Thr-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@@H]1C(=O)O)N ANTFEOSJMAUGIB-KNZXXDILSA-N 0.000 description 2
- KXUKTDGKLAOCQK-LSJOCFKGSA-N Ile-Val-Gly Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)NCC(O)=O KXUKTDGKLAOCQK-LSJOCFKGSA-N 0.000 description 2
- SITWEMZOJNKJCH-UHFFFAOYSA-N L-alanine-L-arginine Natural products CC(N)C(=O)NC(C(O)=O)CCCNC(N)=N SITWEMZOJNKJCH-UHFFFAOYSA-N 0.000 description 2
- RCFDOSNHHZGBOY-UHFFFAOYSA-N L-isoleucyl-L-alanine Natural products CCC(C)C(N)C(=O)NC(C)C(O)=O RCFDOSNHHZGBOY-UHFFFAOYSA-N 0.000 description 2
- BQSLGJHIAGOZCD-CIUDSAMLSA-N Leu-Ala-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O BQSLGJHIAGOZCD-CIUDSAMLSA-N 0.000 description 2
- QLQHWWCSCLZUMA-KKUMJFAQSA-N Leu-Asp-Tyr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 QLQHWWCSCLZUMA-KKUMJFAQSA-N 0.000 description 2
- DZQMXBALGUHGJT-GUBZILKMSA-N Leu-Glu-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O DZQMXBALGUHGJT-GUBZILKMSA-N 0.000 description 2
- HYIFFZAQXPUEAU-QWRGUYRKSA-N Leu-Gly-Leu Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC(C)C HYIFFZAQXPUEAU-QWRGUYRKSA-N 0.000 description 2
- QPXBPQUGXHURGP-UWVGGRQHSA-N Leu-Gly-Met Chemical compound CC(C)C[C@@H](C(=O)NCC(=O)N[C@@H](CCSC)C(=O)O)N QPXBPQUGXHURGP-UWVGGRQHSA-N 0.000 description 2
- UCDHVOALNXENLC-KBPBESRZSA-N Leu-Gly-Tyr Chemical compound CC(C)C[C@H]([NH3+])C(=O)NCC(=O)N[C@H](C([O-])=O)CC1=CC=C(O)C=C1 UCDHVOALNXENLC-KBPBESRZSA-N 0.000 description 2
- OHZIZVWQXJPBJS-IXOXFDKPSA-N Leu-His-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OHZIZVWQXJPBJS-IXOXFDKPSA-N 0.000 description 2
- JKSIBWITFMQTOA-XUXIUFHCSA-N Leu-Ile-Val Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O JKSIBWITFMQTOA-XUXIUFHCSA-N 0.000 description 2
- JDBQSGMJBMPNFT-AVGNSLFASA-N Leu-Pro-Val Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O JDBQSGMJBMPNFT-AVGNSLFASA-N 0.000 description 2
- MVHXGBZUJLWZOH-BJDJZHNGSA-N Leu-Ser-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O MVHXGBZUJLWZOH-BJDJZHNGSA-N 0.000 description 2
- BRTVHXHCUSXYRI-CIUDSAMLSA-N Leu-Ser-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O BRTVHXHCUSXYRI-CIUDSAMLSA-N 0.000 description 2
- PPGBXYKMUMHFBF-KATARQTJSA-N Leu-Ser-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PPGBXYKMUMHFBF-KATARQTJSA-N 0.000 description 2
- FBNPMTNBFFAMMH-AVGNSLFASA-N Leu-Val-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-AVGNSLFASA-N 0.000 description 2
- LMDVGHQPPPLYAR-IHRRRGAJSA-N Leu-Val-His Chemical compound N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)O LMDVGHQPPPLYAR-IHRRRGAJSA-N 0.000 description 2
- KNKHAVVBVXKOGX-JXUBOQSCSA-N Lys-Ala-Thr Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O KNKHAVVBVXKOGX-JXUBOQSCSA-N 0.000 description 2
- FLCMXEFCTLXBTL-DCAQKATOSA-N Lys-Asp-Arg Chemical compound C(CCN)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N FLCMXEFCTLXBTL-DCAQKATOSA-N 0.000 description 2
- AIRZWUMAHCDDHR-KKUMJFAQSA-N Lys-Leu-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O AIRZWUMAHCDDHR-KKUMJFAQSA-N 0.000 description 2
- ORVFEGYUJITPGI-IHRRRGAJSA-N Lys-Leu-Met Chemical compound CSCC[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)CCCCN ORVFEGYUJITPGI-IHRRRGAJSA-N 0.000 description 2
- URGPVYGVWLIRGT-DCAQKATOSA-N Lys-Met-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O URGPVYGVWLIRGT-DCAQKATOSA-N 0.000 description 2
- YCUSPBPZVJDMII-YUMQZZPRSA-N Met-Gly-Glu Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O YCUSPBPZVJDMII-YUMQZZPRSA-N 0.000 description 2
- GWADARYJIJDYRC-XGEHTFHBSA-N Met-Thr-Ser Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O GWADARYJIJDYRC-XGEHTFHBSA-N 0.000 description 2
- QYIGOFGUOVTAHK-ZJDVBMNYSA-N Met-Thr-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QYIGOFGUOVTAHK-ZJDVBMNYSA-N 0.000 description 2
- 102000016943 Muramidase Human genes 0.000 description 2
- 108010014251 Muramidase Proteins 0.000 description 2
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 2
- YBAFDPFAUTYYRW-UHFFFAOYSA-N N-L-alpha-glutamyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCC(O)=O YBAFDPFAUTYYRW-UHFFFAOYSA-N 0.000 description 2
- 108010079364 N-glycylalanine Proteins 0.000 description 2
- JVTMTFMMMHAPCR-UBHSHLNASA-N Phe-Ala-Arg Chemical compound NC(=N)NCCC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 JVTMTFMMMHAPCR-UBHSHLNASA-N 0.000 description 2
- FRPVPGRXUKFEQE-YDHLFZDLSA-N Phe-Asp-Val Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O FRPVPGRXUKFEQE-YDHLFZDLSA-N 0.000 description 2
- HQVPQHLNOVTLDD-IHRRRGAJSA-N Phe-Cys-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CC1=CC=CC=C1)N HQVPQHLNOVTLDD-IHRRRGAJSA-N 0.000 description 2
- HTXVATDVCRFORF-MGHWNKPDSA-N Phe-Ile-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CC=CC=C2)N HTXVATDVCRFORF-MGHWNKPDSA-N 0.000 description 2
- BSKMOCNNLNDIMU-CDMKHQONSA-N Phe-Thr-Gly Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O BSKMOCNNLNDIMU-CDMKHQONSA-N 0.000 description 2
- 229920001213 Polysorbate 20 Polymers 0.000 description 2
- FZHBZMDRDASUHN-NAKRPEOUSA-N Pro-Ala-Ile Chemical compound CC[C@H](C)[C@H](NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1)C(O)=O FZHBZMDRDASUHN-NAKRPEOUSA-N 0.000 description 2
- JXVXYRZQIUPYSA-NHCYSSNCSA-N Pro-Val-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O JXVXYRZQIUPYSA-NHCYSSNCSA-N 0.000 description 2
- 108010025216 RVF peptide Proteins 0.000 description 2
- 108010083644 Ribonucleases Proteins 0.000 description 2
- 102000006382 Ribonucleases Human genes 0.000 description 2
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 2
- 235000014680 Saccharomyces cerevisiae Nutrition 0.000 description 2
- WDXYVIIVDIDOSX-DCAQKATOSA-N Ser-Arg-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@@H](N)CO)CCCN=C(N)N WDXYVIIVDIDOSX-DCAQKATOSA-N 0.000 description 2
- KNZQGAUEYZJUSQ-ZLUOBGJFSA-N Ser-Asp-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](CO)N KNZQGAUEYZJUSQ-ZLUOBGJFSA-N 0.000 description 2
- FMDHKPRACUXATF-ACZMJKKPSA-N Ser-Gln-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(O)=O FMDHKPRACUXATF-ACZMJKKPSA-N 0.000 description 2
- UIGMAMGZOJVTDN-WHFBIAKZSA-N Ser-Gly-Ser Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O UIGMAMGZOJVTDN-WHFBIAKZSA-N 0.000 description 2
- OQPNSDWGAMFJNU-QWRGUYRKSA-N Ser-Gly-Tyr Chemical compound OC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 OQPNSDWGAMFJNU-QWRGUYRKSA-N 0.000 description 2
- MLSQXWSRHURDMF-GARJFASQSA-N Ser-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CO)N)C(=O)O MLSQXWSRHURDMF-GARJFASQSA-N 0.000 description 2
- ASGYVPAVFNDZMA-GUBZILKMSA-N Ser-Met-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCSC)NC(=O)[C@H](CO)N ASGYVPAVFNDZMA-GUBZILKMSA-N 0.000 description 2
- BSXKBOUZDAZXHE-CIUDSAMLSA-N Ser-Pro-Glu Chemical compound [H]N[C@@H](CO)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O BSXKBOUZDAZXHE-CIUDSAMLSA-N 0.000 description 2
- RXUOAOOZIWABBW-XGEHTFHBSA-N Ser-Thr-Arg Chemical compound OC[C@H](N)C(=O)N[C@@H]([C@H](O)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N RXUOAOOZIWABBW-XGEHTFHBSA-N 0.000 description 2
- HXPNJVLVHKABMJ-KKUMJFAQSA-N Ser-Tyr-His Chemical compound C1=CC(=CC=C1C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)NC(=O)[C@H](CO)N)O HXPNJVLVHKABMJ-KKUMJFAQSA-N 0.000 description 2
- ANOQEBQWIAYIMV-AEJSXWLSSA-N Ser-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CO)N ANOQEBQWIAYIMV-AEJSXWLSSA-N 0.000 description 2
- VMHLLURERBWHNL-UHFFFAOYSA-M Sodium acetate Chemical compound [Na+].CC([O-])=O VMHLLURERBWHNL-UHFFFAOYSA-M 0.000 description 2
- 229930006000 Sucrose Natural products 0.000 description 2
- CZMRCDWAGMRECN-UGDNZRGBSA-N Sucrose Chemical compound O[C@H]1[C@H](O)[C@@H](CO)O[C@@]1(CO)O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 CZMRCDWAGMRECN-UGDNZRGBSA-N 0.000 description 2
- TYVAWPFQYFPSBR-BFHQHQDPSA-N Thr-Ala-Gly Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)NCC(O)=O TYVAWPFQYFPSBR-BFHQHQDPSA-N 0.000 description 2
- LHUBVKCLOVALIA-HJGDQZAQSA-N Thr-Arg-Gln Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(N)=O)C(O)=O LHUBVKCLOVALIA-HJGDQZAQSA-N 0.000 description 2
- CEXFELBFVHLYDZ-XGEHTFHBSA-N Thr-Arg-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O CEXFELBFVHLYDZ-XGEHTFHBSA-N 0.000 description 2
- LMMDEZPNUTZJAY-GCJQMDKQSA-N Thr-Asp-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O LMMDEZPNUTZJAY-GCJQMDKQSA-N 0.000 description 2
- OYTNZCBFDXGQGE-XQXXSGGOSA-N Thr-Gln-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](C)C(=O)O)N)O OYTNZCBFDXGQGE-XQXXSGGOSA-N 0.000 description 2
- GUZGCDIZVGODML-NKIYYHGXSA-N Thr-Gln-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N)O GUZGCDIZVGODML-NKIYYHGXSA-N 0.000 description 2
- KBBRNEDOYWMIJP-KYNKHSRBSA-N Thr-Gly-Thr Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)O)N)O KBBRNEDOYWMIJP-KYNKHSRBSA-N 0.000 description 2
- SIMKLINEDYOTKL-MBLNEYKQSA-N Thr-His-Ala Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)N[C@@H](C)C(=O)O)N)O SIMKLINEDYOTKL-MBLNEYKQSA-N 0.000 description 2
- BVOVIGCHYNFJBZ-JXUBOQSCSA-N Thr-Leu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O BVOVIGCHYNFJBZ-JXUBOQSCSA-N 0.000 description 2
- VTVVYQOXJCZVEB-WDCWCFNPSA-N Thr-Leu-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VTVVYQOXJCZVEB-WDCWCFNPSA-N 0.000 description 2
- MEJHFIOYJHTWMK-VOAKCMCISA-N Thr-Leu-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H](N)[C@@H](C)O MEJHFIOYJHTWMK-VOAKCMCISA-N 0.000 description 2
- WTMPKZWHRCMMMT-KZVJFYERSA-N Thr-Pro-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C)C(O)=O WTMPKZWHRCMMMT-KZVJFYERSA-N 0.000 description 2
- MUAFDCVOHYAFNG-RCWTZXSCSA-N Thr-Pro-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MUAFDCVOHYAFNG-RCWTZXSCSA-N 0.000 description 2
- BDENGIGFTNYZSJ-RCWTZXSCSA-N Thr-Pro-Met Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCSC)C(O)=O BDENGIGFTNYZSJ-RCWTZXSCSA-N 0.000 description 2
- 239000007984 Tris EDTA buffer Substances 0.000 description 2
- QPZMOUMNTGTEFR-ZKWXMUAHSA-N Val-Asn-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)N)NC(=O)[C@H](C(C)C)N QPZMOUMNTGTEFR-ZKWXMUAHSA-N 0.000 description 2
- HGJRMXOWUWVUOA-GVXVVHGQSA-N Val-Leu-Gln Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](C(C)C)N HGJRMXOWUWVUOA-GVXVVHGQSA-N 0.000 description 2
- JSOXWWFKRJKTMT-WOPDTQHZSA-N Val-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N JSOXWWFKRJKTMT-WOPDTQHZSA-N 0.000 description 2
- 108010076324 alanyl-glycyl-glycine Proteins 0.000 description 2
- 108010070944 alanylhistidine Proteins 0.000 description 2
- 108010011559 alanylphenylalanine Proteins 0.000 description 2
- KOSRFJWDECSPRO-UHFFFAOYSA-N alpha-L-glutamyl-L-glutamic acid Natural products OC(=O)CCC(N)C(=O)NC(CCC(O)=O)C(O)=O KOSRFJWDECSPRO-UHFFFAOYSA-N 0.000 description 2
- 230000003698 anagen phase Effects 0.000 description 2
- 238000000137 annealing Methods 0.000 description 2
- 108010009111 arginyl-glycyl-glutamic acid Proteins 0.000 description 2
- 108010038633 aspartylglutamate Proteins 0.000 description 2
- 150000001875 compounds Chemical class 0.000 description 2
- 239000012531 culture fluid Substances 0.000 description 2
- 108010004073 cysteinylcysteine Proteins 0.000 description 2
- 108010060199 cysteinylproline Proteins 0.000 description 2
- 230000002950 deficient Effects 0.000 description 2
- 230000003544 deproteinization Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000011049 filling Methods 0.000 description 2
- 108010063718 gamma-glutamylaspartic acid Proteins 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 108010055341 glutamyl-glutamic acid Proteins 0.000 description 2
- 108010008237 glutamyl-valyl-glycine Proteins 0.000 description 2
- XBGGUPMXALFZOT-UHFFFAOYSA-N glycyl-L-tyrosine hemihydrate Natural products NCC(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 XBGGUPMXALFZOT-UHFFFAOYSA-N 0.000 description 2
- 108010000434 glycyl-alanyl-leucine Proteins 0.000 description 2
- 108010027668 glycyl-alanyl-valine Proteins 0.000 description 2
- 108010015792 glycyllysine Proteins 0.000 description 2
- 108010028295 histidylhistidine Proteins 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 108010091871 leucylmethionine Proteins 0.000 description 2
- 108010057821 leucylproline Proteins 0.000 description 2
- 229960000274 lysozyme Drugs 0.000 description 2
- 239000004325 lysozyme Substances 0.000 description 2
- 235000010335 lysozyme Nutrition 0.000 description 2
- 108010064235 lysylglycine Proteins 0.000 description 2
- 108010063431 methionyl-aspartyl-glycine Proteins 0.000 description 2
- 108010005942 methionylglycine Proteins 0.000 description 2
- 150000002989 phenols Chemical class 0.000 description 2
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 2
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 2
- 238000001556 precipitation Methods 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 108010077112 prolyl-proline Proteins 0.000 description 2
- 108010004914 prolylarginine Proteins 0.000 description 2
- 108010070643 prolylglutamic acid Proteins 0.000 description 2
- 239000006152 selective media Substances 0.000 description 2
- 108010026333 seryl-proline Proteins 0.000 description 2
- 229910052708 sodium Inorganic materials 0.000 description 2
- 239000011734 sodium Substances 0.000 description 2
- 239000001632 sodium acetate Substances 0.000 description 2
- 235000017281 sodium acetate Nutrition 0.000 description 2
- 239000008223 sterile water Substances 0.000 description 2
- 239000005720 sucrose Substances 0.000 description 2
- 229960000344 thiamine hydrochloride Drugs 0.000 description 2
- 235000019190 thiamine hydrochloride Nutrition 0.000 description 2
- 239000011747 thiamine hydrochloride Substances 0.000 description 2
- DPJRMOMPQZCRJU-UHFFFAOYSA-M thiamine hydrochloride Chemical compound Cl.[Cl-].CC1=C(CCO)SC=[N+]1CC1=CN=C(C)N=C1N DPJRMOMPQZCRJU-UHFFFAOYSA-M 0.000 description 2
- 108010038745 tryptophylglycine Proteins 0.000 description 2
- 241001515965 unidentified phage Species 0.000 description 2
- 108010073969 valyllysine Proteins 0.000 description 2
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Chemical compound O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 2
- 238000004804 winding Methods 0.000 description 2
- CWFMWBHMIMNZLN-NAKRPEOUSA-N (2s)-1-[(2s)-2-[[(2s,3s)-2-amino-3-methylpentanoyl]amino]propanoyl]pyrrolidine-2-carboxylic acid Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CWFMWBHMIMNZLN-NAKRPEOUSA-N 0.000 description 1
- PJODNTCLCCRNCO-UHFFFAOYSA-N 2,2-diaminononanoic acid Chemical compound CCCCCCCC(N)(N)C(O)=O PJODNTCLCCRNCO-UHFFFAOYSA-N 0.000 description 1
- HXUVTXPOZRFMOY-NSHDSACASA-N 2-[[(2s)-2-[[2-[(2-aminoacetyl)amino]acetyl]amino]-3-phenylpropanoyl]amino]acetic acid Chemical compound NCC(=O)NCC(=O)N[C@H](C(=O)NCC(O)=O)CC1=CC=CC=C1 HXUVTXPOZRFMOY-NSHDSACASA-N 0.000 description 1
- PEZMQPADLFXCJJ-ZETCQYMHSA-N 2-[[2-[[(2s)-1-(2-aminoacetyl)pyrrolidine-2-carbonyl]amino]acetyl]amino]acetic acid Chemical compound NCC(=O)N1CCC[C@H]1C(=O)NCC(=O)NCC(O)=O PEZMQPADLFXCJJ-ZETCQYMHSA-N 0.000 description 1
- 108010044087 AS-I toxin Proteins 0.000 description 1
- 241000186361 Actinobacteria <class> Species 0.000 description 1
- SBGXWWCLHIOABR-UHFFFAOYSA-N Ala Ala Gly Ala Chemical compound CC(N)C(=O)NC(C)C(=O)NCC(=O)NC(C)C(O)=O SBGXWWCLHIOABR-UHFFFAOYSA-N 0.000 description 1
- DKJPOZOEBONHFS-ZLUOBGJFSA-N Ala-Ala-Asp Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC(O)=O DKJPOZOEBONHFS-ZLUOBGJFSA-N 0.000 description 1
- BUANFPRKJKJSRR-ACZMJKKPSA-N Ala-Ala-Gln Chemical compound C[C@H]([NH3+])C(=O)N[C@@H](C)C(=O)N[C@H](C([O-])=O)CCC(N)=O BUANFPRKJKJSRR-ACZMJKKPSA-N 0.000 description 1
- CXRCVCURMBFFOL-FXQIFTODSA-N Ala-Ala-Pro Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O CXRCVCURMBFFOL-FXQIFTODSA-N 0.000 description 1
- YYSWCHMLFJLLBJ-ZLUOBGJFSA-N Ala-Ala-Ser Chemical compound C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YYSWCHMLFJLLBJ-ZLUOBGJFSA-N 0.000 description 1
- JBGSZRYCXBPWGX-BQBZGAKWSA-N Ala-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CCCN=C(N)N JBGSZRYCXBPWGX-BQBZGAKWSA-N 0.000 description 1
- JAMAWBXXKFGFGX-KZVJFYERSA-N Ala-Arg-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O JAMAWBXXKFGFGX-KZVJFYERSA-N 0.000 description 1
- XCVRVWZTXPCYJT-BIIVOSGPSA-N Ala-Asn-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N1CCC[C@@H]1C(=O)O)N XCVRVWZTXPCYJT-BIIVOSGPSA-N 0.000 description 1
- ZIBWKCRKNFYTPT-ZKWXMUAHSA-N Ala-Asn-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O ZIBWKCRKNFYTPT-ZKWXMUAHSA-N 0.000 description 1
- NHCPCLJZRSIDHS-ZLUOBGJFSA-N Ala-Asp-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O NHCPCLJZRSIDHS-ZLUOBGJFSA-N 0.000 description 1
- MCKSLROAGSDNFC-ACZMJKKPSA-N Ala-Asp-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(O)=O MCKSLROAGSDNFC-ACZMJKKPSA-N 0.000 description 1
- GWFSQQNGMPGBEF-GHCJXIJMSA-N Ala-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H](C)N GWFSQQNGMPGBEF-GHCJXIJMSA-N 0.000 description 1
- WJRXVTCKASUIFF-FXQIFTODSA-N Ala-Cys-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WJRXVTCKASUIFF-FXQIFTODSA-N 0.000 description 1
- DAEFQZCYZKRTLR-ZLUOBGJFSA-N Ala-Cys-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CC(O)=O)C(O)=O DAEFQZCYZKRTLR-ZLUOBGJFSA-N 0.000 description 1
- WCBVQNZTOKJWJS-ACZMJKKPSA-N Ala-Cys-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CS)C(=O)N[C@@H](CCC(O)=O)C(O)=O WCBVQNZTOKJWJS-ACZMJKKPSA-N 0.000 description 1
- IFTVANMRTIHKML-WDSKDSINSA-N Ala-Gln-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(O)=O IFTVANMRTIHKML-WDSKDSINSA-N 0.000 description 1
- SFNFGFDRYJKZKN-XQXXSGGOSA-N Ala-Gln-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](C)N)O SFNFGFDRYJKZKN-XQXXSGGOSA-N 0.000 description 1
- FUSPCLTUKXQREV-ACZMJKKPSA-N Ala-Glu-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O FUSPCLTUKXQREV-ACZMJKKPSA-N 0.000 description 1
- NWVVKQZOVSTDBQ-CIUDSAMLSA-N Ala-Glu-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O NWVVKQZOVSTDBQ-CIUDSAMLSA-N 0.000 description 1
- NJPMYXWVWQWCSR-ACZMJKKPSA-N Ala-Glu-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O NJPMYXWVWQWCSR-ACZMJKKPSA-N 0.000 description 1
- GGNHBHYDMUDXQB-KBIXCLLPSA-N Ala-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)N GGNHBHYDMUDXQB-KBIXCLLPSA-N 0.000 description 1
- YEVZMOUUZINZCK-LKTVYLICSA-N Ala-Glu-Trp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O YEVZMOUUZINZCK-LKTVYLICSA-N 0.000 description 1
- OMMDTNGURYRDAC-NRPADANISA-N Ala-Glu-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OMMDTNGURYRDAC-NRPADANISA-N 0.000 description 1
- WMYJZJRILUVVRG-WDSKDSINSA-N Ala-Gly-Gln Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(N)=O WMYJZJRILUVVRG-WDSKDSINSA-N 0.000 description 1
- BEMGNWZECGIJOI-WDSKDSINSA-N Ala-Gly-Glu Chemical compound [H]N[C@@H](C)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O BEMGNWZECGIJOI-WDSKDSINSA-N 0.000 description 1
- BTBUEVAGZCKULD-XPUUQOCRSA-N Ala-Gly-His Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CN=CN1 BTBUEVAGZCKULD-XPUUQOCRSA-N 0.000 description 1
- NBTGEURICRTMGL-WHFBIAKZSA-N Ala-Gly-Ser Chemical compound C[C@H](N)C(=O)NCC(=O)N[C@@H](CO)C(O)=O NBTGEURICRTMGL-WHFBIAKZSA-N 0.000 description 1
- OBVSBEYOMDWLRJ-BFHQHQDPSA-N Ala-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@H](C)N OBVSBEYOMDWLRJ-BFHQHQDPSA-N 0.000 description 1
- NYDBKUNVSALYPX-NAKRPEOUSA-N Ala-Ile-Arg Chemical compound C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@H](C(O)=O)CCCN=C(N)N NYDBKUNVSALYPX-NAKRPEOUSA-N 0.000 description 1
- FOHXUHGZZKETFI-JBDRJPRFSA-N Ala-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](C)N FOHXUHGZZKETFI-JBDRJPRFSA-N 0.000 description 1
- CKLDHDOIYBVUNP-KBIXCLLPSA-N Ala-Ile-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O CKLDHDOIYBVUNP-KBIXCLLPSA-N 0.000 description 1
- DVJSJDDYCYSMFR-ZKWXMUAHSA-N Ala-Ile-Gly Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O DVJSJDDYCYSMFR-ZKWXMUAHSA-N 0.000 description 1
- TZDNWXDLYFIFPT-BJDJZHNGSA-N Ala-Ile-Leu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(C)C)C(O)=O TZDNWXDLYFIFPT-BJDJZHNGSA-N 0.000 description 1
- LBYMZCVBOKYZNS-CIUDSAMLSA-N Ala-Leu-Asp Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O LBYMZCVBOKYZNS-CIUDSAMLSA-N 0.000 description 1
- SUMYEVXWCAYLLJ-GUBZILKMSA-N Ala-Leu-Gln Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O SUMYEVXWCAYLLJ-GUBZILKMSA-N 0.000 description 1
- OPZJWMJPCNNZNT-DCAQKATOSA-N Ala-Leu-Met Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCSC)C(=O)O)N OPZJWMJPCNNZNT-DCAQKATOSA-N 0.000 description 1
- OYJCVIGKMXUVKB-GARJFASQSA-N Ala-Leu-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@@H]1C(=O)O)N OYJCVIGKMXUVKB-GARJFASQSA-N 0.000 description 1
- MEFILNJXAVSUTO-JXUBOQSCSA-N Ala-Leu-Thr Chemical compound C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MEFILNJXAVSUTO-JXUBOQSCSA-N 0.000 description 1
- PMQXMXAASGFUDX-SRVKXCTJSA-N Ala-Lys-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)[C@H](C)N)CCCCN PMQXMXAASGFUDX-SRVKXCTJSA-N 0.000 description 1
- XUCHENWTTBFODJ-FXQIFTODSA-N Ala-Met-Ala Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](C)C(O)=O XUCHENWTTBFODJ-FXQIFTODSA-N 0.000 description 1
- DGLQWAFPIXDKRL-UBHSHLNASA-N Ala-Met-Phe Chemical compound C[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)O)N DGLQWAFPIXDKRL-UBHSHLNASA-N 0.000 description 1
- DHBKYZYFEXXUAK-ONGXEEELSA-N Ala-Phe-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](N)C)CC1=CC=CC=C1 DHBKYZYFEXXUAK-ONGXEEELSA-N 0.000 description 1
- RUXQNKVQSKOOBS-JURCDPSOSA-N Ala-Phe-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RUXQNKVQSKOOBS-JURCDPSOSA-N 0.000 description 1
- JAQNUEWEJWBVAY-WBAXXEDZSA-N Ala-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)C)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 JAQNUEWEJWBVAY-WBAXXEDZSA-N 0.000 description 1
- MAZZQZWCCYJQGZ-GUBZILKMSA-N Ala-Pro-Arg Chemical compound [H]N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MAZZQZWCCYJQGZ-GUBZILKMSA-N 0.000 description 1
- IORKCNUBHNIMKY-CIUDSAMLSA-N Ala-Pro-Glu Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O IORKCNUBHNIMKY-CIUDSAMLSA-N 0.000 description 1
- GMGWOTQMUKYZIE-UBHSHLNASA-N Ala-Pro-Phe Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 GMGWOTQMUKYZIE-UBHSHLNASA-N 0.000 description 1
- XWFWAXPOLRTDFZ-FXQIFTODSA-N Ala-Pro-Ser Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O XWFWAXPOLRTDFZ-FXQIFTODSA-N 0.000 description 1
- KLALXKYLOMZDQT-ZLUOBGJFSA-N Ala-Ser-Asn Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC(N)=O KLALXKYLOMZDQT-ZLUOBGJFSA-N 0.000 description 1
- MSWSRLGNLKHDEI-ACZMJKKPSA-N Ala-Ser-Glu Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O MSWSRLGNLKHDEI-ACZMJKKPSA-N 0.000 description 1
- RTZCUEHYUQZIDE-WHFBIAKZSA-N Ala-Ser-Gly Chemical compound C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RTZCUEHYUQZIDE-WHFBIAKZSA-N 0.000 description 1
- NHWYNIZWLJYZAG-XVYDVKMFSA-N Ala-Ser-His Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N NHWYNIZWLJYZAG-XVYDVKMFSA-N 0.000 description 1
- HOVPGJUNRLMIOZ-CIUDSAMLSA-N Ala-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@H](C)N HOVPGJUNRLMIOZ-CIUDSAMLSA-N 0.000 description 1
- OMCKWYSDUQBYCN-FXQIFTODSA-N Ala-Ser-Met Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O OMCKWYSDUQBYCN-FXQIFTODSA-N 0.000 description 1
- VRTOMXFZHGWHIJ-KZVJFYERSA-N Ala-Thr-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VRTOMXFZHGWHIJ-KZVJFYERSA-N 0.000 description 1
- AAWLEICNDUHIJM-MBLNEYKQSA-N Ala-Thr-His Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](C)N)O AAWLEICNDUHIJM-MBLNEYKQSA-N 0.000 description 1
- VNFSAYFQLXPHPY-CIQUZCHMSA-N Ala-Thr-Ile Chemical compound [H]N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O VNFSAYFQLXPHPY-CIQUZCHMSA-N 0.000 description 1
- AETQNIIFKCMVHP-UVBJJODRSA-N Ala-Trp-Arg Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AETQNIIFKCMVHP-UVBJJODRSA-N 0.000 description 1
- YXXPVUOMPSZURS-ZLIFDBKOSA-N Ala-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)[C@H](C)N)=CNC2=C1 YXXPVUOMPSZURS-ZLIFDBKOSA-N 0.000 description 1
- SFPRJVVDZNLUTG-OWLDWWDNSA-N Ala-Trp-Thr Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H]([C@@H](C)O)C(O)=O SFPRJVVDZNLUTG-OWLDWWDNSA-N 0.000 description 1
- MTDDMSUUXNQMKK-BPNCWPANSA-N Ala-Tyr-Arg Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N MTDDMSUUXNQMKK-BPNCWPANSA-N 0.000 description 1
- ZXKNLCPUNZPFGY-LEWSCRJBSA-N Ala-Tyr-Pro Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)N2CCC[C@@H]2C(=O)O)N ZXKNLCPUNZPFGY-LEWSCRJBSA-N 0.000 description 1
- YEBZNKPPOHFZJM-BPNCWPANSA-N Ala-Tyr-Val Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](C(C)C)C(O)=O YEBZNKPPOHFZJM-BPNCWPANSA-N 0.000 description 1
- ZCUFMRIQCPNOHZ-NRPADANISA-N Ala-Val-Gln Chemical compound C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(=O)N)C(=O)O)N ZCUFMRIQCPNOHZ-NRPADANISA-N 0.000 description 1
- REWSWYIDQIELBE-FXQIFTODSA-N Ala-Val-Ser Chemical compound [H]N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O REWSWYIDQIELBE-FXQIFTODSA-N 0.000 description 1
- MCYJBCKCAPERSE-FXQIFTODSA-N Arg-Ala-Asp Chemical compound OC(=O)C[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCCN=C(N)N MCYJBCKCAPERSE-FXQIFTODSA-N 0.000 description 1
- KWKQGHSSNHPGOW-BQBZGAKWSA-N Arg-Ala-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)NCC(O)=O KWKQGHSSNHPGOW-BQBZGAKWSA-N 0.000 description 1
- PEFFAAKJGBZBKL-NAKRPEOUSA-N Arg-Ala-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O PEFFAAKJGBZBKL-NAKRPEOUSA-N 0.000 description 1
- SBVJJNJLFWSJOV-UBHSHLNASA-N Arg-Ala-Phe Chemical compound NC(=N)NCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 SBVJJNJLFWSJOV-UBHSHLNASA-N 0.000 description 1
- JGDGLDNAQJJGJI-AVGNSLFASA-N Arg-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCCN=C(N)N)N JGDGLDNAQJJGJI-AVGNSLFASA-N 0.000 description 1
- OTCJMMRQBVDQRK-DCAQKATOSA-N Arg-Asp-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O OTCJMMRQBVDQRK-DCAQKATOSA-N 0.000 description 1
- RRGPUNYIPJXJBU-GUBZILKMSA-N Arg-Asp-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCSC)C(O)=O RRGPUNYIPJXJBU-GUBZILKMSA-N 0.000 description 1
- XTGGTAWGUFXJSV-NAKRPEOUSA-N Arg-Cys-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CS)NC(=O)[C@H](CCCN=C(N)N)N XTGGTAWGUFXJSV-NAKRPEOUSA-N 0.000 description 1
- ZEAYJGRKRUBDOB-GARJFASQSA-N Arg-Gln-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCCN=C(N)N)N)C(=O)O ZEAYJGRKRUBDOB-GARJFASQSA-N 0.000 description 1
- GOWZVQXTHUCNSQ-NHCYSSNCSA-N Arg-Glu-Val Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O GOWZVQXTHUCNSQ-NHCYSSNCSA-N 0.000 description 1
- AQPVUEJJARLJHB-BQBZGAKWSA-N Arg-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCN=C(N)N AQPVUEJJARLJHB-BQBZGAKWSA-N 0.000 description 1
- YNSGXDWWPCGGQS-YUMQZZPRSA-N Arg-Gly-Gln Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)NCC(=O)N[C@@H](CCC(N)=O)C(O)=O YNSGXDWWPCGGQS-YUMQZZPRSA-N 0.000 description 1
- RFXXUWGNVRJTNQ-QXEWZRGKSA-N Arg-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N RFXXUWGNVRJTNQ-QXEWZRGKSA-N 0.000 description 1
- SYAUZLVLXCDRSH-IUCAKERBSA-N Arg-Gly-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CCCN=C(N)N)N SYAUZLVLXCDRSH-IUCAKERBSA-N 0.000 description 1
- YBIAYFFIVAZXPK-AVGNSLFASA-N Arg-His-Arg Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O YBIAYFFIVAZXPK-AVGNSLFASA-N 0.000 description 1
- QEHMMRSQJMOYNO-DCAQKATOSA-N Arg-His-Asn Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N QEHMMRSQJMOYNO-DCAQKATOSA-N 0.000 description 1
- YQGZIRIYGHNSQO-ZPFDUUQYSA-N Arg-Ile-Gln Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N YQGZIRIYGHNSQO-ZPFDUUQYSA-N 0.000 description 1
- LKDHUGLXOHYINY-XUXIUFHCSA-N Arg-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LKDHUGLXOHYINY-XUXIUFHCSA-N 0.000 description 1
- GXXWTNKNFFKTJB-NAKRPEOUSA-N Arg-Ile-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CO)C(O)=O GXXWTNKNFFKTJB-NAKRPEOUSA-N 0.000 description 1
- CFGHCPUPFHWMCM-FDARSICLSA-N Arg-Ile-Trp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N CFGHCPUPFHWMCM-FDARSICLSA-N 0.000 description 1
- NMRHDSAOIURTNT-RWMBFGLXSA-N Arg-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N NMRHDSAOIURTNT-RWMBFGLXSA-N 0.000 description 1
- LCBSSOCDWUTQQV-SDDRHHMPSA-N Arg-Met-Pro Chemical compound CSCC[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N LCBSSOCDWUTQQV-SDDRHHMPSA-N 0.000 description 1
- CZUHPNLXLWMYMG-UBHSHLNASA-N Arg-Phe-Ala Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 CZUHPNLXLWMYMG-UBHSHLNASA-N 0.000 description 1
- BSGSDLYGGHGMND-IHRRRGAJSA-N Arg-Phe-Cys Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N BSGSDLYGGHGMND-IHRRRGAJSA-N 0.000 description 1
- WKPXXXUSUHAXDE-SRVKXCTJSA-N Arg-Pro-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCN=C(N)N)C(O)=O WKPXXXUSUHAXDE-SRVKXCTJSA-N 0.000 description 1
- UULLJGQFCDXVTQ-CYDGBPFRSA-N Arg-Pro-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)CC)C(O)=O UULLJGQFCDXVTQ-CYDGBPFRSA-N 0.000 description 1
- NGYHSXDNNOFHNE-AVGNSLFASA-N Arg-Pro-Leu Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O NGYHSXDNNOFHNE-AVGNSLFASA-N 0.000 description 1
- YCYXHLZRUSJITQ-SRVKXCTJSA-N Arg-Pro-Pro Chemical compound NC(=N)NCCC[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 YCYXHLZRUSJITQ-SRVKXCTJSA-N 0.000 description 1
- VUGWHBXPMAHEGZ-SRVKXCTJSA-N Arg-Pro-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CCCN=C(N)N VUGWHBXPMAHEGZ-SRVKXCTJSA-N 0.000 description 1
- VENMDXUVHSKEIN-GUBZILKMSA-N Arg-Ser-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O VENMDXUVHSKEIN-GUBZILKMSA-N 0.000 description 1
- UZSQXCMNUPKLCC-FJXKBIBVSA-N Arg-Thr-Gly Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(O)=O UZSQXCMNUPKLCC-FJXKBIBVSA-N 0.000 description 1
- YNSUUAOAFCVINY-OSUNSFLBSA-N Arg-Thr-Ile Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O YNSUUAOAFCVINY-OSUNSFLBSA-N 0.000 description 1
- JKRPBTQDPJSQIT-RCWTZXSCSA-N Arg-Thr-Met Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCCN=C(N)N)N)O JKRPBTQDPJSQIT-RCWTZXSCSA-N 0.000 description 1
- WTUZDHWWGUQEKN-SRVKXCTJSA-N Arg-Val-Met Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCSC)C(O)=O WTUZDHWWGUQEKN-SRVKXCTJSA-N 0.000 description 1
- CPTXATAOUQJQRO-GUBZILKMSA-N Arg-Val-Ser Chemical compound [H]N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O CPTXATAOUQJQRO-GUBZILKMSA-N 0.000 description 1
- LXTGAOAXPSJWOU-DCAQKATOSA-N Asn-Arg-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)N)N LXTGAOAXPSJWOU-DCAQKATOSA-N 0.000 description 1
- KXEGPPNPXOKKHK-ZLUOBGJFSA-N Asn-Asp-Ala Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(O)=O KXEGPPNPXOKKHK-ZLUOBGJFSA-N 0.000 description 1
- WPOLSNAQGVHROR-GUBZILKMSA-N Asn-Gln-Leu Chemical compound CC(C)C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)N)N WPOLSNAQGVHROR-GUBZILKMSA-N 0.000 description 1
- PLVAAIPKSGUXDV-WHFBIAKZSA-N Asn-Gly-Cys Chemical compound C([C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N)C(=O)N PLVAAIPKSGUXDV-WHFBIAKZSA-N 0.000 description 1
- AITGTTNYKAWKDR-CIUDSAMLSA-N Asn-His-Ser Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CO)C(O)=O AITGTTNYKAWKDR-CIUDSAMLSA-N 0.000 description 1
- ANPFQTJEPONRPL-UGYAYLCHSA-N Asn-Ile-Asp Chemical compound NC(=O)C[C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O ANPFQTJEPONRPL-UGYAYLCHSA-N 0.000 description 1
- PNHQRQTVBRDIEF-CIUDSAMLSA-N Asn-Leu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(=O)N)N PNHQRQTVBRDIEF-CIUDSAMLSA-N 0.000 description 1
- VHQSGALUSWIYOD-QXEWZRGKSA-N Asn-Pro-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O VHQSGALUSWIYOD-QXEWZRGKSA-N 0.000 description 1
- BCADFFUQHIMQAA-KKHAAJSZSA-N Asn-Thr-Val Chemical compound [H]N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(O)=O BCADFFUQHIMQAA-KKHAAJSZSA-N 0.000 description 1
- XEDQMTWEYFBOIK-ACZMJKKPSA-N Asp-Ala-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XEDQMTWEYFBOIK-ACZMJKKPSA-N 0.000 description 1
- XBQSLMACWDXWLJ-GHCJXIJMSA-N Asp-Ala-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O XBQSLMACWDXWLJ-GHCJXIJMSA-N 0.000 description 1
- OERMIMJQPQUIPK-FXQIFTODSA-N Asp-Arg-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O OERMIMJQPQUIPK-FXQIFTODSA-N 0.000 description 1
- DBWYWXNMZZYIRY-LPEHRKFASA-N Asp-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O DBWYWXNMZZYIRY-LPEHRKFASA-N 0.000 description 1
- XYBJLTKSGFBLCS-QXEWZRGKSA-N Asp-Arg-Val Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C(C)C)C(O)=O)NC(=O)[C@@H](N)CC(O)=O XYBJLTKSGFBLCS-QXEWZRGKSA-N 0.000 description 1
- GWTLRDMPMJCNMH-WHFBIAKZSA-N Asp-Asn-Gly Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(N)=O)C(=O)NCC(O)=O GWTLRDMPMJCNMH-WHFBIAKZSA-N 0.000 description 1
- UGIBTKGQVWFTGX-BIIVOSGPSA-N Asp-Asn-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)N)NC(=O)[C@H](CC(=O)O)N)C(=O)O UGIBTKGQVWFTGX-BIIVOSGPSA-N 0.000 description 1
- NYQHSUGFEWDWPD-ACZMJKKPSA-N Asp-Gln-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N NYQHSUGFEWDWPD-ACZMJKKPSA-N 0.000 description 1
- ZSJFGGSPCCHMNE-LAEOZQHASA-N Asp-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CC(=O)O)N ZSJFGGSPCCHMNE-LAEOZQHASA-N 0.000 description 1
- GHODABZPVZMWCE-FXQIFTODSA-N Asp-Glu-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O GHODABZPVZMWCE-FXQIFTODSA-N 0.000 description 1
- ZEDBMCPXPIYJLW-XHNCKOQMSA-N Asp-Glu-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC(=O)O)N)C(=O)O ZEDBMCPXPIYJLW-XHNCKOQMSA-N 0.000 description 1
- YNCHFVRXEQFPBY-BQBZGAKWSA-N Asp-Gly-Arg Chemical compound OC(=O)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N YNCHFVRXEQFPBY-BQBZGAKWSA-N 0.000 description 1
- VIRHEUMYXXLCBF-WDSKDSINSA-N Asp-Gly-Glu Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](CCC(O)=O)C(O)=O VIRHEUMYXXLCBF-WDSKDSINSA-N 0.000 description 1
- PZXPWHFYZXTFBI-YUMQZZPRSA-N Asp-Gly-Lys Chemical compound NCCCC[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CC(O)=O PZXPWHFYZXTFBI-YUMQZZPRSA-N 0.000 description 1
- WSGVTKZFVJSJOG-RCOVLWMOSA-N Asp-Gly-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O WSGVTKZFVJSJOG-RCOVLWMOSA-N 0.000 description 1
- TZOZNVLBTAFJRW-UGYAYLCHSA-N Asp-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC(=O)O)N TZOZNVLBTAFJRW-UGYAYLCHSA-N 0.000 description 1
- SPKCGKRUYKMDHP-GUDRVLHUSA-N Asp-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N SPKCGKRUYKMDHP-GUDRVLHUSA-N 0.000 description 1
- JNNVNVRBYUJYGS-CIUDSAMLSA-N Asp-Leu-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O JNNVNVRBYUJYGS-CIUDSAMLSA-N 0.000 description 1
- PAYPSKIBMDHZPI-CIUDSAMLSA-N Asp-Leu-Asp Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O PAYPSKIBMDHZPI-CIUDSAMLSA-N 0.000 description 1
- RQHLMGCXCZUOGT-ZPFDUUQYSA-N Asp-Leu-Ile Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O RQHLMGCXCZUOGT-ZPFDUUQYSA-N 0.000 description 1
- QNMKWNONJGKJJC-NHCYSSNCSA-N Asp-Leu-Val Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O QNMKWNONJGKJJC-NHCYSSNCSA-N 0.000 description 1
- AHWRSSLYSGLBGD-CIUDSAMLSA-N Asp-Pro-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O AHWRSSLYSGLBGD-CIUDSAMLSA-N 0.000 description 1
- QTIZKMMLNUMHHU-DCAQKATOSA-N Asp-Pro-His Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CC(=O)O)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QTIZKMMLNUMHHU-DCAQKATOSA-N 0.000 description 1
- BRRPVTUFESPTCP-ACZMJKKPSA-N Asp-Ser-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCC(O)=O BRRPVTUFESPTCP-ACZMJKKPSA-N 0.000 description 1
- HRVQDZOWMLFAOD-BIIVOSGPSA-N Asp-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CC(=O)O)N)C(=O)O HRVQDZOWMLFAOD-BIIVOSGPSA-N 0.000 description 1
- MNQMTYSEKZHIDF-GCJQMDKQSA-N Asp-Thr-Ala Chemical compound [H]N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O MNQMTYSEKZHIDF-GCJQMDKQSA-N 0.000 description 1
- PDIYGFYAMZZFCW-JIOCBJNQSA-N Asp-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CC(=O)O)N)O PDIYGFYAMZZFCW-JIOCBJNQSA-N 0.000 description 1
- USENATHVGFXRNO-SRVKXCTJSA-N Asp-Tyr-Asp Chemical compound OC(=O)C[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](CC(O)=O)C(O)=O)CC1=CC=C(O)C=C1 USENATHVGFXRNO-SRVKXCTJSA-N 0.000 description 1
- XMKXONRMGJXCJV-LAEOZQHASA-N Asp-Val-Glu Chemical compound OC(=O)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O XMKXONRMGJXCJV-LAEOZQHASA-N 0.000 description 1
- 241000193830 Bacillus <bacterium> Species 0.000 description 1
- 241000194110 Bacillus sp. (in: Bacteria) Species 0.000 description 1
- 244000063299 Bacillus subtilis Species 0.000 description 1
- 235000014469 Bacillus subtilis Nutrition 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 108020004705 Codon Proteins 0.000 description 1
- TVYMKYUSZSVOAG-ZLUOBGJFSA-N Cys-Ala-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O TVYMKYUSZSVOAG-ZLUOBGJFSA-N 0.000 description 1
- SZQCDCKIGWQAQN-FXQIFTODSA-N Cys-Arg-Ala Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O SZQCDCKIGWQAQN-FXQIFTODSA-N 0.000 description 1
- GEEXORWTBTUOHC-FXQIFTODSA-N Cys-Arg-Ser Chemical compound C(C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N)CN=C(N)N GEEXORWTBTUOHC-FXQIFTODSA-N 0.000 description 1
- DZSICRGTVPDCRN-YUMQZZPRSA-N Cys-Gly-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)CNC(=O)[C@H](CS)N DZSICRGTVPDCRN-YUMQZZPRSA-N 0.000 description 1
- OXOQBEVULIBOSH-ZDLURKLDSA-N Cys-Gly-Thr Chemical compound [H]N[C@@H](CS)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O OXOQBEVULIBOSH-ZDLURKLDSA-N 0.000 description 1
- YKKHFPGOZXQAGK-QWRGUYRKSA-N Cys-Gly-Tyr Chemical compound SC[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CC1=CC=C(O)C=C1 YKKHFPGOZXQAGK-QWRGUYRKSA-N 0.000 description 1
- XIZWKXATMJODQW-KKUMJFAQSA-N Cys-His-Phe Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)O)NC(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CS)N XIZWKXATMJODQW-KKUMJFAQSA-N 0.000 description 1
- VFGADOJXRLWTBU-JBDRJPRFSA-N Cys-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CS)N VFGADOJXRLWTBU-JBDRJPRFSA-N 0.000 description 1
- SRIRHERUAMYIOQ-CIUDSAMLSA-N Cys-Leu-Ser Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O SRIRHERUAMYIOQ-CIUDSAMLSA-N 0.000 description 1
- TXCCRYAZQBUCOV-CIUDSAMLSA-N Cys-Pro-Gln Chemical compound [H]N[C@@H](CS)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(O)=O TXCCRYAZQBUCOV-CIUDSAMLSA-N 0.000 description 1
- ZGERHCJBLPQPGV-ACZMJKKPSA-N Cys-Ser-Gln Chemical compound C(CC(=O)N)[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)[C@H](CS)N ZGERHCJBLPQPGV-ACZMJKKPSA-N 0.000 description 1
- NXQCSPVUPLUTJH-WHFBIAKZSA-N Cys-Ser-Gly Chemical compound SC[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O NXQCSPVUPLUTJH-WHFBIAKZSA-N 0.000 description 1
- ABLQPNMKLMFDQU-BIIVOSGPSA-N Cys-Ser-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CO)NC(=O)[C@H](CS)N)C(=O)O ABLQPNMKLMFDQU-BIIVOSGPSA-N 0.000 description 1
- VIOQRFNAZDMVLO-NRPADANISA-N Cys-Val-Glu Chemical compound [H]N[C@@H](CS)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O VIOQRFNAZDMVLO-NRPADANISA-N 0.000 description 1
- 102000053602 DNA Human genes 0.000 description 1
- 238000007400 DNA extraction Methods 0.000 description 1
- 206010059866 Drug resistance Diseases 0.000 description 1
- 241000588722 Escherichia Species 0.000 description 1
- 241001200922 Gagata Species 0.000 description 1
- YJIUYQKQBBQYHZ-ACZMJKKPSA-N Gln-Ala-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](C)C(O)=O YJIUYQKQBBQYHZ-ACZMJKKPSA-N 0.000 description 1
- HHWQMFIGMMOVFK-WDSKDSINSA-N Gln-Ala-Gly Chemical compound OC(=O)CNC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O HHWQMFIGMMOVFK-WDSKDSINSA-N 0.000 description 1
- UWZLBXOBVKRUFE-HGNGGELXSA-N Gln-Ala-His Chemical compound C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CCC(=O)N)N UWZLBXOBVKRUFE-HGNGGELXSA-N 0.000 description 1
- NUMFTVCBONFQIQ-DRZSPHRISA-N Gln-Ala-Phe Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O NUMFTVCBONFQIQ-DRZSPHRISA-N 0.000 description 1
- SHERTACNJPYHAR-ACZMJKKPSA-N Gln-Ala-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(N)=O SHERTACNJPYHAR-ACZMJKKPSA-N 0.000 description 1
- XXLBHPPXDUWYAG-XQXXSGGOSA-N Gln-Ala-Thr Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XXLBHPPXDUWYAG-XQXXSGGOSA-N 0.000 description 1
- KWUSGAIFNHQCBY-DCAQKATOSA-N Gln-Arg-Arg Chemical compound NC(=O)CC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O KWUSGAIFNHQCBY-DCAQKATOSA-N 0.000 description 1
- PGPJSRSLQNXBDT-YUMQZZPRSA-N Gln-Arg-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O PGPJSRSLQNXBDT-YUMQZZPRSA-N 0.000 description 1
- UFNSPPFJOHNXRE-AUTRQRHGSA-N Gln-Gln-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O UFNSPPFJOHNXRE-AUTRQRHGSA-N 0.000 description 1
- HWEINOMSWQSJDC-SRVKXCTJSA-N Gln-Leu-Arg Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O HWEINOMSWQSJDC-SRVKXCTJSA-N 0.000 description 1
- XFAUJGNLHIGXET-AVGNSLFASA-N Gln-Leu-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(O)=O XFAUJGNLHIGXET-AVGNSLFASA-N 0.000 description 1
- YPMDZWPZFOZYFG-GUBZILKMSA-N Gln-Leu-Ser Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O YPMDZWPZFOZYFG-GUBZILKMSA-N 0.000 description 1
- FKXCBKCOSVIGCT-AVGNSLFASA-N Gln-Lys-Leu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O FKXCBKCOSVIGCT-AVGNSLFASA-N 0.000 description 1
- HMIXCETWRYDVMO-GUBZILKMSA-N Gln-Pro-Glu Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(O)=O)C(O)=O HMIXCETWRYDVMO-GUBZILKMSA-N 0.000 description 1
- DCWNCMRZIZSZBL-KKUMJFAQSA-N Gln-Pro-Tyr Chemical compound C1C[C@H](N(C1)C(=O)[C@H](CCC(=O)N)N)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)O DCWNCMRZIZSZBL-KKUMJFAQSA-N 0.000 description 1
- OSCLNNWLKKIQJM-WDSKDSINSA-N Gln-Ser-Gly Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)NCC(O)=O OSCLNNWLKKIQJM-WDSKDSINSA-N 0.000 description 1
- OTQSTOXRUBVWAP-NRPADANISA-N Gln-Ser-Val Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O OTQSTOXRUBVWAP-NRPADANISA-N 0.000 description 1
- PAOHIZNRJNIXQY-XQXXSGGOSA-N Gln-Thr-Ala Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O PAOHIZNRJNIXQY-XQXXSGGOSA-N 0.000 description 1
- VDMABHYXBULDGN-LAEOZQHASA-N Gln-Val-Asp Chemical compound [H]N[C@@H](CCC(N)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O VDMABHYXBULDGN-LAEOZQHASA-N 0.000 description 1
- GJLXZITZLUUXMJ-NHCYSSNCSA-N Gln-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CCC(=O)N)N GJLXZITZLUUXMJ-NHCYSSNCSA-N 0.000 description 1
- RUFHOVYUYSNDNY-ACZMJKKPSA-N Glu-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O RUFHOVYUYSNDNY-ACZMJKKPSA-N 0.000 description 1
- OGMQXTXGLDNBSS-FXQIFTODSA-N Glu-Ala-Gln Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O OGMQXTXGLDNBSS-FXQIFTODSA-N 0.000 description 1
- MXOODARRORARSU-ACZMJKKPSA-N Glu-Ala-Ser Chemical compound C[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](CCC(=O)O)N MXOODARRORARSU-ACZMJKKPSA-N 0.000 description 1
- KBKGRMNVKPSQIF-XDTLVQLUSA-N Glu-Ala-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O KBKGRMNVKPSQIF-XDTLVQLUSA-N 0.000 description 1
- WOMUDRVDJMHTCV-DCAQKATOSA-N Glu-Arg-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O WOMUDRVDJMHTCV-DCAQKATOSA-N 0.000 description 1
- DIXKFOPPGWKZLY-CIUDSAMLSA-N Glu-Arg-Asp Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O DIXKFOPPGWKZLY-CIUDSAMLSA-N 0.000 description 1
- VPKBCVUDBNINAH-GARJFASQSA-N Glu-Arg-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VPKBCVUDBNINAH-GARJFASQSA-N 0.000 description 1
- GCYFUZJHAXJKKE-KKUMJFAQSA-N Glu-Arg-Tyr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O GCYFUZJHAXJKKE-KKUMJFAQSA-N 0.000 description 1
- OXEMJGCAJFFREE-FXQIFTODSA-N Glu-Gln-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O OXEMJGCAJFFREE-FXQIFTODSA-N 0.000 description 1
- GFLQTABMFBXRIY-GUBZILKMSA-N Glu-Gln-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O GFLQTABMFBXRIY-GUBZILKMSA-N 0.000 description 1
- PVBBEKPHARMPHX-DCAQKATOSA-N Glu-Gln-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCC(O)=O PVBBEKPHARMPHX-DCAQKATOSA-N 0.000 description 1
- HTTSBEBKVNEDFE-AUTRQRHGSA-N Glu-Gln-Val Chemical compound CC(C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)N)NC(=O)[C@H](CCC(=O)O)N HTTSBEBKVNEDFE-AUTRQRHGSA-N 0.000 description 1
- NKLRYVLERDYDBI-FXQIFTODSA-N Glu-Glu-Asp Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O NKLRYVLERDYDBI-FXQIFTODSA-N 0.000 description 1
- QJCKNLPMTPXXEM-AUTRQRHGSA-N Glu-Glu-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)CCC(O)=O QJCKNLPMTPXXEM-AUTRQRHGSA-N 0.000 description 1
- HILMIYALTUQTRC-XVKPBYJWSA-N Glu-Gly-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O HILMIYALTUQTRC-XVKPBYJWSA-N 0.000 description 1
- JGHNIWVNCAOVRO-DCAQKATOSA-N Glu-His-Glu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCC(O)=O)C(O)=O JGHNIWVNCAOVRO-DCAQKATOSA-N 0.000 description 1
- VGOFRWOTSXVPAU-SDDRHHMPSA-N Glu-His-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC2=CN=CN2)NC(=O)[C@H](CCC(=O)O)N)C(=O)O VGOFRWOTSXVPAU-SDDRHHMPSA-N 0.000 description 1
- ITBHUUMCJJQUSC-LAEOZQHASA-N Glu-Ile-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)NCC(O)=O ITBHUUMCJJQUSC-LAEOZQHASA-N 0.000 description 1
- ATVYZJGOZLVXDK-IUCAKERBSA-N Glu-Leu-Gly Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)NCC(O)=O ATVYZJGOZLVXDK-IUCAKERBSA-N 0.000 description 1
- NJCALAAIGREHDR-WDCWCFNPSA-N Glu-Leu-Thr Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NJCALAAIGREHDR-WDCWCFNPSA-N 0.000 description 1
- ILWHFUZZCFYSKT-AVGNSLFASA-N Glu-Lys-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O ILWHFUZZCFYSKT-AVGNSLFASA-N 0.000 description 1
- BPLNJYHNAJVLRT-ACZMJKKPSA-N Glu-Ser-Ala Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O BPLNJYHNAJVLRT-ACZMJKKPSA-N 0.000 description 1
- VNCNWQPIQYAMAK-ACZMJKKPSA-N Glu-Ser-Ser Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O VNCNWQPIQYAMAK-ACZMJKKPSA-N 0.000 description 1
- DMYACXMQUABZIQ-NRPADANISA-N Glu-Ser-Val Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O DMYACXMQUABZIQ-NRPADANISA-N 0.000 description 1
- TWYSSILQABLLME-HJGDQZAQSA-N Glu-Thr-Arg Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O TWYSSILQABLLME-HJGDQZAQSA-N 0.000 description 1
- YQAQQKPWFOBSMU-WDCWCFNPSA-N Glu-Thr-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O YQAQQKPWFOBSMU-WDCWCFNPSA-N 0.000 description 1
- HGJREIGJLUQBTJ-SZMVWBNQSA-N Glu-Trp-Leu Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CC(C)C)C(O)=O HGJREIGJLUQBTJ-SZMVWBNQSA-N 0.000 description 1
- HQTDNEZTGZUWSY-XVKPBYJWSA-N Glu-Val-Gly Chemical compound CC(C)[C@H](NC(=O)[C@@H](N)CCC(O)=O)C(=O)NCC(O)=O HQTDNEZTGZUWSY-XVKPBYJWSA-N 0.000 description 1
- FGGKGJHCVMYGCD-UKJIMTQDSA-N Glu-Val-Ile Chemical compound [H]N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O FGGKGJHCVMYGCD-UKJIMTQDSA-N 0.000 description 1
- SOYWRINXUSUWEQ-DLOVCJGASA-N Glu-Val-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)CCC(O)=O SOYWRINXUSUWEQ-DLOVCJGASA-N 0.000 description 1
- PUUYVMYCMIWHFE-BQBZGAKWSA-N Gly-Ala-Arg Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N PUUYVMYCMIWHFE-BQBZGAKWSA-N 0.000 description 1
- MFVQGXGQRIXBPK-WDSKDSINSA-N Gly-Ala-Glu Chemical compound NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O MFVQGXGQRIXBPK-WDSKDSINSA-N 0.000 description 1
- UGVQELHRNUDMAA-BYPYZUCNSA-N Gly-Ala-Gly Chemical compound [NH3+]CC(=O)N[C@@H](C)C(=O)NCC([O-])=O UGVQELHRNUDMAA-BYPYZUCNSA-N 0.000 description 1
- JRDYDYXZKFNNRQ-XPUUQOCRSA-N Gly-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)CN JRDYDYXZKFNNRQ-XPUUQOCRSA-N 0.000 description 1
- RJIVPOXLQFJRTG-LURJTMIESA-N Gly-Arg-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N RJIVPOXLQFJRTG-LURJTMIESA-N 0.000 description 1
- OCQUNKSFDYDXBG-QXEWZRGKSA-N Gly-Arg-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N OCQUNKSFDYDXBG-QXEWZRGKSA-N 0.000 description 1
- KKBWDNZXYLGJEY-UHFFFAOYSA-N Gly-Arg-Pro Natural products NCC(=O)NC(CCNC(=N)N)C(=O)N1CCCC1C(=O)O KKBWDNZXYLGJEY-UHFFFAOYSA-N 0.000 description 1
- WKJKBELXHCTHIJ-WPRPVWTQSA-N Gly-Arg-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CCCN=C(N)N WKJKBELXHCTHIJ-WPRPVWTQSA-N 0.000 description 1
- IWAXHBCACVWNHT-BQBZGAKWSA-N Gly-Asp-Arg Chemical compound NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IWAXHBCACVWNHT-BQBZGAKWSA-N 0.000 description 1
- FUTAPPOITCCWTH-WHFBIAKZSA-N Gly-Asp-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O FUTAPPOITCCWTH-WHFBIAKZSA-N 0.000 description 1
- FZQLXNIMCPJVJE-YUMQZZPRSA-N Gly-Asp-Leu Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O FZQLXNIMCPJVJE-YUMQZZPRSA-N 0.000 description 1
- MHHUEAIBJZWDBH-YUMQZZPRSA-N Gly-Asp-Lys Chemical compound C(CCN)C[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN MHHUEAIBJZWDBH-YUMQZZPRSA-N 0.000 description 1
- ZRZILYKEJBMFHY-BQBZGAKWSA-N Gly-Asp-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)CN ZRZILYKEJBMFHY-BQBZGAKWSA-N 0.000 description 1
- RPLLQZBOVIVGMX-QWRGUYRKSA-N Gly-Asp-Phe Chemical compound [H]NCC(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O RPLLQZBOVIVGMX-QWRGUYRKSA-N 0.000 description 1
- LXXLEUBUOMCAMR-NKWVEPMBSA-N Gly-Asp-Pro Chemical compound C1C[C@@H](N(C1)C(=O)[C@H](CC(=O)O)NC(=O)CN)C(=O)O LXXLEUBUOMCAMR-NKWVEPMBSA-N 0.000 description 1
- JUGQPPOVWXSPKJ-RYUDHWBXSA-N Gly-Gln-Phe Chemical compound [H]NCC(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O JUGQPPOVWXSPKJ-RYUDHWBXSA-N 0.000 description 1
- QPDUVFSVVAOUHE-XVKPBYJWSA-N Gly-Gln-Val Chemical compound CC(C)[C@H](NC(=O)[C@H](CCC(N)=O)NC(=O)CN)C(O)=O QPDUVFSVVAOUHE-XVKPBYJWSA-N 0.000 description 1
- MOJKRXIRAZPZLW-WDSKDSINSA-N Gly-Glu-Ala Chemical compound [H]NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O MOJKRXIRAZPZLW-WDSKDSINSA-N 0.000 description 1
- DHDOADIPGZTAHT-YUMQZZPRSA-N Gly-Glu-Arg Chemical compound NCC(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DHDOADIPGZTAHT-YUMQZZPRSA-N 0.000 description 1
- HQRHFUYMGCHHJS-LURJTMIESA-N Gly-Gly-Arg Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CCCN=C(N)N HQRHFUYMGCHHJS-LURJTMIESA-N 0.000 description 1
- UFPXDFOYHVEIPI-BYPYZUCNSA-N Gly-Gly-Asp Chemical compound NCC(=O)NCC(=O)N[C@H](C(O)=O)CC(O)=O UFPXDFOYHVEIPI-BYPYZUCNSA-N 0.000 description 1
- XMPXVJIDADUOQB-RCOVLWMOSA-N Gly-Gly-Ile Chemical compound CC[C@H](C)[C@@H](C([O-])=O)NC(=O)CNC(=O)C[NH3+] XMPXVJIDADUOQB-RCOVLWMOSA-N 0.000 description 1
- FQKKPCWTZZEDIC-XPUUQOCRSA-N Gly-His-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 FQKKPCWTZZEDIC-XPUUQOCRSA-N 0.000 description 1
- ORXZVPZCPMKHNR-IUCAKERBSA-N Gly-His-Glu Chemical compound OC(=O)CC[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 ORXZVPZCPMKHNR-IUCAKERBSA-N 0.000 description 1
- ADZGCWWDPFDHCY-ZETCQYMHSA-N Gly-His-Gly Chemical compound OC(=O)CNC(=O)[C@@H](NC(=O)CN)CC1=CN=CN1 ADZGCWWDPFDHCY-ZETCQYMHSA-N 0.000 description 1
- FSPVILZGHUJOHS-QWRGUYRKSA-N Gly-His-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CNC=N1 FSPVILZGHUJOHS-QWRGUYRKSA-N 0.000 description 1
- UTYGDAHJBBDPBA-BYULHYEWSA-N Gly-Ile-Asp Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)CN UTYGDAHJBBDPBA-BYULHYEWSA-N 0.000 description 1
- COVXELOAORHTND-LSJOCFKGSA-N Gly-Ile-Val Chemical compound NCC(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(O)=O COVXELOAORHTND-LSJOCFKGSA-N 0.000 description 1
- PAWIVEIWWYGBAM-YUMQZZPRSA-N Gly-Leu-Ala Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O PAWIVEIWWYGBAM-YUMQZZPRSA-N 0.000 description 1
- YTSVAIMKVLZUDU-YUMQZZPRSA-N Gly-Leu-Asp Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O YTSVAIMKVLZUDU-YUMQZZPRSA-N 0.000 description 1
- TWTPDFFBLQEBOE-IUCAKERBSA-N Gly-Leu-Gln Chemical compound [H]NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O TWTPDFFBLQEBOE-IUCAKERBSA-N 0.000 description 1
- ULZCYBYDTUMHNF-IUCAKERBSA-N Gly-Leu-Glu Chemical compound NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O ULZCYBYDTUMHNF-IUCAKERBSA-N 0.000 description 1
- UUYBFNKHOCJCHT-VHSXEESVSA-N Gly-Leu-Pro Chemical compound CC(C)C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)CN UUYBFNKHOCJCHT-VHSXEESVSA-N 0.000 description 1
- MHXKHKWHPNETGG-QWRGUYRKSA-N Gly-Lys-Leu Chemical compound [H]NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O MHXKHKWHPNETGG-QWRGUYRKSA-N 0.000 description 1
- HHRODZSXDXMUHS-LURJTMIESA-N Gly-Met-Gly Chemical compound CSCC[C@H](NC(=O)C[NH3+])C(=O)NCC([O-])=O HHRODZSXDXMUHS-LURJTMIESA-N 0.000 description 1
- IBYOLNARKHMLBG-WHOFXGATSA-N Gly-Phe-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 IBYOLNARKHMLBG-WHOFXGATSA-N 0.000 description 1
- JPVGHHQGKPQYIL-KBPBESRZSA-N Gly-Phe-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H](NC(=O)CN)CC1=CC=CC=C1 JPVGHHQGKPQYIL-KBPBESRZSA-N 0.000 description 1
- VDCRBJACQKOSMS-JSGCOSHPSA-N Gly-Phe-Val Chemical compound [H]NCC(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C(C)C)C(O)=O VDCRBJACQKOSMS-JSGCOSHPSA-N 0.000 description 1
- JYPCXBJRLBHWME-IUCAKERBSA-N Gly-Pro-Arg Chemical compound NCC(=O)N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(O)=O JYPCXBJRLBHWME-IUCAKERBSA-N 0.000 description 1
- IRJWAYCXIYUHQE-WHFBIAKZSA-N Gly-Ser-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)CN IRJWAYCXIYUHQE-WHFBIAKZSA-N 0.000 description 1
- MKIAPEZXQDILRR-YUMQZZPRSA-N Gly-Ser-His Chemical compound C1=C(NC=N1)C[C@@H](C(=O)O)NC(=O)[C@H](CO)NC(=O)CN MKIAPEZXQDILRR-YUMQZZPRSA-N 0.000 description 1
- JSLVAHYTAJJEQH-QWRGUYRKSA-N Gly-Ser-Phe Chemical compound NCC(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 JSLVAHYTAJJEQH-QWRGUYRKSA-N 0.000 description 1
- LCRDMSSAKLTKBU-ZDLURKLDSA-N Gly-Ser-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN LCRDMSSAKLTKBU-ZDLURKLDSA-N 0.000 description 1
- FFALDIDGPLUDKV-ZDLURKLDSA-N Gly-Thr-Ser Chemical compound [H]NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(O)=O FFALDIDGPLUDKV-ZDLURKLDSA-N 0.000 description 1
- RIUZKUJUPVFAGY-HOTGVXAUSA-N Gly-Trp-His Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CN=CN3)C(=O)O)NC(=O)CN RIUZKUJUPVFAGY-HOTGVXAUSA-N 0.000 description 1
- SFOXOSKVTLDEDM-HOTGVXAUSA-N Gly-Trp-Leu Chemical compound C1=CC=C2C(C[C@@H](C(=O)N[C@@H](CC(C)C)C(O)=O)NC(=O)CN)=CNC2=C1 SFOXOSKVTLDEDM-HOTGVXAUSA-N 0.000 description 1
- GWCJMBNBFYBQCV-XPUUQOCRSA-N Gly-Val-Ala Chemical compound NCC(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O GWCJMBNBFYBQCV-XPUUQOCRSA-N 0.000 description 1
- BAYQNCWLXIDLHX-ONGXEEELSA-N Gly-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)CN BAYQNCWLXIDLHX-ONGXEEELSA-N 0.000 description 1
- MUGLKCQHTUFLGF-WPRPVWTQSA-N Gly-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)CN MUGLKCQHTUFLGF-WPRPVWTQSA-N 0.000 description 1
- KZTLOHBDLMIFSH-XVYDVKMFSA-N His-Ala-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(O)=O)C(O)=O KZTLOHBDLMIFSH-XVYDVKMFSA-N 0.000 description 1
- AFPFGFUGETYOSY-HGNGGELXSA-N His-Ala-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O AFPFGFUGETYOSY-HGNGGELXSA-N 0.000 description 1
- AWASVTXPTOLPPP-MBLNEYKQSA-N His-Ala-Thr Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O AWASVTXPTOLPPP-MBLNEYKQSA-N 0.000 description 1
- HXKZJLWGSWQKEA-LSJOCFKGSA-N His-Ala-Val Chemical compound CC(C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CN=CN1 HXKZJLWGSWQKEA-LSJOCFKGSA-N 0.000 description 1
- TVQGUFGDVODUIF-LSJOCFKGSA-N His-Arg-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCN=C(N)N)NC(=O)[C@H](CC1=CN=CN1)N TVQGUFGDVODUIF-LSJOCFKGSA-N 0.000 description 1
- SVHKVHBPTOMLTO-DCAQKATOSA-N His-Arg-Asp Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(O)=O)C(O)=O SVHKVHBPTOMLTO-DCAQKATOSA-N 0.000 description 1
- JBJNKUOMNZGQIM-PYJNHQTQSA-N His-Arg-Ile Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O JBJNKUOMNZGQIM-PYJNHQTQSA-N 0.000 description 1
- CJGDTAHEMXLRMB-ULQDDVLXSA-N His-Arg-Phe Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O CJGDTAHEMXLRMB-ULQDDVLXSA-N 0.000 description 1
- KYMUEAZVLPRVAE-GUBZILKMSA-N His-Asn-Glu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O KYMUEAZVLPRVAE-GUBZILKMSA-N 0.000 description 1
- LYSMQLXUCAKELQ-DCAQKATOSA-N His-Asp-Arg Chemical compound C1=C(NC=N1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N LYSMQLXUCAKELQ-DCAQKATOSA-N 0.000 description 1
- ZZLWLWSUIBSMNP-CIUDSAMLSA-N His-Asp-Ser Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CO)C(O)=O ZZLWLWSUIBSMNP-CIUDSAMLSA-N 0.000 description 1
- PQKCQZHAGILVIM-NKIYYHGXSA-N His-Glu-Thr Chemical compound C[C@@H](O)[C@H](NC(=O)[C@H](CCC(O)=O)NC(=O)[C@@H](N)Cc1cnc[nH]1)C(O)=O PQKCQZHAGILVIM-NKIYYHGXSA-N 0.000 description 1
- VYUXYMRNGALHEA-DLOVCJGASA-N His-Leu-Ala Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O VYUXYMRNGALHEA-DLOVCJGASA-N 0.000 description 1
- AYUOWUNWZGTNKB-ULQDDVLXSA-N His-Phe-Arg Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AYUOWUNWZGTNKB-ULQDDVLXSA-N 0.000 description 1
- VCBWXASUBZIFLQ-IHRRRGAJSA-N His-Pro-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O VCBWXASUBZIFLQ-IHRRRGAJSA-N 0.000 description 1
- VIJMRAIWYWRXSR-CIUDSAMLSA-N His-Ser-Ser Chemical compound OC[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC1=CN=CN1 VIJMRAIWYWRXSR-CIUDSAMLSA-N 0.000 description 1
- CCUSLCQWVMWTIS-IXOXFDKPSA-N His-Thr-Leu Chemical compound [H]N[C@@H](CC1=CNC=N1)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O CCUSLCQWVMWTIS-IXOXFDKPSA-N 0.000 description 1
- JXUGDUWBMKIJDC-NAKRPEOUSA-N Ile-Ala-Arg Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O JXUGDUWBMKIJDC-NAKRPEOUSA-N 0.000 description 1
- CISBRYJZMFWOHJ-JBDRJPRFSA-N Ile-Ala-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CS)C(=O)O)N CISBRYJZMFWOHJ-JBDRJPRFSA-N 0.000 description 1
- VAXBXNPRXPHGHG-BJDJZHNGSA-N Ile-Ala-Leu Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(=O)O)N VAXBXNPRXPHGHG-BJDJZHNGSA-N 0.000 description 1
- DMHGKBGOUAJRHU-UHFFFAOYSA-N Ile-Arg-Pro Natural products CCC(C)C(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O DMHGKBGOUAJRHU-UHFFFAOYSA-N 0.000 description 1
- IIXDMJNYALIKGP-DJFWLOJKSA-N Ile-Asn-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N IIXDMJNYALIKGP-DJFWLOJKSA-N 0.000 description 1
- QIHJTGSVGIPHIW-QSFUFRPTSA-N Ile-Asn-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](C(C)C)C(=O)O)N QIHJTGSVGIPHIW-QSFUFRPTSA-N 0.000 description 1
- NKRJALPCDNXULF-BYULHYEWSA-N Ile-Asp-Gly Chemical compound [H]N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O NKRJALPCDNXULF-BYULHYEWSA-N 0.000 description 1
- JQLFYZMEXFNRFS-DJFWLOJKSA-N Ile-Asp-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N JQLFYZMEXFNRFS-DJFWLOJKSA-N 0.000 description 1
- AQTWDZDISVGCAC-CFMVVWHZSA-N Ile-Asp-Tyr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CC=C(C=C1)O)C(=O)O)N AQTWDZDISVGCAC-CFMVVWHZSA-N 0.000 description 1
- TVSPLSZTKTUYLV-ZPFDUUQYSA-N Ile-Glu-Met Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CCSC)C(=O)O TVSPLSZTKTUYLV-ZPFDUUQYSA-N 0.000 description 1
- JLWLMGADIQFKRD-QSFUFRPTSA-N Ile-His-Ala Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CN=CN1 JLWLMGADIQFKRD-QSFUFRPTSA-N 0.000 description 1
- QZZIBQZLWBOOJH-PEDHHIEDSA-N Ile-Ile-Val Chemical compound N[C@@H]([C@@H](C)CC)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](C(C)C)C(=O)O QZZIBQZLWBOOJH-PEDHHIEDSA-N 0.000 description 1
- KLBVGHCGHUNHEA-BJDJZHNGSA-N Ile-Leu-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(=O)O)N KLBVGHCGHUNHEA-BJDJZHNGSA-N 0.000 description 1
- FZWVCYCYWCLQDH-NHCYSSNCSA-N Ile-Leu-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)O)N FZWVCYCYWCLQDH-NHCYSSNCSA-N 0.000 description 1
- DBXXASNNDTXOLU-MXAVVETBSA-N Ile-Leu-His Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N DBXXASNNDTXOLU-MXAVVETBSA-N 0.000 description 1
- PMMMQRVUMVURGJ-XUXIUFHCSA-N Ile-Leu-Pro Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(O)=O PMMMQRVUMVURGJ-XUXIUFHCSA-N 0.000 description 1
- PHRWFSFCNJPWRO-PPCPHDFISA-N Ile-Leu-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N PHRWFSFCNJPWRO-PPCPHDFISA-N 0.000 description 1
- DSDPLOODKXISDT-XUXIUFHCSA-N Ile-Leu-Val Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O DSDPLOODKXISDT-XUXIUFHCSA-N 0.000 description 1
- GLYJPWIRLBAIJH-FQUUOJAGSA-N Ile-Lys-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N1CCC[C@@H]1C(=O)O)N GLYJPWIRLBAIJH-FQUUOJAGSA-N 0.000 description 1
- GLYJPWIRLBAIJH-UHFFFAOYSA-N Ile-Lys-Pro Natural products CCC(C)C(N)C(=O)NC(CCCCN)C(=O)N1CCCC1C(O)=O GLYJPWIRLBAIJH-UHFFFAOYSA-N 0.000 description 1
- VEPIBPGLTLPBDW-URLPEUOOSA-N Ile-Phe-Thr Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H]([C@@H](C)O)C(=O)O)N VEPIBPGLTLPBDW-URLPEUOOSA-N 0.000 description 1
- ZLFNNVATRMCAKN-ZKWXMUAHSA-N Ile-Ser-Gly Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)NCC(=O)O)N ZLFNNVATRMCAKN-ZKWXMUAHSA-N 0.000 description 1
- NURNJECQNNCRBK-FLBSBUHZSA-N Ile-Thr-Thr Chemical compound CC[C@H](C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O NURNJECQNNCRBK-FLBSBUHZSA-N 0.000 description 1
- YJRSIJZUIUANHO-NAKRPEOUSA-N Ile-Val-Ala Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(=O)O)N YJRSIJZUIUANHO-NAKRPEOUSA-N 0.000 description 1
- JZBVBOKASHNXAD-NAKRPEOUSA-N Ile-Val-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(=O)O)N JZBVBOKASHNXAD-NAKRPEOUSA-N 0.000 description 1
- YHFPHRUWZMEOIX-CYDGBPFRSA-N Ile-Val-Val Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)O)N YHFPHRUWZMEOIX-CYDGBPFRSA-N 0.000 description 1
- PMGDADKJMCOXHX-UHFFFAOYSA-N L-Arginyl-L-glutamin-acetat Natural products NC(=N)NCCCC(N)C(=O)NC(CCC(N)=O)C(O)=O PMGDADKJMCOXHX-UHFFFAOYSA-N 0.000 description 1
- IBMVEYRWAWIOTN-UHFFFAOYSA-N L-Leucyl-L-Arginyl-L-Proline Natural products CC(C)CC(N)C(=O)NC(CCCN=C(N)N)C(=O)N1CCCC1C(O)=O IBMVEYRWAWIOTN-UHFFFAOYSA-N 0.000 description 1
- UGTHTQWIQKEDEH-BQBZGAKWSA-N L-alanyl-L-prolylglycine zwitterion Chemical compound C[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O UGTHTQWIQKEDEH-BQBZGAKWSA-N 0.000 description 1
- LHSGPCFBGJHPCY-UHFFFAOYSA-N L-leucine-L-tyrosine Natural products CC(C)CC(N)C(=O)NC(C(O)=O)CC1=CC=C(O)C=C1 LHSGPCFBGJHPCY-UHFFFAOYSA-N 0.000 description 1
- LZDNBBYBDGBADK-UHFFFAOYSA-N L-valyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)C(N)C(C)C)C(O)=O)=CNC2=C1 LZDNBBYBDGBADK-UHFFFAOYSA-N 0.000 description 1
- 241000880493 Leptailurus serval Species 0.000 description 1
- MJOZZTKJZQFKDK-GUBZILKMSA-N Leu-Ala-Gln Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(N)=O MJOZZTKJZQFKDK-GUBZILKMSA-N 0.000 description 1
- PBCHMHROGNUXMK-DLOVCJGASA-N Leu-Ala-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 PBCHMHROGNUXMK-DLOVCJGASA-N 0.000 description 1
- XIRYQRLFHWWWTC-QEJZJMRPSA-N Leu-Ala-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XIRYQRLFHWWWTC-QEJZJMRPSA-N 0.000 description 1
- WSGXUIQTEZDVHJ-GARJFASQSA-N Leu-Ala-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@@H]1C(O)=O WSGXUIQTEZDVHJ-GARJFASQSA-N 0.000 description 1
- KSZCCRIGNVSHFH-UWVGGRQHSA-N Leu-Arg-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)NCC(O)=O KSZCCRIGNVSHFH-UWVGGRQHSA-N 0.000 description 1
- UCOCBWDBHCUPQP-DCAQKATOSA-N Leu-Arg-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CO)C(O)=O UCOCBWDBHCUPQP-DCAQKATOSA-N 0.000 description 1
- STAVRDQLZOTNKJ-RHYQMDGZSA-N Leu-Arg-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(O)=O STAVRDQLZOTNKJ-RHYQMDGZSA-N 0.000 description 1
- IGUOAYLTQJLPPD-DCAQKATOSA-N Leu-Asn-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IGUOAYLTQJLPPD-DCAQKATOSA-N 0.000 description 1
- VPKIQULSKFVCSM-SRVKXCTJSA-N Leu-Gln-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VPKIQULSKFVCSM-SRVKXCTJSA-N 0.000 description 1
- WIDZHJTYKYBLSR-DCAQKATOSA-N Leu-Glu-Glu Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(O)=O WIDZHJTYKYBLSR-DCAQKATOSA-N 0.000 description 1
- NEEOBPIXKWSBRF-IUCAKERBSA-N Leu-Glu-Gly Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(O)=O NEEOBPIXKWSBRF-IUCAKERBSA-N 0.000 description 1
- IWTBYNQNAPECCS-AVGNSLFASA-N Leu-Glu-His Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 IWTBYNQNAPECCS-AVGNSLFASA-N 0.000 description 1
- HPBCTWSUJOGJSH-MNXVOIDGSA-N Leu-Glu-Ile Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(O)=O HPBCTWSUJOGJSH-MNXVOIDGSA-N 0.000 description 1
- HQUXQAMSWFIRET-AVGNSLFASA-N Leu-Glu-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@H](C(O)=O)CCCCN HQUXQAMSWFIRET-AVGNSLFASA-N 0.000 description 1
- BABSVXFGKFLIGW-UWVGGRQHSA-N Leu-Gly-Arg Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCCNC(N)=N BABSVXFGKFLIGW-UWVGGRQHSA-N 0.000 description 1
- YFBBUHJJUXXZOF-UWVGGRQHSA-N Leu-Gly-Pro Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N1CCC[C@H]1C(O)=O YFBBUHJJUXXZOF-UWVGGRQHSA-N 0.000 description 1
- HYMLKESRWLZDBR-WEDXCCLWSA-N Leu-Gly-Thr Chemical compound CC(C)C[C@H](N)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O HYMLKESRWLZDBR-WEDXCCLWSA-N 0.000 description 1
- PBGDOSARRIJMEV-DLOVCJGASA-N Leu-His-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](C)C(O)=O PBGDOSARRIJMEV-DLOVCJGASA-N 0.000 description 1
- VZBIUJURDLFFOE-IHRRRGAJSA-N Leu-His-Arg Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CC1=CNC=N1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VZBIUJURDLFFOE-IHRRRGAJSA-N 0.000 description 1
- KOSWSHVQIVTVQF-ZPFDUUQYSA-N Leu-Ile-Asp Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC(O)=O)C(O)=O KOSWSHVQIVTVQF-ZPFDUUQYSA-N 0.000 description 1
- DSFYPIUSAMSERP-IHRRRGAJSA-N Leu-Leu-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCCN=C(N)N DSFYPIUSAMSERP-IHRRRGAJSA-N 0.000 description 1
- IAJFFZORSWOZPQ-SRVKXCTJSA-N Leu-Leu-Asn Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IAJFFZORSWOZPQ-SRVKXCTJSA-N 0.000 description 1
- ZRHDPZAAWLXXIR-SRVKXCTJSA-N Leu-Lys-Ala Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(O)=O ZRHDPZAAWLXXIR-SRVKXCTJSA-N 0.000 description 1
- CPONGMJGVIAWEH-DCAQKATOSA-N Leu-Met-Ala Chemical compound CSCC[C@H](NC(=O)[C@@H](N)CC(C)C)C(=O)N[C@@H](C)C(O)=O CPONGMJGVIAWEH-DCAQKATOSA-N 0.000 description 1
- KWLWZYMNUZJKMZ-IHRRRGAJSA-N Leu-Pro-Leu Chemical compound CC(C)C[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(O)=O KWLWZYMNUZJKMZ-IHRRRGAJSA-N 0.000 description 1
- CHJKEDSZNSONPS-DCAQKATOSA-N Leu-Pro-Ser Chemical compound [H]N[C@@H](CC(C)C)C(=O)N1CCC[C@H]1C(=O)N[C@@H](CO)C(O)=O CHJKEDSZNSONPS-DCAQKATOSA-N 0.000 description 1
- IDGZVZJLYFTXSL-DCAQKATOSA-N Leu-Ser-Arg Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@H](C(O)=O)CCCN=C(N)N IDGZVZJLYFTXSL-DCAQKATOSA-N 0.000 description 1
- RGUXWMDNCPMQFB-YUMQZZPRSA-N Leu-Ser-Gly Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(=O)NCC(O)=O RGUXWMDNCPMQFB-YUMQZZPRSA-N 0.000 description 1
- AMSSKPUHBUQBOQ-SRVKXCTJSA-N Leu-Ser-Lys Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)O)N AMSSKPUHBUQBOQ-SRVKXCTJSA-N 0.000 description 1
- SBANPBVRHYIMRR-UHFFFAOYSA-N Leu-Ser-Pro Natural products CC(C)CC(N)C(=O)NC(CO)C(=O)N1CCCC1C(O)=O SBANPBVRHYIMRR-UHFFFAOYSA-N 0.000 description 1
- ZJZNLRVCZWUONM-JXUBOQSCSA-N Leu-Thr-Ala Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O ZJZNLRVCZWUONM-JXUBOQSCSA-N 0.000 description 1
- KLSUAWUZBMAZCL-RHYQMDGZSA-N Leu-Thr-Pro Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1CCC[C@H]1C(O)=O KLSUAWUZBMAZCL-RHYQMDGZSA-N 0.000 description 1
- GZRABTMNWJXFMH-UVOCVTCTSA-N Leu-Thr-Thr Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O GZRABTMNWJXFMH-UVOCVTCTSA-N 0.000 description 1
- FBNPMTNBFFAMMH-UHFFFAOYSA-N Leu-Val-Arg Natural products CC(C)CC(N)C(=O)NC(C(C)C)C(=O)NC(C(O)=O)CCCN=C(N)N FBNPMTNBFFAMMH-UHFFFAOYSA-N 0.000 description 1
- XZNJZXJZBMBGGS-NHCYSSNCSA-N Leu-Val-Asn Chemical compound [H]N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O XZNJZXJZBMBGGS-NHCYSSNCSA-N 0.000 description 1
- YQFZRHYZLARWDY-IHRRRGAJSA-N Leu-Val-Lys Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCCCN YQFZRHYZLARWDY-IHRRRGAJSA-N 0.000 description 1
- XOEDPXDZJHBQIX-ULQDDVLXSA-N Leu-Val-Phe Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 XOEDPXDZJHBQIX-ULQDDVLXSA-N 0.000 description 1
- QESXLSQLQHHTIX-RHYQMDGZSA-N Leu-Val-Thr Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O QESXLSQLQHHTIX-RHYQMDGZSA-N 0.000 description 1
- 239000006137 Luria-Bertani broth Substances 0.000 description 1
- KCXUCYYZNZFGLL-SRVKXCTJSA-N Lys-Ala-Leu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O KCXUCYYZNZFGLL-SRVKXCTJSA-N 0.000 description 1
- YIBOAHAOAWACDK-QEJZJMRPSA-N Lys-Ala-Phe Chemical compound NCCCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 YIBOAHAOAWACDK-QEJZJMRPSA-N 0.000 description 1
- QIJVAFLRMVBHMU-KKUMJFAQSA-N Lys-Asp-Phe Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC1=CC=CC=C1)C(O)=O QIJVAFLRMVBHMU-KKUMJFAQSA-N 0.000 description 1
- DFXQCCBKGUNYGG-GUBZILKMSA-N Lys-Gln-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@@H](N)CCCCN DFXQCCBKGUNYGG-GUBZILKMSA-N 0.000 description 1
- ITWQLSZTLBKWJM-YUMQZZPRSA-N Lys-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CCCCN ITWQLSZTLBKWJM-YUMQZZPRSA-N 0.000 description 1
- DKTNGXVSCZULPO-YUMQZZPRSA-N Lys-Gly-Cys Chemical compound NCCCC[C@H](N)C(=O)NCC(=O)N[C@@H](CS)C(O)=O DKTNGXVSCZULPO-YUMQZZPRSA-N 0.000 description 1
- SKRGVGLIRUGANF-AVGNSLFASA-N Lys-Leu-Glu Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SKRGVGLIRUGANF-AVGNSLFASA-N 0.000 description 1
- BDFHWFUAQLIMJO-KXNHARMFSA-N Lys-Thr-Pro Chemical compound C[C@H]([C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](CCCCN)N)O BDFHWFUAQLIMJO-KXNHARMFSA-N 0.000 description 1
- VWPJQIHBBOJWDN-DCAQKATOSA-N Lys-Val-Ala Chemical compound [H]N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C)C(O)=O VWPJQIHBBOJWDN-DCAQKATOSA-N 0.000 description 1
- RIPJMCFGQHGHNP-RHYQMDGZSA-N Lys-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](CCCCN)N)O RIPJMCFGQHGHNP-RHYQMDGZSA-N 0.000 description 1
- ONGCSGVHCSAATF-CIUDSAMLSA-N Met-Ala-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@H](C(O)=O)CCC(O)=O ONGCSGVHCSAATF-CIUDSAMLSA-N 0.000 description 1
- GAELMDJMQDUDLJ-BQBZGAKWSA-N Met-Ala-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C)C(=O)NCC(O)=O GAELMDJMQDUDLJ-BQBZGAKWSA-N 0.000 description 1
- QGQGAIBGTUJRBR-NAKRPEOUSA-N Met-Ala-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](C)NC(=O)[C@@H](N)CCSC QGQGAIBGTUJRBR-NAKRPEOUSA-N 0.000 description 1
- ZAJNRWKGHWGPDQ-SDDRHHMPSA-N Met-Arg-Pro Chemical compound CSCC[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N1CCC[C@@H]1C(=O)O)N ZAJNRWKGHWGPDQ-SDDRHHMPSA-N 0.000 description 1
- DCHHUGLTVLJYKA-FXQIFTODSA-N Met-Asn-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(O)=O DCHHUGLTVLJYKA-FXQIFTODSA-N 0.000 description 1
- FJVJLMZUIGMFFU-BQBZGAKWSA-N Met-Asp-Gly Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(O)=O)C(=O)NCC(O)=O FJVJLMZUIGMFFU-BQBZGAKWSA-N 0.000 description 1
- UJDMTKHGWSBHBX-IHRRRGAJSA-N Met-Cys-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 UJDMTKHGWSBHBX-IHRRRGAJSA-N 0.000 description 1
- AETNZPKUUYYYEK-CIUDSAMLSA-N Met-Glu-Asn Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(N)=O)C(O)=O AETNZPKUUYYYEK-CIUDSAMLSA-N 0.000 description 1
- MTBVQFFQMXHCPC-CIUDSAMLSA-N Met-Glu-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O MTBVQFFQMXHCPC-CIUDSAMLSA-N 0.000 description 1
- OOSPRDCGTLQLBP-NHCYSSNCSA-N Met-Glu-Val Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C(C)C)C(O)=O OOSPRDCGTLQLBP-NHCYSSNCSA-N 0.000 description 1
- IUYCGMNKIZDRQI-BQBZGAKWSA-N Met-Gly-Ala Chemical compound CSCC[C@H](N)C(=O)NCC(=O)N[C@@H](C)C(O)=O IUYCGMNKIZDRQI-BQBZGAKWSA-N 0.000 description 1
- CUICVBQQHMKBRJ-LSJOCFKGSA-N Met-His-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O CUICVBQQHMKBRJ-LSJOCFKGSA-N 0.000 description 1
- HGAJNEWOUHDUMZ-SRVKXCTJSA-N Met-Leu-Glu Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O HGAJNEWOUHDUMZ-SRVKXCTJSA-N 0.000 description 1
- CHDYFPCQVUOJEB-ULQDDVLXSA-N Met-Leu-Phe Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CC=CC=C1 CHDYFPCQVUOJEB-ULQDDVLXSA-N 0.000 description 1
- USBFEVBHEQBWDD-AVGNSLFASA-N Met-Leu-Val Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(O)=O USBFEVBHEQBWDD-AVGNSLFASA-N 0.000 description 1
- WNJXJJSGUXAIQU-UFYCRDLUSA-N Met-Phe-Phe Chemical compound C([C@H](NC(=O)[C@@H](N)CCSC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 WNJXJJSGUXAIQU-UFYCRDLUSA-N 0.000 description 1
- BQHLZUMZOXUWNU-DCAQKATOSA-N Met-Pro-Glu Chemical compound CSCC[C@@H](C(=O)N1CCC[C@H]1C(=O)N[C@@H](CCC(=O)O)C(=O)O)N BQHLZUMZOXUWNU-DCAQKATOSA-N 0.000 description 1
- CIDICGYKRUTYLE-FXQIFTODSA-N Met-Ser-Ala Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O CIDICGYKRUTYLE-FXQIFTODSA-N 0.000 description 1
- RDLSEGZJMYGFNS-FXQIFTODSA-N Met-Ser-Asp Chemical compound [H]N[C@@H](CCSC)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(O)=O RDLSEGZJMYGFNS-FXQIFTODSA-N 0.000 description 1
- KYXDADPHSNFWQX-VEVYYDQMSA-N Met-Thr-Asp Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CC(O)=O KYXDADPHSNFWQX-VEVYYDQMSA-N 0.000 description 1
- KYJHWKAMFISDJE-RCWTZXSCSA-N Met-Thr-Met Chemical compound CSCC[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCSC KYJHWKAMFISDJE-RCWTZXSCSA-N 0.000 description 1
- IIHMNTBFPMRJCN-RCWTZXSCSA-N Met-Val-Thr Chemical compound CSCC[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O IIHMNTBFPMRJCN-RCWTZXSCSA-N 0.000 description 1
- 241000187708 Micromonospora Species 0.000 description 1
- WYBVBIHNJWOLCJ-UHFFFAOYSA-N N-L-arginyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CCCN=C(N)N WYBVBIHNJWOLCJ-UHFFFAOYSA-N 0.000 description 1
- AUEJLPRZGVVDNU-UHFFFAOYSA-N N-L-tyrosyl-L-leucine Natural products CC(C)CC(C(O)=O)NC(=O)C(N)CC1=CC=C(O)C=C1 AUEJLPRZGVVDNU-UHFFFAOYSA-N 0.000 description 1
- KZNQNBZMBZJQJO-UHFFFAOYSA-N N-glycyl-L-proline Natural products NCC(=O)N1CCCC1C(O)=O KZNQNBZMBZJQJO-UHFFFAOYSA-N 0.000 description 1
- AJHCSUXXECOXOY-UHFFFAOYSA-N N-glycyl-L-tryptophan Natural products C1=CC=C2C(CC(NC(=O)CN)C(O)=O)=CNC2=C1 AJHCSUXXECOXOY-UHFFFAOYSA-N 0.000 description 1
- 108010066427 N-valyltryptophan Proteins 0.000 description 1
- 108010047562 NGR peptide Proteins 0.000 description 1
- 108091028043 Nucleic acid sequence Proteins 0.000 description 1
- BJEYSVHMGIJORT-NHCYSSNCSA-N Phe-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=CC=C1 BJEYSVHMGIJORT-NHCYSSNCSA-N 0.000 description 1
- SEPNOAFMZLLCEW-UBHSHLNASA-N Phe-Ala-Val Chemical compound N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(=O)O SEPNOAFMZLLCEW-UBHSHLNASA-N 0.000 description 1
- DPUOLKQSMYLRDR-UBHSHLNASA-N Phe-Arg-Ala Chemical compound NC(N)=NCCC[C@@H](C(=O)N[C@@H](C)C(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 DPUOLKQSMYLRDR-UBHSHLNASA-N 0.000 description 1
- MPGJIHFJCXTVEX-KKUMJFAQSA-N Phe-Arg-Glu Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O MPGJIHFJCXTVEX-KKUMJFAQSA-N 0.000 description 1
- JEGFCFLCRSJCMA-IHRRRGAJSA-N Phe-Arg-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CO)C(=O)O)N JEGFCFLCRSJCMA-IHRRRGAJSA-N 0.000 description 1
- XMPUYNHKEPFERE-IHRRRGAJSA-N Phe-Asp-Arg Chemical compound NC(N)=NCCC[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=CC=C1 XMPUYNHKEPFERE-IHRRRGAJSA-N 0.000 description 1
- BIYWZVCPZIFGPY-QWRGUYRKSA-N Phe-Gly-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)NCC(=O)N[C@@H](CO)C(O)=O BIYWZVCPZIFGPY-QWRGUYRKSA-N 0.000 description 1
- ZKSLXIGKRJMALF-MGHWNKPDSA-N Phe-His-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CN=CN1)NC(=O)[C@H](CC2=CC=CC=C2)N ZKSLXIGKRJMALF-MGHWNKPDSA-N 0.000 description 1
- MYQCCQSMKNCNKY-KKUMJFAQSA-N Phe-His-Ser Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC2=CN=CN2)C(=O)N[C@@H](CO)C(=O)O)N MYQCCQSMKNCNKY-KKUMJFAQSA-N 0.000 description 1
- BWTKUQPNOMMKMA-FIRPJDEBSA-N Phe-Ile-Phe Chemical compound C([C@H](N)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC=1C=CC=CC=1)C(O)=O)C1=CC=CC=C1 BWTKUQPNOMMKMA-FIRPJDEBSA-N 0.000 description 1
- CMHTUJQZQXFNTQ-OEAJRASXSA-N Phe-Leu-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC1=CC=CC=C1)N)O CMHTUJQZQXFNTQ-OEAJRASXSA-N 0.000 description 1
- ZIQQNOXKEFDPBE-BZSNNMDCSA-N Phe-Lys-His Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O)N ZIQQNOXKEFDPBE-BZSNNMDCSA-N 0.000 description 1
- IEOHQGFKHXUALJ-JYJNAYRXSA-N Phe-Met-Arg Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O IEOHQGFKHXUALJ-JYJNAYRXSA-N 0.000 description 1
- OXKJSGGTHFMGDT-UFYCRDLUSA-N Phe-Phe-Arg Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O)C1=CC=CC=C1 OXKJSGGTHFMGDT-UFYCRDLUSA-N 0.000 description 1
- GKRCCTYAGQPMMP-IHRRRGAJSA-N Phe-Ser-Met Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCSC)C(O)=O GKRCCTYAGQPMMP-IHRRRGAJSA-N 0.000 description 1
- MCIXMYKSPQUMJG-SRVKXCTJSA-N Phe-Ser-Ser Chemical compound [H]N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H](CO)C(O)=O MCIXMYKSPQUMJG-SRVKXCTJSA-N 0.000 description 1
- YFXXRYFWJFQAFW-JHYOHUSXSA-N Phe-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](CC1=CC=CC=C1)N)O YFXXRYFWJFQAFW-JHYOHUSXSA-N 0.000 description 1
- DZZCICYRSZASNF-FXQIFTODSA-N Pro-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H]1CCCN1 DZZCICYRSZASNF-FXQIFTODSA-N 0.000 description 1
- AJLVKXCNXIJHDV-CIUDSAMLSA-N Pro-Ala-Gln Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(N)=O)C(O)=O AJLVKXCNXIJHDV-CIUDSAMLSA-N 0.000 description 1
- IHCXPSYCHXFXKT-DCAQKATOSA-N Pro-Arg-Glu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(O)=O IHCXPSYCHXFXKT-DCAQKATOSA-N 0.000 description 1
- QBFONMUYNSNKIX-AVGNSLFASA-N Pro-Arg-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O QBFONMUYNSNKIX-AVGNSLFASA-N 0.000 description 1
- VCYJKOLZYPYGJV-AVGNSLFASA-N Pro-Arg-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(O)=O VCYJKOLZYPYGJV-AVGNSLFASA-N 0.000 description 1
- ICTZKEXYDDZZFP-SRVKXCTJSA-N Pro-Arg-Pro Chemical compound N([C@@H](CCCN=C(N)N)C(=O)N1[C@@H](CCC1)C(O)=O)C(=O)[C@@H]1CCCN1 ICTZKEXYDDZZFP-SRVKXCTJSA-N 0.000 description 1
- FUVBEZJCRMHWEM-FXQIFTODSA-N Pro-Asn-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CO)C(O)=O FUVBEZJCRMHWEM-FXQIFTODSA-N 0.000 description 1
- SGCZFWSQERRKBD-BQBZGAKWSA-N Pro-Asp-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@@H]1CCCN1 SGCZFWSQERRKBD-BQBZGAKWSA-N 0.000 description 1
- KIGGUSRFHJCIEJ-DCAQKATOSA-N Pro-Asp-His Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC2=CN=CN2)C(=O)O KIGGUSRFHJCIEJ-DCAQKATOSA-N 0.000 description 1
- HQVPQXMCQKXARZ-FXQIFTODSA-N Pro-Cys-Ser Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CS)C(=O)N[C@@H](CO)C(=O)O HQVPQXMCQKXARZ-FXQIFTODSA-N 0.000 description 1
- LSIWVWRUTKPXDS-DCAQKATOSA-N Pro-Gln-Arg Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O LSIWVWRUTKPXDS-DCAQKATOSA-N 0.000 description 1
- AFXCXDQNRXTSBD-FJXKBIBVSA-N Pro-Gly-Thr Chemical compound [H]N1CCC[C@H]1C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(O)=O AFXCXDQNRXTSBD-FJXKBIBVSA-N 0.000 description 1
- STASJMBVVHNWCG-IHRRRGAJSA-N Pro-His-Leu Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C([O-])=O)NC(=O)[C@H]1[NH2+]CCC1)C1=CN=CN1 STASJMBVVHNWCG-IHRRRGAJSA-N 0.000 description 1
- XQHGISDMVBTGAL-ULQDDVLXSA-N Pro-His-Phe Chemical compound C([C@@H](C(=O)[O-])NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H]1[NH2+]CCC1)C1=CC=CC=C1 XQHGISDMVBTGAL-ULQDDVLXSA-N 0.000 description 1
- IBGCFJDLCYTKPW-NAKRPEOUSA-N Pro-Ile-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]1CCCN1 IBGCFJDLCYTKPW-NAKRPEOUSA-N 0.000 description 1
- FKVNLUZHSFCNGY-RVMXOQNASA-N Pro-Ile-Pro Chemical compound CC[C@H](C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 FKVNLUZHSFCNGY-RVMXOQNASA-N 0.000 description 1
- FXGIMYRVJJEIIM-UWVGGRQHSA-N Pro-Leu-Gly Chemical compound OC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@@H]1CCCN1 FXGIMYRVJJEIIM-UWVGGRQHSA-N 0.000 description 1
- FKYKZHOKDOPHSA-DCAQKATOSA-N Pro-Leu-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(O)=O FKYKZHOKDOPHSA-DCAQKATOSA-N 0.000 description 1
- ZLXKLMHAMDENIO-DCAQKATOSA-N Pro-Lys-Asp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLXKLMHAMDENIO-DCAQKATOSA-N 0.000 description 1
- RMODQFBNDDENCP-IHRRRGAJSA-N Pro-Lys-Leu Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O RMODQFBNDDENCP-IHRRRGAJSA-N 0.000 description 1
- WIPAMEKBSHNFQE-IUCAKERBSA-N Pro-Met-Gly Chemical compound CSCC[C@@H](C(=O)NCC(=O)O)NC(=O)[C@@H]1CCCN1 WIPAMEKBSHNFQE-IUCAKERBSA-N 0.000 description 1
- AWQGDZBKQTYNMN-IHRRRGAJSA-N Pro-Phe-Asp Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N[C@@H](CC(=O)O)C(=O)O AWQGDZBKQTYNMN-IHRRRGAJSA-N 0.000 description 1
- ZVEQWRWMRFIVSD-HRCADAONSA-N Pro-Phe-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=CC=C2)C(=O)N3CCC[C@@H]3C(=O)O ZVEQWRWMRFIVSD-HRCADAONSA-N 0.000 description 1
- LEIKGVHQTKHOLM-IUCAKERBSA-N Pro-Pro-Gly Chemical compound OC(=O)CNC(=O)[C@@H]1CCCN1C(=O)[C@H]1NCCC1 LEIKGVHQTKHOLM-IUCAKERBSA-N 0.000 description 1
- RCYUBVHMVUHEBM-RCWTZXSCSA-N Pro-Pro-Thr Chemical compound [H]N1CCC[C@H]1C(=O)N1CCC[C@H]1C(=O)N[C@@H]([C@@H](C)O)C(O)=O RCYUBVHMVUHEBM-RCWTZXSCSA-N 0.000 description 1
- AJNGQVUFQUVRQT-JYJNAYRXSA-N Pro-Pro-Tyr Chemical compound C([C@@H](C(=O)O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H]1NCCC1)C1=CC=C(O)C=C1 AJNGQVUFQUVRQT-JYJNAYRXSA-N 0.000 description 1
- IALSFJSONJZBKB-HRCADAONSA-N Pro-Tyr-Pro Chemical compound C1C[C@H](NC1)C(=O)N[C@@H](CC2=CC=C(C=C2)O)C(=O)N3CCC[C@@H]3C(=O)O IALSFJSONJZBKB-HRCADAONSA-N 0.000 description 1
- VEUACYMXJKXALX-IHRRRGAJSA-N Pro-Tyr-Ser Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(O)=O VEUACYMXJKXALX-IHRRRGAJSA-N 0.000 description 1
- ZAUHSLVPDLNTRZ-QXEWZRGKSA-N Pro-Val-Asn Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O ZAUHSLVPDLNTRZ-QXEWZRGKSA-N 0.000 description 1
- FUOGXAQMNJMBFG-WPRPVWTQSA-N Pro-Val-Gly Chemical compound OC(=O)CNC(=O)[C@H](C(C)C)NC(=O)[C@@H]1CCCN1 FUOGXAQMNJMBFG-WPRPVWTQSA-N 0.000 description 1
- ZMLRZBWCXPQADC-TUAOUCFPSA-N Pro-Val-Pro Chemical compound CC(C)[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@@H]2CCCN2 ZMLRZBWCXPQADC-TUAOUCFPSA-N 0.000 description 1
- MTMJNKFZDQEVSY-BZSNNMDCSA-N Pro-Val-Trp Chemical compound [H]N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O MTMJNKFZDQEVSY-BZSNNMDCSA-N 0.000 description 1
- 241000589516 Pseudomonas Species 0.000 description 1
- 101000702488 Rattus norvegicus High affinity cationic amino acid transporter 1 Proteins 0.000 description 1
- 241000235070 Saccharomyces Species 0.000 description 1
- SRTCFKGBYBZRHA-ACZMJKKPSA-N Ser-Ala-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCC(O)=O)C(O)=O SRTCFKGBYBZRHA-ACZMJKKPSA-N 0.000 description 1
- YQHZVYJAGWMHES-ZLUOBGJFSA-N Ser-Ala-Ser Chemical compound OC[C@H](N)C(=O)N[C@@H](C)C(=O)N[C@@H](CO)C(O)=O YQHZVYJAGWMHES-ZLUOBGJFSA-N 0.000 description 1
- GXXTUIUYTWGPMV-FXQIFTODSA-N Ser-Arg-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](C)C(O)=O GXXTUIUYTWGPMV-FXQIFTODSA-N 0.000 description 1
- TYYBJUYSTWJHGO-ZKWXMUAHSA-N Ser-Asn-Val Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C(C)C)C(O)=O TYYBJUYSTWJHGO-ZKWXMUAHSA-N 0.000 description 1
- BGOWRLSWJCVYAQ-CIUDSAMLSA-N Ser-Asp-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O BGOWRLSWJCVYAQ-CIUDSAMLSA-N 0.000 description 1
- HJEBZBMOTCQYDN-ACZMJKKPSA-N Ser-Glu-Asp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(O)=O)C(O)=O HJEBZBMOTCQYDN-ACZMJKKPSA-N 0.000 description 1
- GRSLLFZTTLBOQX-CIUDSAMLSA-N Ser-Glu-Met Chemical compound CSCC[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CO)N GRSLLFZTTLBOQX-CIUDSAMLSA-N 0.000 description 1
- UQFYNFTYDHUIMI-WHFBIAKZSA-N Ser-Gly-Ala Chemical compound OC(=O)[C@H](C)NC(=O)CNC(=O)[C@@H](N)CO UQFYNFTYDHUIMI-WHFBIAKZSA-N 0.000 description 1
- YMTLKLXDFCSCNX-BYPYZUCNSA-N Ser-Gly-Gly Chemical compound OC[C@H](N)C(=O)NCC(=O)NCC(O)=O YMTLKLXDFCSCNX-BYPYZUCNSA-N 0.000 description 1
- SFTZWNJFZYOLBD-ZDLURKLDSA-N Ser-Gly-Thr Chemical compound C[C@@H](O)[C@@H](C(O)=O)NC(=O)CNC(=O)[C@@H](N)CO SFTZWNJFZYOLBD-ZDLURKLDSA-N 0.000 description 1
- WEQAYODCJHZSJZ-KKUMJFAQSA-N Ser-His-Tyr Chemical compound C([C@H](NC(=O)[C@H](CO)N)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(O)=O)C1=CN=CN1 WEQAYODCJHZSJZ-KKUMJFAQSA-N 0.000 description 1
- DJACUBDEDBZKLQ-KBIXCLLPSA-N Ser-Ile-Glu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CCC(O)=O)C(O)=O DJACUBDEDBZKLQ-KBIXCLLPSA-N 0.000 description 1
- JIPVNVNKXJLFJF-BJDJZHNGSA-N Ser-Ile-Lys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CCCCN)C(=O)O)NC(=O)[C@H](CO)N JIPVNVNKXJLFJF-BJDJZHNGSA-N 0.000 description 1
- MQQBBLVOUUJKLH-HJPIBITLSA-N Ser-Ile-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O MQQBBLVOUUJKLH-HJPIBITLSA-N 0.000 description 1
- FUMGHWDRRFCKEP-CIUDSAMLSA-N Ser-Leu-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C)C(O)=O FUMGHWDRRFCKEP-CIUDSAMLSA-N 0.000 description 1
- VMLONWHIORGALA-SRVKXCTJSA-N Ser-Leu-Leu Chemical compound CC(C)C[C@@H](C([O-])=O)NC(=O)[C@H](CC(C)C)NC(=O)[C@@H]([NH3+])CO VMLONWHIORGALA-SRVKXCTJSA-N 0.000 description 1
- MUJQWSAWLLRJCE-KATARQTJSA-N Ser-Leu-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O MUJQWSAWLLRJCE-KATARQTJSA-N 0.000 description 1
- BYCVMHKULKRVPV-GUBZILKMSA-N Ser-Lys-Gln Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CCC(N)=O)C(O)=O BYCVMHKULKRVPV-GUBZILKMSA-N 0.000 description 1
- FOOZNBRFRWGBNU-DCAQKATOSA-N Ser-Met-His Chemical compound CSCC[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CO)N FOOZNBRFRWGBNU-DCAQKATOSA-N 0.000 description 1
- RHAPJNVNWDBFQI-BQBZGAKWSA-N Ser-Pro-Gly Chemical compound OC[C@H](N)C(=O)N1CCC[C@H]1C(=O)NCC(O)=O RHAPJNVNWDBFQI-BQBZGAKWSA-N 0.000 description 1
- FKYWFUYPVKLJLP-DCAQKATOSA-N Ser-Pro-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)CO FKYWFUYPVKLJLP-DCAQKATOSA-N 0.000 description 1
- BMKNXTJLHFIAAH-CIUDSAMLSA-N Ser-Ser-Leu Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(C)C)C(O)=O BMKNXTJLHFIAAH-CIUDSAMLSA-N 0.000 description 1
- DKGRNFUXVTYRAS-UBHSHLNASA-N Ser-Ser-Trp Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CNC2=C1C=CC=C2)C(O)=O DKGRNFUXVTYRAS-UBHSHLNASA-N 0.000 description 1
- OLKICIBQRVSQMA-SRVKXCTJSA-N Ser-Ser-Tyr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(O)=O OLKICIBQRVSQMA-SRVKXCTJSA-N 0.000 description 1
- XJDMUQCLVSCRSJ-VZFHVOOUSA-N Ser-Thr-Ala Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(O)=O XJDMUQCLVSCRSJ-VZFHVOOUSA-N 0.000 description 1
- TYIHBQYLIPJSIV-NYVOZVTQSA-N Ser-Trp-Trp Chemical compound C1=CC=C2C(=C1)C(=CN2)C[C@@H](C(=O)N[C@@H](CC3=CNC4=CC=CC=C43)C(=O)O)NC(=O)[C@H](CO)N TYIHBQYLIPJSIV-NYVOZVTQSA-N 0.000 description 1
- OQSQCUWQOIHECT-YJRXYDGGSA-N Ser-Tyr-Thr Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H]([C@@H](C)O)C(O)=O OQSQCUWQOIHECT-YJRXYDGGSA-N 0.000 description 1
- JGUWRQWULDWNCM-FXQIFTODSA-N Ser-Val-Ser Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CO)C(O)=O JGUWRQWULDWNCM-FXQIFTODSA-N 0.000 description 1
- LSHUNRICNSEEAN-BPUTZDHNSA-N Ser-Val-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)NC(=O)[C@H](CO)N LSHUNRICNSEEAN-BPUTZDHNSA-N 0.000 description 1
- 241000607715 Serratia marcescens Species 0.000 description 1
- 108050007496 Shikimate kinase 2 Proteins 0.000 description 1
- 108020004682 Single-Stranded DNA Proteins 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- MQCPGOZXFSYJPS-KZVJFYERSA-N Thr-Ala-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O MQCPGOZXFSYJPS-KZVJFYERSA-N 0.000 description 1
- DFTCYYILCSQGIZ-GCJQMDKQSA-N Thr-Ala-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(N)=O)C(O)=O DFTCYYILCSQGIZ-GCJQMDKQSA-N 0.000 description 1
- BSNZTJXVDOINSR-JXUBOQSCSA-N Thr-Ala-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](CC(C)C)C(O)=O BSNZTJXVDOINSR-JXUBOQSCSA-N 0.000 description 1
- NFMPFBCXABPALN-OWLDWWDNSA-N Thr-Ala-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](C)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O NFMPFBCXABPALN-OWLDWWDNSA-N 0.000 description 1
- XSLXHSYIVPGEER-KZVJFYERSA-N Thr-Ala-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C)C(=O)N[C@@H](C(C)C)C(O)=O XSLXHSYIVPGEER-KZVJFYERSA-N 0.000 description 1
- WFUAUEQXPVNAEF-ZJDVBMNYSA-N Thr-Arg-Thr Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CCCN=C(N)N WFUAUEQXPVNAEF-ZJDVBMNYSA-N 0.000 description 1
- VXMHQKHDKCATDV-VEVYYDQMSA-N Thr-Asp-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O VXMHQKHDKCATDV-VEVYYDQMSA-N 0.000 description 1
- GKMYGVQDGVYCPC-IUKAMOBKSA-N Thr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC(=O)O)NC(=O)[C@H]([C@@H](C)O)N GKMYGVQDGVYCPC-IUKAMOBKSA-N 0.000 description 1
- NLSNVZAREYQMGR-HJGDQZAQSA-N Thr-Asp-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H](CC(C)C)C(O)=O NLSNVZAREYQMGR-HJGDQZAQSA-N 0.000 description 1
- XDARBNMYXKUFOJ-GSSVUCPTSA-N Thr-Asp-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(O)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O XDARBNMYXKUFOJ-GSSVUCPTSA-N 0.000 description 1
- VUKVQVNKIIZBPO-HOUAVDHOSA-N Thr-Asp-Trp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O VUKVQVNKIIZBPO-HOUAVDHOSA-N 0.000 description 1
- UCCNDUPVIFOOQX-CUJWVEQBSA-N Thr-Cys-His Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CS)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 UCCNDUPVIFOOQX-CUJWVEQBSA-N 0.000 description 1
- VGYBYGQXZJDZJU-XQXXSGGOSA-N Thr-Glu-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O VGYBYGQXZJDZJU-XQXXSGGOSA-N 0.000 description 1
- XPNSAQMEAVSQRD-FBCQKBJTSA-N Thr-Gly-Gly Chemical compound C[C@@H](O)[C@H](N)C(=O)NCC(=O)NCC(O)=O XPNSAQMEAVSQRD-FBCQKBJTSA-N 0.000 description 1
- YSXYEJWDHBCTDJ-DVJZZOLTSA-N Thr-Gly-Trp Chemical compound C[C@H]([C@@H](C(=O)NCC(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N)O YSXYEJWDHBCTDJ-DVJZZOLTSA-N 0.000 description 1
- JKGGPMOUIAAJAA-YEPSODPASA-N Thr-Gly-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](C(C)C)C(O)=O JKGGPMOUIAAJAA-YEPSODPASA-N 0.000 description 1
- DDDLIMCZFKOERC-SVSWQMSJSA-N Thr-Ile-Cys Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H]([C@@H](C)O)N DDDLIMCZFKOERC-SVSWQMSJSA-N 0.000 description 1
- AMXMBCAXAZUCFA-RHYQMDGZSA-N Thr-Leu-Arg Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O AMXMBCAXAZUCFA-RHYQMDGZSA-N 0.000 description 1
- IMDMLDSVUSMAEJ-HJGDQZAQSA-N Thr-Leu-Asn Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(N)=O)C(O)=O IMDMLDSVUSMAEJ-HJGDQZAQSA-N 0.000 description 1
- RRRRCRYTLZVCEN-HJGDQZAQSA-N Thr-Leu-Asp Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O RRRRCRYTLZVCEN-HJGDQZAQSA-N 0.000 description 1
- SPVHQURZJCUDQC-VOAKCMCISA-N Thr-Lys-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(O)=O SPVHQURZJCUDQC-VOAKCMCISA-N 0.000 description 1
- YJVJPJPHHFOVMG-VEVYYDQMSA-N Thr-Met-Asp Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(=O)O)C(=O)O)N)O YJVJPJPHHFOVMG-VEVYYDQMSA-N 0.000 description 1
- XNTVWRJTUIOGQO-RHYQMDGZSA-N Thr-Met-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CC(C)C)C(O)=O XNTVWRJTUIOGQO-RHYQMDGZSA-N 0.000 description 1
- WVVOFCVMHAXGLE-LFSVMHDDSA-N Thr-Phe-Ala Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](C)C(O)=O WVVOFCVMHAXGLE-LFSVMHDDSA-N 0.000 description 1
- ABWNZPOIUJMNKT-IXOXFDKPSA-N Thr-Phe-Ser Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O ABWNZPOIUJMNKT-IXOXFDKPSA-N 0.000 description 1
- VUXIQSUQQYNLJP-XAVMHZPKSA-N Thr-Ser-Pro Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CO)C(=O)N1CCC[C@@H]1C(=O)O)N)O VUXIQSUQQYNLJP-XAVMHZPKSA-N 0.000 description 1
- IEZVHOULSUULHD-XGEHTFHBSA-N Thr-Ser-Val Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CO)C(=O)N[C@@H](C(C)C)C(O)=O IEZVHOULSUULHD-XGEHTFHBSA-N 0.000 description 1
- BBPCSGKKPJUYRB-UVOCVTCTSA-N Thr-Thr-Leu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(C)C)C(O)=O BBPCSGKKPJUYRB-UVOCVTCTSA-N 0.000 description 1
- GJOBRAHDRIDAPT-NGTWOADLSA-N Thr-Trp-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)NC(=O)[C@H]([C@@H](C)O)N GJOBRAHDRIDAPT-NGTWOADLSA-N 0.000 description 1
- REJRKTOJTCPDPO-IRIUXVKKSA-N Thr-Tyr-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(O)=O REJRKTOJTCPDPO-IRIUXVKKSA-N 0.000 description 1
- KPMIQCXJDVKWKO-IFFSRLJSSA-N Thr-Val-Glu Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CCC(O)=O)C(O)=O KPMIQCXJDVKWKO-IFFSRLJSSA-N 0.000 description 1
- BKVICMPZWRNWOC-RHYQMDGZSA-N Thr-Val-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](C(C)C)NC(=O)[C@@H](N)[C@@H](C)O BKVICMPZWRNWOC-RHYQMDGZSA-N 0.000 description 1
- 102000004357 Transferases Human genes 0.000 description 1
- 108090000992 Transferases Proteins 0.000 description 1
- CZSMNLQMRWPGQF-XEGUGMAKSA-N Trp-Gln-Ala Chemical compound [H]N[C@@H](CC1=CNC2=C1C=CC=C2)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](C)C(O)=O CZSMNLQMRWPGQF-XEGUGMAKSA-N 0.000 description 1
- NXJZCPKZIKTYLX-XEGUGMAKSA-N Trp-Glu-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N NXJZCPKZIKTYLX-XEGUGMAKSA-N 0.000 description 1
- OJCSQAWRJKPKFM-TUSQITKMSA-N Trp-His-Trp Chemical compound N[C@@H](Cc1c[nH]c2ccccc12)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](Cc1c[nH]c2ccccc12)C(O)=O OJCSQAWRJKPKFM-TUSQITKMSA-N 0.000 description 1
- IQXWAJUIAQLZNX-IHPCNDPISA-N Trp-Leu-His Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N IQXWAJUIAQLZNX-IHPCNDPISA-N 0.000 description 1
- WMBFONUKQXGLMU-WDSOQIARSA-N Trp-Leu-Val Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](C(C)C)C(=O)O)NC(=O)[C@H](CC1=CNC2=CC=CC=C21)N WMBFONUKQXGLMU-WDSOQIARSA-N 0.000 description 1
- GQEXFCQNAJHJTI-IHPCNDPISA-N Trp-Phe-Asp Chemical compound C1=CC=C(C=C1)C[C@@H](C(=O)N[C@@H](CC(=O)O)C(=O)O)NC(=O)[C@H](CC2=CNC3=CC=CC=C32)N GQEXFCQNAJHJTI-IHPCNDPISA-N 0.000 description 1
- UIRPULWLRODAEQ-QEJZJMRPSA-N Trp-Ser-Glu Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(O)=O)=CNC2=C1 UIRPULWLRODAEQ-QEJZJMRPSA-N 0.000 description 1
- RWTFCAMQLFNPTK-UMPQAUOISA-N Trp-Val-Thr Chemical compound C1=CC=C2C(C[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)=CNC2=C1 RWTFCAMQLFNPTK-UMPQAUOISA-N 0.000 description 1
- VCXWRWYFJLXITF-AUTRQRHGSA-N Tyr-Ala-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@H](C)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 VCXWRWYFJLXITF-AUTRQRHGSA-N 0.000 description 1
- NLMXVDDEQFKQQU-CFMVVWHZSA-N Tyr-Asp-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](N)CC1=CC=C(O)C=C1 NLMXVDDEQFKQQU-CFMVVWHZSA-N 0.000 description 1
- NQJDICVXXIMMMB-XDTLVQLUSA-N Tyr-Glu-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](C)C(O)=O NQJDICVXXIMMMB-XDTLVQLUSA-N 0.000 description 1
- CNLKDWSAORJEMW-KWQFWETISA-N Tyr-Gly-Ala Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)NCC(=O)N[C@@H](C)C(O)=O CNLKDWSAORJEMW-KWQFWETISA-N 0.000 description 1
- NVZVJIUDICCMHZ-BZSNNMDCSA-N Tyr-Phe-Ser Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(O)=O NVZVJIUDICCMHZ-BZSNNMDCSA-N 0.000 description 1
- LUMQYLVYUIRHHU-YJRXYDGGSA-N Tyr-Ser-Thr Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(O)=O LUMQYLVYUIRHHU-YJRXYDGGSA-N 0.000 description 1
- LVILBTSHPTWDGE-PMVMPFDFSA-N Tyr-Trp-Lys Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1C2=CC=CC=C2NC=1)C(=O)N[C@@H](CCCCN)C(O)=O)C1=CC=C(O)C=C1 LVILBTSHPTWDGE-PMVMPFDFSA-N 0.000 description 1
- GZWPQZDVTBZVEP-BZSNNMDCSA-N Tyr-Tyr-Asn Chemical compound [H]N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC1=CC=C(O)C=C1)C(=O)N[C@@H](CC(N)=O)C(O)=O GZWPQZDVTBZVEP-BZSNNMDCSA-N 0.000 description 1
- SMUWZUSWMWVOSL-JYJNAYRXSA-N Tyr-Val-Met Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)O)NC(=O)[C@H](CC1=CC=C(C=C1)O)N SMUWZUSWMWVOSL-JYJNAYRXSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-AEJSXWLSSA-N Val-Ala-Pro Chemical compound C[C@@H](C(=O)N1CCC[C@@H]1C(=O)O)NC(=O)[C@H](C(C)C)N ZLFHAAGHGQBQQN-AEJSXWLSSA-N 0.000 description 1
- ZLFHAAGHGQBQQN-GUBZILKMSA-N Val-Ala-Pro Natural products CC(C)[C@H](N)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(O)=O ZLFHAAGHGQBQQN-GUBZILKMSA-N 0.000 description 1
- KKHRWGYHBZORMQ-NHCYSSNCSA-N Val-Arg-Glu Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(=O)O)C(=O)O)N KKHRWGYHBZORMQ-NHCYSSNCSA-N 0.000 description 1
- UDLYXGYWTVOIKU-QXEWZRGKSA-N Val-Asn-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC(=O)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N UDLYXGYWTVOIKU-QXEWZRGKSA-N 0.000 description 1
- OXVPMZVGCAPFIG-BQFCYCMXSA-N Val-Gln-Trp Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)N)C(=O)N[C@@H](CC1=CNC2=CC=CC=C21)C(=O)O)N OXVPMZVGCAPFIG-BQFCYCMXSA-N 0.000 description 1
- YDPFWRVQHFWBKI-GVXVVHGQSA-N Val-Glu-His Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)N[C@@H](CC1=CN=CN1)C(=O)O)N YDPFWRVQHFWBKI-GVXVVHGQSA-N 0.000 description 1
- VCAWFLIWYNMHQP-UKJIMTQDSA-N Val-Glu-Ile Chemical compound CC[C@H](C)[C@@H](C(=O)O)NC(=O)[C@H](CCC(=O)O)NC(=O)[C@H](C(C)C)N VCAWFLIWYNMHQP-UKJIMTQDSA-N 0.000 description 1
- OXGVAUFVTOPFFA-XPUUQOCRSA-N Val-Gly-Cys Chemical compound CC(C)[C@@H](C(=O)NCC(=O)N[C@@H](CS)C(=O)O)N OXGVAUFVTOPFFA-XPUUQOCRSA-N 0.000 description 1
- WFENBJPLZMPVAX-XVKPBYJWSA-N Val-Gly-Glu Chemical compound CC(C)[C@H](N)C(=O)NCC(=O)N[C@H](C(O)=O)CCC(O)=O WFENBJPLZMPVAX-XVKPBYJWSA-N 0.000 description 1
- FEFZWCSXEMVSPO-LSJOCFKGSA-N Val-His-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](Cc1cnc[nH]1)C(=O)N[C@@H](C)C(O)=O FEFZWCSXEMVSPO-LSJOCFKGSA-N 0.000 description 1
- OVBMCNDKCWAXMZ-NAKRPEOUSA-N Val-Ile-Ser Chemical compound CC[C@H](C)[C@@H](C(=O)N[C@@H](CO)C(=O)O)NC(=O)[C@H](C(C)C)N OVBMCNDKCWAXMZ-NAKRPEOUSA-N 0.000 description 1
- OTJMMKPMLUNTQT-AVGNSLFASA-N Val-Leu-Arg Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)NC(=O)[C@H](C(C)C)N OTJMMKPMLUNTQT-AVGNSLFASA-N 0.000 description 1
- LYERIXUFCYVFFX-GVXVVHGQSA-N Val-Leu-Glu Chemical compound CC(C)C[C@@H](C(=O)N[C@@H](CCC(=O)O)C(=O)O)NC(=O)[C@H](C(C)C)N LYERIXUFCYVFFX-GVXVVHGQSA-N 0.000 description 1
- ZZGPVSZDZQRJQY-ULQDDVLXSA-N Val-Leu-Phe Chemical compound CC(C)C[C@H](NC(=O)[C@@H](N)C(C)C)C(=O)N[C@@H](Cc1ccccc1)C(O)=O ZZGPVSZDZQRJQY-ULQDDVLXSA-N 0.000 description 1
- RWOGENDAOGMHLX-DCAQKATOSA-N Val-Lys-Ala Chemical compound C[C@@H](C(=O)O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C(C)C)N RWOGENDAOGMHLX-DCAQKATOSA-N 0.000 description 1
- OJPRSVJGNCAKQX-SRVKXCTJSA-N Val-Met-Arg Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCN=C(N)N)C(=O)O)N OJPRSVJGNCAKQX-SRVKXCTJSA-N 0.000 description 1
- JVGHIFMSFBZDHH-WPRPVWTQSA-N Val-Met-Gly Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)NCC(=O)O)N JVGHIFMSFBZDHH-WPRPVWTQSA-N 0.000 description 1
- WSUWDIVCPOJFCX-TUAOUCFPSA-N Val-Met-Pro Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CCSC)C(=O)N1CCC[C@@H]1C(=O)O)N WSUWDIVCPOJFCX-TUAOUCFPSA-N 0.000 description 1
- LJSZPMSUYKKKCP-UBHSHLNASA-N Val-Phe-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H](C)C(O)=O)CC1=CC=CC=C1 LJSZPMSUYKKKCP-UBHSHLNASA-N 0.000 description 1
- NZGOVKLVQNOEKP-YDHLFZDLSA-N Val-Phe-Asn Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CC(=O)N)C(=O)O)N NZGOVKLVQNOEKP-YDHLFZDLSA-N 0.000 description 1
- KISFXYYRKKNLOP-IHRRRGAJSA-N Val-Phe-Ser Chemical compound CC(C)[C@@H](C(=O)N[C@@H](CC1=CC=CC=C1)C(=O)N[C@@H](CO)C(=O)O)N KISFXYYRKKNLOP-IHRRRGAJSA-N 0.000 description 1
- YKNOJPJWNVHORX-UNQGMJICSA-N Val-Phe-Thr Chemical compound CC(C)[C@H](N)C(=O)N[C@H](C(=O)N[C@@H]([C@@H](C)O)C(O)=O)CC1=CC=CC=C1 YKNOJPJWNVHORX-UNQGMJICSA-N 0.000 description 1
- USLVEJAHTBLSIL-CYDGBPFRSA-N Val-Pro-Ile Chemical compound CC[C@H](C)[C@@H](C(O)=O)NC(=O)[C@@H]1CCCN1C(=O)[C@@H](N)C(C)C USLVEJAHTBLSIL-CYDGBPFRSA-N 0.000 description 1
- DOFAQXCYFQKSHT-SRVKXCTJSA-N Val-Pro-Pro Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N1[C@H](C(O)=O)CCC1 DOFAQXCYFQKSHT-SRVKXCTJSA-N 0.000 description 1
- QSPOLEBZTMESFY-SRVKXCTJSA-N Val-Pro-Val Chemical compound CC(C)[C@H](N)C(=O)N1CCC[C@H]1C(=O)N[C@@H](C(C)C)C(O)=O QSPOLEBZTMESFY-SRVKXCTJSA-N 0.000 description 1
- DEGUERSKQBRZMZ-FXQIFTODSA-N Val-Ser-Ala Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](C)C(O)=O DEGUERSKQBRZMZ-FXQIFTODSA-N 0.000 description 1
- MNSSBIHFEUUXNW-RCWTZXSCSA-N Val-Thr-Arg Chemical compound CC(C)[C@H](N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@H](C(O)=O)CCCN=C(N)N MNSSBIHFEUUXNW-RCWTZXSCSA-N 0.000 description 1
- JAIZPWVHPQRYOU-ZJDVBMNYSA-N Val-Thr-Thr Chemical compound C[C@H]([C@@H](C(=O)N[C@@H]([C@@H](C)O)C(=O)O)NC(=O)[C@H](C(C)C)N)O JAIZPWVHPQRYOU-ZJDVBMNYSA-N 0.000 description 1
- ZLNYBMWGPOKSLW-LSJOCFKGSA-N Val-Val-Asp Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](CC(O)=O)C(O)=O ZLNYBMWGPOKSLW-LSJOCFKGSA-N 0.000 description 1
- NLNCNKIVJPEFBC-DLOVCJGASA-N Val-Val-Glu Chemical compound CC(C)[C@H](N)C(=O)N[C@@H](C(C)C)C(=O)N[C@H](C(O)=O)CCC(O)=O NLNCNKIVJPEFBC-DLOVCJGASA-N 0.000 description 1
- JVGDAEKKZKKZFO-RCWTZXSCSA-N Val-Val-Thr Chemical compound C[C@H]([C@@H](C(=O)O)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)N)O JVGDAEKKZKKZFO-RCWTZXSCSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 108010008685 alanyl-glutamyl-aspartic acid Proteins 0.000 description 1
- 108010024078 alanyl-glycyl-serine Proteins 0.000 description 1
- 108010069020 alanyl-prolyl-glycine Proteins 0.000 description 1
- 108010041407 alanylaspartic acid Proteins 0.000 description 1
- 108010008355 arginyl-glutamine Proteins 0.000 description 1
- 108010029539 arginyl-prolyl-proline Proteins 0.000 description 1
- 108010084758 arginyl-tyrosyl-aspartic acid Proteins 0.000 description 1
- 108010062796 arginyllysine Proteins 0.000 description 1
- 108010060035 arginylproline Proteins 0.000 description 1
- 108010077245 asparaginyl-proline Proteins 0.000 description 1
- 108010069205 aspartyl-phenylalanine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 108010068265 aspartyltyrosine Proteins 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 239000012141 concentrate Substances 0.000 description 1
- 238000000432 density-gradient centrifugation Methods 0.000 description 1
- 230000030609 dephosphorylation Effects 0.000 description 1
- 238000006209 dephosphorylation reaction Methods 0.000 description 1
- 108010054813 diprotin B Proteins 0.000 description 1
- 239000012154 double-distilled water Substances 0.000 description 1
- 230000002255 enzymatic effect Effects 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 238000000855 fermentation Methods 0.000 description 1
- 230000004151 fermentation Effects 0.000 description 1
- 235000013373 food additive Nutrition 0.000 description 1
- 239000002778 food additive Substances 0.000 description 1
- 108010006664 gamma-glutamyl-glycyl-glycine Proteins 0.000 description 1
- 108091008053 gene clusters Proteins 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010078144 glutaminyl-glycine Proteins 0.000 description 1
- 108010085059 glutamyl-arginyl-proline Proteins 0.000 description 1
- 108010079547 glutamylmethionine Proteins 0.000 description 1
- JYPCXBJRLBHWME-UHFFFAOYSA-N glycyl-L-prolyl-L-arginine Natural products NCC(=O)N1CCCC1C(=O)NC(CCCN=C(N)N)C(O)=O JYPCXBJRLBHWME-UHFFFAOYSA-N 0.000 description 1
- 108010072405 glycyl-aspartyl-glycine Proteins 0.000 description 1
- 108010062266 glycyl-glycyl-argininal Proteins 0.000 description 1
- 108010085109 glycyl-histidyl-arginyl-proline Proteins 0.000 description 1
- 108010033719 glycyl-histidyl-glycine Proteins 0.000 description 1
- 108010082286 glycyl-seryl-alanine Proteins 0.000 description 1
- 108010045126 glycyl-tyrosyl-glycine Proteins 0.000 description 1
- 108010010147 glycylglutamine Proteins 0.000 description 1
- 108010087823 glycyltyrosine Proteins 0.000 description 1
- 108010025306 histidylleucine Proteins 0.000 description 1
- 108010092114 histidylphenylalanine Proteins 0.000 description 1
- 108010085325 histidylproline Proteins 0.000 description 1
- 238000009396 hybridization Methods 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 108010031424 isoleucyl-prolyl-proline Proteins 0.000 description 1
- 108010027338 isoleucylcysteine Proteins 0.000 description 1
- 108010078274 isoleucylvaline Proteins 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 108010076756 leucyl-alanyl-phenylalanine Proteins 0.000 description 1
- 108010044311 leucyl-glycyl-glycine Proteins 0.000 description 1
- 108010012058 leucyltyrosine Proteins 0.000 description 1
- 108010025153 lysyl-alanyl-alanine Proteins 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 108700023046 methionyl-leucyl-phenylalanine Proteins 0.000 description 1
- 108010024654 phenylalanyl-prolyl-alanine Proteins 0.000 description 1
- 108010018625 phenylalanylarginine Proteins 0.000 description 1
- 108010012581 phenylalanylglutamate Proteins 0.000 description 1
- 108010051242 phenylalanylserine Proteins 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 108700042769 prolyl-leucyl-glycine Proteins 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 108010015796 prolylisoleucine Proteins 0.000 description 1
- 108010090894 prolylleucine Proteins 0.000 description 1
- 238000000746 purification Methods 0.000 description 1
- 238000009790 rate-determining step (RDS) Methods 0.000 description 1
- 230000014493 regulation of gene expression Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000000717 retained effect Effects 0.000 description 1
- 239000000523 sample Substances 0.000 description 1
- 108010071207 serylmethionine Proteins 0.000 description 1
- 241000894007 species Species 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000002194 synthesizing effect Effects 0.000 description 1
- 238000013518 transcription Methods 0.000 description 1
- 230000035897 transcription Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
- 230000014616 translation Effects 0.000 description 1
- 108010058119 tryptophyl-glycyl-glycine Proteins 0.000 description 1
- 108010045269 tryptophyltryptophan Proteins 0.000 description 1
- 241001624918 unidentified bacterium Species 0.000 description 1
- 108010003885 valyl-prolyl-glycyl-glycine Proteins 0.000 description 1
- 108010015385 valyl-prolyl-proline Proteins 0.000 description 1
- 108010009962 valyltyrosine Proteins 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 150000003722 vitamin derivatives Chemical class 0.000 description 1
Images
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/11—DNA or RNA fragments; Modified forms thereof; Non-coding nucleic acids having a biological activity
- C12N15/52—Genes encoding for enzymes or proenzymes
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N1/00—Microorganisms, e.g. protozoa; Compositions thereof; Processes of propagating, maintaining or preserving microorganisms or compositions thereof; Processes of preparing or isolating a composition containing a microorganism; Culture media therefor
- C12N1/20—Bacteria; Culture media therefor
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
- C12N15/09—Recombinant DNA-technology
- C12N15/63—Introduction of foreign genetic material using vectors; Vectors; Use of hosts therefor; Regulation of expression
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12P—FERMENTATION OR ENZYME-USING PROCESSES TO SYNTHESISE A DESIRED CHEMICAL COMPOUND OR COMPOSITION OR TO SEPARATE OPTICAL ISOMERS FROM A RACEMIC MIXTURE
- C12P17/00—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms
- C12P17/18—Preparation of heterocyclic carbon compounds with only O, N, S, Se or Te as ring hetero atoms containing at least two hetero rings condensed among themselves or condensed with a common carbocyclic ring system, e.g. rifamycin
Landscapes
- Health & Medical Sciences (AREA)
- Genetics & Genomics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Engineering & Computer Science (AREA)
- Chemical & Material Sciences (AREA)
- Organic Chemistry (AREA)
- Zoology (AREA)
- Wood Science & Technology (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Microbiology (AREA)
- Molecular Biology (AREA)
- Biochemistry (AREA)
- General Health & Medical Sciences (AREA)
- Plant Pathology (AREA)
- Biophysics (AREA)
- Physics & Mathematics (AREA)
- Chemical Kinetics & Catalysis (AREA)
- General Chemical & Material Sciences (AREA)
- Medicinal Chemistry (AREA)
- Tropical Medicine & Parasitology (AREA)
- Virology (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Enzymes And Modification Thereof (AREA)
Abstract
含来自鞘氨醇单胞菌属微生物的在生物素生物合成中涉及的基因的DNA片段,包含该DNA片段的质粒,包含该质粒的生物素合成转化子。提供了利用来自鞘氨醇单胞菌属微生物的在生物素生物合成中涉及的基因,用基因工程方法培育生物素合成微生物的技术。
Description
本申请是1998年3月2日申请的中国发明专利申请“含生物素生物合成基因的DNA片段及其利用”(申请号988002272)的分案申请。
技术领域
本发明涉及含至少一个生物素生物合成中有关基因的DNA片段及其利用。
背景知识
生物素是人类、动物、植物和一些微生物的必需维生素,并且被用作人及动物的食物添加剂。至于利用微生物生产生物素的方法,现已知利用链霉菌及其小单孢菌(JP-B-41-21756)的方法,利用掷孢酵母属(JP-B-42-3074)的方法,利用杆菌、放线菌或假单孢菌(JP-A-56-160998)的方法,利用鞘氨醇单胞菌属(JP-A-133790)的方法等。也有人提议通过基因工程技术将在生物素生物合成中涉及的从能够合成生物素的微生物中分离的基因导入另一种微生物,以增加生物素生物合成中有关基因的表达的培养微生物的方法,由此,能催化生物素生物合成反应的酶的活性提高了,从而改进了生物素的产量(JP-A-61-202686,JP-A-2-27980,JP-A-7-231789,等)。
在微生物细胞内的生物素生物合成中涉及的基因,现已知的有来自大肠杆菌(大肠杆菌)的bioA,bioB,bioF,bioD,bioC,和bioH基因〔生物化学杂志,Vol.263,19577-19585(1988)〕。bioA基因编码具有7,8-二氨基壬酸氨酸转移酶活性的酶。bioB基因编码具有生物素合成酶活性的酶。bioF基因编码具有7-酮-8-氨基壬酸合成酶活性的酶。bioD基因编码具有脱硫生物素合成酶活性的酶。bioC基因参于生物素生物合成途径中庚二酰辅酶A上游的生物合成阶段。bioH基因的功能不清楚。在大肠杆菌的生物合成途径中,细胞内的庚二酰辅酶A由7-酮-8-氨基壬酸合成酶催化生成7-酮-8-氨基壬酸,然后由7,8-二氨基壬酸氨基转移酶催化7-酮-8-氨基壬酸生成7,8-二氨基壬酸,脱硫生物素合成酶再催化二氨基壬酸生成脱硫生物素,最后脱硫生物素由生物素合成酶催化生成生成素,由此生物素得到合成。当bioC基因缺失时,生物素的产量下降,因此,人们认为bioC基因编码一种酶,该酶具有催化庚二酰辅酶A上游生物合成阶段中某一反应的活性〔“发酵和工业”,46,102-111(1988)〕。来自大肠杆菌的bioA,bioB,bioF,bioD,bioC,bioH基因的碱基序列已得到说明。现已知bioA,bioB,bioF,bioD,bioC基因形成一个操纵子,其转录由操纵基因控制。
关于来自除埃希氏菌属以外其它属的微生物的生物素生物合成中涉及的基因,已有报道来自粘质沙雷氏菌的基因(基因库资料,录入号:D17468)及来自枯草杆菌的基因(JP-A-7-231789)。每一个这些基因的碱基序列已得到说明。虽然这些基因的碱基序列与来自大肠杆菌的基因不同,但前述基因的基因产物的功能和生物素生物合成途径与后述基因(大肠杆菌)基本相同。另一方面,来自球状杆菌的生物素生物合成中涉及的基因也已有报道〔Ohsawa等,基因80,39-48(1989),Gloeckler等,基因87,63-70(1990)〕。球状杆菌的基因在以下几个方面与大肠杆菌的不同:在庚二酰辅酶A上游生物合成阶段中涉及的基因,bio基因簇的形成及基因顺序等〔Gloeckler等,基因87,63-70(1990)〕。
可是,关于来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因,它们的碱基序列和基因产物的结构或功能完全不为人们所知。因此,还没有利用来自鞘氨醇单胞菌属的生物素生物合成中涉及的基因,通过基因工程培育生产生物素的微生物这方面的技术。
在这种情况下,本发明的发明者们认真地研究并随后发现:通过分离包含来自鞘氨醇单胞菌属微生物的生物素生物合成中牵涉的基因的DNA片段,将该DNA片段插入到载体中,然后将载体导入宿主细胞来制备用于生产生物素或生物素前体的转化子。至此,本发明得到完成。
发明内容
本发明有下列内容
1)包含来自鞘氨醇单胞菌属微生物的生物素生物合成中牵涉的基因的DNA片段。
2)根据上述第1项所述的DNA片段,其中的基因选自于一组基因,这组基因由7-酮-8-氨基壬酸合成酶基因、7,8-二氨基壬酸氨基转移酶基因,脱硫生物素合成酶基因,生物素合成酶基因,以及编码具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应活性的酶的基因组成。
3)根据上述第1项所述的DNA片段,其中的基因是编码7-酮-8-氨基壬酸合成酶的基因。
4)包含编码一种蛋白的基因的DNA片段,该蛋白具有7-酮-8-氨基壬酸合成酸活性,并具有如SEQ ID NO:1所示的氨基酸序列,或者在如SEQ ID NO:1所示氨基酸序列中经过缺失,取代、修饰或插入一个或多个氨基酸而得到的氨基酸序列。
5)包含编码一种蛋白的基因的DNA片段,该蛋白具有7-酮-8-氨基壬酸合成酶活性,并具有如SEQ ID NO:2所示的氨基酸序列,或者在如SEQ ID NO:2所示氨基酸序列中经过缺失,取代,修饰、或插入一个或多个氨基酸而得到的氨基酸序列。
6)包含具有如SEQ ID NO:3,4或5所示的碱基序列的基因的DNA片段。
7)根据上述第1项所述的DNA片段,其中的基因是编码7,8-二氨基壬酸氨基转移酶的基因。
8)包含编码一种蛋白的基因的DNA片段,该蛋白具有7,8-二氨基壬酸氨基转移酶活性,其氨基酸序列如SEQ ID NO:6所示,或者是在SEQ ID NO:6所示氨基酸序列中经过缺失、取代、修饰或插入一个或多个氨基酸而得到的氨基酸序列。
9)包含编码一种蛋白的基因的DNA片段,该蛋白具有7,8-二氨基壬酸氨基转移酶活性,其氨基酸序列如SEQ ID NO:7所示,或者是在SEQ ID NO:7所示氨基酸序列中经过缺失,取代、修饰或插入一个或多个氨基酸而得到的氨基酸序列。
10)包含具有如SEQ ID NO:8或9所示碱基序列的基因的DNA片段。
11)根据上述第1项所述的DNA片段,其中的基因是编码脱硫生物素合成酶的基因。
12)包含编码一种蛋白的基因的DNA片段,该蛋白具有脱硫生物素合成酶活性,其氨基酸序列如SEQ ID NO:10所示,或者是在SEQ ID
NO:10所示氨基酸序列中经过缺失、取代、修饰或插入一个或多个氨基酸而得到的氨基酸序列。
13)包含编码一种蛋白的基因的DNA片段,该蛋白具有脱硫生物素合成酶活性,其氨基酸的序列如SEQ ID NO:11所示,或者是在如SEQ
ID NO:11所示氨基酸序列中经过缺失,取代,修饰或插入一个或多个氨基酸而得到的氨基酸序列。
14)包含具有SEQ ID NO:12或13所示碱基序列的基因的DNA片段。
15)根据上述第1项所述的DNA片段,其中的基因是编码生物素合成酶的基因。
16)包含编码一种蛋白的基因的DNA片段,该蛋白具有生物素全成酶活性,其氨基酸序列如SEQ ID NO:14所示,或者是在如SEQ IDNO:14所示氨基酸序列中经过缺失,取代,修饰或插入一个或多个氨基酸而得到的氨基酸序列。
17)包含编码一种蛋白的基因的DNA片段,该蛋白具有生物素合成酶活性,其氨基酸序列如SEQ ID NO:15所示,或者是在SEQ ID NO:15所示的氨基酸序列中经过缺失、取代、修饰或插入一个或多个氨基酸而得到的氨基酸序列。
18)包含编码一种蛋白的基因的DNA片段,该蛋白具有生物素合成酶活性,其氨基酸序列如SEQ ID NO:18所示。
19)包含具有如SEQ ID NO:16,17或19所示碱基序列的基因的DNA片段。
20)根据上述第1项所述的DNA片段,其中的基因编码具有催化生物素生物合成途径中庚二酰辅酶A上游生物合成阶段某一反应的活性的酶。
21)包含编码一种蛋白的基因的DNA片段,该蛋白具有催化生物素生物合成途径中庚二酰辅酶A上游生物合成阶段某一反应的活性,其氨基酸序列如SEQ ID NO:20所示,或者是在SEQ ID NO:20所示氨基酸序列中经过缺失,取代,修饰或插入一个或多个氨基酸而得到的氨基酸序列。
22)包含编码一种蛋白的基因的DNA片段,该蛋白具有催化生物素生物合成途径中庚二酰辅酶A上游生物合成阶段某一反应的活性。其氨基酸序列如SEQ ID NO:21所示,或者是在SEQ ID NO:21所示氨基酸序列中经过缺失,取代,修饰或插入一个或多个氨基酸而得到的氨基酸序列。
23)包含具有如SEQ ID NO:22或23所示碱基序列的基因的DNA片段。
24)包含来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因的部分碱基序列的DNA片段。
25)根据上述第24项所述的DNA片段,其中的基因是:7-酮-8-氨基壬酸合成酶基因,7,8-二氨基壬酸氨基转移酶基因,脱硫生物素合成酶基因,生物素合成酶基因,或编码具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应活性的酶的基因。
26)包含来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因的蛋白编码区的DNA片段。
27)根据上述第26项所述的DNA片段,其中的基因是:7-酮-8-氨基壬酸合成酶基因,7,8-二氨基壬酸氨基转移酶基因,脱硫生物素合成酶基因,生物素合成酶基因,或编码具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应的活性的酶的基因。
28)包含来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因的基因表达调控区域的DNA片段,该区域位于蛋白编码区的上游。
29)根据上述第28项所述的DNA片段,其中所述的基因是7-酮-8-氨基壬酸合成酶基因,7,8-二氨基壬酸氨基转移酶基因,脱硫生物素合成酶基因,生物素合成酶基因,或编码具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应的活性的酶的基因。
30)具有如SEQ ID NO:24或25所述碱基序列的DNA片段。
31)根据上述第1、2、3、7、11、15、20、24、25、26、27、28和29项中任何一项所述的DNA片段,其中属于鞘氨醇单胞菌属的微生物是少动鞘氨醇单胞菌JCM 7511或鞘氨醇单胞菌属种类SC42405。
32)包含根据上述第1到31项中任何一项所述的DNA片段的载体。
33)一种构建载体的方法,包括将根据第1到31项中任何一项所述的DNA片段插入能在宿主细胞中复制的载体。
34)根据第32项所述的载体,其中的基因表达调控区域连入蛋白编码区的上游。
35)含有至少一个导入宿主细胞的根据上述第1到31项中任何一项所述的DNA片段或根据上述第32或34项所述的载体的转化子。
36)根据上述第35项所述的转化子,其中的宿主细胞是微生物。
37)一种构建转化子的方法,包括将根据第32或34项所述的载体导入宿主细胞。
附图简述
图1表示质粒pJAB2的结构及限制性图谱
图2表示质粒pJAW的结构及限制性图谱
图3表示质粒pJA41的结构及限制性图谱
图4表示质粒pSP302的结构及限制性图谱
图5表示质粒pSP304的结构及限制性图谱
图6表示质粒pSS301的结构及限制性图谱
图7表示质粒pSS305的结构及限制性图谱
图8表示质粒pSS304的结构及限制性图谱
图9表示质粒pSS306的结构及限制性图谱
本发明优选的实施方案
在本说明书中,来自鞘氨醇单胞菌属微生物的在生物素生物合成中涉及的基因是指编码在鞘氨醇单胞菌属微生物的细胞中生物素生物合成中涉及的酶的基因。所述的基因包括,例如,7-酮-8-氨基壬酸合成酶基因(下文称为“本发明bioF基因”),7,8-二氨基壬酸氨基转移酶基因(下文称为“本发明bioA基因”),脱硫生物素合成酶基因(下文称为“本发明bioD基因”),生物素合成酶基因(下文称为“本发明bioB基因”),以及编码在生物素生物合成途径中具有催化庚二酰辅酶A上游生物合成阶段某一反应的活性的酶的基因(下文称为“本发明bioC基因”)。
本发明bioF基因包括,例如,来自鞘氨醇单胞菌属微生物,含编码7-酮-8-氨基壬酸合成酶的约1.1-1.2kbp区域的基因。其更特定的例子是编码具有7-酮-8-氨基壬酸合成酶活性,并具有如SEQ ID NO:1或2所示氨基酸序列或由SEQ ID NO:1或2所示氨基酸序列经过缺失,取代、修饰或插入一个或多个氨基酸而得到的氨基酸序列的蛋白的基因;具有如SEQ ID NO:3,4或5所述的碱基序列的7-酮-8-氨基壬酸酶合成酶基因。
本发明bioA基因包括,例如来自鞘氨醇单胞菌属微生物,含编码7,8-二氨基壬酸氨基转移酶的约1.2-1.3kbp区域的基因。其更特定的例子是编码具有7,8-二氨基壬酸氨基转移酶活性,并具有氨基酸序列如SEQ ID NO:6或7所示氨基酸序列或由SEQ ID NO:6或7所示氨基酸序列经过缺失,取代,修饰或插入一个或多个氨基酸后得到的氨基酸序列的蛋白的基因;具有如SEQ ID NO:8或9所示碱基序列的7,8-二氨基壬酸氨基转移酶基因。
本发明bioD基因包括,例如,来自鞘氨醇单胞菌属微生物,含编码脱硫生物素合成酶的约0.6kbp区域的基因。更特定的例子是编码下述蛋白的基因,该蛋白具有脱硫生物素合成酶活性,其氨基酸序列如SEQID NO:10或11所示或由SEQ ID NO:10或11所示氨基酸序列经过缺失,取代,修饰或插入一个或多个氨基酸后得到的氨基酸序列;和具有SEQ ID NO:如12或13所示碱基序列的脱硫生物素合成酶基因。
本发明bioB基因包括,例如,来自鞘氨醇单胞菌属微生物,含编码生物素合成酶的约1.0-1.1kbp区域的基因。更特定的例子是编码下述蛋白的基因,该蛋白具有生物素合成酶活性,其氨基酸序列如SEQ ID NO:14,15或18所示或由SEQ ID NO:14,15,18所示氨基酸序列经过缺失,取代,修饰或插入一个或多个氨基酸后得到的氨基酸序列;和具有如SEQ ID NO:16,17或19的所示碱基序列的生物素合成酶基因。
本发明bioC基因包括,例如,来自鞘氨醇单胞菌属微生物,含编码具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应活性的酶的约0.8-0.9kbp的基因。更特定的例子是编码下述蛋白的基因,该蛋白具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应的活性,其氨基酸序列如SEQ ID NO:20或21所示或由SEQ ID NO:20或21所示氨基酸序列经过缺失、取代、修饰或插入一个或多个氨基酸后得到的氨基酸序列;和具有如SEQ ID NO:22或23所示碱基序列的编码具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应活性的酶的基因。
用于从中分离任何本发明bioF,bioA,bioD,bioB和bioC基因的微生物可以是从自然界分离的菌株或通过将突变导入上述分离菌株得到的菌株,只要它是鞘氨醇单胞菌属生物素合成菌,即,它有在生物素生物合成中涉及的基因。上述微生物包括,例如,在JP-A-133790中描述过的少动鞘氨醇单胞菌JCM 7511菌株和鞘氨醇单胞菌属种类SC42405菌株,少动鞘氨醇单胞菌JCM 7511菌株以可分装的状态贮存于生理和生化研究所的生物株系贮存设备中。鞘氨醇单胞菌种类SC42405株系在布达佩斯条约的规定下贮存在国际工业贸易部,工业科学和技术社,国立生命科学和人体技术研究所(1-3,Higashi-1-Chome,Tsukuba-shi,Ibaraki,Japan),贮存录入号为FERM-BP3995,并已在美国专利号5,432,067中公开。
以下给出从鞘氨醇单胞菌属生物素合成细菌制备含生物素生物合成中涉及基因的DNA片段的方法的例子。
首先,分离鞘氨醇单胞菌属微生物的基因组DNA,例如,通过在Biochemica.Biophysica.Acta.,Vol.72,619-629(1963),等中描述的常规基因组DNA提取方法。用合适的限制性酶如Sau 3AI部分酶切分离的基因组DNA,接着将所得的DNA片段插入合适的载体以制备基因组DNA文库。至于此实验中所用的载体,只要能在基因组DNA文库要导入的菌株中复制和繁殖的任何载体都可使用。载体可包括,例如,质粒,噬菌体,和粘粒。
关于从这样制备的基因组DNA文库中分离和鉴定生物素生物合成中涉及基因的方法,可提及的一种方法是,将基因组DNA文库导入缺少生物素生物合成中涉及基因并需要生物素的基因缺失突变株,然后从得到的转化子中筛选能够重新合成生物素的菌株。至于该方法中用到的生物素依赖型突变株,任何菌株,只要鞘氨醇单胞菌属微生物的DNA片段导入此菌株的细胞后能够表达,都可使用。这样的突变株可以通过,例如,在Proceeding of the National Academy of Sciences U.S.A.,Vol.69,2219(1972),细菌学杂志,Vol.115,662(1973),等中描述的常规方法来制备。关于将基因组DNA文库导入生物素依赖型突变株的方法,可以提到多种常规方法,如用CaCl2处理细胞的方法〔分子克隆,第二版,冷泉港实验室出版社(1989)〕,电穿孔法〔分子生物学常用方法,Vol.1,John Wiley&Sons.Inc.ISBNO-471-50338-X(1987)〕,等。
将所获得的生物素依赖型突变株的转化子培养在不含生物素的合适的选择性培养基中,筛选能够生长的转化子。这样选择出的转化子可能是带有含来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及基因的DNA插入片段的载体的菌株。
例如,当载体是质粒或粘粒时,重组载体通过常规方法如碱裂解法或煮沸法〔分子克隆,第二版,冷泉港实验室出版社(1989)〕从上述转化子中提取。当载体是噬菌体时,重组载体通过常规方法如密度梯度离心或离子交换层析〔分子生物学常用方法,Vol.1,John Wiley&Sons.Inc.ISBNO-471-50338-X(1987)〕从转化子中提取。提出的重组载体的碱基序列通过Sanger双脱氧链终止法〔分子克隆,第二版,冷泉港实验室出版社(1989)〕来分析。这样就可确定插入到载体中DNA片段的碱基序列。
当与下述每个已知基因有高度同源性的编码蛋白的区域从已确定的碱基序列中找到的500bp或更多的开放阅读框中筛选出来时,其可以列为鞘氨醇单胞菌属细菌生物素生物合成涉及的基因:当DNA片段通过使用bioF缺失突变株而获得时,该基因即为已知的bioF基因;当DNA片段通过使用bioA缺失突变株而获得时,该基因即为已知的bioA基因;当DNA片段通过使用bioD缺失突变株而获得时,该基因即为已知的bioD基因;当DNA片段通过使用bioB缺失突变株而获得时,该基因即为已知的bioB基因;当DNA片段通过使用bioC缺失突变株而获得时,该基因即为已知的bioC基因。下面情况也是可能的。每一个编码区域用任何的限制性酶切割并制备含每一个编码区域的亚克隆。每一个亚克隆按下述方法导入基因缺失突变株中:当亚克隆是通过使用bioF缺失突变株得到的DNA片段,将其导入bioF缺失突变株;当亚克隆是通过使用bioA缺失突变株得到的DNA片段,将其导入bioA缺失突变株;当亚克隆是通过使用bioD缺失突变株得到的DNA片段,将其导入bioD缺失突变株;当亚克隆是通过使用bioB缺失突变株得到的DNA片段,将其导入bioB缺失突变株;当亚克隆是通过使用bioC缺失突变株得到的DNA片段,其被导入bioC缺失突变株。研究含有这样导入的基因的突变株在合适的不含生物素的选择性培养基上的生长情况。由此保留在可生长的突变株中的亚克隆上所含的编码区使可确定为鞘氨醇单胞菌属细菌生物素生物合成中涉及的基因。
此外,这样获得的目的基因的功能可被改进,例如,通过在分子克隆,第二版,冷泉港实验室出版社(1989)等中所述的常规突变引入方法。突变可导入任何参与生物素生物合成的基因。为了提高生物素的产量,在编码生物素合成酶的bioB基因中引入突变尤其有效。因为bioB编码生物素合成酶,催化脱硫生物素转变为生物素,而这一步常常是生物素生物合成途径中的限速步骤。导入的突变可以是编码增加酶活性或提高蛋白稳定性的蛋白的编码区,或在促进基因表达的基因表达调控区中的突变。引入突变的基因如上所述通过导入生物素依赖型基因缺失突变株中表达,比较其与野生型基因补充基因缺失的能力。这样,就有可能筛选出有助于提高生物素产量的突变基因。有助于提高生物素产量的突变基因也可通过在微生物中表达引入突变的基因,与使用野生型基因的实验比较导入基因所编码酶的产量,酶活性,以及该酶催化的反应合成的化合物的量等等来筛选。具有在生物素生物合成途径中催化庚二酰辅酶A上游生物合成阶段某一反应活性的酶,7-酮-8-氨基壬酸合成酶,7,8-二氨基壬酸氨基转移酶,及脱硫生物素合成酶的活性可通过,例如,在Y.Izumi等,酶学方法,Vol.62,326-338(1979),等中描述的方法来测量。生物素合成酶的活性可通过,例如在Sanyal等,Archives of Biochemistry and Biophysics.,Vol.326,48-56(1996);A.Mejean等,生物化学和生物物理研究通讯,Vol.217,1231-1237(1995),等中描述的方法来测量。
上述导入了突变的基因的具体例子是具有如SEQ ID NO:19所示碱基序列的生物素合成酶基因。当与具有如SEQ ID NO:16所示碱基序列的生物素合成酶基因相比,如果以起始密码子(ATG)的A为第1位碱基,该基因中第-57位碱基和第706位碱基都是取代的。作为引入了突变的基因可以以具有如SEQ ID NO:5所述碱基序列的7-酮-8-氨基壬酸合成酶基因为例。当与具有如SEQ ID NO:4所示碱基序列的7-酮-8-氨基壬酸合成酶基因相比,如果以起始密码子(ATG)的A为第1位碱基,该基因中的第-11位碱基是取代的。
在本说明书中,术语“含至少一种来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因的DNA片段”意思是该DNA片段中至少含一个在鞘氨醇单胞菌属微生物的细胞中编码生物素生物合成中涉及的酶的基因。该DNA片段可以是按前述方法从微生物中分离的DNA片段,或通过连接编码来自任何不同微生物菌株的参与生物素生物合成的酶的基因而制备的DNA片段,或者通过在前述基因引入突变而得到的基因。
本发明也提供含有来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及基因的部分碱基序列的DNA片段。该DNA片段用作,例如,在杂交方法中使用的探针或PCR方法中使用的引物。当该DNA片段用作引物在PCR方法中使用时,其碱基数一般优选大的以保证退火特异性。但是,随着碱基数目的增加,在PCR反应中引物自身易形成更高级结构,因而在某些情况下会降低退火的效率。因此,需要非常繁琐的操作来进行合成后纯化。考虑到这一缺点时,碱基数目优选不要太大。一般地,优选由含不超过50和不少于15个碱基的单链DNA组成的基因片段。这样的DNA片段具体例子是具有表1所示引物BF,BR,BF1,BR1,C1和C6的碱基序列中任何一种碱基序列的DNA片段。
表1PCR引物
引物 | 碱基序列 |
BF | 5′-ATTCTAGAACAGGACTATCAGGCACTCT-3′ |
引物 | 碱基序列 |
BR | 5′-TTTCTAGATTCCCCGCGATTGGCGATCA-3′ |
BF1 | 5′-AGCGGCCGAGGATGTGCTTAGGCTGCT-3′ |
BR1 | 5′-CCGTGCCCTTGACCGACACCAGCGCGT-3′ |
C1 | 5′-GCAAGCTTTGTCGCTGCCGCTCGTCATGCTGT-3′ |
C6 | 5′-CGCTCGAGATTCGCGCTTCCTGTTCCTGAC-3′ |
表2PCR引物
引物 | 碱基序列 |
BF4 | 5′-CGTGATGCTGCGCCTGCTCGGCCACAACAT-3′ |
BR4 | 5′-GCTCTAGACCTCATCGTCCCCCTGAACTTGTT-3′ |
F2 | 5′-GGACTAGTACCGGAATGACAGGCGGACA-3′ |
F3 | 5′-GCCTGCAGCAGAACGTGTGGTCGAAGCC-3′ |
CDA1 | 5′-ATCTGCAGTTGCGCGATGAGGAGGCCACCTTGC-3′ |
CDA6 | 5′-GCAAGCTTATGACGCCGCCTGCGCCTTCGACCA-3′ |
CDA3 | 5′-CTAAGCTTCGAGATCGACGGGGTGGAAATCGAT-3′ |
CDA7 | 5′-CGCTCGAGGGGAGAAGTCCTGGGGGATGATCCC-3′ |
来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因包含蛋白编码区和上游或下游的基因表达调控区。
含来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因的蛋白编码区的DNA片段可用于,例如,将上述区域连接到能在宿主细胞中作用的启动子上,及构建在宿主细胞中表达此基因的载体的步骤中。
基因表达调控区域是指位于蛋白编码区上游或下游的几十到几百个碱基对的区域,该区域对蛋白的基因表达调控有重要影响。特别地,位于蛋白编码区上游的启动子是重要的基因表达调控区。基因表达调控区的具体例子是以起始密码子(ATG)的A为第+1位碱基,含在SEQ IDNO:16中第-222位碱基到第-1位碱基序列的区域,以起始密码子(ATG)的A为第+1位碱基,含在SEQ ID NO:4中第-201位碱基到第-1位碱基序列的区域。
基因表达调控区可通过下面步骤来确定,首先引入突变比如点突度,缺失或插入到来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及基因蛋白编码区的上游或下游,然后在微生物中表达该基因,测量该蛋白编码酶的产量,酶的活性,此酶所催化反应产生的化合物量等等,最后找出引入突变后显著改变上述测量值的区域,也可以用编码易于测量上述值或酶活性的蛋白的基因取代上述基因的蛋白编码区来进行同样的实验。基因表达调控区可通过用上述方法来确定而得到。
通过例如用如前所述的PCR方法引入突变如取代,缺失或插入,所获得的基因表达调控区可以修饰成允许高水平基因表达的基因表达调控区。也可以通过从由于突变等原因提高了所需蛋白产量或酶活性的突变株分离基因来获得允许高水平基因表达的基因表达调控区。象这样的基因表达调控区,可以提及的是具有如SEQ ID NO:24和25所示碱基序列的基因表达调控区。
含上述来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及基因的基因表达调控区的DNA片段可用来,例如,构建在鞘氨醇单胞菌属微生物中表达这种基因的载体。
将这样获得的含来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及基因或该基因的部分碱基序列的DNA片段插入在宿主细胞中能复制的载体,由此得到用于将该基因或其一部分导入宿主细胞的载体。而且,基因表达调控区连到上述基因蛋白编码区的上游,接着插入载体中,由此在宿主细胞中表达该基因的载体得到构建。
这种情况下所用的DNA片段,可提及的有通过事先切割含按前述方法获得的来自鞘氨醇单胞菌属微生物的生物素合成中涉及基因的DNA片段而得到合适大小的具有方便将DNA片段连接到载体上的合适限制性酶位点的DNA片段;在没有合适的限制性酶切位点存在的情况下,通过PCR方法在生物素生物合成中涉及基因两端引入任意酶切位点,由此得到的DNA片段。
关于插入载体的基因,使用选自由上述本发明的bioF,bioA,bioD,bioB,bioC基因组成的一组基因中的至少一个基因就足够了。例如,可以将上述所有基因或一些基因插入载体。另外,可以插入多数特定基因以增加特定酶活性。如果需要,选择性标记基因,如对在下文所述的筛选转化子有用的药物抗性基因,用于与宿主基因组同源重组的基因组DNA可以与上述基因一起插入同样一个载体中。
至于连到蛋白编码区上游的基因表达调控区,只要具有在宿主细胞中调控基因表达功能的任何碱基序列都可使用。例如,当宿主细胞是鞘氨醇单胞菌属微生物细胞时,可提及的有如上述的来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及基因的基因表达调控区。当宿主细胞是大肠杆菌细胞时,可以利用商业上可得到的在大肠杆菌中具有基因表达调控活性的启动子。
至于DNA片段要插入其中的载体,只有能在宿主细胞例如,微生物细胞中复制的任何载体都可使用。例如,当宿主细胞是鞘氨醇单胞菌属微生物细胞时,可使用的有,分离地位在不相容质粒P组的RK2和由RK2衍生的质粒载体〔质粒,Vol.13,149-153(1985),细菌学杂志,Vol.167,604-610(1986)〕,分类地位在不相容质粒Q组的RSF1010,和由RSF1010衍生的质粒载体〔基因,Vol.16,237-247(1981)〕,等。当宿主细胞是大肠杆菌细胞时,可以使用的有商业上可得到的在细胞中能复制的质粒,噬菌体等。
将这样构建的含生物素生物合成中涉及基因的载体导入宿主细胞,例如,宿主微生物细胞制备转化子。
含生物素生物合成中涉及基因的DNA片段要导入的宿主细胞没有特别地限制,只要在该细胞中,导入的DNA片段能够由该细胞稳定地保存并且基因在该细胞中能够表达。关于宿主细胞,可提及的有鞘氨醇单胞菌属微生物细胞,大肠杆菌等。
至于将载体导入宿主细胞的方法,可用常规基因工程的方法。例如,关于将载体导入微生物宿主的方法,可提及的有常规方法如氯化钙处理细胞的方法〔分子克隆,第二版,冷泉港实验室出版社(1989)〕,电穿孔法〔分子生物学常规方法,Vol.1,John Wiley & Sons.Inc,ISBNO-471-50338-X(1987)〕,等。也可使用通过利用同源重组将所需DNA片段导入宿主细胞基因组的基因导入方法。例如,将含生物素生物合成中涉及的基因的DNA片段两端连上来自宿主细胞的基因组DNA片段,然后将连好的片段插入载体再将载体导入宿主细胞。当载体上的基因组DNA和宿主细胞的基因组DNA之间发生同源重组时,含生物素生物合成中涉及基因的DNA片段就被导入宿主细胞基因组中形成转化子。
按上述方法通过导入含生物素生物合成中涉及基因的DNA片段得到的转化微生物,可在选择性标记基因的表型的基础上有效选择,该选择性标记基因包含于载体中且与基因一起转入宿主细胞。例如,如果选择性标记基因是氨苄青霉素抗生基因,在基因转化后,细胞可在含氨苄青霉素的合适培养基上划线培养,然后可用钓菌法(hooking upmethod)分离长出的克隆,由此可获得转化子。这样,可获得导入了含生物素生物合成中涉及基因的DNA片段的转化子。该转化子可用于生产生物素,和作为生物系生物合成前体的7-酮-8-氨基壬酸,和7,8-二氨基壬酸和脱硫生物素。
下面参考实施例进一步解释了本发明的细节,但本发明并不受限于所举的实施例。
实施例1.分离生物素生物合成中涉及的基因(1-A)制备少动鞘氨醇单胞菌的基因组DNA
将-环少动鞘氨醇单胞菌JCM 7511接种于200毫升LB培养液(1%蛋白胨,0.5%酵母提取物,1%氯化钠)中,拿去在30℃下摇瓶培养15小时,离心(8000rpm,10分钟)收集对数生长期细菌细胞。收集的细胞重悬于20ml缓冲液A〔25%蔗糖,50mM Tris-HCl(pH 8,0)〕中,再加入2.5ml溶菌酶氯化物溶液(50mg/ml),37℃温育30分钟。然后,再加入2.5ml SDS溶液〔10%(v/v)和0.25ml EDTA溶液(0.5M)〕,得到的混合物在37℃温育16小时,温育的混合物中加入等体积TE饱和的酚,缓慢搅拌得到的混合物,然后离心(10,000rom,10分钟),然后回收上清来去除蛋白。上述去蛋白过程再重复5次。在回收的上清中加两倍于上清体积的无水乙醇以沉淀DNA,沉淀的DNA通过缠在玻璃棒上回收,用70%乙醇洗,风干后溶于20毫升TE缓冲液中,加入20微升RNA酶(10mg/ml),然后在37℃温育16小时。这样,得到了含大约21毫克少动鞘氨醇单胞菌JCM 7511基因组DNA的DNA溶液。(1-B)制备基因组DNA文库
在(1-A)中得到的43mg基因组DNA用15单位限制性酶I在37℃下处理2分钟以使其部分消化。经过部分消化得到的基因组DNA片段与通过用限制性酶BamH I切割pUC19质粒载体并去磷酸得到的DNA片段(可从TAKARA SHUZO有限公司得到)混合。通过使用连接反应试剂盒(可从TAKARA SHUZO有限公司得到),按照所附的操作手册,将通过部分消化得到的基因组DNA片段与pUC19质粒载体连接,以制备含不同DNA片段的重组质粒。
(1-C)筛选含生物素生物合成中涉及DNA片段的重组质粒
通过电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由Bio-Rad实验室公司制造)将(1-B)中获得的重组质粒分别导入bioF缺失大肠杆菌突变株(R874),bioA缺失大肠杆菌突变株(R879),bioB缺失大肠杆菌突变株(R875),bioC缺失大肠杆菌突变株(R876)〔细菌学杂志,Vol.94,2065-2066(1972),细菌学杂志,Vol.112,830-839(1972)〕。由此处理后的菌株在不含生物素的选择性琼脂平板(1.48%Na2HPO4-7H2O,0.3%KH2PO4,0.05NaCl,0.1%NH4Cl,0.005%氨苄青霉素,1.5%琼脂)上划线,琼脂平板于37℃温育两天。挑取在培养基上生长并形成菌落的菌株,在37℃下LB培养基中温育16小时。用碱裂解法〔分子克隆,第二版,冷泉港实验室出版社(1989)〕从这些菌株提取质粒,用限制性酶切割后观察琼脂糖凝胶电泳结果发现,导入每个缺失突变株的重组质粒包含了如表3所示大小的插入片段。
表3与生长缺陷型菌株互补的重组质粒
(1-D)含生物素生物合成中涉及基因的DNA片段的碱基序列分析
对(1-C)中获得的重组质粒pBC01,pBC02,pBC03,pBC04,pBC05,用下述方法通过使用千碱基序列缺失试剂盒(可从TAKARASHUZO有限公司得到)制备含不同大小插入片段的缺失克隆。
重组质粒pBC01,pBC02,pBC03,pBC04和pBC05各20毫克用以下酶切割:pBC01:Sma I和Kpn I,pBC02:pst I和Xba I,pBC03。pBC04和pBC05:Xba I和Sse 8387I。可用酚抽去酶,然后用乙醇沉淀DNA。所得的DNA溶于100μl外切核酸酶III缓冲液〔50mM Tris-HCl(pH 8.0),100mM NaCl,5mM MgCl2,10mM 2-巯基乙醇〕中,再加入180单位外切核酸酶III,所得的混合物混匀后在37℃温育。每隔1分钟取出10微升溶液与预冷的100微升绿豆核酸酶缓冲液〔40mM醋酸钠(pH 4.5),100mM NaCl,2mM ZnCl2,10%甘油〕混合,在65℃处理5分钟后使外切核酸酶III失活。这样所得的溶液冷却到37℃后再加入50单位绿豆核酸酶,温育60分钟。用酚抽提去酶,然后用乙醇沉淀DNA。所得的DNA溶于50微升klenow缓冲液〔7mM Tris-HCl(pH 7.5),0.1mM EDTA,20mM NaCl,7mM MgCl2,每种dNTP各0.1mM〕,加2单位klenow片段,接着在37℃温育15分钟,每10微升所得的溶液中加入100微升连接溶液A和12微升连接溶液B,然后在16℃温育1小时。然后用乙醇通过沉淀浓缩DNA,溶于5微升无菌水中,得到的溶液导入大肠杆菌JM 109,然后处理后的大肠杆菌JM 109在含氨苄青霉素(0.005%)的LB培养基琼脂平板(1%蛋白胨,0.5%酵母提取物,1%氯化钠,1.5%琼脂)上划线,平板在37℃温育16小时。从长出的菌落提取质粒并检测其插入DNA片段的大小。对于缺失克隆pBC01,有8个含插入DNA片段的克隆分别被筛选出来,大小从250bp到1.8kb,以约250bp间隔变动。对于缺失克隆pBC02,有12个含插入片段的克隆被挑出,大小从250bp到2.8kb之间,以约250bp间隔变动。对于缺失克隆pBC03,有6个含插入片段的克隆分别被筛选出来,大小从250bp到1.4kb之间,以约250bp间隔变动。对于缺失克隆pBC04,有6个含插入片段的克隆分别被挑出,大小从250bp到1.4kb之间变动,相差约为250bp。对于缺失克隆BC05,有15个含插入片段的克隆分别被挑出,大小在250bp到3.7kb之间,相差约为250bp。
每个缺失克隆通过碱裂解法提取质粒〔分子克隆,第二版,冷泉港实验室出版社(1989)〕。用30纳克提取物作模板和M13引物M4(可从TAKARA SHUZO有限公司得到)或M13引物RV(可从TAKARASHUZO有限公司得到)作引物,通过使用ABI棱晶体染料终止循环测序备好反应试剂盒(由Perkin-Elmer公司制造)进行测序反应,用自动碱基序列分析仪373A(由Perkin-Elmer公司制造)分析碱基序列。
为从重组质粒的插入片段的碱基序列中确定生物素生物合成中涉及的基因,从在片段中存在的500bp或更大的开放阅读框架中选出分别与已知bioF基因,bioA基因,bioD基因,bioB基因和bioC基因在碱基序列中有高度同源性的蛋白编码区,并被确定为分别对应于上述基因的鞘氨醇单胞菌属细菌的基因。pBC01含bioF(SEQ ID No:3),pBC02含bioD(SEQ ID NO:12)和bioA(SEQ ID NO:8),pBC03含bioD(SEQ ID NO:16),pBC04和pBC05各含bioD(SEQ ID No:22)。比较独立得到的pBC01和pBC02的插入片段的碱基序列,发现pBC01、pBC02插入片段的组合是基因组中连续的DNA。因此,很明显bioF,bioD和bioA形成一个操纵子。
实施例2分离生物素生物合成中涉及的基因
(2-A)制备鞘氨醇单胞菌属种类SC42405基因组DNA
接种一环鞘氨醇单胞菌属种类SC42405到200毫升LB培养液(1%蛋白胨,0.5%酵母提取物,1%氯化钠)中,拿去在30℃下摇瓶培养15小时,离心(8000rpm,10分钟)收集对数生长期的细菌细胞。收集的细胞重悬于20毫升缓冲液A〔25%蔗糖,50mM Tris-HCl(pH8.0)〕中,再加入2.5ml溶菌酶氯化物(50mg/ml)溶液,37℃温育30分钟。然后,再加入2.5ml SDS溶液〔10%(v/v)和0.25ml EDTA溶液(0.5M)〕。得到的混合物在37℃温育16小时。温育混合物中加入等体积的Tris饱和酚,轻轻混匀得到的混合物,然后离心(10,000rpm,10分钟),然后回收上清来去除蛋白。上述去蛋白过程再重复5次,在回收的上清中加两倍体积的乙醇以沉淀DNA。通过缠到玻璃棒上回收DNA,用70%乙醇洗,风干后溶于20ml TE缓冲液中,加入20微升RNA酶(10mg/ml),然后37℃温育16小时。这样,得到了含大约21mg鞘氨醇单胞菌属种类SC42405基因组DNA的DNA溶液。
(2-B)制备基因组DNA文库
(2-A)中获得的40毫克基因组DNA用15单位限制性酶Sau3AI在37℃下处理2.5分钟以部分消化。经过部分消化得到的基因组DNA片段与用限制性酶BamH I切割并脱磷酸的pUC19载体(可以TAKARASHUZO有限公司得到)混合。通过使用连接反应试剂盒(可从TAKARASHUZO有限公司得到),按所附的操作手册,将通过部分消化得到的基因组DNA片段与pUC19质粒载体连接,以制备含不同DNA片段的重组质粒。
(2-C)筛选含生物素生物合成中涉及的DNA片段的重组质粒
通过电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由Bio-Rad实验室公司制造)将(2-B)中获得的重组质粒分别导入bioF缺失大肠杆菌突变株(R874),bioA缺失大肠杆菌突变株(R879),缺失bioB的大肠杆菌突变株(R875),bioC缺失大肠杆菌突变株(R876)〔细菌学杂志,Vol.94,2065-2066(1972),细菌学杂志,Vol.112,830-839(1972)〕。由此处理后的菌株在不含生物素的选择性琼脂平板平板(1.71%Na2HPO4-7H2O,0.3%KH2PO4,0.05NaCl,0.1%NH4Cl,0.005%氨苄青霉素,0.2mM IPTG,1.5%琼脂)上划线,琼脂平板在37℃中温育两天。挑取在培养基上生长并形成菌落的菌株,在37℃下LB培养基中温育16小时。用碱裂解法〔分子克隆,第二版,冷泉港实验室出版社(1989)〕从这些菌株中提取质粒,用限制性酶切割后观察琼脂糖凝胶电泳结果,发现导入每个缺失突变株的重组质粒包含了如表4所示大小的插入片段
表4与生长缺陷型菌株互补的重组质粒
(2-D)含生物素生物合成中涉及基因的DNA片段的碱基序列分析
对(2-C)中获得的重组质粒pBC11,pBC12,pBC13,pBC14和pBC15,用下述方法通过使用千碱基序列缺失试剂盒(可从TAKARASHUZO有限公司得到)制备含不同大小插入片段的缺失克隆。
20毫克重组质粒pBC11,pBC12,pBC13,pBC14,pBC15用以下酶切割;pBC11:XbaI和Sse8387 I,pBC12和pBC13:PstI和XbaI,pBC14和pBC15:XbaI和KpnI。可用酚抽提去酶,然后用乙醇沉淀DNA。所得的DNA溶于100微升外切核酸酶III缓冲液〔50mM Tris-HCl(pH 8.0),100mM NaCl,5mM MgCl2,10mM2-巯基乙醇〕中,再加入180单位外切核酸酶III,得到的混合物在37℃温育。每隔1分钟取出10微升溶液与预冷的100微升绿豆核酸酶缓冲液〔40mM醋酸钠(pH4.5),100mM NaCl,2mM ZnCl2,10%甘油〕混合,在65℃处理5分钟后使外切核酸酶III失活。这样所得的溶液冷却到37℃后再加入50单位绿豆核酸酶,温育60分钟。用酚抽提去酶,然后用乙醇沉淀DNA。所得的DNA溶于50微升klenow缓冲液〔7mM Tris-HCl(pH 7.5),0.1mMEDTA,20mM NaCl,7mM MgCl2,各0.1mM每种dNTP〕,加2单位klenow片段,接着在37℃温育15分钟,每10微升所得的溶液中加入100微升连接溶液A和12微升连接溶液B,然后在16℃温育1小时。然后用乙醇通过沉淀浓缩DNA,溶于5微升无菌水中,得到的溶液导入大肠杆菌JM109中。然后处理后的大肠杆菌JM109在含氨苄青霉素(0.005%)的LB培养基琼脂平板(1%蛋白胨,0.5%酵母提取物,1%氯化钠,1.5%琼脂)上划线,平板在37℃温育16小时。从长出的菌落中提取质粒并检测其插入DNA片段的大小。对于缺失克隆pBC11,有9个含插入片段的克隆分别被筛选出来。大小从250bp到2.3kb之间,以约250bp间隔变动。对于缺失克隆pBC12,有8个含插入片段的克隆分别被筛选出来,大小从250bp到2.2kbp之间,以约250bp间隔变动。对于缺失克隆pBC13,有10个含插入片段的克隆分别被筛选出来,大小从250bp到2.6kbp之间,以约250bp间隔变动。对于缺失克隆pBC14有8个含插入片段的克隆分别被筛选出来,大小在250bp到1.9kbp之间,以约250bp间隔变动。对于缺失克隆pBC15有8个含插入片段的克隆分别被筛选出来,大小从250bp到2.0kb之间,以约250bp间隔变动。
每个缺失克隆通过碱裂解法提取质粒〔分子克隆,第二版,冷泉港实验室出版社(1989)〕,用30纳克提取物作模板和M13引物M4(可从TAKARA SHUZO有限公司得到)或M13引物RV(从TAKARASHUZO有限公司得到)作引物,通过使用ABI棱晶体染料终止循环备好反应试剂盒(由Perkin-Elmer公司制造)进行测序反应,碱基序列用自动碱基序列分析仪373A(由Perkin-Elmer公司制造)来分析。
为从重组质粒的插入片段的碱基序列中确定生物素生物合成中涉及基因,从在片段中存在的500bp或更大的开放阅读框架中选出分别与已知的bioF基因,bioA基因,bioD基因,bioB基因和bioC基因在碱基序列中有高度同源性的蛋白编码区,并被确定为分别对应于上述基因的鞘氨醇单胞菌属细菌的基因。pBC11含bioF(SEQ ID NO:4),pBC12和pBC13都含bioD(SEQ ID NO:13)和bioA(SEQ IDNO:9),pBC14含bioB(SEQ ID NO:17),pBC15含bioC(SEQ ID NO:23)。比较独立得到的pBC12,pBC13和pBC15的插入片段的碱基序列,很明显,pBC12,pBC13,pBC15插入片段的组合在基因组中是连续的DNA,bioC基因的终止密码子TGA的TG和bioD基因的起始密码子ATG在碱基序列TG处重叠,bioC,bioD和bioA基因形成一个操纵子。
实施例3制备重组质粒pJAW
(E)制备质粒载体pJAβ2
1mg来源于质粒载体RK2〔细菌学杂志,Vol.162,604-614(1986)〕的质粒载体pJAJ7用限制性酶PstI和BamHI切割,所得片段的两端通过使用DNA补平试剂盒(可从TAKARA SHUZO公司得到)按所附的操作手册补平。这样处理后的片段通过琼脂糖凝胶电泳分开以分离大小约10kb的DNA片段。另外,2微克质粒载体pBlue sciipt Sk(+)(可从Stratagene Cloning Systems得到)用限制性酶Hae III切割。然后所得片段的两端通过使用DNA补平试剂盒(可从TAKARA SHUZO公司得到)按所附的操作手册补平。这样处理后的片段通过琼脂糖凝胶电泳分开以分离大小约0.7kb的DNA片段。混合这样得到的全部约10kb的DNA片段和全部约0.7kb的片段,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)根据所附的操作手册连接两种片段,所得的质粒命名为pJAβ2(如图1)。
(F)制备质粒pJAW
用实施例1,(1-A)中获得的基因组DNA作为模板和表5中
所示的BF和BR引物进行PCR反应〔反应组合物:10mM Tris-HCl(pH8.8),10mM KCl,0.002(v/v)%Tween 20,1.5mM MgCl2,40μM每种dNTP,20pmol每种引物,0.5到100纳克基因组DNA,3单位ULTTMDNA聚合酶(可从Perkin-Elmer公司得到)/100微升;反应循环:于97℃,2分钟的反应循环一次,分别于97℃,1分钟,于55℃,1分钟,然后72℃,1.5分钟的反应循环30次,并且于72℃,7分钟的反应循环一次〕。这样,得到全长含1325bp的DNA片段,该DNA片段从bioB编码区上游145bp到编码区下游154bp,并在1356bp序列两端各引入一个XbaI位点。用限制性酶XbaI切割该DNA片段,得到的DNA片段与通过用限制性酶XbaI切开质粒载体并脱磷的处理切开的质粒载体而得到的DNA片段混合,通过使用连接试剂盒(可从TAKARASHUZO公司得到),按所附的操作手册连接。这样获得的质粒命名为pJW(图2)。
表5PCR引物
引物 | 碱基序列 |
BF | 5′-ATTCTAGAACAGGACTATCAGGCACTCT-3′ |
BR | 5′-TTTCTAGATTCCCCGCGATTGGCGATCA-3′ |
BF1 | 5′-AGCGGCCGAGGATGTGCTTAGGCTGCT-3′ |
BR1 | 5′-CCGTGCCCTTGACCGACACCAGCGCGT-3′ |
C1 | 5′-GCAAGCTTTGTCGCTGCCGCTCGTCATGCTGT-3′ |
C6 | 5′-CGCTCGAGATTCGCGCTTCCTGTTCCTGAC-3′ |
实施例4制备导入了pJAW的转化子
通过按电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由Bio-Rad实验室公司制造)将实施例3中获得的质粒pJAW导入少动鞘氨醇单胞菌JCM 7511,得到JCM 7511/pJAW转化子。
实施例5制备重组质粒pJA41
用限制性内切酶Eco 52I和Bsp 1286I部分消化实施例3中获得的质粒pJAW后得到的DNA片段通过琼脂糖凝胶电泳分离以回收约11.8kbp的DNA片段。此片段由pJAW从第-72位碱在到第718位碱基缺失形成(在以bioB基因起始密码子ATG的A为第1位碱基的情况下)。另一方面,以实施例1,(1-A)中获得的基因组DNA为模板和表5所示引物BF1和BR1为引物进行PCR反应〔反应组合物:10mMTris-HCl(pH 8.8),10mM KCl,0.002(v/v)%Tween 20,1.5mM MgCl2,40μM每种dNTP,20pmol每种引物,0.5到100ng基因组DNA,3单位ULTmaTM DNA聚合酶(可从Perkin-Elmer公司得到)/100μl;反应循环:于97℃,1分钟的反应循环1次,分别于97℃,0.5分钟、60℃,1分钟、然后72℃1.5分钟的反应循环30次,于72℃7分钟的反应循环1次〕。这样,在以bioB基因起始密码子ATG的A为第1位碱基的情况下,含从第-75位碱基到721位碱基的碱基序列的DNA片段得到制备。然后,用限制性酶Eco 52I和Bsp 1286I部分消化该DNA片段,所获得的DNA片段通过琼脂糖凝胶电泳分离以回收约0.8kbp的DNA片段。分析回收的DNA片段的碱基序列以确定在SEQ ID NO:19中所示的第1到1336碱基的碱基序列。这样得到的DNA片段通过使用连接试剂盒(可从TAKARA SHOZO公司得到)按所附操作手册使它们互相连接。这样所获得的质粒命名为pJA41(图3)。
实施例6制备导入了pJA41的转化子
通过按电穿孔法(工作电压18kv/cm,电容25μF,电阻400C2)用基因脉冲发生器(由Bio-Rad公司制造)将实施例5中获得的质粒pJA41导入少动鞘氨醇单胞菌JCM7511,得到少动鞘氨醇单胞菌JCM7511/pJA41转化子。
实施例7少动鞘氨醇单胞菌JCM 7511/pJAW和JCM7511/pJA41的生
物素产量
分别接种一环少动鞘氨醇单胞菌JCM 7511/pJAW和少动鞘氨醇单胞菌JCM 7511/pJA41到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,0.005%四环素(pH 7.2)〕的小试管(18×50mm)中。作为对照,接种一环没有导入基因的少动鞘氨醇单胞菌JCM 7511到装有3毫升培养基〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O(pH 7.2)〕的小试管中。上述三种细菌在30℃培养2天(250rpm)以获得预培养菌液。然后,接种200微升这样得到的少动鞘氨醇单胞菌JCM 7511/pJAW和少动鞘氨醇单胞菌JCM 7511/pJA41预培养菌液到装有10毫升培养液〔4%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4-7H2O,0.001%FeSO4-7H2O,0.001%MnSO4-4~6H2O,0.000002%盐酸硫胺素,0.005%四环素(pH 7.0)〕的大试管(22×220mm)中。作为对照,接种200微升少动鞘氨醇单胞菌JCM 7511的预培养菌液到装有10毫升培养基〔4%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%Kcl,0.05%MgSO4-7H2O,0.001%FeSO4-7H2O,0.001%MnSO4-4-6H2O 0.000002%盐酸硫胺素(pH 7.0)〕的大试管(22×220mm)中。上述三种细菌在30℃培养4天(250rpm)。通过使用植物乳杆菌IFO 3070菌株的微生物学定量方法(Izumi和Yamada“维生素学实验方法II.水溶维生素”,p.481-499,日本维生素学学会,Tokyo Kagaku Dojin,1985)来确定每种培养菌液中合成和积累的生物素的浓度,并且得到的合成的生物素浓度如表6所示。
表6
菌株 | 生物素产量<sup>*</sup> | 生物素浓度(毫克/升) |
少动鞘氨醇单胞菌JCM7511 | 1 | 0.037 |
少动鞘氨醇单胞菌JCM7511/pJAW | 2.1 | 0.078 |
少动鞘氨醇单胞菌JCM7511/pJA41 | 6.9 | 0.260 |
*相对于未导入基因的菌株的生物素产量的数值
实施例8制备重组质粒pSP302
用限制性酶Pst I和BamH I切割实施例1中获得的质粒pBC01,所得的DNA片段通过琼脂糖凝胶电泳分离以获得含bioF基因大部分编码区和位于bioF基因上游控制bioF、bioD和bioA基因表达的区域的约1.4kbp DNA片段。这样所得的DNA片段和用限制性酶BamHI和PstI消化后的质粒载体pBluscript SKII(+)(可从Stratagene CloningSystems得到),混合,然后通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附的操作手册将它们相互连接,这样获得的质粒命名为pBC06。
另外,用限制性酶Pst I和EcoR I切割实施例1中获得的质粒pBC02,所得的DNA片段通过琼脂糖凝胶电泳分离以获得合bioF部分编码区,bioD和bioA编码区和位于bioA基因下游bioF,bioD和bioA的3′-非编码区的约2.2kbp DNA片段。这样所得的DNA片段与限制性酶Pst I和EcoR I消化的质粒pBC06混合,然后通过使用连接试剂盒(可从TAKARA SHOZO公司得到)按所附的操作手册将它们相互连接。这样获得的质粒命名为pSP105。
此外,用限制性酶BamH I和Hind III切割质粒pSP105,所得的DNA片段通过琼脂糖凝胶电泳分离以获得含bioF,bioD和bioA的编码区和bioF,bioD和bioA表达调控区的约3.6kbp DNA片段。这样所得的DNA片段与限制性酶BamH I和Hind III切割的质粒pJA41混合,然后通过使用连接试剂盒(可从TAKARA SHUZO公司得到),按所附的操作手册将它们相互连接。这样获得的质粒命名为pSP302(图4)。
实施例9制备导入了pSP302的转化子。
通过按电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由Bio-Rad实验室公司制造)将实施例8中获得的质粒pSP302导入少动鞘氨醇单胞菌JCM7511得到少动鞘氨醇胞菌JCM7511/pSP302转化子。
实施例10制备重组质粒pSP304
用实施例1,(1-A),中获得的基因组DNA作模板,表5中所示的C1和C6引物进行PCR反应,以制备全长含1435bp的DNA片段,该片段从bioC基因编码子上游387bp到编码区下游196bp,并且在1435bp序列的上游末端引入了Hind III位点,下游末端引入Xho I位点。用限制性酶Hind III和Xho I切割该DNA片段得到的片段和通过用限制性酶Hind III和Xho I切割质粒pSP302得到的DNA片段混合,然后通过使用连接试剂盒(可从TAKARA SHUZO公司得到),按所附操作手册将它们相互连接。这样获得的质粒命名为pSP304(图5)。
实施例11制备导入了pSP304的转化子
通过按电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由Bio-Rad实验室公司制造)将实施例10中获得的质粒pSP304导入少动鞘氨醇单胞菌JCM7511得到少动鞘氨醇单胞菌JCM7511/pSP304转化子。
实施例12少动鞘氨醇单胞菌JCM7511/pSP302和JCM7511/pSP304的生物素产量
分别接种一环少动鞘氨醇单胞菌JCM7511/pSP302和少动鞘氨醇单胞菌JCM7511/pSP304到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,0.005%四环素(pH 7.2)〕的小试管中(18×150mm)。作为对照,接种一环没有导入基因的少动鞘氨醇单胞菌JCM7511到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O(pH 7.2)〕的小试管(18×150mm)中。上述三种细菌在30℃培养2天(250rpm)以获得预培养菌液。然后各接种200微升这样得到的少动鞘氨醇单胞菌JCM7511/pSP302和少动鞘氨醇单胞菌JCM7511/pSP304预培养菌液到装有10毫升培养液〔4%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4-7H2O,0.001%FeSO4-7H2O,0.001%MnSO4-4-6H2O,0.005%四环素(pH 7.0)〕的大试管(22×220mm)中。作为对照,接种200微升少动鞘氨醇单胞菌JCM7511的预培养菌液到装有10毫升培养基〔4%甘油。2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4-7H2O,0.001%FeSO4-7H2O,0.001%MnSO4-4-6H2O,(pH 7.0)〕的大试管(22×220mm)中。上述三种细菌在30℃培养4天(250rpm)。通过使用植物乳杆菌IFO 3070菌株的微生物学定量方法(Izumi和Yamada“维生素学实验方法II。水溶维生素”,P481-499,日本维生素学学会,Tokyo kagaku Dojin,1985)来确定每种培养菌液中合成和积累的生物素的浓度,并得到合成的生物素浓度如表7所示。
表7少动鞘氨醇单胞菌JCM7511/pSP305的生物素产量
菌株 | 生物素产量<sup>*</sup> | 生物素浓度(毫克/升) |
少动鞘氨醇单胞菌JCM7511 | 1 | 0.12 |
少动鞘氨醇单胞菌JCM7511/pSP302 | 2.8 | 0.33 |
少动鞘氨醇单胞菌JCM7511/pSP304 | 8.6 | 1.0 |
*相对于没有导入基因的菌株的生物素产量的数值
实施例13制备重组质粒pSS301
用实施例2,(2-A),中获得的基因组DNA作模板和表8所示的BF4和BR4引物进行PCR反应〔反应组合物:1×Expand HF缓冲液,1.5mM MgCl2,300mM每种dNTP,300nM每种引物,0.5到100ng基因组DNA,2.6单位ExpandTM高可信度PCR系统酶混合物(可从Perkin-Elmer公司得到)/50微升体系;反应循环:于97℃,2分钟的反应循环1次,分别于97℃,15秒、60℃,30秒、然后72℃,1.5分钟的反应循环10次,97℃,15秒、60℃,30秒、然后72℃。1.5分钟(72℃1.5分钟的反应时间在一个循环中增加20秒)的反应循环15次,于72℃,7分钟的反应循环1次〕。这样,全长含1358bp的DNA片段得到制备,该DNA片段从bioB基因编码区上游151bp到编码区下游151bp,并在1358bp序列的每一端各引入一个Xba I位点。用限制性酶Xba I酶切该DNA片段所得的DNA片段与通过使用Xba I酶切质粒载体pJAβ2和脱磷酸处理质粒载体得到的DNA片段混合,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册将它们互相连接,这样获得的质粒命名为pSS301(图6)。
表8
引物 | 碱基序列 |
BF4 | 5′-CGTGATGCTGCGCCTGCTCGGCCACAACAT-3′ |
BR4 | 5′-GCTCTAGACCTCATCGTCCCCCTGAACTTGTT-3′ |
F2 | 5′-GGACTAGTACCGGAATGACAGGCGGACA-3′ |
F3 | 5′-GCCTGCAGCAGAACGTGTGGTCGAAGCC-3′ |
CDA1 | 5′-ATCTGCAGTTGCGCGATGAGGAGGCCACCTTGC-3′ |
CDA6 | 5′-GCAAGCTTATGACGCCGCCTGCGCCTTCGACCA-3′ |
CDA3 | 5′-CTAAGCTTCGAGATCGACGGGGTGGAAATCGAT-3′ |
CDA7 | 5′-CGCTCGAGGGGAGAAGTCCTGGGGGATGATCCC-3′ |
R1 | 5′-CCCTGCCCGTATGGCAAGCG-3′ |
例14制备导入pSS301的转化子
通过按电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由bio-Rad实验室公司制造)将实施例13中获得的质粒pSS301导入鞘氨醇单胞菌属种类SC42405,便可获得鞘氨醇单胞菌属种类SC4404/pSS301。
实施例15鞘氨醇单胞菌属种类SC42405/pSS301的生物素产量
接种一环鞘氨醇单胞菌属种类SC42405/pSS301到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,0.005%四环素(pH 7.2)〕的小试管中(18×150mm)中。作为对照,接种一环没有导入基因的鞘氨醇单胞菌属种类SC42405到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,(pH 7.2)〕的小试管(18×150mm)中。上述二种细菌在30℃培养2天(250rpm)以获得预培养菌液。然后,接种160微升鞘氨醇单胞菌属种类SC42405/pSS301的预培养菌液到装有8毫升培养液〔6%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4·7H2O,0.01%FeSO4·7H2O,0.1%MnSO4·4~6H2O,0.005%四环素(pH7.0)〕的大试管(22×220mm)中。作为对照,接种160微升鞘氨醇单胞菌属种类SC42405的预培养菌液到装有8毫升培养液〔6%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4·7H2O,0.01%FeSO4·7H2O,0.1%MnSO4·4~6H2O(pH 7.0)〕的大试管(22×220mm)中。上述三种细菌在30℃培养4天(250rpm)。
通过使用植物乳杆菌IFO 3070株的微生物学定量方法(Izumi and
Yamada“维生素学实验方法II。水溶维生素”,P481-499,日本维生素学学会,Tokyo Kayaku Dojin,1985)来确定每种培养菌液中生物素合成和积累的浓度,以得到如表9所示的生物合成的浓度。
表9
菌株 | 生物素产量<sup>*</sup> | 生物素浓度(毫克/升) |
鞘氨醇单胞菌属种类SC42405 | 1 | 4.1 |
鞘氨醇单胞菌属种类SC42405/pSS301 | 4.0 | 17 |
*相对于没有导入基因的菌株的生物素产量的数值
实施例16制备重组质粒pSS305
用实施例2,(2-A),中获得的基因组DNA作模板和表8所示的引物F2和F3进行PCR反应〔反应组合物:1×Expand HF缓冲液,1.5mM MgCl2,200μM每种dNTP,300nm每种引物,0.5到100ng基因组DNA,2.6单位ExpandTM高可信度PCR系统酶混合物(可从Boehringer Mancheim公司得到)/50μl;反应循环:于97℃2分钟的反应循环1次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环10次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环15次(72℃1.5分钟的反应时间每循环1次时间延长20秒),于72℃7分钟的反应循环1次〕。这样,全长含1408bp的DNA片段得到制备,该DNA片段从bioF基因编码区上游201bp到编码区下游46bp,并在此1408序列的上游末端引入一个Spe I位点,下游末端引入一个Pst I位点。用限制性酶Spe I和Pst I切割该DNA片段所得的DNA片段与通过用限制性酶Spe I和PstI切割质粒载体pBluscript SK(+)(可从Stratagene Cloning Systems得到)得到的DNA片段混合,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册将它们互相连接。这样获得的质粒命名为pSS302。
另外,用实施例2,(2-A),中获得的基因组DNA作模板和表8所示的引物CDA1和CDA6进行PCR反应〔反应组合物:1×Expand HF缓冲液,1.5mM MgCl2,200μM每种dNTP,300nm每种引物,0.5到100ng基因组DNA,2.6单位ExpandTM高可信度PCR系统酶混合物(可从Boehringer Mannheim公司得到)/50微升;反应循环:于97℃2分钟的反应循环1次,分别于97℃15秒,60℃30秒,然后72℃1.5分钟的反应循环10次,分别于97℃15秒、60℃30秒、72℃1.5分钟的反应循环15次(72℃1.5分钟的反应时间每循环一次延长20秒),于72℃7分钟的反应循环1次〕。这样,含bioC编码区上游209bp,bioC编码区726bp,和bioD基因编码区前半部分大约300bp的全长1287bp的DNA片段得到制备,并且在1287bp序列的上游末端引入Pst I位点,下游末端引入了Hind III位点。同限制性酶Pst I和HindIII切割该DNA片段,所得的DNA片段与通过用限制性酶Pst I和HindIII切割质粒载体pBluescript SK(+)(可从TAKARA SHUZO公司得到)得到的DNA片段混合,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册将它们互相连接。这样获得的质粒命名为pSS3205。
进一步用实施例2,(2-A),中获得的基因组DNA作模板和表8所示引物CNAS和CDA7进行PCR反应〔反应组合物:1×ExpandHF缓冲液,1.5mM MgCl2,200μM每种dNTP,300nM每种引物,0.5到100ng基因组DNA,2.6单位ExpandTM高可信度PCR系统酶混合物(可从Boehringer Mannheim公司得到)/50微升;反应循环:于97℃2分钟的反应循环1次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环10次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环10次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环15次(于72℃1.5分钟的反应时间每循环一次延长20秒),于72℃7分钟的反应循环1次〕。这样,含bioD基因后半部分编码区约400bp,bioA基因编码区1251bp,以及bioA基因编码区下游208bp序列的1653bp(全长)的DNA片段得到制备,并且在1653bp序列上游末端引入Hind III位点,下游末端引入Xbo I位点。用限制酶Hind III和Xho I切割该DNA片段,所得的DNA片段与通过使用限制性酶Hind III和XhoI切割质粒载体pBluscript SK(+)(可从Stratagene Cloning Systems公司得到)得到的DNA片段混合,通过使用连接试剂盒(可从TAKARASHUZO公司得到)按所附操作手册将它们相互连接。这样所得到的质粒命名为pSS206。
用限制性酶和Pst I和Hind III切割按上述方法获得的质粒pSS205,所得的DNA片段通过琼脂糖凝胶电泳分离以制备含1287bp(全长)的DNA片段,该DNA片段包含bioC基因编码区上游209bp序列,bioC基因编码区762bp以及bioD基因编码区前半部分的300bp序列,并且在1287序列上游末端引入了Pst I位点,下游末端引入了Hind III位点。这样获得的DNA片段与用限制性酶Pst I和Hind III切割的质粒pSS206混合,通过使用连接试剂盒(可从TAKARA SHOZO公司得到)按所附操作手册将它们连接。这样获得的质粒命名为pSS2071。
下一步,用限制性酶Cla I切割pSS2071,所得的DNA片段用来通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册自连。这样获得的质粒命名为pSS 207。
另外,用限制性酶Spe I和Pst I切割质粒pSS 202,通过琼脂糖凝胶电泳分离所得的片段以制备含从bioF编码区上游201bp到编码区下游46bp全长1408bp的DNA片段,并且该1406bp序列的上游末端导入了Spe I位点。下游末端导入了Pst I位点。这样获得的DNA片段与经过限制性酶Spe I和Pst I切割的pSS207混合,通过使用连接反应试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册将它们相互连接,这样获得的质粒命名为pSS209。
用限制性酶Spe I和Xho I切割质粒pSS204,所得的DNA片段通过琼脂糖凝胶电泳分离以制备含bioF,bioC,bioD和bioA基因的DNA片段,并且该片段的上游末端含引入的Spe I位点,下游末端含引入Xho I位点。这样所获得的DNA片段与经过限制性酶Spe I和Xho I切割的pJAβ2混合,通过使用连接反应试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册将它们相互连接。这样所获得的质粒命名为pSS305(图7)。
实施例17制备重组质粒pSS304
用实施例16中获得的质粒pSS305作为模板,M13引物RV(可从TAKARA SHUZO公司得到)和表8所示的引物R1的组合及M13引物M4(可从TAKARA SHUZO公司得到)和MUTB1引物(可从TAKARA SHUZO公司得到)的组合之一作为引物进行PCR反应〔反应组合物:1×Expand HF缓冲液,1.5mM MgCl2,200μM每种dNTP,300nM每种引物,0.5到100纳克模板DNA,2.6单位ExpandTM高可信度PCR系统酶混合物(可从Boehringer Mannheim公司得到)/50微升;反应循环:于97℃2分钟的反应循环1次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环10次,分别于97℃15秒,60℃30秒,72℃1.5分钟的反应循环15次(于72℃7分钟的反应时间每一循环延长20秒),于72℃7分钟的反应循环1次〕。PCR后,用Centricon-100(由Amico公司制造)的方法从反应溶液中去除过剩的引物和过剩dNTP,然后向残留液中加TF缓冲液使总体积为50微升。然后,这样得到溶液各0.5μl混合。向所得的混合液中加入50微升10×Expand HF缓冲液,4微升2.5mM每种dNTP,38.62微升灭菌双蒸水和0.38微升ExpandTM高可信度PCR系统酶混合物(3.5单位/微升)(可从Boehringer Mannheim公司得到)。这样得到的混合液于94℃加热10分钟,在超过60分钟的时间范围冷至37℃,然后于37℃温育15分钟。向此反应溶液中加入0.5微升20pmol M13引物RV(可从TAKARASHUZO公司得到)和0.5微升20pmol M13引物M4(可从TAKARASHUZO公司得到),然后进行PCR〔反应循环:于97℃2分钟的反应循环1次,分别于97℃15秒,60℃30秒,然后72℃1.5分钟的反应循环10次,分别于97℃15秒,60℃30秒、然后72℃1.5分钟的反应循环15次,(72℃1.5分钟的反应时间每循环1次延长20秒),于72℃7分钟的反应循环1次〕。用限制性酶NotI和Hind III切割所得的DNA片段,与经过限制性酶Not I和Hind III切割的质粒载体pBluescriptSK(+)(可从Stratagene Cloning Systems得到)混合,然后通过使用连接反应试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册连接,从获得的克隆中,通过碱基序列分析筛选出含bioF突变体的碱基序列(SEQ ID NO:5)的克隆,在以bioF基因的起始密码子(ATG)中的A为第1位碱基的情况下,该突变体第-11位的碱基C被G取代。所得的质粒命名为pSS201。然后,用限制性酶Spe I和Pst I切割质粒pSS201,所得的DNA片段通过琼脂糖凝胶电泳分离以制备含从bioF基因编码区上游201到编码区下游46bp的全长408bp的DNA片段,并且在该片段上游末端引入一个Spe I位点,下游末端引入一个Pst I位点。所得的DNA片段与经过限制性酶Spe I和Pst I切割的pSS207混合,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册使它们相互连接。这样所得的质粒命名为pSS208。
另外,用限制性酶Spe I和Xho I切割质粒pSS208,所得的DNA片段通过琼脂糖凝胶电泳分离以制备含BioF突变体,bioC,bioD和bioA的DNA片段,并且该片段的上游末端有引入的Spe I位点,下游末端有引入的Xho I位点。所得的DNA片段与经过限制性酶Spe I和Xho I切割的质粒pJAβ2混合,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册将这些DNA片段连接。这样所获得的质粒命名为pSS304(图8)。
实施例18制备导入pSS304的转化子和导入pSS305的转化子
按电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器(由Bio-Rad实验制造)将实施例17中所得的质粒pSS304和质粒pSS305分别导入鞘氨醇单胞菌属种类SC42405,便可获得鞘氨醇单胞菌属种类SC42405/pSS304和鞘氨醇单胞菌属种类SC42405/pSS305转化子。
实施例19鞘氨醇单胞菌属种类SC42405/pSS304和鞘氨醇单胞菌属种类SC420405/pSS305的生物素产量和生物素相关产物的产量
分别接种一环鞘氨醇单胞菌属种类SC42405/pSS304和鞘氨醇单胞菌属种类SC42405/pSS305到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,0.005%四环素(pH7.2)〕的小试管(18×150mm)中。作为对照,接种一环没有基因导入的鞘氨醇单胞菌属种类SC42405到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,(pH 7.2)〕的小试管(18×150mm)中。上述三种菌在30℃下培养2天(250rpm)以获得预培养菌液。然后,接种160微升鞘氨醇单胞菌属种类SC42405/pSS304和鞘氨醇单胞菌属种类SC42405/pSS305的预培养菌液到装有8毫升培养液〔6%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4·7H2O,0.01%FeSO4·7H2O,0.1%MnSO4·4~6H2O,0.005%四环素(pH 7.0)〕的大试管(22×220mm)中。作为对照,接种160微升鞘氨醇单胞菌属种类SC42405到装有8毫升培养液〔6%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4·7H2O,0.01%FeSO4·7H2O,0.1%MnSO4·4~6H2O,(pH 7.0)〕的大试管(22×220mm)中。上述三种细菌在30℃下培养4天(250rpm)。通过分别使用植物乳杆菌IFO 3070和酿酒酵母的微生物学定量方法(Izumi and Yamada “维生素学实验方法II。水溶维生素”,p.481-499,日本维生素学学会,TokyoKayaku Dojin,1985)来确定每种培养菌液中合成和积累的生物素及生物素相关产物的浓度。结果,生物素和生物素生物合成中前体,例如,7-酮-8-氨基壬酸,7,8-二氨基壬酸和脱硫生物素(下文称为“生物素-同效维生素”)的浓度如表10所示。
表10鞘氨醇单胞菌属种类SC42405/pSS304和鞘氨醇单胞菌属种类SC42405/pSS305的生物素产量和与生物素一同效维生素产量
菌株 | 生物素产量<sup>*</sup> | 生物素浓度(毫克/升) | 生物素-同效维生素产量<sup>*</sup> | 生物素-同效维生素浓度 |
鞘氨醇单胞菌属种类 | 1 | 4.1 | 1 | 25.2 |
鞘氨醇单胞菌属种类SC42405/pSS304 | 0.9 | 3.5 | 11 | 271 |
鞘氨醇单胞菌属种类SC42405/pSS305 | 0.8 | 3.5 | 5.9 | 149 |
*相对于没有导入基因的菌株的生物素产量的数值
实施例20制备重组质粒pSS306
用限制性酶Spe I和Xho I切割pSS209,所得的DNA片段通过琼脂糖凝胶电泳分离以制备含bioF,bioC,bioD和bioA的DNA片段,并且该片段的上游末端有导入的Spe I位点,下游末端有导入的Xho I位点。所得的DNA片段与经过限制性酶Spe I和Xho I酶切的pSS301混合,通过使用连接试剂盒(可从TAKARA SHUZO公司得到)按所附操作手册使它们相互连接。这样所获得的质粒命名为pSS306(图9)。
实施例21制备导入了pSS306的转化子
按电穿孔法(工作电压18kv/cm,电容25μF,电阻400Ω)用基因脉冲发生器将实施例20中得到的质粒pSS306导入鞘氨醇单胞菌属种类SC42405中,便可获得鞘氨醇单胞菌属种类SC42405/pSS转化子。
实施例22鞘氨醇单胞菌属种类SC42405/pSS306的生物素产量
接种一环鞘氨醇单胞菌属种类SC42405/pSS306到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,0.005%四环素(pH 7.2)〕的小试管(18×150mm)中。作为对照,接种一环没有导入基因的鞘氨醇单胞菌属种类SC42405到装有3毫升培养液〔1%甘油,2%蛋白胨,0.15%K2HPO4,0.15%MgSO4·7H2O,(pH 7.2)〕的小试管(18×150mm)中。上述两种细菌在30℃下培养2天(250rpm)以获得预培养菌液。然后,接种160微升鞘氨醇单胞菌属种类SC42405/pSS306的预培养菌液到装有8毫升培养液〔6%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05%MgSO4·7H2O,0.01%FeSO4·7H2O,0.1%MnSO4·4~6H2O,0.005%四环素(pH 7.0)〕的大试管(22×220mm)中。作为对照,接种160微升鞘氨醇单胞菌属种类SC42405的预培养菌液到装有8毫升培养基〔6%甘油,2%酵母提取物,0.5%酪蛋白氨基酸,0.1%K2HPO4,0.05%KCl,0.05MgSO4·7H2O,0.01%FeSO4·7H2O,0.1%MnSO4·4~6H2O(pH 7.0)〕的大试管(22×220mm)中。上述两种细菌在30℃下培养2天(250rpm)。通过分别使用植物乳杆菌IFO 3070菌株和酿酒酵母的微生物学定量方法(Izumi and Yamada“维生素学实验方法II。水溶维生素”,p.481-499,日本维生素学学会,Tokyo Kayaka Dojin,1985)来确定每种培养菌液中合成和积累的生物及生物素相关产物的浓度。结果,得到合成的生物素的浓度如表11所示。
表11鞘氨醇单胞菌属种类SC42405/pSS306的生物素产量
菌株 | 生物素产量<sup>*</sup> | 生物素浓度 |
鞘氨醇单胞菌属种类SC42405 | 1 | 4.1 |
鞘氨醇单胞菌属种类SC42405/pSS306 | 3.9 | 16 |
*相对于没有导入基因的菌株的生物素产量的数值
工业应用
本发明提供含至少一个来自鞘氨醇单胞菌属微生物的生物素生物合成中涉及的基因的DNA片段,和利用该DNA片段得到的生物素合成转化子,并且本发明在提高生物素(动物,植物及一些微生物的必需维生素)的产量中有用。
序列表
SEQ ID NO:1
序列长度:369
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCW7511
序列描述
Met Leu Asp Phe His Arg Ala Asp Leu Ala Arg Leu Ala Ala Arg Asp
5 10 15
Arg Leu Arg Val Leu Ala Pro Gln Arg Gly Lys Asp Phe Ala Ser Asn
20 25 30
Asp Tyr Leu Gly Leu Ala Asn Ser Pro Arg Leu Ala Ala Ala Ile Ala
35 40 45
Ala Ala Val Glu Glu Gly Val Pro Val Gly Ser Gly Gly Ser Arg Leu
50 55 60
Leu Arg Gly Asn His Pro Glu His Glu Ala Leu Glu Ala Asp Ala Ala
65 70 75 80
Ala Phe Phe Gly Ala Glu Ala Ser Leu Tyr Phe Ser Ser Gly Tyr Gly
85 90 95
Ala Asn Val Ala Ile Leu Ala Thr Leu Pro Gln Arg Gly Asp Leu Ile
100 105 110
al His Asp Ser Leu Val His Ala Ser Met Arg Leu Val His His Gln
115 120 125
His Arg Ile Val Pro Ile Gly Gly Arg Leu Glu Ile Gly Glu Arg Arg
130 135 140
Gly Val Ala Val His Ala Val Lys Ala Phe Asp Arg Asp Pro His Gly
145 150 155 160
Ala Leu Ala Ala Leu Val Ala Pro Cys Pro Asp Arg Ile Leu Glu Gly
165 170 175
Arg Cys Ile Val Met Arg Arg Arg His Gly Leu Gly Thr Arg Gln Ala
180 185 190
His Pro Leu Met His Ala Thr Gly Val Phe Gly Glu Arg Gly Gln Gly
195 200 205
Leu Ser Ile Ala Gly Glu Arg Val Val Thr Leu His Thr Cys Gly Lys
210 215 220
Ala Met Gly Cys Glu Gly Ala Leu Val Ala Gly Pro Thr Ile Val Arg
225 230 235 240
Asp Tyr Leu Val Asn Arg Gly Arg Gly Phe Ile Phe Ser Thr Ala Pro
245 250 255
Ser Pro Leu Met Ala Arg Gly Val Arg Glu Ala Leu Arg Ile Leu Ala
260 265 270
Asp Glu Pro Glu Arg Arg Thr Ala Leu His Asp Arg Ile Ala Leu Ala
275 280 285
Gly Ala Arg Leu Gly Arg Arg Gly Ala Leu Ala Gln Gly Thr Pro Ile
290 295 300
Leu Pro Leu Ile Leu His Asp Asn Gly Arg Thr Met Arg Ala Ala Glu
305 310 315 320
Ala Leu Gln Ala Leu Gly Tyr Asp Ile Arg Gly Ile Arg Pro Pro Thr
325 330 335
Val Pro Val Gly Ser Ala Arg Leu Arg Leu Ser Ile Thr Leu Asn Val
340 345 350
Glu Ala Ala Asp Ile Leu Ala Leu Asp Gln Ala Leu Gln Glu Val Leu
355 360 365
Ala
369
SEQ ID NO:2
序列长度:387
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
序列描述
Met Ser Arg Leu Asp Ser Phe Phe Ala Ala Ala Leu Asp Arg Ile Asp
5 10 15
Arg Ala Gly Gln Arg Arg Thr Leu Arg Pro Ala Ala Leu Glu Lys Gly
20 25 30
Gly Arg Val His Arg Asp Gly His Glu Leu Ile Asp Phe Ser Ser Asn
35 40 45
Asp Tyr Leu Gly Leu Ala Arg His Pro Leu Leu Ile Glu Arg Ala Arg
50 55 60
Ala Trp Thr Glu Ala His Gly Thr Gly Ser Gly Ala Ser Arg Leu Val
65 70 75 80
Thr Gly Thr Ser Ala Thr His Leu Ala Ile Glu Ala Arg Ile Ala Arg
85 90 95
Phe Lys His Ala Glu Ala Ala Leu Val Phe Ala Ser Gly Trp Gln Ala
100 105 110
Asn Ala Ala Val Ile Pro Ala Leu Leu Ala Ala Val Pro Gly Ser Ala
115 120 125
Val Phe Thr Asp Arg Leu Ile His Ala Ser Met His Ala Gly Leu Ala
130 135 140
Ile Ser Gly Thr Arg Gln His Arg Phe Arg His Asn Asp Leu Asp His
145 150 155 160
Leu Glu Glu Leu Leu Ala Ser Lys Gly Ala Glu Ala Ser Ala Arg Leu
165 170 175
Ile Leu Thr Glu Ser Val Phe Ser Met Asp Gly Asp Arg Ala Asp Ile
180 185 190
Ala Arg Leu Ala Glu Ile Ala Ala Arg His Asp Ala Phe Leu Phe Val
195 200 205
Asp Glu Ala His Ala Thr Gly Val Leu Gly Pro Gly Gly Ala Gly Leu
210 215 220
Ser Ala Glu Val Pro Gly Gly Ile Asp Leu Val Met Gly Thr Phe Ser
225 230 235 240
Lys Ala Leu Gly Gly Phe Gly Ala Tyr Val Ala Gly Ser Gln Val Met
245 250 255
Ile Asp Tyr Leu Val Asn Ala Ala Ser Gly Phe Ile Phe Thr Thr Ala
260 265 270
Pro Pro Pro Ala Val Leu Gly Ala Ile Asp Ala Ala Leu Asp Leu Val
275 280 285
Pro Gly Met Asp Ala Glu Arg Ala His Leu Ala Ala Leu Gly Gln Gln
290 295 300
Leu Arg Ser Gly Leu Ala Ala Leu Gly Ile Asp His Gly Ala Ser Ser
305 310 315 320
Thr Gln Ile Val Pro Ala Val Ile Gly Ala Glu Val Ala Ala Leu Asp
325 330 335
Leu Ser Arg Lys Leu Glu Glu Arg Gly Leu Leu Ala Ser Ala Ile Arg
340 345 350
Pro Pro Thr Val Pro Pro Gly Thr Ser Arg Leu Arg Leu Ala Leu Arg
355 360 365
Ala Thr His Ala Pro Ser Asp Ile Asp Ala Leu Leu Asn Ala Ile Glu
370 375 380
Ala Cys Arg
385 387
SEQ ID NO:3
序列长度:1536
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCM7511
特征:
特征键:CDS
位置:427..1536
决定特征的方法:S
序列描述
GATCCTGATC GCGGTCCCGG CGCATCAATG GATGTGGTCG GCGCATGACG TGGTGAACCA 60
TCACCATCGT CGGTATTCGA AGACGACCTT GGGGTCCGCG ATCGAGAAGG CGGGCCTGAA 120
ACCCCGCAAG CTCGGCTATT TCAACTCGCT GCTCTTCCCG CTCGCCGCGG CCGCGCGGAT 180
CGCCGGACGG ATCACGGGGC GCGACGACAG CGACGACTCG CCACCGCCCG CGCCGCTCAA 240
CAAAACGTTC GAGGCGATCT TCCGGTTGGA GCGGCATCTG GTCGGCCGTG TGCCGATGAC 300
CCCGGGGGTT TCGATCGTGA CCTTGGCGGA GCCTGCCTGA CGGCGGGTGG AGCGAAGTCG 360
AAGGCCACGG GAATCCCTAA CCTTTCCGGG TTCCGCTCTC CTGCCTAGCT GGGTAGGGAA 420
GCCCCC ATG CTG GAC TTT CAT CGC GCC GAT CTG GCC CGA CTG GCC GCG 468
CGG GAC CGA TTG CGG GTG CTG GCC CCG CAG CGT GGC AAG GAT TTC GCG 516
TCC AAC GAT TAT CTG GGC TTG GCG AAC AGC CCC CGC CTC GCC GCC GCC 564
ATC GCC GCC GCG GTC GAG GAG GGC GTC CCC GTT GGG TCG GGC GGA TCG 612
CGA TTG CTG CGC GGC AAT CAC CCC GAA CAT GAG GCG CTG GAG GCG GAC 660
GCC GCC GCG TTC TTC GGG GCG GAG GCG AGC CTG TAT TTC TCC TCG GGC 708
TAC GGT GCC AAT GTC GCG ATC CTG GCG ACG CTG CCA CAG CGC GGC GAC 756
CTG ATC GTC CAC GAC TCG CTC GTC CAT GCC AGC ATG CGC CTC GTC CAC 804
CAC CAG CAT CGC ATC GTG CCG ATC GGC GGC CGC CTG GAG ATC GGC GAG 852
CGG CGC GGT GTC GCC GTC CAT GCT GTA AAG GCT TTC GAC CGC GAT CCA 900
CAC GGT GCC CTT GCC GCC CTG GTC GCG CCA TGC CCG GAT CGC ATC CTC 948
GAA GGC CGA TGC ATC GTT ATG CGC CGC CGC CAC GGC CTC GGC ACG CGA 996
CAG GCG CAT CCC CTC ATG CAT GCC ACC GGC GTC TTC GGC GAG CGG GGA 1044
CAG GGG CTG AGC ATC GCA GGC GAG CGG GTG GTG ACG CTC CAC ACC TGT 1092
GGC AAG GCG ATG GGC TGC GAG GGT GCG CTG GTC GCC GGG CCG ACG ATC 1140
GTG CGC GAC TAT CTG GTC AAT CGC GGG AGG GGC TTC ATC TTC TCG ACC 1188
GCG CCC TCG CCG CTG ATG GCA CGC GGG GTG CGC GAG GCG CTT CGC ATC 1236
CTG GCC GAC GAG CCC GAG CGG CGC ACC GCG CTG CAC GAC CGG ATC GCG 1284
CTG GCG GGC GCG CGG CTG GGC CGC CGC GGT GCG CTG GCG CAG GGC ACG 1332
CCG ATC CTG CCG CTG ATC CTG CAC GAC AAT GGC CGC ACC ATG CGC GCC 1380
GCT GAG GCG CTG CAG GCG CTT GGC TAT GAC ATA CGC GGC ATC CGC CCG 1428
CCG ACC GTG CCC GTG GGC TCG GCG CGG CTG CGG CTG TCG ATC ACT TTG 1476
AAT GTC GAG GCG GCG GAC ATC CTC GCC CTC GAC CAA GCA TTG CAA GAG 1524
GTT CTG GCA TGA 1536
SEQ ID NO:4
序列长度:1408
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
特征:
特征键:CDS
位置:202..1365
决定特征的方法:S
序列描述
ACCGGAATGA CAGGCGGACA GCAGCAATAG GGCGGCAAGA GAGAGCGGCA GGGATCGCAT 60
CAGACGGGCA TCCTTCGGTT TTTCCTTTGC CGTTCCAACG CGCGAGGAAG GCGGCGGCTT 120
CACGTCCCGC CGCGAAATCG ATGCCCCTCC CGGCCAGCCA AGCATTGTGC CGGACGCCCG 180
CTTGCCATAC CGGCAGGGGC G ATG AGC AGG CTC GAT TCC TTC TTC GCA GCG 231
GCG CTC GAC CGG ATC GAC CGC GCC GGA CAA CGC CGC ACC TTG CGC CCC 279
GCC GCA CTC GAA AAG GGT GGC CGC GTC CAC CGC GAC GGG CAC GAA CTG 327
ATA GAT TTC TCC AGC AAC GAC TAT CTC GGC CTC GCC CGC CAC CCG CTG 375
CTG ATC GAG CGC GCC CGC GCC TGG ACG GAA GCC CAC GGC ACC GGC TCC 423
GGC GCC TCG CGA CTG GTG ACG GGA ACC AGC GCC ACC CAT CTC GCG ATC 471
GAG GCC CGC ATC GCC CGG TTC AAG CAT GCC GAA GCC GCG CTG GTC TTC 519
GCC AGC GGC TGG CAG GCC AAT GCC GCG GTG ATC CCC GCC CTG CTC GCC 567
GCC GTA CCC GGT TCA GCA GTC TTC ACC GAC CGG CTG ATC CAT GCC TCG 615
ATG CAC GCG GGC CTC GCG ATC TCG GGC ACC CGC CAG CAC CGC TTC CGC 663
CAT AAC GAC CTC GAT CAT CTG GAG GAA CTG CTG GCG AGC AAG GGC GCC 711
GAA GCC TCC GCC CGC CTG ATC CTC ACC GAG AGC GTG TTC TCG ATG GAC 759
GGC GAC CGC GCC GAC ATT GCC CGC CTG GCC GAG ATC GCC GCC CGC CAC 807
GAC GCA TTC CTG TTC GTG GAC GAA GCC CAT GCC ACC GGC GTG CTC GGC 855
CCC GGC GGC GCG GGC CTC TCG GCG GAA GTG CCC GGC GGG ATC GAC CTC 903
GTC ATG GGC ACC TTC AGC AAG GCG CTC GGC GGT TTC GGC GCC TAT GTC 951
GCC GGG TCA CAA GTG ATG ATC GAC TAC CTC GTC AAC GCG GCG AGC GGC 999
TTC ATC TTC ACC ACC GCC CCG CCG CCT GCC GTG CTG GGC GCC ATC GAC 1047
GCC GCG CTC GAC CTC GTG CCG GGC ATG GAT GCC GAG CGC GCC CAT CTT 1095
GCC GCG CTG GGT CAG CAG CTG CGC TCC GGC CTC GCC GCG CTC GGC ATC 1143
GAT CAC GGC GCA TCG AGC ACG CAG ATC GTC CCC GCC GTG ATC GGC GCG 1191
GAG GTC GCC GCG CTC GAC CTC TCC CGC AAG CTG GAA GAG CGC GGA CTG 1239
CTC GCT TCC GCG ATC CGC CCG CCC ACG GTG CCG CCC GGC ACC AGC CGC 1287
CTG CGC CTG GCG CTG CGC GCG ACC CAT GCG CCA AGC GAT ATC GAT GCC 1335
CTG CTG AAC GCG ATC GAG GCC TGC CGG TGA AGCTGCTTTT CGCCCATGGC 1385
TGGGGCTTCG ACCACACGTT CTG 1408
SEQ ID NO:5
序列长度:1408
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
特征:
特征键:CDS
位置:202..1365
决定特征的方法:S
序列描述
ACCGGAATGA CAGGCGGACA GCAGCAATAG GGCGGCAAGA GAGAGCGGCA GGGATCGCAT 60
CAGACGGGCA TCCTTCGGTT TTTCCTTTGC CGTTCCAACG CGCGAGGAAG GCGGCGGCTT 120
CACGTCCCGC CGCGAAATCG ATGCCCCTCC CGGCCAGCCA AGCATTGTGC CGGACGCCCG 180
CTTGCCATAC GGGCAGGGGC G ATG AGC AGG CTC GAT TCC TTC TTC GCA GCG 231
GCG CTC GAC CGG ATC GAC CGC GCC GGA CAA CGC CGC ACC TTG CGC CCC 279
GCC GCA CTC GAA AAG GGT GGC CGC GTC CAC CGC GAC GGG CAC GAA CTG 327
ATA GAT TTC TCC AGC AAC GAC TAT CTC GGC CTC GCC CGC CAC CCG CTG 375
CTG ATC GAG CGC GCC CGC GCC TGG ACG GAA GCC CAC GGC ACC GGC TCC 423
GGC GCC TCG CGA CTG GTG ACG GGA ACC AGC GCC ACC CAT CTC GCG ATC 471
GAG GCC CGC ATC GCC CGG TTC AAG CAT GCC GAA GCC GCG CTG GTC TTC 519
GCC AGC GGC TGG CAG GCC AAT GCC GCG GTG ATC CCC GCC CTG CTC GCC 567
GCC GTA CCC GGT TCA GCA GTC TTC ACC GAC CGG CTG ATC CAT GCC TCG 615
ATG CAC GCG GGC CTC GCG ATC TCG GGC ACC CGC CAG CAC CGC TTC CGC 663
CAT AAC GAC CTC GAT CAT CTG GAG GAA CTG CTG GCG AGC AAG GGC GCC 711
GAA GCC TCC GCC CGC CTG ATC CTC ACC GAG AGC GTG TTC TCG ATG GAC 759
GGC GAC CGC GCC GAC ATT GCC CGC CTG GCC GAG ATC GCC GCC CGC CAC 807
GAC GCA TTC CTG TTC GTG GAC GAA GCC CAT GCC ACC GGC GTG CTC GGC 855
CCC GGC GGC GCG GGC CTC TCG GCG GAA GTG CCC GGC GGG ATC GAC CTC 903
GTC ATG GGC ACC TTC AGC AAG GCG CTC GGC GGT TTC GGC GCC TAT GTC 951
GCC GGG TCA CAA GTG ATG ATC GAC TAC CTC GTC AAC GCG GCG AGC GGC 999
TTC ATC TTC ACC ACC GCC CCG CCG CCT GCC GTG CTG GGC GCC ATC GAC 1047
GCC GCG CTC GAC CTC GTG CCG GGC ATG GAT GCC GAG CGC GCC CAT CTT 1095
GCC GCG CTG GGT CAG CAG CTG CGC TCC GGC CTC GCC GCG CTC GGC ATC 1143
GAT CAC GGC GCA TCG AGC ACG CAG ATC GTC CCC GCC GTG ATC GGC GCG 1191
GAG GTC GCC GCG CTC GAC CTC TCC CGC AAG CTG GAA GAG CGC GGA CTG 1239
CTC GCT TCC GCG ATC CGC CCG CCC ACG GTG CCG CCC GGC ACC AGC CGC 1287
CTG CGC CTG GCG CTG CGC GCG ACC CAT GCG CCA AGC GAT ATC GAT GCC 1335
CTG CTG AAC GCG ATC GAG GCC TGC CGG TGA AGCTGCTTTT CGCCCATGGC 1385
TGGGGCTTCG ACCACACGTT CTG 1408
SEQ ID NO:6
序列长度:415
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCW7511
序列描述
Met Thr Ser Pro Val Trp His Pro Phe Thr Gln His Gly Leu Gly Glu
5 10 15
Pro Ile Pro Lys Val Ala Ser Ala Ser Gly Ala Val Leu Thr Thr Val
20 25 30
Asp Gly Arg Glu Val Ile Asp Ala Ile Ser Ser Trp Trp Val Thr Thr
35 40 45
His Gly His Asn His Pro Arg Ile Ser Ala Ala Ile Ala Glu Gln Ala
50 55 60
Gly Lys Leu Asp Gln Ile Ile Phe Ala Gly Trp Thr His Glu Pro Ala
65 70 75 80
Glu Glu Val Ala Ala Glu Leu Val Arg Ile Thr Pro Pro Lys Leu Thr
85 90 95
Arg Val Phe Phe Ser Asp Ser Gly Ser Thr Ala Val Glu Val Ala Leu
100 105 110
Lys Met Ala Leu Gly Tyr Trp Leu His Arg Gly Glu Pro Arg His Arg
115 120 125
Ile Leu Val Leu Glu His Ser Tyr His Gly Asp Thr Ile Gly Ala Met
130 135 140
Ser Val Gly Ala Arg Gly Val Tyr Asn Gln Ala Tyr Ala Pro Leu Leu
145 150 155 160
Phe Asp Val Gly Thr Ile Pro Tyr Pro Thr Asp Ile Gln Ala Thr Leu
165 170 175
Asp Thr Leu Glu Ala Glu Cys Arg Ala Gly Ala Ala Ala Phe Ile Val
180 185 190
Glu Pro Leu Val Leu Gly Ala Gly Gly Met Leu Phe Tyr Ala Ala Glu
195 200 205
Thr Leu Ala Ala Met Arg Glu Ile Cys Ala Ala His Gly Val Leu Phe
210 215 220
Ile Ala Asp Glu Val Met Thr Gly Trp Gly Arg Thr Gly Thr Ile Phe
225 230 235 240
Ala Cys Asp Gln Ala Gly Val Val Pro Asp Ile Leu Cys Leu Ser Lys
245 250 255
Gly Leu Thr Gly Gly Ala Val Pro Leu Ala Val Thr Leu Ala Thr Glu
260 265 270
Ala Ile Phe Gln Ala His Trp Ser Glu Thr Asp Arg Ser Lys Gln Phe
275 280 285
Phe His Ser Ser Ser Tyr Thr Ala Asn Pro Ile Ala Cys Ala Ala Ala
290 295 300
Ala Ala Asn Leu Ala Ile Trp Arg Glu Glu Pro Val Gln Ala Arg Ile
305 310 315 320
Asp Ala Leu Ala Glu Arg Gln Arg Ala His Leu Ala Thr Ile Ala Gly
325 330 335
Arg Asp Ala Val Arg Asn Pro Arg Ala Leu Gly Thr Ile Ala Ala Phe
340 345 350
Glu Leu Gly Ala Gly Gln Asp Tyr Leu Ser Asp Leu Gly Pro Arg Leu
355 360 365
Leu Ala His Phe Arg Glu Arg Asp Leu Leu Val Arg Pro Met Gly Asn
370 375 380
Ser Ile Tyr Val Met Pro Pro Tyr Ser Ile Thr Pro Glu Gln Leu Ala
385 390 395 400
Arg Ile Trp Gly Gly Ile Asp Glu Ala Ile Ala Arg Phe Gly Ser
405 410 415
SEQ ID NO:7
序列长度:417
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:鞘氨醇单胞菌属种类
菌株:SC42405
序列描述
Met Thr Ser Ser Val Trp His Pro Phe Thr Gln His Gly Leu Gln Glu
5 10 15
Pro Val Pro Leu Val Thr His Ala Glu Gly Ala Leu Leu His Thr Ala
20 25 30
Asp Gly Lys Ala Val Val Asp Ala Val Ser Ser Trp Trp Val Thr Thr
35 40 45
His Gly His Ser His Pro Arg Ile Lys Ala Ala Ile Ala Glu Gln Ala
50 55 60
Gln Lys Leu Asp Gln Ile Ile Phe Ala Gly Trp Thr His Glu Pro Ala
65 70 75 80
Glu Gln Val Ala Ala Gly Leu Arg Ala Ile Met Pro Glu Ser Leu Thr
85 90 95
Arg Val Phe Phe Ser Asp Ser Gly Ser Thr Ser Val Glu Val Ala Leu
100 105 110
Lys Met Ala Leu Gly Tyr Trp His Trp Arg Gly Glu Asn Arg His Arg
115 120 125
Ile Val Val Met Glu Asn Ser Tyr His Gly Asp Thr Ile Gly Ala Met
130 135 140
Ser Val Gly Glu Arg Gly Val Phe Asn Gln Pro Tyr Glu Pro Leu Leu
145 150 155 160
Phe Asp Val Gly Arg Ile Pro Phe Pro Ala Ala Gly Ala Glu Gln Ala
165 170 175
Thr Leu Asp Ala Leu Glu Ala Ile Cys Arg Gln Pro Asp Thr Ala Ala
180 185 190
Leu Ile Val Glu Pro Leu Ile Leu Gly Ala Gly Gly Met Leu Val Tyr
195 200 205
Ser Ser Glu Thr Leu Ala Ala Met Gln Ala Ile Cys Ala Arg His Gly
210 215 220
Val Leu Phe Ile Ala Asp Glu Val Met Thr Ala Trp Gly Arg Thr Gly
225 230 235 240
Thr Leu Leu Ala Cys Glu Gln Ala Ser Val Val Pro Asp Ile Leu Cys
245 250 255
Leu Ser Lys Gly Leu Thr Gly Gly Ala Val Pro Leu Ala Val Thr Met
260 265 270
Ala Ser Glu Ala Ile Phe Glu Ala His Tyr Ser Thr Asp Arg Ala Arg
275 280 285
Met Phe Phe His Ser Ser Ser Tyr Thr Ala Asn Pro Ile Ala Cys Ala
290 295 300
Ala Ala Ala Ala Asn Leu Ala Ile Trp Arg Glu Glu Pro Val Leu Glu
305 310 315 320
Arg Ile Ala Ala Leu Ala Gly Lys Gln Ala Thr Trp Ile Glu Lys Leu
325 330 335
Gly Gln Phe Cys His Phe Asp Asn Pro Arg Thr Ile Gly Thr Ile Ala
340 345 350
Ala Leu Asp Leu Arg Thr Ser Gly Thr Ser Gly Tyr Met Ser Asp Leu
355 360 365
Ala Pro Arg Leu Met Ala Phe Phe Arg Glu Arg Asp Val Leu Leu Arg
370 375 380
Pro Leu Gly Asn Thr Val Tyr Val Met Pro Pro Tyr Cys Ile Ser Asp
385 390 395 400
Asn Gln Leu Gly Gln Val Trp Glu Ala Val Gly Glu Ala Val Ile Ser
405 410 415
Phe
417
SEQ ID NO:8
序列长度:1448
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCM7511
特征:
特征键:CDS
位置:1..1248
决定特征的方法:S
序列描述
ATG ACC TCG CCG GTC TGG CAT CCC TTC ACC CAG CAT GGT CTG GGC GAG 48
CCG ATT CCT AAG GTG GCT TCC GCC TCT GGC GCG GTG CTG ACC ACC GTC 96
GAT GGC CGC GAG GTG ATC GAT GCC ATC TCT AGC TGG TGG GTG ACC ACG 144
CAC GGG CAC AAC CAT CCC CGC ATC AGC GCC GCC ATC GCC GAG CAG GCA 192
GGC AAG CTC GAC CAG ATC ATC TTC GCC GGC TGG ACC CAT GAG CCG GCC 240
GAG GAG GTT GCC GCC GAG CTG GTA CGG ATC ACG CCG CCC AAG CTG ACG 288
CGG GTG TTC TTT TCC GAT TCT GGT TCG ACG GCG GTC GAG GTC GCG CTG 336
AAG ATG GCG CTG GGC TAC TGG CTC CAC CGG GGC GAG CCG CGC CAC CGC 384
ATC CTC GTC CTC GAA CAC AGC TAT CAT GGC GAC ACG ATC GGC GCG ATG 432
TCG GTC GGC GCG CGG GGG GTA TAC AAC CAG GCT TAT GCG CCG TTG CTG 480
TTC GAT GTC GGC ACC ATC CCC TAT CCG ACC GAC ATA CAG GCG ACG CTC 528
GAC ACG CTG GAG GCG GAG TGC CGG GCG GGC GCG GCG GCG TTC ATC GTC 576
GAG CCG CTG GTG CTG GGG GCG GGG GGC ATG CTC TTC TAC GCC GCC GAA 624
ACG CTG GCC GCG ATG CGT GAG ATA TGC GCG GCG CAT GGC GTG CTG TTC 672
ATC GCT GAT GAG GTG ATG ACC GGA TGG GGG CGC ACC GGC ACG ATC TTC 720
GCC TGT GAC CAG GCG GGC GTG GTC CCC GAT ATC CTC TGC CTG TCC AAG 768
GGG CTG ACC GGC GGT GCG GTA CCG CTG GCG GTG ACA CTG GCG ACC GAG 816
GCG ATC TTC CAG GCG CAC TGG TCG GAA ACC GAT CGG TCG AAG CAG TTC 864
TTC CAC TCG TCC AGC TAC ACC GCC AAC CCG ATC GCC TGC GCG GCG GCG 912
GCC GCC AAT CTG GCG ATC TGG CGC GAG GAG CCG GTG CAG GCG CGG ATC 960
GAC GCG CTC GCC GAG CGG CAG CGG GCG CAT CTG GCG ACG ATC GCG GGG 1008
CGG GAT GCG GTG CGA AAC CCG CGC GCG CTC GGC ACC ATC GCG GCG TTC 1056
GAA CTG GGG GCG GGG CAG GAT TAT CTC TCC GAT CTG GGA CCC CGG TTG 1104
CTG GCC CAT TTC CGG GAG CGC GAT CTG CTC GTC CGG CCG ATG GGC AAT 1152
AGC ATC TAT GTC ATG CCG CCC TAT TCC ATT ACG CCC GAG CAA CTG GCG 1200
CGC ATT TGG GGC GGC ATC GAT GAG GCG ATT GCC CGC TTC GGG AGT 1245
TGA GACGGGCCGG GGCCTTTGAC TTTACGGCAT TTCATTTGCT TTATCCGGCG 1298
ACGATCGAAA AGGGAGCGGG CATGGGCGTG GCGAAGACTG GGGCGATGGG GGCTCTGGCA 1358
TCGGTGACGG CGCTGATGTG GGGCCTGGCC GCCACCGCGC AGACGACCCC GCCCGCCGCC 1418
AACCCGGCCA CCCCGCCGCT GGGCCCGATC 1448
SEQ ID NO:9
序列长度:1459
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
特征:
特征键:CDS
位置:1..1253
决定特征的方法:S
序列描述
ATG ACG TCA TCG GTC TGG CAC CCC TTC ACC CAG CAC GGC CTG CAA GAG 48
CCG GTC CCG CTG GTC ACC CAT GCC GAG GGC GCG CTG CTC CAC ACG GCT 96
GAC GGC AAG GCA GTG GTG GAC GCG GTG TCC TCG TGG TGG GTG ACG ACC 144
CAC GGC CAC TCC CAT CCG CGC ATC AAG GCC GCC ATC GCG GAG CAG GCG 192
CAG AAG CTC GAC CAG ATC ATC TTC GCC GGA TGG ACC CAC GAA CCC GCC 240
GAG CAA GTC GCA GCA GGC CTG CGC GCG ATC ATG CCG GAA AGC CTG ACG 288
CGG GTG TTC TTC TCC GAT TCG GGT TCG ACC AGC GTG GAA GTC GCG CTG 336
AAG ATG GCG CTC GGC TAC TGG CAC TGG CGC GGC GAG AAC CGC CAC CGC 384
ATC GTC GTG ATG GAA AAC TCC TAC CAC GGC GAC ACC ATC GGC GCG ATG 432
TCG GTG GGC GAG CGC GGC GTG TTC AAC CAG CCC TAC GAA CCG CTG CTG 480
TTC GAC GTG GGC CGC ATT CCC TTC CCC GCC GCC GGG GCC GAG CAG GCA 528
ACG CTG GAC GCA CTC GAA GCG ATC TGC CGC CAG CCG GAC ACC GCC GCG 576
CTG ATC GTC GAG CCG CTG ATC CTC GGC GCC GGC GGC ATG CTG GTC TAT 624
TCG TCC GAG ACG CTC GCC GCG ATG CAG GCG ATC TGC GCC CGC CAC GGC 672
GTG CTC TTC ATC GCC GAC GAA GTG ATG ACC GCC TGG GGC CGC ACC GGC 720
ACC CTC CTC GCC TGC GAA CAG GCA AGC GTG GTC CCG GAC ATC CTC TGC 768
CTC TCC AAG GGC CTG ACC GGC GGT GCC GTC CCG CTC GCT GTC ACG ATG 816
GCC AGC GAA GCG ATC TTC GAG GCG CAC TAC TCC ACC GAC CGC GCG CGG 864
ATG TTC TTC CAC TCC TCC AGC TAC ACC GCG AAC CCG ATC GCC TGC GCC 912
GCC GCC GCC GCC AAC CTG GCT ATC TGG CGC GAG GAA CCG GTG CTG GAA 960
CGC ATC GCC GCG CTG GCC GGG AAA CAG GCG ACG TGG ATC GAG AAG CTC 1008
GGC CAG TTC TGC CAC TTC GAC AAT CCC CGC ACG ATC GGC ACC ATC GCC 1056
GCG CTC GAC CTC AGG ACC TCA GGC ACC AGC GGC TAC ATG AGC GAC CTC 1104
GCC CCG CGC CTG ATG GCG TTC TTC CGC GAG CGG GAC GTG CTG TTG CGG 1152
CCG CTG GGG AAC ACC GTC TAC GTC ATG CCG CCT TAC TGC ATT TCC GAT 1200
AAT CAG CTT GGG CAG GTT TGG GAG GCT GTC GGG GAA GCG GTG ATT TCG 1248
TTT TAA GAACGATTTT AAGATGAAGG ATGAAGAGCA GGGGTCAAAA CCCCTGCACC 1304
CCATTACTGT CGAGGTCAGG TACGACCTAT CCGTCTTGCG CCCATGGCAG CGTCGGGAGG 1364
CATATTGGCT GCGCCGCAAG GAGCGCACTA CGCATAGGGC GCGCGCGACG TCGACATTCC 1424
GAGGGTCTGG GGGATCATCC CCCAGGACTT CTCCC 1459
SEQ ID NO:10
序列长度:206
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCW7511
序列描述
Met Ser Ala Ile Ile Val Thr Gly Thr Asp Thr Glu Ile Gly Lys Thr
5 10 15
Val Phe Ser Ala Ala Leu Thr Gly Ala Leu Gly Ala Ser Tyr Trp Lys
20 25 30
Pro Val Gln Ala Gly Thr Asp Glu Glu Gly His Gly Asp Ala Glu Thr
35 40 45
Val Ser Ala Leu Ser Gly Arg Pro Val Leu Pro Ser Ala Tyr Arg Leu
50 55 60
Lys Thr Pro Cys Ser Pro His Leu Ala Ala Glu Ile Asp Gly Val Thr
65 70 75 80
Ile Glu Ile Asp Arg Leu Val Leu Pro Gln Val Asp Gly Pro Leu Val
85 90 95
Ala Glu Gly Ala Gly Gly Val Leu Val Pro Val Thr Arg Gln Leu Leu
100 105 110
Phe Ala Asp Leu Phe Ala Arg Trp Gly Arg Pro Val Val Leu Val Ala
115 120 125
Arg Thr Gly Leu Gly Thr Ile Asn His Ser Leu Leu Ser Ile Glu Ala
130 135 140
Leu Arg Ala Arg Gly Val Asp Val Leu Gly Val Ala Phe Val Gly Asp
145 150 155 160
Ala Val Glu Asp Ser Glu Ala Thr Ile Ala Ala Ile Gly Gly Val Lys
165 170 175
Arg Leu Gly Arg Leu Pro Arg Leu Ala Thr Leu Asn Arg Glu Thr Leu
180 185 190
Thr Glu Ala Phe Ala Ala His Phe Arg Ser Glu Asp Phe Arg
195 200 205 206
SEQ ID NO:11
序列长度:209
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:鞘氨醇单胞菌属种类
菌株:SC42405
序列描述
Met Arg Pro Leu Ile Val Thr Gly Thr Asp Thr Glu Ile Gly Lys Thr
5 10 15
Val Phe Ala Ala Ala Leu Ala Gly Ala Leu Gly Ser His Tyr Trp Lys
20 25 30
Pro Val Gln Ala Gly Leu Glu Glu Asp Gly Gly Asp Gly Asp Arg Val
35 40 45
Ala Arg Leu Ser Gly Leu Pro Ala Ser His Ile Leu Pro Glu Ala Tyr
50 55 60
Arg Leu Ala Thr Pro Cys Ser Pro His Leu Ala Ala Glu Ile Asp Gly
65 70 75 80
Val Glu Ile Asp Pro Glu Arg Leu Ala Leu Pro Gln Val Asp Gly Pro
85 90 95
Leu Val Val Glu Gly Ala Gly Gly Val Met Val Pro Leu Thr Arg Thr
100 105 110
Thr Thr Tyr Ala Asp Gln Phe Ala Arg Trp Asn Ala Pro Val Val Leu
115 120 125
Val Ala Arg Thr Met Leu Gly Thr Ile Asn HisSer Leu Leu Ser Ile
130 135 140
Glu Ala Leu Arg Ala Arg Gly Val Glu Val Leu Gly Val Ala Phe Val
145 150 155 160
Gly Asp Pro Met Glu Asp Ser Glu Ala Thr Ile Cys Ala Met Ala Asn
165 170 175
Val Arg Arg Leu Gly Arg Leu Pro Arg Leu Ala Ser Leu Thr Pro Glu
180 185 190
Asn Leu Ala Lys Ala Phe Ala Glu Asn Phe His Ile Gly Asp Phe Thr
195 200 205
Gln
209
SEQ ID NO:12
序列长度:621
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCM7511
特征:
特征键:CDS
位置:1..621
决定特征的方法:S
序列描述
ATG AGC GCC ATC ATC GTC ACC GGC ACT GAT ACC GAG ATC GGC AAG ACC 48
GTC TTC TCC GCC GCG CTG ACC GGC GCG TTG GGG GCG AGC TAT TGG AAG 96
CCG GTC CAG GCG GGA ACC GAC GAG GAA GGG CAT GGC GAT GCC GAG ACG 144
GTG TCG GCC CTG AGC GGA CGT CCG GTC CTG CCC TCC GCC TAT CGG TTG 192
AAG ACG CCC TGC TCG CCG CAT CTG GCC GCC GAG ATC GAC GGG GTG ACG 240
ATC GAG ATC GAT CGG CTG GTG CTG CCG CAG GTG GAC GGG CCG CTG GTC 288
GCC GAG GGG GCG GGC GGC GTG CTG GTG CCG GTG ACG CGG CAG TTG CTG 336
TTC GCC GAT CTC TTC GCC CGC TGG GGC CGG CCG GTG GTG CTG GTC GCG 384
CGG ACC GGG CTG GGG ACG ATC AAC CAC AGC CTG TTG TCG ATC GAG GCG 432
TTG CGC GCG CGC GGC GTG GAC GTG CTG GGG GTC GCG TTC GTC GGT GAC 480
GCA GTC GAG GAT AGC GAG GCC ACC ATC GCC GCG ATC GGC GGG GTG AAG 528
CGA CTC GGC CGC CTG CCG CGT CTG GCC ACG CTA AAT CGC GAG ACA CTG 576
ACC GAG GCG TTC GCG GCG CAT TTC CGG AGC GAG GAT TTC CGA TGA 621
SEQ ID NO:13
序列长度:627
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
特征:
特征键:CDS
位置:1..627
决定特征的方法:S
序列描述
ATG AGA CCG CTT ATC GTC ACC GGA ACC GAT ACC GAG ATC GGC AAG ACC 48
GTC TTC GCC GCC GCG CTC GCG GGC GCC CTC GGC TCA CAT TAC TGG AAG 96
CCG GTG CAG GCA GGC CTC GAA GAA GAC GGC GGC GAC GGC GAC CGC GTG 144
GCG CGC CTC TCC GGC CTG CCT GCC AGC CAT ATT CTG CCC GAA GCC TAT 192
CGC CTC GCC ACC CCC TGC TCG CCG CAC CTC GCC GCC GAG ATC GAC GGG 240
GTG GAA ATC GAT CCC GAG CGC CTC GCC TTG CCG CAA GTG GAC GGT CCG 288
CTG GTG GTC GAA GGC GCA GGC GGC GTC ATG GTC CCG CTC ACC CGG ACC 336
ACG ACT TAT GCC GAC CAG TTC GCG CGG TGG AAC GCC CCG GTC GTG CTG 384
GTG GCG CGC ACG ATG CTC GGC ACG ATC AAC CAT TCG CTG CTC TCC ATC 432
GAG GCC CTG CGC GCG CGC GGC GTC GAA GTG CTG GGC GTG GCC TTC GTC 480
GGC GAT CCG ATG GAA GAC AGC GAG GCG ACG ATC TGC GCC ATG GCC AAT 528
GTC CGC CGC CTC GGC CGC CTG CCC CGC CTC GCC TCG CTG ACC CCG GAG 576
AAC CTC GCC AAG GCC TTC GCC GAA AAC TTC CAT ATC GGA GAT TTC ACG 624
CAA 627
SEQ ID NO:14
序列长度:341
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCW7511
序列描述
Met Thr Thr Thr Pro Ala Leu Ser Ser Glu Ala Thr Pro Arg Thr Asp
5 10 15
Trp Thr Arg Ala Glu Ile Ala Ala Leu Phe Asp Leu Pro Phe Thr Glu
20 25 30
Leu Leu Phe Arg Ala Ala Glu Val His Arg Ala His His Ala Ala Asp
35 40 45
Gln Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Pro
50 55 60
Glu Asp Cys Gly Tyr Cys Ser Gln Ser Thr His Ala Asp Thr Gly Leu
65 70 75 80
Lys Ala Thr Lys Leu Met Asp Pro Arg Ala Val Leu Gln Ala Ala Ala
85 90 95
Gln Ala Lys Asp His Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp
100 105 110
Arg Asn Pro Lys Asp Arg Asp Met Pro Ala Ile Val Glu Met Val Lys
115 120 125
Gly Val Arg Ala Met Gly Met Glu Thr Cys Met Thr Leu Gly Met Leu
130 135 140
Thr Asp Ala Gln Ala Gln Thr Leu Ala Glu Ala Gly Leu Asp Tyr Tyr
145 150 155 160
Asn His Asn Ile Asp Thr Ser Pro Glu Arg Tyr Gly Asp Val Ile Thr
165 170 175
Thr Arg Ser Phe Gly Glu Arg Leu Glu Thr Leu Glu His Val Arg Asp
180 185 190
Ala Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met Gly Glu Thr
195 200 205
Arg Gly Asp Arg Val Gly Phe Ile His Ala Leu Ala Thr Leu Pro Val
210 215 220
His Pro Gly Ser Val Pro Val Asn Ala Leu Val Pro Val Lys Gly Thr
225 230 235 240
Val Leu Gly Asp Met Leu Ala Asp Thr Pro Leu Ala Lys Ile Asp Asp
245 250 255
Ile Glu Phe Val Arg Thr Val Ala Val Ala Arg Ile Thr Met Pro His
260 265 270
Ser Met Val Arg Leu Ser Ala Gly Arg Glu Ser Met Ser Asp Ala Thr
275 280 285
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Thr Gly Asp
290 295 300
Lys Leu Leu Thr Ala Gly Asn Ala Gly Asp Asp Lys Asp Ala Ala Leu
305 310 315 320
Phe Ala Arg Leu Gly Leu Thr Pro Met Ala Ala Glu Cys Lys Val Glu
325 330 335
Leu Glu Ala Ala Glu
340 341
SEQ ID NO:15
序列长度:352
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:鞘氨醇单胞菌属种类
菌株:SC42405
序列描述
Met Thr Met Thr Asp Thr Pro Ala Ile Thr Ala Arg Thr Asp Trp Thr
5 10 15
Arg Glu Glu Ile Ala Ala Leu Phe Asp Leu Pro Phe Thr Glu Leu Val
20 25 30
Phe Arg Ala Ala Glu Val His Arg Ala Ser His Pro His Asn Glu Val
35 40 45
Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Val Glu Asp
50 55 60
Cys Gly Tyr Cys Ser Gln Ser Val Ser Ala Asn Ser Gly Val Lys Ala
65 70 75 80
Thr Lys Leu Met Glu Val Gln Gln Val Leu Gln Arg Ala Ala Gln Ala
85 90 95
Ala Asp Gln Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp Arg Asn
100 105 110
Pro Lys Asp Arg Asp Met Pro Ala Ile Ile Glu Met Val Lys Gly Val
115 120 125
Arg Ala Met Gly Met Glu Thr Cys Met Thr Arg Gly Met Leu Thr Pro
130 135 140
Asp Gln Ala Asp Met Leu Ser Glu Ala Gly Leu Asp Tyr Tyr Asn His
145 150 155 160
Asn Ile Asp Thr Ser Pro Glu Arg Tyr Asp Gln Val Ile Thr Thr Arg
165 170 175
Thr Met Asp Asp Arg Leu Asp Thr Leu Ser Asn Val Arg Met Ala Gly
180 185 190
Ile Asn Val Cys Ser Gly Gly Ile Val Gly Met Gly Glu Thr Arg Ala
195 200 205
Asp Arg Val Gly Phe Val His Thr Leu Ala Thr Leu Pro Asp His Pro
210 215 220
Gln Ser Val Pro Val Asn Ala Leu Val Pro Val Lys Gly Thr Val Leu
225 230 235 240
Gly Asp Met Leu Ala Asp Thr Pro Leu Ala Lys Ile Asp Asp Val Glu
245 250 255
Phe Val Arg Thr Val Ala Val Ala Arg Ile Thr Met Pro Leu Ser Met
260 265 270
Val Arg Leu Ser Ala Gly Arg Glu Ser Met Ser Glu Met Thr Gln Ala
275 280 285
Met Cys Phe Met Ala Gly Ala Asn Ser Ile Phe Thr Gly Asp Lys Leu
290 295 300
Leu Thr Ala Pro Asn Ser Gly Asp Asp Asn Asp Ala Ala Met Phe Ala
305 310 315 320
Arg Leu Gly Ile Lys Pro Met Ala Ile Glu Leu Thr Pro Ala Gln Val
325 330 335
Glu Ala Gln Arg Met Pro Lys Gly Cys Ala Lys Leu Glu Ala Ala Glu
340 345 350 352
SEQ ID NO:16
序列长度:1420
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCM7511
特征:
特征键:CDS
位置:223..1248
决定特征的方法:S
序列描述
GATCCCCGAG CTGATCGGCC ATCTGCGCGA GGCGGGCCGC GCGGATATCA AGGTCATCGC 60
GGGTGGCGTT ATTCCCGCAC AGGACTATCA GGCACTCTAC GATGCCGGGG TACAGGCGAT 120
TTTCGGTCCC GGCACCAATC TTGTGAAAGC GGCCGAGGAT GTGCTGAGGC TGCTGGGACA 180
TAATATGCCG CCCGAGGCGG GCGAATGACA GGACGACACG TG ATG ACG ACG ACA 234
CCC GCG CTG AGC TCC GAG GCG ACC CCG CGC ACC GAC TGG ACC CGC GCC 282
GAG ATC GCC GCG CTG TTC GAC CTG CCC TTC ACC GAG CTG TTG TTC CGC 330
GCG GCC GAG GTG CAC CGC GCG CAT CAC GCC GCC GAT CAG GTT CAG CTG 378
TCG ACG CTG TTG TCG ATC AAG ACG GGC GGC TGC CCC GAG GAT TGC GGC 426
TAT TGC AGC CAG TCG ACC CAT GCC GAT ACC GGG CTG AAG GCG ACC AAG 474
CTG ATG GAC CCG CGC GCC GTG CTG CAG GCG GCG GCG CAG GCC AAG GAT 522
CAC GGC TCG ACG CGC TTC TGC ATG GGC GCG GCC TGG CGC AAC CCC AAG 570
GAT CGC GAC ATG CCC GCC ATC GTG GAG ATG GTG AAG GGC GTG CGC GCC 618
ATG GGC ATG GAA ACC TGC ATG ACG CTG GGC ATG CTG ACC GAT GCA CAG 666
GCG CAG ACG CTC GCC GAG GCG GGG CTG GAC TAT TAC AAT CAC AAT ATC 714
GAC ACG TCG CCC GAG CGT TAT GGC GAC GTC ATC ACC ACG CGC AGC TTC 762
GGC GAG CGG TTG GAG ACG TTG GAG CAT GTC CGC GAT GCC GGC ATC AAT 810
GTA TGC TGT GGC GGT ATT GTC GGC ATG GGT GAG ACG CGC GGC GAC CGG 858
GTC GGC TTC ATC CAT GCG CTT GCC ACC CTG CCG GTC CAT CCG GGC AGC 906
GTG CCG GTG AAC GCG CTG GTG CCG GTC AAG GGC ACG GTA TTG GGC GAT 954
ATG TTG GCC GAC ACG CCG CTG GCC AAG ATC GAC GAT ATC GAA TTC GTC 1002
CGC ACC GTC GCG GTT GCG CGC ATC ACC ATG CCG CAT TCG ATG GTC CGC 1050
CTG TCG GCG GGG CGC GAG AGC ATG TCG GAT GCC ACC CAG GCT TTG TGC 1098
TTC CTG GCG GGC GCG AAC TCG ATC TTC ACC GGC GAC AAG CTG CTG ACT 1146
GCG GGC AAT GCG GGC GAC GAC AAG GAC GCA GCG CTC TTC GCC CGG CTG 1194
GGG CTC ACG CCC ATG GCG GCG GAG TGC AAG GTG GAA TTG GAA GCG GCG 1242
GAG TAA ACAGGCTTCG CCGGTTGTCC CCGGCGAAAG CCGGAGCCCA GTTGCGGTGA 1298
AGTAGGGGTG GTGCGCCACC CGAATGGCAT TCGACACGGA CCAACGACAT AATAGGAGAG 1358
GTATCCCCGT GTTCCAGAAA ATCCTGATCG CCAATCGCGG GGAAATCGCG TGCCGGGTGA 1418
TC 1420
SEQ ID NO:17
序列长度:1358
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
特征:
特征键:CDS
位置:152..1210
决定特征的方法:S
序列描述
TGCTGCGCCT GCTCGGCCAC AACATGCCGC CGCTCGGTTC TTCGCTGGAA GCGGCGGAAT 60
AAGGATGGCC ACGCTGGATC GACGCCGCGC TTGCCCCTAT GACCGGGGTT TGGCCGCGCG 120
TCATCCCGCG CGCAGACCGG CCGCCTGAGG A ATG ACT ATG ACT GAC ACC CCC 172
GCC ATC ACT GCA CGT ACC GAC TGG ACC CGT GAG GAA ATC GCG GCG CTG 220
TTC GAC CTG CCG TTC ACC GAA CTG GTG TTC CGC GCA GCC GAA GTC CAT 268
CGC GCC AGC CAT CCG CAC AAC GAA GTG CAG CTT TCC ACG CTG CTT TCG 316
ATC AAG ACC GGC GGC TGC GTG GAA GAC TGC GGC TAT TGC TCA CAG TCG 364
GTT TCG GCC AAC AGC GGC GTC AAG GCG ACC AAG CTG ATG GAA GTG CAG 412
CAG GTG CTG CAG CGC GCG GCG CAG GCG GCG GAT CAG GGC TCT ACC CGC 460
TTC TGC ATG GGC GCC GCC TGG CGC AAC CCC AAG GAC CGC GAC ATG CCC 508
GCC ATC ATC GAG ATG GTG AAG GGC GTG CGC GCC ATG GGC ATG GAA ACC 556
TGC ATG ACG CGG GGC ATG CTG ACG CCC GAT CAG GCG GAC ATG CTC TCC 604
GAA GCG GGT CTC GAT TAC TAC AAC CAC AAC ATC GAC ACC TCG CCC GAG 652
CGT TAC GAT CAG GTG ATC ACC ACG CGC ACG ATG GAT GAC CGC CTC GAT 700
ACG CTG TCG AAC GTG CGT ATG GCG GGC ATC AAC GTC TGC TCC GGC GGC 748
ATC GTC GGC ATG GGT GAG ACG CGC GCC GAC CGC GTG GGC TTC GTT CAC 796
ACG CTG GCG ACG CTG CCC GAT CAC CCG CAG TCG GTG CCG GTC AAC GCG 844
CTG GTT CCT GTG AAG GGC ACC GTG CTG GGC GAC ATG CTG GCC GAT ACC 892
CCG CTT GCC AAG ATC GAC GAT GTG GAA TTC GTG CGC ACC GTC GCG GTG 940
GCG CGC ATC ACC ATG CCG CTG TCG ATG GTG CGC CTC TCG GCC GGC CGC 988
GAA TCG ATG TCC GAA ATG ACG CAG GCG ATG TGC TTC ATG GCC GGC GCG 1036
AAC TCG ATC TTC ACC GGC GAC AAG CTG CTG ACC GCA CCG AAC TCC GGC 1084
GAC GAC AAC GAC GCG GCG ATG TTC GCC CGT CTC GGC ATC AAG CCG ATG 1132
GCC ATC GAA CTG ACC CCG GCG CAA GTC GAA GCC CAG CGC ATG CCC AAG 1180
GGC TGC GCC AAG CTG GAA GCT GCG GAA TAA CGAATGGGGC ACCGCGCACC 1230
CTTCCATCCC CGTCATGCTG AACTTGTTTC AGCATCCATT TCGCCGTTCG GACCGATGGC 1290
CTGTGCGGCG CGATGGACCC TGAGCCGTCA GGCCAGCGGA GCTAAACAAG TTCAGGGGGA 1350
CGATGAGG 1358
SEQ ID NO:18
序列长度:341
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCW7511
序列描述
Met Thr Thr Thr Pro Ala Leu Ser Ser Glu Ala Thr Pro Arg Thr Asp
5 10 15
Trp Thr Arg Ala Glu Ile Ala Ala Leu Phe Asp Leu Pro Phe Thr Glu
20 25 30
Leu Leu Phe Arg Ala Ala Glu Val His Arg Ala His His Ala Ala Asp
35 40 45
Gln Val Gln Leu Ser Thr Leu Leu Ser Ile Lys Thr Gly Gly Cys Pro
50 55 60
Glu Asp Cys Gly Tyr Cys Ser Gln Ser Thr His Ala Asp Thr Gly Leu
65 70 75 80
Lys Ala Thr Lys Leu Met Asp Pro Arg Ala Val Leu Gln Ala Ala Ala
85 90 95
Gln Ala Lys Asp His Gly Ser Thr Arg Phe Cys Met Gly Ala Ala Trp
100 105 110
Arg Asn Pro Lys Asp Arg Asp Met Pro Ala Ile Val Glu Met Val Lys
115 120 125
Gly Val Arg Ala Met Gly Met Glu Thr Cys Met Thr Leu Gly Met Leu
130 135 140
Thr Asp Ala Gln Ala Gln Thr Leu Ala Glu Ala Gly Leu Asp Tyr Tyr
145 150 155 160
Asn His Asn Ile Asp Thr Ser Pro Glu Arg Tyr Gly Asp Val Ile Thr
165 170 175
Thr Arg Ser Phe Gly Glu Arg Leu Glu Thr Leu Glu His Val Arg Asp
180 185 190
Ala Gly Ile Asn Val Cys Cys Gly Gly Ile Val Gly Met Gly Glu Thr
195 200 205
Arg Gly Asp Arg Val Gly Phe Ile His Ala Leu Ala Thr Leu Pro Val
210 215 220
His Pro Gly Ser Val Pro Val Asn Ala Leu Val Leu Val Lys Gly Thr
225 230 235 240
Val Leu Gly Asp Met Leu Ala Asp Thr Pro Leu Ala Lys Ile Asp Asp
245 250 255
Ile Glu Phe Val Arg Thr Val Ala Val Ala Arg Ile Thr Met Pro His
260 265 270
Ser Met Val Arg Leu Ser Ala Gly Arg Glu Ser Met Ser Asp Ala Thr
275 280 285
Gln Ala Leu Cys Phe Leu Ala Gly Ala Asn Ser Ile Phe Thr Gly Asp
290 295 300
Lys Leu Leu Thr Ala Gly Asn Ala Gly Asp Asp Lys Asp Ala Ala Leu
305 310 315 320
Phe Ala Arg Leu Gly Leu Thr Pro Met Ala Ala Glu Cys Lys Val Glu
325 330 335
Leu Glu Ala Ala Glu
340 341
SEQ ID NO:19
序列长度:1336
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCM7511
特征:
特征键:CDS
位置:151..1176
决定特征的方法:S
序列描述
TCTAGAACAG GACTATCAGG CACTCTACGA TGCCGGGGTA CAGGCGATTT TCGGTCCCGG 60
CACCAATCTT GTGAAAGCGG CCGAGGATGT GCTAAGGCTG CTGGGACATA ATATGCCGCC 120
CGAGGCGGGC GAATGACAGG ACGACACGTG ATG ACG ACG ACA CCC GCG CTG AGC 174
TCC GAG GCG ACC CCG CGC ACC GAC TGG ACC CGC GCC GAG ATC GCC GCG 222
CTG TTC GAC CTG CCC TTC ACC GAG CTG TTG TTC CGC GCG GCC GAG GTG 270
CAC CGC GCG CAT CAC GCC GCC GAT CAG GTT CAG CTG TCG ACG CTG TTG 318
TCG ATC AAG ACG GGC GGC TGC CCC GAG GAT TGC GGC TAT TGC AGC CAG 366
TCG ACC CAT GCC GAT ACC GGG CTG AAG GCG ACC AAG CTG ATG GAC CCG 414
CGC GCC GTG CTG CAG GCG GCG GCG CAG GCC AAG GAT CAC GGC TCG ACG 462
CGC TTC TGC ATG GGC GCG GCC TGG CGC AAC CCC AAG GAT CGC GAC ATG 510
CCC GCC ATC GTG GAG ATG GTG AAG GGC GTG CGC GCC ATG GGC ATG GAA 558
ACC TGC ATG ACG CTG GGC ATG CTG ACC GAT GCA CAG GCG CAG ACG CTC 606
GCC GAG GCG GGG CTG GAC TAT TAC AAT CAC AAT ATC GAC ACG TCG CCC 654
GAG CGT TAT GGC GAC GTC ATC ACC ACG CGC AGC TTC GGC GAG CGG TTG 702
GAG ACG TTG GAG CAT GTC CGC GAT GCC GGC ATC AAT GTA TGC TGT GGC 750
GGT ATT GTC GGC ATG GGT GAG ACG CGC GGC GAC CGG GTC GGC TTC ATC 798
CAT GCG CTT GCC ACC CTG CCG GTC CAT CCG GGC AGC GTG CCG GTG AAC 846
GCG CTG GTG CTG GTC AAG GGC ACG GTA TTG GGC GAT ATG TTG GCC GAC 894
ACG CCG CTG GCC AAG ATC GAC GAT ATC GAA TTC GTC CGC ACC GTC GCG 942
GTT GCG CGC ATC ACC ATG CCG CAT TCG ATG GTC CGC CTG TCG GCG GGG 990
CGC GAG AGC ATG TCG GAT GCC ACC CAG GCT TTG TGC TTC CTG GCG GGC 1038
GCG AAC TCG ATC TTC ACC GGC GAC AAG CTG CTG ACT GCG GGC AAT GCG 1086
GGC GAC GAC AAG GAC GCA GCG CTC TTC GCC CGG CTG GGG CTC ACG CCC 1134
ATG GCG GCG GAG TGC AAG GTG GAA TTG GAA GCG GCG GAG TAA 1176
ACAGGCTTCG CCGGTTGTCC CCGGCGAAAG CCGGAGCCCA GTTGCGGTGA AGTAGGGGTG 1236
GTGCGCCACC CGAATGGCAT TCGACACGGA CCAACGACAT AATAGGAGAG GTATCCCCGT 1296
GTTCCAGAAA ATCCTGATCG CCAATCGCGG GGAATCTAGA 1336
SEQ ID NO:20
序列长度:283
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCW7511
序列描述
Met Ala Glu Asp Ser Pro Ser Arg Ala Arg Ile Ala Gln Ala Phe Asp
5 10 15
Ala Ala Ala Ala Tyr Asp Ala Tyr Ala Val Val Gln Arg Gln Val Ala
20 25 30
Ala Trp Leu Ala Glu Arg Ile Val Ala Val Ala Pro Pro Arg Pro Arg
35 40 45
Val Leu Glu Val Gly Cys Gly Thr Gly Phe Leu Thr Gln Ala Ala Trp
50 55 60
Pro Arg Leu Asp Arg Pro Glu Trp Leu Met Thr Asp Ile Ala Pro Glu
65 70 75 80
Met Leu Ala Arg Gly Arg Ala Gln Met Pro Asp Leu Cys Ala Arg Val
85 90 95
Met Asp Gly Glu Arg Pro Asp Leu Ala Gly Glu Ala Pro Phe Asp Leu
100 105 110
Ile Val Ser Ser Leu Ala Val Gln Trp Phe Ser Asp Leu Glu Gly Gly
115 120 125
Leu Gln Arg Leu Ala Ala Leu Leu Ala Pro Gly Gly Arg Met Leu Val
130 135 140
Thr Thr Leu Ala Gln Gly Thr Phe Ala Gly Trp His Ala Ala His Arg
145 150 155 160
Ala Glu Gly Tyr Glu Ala Gly Ser His Ala Tyr Pro Thr Val Glu Ala
165 170 175
Leu Ala Ala Met Ala Leu Pro Gly Leu Gly Val Ala Thr Arg Arg Phe
180 185 190
Glu Gln Arg His Glu Thr Ala Ala Asp Phe Met Arg Ala Leu Arg Ala
195 200 205
Ile Gly Ala Gly Thr Pro Arg Val Gly His Arg Pro Ile Pro Pro Gly
210 215 220
Ala Met Arg Arg Ile Ala Lys Arg Phe Glu Val Gly Gly Ala Val Ala
225 230 235 240
Thr Tyr Glu Val Ala Leu Met Asp Ile Pro Asn Pro Val Gln Pro Glu
245 250 255
Arg Ser Arg Arg Pro Arg Ala Thr Arg Glu Ala Gly Arg Val Leu Arg
260 265 270
Phe Arg Ser Ala Arg Thr Glu Val Gly Gly Lys
275 280 283
SEQ ID NO:21
序列长度:254
序列类型:氨基酸
拓扑结构:线型
分子类型:蛋白质
原始来源
生物:鞘氨醇单胞菌属种类
菌株:SC42405
序列描述
Met Asn Ala Pro Arg Glu Arg Val Ser Arg Ala Phe Ala Ala Ala Pro
5 10 15
Asp Tyr Asp Gly His Ala Arg Ile Gln Arg Glu Val Ala Gln Thr Leu
20 25 30
Ala Ala Arg Ile Ala Ala Leu Asp Leu Pro Pro Asn Pro Arg Val Leu
35 40 45
Glu Ile Gly Cys Gly Thr Gly Phe Leu Thr Gln Ala Leu Ala Gly Leu
50 55 60
Asp Gly Asp Trp Leu Val Thr Asp Leu Ala Pro Glu Met Leu Glu Arg
65 70 75 80
Cys Arg Ser Arg Leu Gly Glu Ser Ala Arg His Arg Phe Ala Val Leu
85 90 95
Asp Gly Glu Tyr Gly Ala Pro Asp Gly Ala Pro Phe Asp Leu Ile Cys
100 105 110
Ser Ser Leu Ala Val Gln Trp Phe Asp Asp Thr Pro Ala Ala Leu Ala
115 120 125
Arg Met Ala Gly Trp Leu Ala Pro Gly Gly His Leu Met Val Thr Thr
130 135 140
Leu Gly Pro Gly Ser Phe Ala Glu Trp Arg Ala Ala His Glu Ala Glu
145 150 155 160
Gly Leu Glu Pro Gly Thr Pro His Phe Ala Asp Ile Ala Ala Phe Gly
165 170 175
Asp Leu Val His Ala Val Glu His Pro Val Glu His His Ala Asp Pro
180 185 190
Leu Ala Phe Leu His Ala Leu Lys Ala Ile Gly Ala Gln Thr Ala Glu
195 200 205
Ala Gly His Arg Pro Leu Ser Pro Gly Gln Leu Arg Arg Val Met Ala
210 215 220
Arg Phe Ala Gln Ser Gly Cys Pro Gln Asn Gly Cys Lys Val Thr Tyr
225 230 235 240
Glu Val Val Thr Cys His Leu His Arg Glu Ser Ser Leu Ser
245 250 254
SEQ ID NO:22
序列长度:1582
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞菌
菌株:JCM7511
特征:
特征键:CDS
位置:489..1340
决定特征的方法:S
序列描述
GATCTGGGTC GCGGCGCTGT TGGCGGCGTT GTTGCCGGTC GATCGGCTGG TCGCGCCCGA 60
TTGCGAGTCG GGCTGGTTCG ATCAGGTCGT GGTGCGCGGG GTGTCGCTGC CGCTCGTCAT 120
GCTGTTGCGG ATCGTCGCGC ATTGGCTGGC CTTTGCGCCG CCGCTGATGC TGGCGGCGAT 180
GGTCGCAGGC GGTTTGTTCG GGCTGGATGG CGCCGCGTTG GTGAGGGTCG AGACCGGATT 240
GCTGCTCGGT ACGCCGGGGC TCGCCGCGCT GGCGGTGGCG ACGGGGGCGC TGACGGCGGG 300
CTTGCGCGGT GCGGGAGCGG TGGCGGGGTT GCTGCTGTTA CCGCTCGCCC TGCCGCTGCT 360
GATCGATCTT CGGGGCTAGC GATGACGGCA TGGGCGGGGC CAAGCTGCTC GCCGCCGTGT 420
CGCTGTTGCT GGTCGCGGGT GCGCCCTGGC TGGCGGCGGC GGCGATCCGG TCGGTGCGCG 480
ACTGAGCC ATG GCC GAA GAC AGT CCA TCG CGC GCG CGG ATC GCG CAG GCC 530
TTT GAC GCG GCG GCG GCC TAT GAC GCC TAT GCG GTG GTG CAG CGC CAA 578
GTG GCC GCG TGG CTA GCC GAA CGA ATC GTC GCG GTC GCC CCG CCG AGG 626
CCC CGC GTG CTG GAG GTC GGG TGC GGC ACA GGC TTC CTG ACA CAG GCG 674
GCA TGG CCC CGG CTT GAT CGC CCC GAA TGG TTG ATG ACC GAT ATC GCA 722
CCC GAG ATG CTG GCC CGG GGC AGG GCG CAG ATG CCG GAT CTG TGT GCG 770
CGG GTG ATG GAT GGC GAG CGC CCC GAT CTG GCG GGC GAA GCG CCG TTC 818
GAC CTG ATC GTC AGC AGC CTG GCG GTG CAG TGG TTT TCC GAT CTG GAG 866
GGC GGC CTG CAG CGG CTG GCG GCG CTG CTC GCC CCT GGC GGG CGG ATG 914
CTG GTG ACG ACT CTG GCG CAA GGG ACA TTC GCC GGC TGG CAT GCC GCG 962
CAT CGG GCG GAG GGA TAT GAG GCG GGG AGT CAC GCC TAT CCA ACG GTC 1010
GAG GCG CTC GCG GCC ATG GCG TTG CCG GGG CTT GGG GTC GCC ACG CGA 1058
CGC TTC GAG CAG CGG CAC GAG ACG GCG GCG GAC TTC ATG CGC GCA CTA 1106
CGG GCG ATC GGG GCG GGG ACA CCG CGT GTT GGG CAC CGC CCG ATC CCG 1154
CCG GGC GCG ATG CGG CGG ATC GCG AAG CGC TTT GAG GTA GGC GGG GCG 1202
GTG GCG ACC TAT GAG GTC GCG TTG ATG GAC ATT CCC AAC CCC GTT CAG 1250
CCT GAG CGA AGT CGA AGG CCA CGC GCG ACG CGA GAG GCG GGG CGT GTG 1298
CTT CGA TTT CGC TCA GCA CGA ACG GAG GTT GGA GGT AAG TAG 1340
GCTTGGGTTG GCGTTTATCG GCTCCACCGC GCCCAGATGG CCGTGGGCAG GATCAGCCCG 1400
CGTGCTTCCT CGCGCACACC GAGTTCGCCG CATTCGACCT TGCCCGGCAG GTCGCCCATC 1460
ACTTGGCGGA GCATCTCGCC GATCGCCAGC GCCGACATGC GCACCGCATA GACGGTCAGG 1520
AACAGGAAGC GCGAATTCGC GTCGAGCAGC TTGCGGCAAT CGGCGATCAG GCCGGGCAGA 1580
TC 1582
SEQ ID NO:23
序列长度:971
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
特征:
特征键:CDS
位置:210..971
决定特征的方法:S
序列描述
TTGCGCGATG AGGAGGCCAC CTTGCCCGCC GTCCCCATCA TCTCGCTTCA AGGCGCGCGC 60
GACCCGCTTC TGCCCGAAGC GATGCGCGCA CATGTCTTCC GGAACGCCGC CGTGCGCCGG 120
ATCGAATGCG AGACCGGAGG GCACCTCCTC CCGCTCGAAG TGCCGGAATT CTGCGCGCAA 180
GCCGTGCGCG ACATGATCGA GACGCTGGC ATG AAC GCC CCC CGC GAG CGC GTC 233
AGC CGC GCC TTT GCC GCC GCG CCC GAC TAC GAC GGC CAT GCC CGC ATC 281
CAG CGT GAG GTC GCA CAA ACA CTC GCC GCC CGG ATC GCC GCG CTC GAC 329
CTG CCT CCA AAC CCG CGC GTG CTG GAG ATC GGC TGC GGC ACC GGT TTT 377
CTC ACG CAG GCG CTG GCC GGG CTG GAT GGC GAC TGG CTC GTC ACC GAT 425
CTT GCG CCC GAA ATG CTG GAG CGC TGT CGC AGC CGC CTG GGC GAA AGC 473
GCC CGG CAC CGC TTT GCC GTG CTC GAT GGC GAA TAT GGC GCA CCG GAC 521
GGC GCA CCG TTC GAC CTG ATC TGC TCC AGC CTC GCC GTG CAA TGG TTC 569
GAC GAT ACC CCG GCC GCC CTC GCC CGC ATG GCA GGC TGG CTG GCA CCG 617
GGC GGG CAC CTC ATG GTG ACG ACA CTC GGC CCC GGC AGC TTC GCC GAA 665
TGG CGC GCC GCG CAT GAA GCG GAG GGG CTG GAA CCC GGC ACG CCC CAC 713
TTC GCG GAC ATC GCC GCC TTC GGC GAC CTC GTC CAC GCG GTC GAG CAC 761
CCC GTC GAG CAT CAC GCC GAT CCG CTG GCC TTC CTC CAC GCC CTC AAG 809
GCC ATC GGC GCG CAG ACC GCC GAA GCC GGA CAC CGC CCC CTT TCC CCC 857
GGC CAG CTT CGC CGC GTC ATG GCA CGT TTC GCC CAA AGC GGA TGC CCC 905
CAA AAC GGA TGC AAA GTG ACT TAC GAA GTC GTG ACC TGC CAC CTA CAC 953
CGA GAA TCG AGC CTT TCA 971
SEQ ID NO:24
序列长度:150
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:少动鞘氨醇单胞
菌株:JCM7511
序列描述
TCTAGAACAG GACTATCAGG CACTCTACGA TGCCGGGGTA CAGGCGATTT TCGGTCCCGG 60
CACCAATCTT GTGAAAGCGG CCGAGGATGT GCTAAGGCTG CTGGGACATA ATATGCCGCC 120
CGAGGCGGGC GAATGACAGG ACGACACGTG 150
SEQ ID NO:25
序列长度:201
序列类型:核酸
链型:双链
拓扑结构:线型
分子类型:基因组DNA
原始来源
生物:鞘氨醇单胞属种类
菌株:SC42405
序列描述
ACCGGAATGA CAGGCGGACA GCAGCAATAG GGCGGCAAGA GAGAGCGGCA GGGATCGCAT 60
CAGACGGGCA TCCTTCGGTT TTTCCTTTGC CGTTCCAACG CGCGAGGAAG GCGGCGGCTT 120
CACGTCCCGC CGCGAAATCG ATGCCCCTCC CGGCCAGCCA AGCATTGTGC CGGACGCCCG 180
CTTGCCATAC GGGCAGGGGC G 201
Claims (16)
1.一种DNA片段,该片段具有编码其氨基酸序列如SEQ ID NO:14、15或18所示的蛋白质的基因,能获自少动鞘氨醇单胞菌(Sphingomonas paucimobilis)或鞘氨醇单胞属(Sphingomonas Sp.)微生物。
2.根据权利要求1所述的DNA片段,其中所述基因是生物素合酶基因。
3.根据权利要求1所述的DNA片段,其中的基因是编码生物素合酶的基因,并且所述基因的碱基序列如SEQ ID NO:16、17或19所示。
4.具有编码一种蛋白的基因的DNA片段,该蛋白的氨基酸序列如SEQ ID NO:14所示,并且该蛋白具有生物素合酶活性。
5.具有编码一种蛋白的基因的DNA片段,该蛋白的氨基酸序列如SEQ ID NO:15所示,并且该蛋白具有生物素合酶活性。
6.具有编码一种蛋白的基因的DNA片段,该蛋白的氨基酸序列如SEQ ID NO:18所示,并且该蛋白具有生物素合成酶活性。
7.权利要求1的DNA片段,其中所述片段具有基因的基因表达调控区。
8.根据权利要求7所述的DNA片段,其中所述的基因是生物素合酶基因。
9.根据权利要求1、2、3和7-8中任何一项所述的DNA片段,其中的微生物是少动鞘氨醇单胞菌JCM 7511或鞘氨醇单胞菌属种类SC42405。
10.包含根据权利要求1到9中任何一项所述的DNA片段的载体。
11.一种构建载体的方法,包括将根据权利要求1到9中任何一项所述的DNA片段插入能在宿主细胞中复制的载体。
12.根据权利要求10所述的载体,其中基因表达调控区域被连入蛋白编码区的上游。
13.含有导入宿主细胞的至少一个根据权利要求1到9中任何一项所述的DNA片段或至少一个根据权利要求10或12所述的载体的转化子。
14.根据权利要求13所述的转化子,其中的宿主细胞是微生物。
15.一种构建转化子的方法,包括将根据权利要求10或12所述的载体导入宿主细胞。
16.由权利要求3到6的任何一项的DNA片段编码的蛋白,该蛋白具有生物素合酶活性。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP047838/1997 | 1997-03-03 | ||
JP4783897 | 1997-03-03 | ||
JP047838/97 | 1997-03-03 |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB988002272A Division CN1166778C (zh) | 1997-03-03 | 1998-03-02 | 含生物素生物合成基因的dna片段及其利用 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1590546A CN1590546A (zh) | 2005-03-09 |
CN1590546B true CN1590546B (zh) | 2010-08-18 |
Family
ID=12786519
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2004100634785A Expired - Fee Related CN1590546B (zh) | 1997-03-03 | 1998-03-02 | 含生物素生物合成基因的dna片段及其利用 |
CNB988002272A Expired - Fee Related CN1166778C (zh) | 1997-03-03 | 1998-03-02 | 含生物素生物合成基因的dna片段及其利用 |
Family Applications After (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB988002272A Expired - Fee Related CN1166778C (zh) | 1997-03-03 | 1998-03-02 | 含生物素生物合成基因的dna片段及其利用 |
Country Status (12)
Country | Link |
---|---|
US (1) | US6410293B1 (zh) |
EP (2) | EP1577394B1 (zh) |
JP (1) | JP4329129B2 (zh) |
KR (1) | KR100567612B1 (zh) |
CN (2) | CN1590546B (zh) |
AT (1) | ATE432989T1 (zh) |
CA (1) | CA2252927C (zh) |
DE (2) | DE69840862D1 (zh) |
DK (2) | DK0913475T3 (zh) |
ES (2) | ES2326535T3 (zh) |
HU (1) | HUP0000718A2 (zh) |
WO (1) | WO1998039452A1 (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR102688631B1 (ko) * | 2021-11-15 | 2024-07-24 | 씨제이제일제당 (주) | 클래스 I 타입의 BirA를 포함하는 미생물 및 이를 이용한 바이오틴 생산방법 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS56160998A (en) | 1980-05-15 | 1981-12-11 | Nippon Zeon Co Ltd | Production of biotin active substance biotin vitamer |
US4446759A (en) | 1981-10-02 | 1984-05-08 | Ford Motor Company | Clutch stroke control metering valve for an automatic transmission |
JPH0740922B2 (ja) | 1985-03-05 | 1995-05-10 | 株式会社資生堂 | ビオチン生産性微生物 |
DE3786578T2 (de) * | 1986-03-25 | 1994-02-03 | Nippon Zeon Co | Biotin-Synthetase kodierendes Gen und dessen Verwendung. |
GB2216530B (en) * | 1988-03-22 | 1992-07-08 | Mini Agriculture & Fisheries | Genetic material for expression of biotin synthetase enzymes |
JP2722504B2 (ja) | 1988-07-14 | 1998-03-04 | 田辺製薬株式会社 | 新規微生物及びそれを用いるd−ビオチンの製法 |
JP3428078B2 (ja) | 1992-09-10 | 2003-07-22 | 住友化学工業株式会社 | ビオチンの製造方法および使用される微生物 |
EP0635572A3 (en) * | 1993-06-25 | 1995-03-08 | Hoffmann La Roche | Biotin biosynthesis in Bacillus subtilis. |
JP4121756B2 (ja) | 2002-03-15 | 2008-07-23 | フクビ化学工業株式会社 | コンクリート打設用型枠の設置構造 |
-
1998
- 1998-02-27 JP JP04726398A patent/JP4329129B2/ja not_active Expired - Fee Related
- 1998-03-02 CN CN2004100634785A patent/CN1590546B/zh not_active Expired - Fee Related
- 1998-03-02 US US09/180,109 patent/US6410293B1/en not_active Expired - Fee Related
- 1998-03-02 DE DE69840862T patent/DE69840862D1/de not_active Expired - Lifetime
- 1998-03-02 ES ES05003876T patent/ES2326535T3/es not_active Expired - Lifetime
- 1998-03-02 DE DE69840986T patent/DE69840986D1/de not_active Expired - Lifetime
- 1998-03-02 ES ES98905729T patent/ES2326289T3/es not_active Expired - Lifetime
- 1998-03-02 DK DK98905729T patent/DK0913475T3/da active
- 1998-03-02 WO PCT/JP1998/000858 patent/WO1998039452A1/ja active IP Right Grant
- 1998-03-02 KR KR1019980708838A patent/KR100567612B1/ko not_active IP Right Cessation
- 1998-03-02 DK DK05003876T patent/DK1577394T3/da active
- 1998-03-02 HU HU0000718A patent/HUP0000718A2/hu unknown
- 1998-03-02 CN CNB988002272A patent/CN1166778C/zh not_active Expired - Fee Related
- 1998-03-02 EP EP05003876A patent/EP1577394B1/en not_active Expired - Lifetime
- 1998-03-02 EP EP98905729A patent/EP0913475B1/en not_active Expired - Lifetime
- 1998-03-02 AT AT98905729T patent/ATE432989T1/de active
- 1998-03-02 CA CA2252927A patent/CA2252927C/en not_active Expired - Fee Related
Non-Patent Citations (4)
Title |
---|
A.J.OTSUKA 等.THE ESCHERICHIA COLI BIOTIN BIOSYNTHETICENZYME SEQUENCES PREDICTED FROM THENUCLEOTIDE SEQUENCE OF THE BIO OPERON.J.BIOL.CHEM 263.1988,(263),19577-19585. * |
A.J.OTSUKA等.THE ESCHERICHIA COLI BIOTIN BIOSYNTHETICENZYME SEQUENCES PREDICTED FROM THENUCLEOTIDE SEQUENCE OF THE BIO OPERON.J.BIOL.CHEM 263.1988,(263),19577-19585. * |
I.OHSAWA 等.CLONING OF THE BIOTIN SYNTHETASE GENE FROMBACILLUS SPHAERICUS AND EXPRESSION INESCHERICHIA COLI AND BACILLI.GENE 80.1989,(80),39-48. * |
I.OHSAWA等.CLONING OF THE BIOTIN SYNTHETASE GENE FROMBACILLUS SPHAERICUS AND EXPRESSION INESCHERICHIA COLI AND BACILLI.GENE 80.1989,(80),39-48. * |
Also Published As
Publication number | Publication date |
---|---|
CN1217749A (zh) | 1999-05-26 |
ES2326535T3 (es) | 2009-10-14 |
JP4329129B2 (ja) | 2009-09-09 |
DE69840862D1 (de) | 2009-07-16 |
EP0913475A4 (en) | 2003-04-02 |
CN1166778C (zh) | 2004-09-15 |
DK1577394T3 (da) | 2009-08-24 |
HUP0000718A2 (hu) | 2000-06-28 |
KR20000065186A (ko) | 2000-11-06 |
EP1577394A1 (en) | 2005-09-21 |
EP0913475B1 (en) | 2009-06-03 |
ES2326289T3 (es) | 2009-10-06 |
KR100567612B1 (ko) | 2006-07-25 |
CA2252927A1 (en) | 1998-09-11 |
US6410293B1 (en) | 2002-06-25 |
DE69840986D1 (de) | 2009-08-27 |
CA2252927C (en) | 2011-04-05 |
JPH10304886A (ja) | 1998-11-17 |
EP0913475A1 (en) | 1999-05-06 |
CN1590546A (zh) | 2005-03-09 |
EP1577394B1 (en) | 2009-07-15 |
DK0913475T3 (da) | 2009-08-24 |
ATE432989T1 (de) | 2009-06-15 |
WO1998039452A1 (fr) | 1998-09-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP0405370B1 (en) | Riboflavinoverproducing strains of bacteria | |
KR101558622B1 (ko) | mRNA의 안정화를 통한 표적 생성물의 증가된 제조 | |
Khudyakov et al. | Evidence that the hanA gene coding for HU protein is essential for heterocyst differentiation in, and cyanophage A-4 (L) sensitivity of, Anabaena sp. strain PCC 7120 | |
CN111117942B (zh) | 一种产林可霉素的基因工程菌及其构建方法和应用 | |
KR20030052242A (ko) | 로도콕쿠스 클로닝 및 발현 벡터 | |
HU214244B (hu) | Eljárás másodlagos metabolitok termelésének fokozására bioszintetikus géncsoporttal | |
JPH08501694A (ja) | ビオチンの生物工学的製造方法 | |
SK43196A3 (en) | Fragment dna, vector and microorganism containing genes for butyrobetaine/crotonobetaine-l-carnitine metabolism and process for producing of l-carnitine | |
JPH02167078A (ja) | 二次代謝産物生産増加のための生合成遺伝子又は制御遺伝子の単離法及び使用法 | |
CN101336295A (zh) | 用于mRNA稳定的方法 | |
AU616380B2 (en) | Cloning of the bioA, bioD, bioF, bioC, and bioH genes of Bacillus sphaericus, vectors and transformed cells and method for preparing biotin | |
CA2083407C (en) | Microorganisms for the stabilization of plasmids | |
CN1590546B (zh) | 含生物素生物合成基因的dna片段及其利用 | |
JPH08505286A (ja) | ベタインの異化に関して改変された細胞、それらの調製、および特に、代謝産物または酵素の生産のための使用 | |
EP0537553B1 (en) | Farnesyl pyrophosphate synthetase and DNA sequence encoding the same | |
EP0213898B1 (en) | A host-vector system | |
JP2722504B2 (ja) | 新規微生物及びそれを用いるd−ビオチンの製法 | |
US7049097B2 (en) | Antibiotics-independent vector for constant high-expression and method for gene expression using the same | |
Goosen et al. | Genes involved in the biosynthesis of PQQ from Acinetobacter calcoaceticus | |
AU659237B2 (en) | Gene coding for esterase and novel microorganism containing said gene | |
EP1024194A1 (en) | Plasmid vectors | |
US20020072062A1 (en) | Chloramphenicol biosynthetic pathway and gene cluster | |
CA2405326A1 (en) | Plasmid to be used in chromosome recombination of escherichia coli | |
WO2017037629A1 (en) | Improved microorganisms for the biosynthesis of pamamycins | |
HU209941B (en) | Method for identifying and using biosynthetic or regulatory genes for enhanced production of beta-lactam antibiotics, these comprising vector, and transformation of fungi cells |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20100818 Termination date: 20130302 |