Smialowski et al., 2007 - Google Patents
Protein solubility: sequence based prediction and experimental verificationSmialowski et al., 2007
View HTML- Document ID
- 7502944370731247000
- Author
- Smialowski P
- Martin-Galiano A
- Mikolajka A
- Girschick T
- Holak T
- Frishman D
- Publication year
- Publication venue
- Bioinformatics
External Links
Snippet
Motivation: Obtaining soluble proteins in sufficient concentrations is a recurring limiting factor in various experimental studies. Solubility is an individual trait of proteins which, under a given set of experimental conditions, is determined by their amino acid sequence …
- 102000004169 proteins and genes 0 title abstract description 157
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/16—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for molecular structure, e.g. structure alignment, structural or functional relations, protein folding, domain topologies, drug targeting using structure data, involving two-dimensional or three-dimensional structures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/22—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for sequence comparison involving nucleotides or amino acids, e.g. homology search, motif or SNP [Single-Nucleotide Polymorphism] discovery or sequence alignment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/28—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for programming tools or database systems, e.g. ontologies, heterogeneous data integration, data warehousing or computing architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G06F17/30864—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems
- G06F17/30867—Retrieval from the Internet, e.g. browsers by querying, e.g. search engines or meta-search engines, crawling techniques, push systems with filtering and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/18—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for functional genomics or proteomics, e.g. genotype-phenotype associations, linkage disequilibrium, population genetics, binding site identification, mutagenesis, genotyping or genome annotation, protein-protein interactions or protein-nucleic acid interactions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/24—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for machine learning, data mining or biostatistics, e.g. pattern finding, knowledge discovery, rule extraction, correlation, clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/10—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology
- G06F19/12—Bioinformatics, i.e. methods or systems for genetic or protein-related data processing in computational molecular biology for modelling or simulation in systems biology, e.g. probabilistic or dynamic models, gene-regulatory networks, protein interaction networks or metabolic networks
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01N—INVESTIGATING OR ANALYSING MATERIALS BY DETERMINING THEIR CHEMICAL OR PHYSICAL PROPERTIES
- G01N33/00—Investigating or analysing materials by specific methods not covered by the preceding groups
- G01N33/48—Investigating or analysing materials by specific methods not covered by the preceding groups biological material, e.g. blood, urine; Haemocytometers
- G01N33/50—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing
- G01N33/68—Chemical analysis of biological material, e.g. blood, urine; Testing involving biospecific ligand binding methods; Immunological testing involving proteins, peptides or amino acids
- G01N33/6803—General methods of protein analysis not limited to specific proteins or families of proteins
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/70—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds
- G06F19/706—Chemoinformatics, i.e. data processing methods or systems for the retrieval, analysis, visualisation, or storage of physicochemical or structural data of chemical compounds for drug design with the emphasis on a therapeutic agent, e.g. ligand-biological target interactions, pharmacophore generation
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Smialowski et al. | Protein solubility: sequence based prediction and experimental verification | |
Erickson et al. | Sourcing thermotolerant poly (ethylene terephthalate) hydrolase scaffolds from natural diversity | |
Smialowski et al. | PROSO II–a new method for protein solubility prediction | |
Habibi et al. | A review of machine learning methods to predict the solubility of overexpressed recombinant proteins in Escherichia coli | |
Magnan et al. | SOLpro: accurate sequence-based prediction of protein solubility | |
Idicula-Thomas et al. | A support vector machine-based method for predicting the propensity of a protein to be soluble or to form inclusion body on overexpression in Escherichia coli | |
Lee et al. | Predicting protein function from sequence and structure | |
Sammut et al. | Pfam 10 years on: 10 000 families and still growing | |
Bressin et al. | TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs | |
Gardy et al. | Methods for predicting bacterial protein subcellular localization | |
Disfani et al. | MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins | |
Radivojac et al. | Intrinsic disorder and functional proteomics | |
Mizianty et al. | Sequence-based prediction of protein crystallization, purification and production propensity | |
Hirose et al. | ESPRESSO: a system for estimating protein expression and solubility in protein expression systems | |
Brylinski et al. | e FindSite: Improved prediction of ligand binding sites in protein models using meta-threading, machine learning and auxiliary ligands | |
Yang et al. | High-accuracy prediction of transmembrane inter-helix contacts and application to GPCR 3D structure modeling | |
Hu et al. | A new supervised over-sampling algorithm with application to protein-nucleotide binding residue prediction | |
Chen et al. | ProAcePred: prokaryote lysine acetylation sites prediction based on elastic net feature optimization | |
Dosztányi et al. | Prediction of protein disorder | |
Patino-Lopez et al. | Myosin 1G is an abundant class I myosin in lymphocytes whose localization at the plasma membrane depends on its ancient divergent pleckstrin homology (PH) domain (Myo1PH) | |
Li et al. | An efficient support vector machine approach for identifying protein S-nitrosylation sites | |
Waight et al. | A machine learning strategy for the identification of key in silico descriptors and prediction models for IgG monoclonal antibody developability properties | |
Xiao et al. | iMem-Seq: a multi-label learning classifier for predicting membrane proteins types | |
Babnigg et al. | Predicting protein crystallization propensity from protein sequence | |
Chan et al. | Learning to predict expression efficacy of vectors in recombinant protein production |