Articles
378 result(s) for 'PubChem' within Journal of Cheminformatics
Page 1 of 8
-
Citation: Journal of Cheminformatics 2013 5(Suppl 1):P38
-
Classification of CYP450 1A2 inhibitors using PubChem data
Citation: Journal of Cheminformatics 2010 2(Suppl 1):P40 -
Taking the PubChem web sketcher to the next level
Citation: Journal of Cheminformatics 2013 5(Suppl 1):P20 -
Literature information in PubChem: associations between PubChem records and scientific articles
PubChem is an open archive consisting of a ... . Currently (as of Nov. 2015), PubChem contains more than 150 million depositor-provided...
Citation: Journal of Cheminformatics 2016 8:32 -
The PubChem chemical structure sketcher
PubChem is an important public, Web-based information ... JavaScript functions, and image sequence streaming. The PubChem structure editor does not require the presence...
Citation: Journal of Cheminformatics 2009 1:20 -
PubChemRDF: towards the semantic annotation of PubChem compound and substance databases
PubChem is an open repository for chemical structures, ... to distribute and integrate scientific data. Exposing PubChem data to Semantic Web services may help...
Citation: Journal of Cheminformatics 2015 7:34 -
PubChem chemical structure standardization
PubChem is a chemical information repository, consisting of ... structure standardization. The present study describes the PubChem standardization approaches and analyzes them for their ... structures during the ...
Citation: Journal of Cheminformatics 2018 10:36 -
Similar compounds versus similar conformers: complementarity between PubChem 2-D and 3-D neighboring sets
PubChem is a public repository for biological activities ... of its vast amount of chemical information, PubChem performs 2-dimensional (2-D) and ... precompute “neighbor” relationships between molecules in the PubChem
Citation: Journal of Cheminformatics 2016 8:62 -
PubChem atom environments
For our analysis, after hydrogen suppression, atoms were characterized by atomic number, formal charge, implicit hydrogen count, explicit degree (number of neighbors), valence (bond order sum), and aromaticity. B...
Citation: Journal of Cheminformatics 2015 7:41 -
PUG-View: programmatic access to chemical annotations integrated in PubChem
PubChem is a chemical data repository that provides ... with new opportunities for data-intensive research. PubChem provides several programmatic access routes. One of ... interface specialized for accessing anno...
Citation: Journal of Cheminformatics 2019 11:56 -
Cheminformatics analysis of the AR agonist and antagonist datasets in PubChem
...As one of the largest publicly accessible databases for hosting chemical structures and biological activities, PubChem has been processing bioassay submissions from the ... increase in volume for the deposited...
Citation: Journal of Cheminformatics 2016 8:37 -
Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization
In this paper, we propose a scaffold hierarchy as a result of large-scale analysis of the PubChem Compound database. The analysis not only provided insights into scaffold diversity of the PubChem Compound databas...
Citation: Journal of Cheminformatics 2016 8:74 -
PubChem structure–activity relationship (SAR) clusters
Research discussed in the present paper employed a bioactivity-centered clustering approach to group 843,845 non-inactive compounds stored in PubChem according to both structural similarity and bioactivity ... wi...
Citation: Journal of Cheminformatics 2015 7:33 -
PubChem3D: a new resource for scientists
PubChem is an open repository for small molecules and their experimental biological activity. PubChem integrates and provides search, retrieval, visualization, ... with similar biological efficacies against targe...
Citation: Journal of Cheminformatics 2011 3:32 -
Quantitative assessment of the expanding complementarity between public and commercial databases of bioactive compounds
Where they could be calculated, extracted compounds-per-journal article were in the range of 12 to 19 but compound-per-protein counts increased with document numbers. Chemical structure filtration to facilitate s...
Citation: Journal of Cheminformatics 2009 1:10 -
IDSM ChemWebRDF: SPARQLing small-molecule datasets
The Resource Description Framework (RDF), together with well-defined ontologies, significantly increases data interoperability and usability. The SPARQL query language was introduced to retrieve requested RDF dat...
Citation: Journal of Cheminformatics 2021 13:38 -
PubChem3D: Biologically relevant 3-D similarity
The use of 3-D similarity techniques in the analysis of biological data and virtual screening is pervasive, but what is a biologically meaningful 3-D similarity value? Can one find statistically significant separ...
Citation: Journal of Cheminformatics 2011 3:26 -
PubChem3D: Similar conformers
PubChem is a free and open public resource ... both chemical structures and biological test results, PubChem is a sizeable system with an uneven ... of available information. Some chemical structures in PubChem i...
Citation: Journal of Cheminformatics 2011 3:13 -
Empowering large chemical knowledge bases for exposomics: PubChemLite meets MetFrag
Compound (or chemical) databases are an invaluable resource for many scientific disciplines. Exposomics researchers need to find and identify relevant chemicals that cover the entirety of potential (chemical and ...
Citation: Journal of Cheminformatics 2021 13:19 -
Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis
To improve the utility of PubChem, a public repository containing biological activities of ... to the small-molecule records contained in the PubChem Compound database and provides various search and...
Citation: Journal of Cheminformatics 2012 4:28 -
canSAR chemistry registration and standardization pipeline
Integration of medicinal chemistry data from numerous public resources is an increasingly important part of academic drug discovery and translational research because it can bring a wealth of important knowledge ...
Citation: Journal of Cheminformatics 2022 14:28 -
Extracting and connecting chemical structures from text sources using chemicalize.org
Exploring bioactive chemistry requires navigating between structures and data from a variety of text-based sources. While PubChem currently includes approximately 16 million document-extracted...
Citation: Journal of Cheminformatics 2013 5:20 -
PubChem3D: conformer ensemble accuracy
PubChem is a free and publicly available resource ... activity information. PubChem3D is an extension to PubChem containing computationally-derived three-dimensional (3-D...
Citation: Journal of Cheminformatics 2013 5:1 -
Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach
Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual scree...
Citation: Journal of Cheminformatics 2021 13:27 -
PubChem3D: Shape compatibility filtering using molecular shape quadrupoles
PubChem provides a 3-D neighboring relationship, which ... be a 3-D neighbor. As such, PubChem employs a series of pre-filters, based ... leads one to wonder: can the existing PubChem 3-D neighboring relationship...
Citation: Journal of Cheminformatics 2011 3:25 -
MetFrag relaunched: incorporating strategies beyond in silico fragmentation
MetFrag has gone through algorithmic and scoring refinements. New features include the retrieval of reference, data source and patent information via ChemSpider and PubChem web services, as well as InChIKey filte...
Citation: Journal of Cheminformatics 2016 8:3 -
Consistency of systematic chemical identifiers within and between small-molecule databases
Correctness of structures and associated metadata within public and commercial chemical databases greatly impacts drug discovery research activities such as quantitative structure–property relationships modell...
Citation: Journal of Cheminformatics 2012 4:35 -
InChI in the wild: an assessment of InChIKey searching in Google
While chemical databases can be queried using the InChI string and InChIKey (IK) the latter was designed for open-web searching. It is becoming increasingly effective for this since more sources enhance crawli...
Citation: Journal of Cheminformatics 2013 5:10 -
Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2
An important step in the reconstruction of a metabolic network is annotation of metabolites. Metabolites are generally annotated with various database or structure based identifiers. Metabolite annotations in ...
Citation: Journal of Cheminformatics 2014 6:2 -
FP-ADMET: a compendium of fingerprint-based ADMET prediction models
The absorption, distribution, metabolism, excretion, and toxicity (ADMET) of drugs plays a key role in determining which among the potential candidates are to be prioritized. In silico approaches based on mach...
Citation: Journal of Cheminformatics 2021 13:75 -
InChI version 1.06: now more than 99.99% reliable
The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integr...
Citation: Journal of Cheminformatics 2021 13:40 -
WENDI: A tool for finding non-obvious relationships between compounds and biological properties, genes, diseases and scholarly publications
In recent years, there has been a huge increase in the amount of publicly-available and proprietary information pertinent to drug discovery. However, there is a distinct lack of data mining tools available to ...
Citation: Journal of Cheminformatics 2010 2:6 -
LeadMine: a grammar and dictionary driven approach to entity recognition
Our system uses a mixture of expertly curated grammars and dictionaries, as well as dictionaries automatically derived from public resources. We show that the heuristics developed to filter our dictionary of triv...
Citation: Journal of Cheminformatics 2015 7(Suppl 1):S5 -
HD_BPMDS: a curated binary pattern multitarget dataset of Huntington’s disease–targeting agents
The discovery of both distinctive lead molecules and novel drug targets is a great challenge in drug discovery, which particularly accounts for orphan diseases. Huntington’s disease (HD) is an orphan, neurodegene...
Citation: Journal of Cheminformatics 2023 15:109 -
Towards a Universal SMILES representation - A standard method to generate canonical SMILES based on the InChI
I describe how to use the InChI canonicalisation to derive a canonical SMILES string in a straightforward way, either incorporating the InChI normalisations (Inchified SMILES) or not (Universal SMILES). This is t...
Citation: Journal of Cheminformatics 2012 4:22 -
Expanding the fragrance chemical space for virtual screening
The properties of fragrance molecules in the public databases SuperScent and Flavornet were analyzed to define a “fragrance-like” (FL) property range (Heavy Atom Count ≤ 21, only C, H, O, S, (O + S) ≤ 3, Hydrogen...
Citation: Journal of Cheminformatics 2014 6:27 -
PubChem3D: Diversity of shape
The shape diversity of 16.4 million biologically relevant molecules from the PubChem Compound database and their 1.46 billion...
Citation: Journal of Cheminformatics 2011 3:9 -
The CompTox Chemistry Dashboard: a community data resource for environmental chemistry
Despite an abundance of online databases providing access to chemical data, there is increasing demand for high-quality, structure-curated, open data to meet the various needs of the environmental sciences and co...
Citation: Journal of Cheminformatics 2017 9:61 -
ExCAPE-DB: an integrated large scale dataset facilitating Big Data analysis in chemogenomics
Chemogenomics data generally refers to the activity data of chemical compounds on an array of protein targets and represents an important source of information for building in silico...target prediction models. T...
Citation: Journal of Cheminformatics 2017 9:17 -
Fast rule-based bioactivity prediction using associative classification mining
Relating chemical features to bioactivities is critical in molecular design and is used extensively in the lead discovery and optimization process. A variety of techniques from statistics, data mining and mach...
Citation: Journal of Cheminformatics 2012 4:29 -
A ligand-based computational drug repurposing pipeline using KNIME and Programmatic Data Access: case studies for rare diseases and COVID-19
Biomedical information mining is increasingly recognized as a promising technique to accelerate drug discovery and development. Especially, integrative approaches which mine data from several (open) data sourc...
Citation: Journal of Cheminformatics 2020 12:71 -
Machine learning for identification of silylated derivatives from mass spectra
Compound structure identification is using increasingly more sophisticated computational tools, among which machine learning tools are a recent addition that quickly gains in importance. These tools, of which ...
Citation: Journal of Cheminformatics 2022 14:62 -
A workflow for deriving chemical entities from crystallographic data and its application to the Crystallography Open Database
Knowledge about the 3-dimensional structure, orientation and interaction of chemical compounds is important in many areas of science and technology. X-ray crystallography is one of the experimental techniques ...
Citation: Journal of Cheminformatics 2023 15:123 -
An open source chemical structure curation pipeline using RDKit
The ChEMBL database is one of a number of public databases that contain bioactivity data on small molecule compounds curated from diverse sources. Incoming compounds are typically not standardised according to...
Citation: Journal of Cheminformatics 2020 12:51 -
The chemfp project
The chemfp project has had four main goals: (1) promote the FPS format as a text-based exchange format for dense binary cheminformatics fingerprints, (2) develop a high-performance implementation of the BitBound ...
Citation: Journal of Cheminformatics 2019 11:76 -
Combining structural and bioactivity-based fingerprints improves prediction performance and scaffold hopping capability
This study aims at improving upon existing activity predictions methods by augmenting chemical structure fingerprints with bio-activity based fingerprints derived from high-throughput screening (HTS) data (HTSFPs...
Citation: Journal of Cheminformatics 2019 11:54 -
Ambiguity of non-systematic chemical identifiers within and between small-molecule databases
A wide range of chemical compound databases are currently available for pharmaceutical research. To retrieve compound information, including structures, researchers can query these chemical databases using non...
Citation: Journal of Cheminformatics 2015 7:54 -
rBAN: retro-biosynthetic analysis of nonribosomal peptides
Proteinogenic and non-proteinogenic amino acids, fatty acids or glycans are some of the main building blocks of nonribsosomal peptides (NRPs) and as such may give insight into the origin, biosynthesis and bioacti...
Citation: Journal of Cheminformatics 2019 11:13 -
Target prediction utilising negative bioactivity data covering large chemical space
In silico analyses are increasingly being used to support mode-of-action investigations; however many such approaches do not utilise the large amounts of inactive data held in chemogenomic repositories. The objec...
Citation: Journal of Cheminformatics 2015 7:51 -
MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics
Here we present Metabolic In silico Network Expansions (MINEs), an extension of known metabolite databases to include molecules that have not been observed, but are likely to occur based on known metabolites and ...
Citation: Journal of Cheminformatics 2015 7:44