Articles

378 result(s) for 'PubChem' within Journal of Cheminformatics

Page 1 of 8

PubChem: atom environments for molecule standardization

Authors: Volker Hähnke, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2013 5(Suppl 1):P38

Content type: Poster presentation Published on: 22 March 2013

This article is part of a Supplement: Volume 5 Supplement 1
- View Full Text
- View PDF
Classification of CYP450 1A2 inhibitors using PubChem data

Authors: Sergii Novotarskyi, Iurii Sushko, R Körner, AK Pandey and Igor Tetko

Citation: Journal of Cheminformatics 2010 2(Suppl 1):P40

Content type: Poster presentation Published on: 4 May 2010

This article is part of a Supplement: Volume 2 Supplement 1
- View Full Text
- View PDF
Taking the PubChem web sketcher to the next level

Authors: Wolf D Ihlenfeldt

Citation: Journal of Cheminformatics 2013 5(Suppl 1):P20

Content type: Poster presentation Published on: 22 March 2013

This article is part of a Supplement: Volume 5 Supplement 1
- View Full Text
- View PDF
Literature information in PubChem: associations between PubChem records and scientific articles

PubChem is an open archive consisting of a ... . Currently (as of Nov. 2015), PubChem contains more than 150 million depositor-provided...

Authors: Sunghwan Kim, Paul A. Thiessen, Tiejun Cheng, Bo Yu, Benjamin A. Shoemaker, Jiyao Wang, Evan E. Bolton, Yanli Wang and Stephen H. Bryant

Citation: Journal of Cheminformatics 2016 8:32

Content type: Database Published on: 10 June 2016
- View Full Text
- View PDF
The PubChem chemical structure sketcher

PubChem is an important public, Web-based information ... JavaScript functions, and image sequence streaming. The PubChem structure editor does not require the presence...

Authors: Wolf D Ihlenfeldt, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2009 1:20

Content type: Software Published on: 17 December 2009
- View Full Text
- View PDF
PubChemRDF: towards the semantic annotation of PubChem compound and substance databases

PubChem is an open repository for chemical structures, ... to distribute and integrate scientific data. Exposing PubChem data to Semantic Web services may help...

Authors: Gang Fu, Colin Batchelor, Michel Dumontier, Janna Hastings, Egon Willighagen and Evan Bolton

Citation: Journal of Cheminformatics 2015 7:34

Content type: Database Published on: 14 July 2015
- View Full Text
- View PDF
PubChem chemical structure standardization

PubChem is a chemical information repository, consisting of ... structure standardization. The present study describes the PubChem standardization approaches and analyzes them for their ... structures during the ...

Authors: Volker D. Hähnke, Sunghwan Kim and Evan E. Bolton

Citation: Journal of Cheminformatics 2018 10:36

Content type: Research article Published on: 10 August 2018
- View Full Text
- View PDF
Similar compounds versus similar conformers: complementarity between PubChem 2-D and 3-D neighboring sets

PubChem is a public repository for biological activities ... of its vast amount of chemical information, PubChem performs 2-dimensional (2-D) and ... precompute “neighbor” relationships between molecules in the PubChem

Authors: Sunghwan Kim, Evan E. Bolton and Stephen H. Bryant

Citation: Journal of Cheminformatics 2016 8:62

Content type: Research article Published on: 4 November 2016
- View Full Text
- View PDF
PubChem atom environments

For our analysis, after hydrogen suppression, atoms were characterized by atomic number, formal charge, implicit hydrogen count, explicit degree (number of neighbors), valence (bond order sum), and aromaticity. B...

Authors: Volker D Hähnke, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2015 7:41

Content type: Research article Published on: 19 August 2015
- View Full Text
- View PDF
PUG-View: programmatic access to chemical annotations integrated in PubChem

PubChem is a chemical data repository that provides ... with new opportunities for data-intensive research. PubChem provides several programmatic access routes. One of ... interface specialized for accessing anno...

Authors: Sunghwan Kim, Paul A. Thiessen, Tiejun Cheng, Jian Zhang, Asta Gindulyte and Evan E. Bolton

Citation: Journal of Cheminformatics 2019 11:56

Content type: Database Published on: 9 August 2019
- View Full Text
- View PDF
Cheminformatics analysis of the AR agonist and antagonist datasets in PubChem

...As one of the largest publicly accessible databases for hosting chemical structures and biological activities, PubChem has been processing bioassay submissions from the ... increase in volume for the deposited...

Authors: Ming Hao, Stephen H. Bryant and Yanli Wang

Citation: Journal of Cheminformatics 2016 8:37

Content type: Research article Published on: 8 July 2016
- View Full Text
- View PDF
Scaffold analysis of PubChem database as background for hierarchical scaffold-based visualization

In this paper, we propose a scaffold hierarchy as a result of large-scale analysis of the PubChem Compound database. The analysis not only provided insights into scaffold diversity of the PubChem Compound databas...

Authors: Jakub Velkoborsky and David Hoksza

Citation: Journal of Cheminformatics 2016 8:74

Content type: Methodology Published on: 29 December 2016
- View Full Text
- View PDF
PubChem structure–activity relationship (SAR) clusters

Research discussed in the present paper employed a bioactivity-centered clustering approach to group 843,845 non-inactive compounds stored in PubChem according to both structural similarity and bioactivity ... wi...

Authors: Sunghwan Kim, Lianyi Han, Bo Yu, Volker D Hähnke, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2015 7:33

Content type: Research article Published on: 7 July 2015
- View Full Text
- View PDF
PubChem3D: a new resource for scientists

PubChem is an open repository for small molecules and their experimental biological activity. PubChem integrates and provides search, retrieval, visualization, ... with similar biological efficacies against targe...

Authors: Evan E Bolton, Jie Chen, Sunghwan Kim, Lianyi Han, Siqian He, Wenyao Shi, Vahan Simonyan, Yan Sun, Paul A Thiessen, Jiyao Wang, Bo Yu, Jian Zhang and Stephen H Bryant

Citation: Journal of Cheminformatics 2011 3:32

Content type: Software Published on: 20 September 2011
- View Full Text
- View PDF
Quantitative assessment of the expanding complementarity between public and commercial databases of bioactive compounds

Where they could be calculated, extracted compounds-per-journal article were in the range of 12 to 19 but compound-per-protein counts increased with document numbers. Chemical structure filtration to facilitate s...

Authors: Christopher Southan, Péter Várkonyi and Sorel Muresan

Citation: Journal of Cheminformatics 2009 1:10

Content type: Research article Published on: 6 July 2009
- View Full Text
- View PDF
IDSM ChemWebRDF: SPARQLing small-molecule datasets

The Resource Description Framework (RDF), together with well-defined ontologies, significantly increases data interoperability and usability. The SPARQL query language was introduced to retrieve requested RDF dat...

Authors: Jakub Galgonek and Jiří Vondrášek

Citation: Journal of Cheminformatics 2021 13:38

Content type: Software Published on: 12 May 2021
- View Full Text
- View PDF
PubChem3D: Biologically relevant 3-D similarity

The use of 3-D similarity techniques in the analysis of biological data and virtual screening is pervasive, but what is a biologically meaningful 3-D similarity value? Can one find statistically significant separ...

Authors: Sunghwan Kim, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2011 3:26

Content type: Research article Published on: 22 July 2011
- View Full Text
- View PDF
PubChem3D: Similar conformers

PubChem is a free and open public resource ... both chemical structures and biological test results, PubChem is a sizeable system with an uneven ... of available information. Some chemical structures in PubChem i...

Authors: Evan E Bolton, Sunghwan Kim and Stephen H Bryant

Citation: Journal of Cheminformatics 2011 3:13

Content type: Research article Published on: 9 May 2011
- View Full Text
- View PDF
Empowering large chemical knowledge bases for exposomics: PubChemLite meets MetFrag

Compound (or chemical) databases are an invaluable resource for many scientific disciplines. Exposomics researchers need to find and identify relevant chemicals that cover the entirety of potential (chemical and ...

Authors: Emma L. Schymanski, Todor Kondić, Steffen Neumann, Paul A. Thiessen, Jian Zhang and Evan E. Bolton

Citation: Journal of Cheminformatics 2021 13:19

Content type: Research article Published on: 8 March 2021
- View Full Text
- View PDF
Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis

To improve the utility of PubChem, a public repository containing biological activities of ... to the small-molecule records contained in the PubChem Compound database and provides various search and...

Authors: Sunghwan Kim, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2012 4:28

Content type: Research article Published on: 7 November 2012
- View Full Text
- View PDF
canSAR chemistry registration and standardization pipeline

Integration of medicinal chemistry data from numerous public resources is an increasingly important part of academic drug discovery and translational research because it can bring a wealth of important knowledge ...

Authors: Daniela Dolciami, Eloy Villasclaras-Fernandez, Christos Kannas, Mirco Meniconi, Bissan Al-Lazikani and Albert A. Antolin

Citation: Journal of Cheminformatics 2022 14:28

Content type: Methodology Published on: 28 May 2022
- View Full Text
- View PDF
Extracting and connecting chemical structures from text sources using chemicalize.org

Exploring bioactive chemistry requires navigating between structures and data from a variety of text-based sources. While PubChem currently includes approximately 16 million document-extracted...

Authors: Christopher Southan and Andras Stracz

Citation: Journal of Cheminformatics 2013 5:20

Content type: Software Published on: 23 April 2013
- View Full Text
- View PDF
PubChem3D: conformer ensemble accuracy

PubChem is a free and publicly available resource ... activity information. PubChem3D is an extension to PubChem containing computationally-derived three-dimensional (3-D...

Authors: Sunghwan Kim, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2013 5:1

Content type: Research article Published on: 7 January 2013
- View Full Text
- View PDF
Analysis of the effects of related fingerprints on molecular similarity using an eigenvalue entropy approach

Two-dimensional (2D) chemical fingerprints are widely used as binary features for the quantification of structural similarity of chemical compounds, which is an important step in similarity-based virtual scree...

Authors: Hiroyuki Kuwahara and Xin Gao

Citation: Journal of Cheminformatics 2021 13:27

Content type: Research article Published on: 23 March 2021
- View Full Text
- View PDF
PubChem3D: Shape compatibility filtering using molecular shape quadrupoles

PubChem provides a 3-D neighboring relationship, which ... be a 3-D neighbor. As such, PubChem employs a series of pre-filters, based ... leads one to wonder: can the existing PubChem 3-D neighboring relationship...

Authors: Sunghwan Kim, Evan E Bolton and Stephen H Bryant

Citation: Journal of Cheminformatics 2011 3:25

Content type: Research article Published on: 20 July 2011
- View Full Text
- View PDF
MetFrag relaunched: incorporating strategies beyond in silico fragmentation

MetFrag has gone through algorithmic and scoring refinements. New features include the retrieval of reference, data source and patent information via ChemSpider and PubChem web services, as well as InChIKey filte...

Authors: Christoph Ruttkies, Emma L. Schymanski, Sebastian Wolf, Juliane Hollender and Steffen Neumann

Citation: Journal of Cheminformatics 2016 8:3

Content type: Software Published on: 29 January 2016
- View Full Text
- View PDF
Consistency of systematic chemical identifiers within and between small-molecule databases

Correctness of structures and associated metadata within public and commercial chemical databases greatly impacts drug discovery research activities such as quantitative structure–property relationships modell...

Authors: Saber A Akhondi, Jan A Kors and Sorel Muresan

Citation: Journal of Cheminformatics 2012 4:35

Content type: Research article Published on: 13 December 2012
- View Full Text
- View PDF
InChI in the wild: an assessment of InChIKey searching in Google

While chemical databases can be queried using the InChI string and InChIKey (IK) the latter was designed for open-web searching. It is becoming increasingly effective for this since more sources enhance crawli...

Authors: Christopher Southan

Citation: Journal of Cheminformatics 2013 5:10

Content type: Review Published on: 11 February 2013
- View Full Text
- View PDF
Comparative evaluation of open source software for mapping between metabolite identifiers in metabolic network reconstructions: application to Recon 2

An important step in the reconstruction of a metabolic network is annotation of metabolites. Metabolites are generally annotated with various database or structure based identifiers. Metabolite annotations in ...

Authors: Hulda S Haraldsdóttir, Ines Thiele and Ronan MT Fleming

Citation: Journal of Cheminformatics 2014 6:2

Content type: Research article Published on: 27 January 2014
- View Full Text
- View PDF
FP-ADMET: a compendium of fingerprint-based ADMET prediction models

The absorption, distribution, metabolism, excretion, and toxicity (ADMET) of drugs plays a key role in determining which among the potential candidates are to be prioritized. In silico approaches based on mach...

Authors: Vishwesh Venkatraman

Citation: Journal of Cheminformatics 2021 13:75

Content type: Research article Published on: 28 September 2021
- View Full Text
- View PDF
InChI version 1.06: now more than 99.99% reliable

The software for the IUPAC Chemical Identifier, InChI, is extraordinarily reliable. It has been tested on large databases around the world, and has proved itself to be an essential tool in the handling and integr...

Authors: Jonathan M. Goodman, Igor Pletnev, Paul Thiessen, Evan Bolton and Stephen R. Heller

Citation: Journal of Cheminformatics 2021 13:40

Content type: Research article Published on: 24 May 2021
- View Full Text
- View PDF
WENDI: A tool for finding non-obvious relationships between compounds and biological properties, genes, diseases and scholarly publications

In recent years, there has been a huge increase in the amount of publicly-available and proprietary information pertinent to drug discovery. However, there is a distinct lack of data mining tools available to ...

Authors: Qian Zhu, Michael S Lajiness, Ying Ding and David J Wild

Citation: Journal of Cheminformatics 2010 2:6

Content type: Software Published on: 20 August 2010
- View Full Text
- View PDF
LeadMine: a grammar and dictionary driven approach to entity recognition

Our system uses a mixture of expertly curated grammars and dictionaries, as well as dictionaries automatically derived from public resources. We show that the heuristics developed to filter our dictionary of triv...

Authors: Daniel M Lowe and Roger A Sayle

Citation: Journal of Cheminformatics 2015 7(Suppl 1):S5

Content type: Research Published on: 19 January 2015

This article is part of a Supplement: Volume 7 Supplement 1
- View Full Text
- View PDF
HD_BPMDS: a curated binary pattern multitarget dataset of Huntington’s disease–targeting agents

The discovery of both distinctive lead molecules and novel drug targets is a great challenge in drug discovery, which particularly accounts for orphan diseases. Huntington’s disease (HD) is an orphan, neurodegene...

Authors: Sven Marcel Stefan, Jens Pahnke and Vigneshwaran Namasivayam

Citation: Journal of Cheminformatics 2023 15:109

Content type: Data Note Published on: 17 November 2023
- View Full Text
- View PDF
Towards a Universal SMILES representation - A standard method to generate canonical SMILES based on the InChI

I describe how to use the InChI canonicalisation to derive a canonical SMILES string in a straightforward way, either incorporating the InChI normalisations (Inchified SMILES) or not (Universal SMILES). This is t...

Authors: Noel M O’Boyle

Citation: Journal of Cheminformatics 2012 4:22

Content type: Research article Published on: 18 September 2012
- View Full Text
- View PDF
Expanding the fragrance chemical space for virtual screening

The properties of fragrance molecules in the public databases SuperScent and Flavornet were analyzed to define a “fragrance-like” (FL) property range (Heavy Atom Count ≤ 21, only C, H, O, S, (O + S) ≤ 3, Hydrogen...

Authors: Lars Ruddigkeit, Mahendra Awale and Jean-Louis Reymond

Citation: Journal of Cheminformatics 2014 6:27

Content type: Research article Published on: 22 May 2014
- View Full Text
- View PDF
PubChem3D: Diversity of shape

The shape diversity of 16.4 million biologically relevant molecules from the PubChem Compound database and their 1.46 billion...

Authors: Evan E Bolton, Sunghwan Kim and Stephen H Bryant

Citation: Journal of Cheminformatics 2011 3:9

Content type: Research article Published on: 21 March 2011
- View Full Text
- View PDF
The CompTox Chemistry Dashboard: a community data resource for environmental chemistry

Despite an abundance of online databases providing access to chemical data, there is increasing demand for high-quality, structure-curated, open data to meet the various needs of the environmental sciences and co...

Authors: Antony J. Williams, Christopher M. Grulke, Jeff Edwards, Andrew D. McEachran, Kamel Mansouri, Nancy C. Baker, Grace Patlewicz, Imran Shah, John F. Wambaugh, Richard S. Judson and Ann M. Richard

Citation: Journal of Cheminformatics 2017 9:61

Content type: Database Published on: 28 November 2017
- View Full Text
- View PDF
ExCAPE-DB: an integrated large scale dataset facilitating Big Data analysis in chemogenomics

Chemogenomics data generally refers to the activity data of chemical compounds on an array of protein targets and represents an important source of information for building in silico...target prediction models. T...

Authors: Jiangming Sun, Nina Jeliazkova, Vladimir Chupakhin, Jose-Felipe Golib-Dzib, Ola Engkvist, Lars Carlsson, Jörg Wegner, Hugo Ceulemans, Ivan Georgiev, Vedrin Jeliazkov, Nikolay Kochev, Thomas J. Ashby and Hongming Chen

Citation: Journal of Cheminformatics 2017 9:17

Content type: Database Published on: 7 March 2017

The Erratum to this article has been published in Journal of Cheminformatics 2017 9:41
- View Full Text
- View PDF
Fast rule-based bioactivity prediction using associative classification mining

Relating chemical features to bioactivities is critical in molecular design and is used extensively in the lead discovery and optimization process. A variety of techniques from statistics, data mining and mach...

Authors: Pulan Yu and David J Wild

Citation: Journal of Cheminformatics 2012 4:29

Content type: Methodology Published on: 23 November 2012
- View Full Text
- View PDF
A ligand-based computational drug repurposing pipeline using KNIME and Programmatic Data Access: case studies for rare diseases and COVID-19

Biomedical information mining is increasingly recognized as a promising technique to accelerate drug discovery and development. Especially, integrative approaches which mine data from several (open) data sourc...

Authors: Alzbeta Tuerkova and Barbara Zdrazil

Citation: Journal of Cheminformatics 2020 12:71

Content type: Educational Published on: 25 November 2020
- View Full Text
- View PDF
Machine learning for identification of silylated derivatives from mass spectra

Compound structure identification is using increasingly more sophisticated computational tools, among which machine learning tools are a recent addition that quickly gains in importance. These tools, of which ...

Authors: Milka Ljoncheva, Tomaž Stepišnik, Tina Kosjek and Sašo Džeroski

Citation: Journal of Cheminformatics 2022 14:62

Content type: Research Published on: 15 September 2022
- View Full Text
- View PDF
A workflow for deriving chemical entities from crystallographic data and its application to the Crystallography Open Database

Knowledge about the 3-dimensional structure, orientation and interaction of chemical compounds is important in many areas of science and technology. X-ray crystallography is one of the experimental techniques ...

Authors: Antanas Vaitkus, Andrius Merkys, Thomas Sander, Miguel Quirós, Paul A. Thiessen, Evan E. Bolton and Saulius Gražulis

Citation: Journal of Cheminformatics 2023 15:123

Content type: Research Published on: 19 December 2023
- View Full Text
- View PDF
An open source chemical structure curation pipeline using RDKit

The ChEMBL database is one of a number of public databases that contain bioactivity data on small molecule compounds curated from diverse sources. Incoming compounds are typically not standardised according to...

Authors: A. Patrícia Bento, Anne Hersey, Eloy Félix, Greg Landrum, Anna Gaulton, Francis Atkinson, Louisa J. Bellis, Marleen De Veij and Andrew R. Leach

Citation: Journal of Cheminformatics 2020 12:51

Content type: Methodology Published on: 1 September 2020
- View Full Text
- View PDF
The chemfp project

The chemfp project has had four main goals: (1) promote the FPS format as a text-based exchange format for dense binary cheminformatics fingerprints, (2) develop a high-performance implementation of the BitBound ...

Authors: Andrew Dalke

Citation: Journal of Cheminformatics 2019 11:76

Content type: Methodology Published on: 5 December 2019

The Correction to this article has been published in Journal of Cheminformatics 2020 12:59
- View Full Text
- View PDF
Combining structural and bioactivity-based fingerprints improves prediction performance and scaffold hopping capability

This study aims at improving upon existing activity predictions methods by augmenting chemical structure fingerprints with bio-activity based fingerprints derived from high-throughput screening (HTS) data (HTSFPs...

Authors: Oliver Laufkötter, Noé Sturm, Jürgen Bajorath, Hongming Chen and Ola Engkvist

Citation: Journal of Cheminformatics 2019 11:54

Content type: Research article Published on: 8 August 2019
- View Full Text
- View PDF
Ambiguity of non-systematic chemical identifiers within and between small-molecule databases

A wide range of chemical compound databases are currently available for pharmaceutical research. To retrieve compound information, including structures, researchers can query these chemical databases using non...

Authors: Saber A. Akhondi, Sorel Muresan, Antony J. Williams and Jan A. Kors

Citation: Journal of Cheminformatics 2015 7:54

Content type: Research article Published on: 16 November 2015
- View Full Text
- View PDF
rBAN: retro-biosynthetic analysis of nonribosomal peptides

Proteinogenic and non-proteinogenic amino acids, fatty acids or glycans are some of the main building blocks of nonribsosomal peptides (NRPs) and as such may give insight into the origin, biosynthesis and bioacti...

Authors: Emma Ricart, Valérie Leclère, Areski Flissi, Markus Mueller, Maude Pupin and Frédérique Lisacek

Citation: Journal of Cheminformatics 2019 11:13

Content type: Research article Published on: 8 February 2019
- View Full Text
- View PDF
Target prediction utilising negative bioactivity data covering large chemical space

In silico analyses are increasingly being used to support mode-of-action investigations; however many such approaches do not utilise the large amounts of inactive data held in chemogenomic repositories. The objec...

Authors: Lewis H. Mervin, Avid M. Afzal, Georgios Drakakis, Richard Lewis, Ola Engkvist and Andreas Bender

Citation: Journal of Cheminformatics 2015 7:51

Content type: Research article Published on: 24 October 2015
- View Full Text
- View PDF
MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics

Here we present Metabolic In silico Network Expansions (MINEs), an extension of known metabolite databases to include molecules that have not been observed, but are likely to occur based on known metabolites and ...

Authors: James G Jeffryes, Ricardo L Colastani, Mona Elbadawi-Sidhu, Tobias Kind, Thomas D Niehaus, Linda J Broadbelt, Andrew D Hanson, Oliver Fiehn, Keith E J Tyo and Christopher S Henry

Citation: Journal of Cheminformatics 2015 7:44

Content type: Database Published on: 28 August 2015
- View Full Text
- View PDF