ChemSpider
|
manual and automated data curation, crowd-sourcing community supported
|
claimed to be the richest source of structure-based chemistry
|
an aggregator of nearly 400 different data sources
|
includes data sources metadata, provenance metadata (e.g. owner and time-stamp of data creation, curation and update) and user meta-data
|
can be accessed by web GUI and web services via PC and mobile devices; query results can be downloaded as a set
|
publicly available, no multi-level user access supported
|
CEBS
|
manual data curation, collaboration between data de-positors and internal curation staffs
|
contains 132 chemicals and their response in 34 detailed studies
|
permits users to integrate various data types and studies, a database schema is well designed by the support of controlled vocabularies
|
includes domain-specific metadata (e.g. owner, study details) and provenance metadata (e.g. time-stamp of the study start, curation updates)
|
allows users to retrieve and combine customised information and export to various formats for downloading. Able to support up to 100 concurrent users
|
publicly available, but also provides private data access mode to protect sensitive user data
|
CTD
|
manual data curation, sup-ported by the scientific com-munity
|
includes 1.4 million chemical-gene-disease data connections and has been widely recognised
|
employs community-accepted vocabularies and ontologies to capture data and is integrated with external resources
|
includes domain-specific, provenance metadata and supporting literature sources are recorded
|
access to entire database (downloadable as a dump file) and individual data sources; query results can be customised and exported to different formats
|
publicly available, no multi-level user access provided
|
DSSTox
|
manual and automated data curation, quality assurance log files recorded; but no curation for external data
|
contains over 8000 chemicals and have been incorporated into several external sources
|
integrates molecular structures and toxicity data into standardised DSSTox SDF
|
includes domain-specific and provenance metadata
|
data sources and associated documents can be downloaded individually and included data is searchable via many options
|
publicly available, no user registration is required
|
ToxCast
|
manual data curation, includes internal and external review
|
covers various chemical classes and diverse mechanism of action, 320 chemicals have been collected in Phase I and 1000 more is currently being screened
|
well integrated into many other EPA databases
|
only domain-specific metadata is included, no recorded track of provenance metadata (e.g. curator and time-stamp)
|
data sources are available for download individually in the ToxCast website; included data can be browsed and queried via ToxCast DB web GUI
|
publicly available, no user registration is required
|
ACToR
|
does not provide any data curation itself, data quality totally depends on the original data sources
|
contains over 500,000 chemicals and associated toxicity data from nearly 500 sources
|
itself is a chemical toxicity data aggregator and employs a clear and flexible database schema for data integration
|
metadata of data source and domain-specific information is well recorded, no recorded track of provenance meta-data
|
open-source implementation and the entire database can be downloaded
|
publicly available, no user registration is required
|
OpenTox
|
allows for automated, man-ual and global data quality validations
|
contains different categories of public data sources which supporting predictive toxicology
|
employs ontology to support efficient integration of data coming from different sources into a unifying structure
|
metadata of data source and domain-specific information is well recorded, no recorded track of provenance meta-data
|
provides APIs and REST-ful web services for included data, algorithms, models, ontologies and reports
|
publicly available, but employs OpenSSO for initial implementation of multi-level user access
|