Skip to main content

Table 2 Data sources and statistics collected in CSgator

From: CSgator: an integrated web platform for compound set analysis

  Number of entries Number of compounds Sources Number of relations Standard ID
Compound database
Compound 89,602,599 PubChem, ChEMBL, ChEBI, DrugBank InChIKey
Compound-target & disease & bioassay
Target 252,498 852,375 15 Public DBs 6,027,120 Entrez Gene ID & UniProtKB
Disease 5680 10,975 CTD 1,575,457 MeSH & OMIM
Bioassay 1,218,658 2,253,835 PubChem, ChEMBL 229,842,265 PubChem AID & ChEMBL
Classification
Protein family 575 833,590 ChEMBL 21 1,691,879 ChEMBL protein class
GO term 19,234 851,359 Gene Ontology 68,331,986 GO term
Disease ontology 1824 5429 Disease Ontology 46,053 DO term
MeSH disease 6351 6909 NIH 143,277 MeSH
Approval status 9 3765 DrugBank
ChEMBL
NCGC
12,820 InChIKey