Skip to main content

Table 2 Data sources and statistics collected in CSgator

From: CSgator: an integrated web platform for compound set analysis

 

Number of entries

Number of compounds

Sources

Number of relations

Standard ID

Compound database

Compound

89,602,599

PubChem, ChEMBL, ChEBI, DrugBank

InChIKey

Compound-target & disease & bioassay

Target

252,498

852,375

15 Public DBs

6,027,120

Entrez Gene ID & UniProtKB

Disease

5680

10,975

CTD

1,575,457

MeSH & OMIM

Bioassay

1,218,658

2,253,835

PubChem, ChEMBL

229,842,265

PubChem AID & ChEMBL

Classification

Protein family

575

833,590

ChEMBL 21

1,691,879

ChEMBL protein class

GO term

19,234

851,359

Gene Ontology

68,331,986

GO term

Disease ontology

1824

5429

Disease Ontology

46,053

DO term

MeSH disease

6351

6909

NIH

143,277

MeSH

Approval status

9

3765

DrugBank

ChEMBL

NCGC

12,820

InChIKey