Skip to main content

From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials

Abstract

Adverse Outcome Pathways (AOPs) have been proposed to facilitate mechanistic understanding of interactions of chemicals/materials with biological systems. Each AOP starts with a molecular initiating event (MIE) and possibly ends with adverse outcome(s) (AOs) via a series of key events (KEs). So far, the interaction of engineered nanomaterials (ENMs) with biomolecules, biomembranes, cells, and biological structures, in general, is not yet fully elucidated. There is also a huge lack of information on which AOPs are ENMs-relevant or -specific, despite numerous published data on toxicological endpoints they trigger, such as oxidative stress and inflammation. We propose to integrate related data and knowledge recently collected. Our approach combines the annotation of nanomaterials and their MIEs with ontology annotation to demonstrate how we can then query AOPs and biological pathway information for these materials. We conclude that a FAIR (Findable, Accessible, Interoperable, Reusable) representation of the ENM-MIE knowledge simplifies integration with other knowledge.

Scientific contribution

This study introduces a new database linking nanomaterial stressors to the first known MIE or KE. Second, it presents a reproducible workflow to analyze and summarize this knowledge. Third, this work extends the use of semantic web technologies to the field of nanoinformatics and nanosafety.

Introduction

With engineered nanomaterials (ENMs) playing an increasingly important role in our lives, the need for adequate risk assessments (RA) for ENMs is urgent [1,2,3]. The broad diversity of ENMs and the wide range of their applications, from paints or coatings to medicine, food and cosmetics, introduce both opportunities and challenges. New applications and nano-enabled products are regularly introduced on the market, solving problems such as antimicrobial resistance [4,5,6,7], safer drug formulations and delivery [8,9,10] and cost-efficient sensor technologies [11, 12], to name a few. However, the diversity in ENM design and applications has been raising many concerns related to hazard and exposure assessment whenever detailed quality and safety data are lacking. The concern is even bigger due to a lack of integration of cumulative hazards and aggregate exposure assessment in the current RA framework that underpins existing regulation. Moreover, differences in legislated procedures for different ENMs classes (e.g., pharma-grade, industrial, advanced materials) [13] further complicate matters.

Currently, the main methodologies for RA of chemicals/materials mostly rely on in vivo (animal) testing. The limited in vivo toxicity data, coupled with the recent paradigm shift in toxicity testing towards the development and use of pathway-based testing strategies rather than traditional animal–based methods, triggered interest in the development of New Approach Methodologies (NAMs). These methodologies are based on in chemico, in vitro, in silico, and alternative species test methods. This reinforces the implementation of the 3R’s (Replacement, Reduction and Refinement of animals) concept [14]. Although NAMs are not envisioned as a direct replacement for in vivo testing, it offers an opportunity for a fit-for-purpose testing strategy in the context of an Integrated Approach to Testing and Assessment (IATA) framework to inform decision-making bodies. It is envisioned that the NAMs should strategically be accompanied by the Adverse Outcome Pathway (AOP) concept [15, 16], which has been formalized by the OECD in 2012 as the basis for organizing data and developing an IATA [17, 18].

An AOP starts with a molecular initiating event (MIE), usually followed by a series of sequential and causal key events (KEs) at cellular, tissue or organ levels of biological organisation and ends with adverse outcome(s) (AOs) defined as the disease or the pathological outcomes, which are targeted for regulatory decisions. These pathways allow systematic collection, organization, and simpler presentation of the information collected from in vitro, in vivo or in silico toxicological studies. As AOPs are not chemical-specific, they can be used in the identification of key toxicological data gaps and prioritization of research efforts and resources, thus supporting IATA [19]. When biological interaction and exposure for fully characterized ENM-relevant stressors are identified, existing AOPs may thus facilitate regulatory-relevant RA.

On a handful of toxicological endpoints triggered by ENMs, such as oxidative stress and inflammation, there is an abundance of published data. However, for the many other endpoints, information on the identification of which AOPs are relevant or specific to ENMs, and understanding the ENM-related molecular processes underlying the KEs is sparse or missing. This might be due to the lack of clear and centralized data and knowledge on the identification of ENMs, their characteristics and their interaction with biological systems. The management of these (toxicological) data needs to operate under FAIR (findability, accessibility, interoperability, and reusability) principles for effective RA and hazard assessment [20,21,22].

One way to achieve this would be to make the data more machine-readable. The use of Semantic Web Technologies (SWTs), such as the Web Ontology Language and Resource Description Framework (RDF), is a possible method. RDF uses ontologies to describe data, define properties and introduce relations between them (for example the characteristics of an ENM). The standardized vocabulary introduced by ontologies enables the communication between databases, in line with the FAIR principles, and makes the information machine-readable. RDF enriched with ontologies can be used to combine information from different resources and answer complex biological questions.

RDF is applied in various resources for the storage of data and information relevant for RA and hazard assessment. For example, the AOP-Wiki RDF stores mechanistic toxicological knowledge and provides additional molecular descriptors. Similarly, WikiPathways contains molecular pathways and defines all molecular entities involved [23], neXtProt provides protein-centered knowledge [24, 25], while eNanoMapper enables the storage, searching and sharing of toxicological data for ENMs [26].

Developing RDF for ENMs and their molecular and biological interactions as described in literature, would allow the exploration of available scientific and professional databases for potential risks of ENM exposure. For example, the AOP-Wiki can extrapolate downstream adverse effects from molecular interactions captured in MIE(s) and KE(s). Furthermore, by implementing a standard format, it will be possible to perform correlation analyses on physicochemical properties and biological effects of ENMs. The use of SWTs, including RDF, ontologies, and linked data, plays a pivotal role in addressing communication gaps, achieving semantic interoperability, and facilitating data integration. The SWTs-based approach not only integrates data across different sources and provides an interoperable format for secondary use but also demonstrates the power of ontological modeling in capturing complex relationships within the modeled domain, which would be challenging to achieve through traditional methods. SWTs have been used for a long time in diverse domains such as biomedical informatics and cheminformatics but rarely applied in the field of nanosafety or nanoinformatics.

Here, we present a proof of principle of implementing SWTs for reporting ENMs, their physicochemical characteristics and their interactions at different levels of biological organization. It expands the work initiated by Murugadoss et al. (2021), where a selection of scientific papers was analyzed for ENMs’ connections to AOPs [27]. Twenty-one of the papers investigated in that study were further analyzed to extract detailed information about the 83 unique ENMs, their physicochemical properties, and their potential effects. All materials were registered in the European Registry of Materials (ERM) [28]. The registry allows the creation of a persistent, unique identifier for ENMs to enable correct and precise documentation.

The RDF has been made publicly available through a SPARQL endpoint and a corresponding SNORQL User Interface to facilitate the exploration of the data with SPARQL queries (nanosafety.rdf.bigcat-bioinformatics.org/). This allows us to perform federated SPARQL queries, expanding the developed RDF with information from other sources. Federated SPARQL queries can be used, e.g., to explore the downstream effects of the proposed MIEs in more detail or group nanomaterials based on their ontological annotations.

Here, we present the collected data, the RDF structure and the SPARQL endpoint, demonstrating their applicative nature in answering relevant biological questions.

Methods

Literature review and dataset creation

Murugadoss et al. (2021) identified relevant papers using the search terms “adverse outcome pathway’’ OR “AOP” AND “nano*,” resulting in 960 papers up to 2020-12-01. After refining for duplicates, reviews, AOPs/AOs reports, and the type of organisms studied, 32 papers covering both in vitro and in vivo studies on mammals were selected, as detailed in Additional file 1: Table S1 [27]. From this collection, our focus narrowed to a subset of 21 papers that include measures of the physicochemical properties of reported ENMs. Each ENM described in the analyzed papers was assigned a new ERM identifier to facilitate information extraction and later enable cross-referencing with other data resources [28]. The RDF Schema specification was used to store information and the ENM types were set to chemical substances using the CHEBI_59999 term from the ChEBI ontology [29]. The new identifier, label, creator information and GitHub page were added to the registry Turtle as in Additional file 1: Fig S1.

The ERM identifiers correspond to the ENMs in the specific study reported in the paper. Some ENMs, e.g., standard materials, can later be mapped to other ERM identifiers that describe exactly the same material. More detailed metadata is available at https://h2020-riskgone.github.io/riskgone-materials/RiskGONE_Literature_NM.html.

To enable grouping within the created dataset, or linking with other datasets, the material types were described by introducing semantic annotations of materials. This was done using the NanoParticle Ontology (NPO) [30, 31], eNanoMapper (eNM) ontology [32] and Chemical Entities of Biological Interest (ChEBI) ontology [29] by exploring BioPortal for the fitting material types, aiming for the most precise descriptions of material types.

The physicochemical properties of each ENM described in these scientific papers were extracted and stored in a collection spreadsheet Additional file 2. These properties include the primary particle size, hydrodynamic diameter, shape, diameter, bundle diameter, surface area and zeta potential amongst others see Additional file 2. Additionally, the characterisation method, and producer and model of the measuring device were reported in the same sheet. The data in the spreadsheet were extended with the reported molecular effects of the ENMs, which were then linked to MIE and KE identifiers from the AOP-Wiki. This association was established through a Jupyter Notebook executing SPARQL queries against the AOP-Wiki RDF and involved manual curation [33].

Finally, the metadata of the papers was manually added to the sheet by accessing the papers using their DOI.

Creating the nanosafety RDF

Based on the spreadsheet Additional file 2 that resulted from the literature review, an RDF schema was developed and annotated using standard metadata vocabularies and specialized ontologies Fig. 1. Table 1 shows the ontologies used in developing the RDF along with their prefixes.

Fig. 1
figure 1

Graphical representation of the Resource Description Framework Schema for engineered nanomaterials (ENMs). Recognize subjects (labeled boxes) and predicates (labeled arrows). Objects can be literals (white boxes) or IRIs (blue boxes). The abbreviations of prefixes for the ontologies are defined in Table 1

Table 1 Ontologies used in the NanoSafety RDF model

The transformation from the spreadsheet to RDF was performed using the RDF Mapping Language (RML). RML is a scalable, general mapping language designed to map heterogeneous data structures onto RDF models [34]. The process starts with creating a text document that contains rules on how each element in the spreadsheet can be mapped to a corresponding semantic expression. That is done by defining a subject map specifying the IRI structure for the subject and a set of predicate-object pairs to create the triples semantifying each cell value in the spreadsheet. The mapping document can be created using either RML rules directly or by using YARRRML as an intermediate representation which is then converted to RML using the YARRRML Parser [35]. YARRRML is a human-readable text-based format derived from YAML, allowing researchers to extend and maintain the mapping documents with less effort. It also requires less experience in semantic web formats to deal with, and thus it was chosen to express the mapping rules. Using both the spreadsheet and the mapping document, the RMLMapper [36] tool was used to produce a set of triples, which were then loaded into a triple store for exploring and querying.

Deployment of the SPARQL webservice

In order to make the data publicly accessible, a SPARQL endpoint was loaded with the RDF and made available through nanosafety.rdf.bigcat-bioinformatics.org/ . The SPARQL endpoint is running on top of a Virtuoso triple-store deployed using Docker. The developed Docker image runs both Virtuoso and an Apache web server in one container, and it facilitates data import into Virtuoso using a custom-made bash script.

Furthermore, a SNORQL User Interface was developed to easily use SPARQL queries and share results (nanosafety.rdf.bigcat-bioinformatics.org/). This webpage includes an example query panel that is auto-populated with SPARQL queries stored in GitHub (github.com/h2020-riskgone/SPARQLQueries).

Use cases of the RDF

To test the applicability of the NanoSafety RDF, a Jupyter Notebook was created. The data was queried using SPARQL queries to explore the contents, generate statistics, create graphical overviews, and highlight potential uses of the NanoSafety RDF. The Jupyter Notebook containing the SPARQL queries and code to make all figures are available on GitHub (https://github.com/h2020-riskgone/Nanomaterial-RDF/blob/main/nanoMIE.ipynb).

Various SPARQL queries were executed to explore parts of the data and utilize the interoperability of RDF by performing federated SPARQL queries with the AOP-Wiki, which contains more extensive information about the AOPs that are initiated by some of the ENMs. First, general information is queried about the presence of ENM types, their physicochemical characteristics, and the MIEs that were annotated for all ENMs. One of the MIEs, DNA damage (https://aopwiki.org/events/1194), was selected as a use-case for further exploration because the RDF contains sufficient information for this case, but DNA damage as MIE or KE is a rather specific effect and not a side product of various mechanisms of action. Using the RDF, the MIE of DNA damage was linked to material types and physicochemical characteristics, and potential AOs that can be caused by DNA damage were explored by including information contained in the AOP-Wiki RDF [33].

Results

The RDF schema and SPARQL endpoint

The Nanosafety RDF aims to follow the data models provided by the NPO [30] and the BioAssay Ontology (BAO) [37], two ontologies also reused in the eNM ontology [32]. Other ontologies used are the ChEBI ontology [29] and Vocabulary of Interlinked Datasets (VoID) [46]. Properties were added from the BAO, Chemical Information Ontology [47], NPO [30, 31], and Semanticscience Integration Ontology [44].

This activity resulted in a model where the following entities were modeled as typed resources (i.e., have an rdf:type triple):

  • materials are typed as chemical substance (CHEBI: obo:CHEBI_59999)

  • material types are typed with the ChEBI, NPO or eNM ontology

  • assay readouts are typed as a measurement group (BAO: obo:BAO_0000040)

  • individual endpoints are typed as an endpoint (BAO: bao:BAO_0000179)

  • the procedure methods are typed as a protocol (obo:OBI_0000272)

  • data sets are typed as data set (VoID: void:Dataset).

As can be seen in Fig. 1, each ENM is linked to a reference with dcterms:source. A reference contains the DOI of the article, article title, journal, publication year and author information. The ERM identifier is linked through dcterms:identifier and the semantic annotations of material type through dcterms:type. The dcterms:title corresponds to a label we provided, while rdfs:label is the name of the ENM used in the article. The MIEs mentioned in the paper were linked to MIE and KE identifiers of the AOP-Wiki (see Section "Literature review and dataset creation"). While sio:SIO_000001 (which stand for “is related to”) links the ENM to all KE identifiers from AOP-wiki, sio:SIO_000636 (which means “is trigger for”) only results in those KE ID’s that are labeled as MIE in AOP-wiki. The name used for the KE reported in the paper is linked with nci:C25492 (“effect”). The ENMs participate in (BFO_0000056) a measurement group (bao:BAO_0000040), linking to the measured endpoint (bao:BAO_0000179) and assay (bao:BAO_0000015). Starting from the measured endpoints (which are the physicochemical properties of ENMs such as primary size, hydrodynamic diameter, shape, zeta potential, etc.), their values and units can be explored with the predicates sio:has-value and sio:has-unit, respectively. For each assay, the protocol (obo:OBI_0000272) can be found with the bao:BAO_0002846 predicate. For six physicochemical properties, the data was normalized and added to the RDF using the predicate bao:BAO_0095000 (which means “is derived from”).

A screenshot of the SPARQL endpoint is shown in Fig. 2. The SNORQL User Interface consists of a query box and an example query panel that is auto-populated with SPARQL queries stored in GitHub. The data can be queried through this interface as exports in various formats (.csv, .json, .xml, etc.) or to return tables in an HTML page.

Fig. 2
figure 2

Screenshot of the SNORQL User Interface, consisting of a query box (1) and an example query panel (2) that is auto-populated with SPARQL queries stored in Github. The data can be queried through this interface as exports in various formats (.csv, .json, .xml, etc.) (3) or to return tables in a HTML page (4)

Exploring the contents of the nanosafety RDF

Using the SPARQL endpoint Fig. 2, SPARQL queries were used to extract data from the RDF. In total, the generated RDF consists of 6,355,652 triples. Based on the 21 scientific papers, 87 unique ERMs were identified and included in the RDF, although 4 of these were micro or macro materials and were therefore excluded for the analyses in the rest of this paper. The ENMs were annotated according to their type using terms from the NPO, ChEBI, and eNM ontologies, resulting in a total of 35 types of ENMs. This number was obtained with the SPARQL query “Count-material-types.rq” from the example query panel. For the annotations, NPO was used 29% of the time, ChEBI 23%, and eNM 49%. The most reported ENMs in the selected literature include multi-walled carbon nanotubes, silver nanoparticles, and titanium dioxide nanoparticles see Fig. 3 that were reported by 6, 5 and 4 papers, respectively.

Fig. 3
figure 3

Distribution of the nanomaterials in the dataset, for details see Additional file 1: Table S1

For Fig. 3, six groups are created based on the dcterms:title label using a Jupyter Notebook. These groups will be used throughout the following sections see Additional file 1: Table S1, but as can be seen in Section "Using ontology annotations for grouping of nanomaterials", the ENMs can also be grouped based on their ontology type. The group carbon nanotubes (CNTs) [23] consists of single-walled CNTs [9], multi-walled CNTs [8] and some specific (multi-walled) CNTs. The other carbon-based ENMs [7] contain graphene oxide [3] and graphene [2] among others. There are three types of silica particles [3]. The RDF contains information about nine titanium dioxide nanoparticles [9]. The silver-containing ENMs [11] contain standard materials like silver NM300K [2], generic materials like sphere-like silver nanoparticles [3] and specific particles like polyvinylpyrrolidone-coated silver nanoparticles [2]. Other metals and metal oxide ENMs [30] consist of about 10 rare earth metal ENMs, but also ENMs with iron oxide (3), cerium oxide (5), zinc oxide (5), copper (2), gold (1), cobalt (1), manganese oxide (1) and 27%- and 78%- zirconium oxide-doped cerium oxide nanoparticle (1–1).

Assessment of potential MIE activation showed that the majority of ENMs from the selected papers were described to cause either ROS production (33%), activation of Nrf2 (27%), and DNA damage (24%), as presented in Fig. 4. The AOP-Wiki describes 238 MIEs in total [48], of which 11 could be linked to the ENMs described in the literature, though 3 of these were induced by the same 23 ENMs (namely “ROS generation from photoactivated chemicals”, “Chronic reactive oxygen species” and “Increase, Reactive oxygen species production”) which were therefore only shown once in Fig. 4. In the papers, 43 molecular effects of the ENMs were mentioned, 31 of which could be linked to 64 KEs in AOP-Wiki of which 11 are MIEs see Table 2.

Fig. 4
figure 4

Number of unique ENMs (by counting unique ERM identifiers) coupled to each molecular initiating event (MIE). Each section indicates the proportion of ENMs (n = 53) that have been described to activate each individual MIE (n = 8)

Table 2 Molecular effects of the ENMs mentioned in the analysed papers (first column), linked to 64 KEs in the AOP-Wiki (AOP-Wiki IDs of the KEs are reported in third column) of which 11 are MIEs (AOP-Wiki IDs of KEs that are MIEs are reported in the second column)

During the literature review phase, sixteen different physicochemical properties were extracted, if reported in the paper. These properties were collected in the spreadsheet Additional file 2, and the most documented properties were converted to RDF, including the primary size, hydrodynamic diameter, shape, diameter, specific surface area, bundle diameter and zeta potential. Additional file 1: Table S2 shows the methods used to obtain each physicochemical property. In addition to the original data, normalized data were derived from the original data and added to the RDF.

Based on the reviewed literature, the most frequently reported physicochemical property was the shape, reported for 65 out of the 83 ENMs (78%), followed by the hydrodynamic diameter, primary size and zeta potential at 75%, 70% and 70%, respectively see Table 3. The shapes are distributed as follows: of the ENMs in the RDF 37 are spherical, twenty-one are tube-shaped, five are sheets and the RDF contains also one plate- and one cube-shaped ENM.

Table 3 Overview of physicochemical data with the number of ENMs (extracted by counting the unique ERM identifiers) that report a property and the min, max and mean value of these properties. When applicable, the medium is indicated

Application of the nanosafety RDF: Use case on DNA damage

Next, the application of the NanoSafety RDF was explored with a specific focus on one of the MIEs, DNA damage, which is a widely studied adverse effect of ENMs. By utilizing the NanoSafety RDF framework, we aim to get insights into the grouping of ENMs based on their ontological annotations, followed by exploring the effects that can be related to specified physicochemical properties, and finally extend our knowledge of relevant AOPs that involve DNA damage.

Using ontology annotations for grouping of nanomaterials

One of the advantages of using RDF in combination with ontology annotation is that the ontological tree structure can be used to group the materials using their annotation of material type. For example, MWCNTs and SWCNTs can be grouped by the parent term CNTs, or information related to all metal oxides can be queried at once. This is a useful feature that makes exploring groups of related nanomaterials possible. For example, to get all silver nanoparticles in the RDF, the following query takes the ontology term we assigned to each nanomaterial (dcterms:type ?t) and queries for the parent of this term (rdfs:subClassOf* ?parent), filtering it to be a silver particle.

We can also return all literature sources in the RDF that describe any type of nanoparticle. For example for all CNTs, this amounts to nine papers for the 23 different CNTs in the dataset.

Analyzing physicochemical properties of nanomaterials

As illustrated, SPARQL queries enable the extraction of information, such as the physicochemical properties of ENMs, facilitating the plotting of primary size values, as shown in Fig. 5. The shape of the point marker indicates the impact of each ENM on DNA damage, and the nanomaterial type is color coded similar to Fig. 3 (with the addition of zinc and cerium oxides colored orange and pink, respectively). The obtained data suggest that for, e.g., silver nanoparticles (shown in yellow) the ENMs with a small primary size value are more likely to cause DNA damage (below 26 nm, 4/6 cause DNA damage, while 30 nm and above 0/4), while, titanium dioxide nanoparticles seem to cause DNA damage regardless of their size.

Fig. 5
figure 5

Overview of primary size values in RDF. Marker color based on material type with between brackets (i) the number of ENMs of this type that cause DNA damage, (ii) the number of this type with a primary size value: red titanium oxides (6/7), yellow silver ENMs (4/10), gray carbon ENMs (1/2), orange zinc oxides (2/4), pink cerium oxides (4/7), green metal oxides (0/13), blue CNTs (0/15), marker shape based on results of DNA damage positive (X) or negative (O)

When looking at the fractions in the Fig. 5 caption in more detail, we see that, according to the data in our RDF, no CNTs cause DNA damage (0/15), while for titanium dioxide particles 6 out of the 7 titanium dioxide particles, for which a primary size value was reported in literature, do cause DNA damage. Similarly, a large fraction of the cerium oxides, zinc oxides and silver-containing nanomaterials in our dataset cause DNA damage, while our RDF does not contain data on DNA damage for, e.g., iron oxides or other metal oxides (0/13).

Exploring the biological effects of nanomaterials and potential adverse outcomes

The SPARQL endpoint can also be used to explore the potential AOs of ENMs as a result of the MIEs described in literature, by extending the information with content from the AOP-Wiki. This was done in the query “AOs-from-MIEs.rq” from the example query panel, which explores the downstream effects of the proposed MIEs in more detail.

The query returns 280 results for 26 unique ENMs, with 13 MIE-AO combinations Table 4. Figure 6 shows the association of each specific AO with its corresponding ENM, irrespective of the MIEs outlined in Table 4. Grouping the ENMs, as described in Section. "Exploring the contents of the NanoSafety RDF" and Additional file 1: Table S1, resulted in seven CNTs, two other carbon-based ENMs, five silver-containing ENMs, six titanium dioxide nanoparticles and six other metal (oxide) ENMs.

Table 4 Overview of DNA-related MIEs and the AOs they cause. ERM counts correspond to the number of ENMs in the RDF that cause the MIE-AO combination
Fig. 6
figure 6

Number of ENMs in the RDF that causes a DNA-related MIE per resulting AO (on x-axis), split on type (CNTs (total number from this query n = 7), other carbon-based ENMs (n = 2), metal (oxide) ENMs (n = 6), Silver containing ENMs (n = 5), titanium dioxide (n = 6)

As an example, Table 4 highlights that 18 ENMs could cause an AOP (https://aopwiki.org/aops/444) that starts with an “increase in DNA damage” (MIE) which ultimately leads to a “decrease in population growth rate” (AO). Figure 6 indicates that this effect could be caused by all metal (oxide), silver-containing and titanium dioxide ENMs in this dataset, while only one out of the seven multi-walled carbon nanotubes in the dataset might lead to this specific AO.

Discussion

From the experimental data presented in literature, it is often difficult to fully assimilate the interaction between ENMs and biological structures or systems, which limits insights into the MIEs/KEs they can trigger and inhibits the development of nano-related AOPs. Here, we present a proof of principle of implementing semantic web approaches for reporting ENMs, their physicochemical characteristics, and their interactions with biomolecules and biological systems. It expands on the work initiated by Murugadoss et al. (2021) where a scientific literature analysis was performed to connect ENMs with existing AOPs [27]. That resulted in the identification of ENM-relevant MIEs and KEs, which were used to explore and repurpose chemical-based AOPs and KEs for ENMs to set up in vitro testing strategies, which was demonstrated with two case studies.

Although the approach used here to create the dataset and the RDF-generated form has made the information more FAIR, it has limitations to consider. First, the number of papers analyzed is limited when compared to the increasing number of scientific papers on nanotoxicology that are published every year, which was close to 6000 publications in 2020 [49]. As a proof of principle 21 papers were investigated which restricts the scope of the findings. To obtain a more comprehensive understanding of the relationship between ENMs and their properties with specific AOPs, additional papers could be included for data extraction. However, this goes beyond the scope outlined in this proof-of-principle paper. Second, data quality poses a significant challenge in the curation process. The adequacy, completeness, accuracy, reliability, relevance, and variability of the reported data in the literature all contribute to its overall quality. It is crucial to evaluate these parameters before proceeding with data analysis [50]. Different scoring or weighing systems, such as the ToxRTool [51], the GUIDEnano tool [52] or CRED [53] evaluation methods, can be utilized for qualitative or quantitative data quality assessments. The third limitation relates to the quality of contents in the AOP-Wiki, where the same key events often have different identifiers. AOPs and their KEs provide simplified descriptions of biological processes, but inconsistencies can arise due to overlapping or conflicting definitions across different ontologies. Therefore, careful consideration is required when utilizing AOP-Wiki for data analysis. The limitations that result from inconsistencies and duplications of AOPs pose a limitation to their utility, underscoring the need for ongoing efforts to enhance their quality and comprehensiveness to further refine and strengthen future investigations in the field. Additionally, exploring potential linkages with other databases, such as molecular pathway databases, could offer valuable and broader insights. Fourth, limitations regarding data accessibility present a challenge. Not all data dealing with ENMs toxicity testing, especially "no effect" data, are published or easily accessible. Even publishing the chemicals discussed in an article in a standardized manner is rare [54, 55]. Additionally, industry-held data are hardly accessible due to confidentiality. Although efforts are being made to promote open access and adherence to the FAIR guiding principles [20], depending on the type of data, repositories are not yet fully established and, when available, are not always completed. Finally, the labor-intensive nature of data curation from scientific literature presents difficulties in scaling up the process and covering a large volume of relevant papers. This challenge is not unique to extracting information from journal articles and is also observed in other initiatives focusing on scientific reporting of nanomaterials and characteristics [56]. To overcome this limitation, the scientific community has explored automated methods, such as machine learning and text mining to extract information from unstructured text [57,58,59]. These techniques aim to improve efficiency and reduce inconsistencies in data extraction. However, manual curation can also introduce discrepancies in reporting and interpretation due to the involvement of multiple individuals with different interpretations and terminologies. Efforts have been made to mitigate these inconsistencies by addressing synonyms and rephrasings. Overall, the automatic extraction of information about papers remains a task that requires further development and improvement.

The NanoSafety RDF offers various applications and potential for future development. It demonstrates its utility through a use-case focused on the physicochemical properties of nanomaterials and their potential to cause DNA damage. The use of SPARQL queries on the developed RDF allows for the exploration of the relationships between nanomaterial properties and biological effects. However, the size of the dataset limits the avail of the conclusions that can be drawn from it. For example in the analysis of the effect of primary size on the ability to activate particular types of MIEs, key information (like dissolution rate, agglomeration) which is not included in the dataset are also required to refine the analysis [60]. The availability of a SPARQL endpoint facilitates the usability of the data, allowing users to interact with it through user-friendly web interfaces or computational means. The presented RDF schema can serve as a starting point for reporting nanomaterials and their properties. Another valuable application is the ability to provide feedback to the AOP-Wiki. By linking KEs and MIEs, gaps in existing AOPs can be identified, guiding additional data and research. Moreover, the identification of KEs associated with nanomaterials, but not explicitly described as MIEs in AOP-Wiki, presents an opportunity to propose new AOPs, highlighting areas where the understanding of nanomaterial adverse effects needs improvement. The NanoSafety RDF also allows for similarity searching, where researchers can explore patterns and correlations between physicochemical and ontological characteristics of nanomaterials. This capability could overcome major challenges in nanomaterial similarity assessment and therefore aids in further research and analysis in the field [61, 62].

In future directions, process automation is a key area of focus. The current manual approach to extracting nanomaterial information from scientific papers hinders scalability. However, advancements in text-mining approaches and APIs can automate or assist in the extraction process, significantly improving efficiency and enabling the extraction of a larger volume of data. Maintenance of the NanoSafety RDF is vital as the literature on nanomaterial characterization and potentially harmful effects continues to expand. Employing more comprehensive literature search and selection methods can ensure the inclusion of a broader range of nanomaterials and their characteristics in the RDF. Expanding the RDF content is another important aspect. Currently, the RDF schema includes only six properties due to the restricted reporting of physicochemical characteristics in scientific papers. To capture a more exhaustive understanding of nanomaterials, it is necessary to expand the range of properties included in the schema, facilitating the exploration of correlations between physicochemical properties and molecular effects.

One of the main take-home messages resulting from the described approach is the need for mutual agreement on minimal reporting requirements of physicochemical characteristics of ENMs and a standardized format for these data in future studies. Although it is well known that the physicochemical characteristics of ENMs determine their toxicokinetics and toxicodynamics [63,64,65,66], many studies on AO of ENMs provide not even basic data on ENMs size, shape, surface charge, surface functionalization or core composition. It is also crucial to consider the changes of ENMs properties such as aggregation, agglomeration, dissolution [67, 68] in different exposure conditions for the successful implementation of computational tools and models in risk assessment. Data reporting using templates and the use of identifiers for ENMs [28] are recommended to improve data consistency. Additionally, comprehensive characterization of ENMs’ physicochemical properties is essential for accurate risk assessment in nanotechnological applications [56].

Conclusions

This work presented a proof of principle of implementing semantic web approaches for reporting ENMs and their interactions with biological events and resulted in an open dataset and a publically available SPARQL endpoint for exploration and query answering. The work has shed light on various aspects of nanosafety research, specifically in the context of extracting and analyzing information from the scientific literature. We explored the challenges posed by manual work in data curation and the limitations it imposes on scalability and consistency. Furthermore, we discussed the application of the NanoSafety RDF and its utility in investigating the physicochemical properties of nanomaterials and their potential effects. The RDF's accessibility through SPARQL endpoints and user-friendly interfaces offers researchers the opportunity to explore relationships between properties and biological effects, contributing to the advancement of nanosafety research in a safe(r) and sustainable manner. Looking ahead, we identified future directions and recommendations and emphasized the importance of standardized data reporting and the inclusion of key identifiers in nanosafety studies. Overall, this work has provided insights into the challenges, opportunities, and recommendations for advancing nanosafety research, showcasing the potential of semantic web approaches and computational tools in supporting the understanding of nanomaterials and their effects on human and environmental health.

Availability of data and materials

The data on ENM characteristics and their toxicological effects come from peer-reviewed literature, of which metadata is stored in the created RDF and in the collection spreadsheet. The collection spreadsheet, containing all data that was converted into RDF is available as Additional file 2. The RDF has been archived in Zenodo (https://doi.org/10.5281/zenodo.8076364). The annotations with MIEs and KEs were done with AOP-Wiki (https://aopwiki.org/). The federated SPARQL queries with the AOP-Wiki RDF utilized the AOP-Wiki SNORQL endpoint (https://aopwiki.rdf.bigcat-bioinformatics.org/sparql). The Jupyter notebook that produced the majority of the figures and tables has been made available on GitHub (https://github.com/h2020-riskgone/Nanomaterial-RDF). The documentation on the deployment and loading of the RDF in a SPARQL endpoint is available in GitHub (https://github.com/h2020-riskgone/nanosafety-virtuoso-httpd-docker). The code used to create the RDF including the RML mapping and the output RDF is available on GitHub (https://github.com/h2020-riskgone/Nanomaterials-MIE-interactions-RDF) and a release is archived on Zenodo with DOI (https://doi.org/10.5281/zenodo.8075705).

Abbreviations

AO:

Adverse Outcome

AOP:

Adverse outcome pathway

CNT:

Carbon Nanotube

ENM:

Engineered nanomaterial

IATA :

Integrated Approach to Testing and Assessment

KE:

Key Event

MIE:

Molecular initiating event

NAM:

New Approach Methodology

PhysChem:

Physicochemical

RA:

Risk assessment

RDF:

Resource description framework

RML:

RDF mapping language

SPARQL:

SPARQL protocol and RDF query language

SWNT:

Single-Walled Carbon Nanotube

References

  1. Savolainen K, Alenius H, Norppa H, Pylkkänen L, Tuomi T, Kasper G (2010) Risk assessment of engineered nanomaterials and nanotechnologies—a review. Toxicology 269(2–3):92–104

    Article  CAS  PubMed  Google Scholar 

  2. Hristozov DR, Gottardo S, Critto A, Marcomini A (2012) Risk assessment of engineered nanomaterials: a review of available data and approaches from a regulatory perspective. Nanotoxicology 6(8):880–898

    Article  CAS  PubMed  Google Scholar 

  3. Laux P, Tentschert J, Riebeling C, Braeuning A, Creutzenberg O, Epp A et al (2018) Nanomaterials: certain aspects of application, risk assessment and risk communication. Arch Toxicol 92(1):121–141

    Article  CAS  PubMed  Google Scholar 

  4. Ndayishimiye J, Kumeria T, Popat A, Falconer JR, Blaskovich MAT (2022) Nanomaterials: the new antimicrobial magic bullet. ACS Infect Dis 8(4):693–712

    Article  CAS  PubMed  Google Scholar 

  5. Xie M, Gao M, Yun Y, Malmsten M, Rotello VM, Zboril R et al (2023) Antibacterial nanomaterials: mechanisms, impacts on antimicrobial resistance and design principles. Angew Chem Int Ed 62(17):e202217345

    Article  CAS  Google Scholar 

  6. Hemeg H (2017) Nanomaterials for alternative antibacterial therapy. Int J Nanomed 12:8211–8225

    Article  CAS  Google Scholar 

  7. Makvandi P, Wang C, Zare EN, Borzacchiello A, Niu L, Tay FR (2020) Metal-based nanomaterials in biomedical applications: antimicrobial activity and cytotoxicity aspects. Adv Funct Mater 30(22):1910021

    Article  CAS  Google Scholar 

  8. Mitchell MJ, Billingsley MM, Haley RM, Wechsler ME, Peppas NA, Langer R (2021) Engineering precision nanoparticles for drug delivery. Nat Rev Drug Discov 20(2):101–124

    Article  CAS  PubMed  Google Scholar 

  9. Goenka S, Sant V, Sant S (2014) Graphene-based nanomaterials for drug delivery and tissue engineering. J Controll Relea 10(173):75–88

    Article  Google Scholar 

  10. Hubbell JA, Chilkoti A (2012) Nanomaterials for drug delivery. Science 337(6092):303–305

    Article  PubMed  Google Scholar 

  11. Baptista FR, Belhout SA, Giordani S, Quinn SJ (2015) Recent developments in carbon nanomaterial sensors. Chem Soc Rev 44(13):4433–4453

    Article  CAS  PubMed  Google Scholar 

  12. Bai J, Zhou B (2014) Titanium dioxide nanomaterials for sensor applications. Chem Rev 114(19):10131–10176

    Article  CAS  PubMed  Google Scholar 

  13. Ray PC, Yu H, Fu pp. (2009) Toxicity and environmental risks of nanomaterials: challenges and future needs. J Environ Sci Health Part C 27(1):1–35

    Article  CAS  Google Scholar 

  14. Burden N, Aschberger K, Chaudhry Q, Clift MJD, Doak SH, Fowler P et al (2017) The 3Rs as a framework to support a 21st century approach for nanosafety assessment. Nano Today 12:10–13

    Article  CAS  Google Scholar 

  15. Ankley GT, Bennett RS, Erickson RJ, Hoff DJ, Hornung MW, Johnson RD et al (2010) Adverse outcome pathways: a conceptual framework to support ecotoxicology research and risk assessment. Environ Toxicol Chem 29(3):730–741

    Article  CAS  PubMed  Google Scholar 

  16. Organization for Economic Cooperation and Development (OECD). The Adverse Outcome Pathways development programme workplan—OECD. https://www.oecd.org/chemicalsafety/testing/projects-adverse-outcome-pathways.htm. Accessed 25 May 2023.

  17. Organization for Economic Cooperation and Development (OECD). Integrated Approaches to Testing and Assessment (IATA)—OECD. https://www.oecd.org/chemicalsafety/risk-assessment/iata/. Accessed 25 May 2023.

  18. Patlewicz G, Kuseva C, Kesova A, Popova I, Zhechev T, Pavlov T et al (2014) Towards AOP application—Implementation of an integrated approach to testing and assessment (IATA) into a pipeline tool for skin sensitization. Regul Toxicol Pharmacol 69(3):529–545

    Article  CAS  PubMed  Google Scholar 

  19. Halappanavar S, Nymark P, Krug HF, Clift MJD, Rothen-Rutishauser B, Vogel U (2021) Non-animal strategies for toxicity assessment of nanoscale materials: role of adverse outcome pathways in the selection of endpoints. Small 17(15):2007628

    Article  CAS  Google Scholar 

  20. Wilkinson MD, Dumontier M, IjJ A, Appleton G, Axton M, Baak A et al (2016) The FAIR guiding principles for scientific data management and stewardship. Sci Data 3(1):160018

    Article  PubMed  PubMed Central  Google Scholar 

  21. Jeliazkova N, Apostolova MD, Andreoli C, Barone F, Barrick A, Battistelli C et al (2021) Towards FAIR nanosafety data. Nat Nanotechnol 16(6):644–654 [cito:sharesAuthorWith]

    Article  CAS  PubMed  Google Scholar 

  22. Making nano data FAIR enough. Nat Nanotechnol. 2021 Jun;16(6):607–607 https://doi.org/10.1038/s41565-021-00911-6

  23. Martens M, Ammar A, Riutta A, Waagmeester A, Slenter DN, Hanspers K et al (2021) WikiPathways: connecting communities. Nucleic Acids Res 49(D1):D613–D621 [cito:sharesAuthorWith]

    Article  CAS  PubMed  Google Scholar 

  24. Lane L, Argoud-Puy G, Britan A, Cusin I, Duek PD, Evalet O et al (2012) neXtProt: a knowledge platform for human proteins. Nucleic Acids Res 40(D1):D76–83

    Article  CAS  PubMed  Google Scholar 

  25. Zahn-Zabal M, Michel PA, Gateau A, Nikitin F, Schaeffer M, Audot E et al (2019) The neXtProt knowledgebase in 2020: data, tools and usability improvements. Nucleic Acids Res. https://doi.org/10.1093/nar/gkz995

    Article  PubMed Central  Google Scholar 

  26. Jeliazkova N, Chomenidis C, Doganis P, Fadeel B, Grafström R, Hardy B et al (2015) The eNanoMapper database for nanomaterial safety information. Beilstein J Nanotechnol 6:1609–1634 [cito:sharesAuthorWith]

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Murugadoss S, Vrček IV, Pem B, Jagiello K, Judzinska B, Sosnowska A et al (2021) A strategy towards the generation of testable adverse outcome pathways for nanomaterials. ALTEX—Alternatives to animal experimentation 38(4):580–594 [cito:usesDataFrom] [cito:sharesAuthorWith]

    Article  Google Scholar 

  28. Van Rijn J, Afantitis A, Culha M, Dusinska M, Exner T, Jeliazkova N et al (2022) European registry of materials: global, unique identifiers for (undisclosed) nanomaterials. J Cheminform. https://doi.org/10.1186/s13321-022-00614-7. [cito:sharesAuthorWith]

  29. Degtyarenko K, De Matos P, Ennis M, Hastings J, Zbinden M, McNaught A et al (2007) ChEBI: a database and ontology for chemical entities of biological interest. Nucleic Acids Res 36:D344–D350 [cito:usesMethodIn]

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Thomas D, Thomas D, Pappu R, Baker N (2009) NPO: ontology for cancer nanotechnology research. Nat Preced. https://doi.org/10.1038/npre.2009.3515. [cito:usesMethodIn]

    Article  Google Scholar 

  31. Thomas DG, Pappu RV, Baker NA (2011) NanoParticle ontology for cancer nanotechnology research. J Biomed Inform 44(1):59–74 [cito:usesMethodIn]

    Article  PubMed  Google Scholar 

  32. Hastings J, Jeliazkova N, Owen G, Tsiliki G, Munteanu CR, Steinbeck C et al (2015) eNanoMapper: harnessing ontologies to enable data integration for nanomaterial risk assessment. J Biomed Semant 6(1):10 [cito:usesDataFrom] [cito:sharesAuthorWith] [cito:usesMethodIn]

    Article  Google Scholar 

  33. Martens M, Evelo CT, Willighagen EL (2022) Providing adverse outcome pathways from the AOP-Wiki in a semantic web format to increase usability and accessibility of the content. Appl Vitro Toxicol 8(1):2–13 [cito:usesDataFrom] [cito:sharesAuthorWith]

    Article  Google Scholar 

  34. Dimou A, Sande MV, Colpaert P. RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data. Proceedings of the Workshop LDOW 2014, Linked Data on the Web. Seoul, Korea (2014) https://ceur-ws.org/Vol-1184/

  35. Heyvaert P, De Meester B, Dimou A, Verborgh R (2018) Declarative rules for linked data generation at your fingertips! In: Gangemi A, Gentile AL, Nuzzolese AG, Rudolph S, Maleshkova M, Paulheim H et al (eds) The Semantic Web: ESWC 2018 Satellite Events. Springer International Publishing, Cham, pp 213–217 [cito:usesMethodIn]

  36. Heyvaert P, Chaves-Fraga D, Priyatna F, Corcho O, Mannens E, Verborgh R et al (2019) Conformance test cases for the RDF mapping language (RML). In: Villazón-Terrazas B, Hidalgo-Delgado Y (eds) Knowledge Graphs and Semantic Web. Springer International Publishing, Cham, pp 162–173 [cito:usesMethodIn]

  37. Visser U, Abeyruwan S, Vempati U, Smith RP, Lemmon V, Schürer SC (2011) BioAssay Ontology (BAO): a semantic description of bioassays and high-throughput screening results. BMC Bioinformatics 12(1):257 [cito:usesDataFrom]

    Article  PubMed  PubMed Central  Google Scholar 

  38. DCMI Metadata Terms. https://www.dublincore.org/specifications/dublin-core/dcmi-terms/. Accessed 14 Jun 2023 [cito:usesDataFrom]

  39. Malone J, Holloway E, Adamusiak T, Kapushesky M, Zheng J, Kolesnikov N et al (2010) Modeling sample variables with an experimental factor ontology. Bioinformatics 26(8):1112–1118 [cito:usesDataFrom]

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Sioutos N, de Coronado S, Haber MW, Hartel FW, Shaiu WL, Wright LW (2007) NCI thesaurus: a semantic model integrating cancer-related clinical and molecular information. J Biomed Inform 40(1):30–43 [cito:usesDataFrom]

    Article  CAS  PubMed  Google Scholar 

  41. Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W et al (2007) The OBO foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol 25(11):1251–1255 [cito:usesDataFrom]

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  42. OWL Web Ontology Language Reference. https://www.w3.org/TR/2004/REC-owl-ref-20040210/. Accessed 14 Jun 2023 [cito:usesDataFrom]

  43. RDF 1.2 Schema. Available from: https://www.w3.org/TR/rdf12-schema/. Accessed 14 Jun 2023

  44. Dumontier M, Baker CJ, Baran J, Callahan A, Chepelev L, Cruz-Toledo J et al (2014) The Semanticscience Integrated Ontology (SIO) for biomedical research and knowledge discovery. J Biomed Semant 5(1):14 [cito:usesDataFrom]

    Article  Google Scholar 

  45. Turki H, Shafee T, Hadj Taieb MA, Ben Aouicha M, Vrandečić D, Das D et al (2019) Wikidata: a large-scale collaborative ontological medical database. J Biomed Inform 99:103292 [cito:usesDataFrom]

    Article  PubMed  Google Scholar 

  46. Alexander K, Cyganiak R, Hausenblas M, Zhao J. Describing Linked Datasets with the VoID Vocabulary. https://www.w3.org/TR/void/. Accessed 25 May 2023 [cito:usesDataFrom]

  47. Hastings J, Chepelev L, Willighagen E, Adams N, Steinbeck C, Dumontier M (2011) The chemical information ontology: provenance and disambiguation for chemical data on the biological semantic web. PLoS ONE 6(10):e25513 [cito:usesDataFrom] [cito:sharesAuthorWith]

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. AOP-Wiki SPARQL query: number of MIEs. https://aopwiki.rdf.bigcat-bioinformatics.org/sparql/?default-graph-uri=aopwiki.org&query=select+%28count+%28distinct+%3Fmie%29+as+%3Fnmie%29++where+%7B%0D%0A++%3Faop+a+aopo%3AAdverseOutcomePathway+%3B+aopo%3Ahas_molecular_initiating_event+%3Fmie%7D&should-sponge=&format=text%2Fhtml&timeout=0&debug=on&run=+Run+Query+. Accessed 25 May 2023

  49. Krug HF (2022) Collection of controlled nanosafety data—the cocon-database, a tool to assess nanomaterial hazard. Nanomaterials 12(3):441

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. European Chemicals Agency. Guidance on information requirements and chemical safety assessment. 2011. https://echa.europa.eu/documents/10162/13643/information_requirements_r4_en.pdf/d6395ad2-1596-4708-ba86-0136686d205e. Accessed 14 Jun 2023

  51. Schneider K, Schwarz M, Burkholder I, Kopp-Schneider A, Edler L, Kinsner-Ovaskainen A et al (2009) “ToxRTool”, a new tool to assess the reliability of toxicological data. Toxicol Lett 189(2):138–144

    Article  CAS  PubMed  Google Scholar 

  52. Fernández-Cruz ML, Hernández-Moreno D, Catalán J, Cross RK, Stockmann-Juvala H, Cabellos J et al (2018) Quality evaluation of human and environmental toxicity studies performed with nanomaterials—the GUIDEnano approach. Environ Sci Nano 5(2):381–397

    Article  Google Scholar 

  53. Kase R, Korkaric M, Werner I, Ågerstrand M (2016) Criteria for reporting and evaluating ecotoxicity data (CRED): comparison and perception of the Klimisch and CRED methods for evaluating reliability and relevance of ecotoxicity studies. Environ Sci Eur 28(1):7

    Article  PubMed  PubMed Central  Google Scholar 

  54. Schymanski EL, Bolton EE (2021) FAIR chemical structures in the Journal of Cheminformatics. J Cheminformatics 13(1):50

    Article  Google Scholar 

  55. Schymanski EL, Bolton EE (2022) FAIRifying the exposome journal: templates for chemical structures and transformations. Exposome. https://doi.org/10.1093/exposome/osab006

    Article  Google Scholar 

  56. Giusti A, Atluri R, Tsekovska R, Gajewicz A, Apostolova MD, Battistelli CL et al (2019) Nanomaterial grouping: existing approaches and future recommendations. NanoImpact 16:100182

    Article  Google Scholar 

  57. Lewinski NA, McInnes BT (2015) Using natural language processing techniques to inform research on nanotechnology. Beilstein J Nanotechnol 1(6):1439–1449

    Article  Google Scholar 

  58. De Miranda SM, Coelho GM, Dos Santos DM, Filho LF (2006) Text mining as a valuable tool in foresight exercises: a study on nanotechnology. Technol Forecast Soc Change 73(8):1013–1027

    Article  Google Scholar 

  59. Boekel RC, Cunningham SW, Wang X. 2023 Exploring the Nanotechnology Landscape for Competitive Advantage Using SAO-Mining. In: 2017 Portland International Conference on Management of Engineering and Technology (PICMET). Portland, OR: IEEE; 2017. p. 1–11. http://ieeexplore.ieee.org/document/8125267/. Accessed 24 May 2023.

  60. Misra SK, Dybowska A, Berhanu D, Luoma SN, Valsami-Jones E (2012) The complexity of nanoparticle dissolution and its importance in nanotoxicological studies. Sci Total Environ 438:225–232

    Article  CAS  PubMed  Google Scholar 

  61. Park MV, Catalán J, Ferraz N, Cabellos J, Vanhauten R, Vázquez-Campos S et al (2018) Development of a systematic method to assess similarity between nanomaterials for human hazard evaluation purposes—lessons learnt. Nanotoxicology 12(7):652–676

    Article  CAS  PubMed  Google Scholar 

  62. Comandella D, Gottardo S, Rio-Echevarria IM, Rauscher H (2020) Quality of physicochemical data on nanomaterials: an assessment of data completeness and variability. Nanoscale 12(7):4695–4708

    Article  CAS  PubMed  Google Scholar 

  63. Dekkers S, Oomen AG, Bleeker EAJ, Vandebriel RJ, Micheletti C, Cabellos J et al (2016) Towards a nanospecific approach for risk assessment. Regul Toxicol Pharmacol 1(80):46–59

    Article  Google Scholar 

  64. Hagens WI, Oomen AG, de Jong WH, Cassee FR, Sips AJAM (2007) What do we (need to) know about the kinetic properties of nanoparticles in the body? Regul Toxicol Pharmacol 49(3):217–229

    Article  CAS  PubMed  Google Scholar 

  65. Tsuji JS, Maynard AD, Howard PC, James JT, Lam C, wing, Warheit DB, et al (2006) Research strategies for safety evaluation of nanomaterials, part IV: risk assessment of nanoparticles. Toxicol Sci 89(1):42–50

    Article  CAS  PubMed  Google Scholar 

  66. Scientific Committee EFSA, More S, Bampidis V, Benford D, Bragard C, Halldorsson T et al (2021) Guidance on risk assessment of nanomaterials to be applied in the food and feed chain: human and animal health. EFSA J 19(8):e06768

    Google Scholar 

  67. Feliu N, Docter D, Heine M, Del Pino P, Ashraf S, Kolosnjaj-Tabi J et al (2016) In vivo degeneration and the fate of inorganic nanoparticles. Chem Soc Rev 45(9):2440–2457

    Article  CAS  PubMed  Google Scholar 

  68. Sharma VK, Siskova KM, Zboril R, Gardea-Torresdey JL (2014) Organic-coated silver nanoparticles in biological and environmental conditions: fate, stability and toxicity. Adv Colloid Interface Sci 204:15–34

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

We thank GitHub for providing their open innovation platform.

Funding

This research was funded by European Union Horizon 2020 Programme (H2020) under Grant agreement nº 814425 (RiskGONE), grant agreement nº 814572 (NanoSolveIT) and Grant agreement nº 862195 (SbD4Nano).

Author information

Authors and Affiliations

Authors

Contributions

Conceptualization and methodology, E.W., M.M., J.v.R.; writing—original draft preparation, J.v.R., M.M., A.A., E.W.; writing—review and editing, M-R.C., V.F., S.M., I.V.V., N.J., P.H.; data extraction, J.v.R., M.M. M-R.C., S.M., I.V.V.; data curation, J.v.R.; RDF creation, A.A.; SPARQL service setup, M.M., J.v.R; visualization, J.v.R., M.M., A.A.; supervision, E.W.; project administration, J.v.R., M.M.. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Egon L. Willighagen.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

All authors give consent for publication in the Journal of Cheminformatics.

Competing interests

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

Example of an ERM identifier as added to the registry Turtle. Table S1. Overview of nanoparticles (ENMs) in the NanoSafety RDF and how they are divided into the six groups shown in Figure 3 and 6. Names from the papers are abbreviated using CNTs for carbon nanotubes and NP for nanoparticles. Table S2. Overview of the methods and instruments used to obtain each physicochemical property.

Additional file 2:

Spreadsheet with raw data. The main sheet with the final data is the seventh sheet.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

van Rijn, J.P.M., Martens, M., Ammar, A. et al. From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials. J Cheminform 16, 49 (2024). https://doi.org/10.1186/s13321-024-00833-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13321-024-00833-0

Keywords