Skip to main content

Helping to improve the practice of cheminformatics

As we take over the editorial reigns of J. Cheminform. we would like to present our goals for the journal and our views on where cheminformatics as a field is headed and how the journal may influence it. As editors our overarching goal is to disseminate cheminformatics research and practice that is impactful and useful within the community. Open Access publishing remains central to this, as well as continued innovation in simplifying reuse of published work. This includes encouraging FAIR [1] outputs and clear, open licensing, resulting in research products that can be reused and reproduced. Recent updates to the journal’s website exemplify our attempts to encourage reuse of data from articles (e.g. integration with FigShare).

In parallel with good publishing practice, we want to focus the journal on innovative research in, and the practice of, cheminformatics. This leads to an obvious question: what do we consider cheminformatics? An answer to this allows us to judge where innovation lies. Very broadly, cheminformatics addresses the generation, representation and manipulation of chemical information, with the chemical structure at its foundation. And thus innovation can occur at multiple levels, starting from the chemical structure itself (non-graph representations, learned representations) to novel ways to integrate chemical information with other data types (such as sequence data or protein structure data). At the same time, it is important to understand the bottlenecks and challenges in dealing with chemical information. While not necessarily novel, discussion of how organizations deal with chemical information problems and practice cheminformatics, which tools work and which do not, are all valuable contributions to the field.

Cheminformatics practice

Given the broad definition, it is useful to focus on a few areas that we feel are relevant to the future of the field. This aspect was touched upon by Wild [2] and more recently by others [3, 4], who focused on specific challenges, many of which remain open. Taking a broad view, we believe that the future of cheminformatics lies in addressing large scale chemical information, across different domains. Particularly, the increased size of chemical space that can be explored, either explicitly [5] or implicitly [6] leads to opportunities in many areas of cheminformatics including property prediction, searching and representing such massive chemical spaces. Given the interdisciplinary nature of the field [7], many of the problems associated with this vision will require a combination of expertise in computer science, statistics, chemistry and biology and we hope to encourage scientists, who are not necessarily chemists, to contribute their expertise via collaborations. While a vision provides us direction, it is useful to point out specific areas that we are actively looking to publish in. Examples include:

  • New methodology and work that address theoretical underpinnings of cheminformatics methods, such as new representations to complement QM and chemical graphs and methods to derive them;

  • Approaches to describe and model new data types such as compound combination responses;

  • New topical areas, such as approaches to modeling and screening inorganic materials or informatics approaches to QM/MM problems that enables application to large datasets; and

  • New perspectives on how chemical information can and should be employed in multi-disciplinary settings such as drug discovery and development pipelines, clinical environments or regulatory settings.

Importantly, this partial list is driven by our scientific interests and vision. It is not comprehensive and we encourage authors to contact us regarding topics not listed here.

Publishing practice

While aiming for high quality, relevant and innovative topical content is a primary goal, it is equally important that we ensure the usability and accessibility of the content published by the journal. Our goal is to publish material that can be reused, modified, and redistributed as far as possible. For studies that involve proprietary datasets, we encourage authors to include at least one non-proprietary dataset so that users can reproduce the study. Similar requirements for software usage are described elsewhere (https://jcheminf.springeropen.com/submission-guidelines/preparing-your-manuscript/software).

We aim to follow, and encourage authors to follow, FAIR principles, which outline how research output is findable, accessible, interoperable, and reuseable (e.g. reflected by the participation in the Initiative for Open Citations). As an Open Access journal, open licensing of content (including data and code) is a foundational aspect and coupled with waivers, we hope to further remove barriers to adoption of material published in the journal and encourage authors from around the world to consider J. Cheminform. as a venue for their work.

At the journal level, we will continue to work on improving dissemination of research published in J. Cheminform. using a variety of mechanisms such as videos, blog posts and Twitter (https://twitter.com/jcheminf). We are actively reviewing article and reviewer guidelines to ensure clarity in terms of what is or isn’t acceptable for different article types. More generally, as a platform for the dissemination of computational research we hope to contribute to the solution of challenges described by Neylon et al. [8] and List et al. [9]

Conclusion

While pontificating on the future is always a fun task, we are committed to the more challenging task of bringing the future to the present. To that end, we are currently designing a series of thematic issues and actively soliciting articles in a number of the areas listed above. In these efforts, the Editorial Board has provided useful inputs and we would like to acknowledge their participation in the journal’s activities. Simultaneously, we are working with the publisher to make our content more accessible in more ways.

We believe that cheminformatics has a bright and broad future and that by focusing on emerging topics, cheminformatics practice and ensuring Open publishing policies, J. Cheminform. will play a key role in pushing the field forward.

References

  1. Wilkinson MD et al (2016) The FAIR Guiding Principles for scientific data management and stewardship. Sci Data 3:160018+

    Article  Google Scholar 

  2. Wild DJ (2009) Grand challenges for cheminformatics. J Cheminform 1:1–2

    Article  Google Scholar 

  3. Gasteiger J (2014) Solved and unsolved problems of chemoinformatics. Mol Inf 33:454–457

    Article  CAS  Google Scholar 

  4. Cherkasov A et al (2014) QSAR modeling: where have you been? where are you going to? J Med Chem 57:4977–5010

    Article  CAS  Google Scholar 

  5. Reymond J-L (2015) The chemical space project. Acc Chem Res 48:722–730

    Article  CAS  Google Scholar 

  6. Peng Z (2013) Very large virtual compound spaces: construction, storage and utility in drug discovery. Drug Discov Today 10:e387–e394

    Article  Google Scholar 

  7. Wegner JK, Sterling A, Guha R, Bender A, Faulon J-L, Hastings J, O’Boyle N, Overington J, Van Vlijmen H, Willighagen E (2012) Cheminformatics. Commun ACM 55:65–75

    Article  Google Scholar 

  8. Neylon C, Aerts J, Brown C, Lemire D, Millman J, Murray-Rust P, Perez F, Saunders N, Smith A, Varoquaux G, Willighagen E (2012) Changing computational research. The challenges ahead. Sour Code Biol Med 7:2+

    Article  Google Scholar 

  9. List M, Ebert P, Albrecht F (2017) Ten simple rules for developing usable software in computational biology. PloS Comput Biol 13:e1005265+

    Article  Google Scholar 

Download references

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Rajarshi Guha or Egon Willighagen.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guha, R., Willighagen, E. Helping to improve the practice of cheminformatics. J Cheminform 9, 40 (2017). https://doi.org/10.1186/s13321-017-0217-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s13321-017-0217-z