In recent years, interoperability has become an important concept in the field of research data management. Discussions about interoperability have taken place in published articles and numerous presentations [1] [2], and several interoperability standards have been developed [3], [4], [5]. The term has been applied, among other things, to data, metadata and research tools.
This post focuses on interoperability between research tools. Thus far this topic has been primarily explored through looking at interoperability between different tools which fall into the same tool category. The most prominent examples are generalist data repositories and data management planning tools. Some work has also been done in the electronic lab notebook interoperability sphere. After a brief summary of these three examples of ‘horizontal’ interoperability, we explore an important emerging area of interest which we call ‘vertical’ interoperability: interoperability between tools that belong to different tool categories. This post builds on work done as part of an NSF grant [6] we are carrying out in collaboration with the California Digital Library, and a presentation and discussion of vertical interoperability at the annual meeting of the Generalist Repository Ecosystem Initiative (GREI) project which took place in Chicago in September 2024 [7].
Horizontal tool interoperability
For data repositories, the RO-Crate format has been designed to be compatible with all generalist repositories and provides well-described data objects and metadata through a ZIP bundle [8] [9]. Additionally, GREI is enabling collaboration and the development of a shared standard for the major generalist repositories [10].
With regards to data management planning tools, much effort has been put into developing machine-actionable DMPs [11] and defining a common metadata standard to enable interoperability both between DMP tools themselves, and tools that consume their data [12]. What is more, a shared codebase in the form of DMP Roadmap, that is extended on by several other DMP tools, enables a base of shared features to be reused across similar tools.
When it comes to electronic lab notebooks, the ELN format [13], an extension of RO-Crate that is designed for files generated by ELNs, provides a format for the exchange of experimental research materials and data through a common archive format. Custom-built import/export capabilities using open formats such as CSV are sometimes available for ELNs but are often designed around the conversion of data from one tool’s schema into another tool’s schema, rather than through utilizing a shared metadata format.
Vertical tool interoperability
As the number and categories of research tools continue to expand at a rapid pace, so does their use in research. Research often takes the form of complex chains of interaction across various specialist and generalist tools, and addressing the challenge of enabling researchers to gracefully use multiple tools together in their workflows has taken on increasing importance. We now look at vertical interoperability in more detail, by putting it in context first.
The research process: from linear progression to lifecycle
Until relatively recently, research was generally thought of as a linear process: an experiment was designed, which produced data, which was processed and analyzed, and the results were written up in a publication. Along with the increasing focus on data and the FAIR principles [14], another element was added to what was still conceived of as a linear process: the deposit of research into a data repository. This would generally happen at the end of the process, and often in conjunction with publication. In the past ten years or so, another element was added to the process: preparation of a data management plan at the start of the process.
The advent of data repositories and data management planning tools, the proliferation of different kinds of research tools, and the rise of the FAIR principles, along with other trends such as the development of regional and national research infrastructures have contributed to a shift from thinking about research as a linear process to conceptualising it as a continuous lifecycle. This change in thinking helps stimulate a rethinking of data and their flows. The repository is no longer the final resting place of data produced in the research process, but rather a staging platform on an ongoing journey where the data will continue to be used and modified by further research. Importantly, this modification and re-use will mostly take place using the same categories of research tools (though not necessarily the same exact tools) that were used across the pre-deposit stages of the research lifecycle.
Vertical interoperability in the context of the research lifecycle
In this way of thinking, vertical interoperability between research tools becomes a critical enabling factor for the FAIRification of data. Without interoperability between tools, it’s impossible for data to pass through different stages of the lifecycle without requiring time-consuming manual reformatting before ingestion, and runs the risk of losing its integrity due to the differences in focus and metadata formats between tools. Continuous use, re-use and modification of research data cannot take place in a sustainable manner.
Currently, vertical interoperability between research tools is a rarity and the exception rather than the rule. Examples include integrations between the Argos DMP tool and the Zenodo repository, which enables deposit of DMPs from Argos into Zenodo, and between the protocols-sharing app protocols.io and Lab Archives, which enables two way transfer of protocols between the two applications.
The benefits of these vertical integrations are many:
- Ability to trigger a transfer of metadata and data from one tool to the next reduces manual effort and potential mistakes in transferring the data
- Automatic links can be made to preserve the relationship between the information contained in both services, greatly enhancing research traceability
- Streamlining researcher workflows, by enabling powerful actions to be performed from within a familiar, regularly used tool
As far as we are aware, our research data platform RSpace is the only tool that has systematically developed integrations with multiple tools from different stages of the research lifecycle. We present our ecosystem of integrations in the following graphic:
Challenges in achieving vertical interoperability
We outline below some of the challenges we’ve encountered when building vertical interoperability:
User experience methodology
Extensive time and effort are needed to ensure the proposed integration solution is not overly specific for one specific use case nor too generic and is flexible enough to be adjusted based on actual researcher workflows. User research is essential to validate the proposed integration design, but requires experience in user interviewing, mockup generation and high-level technical specification writing to be performed effectively. The development of these intermediary materials has the added benefit of creating a shared language between tool developers and institutions, both between and among themselves. This enables broader conversations and comparisons on a constantly growing topic.
API-driven development
To successfully design powerful integrations, all actions that are achievable through the tool’s user interface should ideally be available through an API. However, differences in API maturity, documentation, availability, and differences in design approaches can result in bottlenecks that greatly limit the scope of an integration, especially if there is direct interaction between more than two tools. Integrations are also costly to maintain since they need regular maintenance and testing to ensure workflows are working as intended.
From monolith tools to tool ecosystem thinking
Adopting vertical interoperability means adopting modular tool design, where each tool in the ecosystem is responsible for achieving its concrete purpose well. This means resisting the temptation to build a tool that reinvents the wheel with the hopes that it will be better for a specific research context. The amount of design, development and planning resources required to build a tool from scratch can be greatly reduced by exploring how existing and proven tools could be expanded upon to provide additional flexibility.
Of course, it is essential that the many tools in use are presented to researchers as a unified experience that directly enables their workflows, thus there is a need for a new kind of tool—a connector, or front-end—that provides a unified interface. As an example, RSpace integrates data repositories into the document export flow, and provides access to chemistry tools from within a document, where these tools would naturally be relevant to access from.
It must be noted that for all three of the challenges listed above, the ability to work closely with members of each team is invaluable and necessary for the success of an interoperability project. This is especially the case for vertical interoperability projects, as they might involve co-creation with people across domains who do not share the same core assumptions and knowledge.
Overcoming the challenges
A conceptual scaffolding: The MaLDReTH Map
The MaLDReTH Map of the Digital Research Tools Landscape [15], a recent output of the Research Data Alliance, provides a conceptual scaffolding for thinking about and implementing vertical interoperability. MaLDReTH is centered on a harmonized model of the research data lifecycle: for each stage of the cycle, three representative categories of tools have been identified, with each tool category listing three concrete examples of tools used in that stage of the lifecycle.
MaLDReTH lifecycle and structure diagrams
A detailed view of the first three stages in MaLDReTH, with their respective tool category and tool example listings
For the first time, MaLDReTH provides a visual representation of the research data lifecycle which includes concrete examples of research tools used in each stage of the lifecycle, with supporting explanation and documentation. As such, MaLDReTH has the potential to act as a significant stimulant in the development of more widespread vertical interoperability between research tools, which is a core prerequisite for enabling streamlined flows of data and metadata throughout the research lifecycle.
Three kinds of actors can take advantage of MaLDReTH and help drive the development of research infrastructures which are built around vertical interoperability.
Developing organizational examples and use cases
MaLDReTH is already being used by research organizations as a reference for assessing and building research infrastructures and has been adopted for this purpose by Oxford University, University College London, JISC and the California Digital Library. In the coming year, the RDA MaLDReTH II Working Group plans to host a series of workshops to introduce MaLDReTH more widely, which should lead to more widespread adoption. The use cases this will result in will highlight the benefits of existing vertical interoperability, as well as identify the gaps remaining. The development of concrete vertical interoperability examples that all utilise the MaLDReTH model as a base will enable shared understanding of core interoperability concepts, and easier comparison of the benefits and drawbacks of implementation approaches.
A role for funders
Most research tools utilised in academic research benefit from at least some public funding, and in many cases are the result of dedicated public funding. Thus, funders have the ability to play a key role in driving vertical interoperability. Firstly, they can do so by requiring a demonstration of awareness of adjacent tools that are already in use elsewhere, that could be built upon to fit within current workflows and ecosystems, to discourage the projects that aim to create new, all-encompassing and highly specific tools from scratch. Similarly, approaches that consider multi-tool workflows could be encouraged. Secondly, they can require functional APIs to be developed alongside the product and treated as core functionality, rather than, as is often the case, treating APIs as an optional and incomplete afterthought.
A change in mentality for tools developers
Direction from adopters of research tools, i.e. research organizations and funders, can drive a change in approach by the developers of new research tools as well as existing tools. With a change in focus to supporting research workflows, which in most cases require application of multiple tools, collaborations and partnerships between tools to provide a joint service come to the fore. A particular tool typically can’t address needs at every stage of the workflow, so even in the initial design of a tool, thought needs to be given to ‘adjacent’ tools and how integrations with them can best support the relevant workflow(s). Interoperability and the development of the relevant APIs then become a core part of tool design, rather than an afterthought.
References
- Lawrence, Rebecca. 2023-04-06. The Scholarly Kitchen. https://scholarlykitchen.sspnet.org/2023/04/06/guest-post-why-interoperability-matters-for-open-research-and-more-than-ever/
- Vasan, Akash. 2024-10-16. Interoperability in Open Solutions: The Key to Collaboration (and Why It's So Complicated) https://www.mysciencework.com/omniscience/interoperability-open-solutions
- Curtin, L., Feri, L., Gautier, J., Gonzales, S., Gueguen, G., Scherer, D., Scherle, R., Stathis, K., Van Gulick, A., & Wood, J. (2023). GREI Metadata and Search Subcommittee Recommendations_V01_2023-06-29. Zenodo. https://doi.org/10.5281/zenodo.8101957
- El-Gebali, S., Juty, N., Edmunds, R., Stathis, K., & Mejias, G. (2024). Harmonizing Metadata Across Disciplines – Bioschemas and the DataCite Metadata Schema. DataCite. https://doi.org/10.5438/VZQP-M504
- Scardaci Diego Orazio, Sciacca Eva, Hériché Jean-Karim, Van De Sanden Mark, Klaas Wierenga, Manghi Paolo, Tamburri Damian, Mazon Jose Norberto, López García Álvaro, Hugo Wim, Pansanel Jerome, & Krøl Andersen Lene. (2023). A landscape overview of the EOSC Interoperability Framework - Capabilities and Gaps (Version 1). Zenodo. https://doi.org/10.5281/zenodo.8399710
- Conference: CI PAOS: Advancing Research Data Management through Enhanced Vertical Interoperability, Award # 2433321. https://www.nsf.gov/awardsearch/showAward?AWD_ID=2433321
- Plankytė, V., & Macneil, R. (2024, September 20). Advancing Research Data Management through Enhanced Vertical Interoperability. GREI Annual Meeting 2024, Chicago, USA. RSpace. https://doi.org/10.5281/zenodo.13829209
- Soiland-Reyes, Stian et al. ‘Packaging Research Artefacts with RO-Crate’. 1 Jan. 2022: 97 – 138. https://doi.org/10.3233/DS-210053
- Sefton, P., & Soiland-Reyes, S. (2024, June 28). Five ways RO-Crate data packages are important for repositories. Open Repositories 2024 (OR2024), Gothenburg, Sweden. Zenodo. https://doi.org/10.5281/zenodo.12579432
- Generalist Repository Ecosystem Initiative. NIH. https://datascience.nih.gov/data-ecosystem/generalist-repository-ecosystem-initiative
- Miksa, T., Chodacki, J., Suchánek, M., Praetzellis, M., Papadopoulou, E., Jacquemot, M.-C., & Kevin, A. (2024). Salzburg Manifesto on machine actionable Data Management Plans. Zenodo. https://doi.org/10.5281/zenodo.10658522
- Tomasz Miksa, Paul Walk, Peter Neish. RDA DMP Common Standard for Machine-actionable Data Management Plans. http://doi.org/10.15497/rda00039
- The ELN Consortium. TheELNFileFormat GitHub Repository. https://github.com/TheELNConsortium/TheELNFileFormat
- Chue Hong, N. P., Katz, D. S., Barker, M., Lamprecht, A.-L., Martinez, C., Psomopoulos, F. E., Harrow, J., Castro, L. J., Gruenpeter, M., Martinez, P. A., Honeyman, T., Struck, A., Lee, A., Loewe, A., van Werkhoven, B., Jones, C., Garijo, D., Plomp, E., Genova, F., … RDA FAIR4RS WG. (2022). FAIR Principles for Research Software (FAIR4RS Principles) (1.0). Zenodo. https://doi.org/10.15497/RDA00068
- Vials Moore, A., Macneil, R., Clare, C., Garcia, M., Pitts, R., & RDA OfR Mapping the Landscape of Digital Research Tools WG. (2024). Mapping the Landscape of Digital Research Tools. Research Data Alliance. https://doi.org/10.15497/RDA00125
Copyright © 2025 Rory Macneil, Vaida Plankytė. Distributed under the terms of the Creative Commons Attribution 4.0 License.