Preprints

Filtering by Subject: Data Management Software

h5RDMtoolbox - A Python Toolbox for FAIR Data Management around HDF5

Matthias Probst, Balazs Pritz

2023-10-11   Data Management Software

Sustainable data management is fundamental to efficient and successful scientific research. The FAIR principles (Findable, Accessible, Interoperable and Reusable) have been proven to be successful guidelines to enable comprehensible analysis, discovery and re-use. Although the topic has recently gained increasing awareness in both academia and industry, the engineering sciences in particular are [...]


An Empirical Study of the State of Research Data Management in the Semiconductor Manufacturing Industry

Dirk Ortloff, Sabrina Anger, Martin Schellenberger

2023-06-07   Data Infrastructure, Data Management Software

The paper presents insights into the situation concerning research data management (RDM) in the high-tech manufacturing industry and respective research institutions. Besides standards and guidelines, data management and its degree of formalization play a decisive role in digital transformation in all organizations. The authors of this study benefited from the opportunity arising within the [...]


RDM Platform Coscine - FAIR play integrated right from the start

Ilona Lang, Marcel Nellesen, Marius Politze

2023-05-05   Data Infrastructure, Data Management Software

Nowadays, researchers often need to distribute their research data among a multitude of service providers with varying (if any) levels of maturity in terms of FAIR research data management (RDM). To provide researchers with a single point of access to their project data and to add a FAIR layer to already established services, the RDM platform Coscine was developed. Within Coscine different [...]


From Ontology to Metadata: A Crawler for Script-based Workflows

Giuseppe Chiapparino, Benjamin Farnbacher, Nils Hoppe, et al.

2023-04-27   Data Management Software

The present work introduces HOMER (HPMC tool for Ontology-based Metadata Extraction and Re-use), a python-written metadata crawler that allows to automatically retrieve relevant research metadata from script-based workflows on HPC systems. The tool offers a flexible approach to metadata collection, as the metadata scheme can be read out from an ontology file. Through minimal user input, the [...]


Agile Research Data Management with Open Source: LinkAhead

Daniel Hornung, Florian Spreckelsen, Thomas Weiß

2023-03-28   Data Management Software

Research data management (RDM) in academic scientific environments increasingly enters the focus as an important part of good scientific practice and as a topic with big potentials for saving time and money. Nevertheless, there is a shortage of appropriate tools, which fulfill the specific requirements in scientific research. We identified where the requirements in science deviate from other [...]


Betty’s (Re)Search Engine: A client-based search engine for research software stored in repositories.

Vasiliy Seibert, Andreas Rausch, Stefan Wittek

2023-03-09   Data Management Software

Promoting research, without providing the source code that was used to conduct the research, means a greater effort for every researcher down the line. Existing solutions that aim to make research software FAIR [1], fail to provide a wholesome solution, for they do not sufficiently consider already existing research software stored on platforms like GitHub or organizational GitLabs. We therefore [...]


Towards Improved Findability of Energy Research Software by Introducing a Metadata-based Registry

Stephan Ferenz, Astrid Nieße

2023-03-01   Data Infrastructure, Data Management Software

Research software in the energy domain becomes increasingly important for the analysis, simulation, and optimization of energy systems and supports design decisions in the required transition of energy systems to tackle the climate crisis. To make energy research software (ERS) more findable, it should be described with metadata following the FAIR (findable, accessible, interoperable, and [...]


Evaluation of tools for describing, reproducing and reusing scientific workflows

Philipp Diercks, Dennis Gläser, Ontje Lünsdorf, et al.

2022-12-06   Data Management Software

In the field of computational science and engineering, workflows often entail the application of various software, for instance, for simulation or pre- and postprocessing. Typically, these components have to be combined in arbitrarily complex workflows to address a specific research question. In order for peer researchers to understand, reproduce and (re)use the findings of a scientific [...]


plotID - a toolkit for connecting research data and visualization

Martin Hock, Hannes Mayr, Manuela Richter, et al.

2022-09-05   Data Management Software

The highest amount of published information on paper is contained in visualizations such as 2D and or 3D plots. Supporting a generic research workflow, plotID provides tools that can a) create and anchor a reference (ID code, URL,...) for and b) package figures, data, code and parameters used to create the figure. The code is provided as tools with small impact, that need to be used [...]