Preprints

There are 14 Preprints listed.

h5RDMtoolbox - A Python Toolbox for FAIR Data Management around HDF5

Matthias Probst, Balazs Pritz

2023-10-11   Data Management Software

Sustainable data management is fundamental to efficient and successful scientific research. The FAIR principles (Findable, Accessible, Interoperable and Reusable) have been proven to be successful guidelines to enable comprehensible analysis, discovery and re-use. Although the topic has recently gained increasing awareness in both academia and industry, the engineering sciences in particular are [...]


Job and Operation Entropy in Job Shop Scheduling: A Dataset

Marco Kemmerling, Maciej Combrzynski-Nogala, Aymen Gannouni, et al.

2023-08-28   Data Sets

The job shop problem is a highly practically relevant NP-hard problem, which has and continues to receive considerable attention in the literature. Approaches to the problem are typically benchmarked on publicly available datasets containing sets of problem instances. These problem instances are usually generated by some mechanism involving randomisation of instance properties or by maximising [...]


Challenges in publishing research data – a Fraunhofer Case Study

Andrea Wuchner, Michèle Robrecht, Pierre Kehl, et al.

2023-08-14   Data Infrastructure

Sharing of research data is becoming more and more established as part of the scientific process, triggered by corresponding requirements of research funders. A large number of subject-specific and institutional research data repositories were created as publication agents for the research data. Nevertheless, the publication processes are not yet established and need to find a way to best [...]


An Empirical Study of the State of Research Data Management in the Semiconductor Manufacturing Industry

Dirk Ortloff, Sabrina Anger, Martin Schellenberger

2023-06-07   Data Infrastructure, Data Management Software

The paper presents insights into the situation concerning research data management (RDM) in the high-tech manufacturing industry and respective research institutions. Besides standards and guidelines, data management and its degree of formalization play a decisive role in digital transformation in all organizations. The authors of this study benefited from the opportunity arising within the [...]


Towards categorizing ethical questions in data literacy

Samira Khodaei, Anas Abdelrazeq, Ingrid Isenhardt

2023-05-12   Data Ethics, Data Literacy

Data Literacy is crucial for a sustainable engineering education [11]. In aiming to find solutions to solve future challenges, mechanical engineering has started to integrate data literacy into the higher education curriculum [13]. However, ethics are rarely considered in current frameworks. Ethics are seen as a side topic or are equated to data privacy issues [2]. Since literacy aims to [...]


PIA - A Concept for a Personal Information Assistant for Data Analysis and Machine Learning of Time-Continuous Data in Industrial Applications

Christopher Schnur, Tanja Dorst, Kapil Sajjan Deshmukh, et al.

2023-05-05   Data Governance, Data Literacy

A database with high-quality data must be given to fully use the potential of Artificial Intelligence (AI). Especially in small and medium-sized companies with little experience with AI, the underlying database quality is often insufficient. This results in an increased manual effort to process the data before using AI. In this contribution, the authors developed a concept to enable inexperienced [...]


RDM Platform Coscine - FAIR play integrated right from the start

Ilona Lang, Marcel Nellesen, Marius Politze

2023-05-05   Data Infrastructure, Data Management Software

Nowadays, researchers often need to distribute their research data among a multitude of service providers with varying (if any) levels of maturity in terms of FAIR research data management (RDM). To provide researchers with a single point of access to their project data and to add a FAIR layer to already established services, the RDM platform Coscine was developed. Within Coscine different [...]


From Ontology to Metadata: A Crawler for Script-based Workflows

Giuseppe Chiapparino, Benjamin Farnbacher, Nils Hoppe, et al.

2023-04-27   Data Management Software

The present work introduces HOMER (HPMC tool for Ontology-based Metadata Extraction and Re-use), a python-written metadata crawler that allows to automatically retrieve relevant research metadata from script-based workflows on HPC systems. The tool offers a flexible approach to metadata collection, as the metadata scheme can be read out from an ontology file. Through minimal user input, the [...]


Agile Research Data Management with Open Source: LinkAhead

Daniel Hornung, Florian Spreckelsen, Thomas Weiß

2023-03-28   Data Management Software

Research data management (RDM) in academic scientific environments increasingly enters the focus as an important part of good scientific practice and as a topic with big potentials for saving time and money. Nevertheless, there is a shortage of appropriate tools, which fulfill the specific requirements in scientific research. We identified where the requirements in science deviate from other [...]


Betty’s (Re)Search Engine: A client-based search engine for research software stored in repositories.

Vasiliy Seibert, Andreas Rausch, Stefan Wittek

2023-03-09   Data Management Software

Promoting research, without providing the source code that was used to conduct the research, means a greater effort for every researcher down the line. Existing solutions that aim to make research software FAIR [1], fail to provide a wholesome solution, for they do not sufficiently consider already existing research software stored on platforms like GitHub or organizational GitLabs. We therefore [...]


Towards Improved Findability of Energy Research Software by Introducing a Metadata-based Registry

Stephan Ferenz, Astrid Nieße

2023-03-01   Data Infrastructure, Data Management Software

Research software in the energy domain becomes increasingly important for the analysis, simulation, and optimization of energy systems and supports design decisions in the required transition of energy systems to tackle the climate crisis. To make energy research software (ERS) more findable, it should be described with metadata following the FAIR (findable, accessible, interoperable, and [...]


Beyond Data Literacy in Engineering Education

Samira Khodaei, Mihail Padev, Anas Abdelrazeq, et al.

2023-01-24   Data Ethics, Data Literacy

Data literacy is a key ingredient for engineering education [14]. Through digital transformation, more data are generated in different scientific fields that will be interpreted. As a highly applicable scientific field, mechanical engineering is predestined to integrate data literacy into the higher education curriculum [17]. However, current frameworks rarely consider ethical questions, [...]


Evaluation of tools for describing, reproducing and reusing scientific workflows

Philipp Diercks, Dennis Gläser, Ontje Lünsdorf, et al.

2022-12-06   Data Management Software

In the field of computational science and engineering, workflows often entail the application of various software, for instance, for simulation or pre- and postprocessing. Typically, these components have to be combined in arbitrarily complex workflows to address a specific research question. In order for peer researchers to understand, reproduce and (re)use the findings of a scientific [...]


plotID - a toolkit for connecting research data and visualization

Martin Hock, Hannes Mayr, Manuela Richter, et al.

2022-09-05   Data Management Software

The highest amount of published information on paper is contained in visualizations such as 2D and or 3D plots. Supporting a generic research workflow, plotID provides tools that can a) create and anchor a reference (ID code, URL,...) for and b) package figures, data, code and parameters used to create the figure. The code is provided as tools with small impact, that need to be used [...]