Dirk Riehle's Industry and Research Publications

A systematic analysis of problems in open collaborative data engineering [TSC Journal]

Abstract

Collaborative workflows are common in open-source software development. They reduce individual costs and improve the quality of work results. Open data shares many characteristics with open-source software as it can be used, modified, and redistributed by anyone, for free. However, in contrast to open-source software engineering, collaborative data engineering on open data lacks a shared understanding of processes, methods, and tools. This article presents a systematic literature review of collaboration processes, methods, and tools in data engineering as performed by open data users. An additional interview study with practitioners confirms and enhances the findings and strengthens the resulting insights. We find an ecosystem with heterogeneous participants and no standardized processes, methods, and tools. Participants face a variety of technical and social challenges during their work. Our work provides a structured overview of collaboration systems in open collaborative data engineering, enabling further research. Additionally, we contribute preliminary guidelines for successful open collaborative data engineering projects and recommendations to increase its adoption for open data ecosystems.

Categories

Human-centered computing → Collaborative and social computing systems and tools; Empirical studies in collaborative and social computing; General and reference → Surveys and overviews

Keywords

Collaboration, data engineering, open data

Reference

Heltweg, P. & Riehle, D. (2023). A Systematic Analysis of Problems in Open Collaborative Data Engineering. In ACM Transactions on Social Computing, vol. 6, no. 3-4 (2023). Article no. 8, pp 1-30.

Download

Available in the ACM Digital Library (local copy).

Posted on

Comments

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.

Share the Joy

Share on LinkedIn

Share by email

Share on Twitter / X

Share on WhatsApp

Featured Startups

QDAcity makes qualitative research and qualitative data analysis fun and easy.
EDITIVE makes inter- and intra-company document collaboration more effective.

Featured Projects

Making free and open data easy, safe, and reliable to use
Bringing business intelligence to engineering management
Making open source in products easy, safe, and fun to use