A Model of the Commit Size Distribution of Open Source [SOFSEM 2013]

Abstract: A fundamental unit of work in programming is the code contribution (“commit”) that a developer makes to the code base of the project in work. We use statistical methods to derive a model of the probabilistic distribution of commit sizes in open source projects and we show that the model is applicable to different project sizes. We use both graphical as well as statistical methods to validate the goodness of fit of our model. By measuring and modeling a fundamental dimension of programming we help improve software development tools and our understanding of software development.

Keywords: Open source; commit sizes; commit size distribution; configuration management; code contributions.

Reference: Carsten Kolassa, Dirk Riehle, Michel A. Salim. “A Model of the Commit Size Distribution of Open Source.” In Proceedings of the 39th International Conference on Current Trends in Theory and Practice of Computer Science (SOFSEM 2013), LNCS 7741. Springer Verlag, 2013. Page 52-66.

The paper is available as a PDF file.

Posted on


  1. Jianshen Ou Avatar

    very interested topics

Share the Joy

Share on LinkedIn

Share by email

Share on Twitter / X

Share on WhatsApp

Featured Startups

QDAcity makes qualitative research and qualitative data analysis fun and easy.
EDITIVE makes inter- and intra-company document collaboration more effective.

Featured Projects

Making free and open data easy, safe, and reliable to use
Bringing business intelligence to engineering management
Making open source in products easy, safe, and fun to use