ABSTRACT
Free and open source software systems (FOSS) are distributed and made available to users under different software licenses, mentioned in FOSS code by means of licensing statements. Various factors, such as changes in the legal landscape, commercial code licensed as FOSS, or code reused from other FOSS systems, lead to evolution of licensing, which may affect the way a system or part thereof can be subsequently used. Therefore, it is crucial to monitor licensing evolution. However, manually tracking the licensing evolution of thousands of files is a daunting task.
After presenting several cases of the effects of licensing evolution, we propose an approach to automatically track changes occurring in the licensing terms of a system. Then, we report an empirical study of the licensing evolution of six different FOSS systems. Results show that licensing underwent frequent and substantial changes.
- American Bar Association. An overview of "open source" software licenses. www.abanet.org/intelprop/opensource.html. Accessed Sept. 2009.Google Scholar
- J. Cohen. Statistical power analysis for the behavioral sciences (2nd ed.). Lawrence Earlbaum Associates, Hillsdale, NJ, 1988.Google Scholar
- M. Di Penta and D. M. Germán. Who are source code contributors and how do they change? In 16th Working Conference on Reverse Engineering, WCRE 2009, 13--16 October 2009, Lille, France, pages 11--20. IEEE Computer Society, 2009. Google ScholarDigital Library
- Eclipse Foundation. Eclipse Public License (EPL) Frequently Asked Questions, 2007. Accessed Dec. 2007.Google Scholar
- B. Fluri, M. Würsch, and H. Gall. Do code and comments co-evolve? on the relation between source code and comment changes. In 14th Working Conference on Reverse Engineering (WCRE 2007), pages 70--79, 2007. Google ScholarDigital Library
- W. B. Frakes and R. Baeza-Yates. Information Retrieval: Data Structures and Algorithms. Prentice-Hall, Englewood Cliffs, NJ, 1992. Google ScholarDigital Library
- Free Software Foundation. Various Licenses and Comments about Them. www.gnu.org/philosophy/license-list.html, 2009.Google Scholar
- D. M. Germán, M. Di Penta, Y.-G. Guéhéneuc, and G. Antoniol. Code siblings: Technical and legal implications. In Proc. of the 2009 Working Conference on Mining Software Repositories, MSR 2009, pages 81--90, 2009. Google ScholarDigital Library
- D. M. Germán and J. M. González-Barahona. An empirical study of the reuse of software licensed under the GNU General Public License. In Proceedings of the International Open Source Systems Conference (OSS'09), pages 185--198. Springer, 2009.Google ScholarCross Ref
- D. M. Germán and A. E. Hassan. License integration patterns: Addressing license mismatches in component-based development. In 31st International Conference on Software Engineering, ICSE 2009, May 16--24, 2009, Vancouver, Canada, Proceedings, pages 188--198. IEEE, 2009. Google ScholarDigital Library
- R. Gobeille. The FOSSology project. In In Proc. Fith International Workshop on Mining Software Repositories, pages 47--50, 2008. Google ScholarDigital Library
- D. Hartmeier. Design and Performance of the OpenBSD Stateful Packet Filter. www.benzedrine.cx/pf-paper.html. Accessed Sept. 2009.Google Scholar
- A. Hindle, D. M. Germán, and R. Holt. What do large commits tell us? a taxonomical study of large commits. In MSR '08: Proceedings of the 2008 international working conference on Mining software repositories, pages 99--108, May 2008. Google ScholarDigital Library
- Z. M. Jiang and A. E. Hassan. Examining the evolution of code comments in PostgreSQL. In Proceedings of the 2006 International Workshop on Mining Software Repositories, MSR 2006, pages 179--180, 2006. Google ScholarDigital Library
- C. Knudsen. Troll Tech's QPL. Linux J., (61es):10, May 1999. Google ScholarDigital Library
- D. Lawrie, C. Morrell, H. Feild, and D. Binkley. What's in a name? a study of identifiers. In 14th Intl. Conference on Program Comprehension (ICPC 2006), pages 3--12, 2006. Google ScholarDigital Library
- T. B. Lee. Court: violating copyleft = copyright infringement. http://arstechnica.com/tech-policy/news/2008/08/court-violating-copyleft-copyright-infringement.ars. Accessed Sept. 2009.Google Scholar
- Novell Inc. FAQ: Licensing. www.mono-project.com/Licensing, 2009.Google Scholar
- L. Rosen. Open Source Licensing: Software Freedom and Intellectual Property Law. Prentice Hall, 2004. Google ScholarDigital Library
- S. Shankland. CNET News: Open-source spat spurs software change. http://news.cnet.com/2100-1001-266399.html. Accessed Sept. 2009.Google Scholar
- S. Shankland. CNET News: Ximian changes open-source license. http://news.cnet.com/Ximian-changes-open-source-license/2100-1016_3-823734.html. Accessed Sept. 2009.Google Scholar
- R. K. Yin. Case Study Research: Design and Methods - Third Edition. SAGE Publications, London, 2002.Google Scholar
- A. T. T. Ying, J. L. Wright, and S. Abrams. Source code that talks: an exploration of Eclipse task comments and their implication to repository mining. In Proceedings of the 2005 International Workshop on Mining Software Repositories, MSR 2005, Saint Louis, Missouri, USA, May 17, 2005. ACM, 2005. Google ScholarDigital Library
Recommendations
Open Source License Inconsistencies on GitHub
Almost all software, open or closed, builds on open source software and therefore needs to comply with the license obligations of the open source code. Not knowing which licenses to comply with poses a legal danger to anyone using open source software. ...
Finding software license violations through binary code clone detection
MSR '11: Proceedings of the 8th Working Conference on Mining Software RepositoriesSoftware released in binary form frequently uses third-party packages without respecting their licensing terms. For instance, many consumer devices have firmware containing the Linux kernel, without the suppliers following the requirements of the GNU ...
A Study of Potential Code Borrowing and License Violations in Java Projects on GitHub
MSR '20: Proceedings of the 17th International Conference on Mining Software RepositoriesWith an ever-increasing amount of open-source software, the popularity of services like GitHub that facilitate code reuse, and common misconceptions about the licensing of open-source software, the problem of license violations in the code is getting ...
Comments