Do Java Programmers Write Better Python? – Studying Off-Language Code Quality on GitHub
There are style guides and best practices for many programming languages. Their goal is to promote uniformity and readability of code, consequentially reducing the chance of errors.
While programmers who are frequently using the same programming language tend to internalize most of its best practices eventually, little is known about what happens when they casually switch languages and write code in a less familiar language. Insights into the factors that lead to coding convention violations could help to improve tutorials for programmers switching languages, make teachers aware of mistakes they might expect depending on what language students have been using before, or influence the order in which programming languages are taught.
To approach this question, we make use of a large-scale data set representing a major part of the open source development activity happening on GitHub. In this data set, we search for Java and C++ programmers that occasionally program Python and study their Python code quality using a lint tool.
Comparing their defect rates to those from Python programmers reveals significant effects in both directions: We observe that some of Python’s best practices have more widespread adoption among Java and C++ programmers than Python experts. At the same time, python-specific coding conventions, especially indentation, scoping, and the use of semicolons, are violated more frequently.
We conclude that programming off-language is not generally associated with better or worse code quality, but individual coding conventions are violated more or less frequently depending on whether they are more universal or language-specific. We intend to motivate a discussion and more research on what causes these effects, how we can mitigate or use them for good, and which related effects can be studied using the presented data set.
Tue 10 AprDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
14:30 - 16:00 | |||
14:30 30mTalk | Do Java Programmers Write Better Python? – Studying Off-Language Code Quality on GitHub PX/18 Siegfried Horschig , Toni Mattis Hasso Plattner Institute, Robert Hirschfeld HPI, University of Potsdam | ||
15:00 30mTalk | The Fun of Being Wrong, or: The Game of Programmer vs. IDE PX/18 Friedrich Steimann Fernuniversität | ||
15:30 30mTalk | Few Versatile vs. Many Specialized Collections PX/18 Pre-print Media Attached |