Getting everything wrong without doing anything right! On the perils of large-scale analysis of Github data
Github has a wealth of data, trying to mine those data for insights about the software development process is irresistible. This talk is a cautionary tale of what can go wrong if care and healthy skepticism are not applied to the results obtained from data torture.
I will tell you about a study that aimed to link the choice of programming language to software defect and how that study failed at more or less every juncture. This talk will touch on how reproduction studies can help us regain trust in the results we cite and on how to make your work reproducible.
Mon 15 JulDisplayed time zone: Belfast change
13:30 - 15:00
|Formal Methods and Computer Networks: A Match Made in Heaven?
Nate Foster Cornell University
|Getting everything wrong without doing anything right! On the perils of large-scale analysis of Github data
Jan Vitek Northeastern University