Getting everything wrong without doing anything right! On the perils of large-scale analysis of Github data
Github has a wealth of data, trying to mine those data for insights about the software development process is irresistible. This talk is a cautionary tale of what can go wrong if care and healthy skepticism are not applied to the results obtained from data torture.
I will tell you about a study that aimed to link the choice of programming language to software defect and how that study failed at more or less every juncture. This talk will touch on how reproduction studies can help us regain trust in the results we cite and on how to make your work reproducible.
Jan Vitek is a Professor of Computer Science at Northeastern University. Dr. Vitek was born in Czechoslovakia and educated in Switzerland. Over the years, he worked on topics related to programming languages, their design, use, and implementation. With Noble and Potter, he proposed the notion of flexible alias control which became know as Ownership Types. He led the Ovm project which produced the first real-time Java virtual machine to be flight tested on a ScanEagle drone (he claims no one was harmed). Outcomes of this project include the Schism real-time garbage collector and the FijiVM – a production VM for embedded systems. More recently, he worked on dynamic languages, trying to make sense of JavaScript and to design a new language called, Thorn. Nowadays, he spends his time with statisticians and data scientists. Jan believes that his 2012 election as Chair of SIGPLAN was an accident; since has been busy trying to rock the boat to ensure this does not happen again. In his spare time, Jan enjoys organizing conferences and sitting on PCs (over 25 in the last decade). He founded the MOS (mobile objects), IWACO (alias control), STOP (gradual typing), and TRANSACT (transactional memory) workshop series. He was the first program chair of VEE and chaired ESOP, ECOOP, Coordination and TOOLS. He was the general chair of PLDI (in Beijing!), ISMM and LCTES. He may still be sitting on the steering committees of ECOOP, JTRES, ICFP, OOPLSA, POPL, PLDI, LCTES, ESOP.
Mon 15 JulDisplayed time zone: Belfast change
13:30 - 15:00 | |||
13:30 45mTalk | Formal Methods and Computer Networks: A Match Made in Heaven? Curry On Nate Foster Cornell University | ||
14:15 45mTalk | Getting everything wrong without doing anything right! On the perils of large-scale analysis of Github data Curry On Jan Vitek Northeastern University |