A Central and Evolving Benchmark (BenchWork 2019 - (2nd edition))

Mon 15 - Fri 19 July 2019 Hammersmith, London, United Kingdom

Who

Abhishek Tiwari, Christian Hammer

Track

BenchWork 2019

Time Zone

The program is currently displayed in (GMT+01:00) Belfast.

Use conference time zone: (GMT+01:00) BelfastSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 16 Jul 2019 13:30 - 14:00 at Bouzy - Benchmark Creation

Abstract

DroidBench, DIALDroid-Bench, and ICC-Bench are a few micro-benchmarks that evaluate the effectiveness of program analyses for Android applications (apps). These benchmarks contain small test sets to evaluate various static analysis problems. However, these benchmarks are not maintained, and the test cases they contain do not reflect real world problems. Consequently, the majority of Android analyses have good precision and recall rates on such micro-benchmarks, but fail to analyze real world apps. Recent research has shown that the majority of the Android analyses fail to keep their promises.

Benchmarks should be designed independently of a tool. However, as is, most of the Android specific benchmarks are designed and contributed by individual tool owners. Hence, the tools are designed to test the benchmarks not the other way around. Additionally, these benchmarks are not centrally located and sometimes unknown to the research community. To avoid aforementioned problems, a central benchmark with constant updates is required. This is a difficult problem and cannot be achieved by a few individual groups.

In this talk we propose a central and evolving benchmark where the community as a whole contributes. This benchmark contains various areas contributed by experts in these areas. The idea is to bring the community together to have a periodically upgrading benchmark, independent of any analysis tools, and with specific branches to test specific functionalities. As an example, researchers who are expert in points-to analysis would regularly submit various test cases to the branch evaluating points-to analysis.

File attachments

A Central and Evolving Benchmark (benchwork.pdf)	4.48MiB

Abhishek Tiwari

University of Potsdam

Germany

Christian Hammer