In this work, we investigate the problem of revealing the functionality of a black-box agent. Notably, we are interested in the interpretable and formal description of the behavior of such an agent. Ideally, this description would take the form of a program written in a high-level language. This task is also known as reverse engineering and plays a pivotal role in software engineering, computer security, but also most recently in interpretability. In contrast to prior work, we do not rely on privileged information on the black box, but rather investigate the problem under a weaker assumption of having only access to inputs and outputs of the program. We approach this problem by iteratively refining a candidate set using a generative neural program synthesis approach until we arrive at a functionally equivalent program. We assess the performance of our approach on the Karel dataset. Our results show that the proposed approach outperforms the state-of-the-art on this challenge by finding an approximately functional equivalent program in 78% of cases -- even exceeding prior work that had privileged information on the black-box.
History
Preferred Citation
Hossein Hajipour, Mateusz Malinowski and Mario Fritz. IReEn: Iterative Reverse-Engineering of Black-Box Functions via Neural Program Synthesis. In: International Workshop on Machine Learning in Software Engineering in conjuncture with ECML PKDD (). 2021.
Primary Research Area
Trustworthy Information Processing
Name of Conference
European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Database (ECML PKDD)
Legacy Posted Date
2021-12-14
Open Access Type
Unknown
BibTeX
@inproceedings{cispa_all_3541,
title = "IReEn: Iterative Reverse-Engineering of Black-Box Functions via Neural Program Synthesis",
author = "Hajipour, Hossein and Malinowski, Mateusz and Fritz, Mario",
booktitle="{International Workshop on Machine Learning in Software Engineering in conjuncture with ECML PKDD ()}",
year="2021",
}