End to End Software Engineering Research
2021-12-22Code Available0· sign in to hype
Idan Amit
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/evidencebp/e2eseOfficialIn papernone★ 4
Abstract
End to end learning is machine learning starting in raw data and predicting a desired concept, with all steps done automatically. In software engineering context, we see it as starting from the source code and predicting process metrics. This framework can be used for predicting defects, code quality, productivity and more. End-to-end improves over features based machine learning by not requiring domain experts and being able to extract new knowledge. We describe a dataset of 5M files from 15k projects constructed for this goal. The dataset is constructed in a way that enables not only predicting concepts but also investigating their causes.