The dataset of duplicate pull-requests in GitHub
The attached file contains a dataset of duplicate pull-requests collected from three popular projects hosted in GitHub.
** Please do not hesitate to point it out if someone find there is and mistake in the data. **