<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="6.x">Drupal-Biblio</source-app><ref-type>5</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Gregorio Robles</style></author><author><style face="normal" font="default" size="100%">Jesus M. Gonzalez-Barahona</style></author><author><style face="normal" font="default" size="100%">Daniel Izquierdo-Cortazar</style></author><author><style face="normal" font="default" size="100%">Israel Herraiz</style></author></authors><secondary-authors><author><style face="normal" font="default" size="100%">Stefan Koch</style></author></secondary-authors></contributors><titles><title><style face="normal" font="default" size="100%">Tools and Datasets for Mining Libre Software Repositories</style></title><secondary-title><style face="normal" font="default" size="100%">Multi-Disciplinary Advancement in Open Source Software and Processes</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">data mining</style></keyword><keyword><style  face="normal" font="default" size="100%">open source</style></keyword><keyword><style  face="normal" font="default" size="100%">tools</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2011</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.igi-global.com/book/multi-disciplinary-advancement-open-source/46171</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">IGI Global</style></publisher><pub-location><style face="normal" font="default" size="100%"> Hershey, PA</style></pub-location><volume><style face="normal" font="default" size="100%">1</style></volume><pages><style face="normal" font="default" size="100%">24–42</style></pages><isbn><style face="normal" font="default" size="100%">9781609605148</style></isbn><language><style face="normal" font="default" size="100%">English</style></language><abstract><style face="normal" font="default" size="100%">Thanks to the open nature of libre (free, open source) software projects, researchers have gained access to a rich set of data related to various aspects of software development. Although it is usually publicly available on the Internet, obtaining and analyzing the data in a convenient way is not an easy task, and many considerations have to be taken into account. In this chapter we introduce the most relevant data sources that can be found in libre software projects and that are commonly studied by scholars: source code releases, source code management systems, mailing lists and issue (bug) tracking systems. The chapter also provides some advice on the problems that can be found when retrieving and preparing the data sources for a later analysis, as well as information about the tools and datasets that support these tasks.</style></abstract><section><style face="normal" font="default" size="100%">2</style></section></record></records></xml>