Libresoft publishes Wikipedia research dumps
Libresoft is proud to announce that our Wikipedia research dumps are publicly available for downloading.
Since just a few hours ago, Wikipedia research dumps have been published on a repository hosted by RedIRIS. These dumps will save time and effort to all Wikipedia researchers worldwide, eliminating the previous need to parse the huge XML dumps, published by Wikimedia Foundation on its download center, to extract all activity metadata corresponding to a certain language version.
The new repositories can be accessed either by HTTP or FTP.
Wikipedia research dumps have been created using WikiXRay, our tool to automate the analysis of any language version of Wikipedia. These are compressed mysqldump files, that can be easily loaded again on any local database for research purposes.
The first research dumps only include some of the biggest versions of Wikipedia. In the following days, the complete set of dumps for all available versions in Wikipedia will be published. The dumps will be updated on a regular based, depending on the availability of the original XML dumps provided by Wikimedia Foundation.
Document Actions


