Improving the extraction of Wikipedia data
Published by on
I am happy to share some recent performance results of a new parser for Wikipedia data dumps that I have developed over the past 2 months.
The new parser is also written in Python, as it was its predecessor included in WikiXRay. However, this new parser comes with notable improvements in speed and accuracy:
