WP2TXT extracts plain text data from Wikipedia dump file (encoded in XML/compressed with Bzip2) stripping all the MediaWiki markups and other metadata.
Yoichiro Hasebe
January 24, 2013 4:18am
N/A