Parser for the MediaWiki markup language for GHC Haskell.
This package only provides the library --- the development of executables that convert MediaWiki to another format is left to the dependee.
The only executable offered in this package is the mediawiki-links
utility which
accepts a MediaWiki XML dump file on stdin
and writes a set of link graph
edges to stdout
. The output format is a tab-separated text file with the
following columns,
- source node name
- link target node name
- link target namespace
- link anchor text
- Download and install Haskell Platform
- Run
cabal update
- Clone this repository,
git clone git://github.com/bgamari/mediawiki-parser
- Run
cabal install
from within the repository - The
597E mediawiki-import
executable can be found in~/.cabal/bin