Wednesday, September 24, 2014

Offline wikipedia on linux -- XOWA

1- download xowa for linux (64-bit) from
http://sourceforge.net/projects/xowa/files/v1.9.4/xowa_app_linux_64_v1.9.4.1.zip/download

2- download wikipedia dump databas or xml pages
 there are varieties of databases to download such as
- language: english, chinese, french, german, japanese
- only text
- text with image
 now we download the text only xml (english)
http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-pages-articles.xml.bz2

3- unzip and run the xowa,
user@localhost$ sh xowa_linux_64.sh


4- import the downloaded xml archive to xowa
enter home/wiki/Help:Import/Script to the url bar
choose "read from file" and choose the xml that we just download and choose "import now" as soon below

5- after 4-5 hours, based on computer performance, the xowa will be ready for view type "simple.wikipedia.org" to start the main page and you can search and read the entire wikipedia offiline

-------------------------------------------------

No comments:

Post a Comment