This presentation shows the experience of placing all the content of Wikipedia on a computer with 40 grams: the Raspberry Pi.
Slides 1-36 are introduction / demo. After that, there is a little "Do It Yourself".
14. Number of pages: 33,088,699
Number of articles: 4,530,955
Number of files: 831,598
Number of edits: 721,601,195
Number of users: 21,533,669
Number of admins: 1,410
As of the 7th of June, 2014:
22. pages-articles.xml.bz2 – Current
revisions only, no talk or user pages.
(This is probably the one you want.
The size of the 13 February 2014
dump is approximately 9.85 GB
compressed, 44 GB
uncompressed).
From http://en.wikipedia.org/wiki/Wikipedia:Database_download
45. Option 1: Evopedia
You can use the official
dump, just 9 GB on torrent!
But the reader is less
mature, and it is difficult to
place an external dump.
46. Option 2: MediaWiki (server)
Is what Wikipedia uses to
show pages.
A lot more work, a lot more
time processing, a lot less
performance on rpi.
47. Option 3: Kiwix
Very good in viewing
wikipedia pages and
making a simple server.
Downside: needs to
download own dump,
44 GB...
50. Option 1: only the server
It is already compiled for
ARM!
http://sourceforge.net/projects/kiwix/files/0.9_rc2/kiwix-server-0.9-rc2-
linux-armv5tejl.tar.bz2/download
kiwix-folder$ sudo make install
51. Option 2: everything
It is not compiled for
ARM...
http://www.kiwix.org/wiki/Compilation
kiwix-folder$ ./autogen.sh
Solve all the (damn) dependencies with sudo apt-get install
protip: read through the ./configure file to see the dependencies
Install xulrunner10-dev, not xulrunner-dev
kiwix-folder$ ./configure
kiwix-folder$ make
A LOT of hours later:
kiwix-folder$ make install
52. Finalizing:
$ kiwix
Open dump at SD/pen drive
Fix the index like this:
http://sourceforge.net/p/kiwix/bugs/275/
Profit!
53.
54.
55. Interesting additionals:
Make rpi conect
automatically on a wifi and
if it can’t make it create an
adhoc server
http://lcdev.dk/2012/11/18/raspberry-pi-tutorial-connect-to-wifi-or-create-an-encrypted-
dhcp-enabled-ad-hoc-network-as-fallback/
Or with a secure connection (what? WEP? no way!):
http://www.novitiate.co.uk/?p=183