Notices


Reply
Thread Tools
Posts: 1,522 | Thanked: 391 times | Joined on Jul 2010 @ São Paulo, Brazil
#1
There is a bunch of pages, including a TOC one, i downloaded with wget (don't worry, the contents are licensed under Creative Commons) that i wanna convert into an ebook for offline reading (the material is more suited to be browsed with an ereader); do you have any suggestions on what i could use to convert the site dump into an ebook (images, text styling etc surviving of course) ?

Ideally somthing i can use straight on my N900 without the need of a regular PC to do the work; i do have acess to Windows and Ubuntu desktops if really necessary, but i would rather just do it staright on the N900.
 
Posts: 992 | Thanked: 738 times | Joined on Jun 2010 @ Low Earth Orbit
#2
If any of the ebook readers on the N900 supports plucker format then you could use that.

http://www.plkr.org/

It comes with this tool:

Plucker's "distiller" is a series of scripts, currently written in Python, which are used to "spider" or crawl webpages you specify, fetching more links and pages as it goes, until it reaches your specified maximum depth.
Or else using the correct options in wget (in particular --convert-links) you could download everything and read offline using a web browser. google "wget mirror website".
 
ME2g's Avatar
Posts: 168 | Thanked: 58 times | Joined on Aug 2010 @ Vienna
#3
Originally Posted by TiagoTiago View Post
There is a bunch of pages, including a TOC one, i downloaded with wget (don't worry, the contents are licensed under Creative Commons) that i wanna convert into an ebook for offline reading (the material is more suited to be browsed with an ereader); do you have any suggestions on what i could use to convert the site dump into an ebook (images, text styling etc surviving of course) ?

Ideally somthing i can use straight on my N900 without the need of a regular PC to do the work; i do have acess to Windows and Ubuntu desktops if really necessary, but i would rather just do it staright on the N900.
If you got the material by wget, then you can convert it for local offline reading on your browser without internet connection and without some ebook-reader application.
Something like
wget -kKmp -np http://svnbook.red-bean.com/en/1.5/index.html
afair.
Then you just have to look for the correct index.htm* page or similar.
;-)

Edit: you have to read it as
file:///<directory where you saved it>/svnbook.red-bean.com/en/1.5/index.html

Last edited by ME2g; 2010-12-05 at 17:43.
 
Posts: 1,522 | Thanked: 391 times | Joined on Jul 2010 @ São Paulo, Brazil
#4
i want to convert what i already downloaded into an ebook, i already got the pages saved on disk, all the links and images working etc, bur like i said the contentare more suited to read on an ebook reader

I'll look into that distiller thing later (i tried finding the download link for the scripts for a little bit earlier today but couldn't find it before some other things came up that i had to deal with)
 
Posts: 2,802 | Thanked: 4,490 times | Joined on Nov 2007
#5
Originally Posted by TiagoTiago View Post
do you have any suggestions on what i could use to convert the site dump into an ebook
Calibre should do it, but it probably fails the "run on the N900" test (unless you run it in easy debian or something).

If you want to roll your own converter, epub is probably the easiest format to target (it's essentially XHTML inside).
 
Reply


 
Forum Jump


All times are GMT. The time now is 07:26.