2015-06-18 05:36:06,356 DEBUG #9662 === Building rdf === 2015-06-18 05:36:06,356 INFO #9662 Making pg9662.rdf 2015-06-18 05:36:06,525 INFO #9662 Done pg9662.rdf 2015-06-25 05:27:12,977 DEBUG #9662 === Building rdf === 2015-06-25 05:27:12,977 INFO #9662 Making pg9662.rdf 2015-06-25 05:27:13,052 INFO #9662 Done pg9662.rdf 2015-07-02 05:38:43,547 DEBUG #9662 === Building rdf === 2015-07-02 05:38:43,547 INFO #9662 Making pg9662.rdf 2015-07-02 05:38:43,643 INFO #9662 Done pg9662.rdf 2015-07-05 03:40:57,231 DEBUG #9662 === Building txt.utf-8 === 2015-07-05 03:40:57,231 DEBUG #9662 Start of retrieval 2015-07-05 03:40:57,239 DEBUG #9662 ... got mediatype text/plain from guess_type 2015-07-05 03:40:57,239 DEBUG #9662 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/9662/9662-8.txt 2015-07-05 03:40:57,240 DEBUG #9662 GutenbergTextParser.pre_parse () ... 2015-07-05 03:40:57,240 DEBUG #9662 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/9662/9662-8.txt ... 2015-07-05 03:40:57,240 DEBUG #9662 End of retrieval 2015-07-05 03:40:57,241 INFO #9662 Creating plain text file: /public/vhost/g/gutenberg/html/cache/epub/9662/pg9662.txt.utf8 2015-07-05 03:40:57,241 DEBUG #9662 Fetching file:///public/vhost/g/gutenberg/html/files/9662/9662-8.txt ... 2015-07-05 03:40:57,304 INFO #9662 Got charset ISO-8859-1 from pg header 2015-07-05 03:40:57,304 DEBUG #9662 Trying to decode document with charset iso-8859-1 ... 2015-07-05 03:40:57,334 INFO #9662 Done plain text file: /public/vhost/g/gutenberg/html/cache/epub/9662/pg9662.txt.utf8 2015-07-05 03:40:57,335 INFO #9662 Creating Gzip file: /public/vhost/g/gutenberg/html/cache/epub/9662/pg9662.txt.utf8.gzip 2015-07-05 03:40:57,335 INFO #9662 Adding file: /public/vhost/g/gutenberg/html/cache/epub/9662/pg9662.txt.utf8 2015-07-05 03:40:57,493 INFO #9662 Done Zip file: /public/vhost/g/gutenberg/html/cache/epub/9662/pg9662.txt.utf8.gzip 2015-07-05 03:40:57,495 DEBUG #9662 === Building epub.images === 2015-07-05 03:40:57,495 DEBUG #9662 Start of retrieval 2015-07-05 03:40:57,523 DEBUG #9662 ... got mediatype text/html from guess_type 2015-07-05 03:40:57,524 DEBUG #9662 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/9662/9662-h/9662-h.htm 2015-07-05 03:40:57,524 DEBUG #9662 HTMLParser.pre_parse () ... 2015-07-05 03:40:57,524 DEBUG #9662 Fetching file:///public/vhost/g/gutenberg/html/files/9662/9662-h/9662-h.htm ... 2015-07-05 03:40:57,611 DEBUG #9662 Got charset iso-8859-1 from html meta 2015-07-05 03:40:57,611 DEBUG #9662 Trying to decode document with charset iso-8859-1 ... 2015-07-05 03:40:57,623 INFO #9662 Running html thru tidy. 2015-07-05 03:40:57,750 WARNING #9662 tidy: line 1 column 1 - missing declaration 2015-07-05 03:40:57,752 WARNING #9662 tidy: line 1 column 1 - plain text isn't allowed in
elements 2015-07-05 03:40:57,752 INFO #9662 tidy: line 1 column 1 - previously mentioned 2015-07-05 03:40:57,752 WARNING #9662 tidy: line 1 column 1 - inserting implicit 2015-07-05 03:40:57,753 WARNING #9662 tidy: line 1 column 1 - isn't allowed after elements 2015-07-05 03:40:57,753 WARNING #9662 tidy: line 3 column 1 - discarding unexpected 2015-07-05 03:40:57,753 WARNING #9662 tidy: line 4 column 3 - isn't allowed in elements 2015-07-05 03:40:57,753 INFO #9662 tidy: line 1 column 1 - previously mentioned 2015-07-05 03:40:57,753 WARNING #9662 tidy: line 5 column 5 - isn't allowed in elements 2015-07-05 03:40:57,754 INFO #9662 tidy: line 1 column 1 - previously mentioned 2015-07-05 03:40:57,754 WARNING #9662 tidy: line 6 column 5 -