2015-06-18 05:28:40,245 DEBUG #7811 === Building rdf === 2015-06-18 05:28:40,245 INFO #7811 Making pg7811.rdf 2015-06-18 05:28:40,323 INFO #7811 Done pg7811.rdf 2015-06-25 05:22:00,172 DEBUG #7811 === Building rdf === 2015-06-25 05:22:00,173 INFO #7811 Making pg7811.rdf 2015-06-25 05:22:00,312 INFO #7811 Done pg7811.rdf 2015-07-02 05:31:05,980 DEBUG #7811 === Building rdf === 2015-07-02 05:31:05,980 INFO #7811 Making pg7811.rdf 2015-07-02 05:31:06,093 INFO #7811 Done pg7811.rdf 2015-07-04 10:14:19,786 DEBUG #7811 === Building txt.utf-8 === 2015-07-04 10:14:19,787 DEBUG #7811 Start of retrieval 2015-07-04 10:14:19,793 DEBUG #7811 ... got mediatype text/plain from guess_type 2015-07-04 10:14:19,793 DEBUG #7811 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/7811/7811-8.txt 2015-07-04 10:14:19,794 DEBUG #7811 GutenbergTextParser.pre_parse () ... 2015-07-04 10:14:19,794 DEBUG #7811 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/7811/7811-8.txt ... 2015-07-04 10:14:19,794 DEBUG #7811 End of retrieval 2015-07-04 10:14:19,794 INFO #7811 Creating plain text file: /public/vhost/g/gutenberg/html/cache/epub/7811/pg7811.txt.utf8 2015-07-04 10:14:19,794 DEBUG #7811 Fetching file:///public/vhost/g/gutenberg/html/files/7811/7811-8.txt ... 2015-07-04 10:14:19,871 INFO #7811 Got charset ISO-8859-1 from pg header 2015-07-04 10:14:19,871 DEBUG #7811 Trying to decode document with charset iso-8859-1 ... 2015-07-04 10:14:19,913 INFO #7811 Done plain text file: /public/vhost/g/gutenberg/html/cache/epub/7811/pg7811.txt.utf8 2015-07-04 10:14:19,914 INFO #7811 Creating Gzip file: /public/vhost/g/gutenberg/html/cache/epub/7811/pg7811.txt.utf8.gzip 2015-07-04 10:14:19,914 INFO #7811 Adding file: /public/vhost/g/gutenberg/html/cache/epub/7811/pg7811.txt.utf8 2015-07-04 10:14:20,124 INFO #7811 Done Zip file: /public/vhost/g/gutenberg/html/cache/epub/7811/pg7811.txt.utf8.gzip 2015-07-04 10:14:20,126 DEBUG #7811 === Building epub.images === 2015-07-04 10:14:20,127 DEBUG #7811 Start of retrieval 2015-07-04 10:14:20,132 DEBUG #7811 ... got mediatype text/html from guess_type 2015-07-04 10:14:20,132 DEBUG #7811 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/7811/7811-h/7811-h.htm 2015-07-04 10:14:20,132 DEBUG #7811 HTMLParser.pre_parse () ... 2015-07-04 10:14:20,132 DEBUG #7811 Fetching file:///public/vhost/g/gutenberg/html/files/7811/7811-h/7811-h.htm ... 2015-07-04 10:14:20,255 DEBUG #7811 Got charset UTF-8 from xml declaration 2015-07-04 10:14:20,255 DEBUG #7811 Trying to decode document with charset utf_8_sig ... 2015-07-04 10:14:20,342 ERROR #7811 etree.fromstring says: Entity 'ldquo' not defined, line 79, column 22 2015-07-04 10:14:20,348 ERROR #7811 Line 79: