2015-06-18 05:21:38,465 DEBUG #5719 === Building rdf === 2015-06-18 05:21:38,466 INFO #5719 Making pg5719.rdf 2015-06-18 05:21:38,539 INFO #5719 Done pg5719.rdf 2015-06-25 05:16:33,273 DEBUG #5719 === Building rdf === 2015-06-25 05:16:33,273 INFO #5719 Making pg5719.rdf 2015-06-25 05:16:33,341 INFO #5719 Done pg5719.rdf 2015-07-02 05:22:48,777 DEBUG #5719 === Building rdf === 2015-07-02 05:22:48,777 INFO #5719 Making pg5719.rdf 2015-07-02 05:22:48,865 INFO #5719 Done pg5719.rdf 2015-07-03 04:16:14,578 DEBUG #5719 === Building txt.utf-8 === 2015-07-03 04:16:14,578 DEBUG #5719 Start of retrieval 2015-07-03 04:16:14,628 DEBUG #5719 ... got mediatype text/plain from guess_type 2015-07-03 04:16:14,628 DEBUG #5719 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/5719/5719.txt 2015-07-03 04:16:14,629 DEBUG #5719 GutenbergTextParser.pre_parse () ... 2015-07-03 04:16:14,629 DEBUG #5719 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/5719/5719.txt ... 2015-07-03 04:16:14,629 DEBUG #5719 End of retrieval 2015-07-03 04:16:14,629 INFO #5719 Creating plain text file: /public/vhost/g/gutenberg/html/cache/epub/5719/pg5719.txt.utf8 2015-07-03 04:16:14,630 DEBUG #5719 Fetching file:///public/vhost/g/gutenberg/html/files/5719/5719.txt ... 2015-07-03 04:16:14,744 INFO #5719 Got charset ASCII from pg header 2015-07-03 04:16:14,744 DEBUG #5719 Trying to decode document with charset ascii ... 2015-07-03 04:16:14,833 INFO #5719 Done plain text file: /public/vhost/g/gutenberg/html/cache/epub/5719/pg5719.txt.utf8 2015-07-03 04:16:14,841 INFO #5719 Creating Gzip file: /public/vhost/g/gutenberg/html/cache/epub/5719/pg5719.txt.utf8.gzip 2015-07-03 04:16:14,842 INFO #5719 Adding file: /public/vhost/g/gutenberg/html/cache/epub/5719/pg5719.txt.utf8 2015-07-03 04:16:15,297 INFO #5719 Done Zip file: /public/vhost/g/gutenberg/html/cache/epub/5719/pg5719.txt.utf8.gzip 2015-07-03 04:16:15,300 DEBUG #5719 === Building epub.images === 2015-07-03 04:16:15,300 DEBUG #5719 Start of retrieval 2015-07-03 04:16:15,342 DEBUG #5719 ... got mediatype text/html from guess_type 2015-07-03 04:16:15,342 DEBUG #5719 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/5719/5719-h/5719-h.htm 2015-07-03 04:16:15,342 DEBUG #5719 HTMLParser.pre_parse () ... 2015-07-03 04:16:15,343 DEBUG #5719 Fetching file:///public/vhost/g/gutenberg/html/files/5719/5719-h/5719-h.htm ... 2015-07-03 04:16:15,486 DEBUG #5719 Got charset utf-8 from html meta 2015-07-03 04:16:15,486 DEBUG #5719 Trying to decode document with charset utf_8_sig ... 2015-07-03 04:16:15,674 ERROR #5719 etree.fromstring says: Opening and ending tag mismatch: div line 129 and body, line 23732, column 8 2015-07-03 04:16:15,687 ERROR #5719 Line 23732:
2015-07-03 04:16:15,687 INFO #5719 Running html thru tidy. 2015-07-03 04:16:16,021 WARNING #5719 tidy: line 6 column 1 - too many title elements in
2015-07-03 04:16:16,023 INFO #5719 tidy: line 6 column 1 -
previously mentioned 2015-07-03 04:16:16,024 WARNING #5719 tidy: line 128 column 1 - missing 2015-07-03 04:16:16,024 WARNING #5719 tidy: line 5175 column 1 -