lacks "summary" attribute
2020-08-27 08:00:38,867 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 08:00:38,867 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 08:00:38,910 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 08:00:39,792 WARNING #63052 37 elements having class pb have been rewritten.
2020-08-27 08:00:40,530 INFO #63052 Creating Epub file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052-images.epub
2020-08-27 08:00:40,602 INFO #63052 Done Epub file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052-images.epub
2020-08-27 08:00:40,608 INFO #63052 epub.images made in 0:00:01.767890
2020-08-27 08:00:40,610 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 08:00:41,029 WARNING #63052 37 elements having class pb have been rewritten.
2020-08-27 08:00:41,761 INFO #63052 Creating Epub file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052.epub
2020-08-27 08:00:41,829 INFO #63052 Done Epub file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052.epub
2020-08-27 08:00:41,830 INFO #63052 epub.noimages made in 0:00:01.220755
2020-08-27 08:00:41,833 INFO #63052 Creating Kindle file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052-images.mobi
2020-08-27 08:00:41,833 INFO #63052 ... from: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052-images.epub
2020-08-27 08:00:43,047 WARNING #63052 kindlegen: W14019: Cover is too small : /tmp/mobi-WusZTZ/OEBPS/@public@vhost@g@gutenberg@html@files@63052@63052-h@images@cover.jpg
2020-08-27 08:00:43,048 INFO #63052 Done Kindle file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052-images.mobi
2020-08-27 08:00:43,048 INFO #63052 kindle.images made in 0:00:01.215644
2020-08-27 08:00:43,049 INFO #63052 Creating Kindle file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052.mobi
2020-08-27 08:00:43,049 INFO #63052 ... from: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052.epub
2020-08-27 08:00:43,630 WARNING #63052 kindlegen: W14019: Cover is too small : /tmp/mobi-zAOFH9/OEBPS/@public@vhost@g@gutenberg@html@files@63052@63052-h@images@cover.jpg
2020-08-27 08:00:43,630 INFO #63052 Done Kindle file: /export/sunsite/users/gutenbackend/cache/epub/63052/pg63052.mobi
2020-08-27 08:00:43,631 INFO #63052 kindle.noimages made in 0:00:00.581815
2020-08-27 08:00:43,633 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 08:00:43,646 INFO #63052 Making pg63052.cover.small.jpg
2020-08-27 08:00:43,658 INFO #63052 Found coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 08:00:43,658 INFO #63052 Done pg63052.cover.small.jpg
2020-08-27 08:00:43,658 INFO #63052 cover.small made in 0:00:00.026801
2020-08-27 08:00:43,660 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 08:00:43,672 INFO #63052 Making pg63052.cover.medium.jpg
2020-08-27 08:00:43,685 INFO #63052 Found coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 08:00:43,685 INFO #63052 Done pg63052.cover.medium.jpg
2020-08-27 08:00:43,685 INFO #63052 cover.medium made in 0:00:00.026267
2020-08-27 08:00:43,686 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 08:00:43,699 INFO #63052 Making pg63052.qrcode.png
2020-08-27 08:00:43,715 INFO #63052 Done pg63052.qrcode.png
2020-08-27 08:00:43,715 INFO #63052 qrcode made in 0:00:00.028930
2020-08-27 08:00:43,716 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 08:00:43,731 INFO #63052 Making pg63052.rdf
2020-08-27 08:00:43,751 INFO #63052 Done pg63052.rdf
2020-08-27 08:00:43,751 INFO #63052 rdf made in 0:00:00.035716
2020-08-27 08:20:08,853 DEBUG #63052 === Building facebook ===
2020-08-27 08:20:08,853 DEBUG #63052 Start of retrieval
2020-08-27 08:20:08,857 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 08:20:08,857 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 08:20:08,857 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 08:20:08,857 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 08:20:08,859 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 08:20:08,859 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 08:20:08,861 INFO #63052 Running html thru tidy.
2020-08-27 08:20:08,879 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 09:20:29,490 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 09:20:29,490 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 09:20:29,530 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 09:20:29,530 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 09:20:29,530 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 09:20:29,530 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,531 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 09:20:29,531 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,531 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 09:20:29,531 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 09:20:29,531 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,531 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,532 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,532 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,532 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,532 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,533 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,533 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,533 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,533 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,533 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,534 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,534 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,534 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,536 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,536 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,537 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,538 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,538 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,539 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,540 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,541 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,541 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,541 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,545 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 09:20:29,545 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 09:20:29,546 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 09:20:29,546 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 09:20:29,546 DEBUG #63052 End of retrieval
2020-08-27 09:20:29,555 DEBUG #63052 Connecting to database ...
2020-08-27 09:20:29,569 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 09:20:29,571 INFO #63052 already posted, no new Facebook post
2020-08-27 09:20:29,571 INFO #63052 facebook made in 0:00:00.106384
2020-08-27 09:20:29,572 DEBUG #63052 === Building twitter ===
2020-08-27 09:20:29,572 DEBUG #63052 Start of retrieval
2020-08-27 09:20:29,572 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 09:20:29,573 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,573 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 09:20:29,573 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,573 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 09:20:29,573 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 09:20:29,573 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,576 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,576 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,576 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,577 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,578 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,578 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,580 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,580 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,581 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,581 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,582 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,583 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,583 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,583 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 09:20:29,587 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 09:20:29,587 DEBUG #63052 End of retrieval
2020-08-27 09:20:29,587 DEBUG #63052 Connecting to database ...
2020-08-27 09:20:29,598 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 09:20:29,600 INFO #63052 twitter made in 0:00:00.028229
2020-08-27 10:20:12,435 DEBUG #63052 === Building facebook ===
2020-08-27 10:20:12,435 DEBUG #63052 Start of retrieval
2020-08-27 10:20:12,438 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 10:20:12,439 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 10:20:12,439 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 10:20:12,439 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 10:20:12,440 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 10:20:12,440 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 10:20:12,442 INFO #63052 Running html thru tidy.
2020-08-27 10:20:12,460 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 10:20:12,461 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 10:20:12,461 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 10:20:12,501 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 10:20:12,501 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 10:20:12,501 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 10:20:12,501 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,501 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 10:20:12,501 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,501 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 10:20:12,502 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 10:20:12,502 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,502 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,502 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,503 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,503 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,503 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,503 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,503 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,504 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,504 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,504 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,504 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,504 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,505 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,507 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,507 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,508 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,508 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,509 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,509 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,510 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,511 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,511 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,512 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,516 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 10:20:12,516 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 10:20:12,516 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 10:20:12,516 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 10:20:12,516 DEBUG #63052 End of retrieval
2020-08-27 10:20:12,525 DEBUG #63052 Connecting to database ...
2020-08-27 10:20:12,543 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 10:20:12,544 INFO #63052 already posted, no new Facebook post
2020-08-27 10:20:12,545 INFO #63052 facebook made in 0:00:00.109212
2020-08-27 10:20:12,545 DEBUG #63052 === Building twitter ===
2020-08-27 10:20:12,546 DEBUG #63052 Start of retrieval
2020-08-27 10:20:12,546 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 10:20:12,546 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,546 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 10:20:12,546 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,547 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 10:20:12,547 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 10:20:12,547 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,547 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,547 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,548 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,548 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,548 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,548 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,549 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,549 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,549 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,549 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,549 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,550 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,550 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,552 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,552 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,553 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,553 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,554 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,554 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,555 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,556 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,557 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,557 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 10:20:12,560 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 10:20:12,560 DEBUG #63052 End of retrieval
2020-08-27 10:20:12,561 DEBUG #63052 Connecting to database ...
2020-08-27 10:20:12,573 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 10:20:12,575 INFO #63052 twitter made in 0:00:00.029614
2020-08-27 11:20:12,759 DEBUG #63052 === Building facebook ===
2020-08-27 11:20:12,759 DEBUG #63052 Start of retrieval
2020-08-27 11:20:12,762 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 11:20:12,762 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 11:20:12,762 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 11:20:12,762 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 11:20:12,764 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 11:20:12,764 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 11:20:12,766 INFO #63052 Running html thru tidy.
2020-08-27 11:20:12,784 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 11:20:12,785 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 11:20:12,785 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 11:20:12,824 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 11:20:12,824 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 11:20:12,824 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 11:20:12,824 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,824 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 11:20:12,825 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,825 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 11:20:12,825 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 11:20:12,825 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,825 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,825 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,826 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,826 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,826 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,826 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,827 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,827 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,827 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,827 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,827 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,828 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,828 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,830 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,830 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,831 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,831 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,832 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,832 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,833 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,834 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,834 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,835 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,839 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 11:20:12,839 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 11:20:12,839 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 11:20:12,839 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 11:20:12,839 DEBUG #63052 End of retrieval
2020-08-27 11:20:12,848 DEBUG #63052 Connecting to database ...
2020-08-27 11:20:12,862 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 11:20:12,864 INFO #63052 already posted, no new Facebook post
2020-08-27 11:20:12,864 INFO #63052 facebook made in 0:00:00.105016
2020-08-27 11:20:12,865 DEBUG #63052 === Building twitter ===
2020-08-27 11:20:12,865 DEBUG #63052 Start of retrieval
2020-08-27 11:20:12,865 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 11:20:12,866 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,866 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 11:20:12,866 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,866 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 11:20:12,866 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 11:20:12,866 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,867 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,867 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,867 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,867 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,867 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,868 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,868 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,868 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,868 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,869 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,869 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,869 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,869 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,871 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,871 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,872 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,873 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,873 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,874 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,875 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,876 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,876 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,876 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 11:20:12,879 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 11:20:12,879 DEBUG #63052 End of retrieval
2020-08-27 11:20:12,880 DEBUG #63052 Connecting to database ...
2020-08-27 11:20:12,891 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 11:20:12,899 INFO #63052 twitter made in 0:00:00.034611
2020-08-27 12:20:14,161 DEBUG #63052 === Building facebook ===
2020-08-27 12:20:14,161 DEBUG #63052 Start of retrieval
2020-08-27 12:20:14,165 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 12:20:14,165 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 12:20:14,165 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 12:20:14,165 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 12:20:14,166 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 12:20:14,166 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 12:20:14,168 INFO #63052 Running html thru tidy.
2020-08-27 12:20:14,186 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 12:20:14,187 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 12:20:14,187 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 12:20:14,226 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 12:20:14,226 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 12:20:14,226 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 12:20:14,227 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,227 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 12:20:14,227 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,227 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 12:20:14,227 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 12:20:14,228 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,228 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,228 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,228 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,228 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,229 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,229 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,229 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,229 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,229 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,230 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,230 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,230 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,231 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,232 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,232 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,233 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,235 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,236 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,237 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,237 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,237 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,241 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 12:20:14,241 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 12:20:14,241 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 12:20:14,242 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 12:20:14,242 DEBUG #63052 End of retrieval
2020-08-27 12:20:14,250 DEBUG #63052 Connecting to database ...
2020-08-27 12:20:14,264 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 12:20:14,266 INFO #63052 already posted, no new Facebook post
2020-08-27 12:20:14,266 INFO #63052 facebook made in 0:00:00.105023
2020-08-27 12:20:14,267 DEBUG #63052 === Building twitter ===
2020-08-27 12:20:14,267 DEBUG #63052 Start of retrieval
2020-08-27 12:20:14,267 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 12:20:14,268 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,268 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 12:20:14,268 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,268 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 12:20:14,268 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 12:20:14,269 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,269 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,269 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,269 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,269 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,270 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,270 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,270 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,270 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,270 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,271 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,271 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,271 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,272 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,273 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,273 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,274 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,276 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,277 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,278 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,278 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,278 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 12:20:14,281 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 12:20:14,281 DEBUG #63052 End of retrieval
2020-08-27 12:20:14,282 DEBUG #63052 Connecting to database ...
2020-08-27 12:20:14,304 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 12:20:14,306 INFO #63052 twitter made in 0:00:00.039266
2020-08-27 13:20:12,630 DEBUG #63052 === Building facebook ===
2020-08-27 13:20:12,630 DEBUG #63052 Start of retrieval
2020-08-27 13:20:12,633 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 13:20:12,633 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 13:20:12,633 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 13:20:12,633 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 13:20:12,635 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 13:20:12,635 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 13:20:12,637 INFO #63052 Running html thru tidy.
2020-08-27 13:20:12,654 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 13:20:12,655 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 13:20:12,655 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 13:20:12,694 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 13:20:12,694 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 13:20:12,694 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 13:20:12,695 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,695 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 13:20:12,695 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,695 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 13:20:12,695 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 13:20:12,696 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,696 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,696 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,696 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,697 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,697 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,697 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,697 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,697 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,697 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,698 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,698 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,698 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,699 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,700 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,700 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,701 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,702 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,703 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,703 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,704 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,705 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,705 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,705 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,710 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 13:20:12,710 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 13:20:12,710 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 13:20:12,710 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 13:20:12,710 DEBUG #63052 End of retrieval
2020-08-27 13:20:12,719 DEBUG #63052 Connecting to database ...
2020-08-27 13:20:12,733 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 13:20:12,735 INFO #63052 already posted, no new Facebook post
2020-08-27 13:20:12,735 INFO #63052 facebook made in 0:00:00.104685
2020-08-27 13:20:12,736 DEBUG #63052 === Building twitter ===
2020-08-27 13:20:12,736 DEBUG #63052 Start of retrieval
2020-08-27 13:20:12,736 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 13:20:12,736 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,737 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 13:20:12,737 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,737 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 13:20:12,737 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 13:20:12,737 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,737 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,738 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,738 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,738 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,738 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,739 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,739 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,739 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,739 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,739 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,740 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,740 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,740 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,742 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,742 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,743 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,743 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,744 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,744 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,745 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,746 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,747 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,747 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 13:20:12,750 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 13:20:12,750 DEBUG #63052 End of retrieval
2020-08-27 13:20:12,751 DEBUG #63052 Connecting to database ...
2020-08-27 13:20:12,762 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 13:20:12,764 INFO #63052 twitter made in 0:00:00.028323
2020-08-27 14:20:28,217 DEBUG #63052 === Building facebook ===
2020-08-27 14:20:28,217 DEBUG #63052 Start of retrieval
2020-08-27 14:20:28,220 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 14:20:28,221 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 14:20:28,221 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 14:20:28,221 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 14:20:28,222 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 14:20:28,222 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 14:20:28,224 INFO #63052 Running html thru tidy.
2020-08-27 14:20:28,242 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 14:20:28,243 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 14:20:28,243 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 14:20:28,282 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 14:20:28,282 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 14:20:28,282 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 14:20:28,282 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,283 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 14:20:28,283 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,283 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 14:20:28,283 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 14:20:28,283 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,283 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,284 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,284 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,284 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,284 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,284 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,285 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,285 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,285 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,285 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,286 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,286 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,286 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,288 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,288 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,289 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,289 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,290 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,290 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,291 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,292 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,293 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,293 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,297 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 14:20:28,297 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 14:20:28,297 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 14:20:28,297 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 14:20:28,297 DEBUG #63052 End of retrieval
2020-08-27 14:20:28,307 DEBUG #63052 Connecting to database ...
2020-08-27 14:20:28,321 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 14:20:28,323 INFO #63052 already posted, no new Facebook post
2020-08-27 14:20:28,323 INFO #63052 facebook made in 0:00:00.105604
2020-08-27 14:20:28,324 DEBUG #63052 === Building twitter ===
2020-08-27 14:20:28,324 DEBUG #63052 Start of retrieval
2020-08-27 14:20:28,324 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 14:20:28,324 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,325 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 14:20:28,325 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,325 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 14:20:28,325 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 14:20:28,325 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,326 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,326 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,326 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,326 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,326 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,327 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,327 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,327 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,327 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,327 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,328 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,328 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,328 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,330 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,330 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,331 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,332 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,332 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,333 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,334 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,335 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,335 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,335 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 14:20:28,338 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 14:20:28,338 DEBUG #63052 End of retrieval
2020-08-27 14:20:28,339 DEBUG #63052 Connecting to database ...
2020-08-27 14:20:28,351 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 14:20:28,353 INFO #63052 twitter made in 0:00:00.029649
2020-08-27 15:20:16,952 DEBUG #63052 === Building facebook ===
2020-08-27 15:20:16,952 DEBUG #63052 Start of retrieval
2020-08-27 15:20:16,955 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 15:20:16,955 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 15:20:16,955 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 15:20:16,955 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 15:20:16,956 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 15:20:16,956 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 15:20:16,959 INFO #63052 Running html thru tidy.
2020-08-27 15:20:16,977 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 15:20:16,978 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 15:20:16,978 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 15:20:17,017 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 15:20:17,017 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 15:20:17,017 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 15:20:17,018 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,018 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 15:20:17,018 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,018 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 15:20:17,018 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 15:20:17,019 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,019 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,019 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,019 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,019 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,020 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,020 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,020 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,020 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,020 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,021 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,021 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,021 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,022 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,023 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,023 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,024 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,025 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,025 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,026 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,027 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,028 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,028 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,028 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,033 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 15:20:17,033 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 15:20:17,033 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 15:20:17,033 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 15:20:17,033 DEBUG #63052 End of retrieval
2020-08-27 15:20:17,043 DEBUG #63052 Connecting to database ...
2020-08-27 15:20:17,062 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 15:20:17,065 INFO #63052 already posted, no new Facebook post
2020-08-27 15:20:17,065 INFO #63052 facebook made in 0:00:00.113192
2020-08-27 15:20:17,066 DEBUG #63052 === Building twitter ===
2020-08-27 15:20:17,066 DEBUG #63052 Start of retrieval
2020-08-27 15:20:17,066 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 15:20:17,067 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,067 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 15:20:17,067 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,067 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 15:20:17,067 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 15:20:17,068 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,068 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,068 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,068 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,068 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,069 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,069 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,069 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,069 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,069 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,070 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,070 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,070 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,071 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,072 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,072 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,073 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,074 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,074 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,075 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,076 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,077 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,077 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,077 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 15:20:17,080 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 15:20:17,080 DEBUG #63052 End of retrieval
2020-08-27 15:20:17,081 DEBUG #63052 Connecting to database ...
2020-08-27 15:20:17,094 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 15:20:17,096 INFO #63052 twitter made in 0:00:00.030189
2020-08-27 16:20:26,683 DEBUG #63052 === Building facebook ===
2020-08-27 16:20:26,683 DEBUG #63052 Start of retrieval
2020-08-27 16:20:26,686 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 16:20:26,687 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 16:20:26,687 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 16:20:26,687 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 16:20:26,688 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 16:20:26,688 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 16:20:26,690 INFO #63052 Running html thru tidy.
2020-08-27 16:20:26,708 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 16:20:26,709 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 16:20:26,709 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 16:20:26,748 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 16:20:26,748 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 16:20:26,748 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 16:20:26,749 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,749 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 16:20:26,749 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,749 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 16:20:26,749 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 16:20:26,749 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,750 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,750 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,750 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,750 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,750 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,751 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,751 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,751 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,751 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,752 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,752 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,752 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,752 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,754 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,754 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,755 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,756 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,756 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,757 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,758 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,759 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,759 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,759 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,763 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 16:20:26,763 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 16:20:26,763 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 16:20:26,763 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 16:20:26,764 DEBUG #63052 End of retrieval
2020-08-27 16:20:26,772 DEBUG #63052 Connecting to database ...
2020-08-27 16:20:26,786 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 16:20:26,788 INFO #63052 already posted, no new Facebook post
2020-08-27 16:20:26,788 INFO #63052 facebook made in 0:00:00.104596
2020-08-27 16:20:26,789 DEBUG #63052 === Building twitter ===
2020-08-27 16:20:26,789 DEBUG #63052 Start of retrieval
2020-08-27 16:20:26,789 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 16:20:26,790 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,790 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 16:20:26,790 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,790 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 16:20:26,790 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 16:20:26,790 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,791 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,791 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,791 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,791 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,791 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,792 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,792 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,792 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,792 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,793 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,793 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,793 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,793 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,795 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,795 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,796 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,797 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,797 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,798 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,799 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,800 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,800 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,800 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 16:20:26,803 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 16:20:26,803 DEBUG #63052 End of retrieval
2020-08-27 16:20:26,804 DEBUG #63052 Connecting to database ...
2020-08-27 16:20:26,815 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 16:20:26,817 INFO #63052 twitter made in 0:00:00.027816
2020-08-27 17:20:18,282 DEBUG #63052 === Building facebook ===
2020-08-27 17:20:18,282 DEBUG #63052 Start of retrieval
2020-08-27 17:20:18,286 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 17:20:18,286 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 17:20:18,286 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 17:20:18,286 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 17:20:18,288 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 17:20:18,288 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 17:20:18,290 INFO #63052 Running html thru tidy.
2020-08-27 17:20:18,308 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 17:20:18,309 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 17:20:18,309 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 17:20:18,349 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 17:20:18,350 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 17:20:18,350 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 17:20:18,350 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,350 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 17:20:18,350 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,350 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 17:20:18,351 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 17:20:18,351 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,351 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,351 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,352 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,352 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,352 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,352 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,353 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,353 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,353 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,353 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,353 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,354 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,354 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,356 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,356 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,358 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,358 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,359 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,359 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,360 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,361 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,361 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,362 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,366 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 17:20:18,367 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 17:20:18,367 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 17:20:18,367 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 17:20:18,367 DEBUG #63052 End of retrieval
2020-08-27 17:20:18,377 DEBUG #63052 Connecting to database ...
2020-08-27 17:20:18,390 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 17:20:18,392 INFO #63052 already posted, no new Facebook post
2020-08-27 17:20:18,393 INFO #63052 facebook made in 0:00:00.110374
2020-08-27 17:20:18,393 DEBUG #63052 === Building twitter ===
2020-08-27 17:20:18,394 DEBUG #63052 Start of retrieval
2020-08-27 17:20:18,394 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 17:20:18,394 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,395 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 17:20:18,395 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,395 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 17:20:18,395 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 17:20:18,395 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,395 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,396 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,396 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,396 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,396 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,396 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,397 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,397 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,397 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,397 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,397 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,398 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,398 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,400 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,400 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,401 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,401 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,402 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,402 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,403 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,405 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,405 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,405 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 17:20:18,408 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 17:20:18,408 DEBUG #63052 End of retrieval
2020-08-27 17:20:18,409 DEBUG #63052 Connecting to database ...
2020-08-27 17:20:18,419 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 17:20:18,421 INFO #63052 twitter made in 0:00:00.027647
2020-08-27 18:20:18,507 DEBUG #63052 === Building facebook ===
2020-08-27 18:20:18,507 DEBUG #63052 Start of retrieval
2020-08-27 18:20:18,510 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 18:20:18,510 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 18:20:18,510 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 18:20:18,510 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 18:20:18,512 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 18:20:18,512 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 18:20:18,514 INFO #63052 Running html thru tidy.
2020-08-27 18:20:18,532 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 18:20:18,533 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 18:20:18,533 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 18:20:18,572 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 18:20:18,572 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 18:20:18,572 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 18:20:18,573 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,573 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 18:20:18,573 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,573 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 18:20:18,573 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 18:20:18,573 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,574 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,575 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,576 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,576 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,576 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,578 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,578 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,579 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,579 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,580 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,580 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,581 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,582 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,583 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,583 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,587 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 18:20:18,587 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 18:20:18,587 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 18:20:18,587 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 18:20:18,587 DEBUG #63052 End of retrieval
2020-08-27 18:20:18,596 DEBUG #63052 Connecting to database ...
2020-08-27 18:20:18,610 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 18:20:18,612 INFO #63052 already posted, no new Facebook post
2020-08-27 18:20:18,612 INFO #63052 facebook made in 0:00:00.105617
2020-08-27 18:20:18,613 DEBUG #63052 === Building twitter ===
2020-08-27 18:20:18,613 DEBUG #63052 Start of retrieval
2020-08-27 18:20:18,613 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 18:20:18,614 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,614 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 18:20:18,614 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,614 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 18:20:18,614 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 18:20:18,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,617 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,617 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,617 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,617 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,617 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,618 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,619 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,620 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,621 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,621 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,622 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,622 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,623 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,624 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,624 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,624 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 18:20:18,628 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 18:20:18,628 DEBUG #63052 End of retrieval
2020-08-27 18:20:18,628 DEBUG #63052 Connecting to database ...
2020-08-27 18:20:18,640 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 18:20:18,642 INFO #63052 twitter made in 0:00:00.028710
2020-08-27 19:20:18,064 DEBUG #63052 === Building facebook ===
2020-08-27 19:20:18,064 DEBUG #63052 Start of retrieval
2020-08-27 19:20:18,067 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 19:20:18,067 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 19:20:18,067 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 19:20:18,067 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 19:20:18,069 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 19:20:18,069 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 19:20:18,071 INFO #63052 Running html thru tidy.
2020-08-27 19:20:18,089 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 19:20:18,090 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 19:20:18,090 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 19:20:18,129 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 19:20:18,130 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 19:20:18,130 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 19:20:18,130 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,130 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 19:20:18,130 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,130 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 19:20:18,130 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 19:20:18,131 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,131 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,131 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,131 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,132 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,132 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,132 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,132 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,132 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,133 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,133 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,133 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,133 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,134 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,135 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,135 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,137 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,137 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,138 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,138 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,139 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,140 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,140 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,140 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,145 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 19:20:18,145 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 19:20:18,145 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 19:20:18,145 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 19:20:18,145 DEBUG #63052 End of retrieval
2020-08-27 19:20:18,154 DEBUG #63052 Connecting to database ...
2020-08-27 19:20:18,169 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 19:20:18,171 INFO #63052 already posted, no new Facebook post
2020-08-27 19:20:18,172 INFO #63052 facebook made in 0:00:00.107449
2020-08-27 19:20:18,172 DEBUG #63052 === Building twitter ===
2020-08-27 19:20:18,173 DEBUG #63052 Start of retrieval
2020-08-27 19:20:18,173 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 19:20:18,173 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,173 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 19:20:18,174 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,174 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 19:20:18,174 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 19:20:18,174 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,174 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,174 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,175 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,175 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,175 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,175 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,176 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,176 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,176 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,176 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,176 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,177 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,177 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,179 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,179 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,180 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,180 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,181 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,181 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,182 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,183 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,184 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,184 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 19:20:18,187 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 19:20:18,187 DEBUG #63052 End of retrieval
2020-08-27 19:20:18,188 DEBUG #63052 Connecting to database ...
2020-08-27 19:20:18,200 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 19:20:18,202 INFO #63052 twitter made in 0:00:00.029418
2020-08-27 20:20:18,167 DEBUG #63052 === Building facebook ===
2020-08-27 20:20:18,167 DEBUG #63052 Start of retrieval
2020-08-27 20:20:18,170 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 20:20:18,170 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 20:20:18,170 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 20:20:18,170 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 20:20:18,171 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 20:20:18,171 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 20:20:18,174 INFO #63052 Running html thru tidy.
2020-08-27 20:20:18,192 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 20:20:18,192 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 20:20:18,192 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 20:20:18,231 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 20:20:18,232 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 20:20:18,232 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 20:20:18,232 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,232 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 20:20:18,232 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,232 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 20:20:18,232 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 20:20:18,233 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,233 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,233 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,233 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,234 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,235 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,235 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,235 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,235 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,236 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,237 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,237 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,239 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,239 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,240 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,240 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,241 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,242 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,242 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,242 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,246 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 20:20:18,247 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 20:20:18,247 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 20:20:18,247 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 20:20:18,247 DEBUG #63052 End of retrieval
2020-08-27 20:20:18,256 DEBUG #63052 Connecting to database ...
2020-08-27 20:20:18,269 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 20:20:18,271 INFO #63052 already posted, no new Facebook post
2020-08-27 20:20:18,271 INFO #63052 facebook made in 0:00:00.104370
2020-08-27 20:20:18,272 DEBUG #63052 === Building twitter ===
2020-08-27 20:20:18,272 DEBUG #63052 Start of retrieval
2020-08-27 20:20:18,272 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 20:20:18,273 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,273 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 20:20:18,273 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,273 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 20:20:18,273 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 20:20:18,274 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,274 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,274 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,274 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,274 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,275 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,276 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,276 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,276 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,277 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,278 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,278 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,279 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,280 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,280 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,281 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,282 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,283 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,283 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,283 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 20:20:18,286 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 20:20:18,286 DEBUG #63052 End of retrieval
2020-08-27 20:20:18,287 DEBUG #63052 Connecting to database ...
2020-08-27 20:20:18,299 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 20:20:18,301 INFO #63052 twitter made in 0:00:00.028555
2020-08-27 21:20:35,087 DEBUG #63052 === Building facebook ===
2020-08-27 21:20:35,087 DEBUG #63052 Start of retrieval
2020-08-27 21:20:35,090 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 21:20:35,090 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 21:20:35,090 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 21:20:35,090 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 21:20:35,091 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 21:20:35,091 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 21:20:35,094 INFO #63052 Running html thru tidy.
2020-08-27 21:20:35,112 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 21:20:35,112 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 21:20:35,112 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 21:20:35,151 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 21:20:35,152 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 21:20:35,152 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 21:20:35,152 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,152 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 21:20:35,152 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,152 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 21:20:35,152 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 21:20:35,153 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,153 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,153 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,153 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,154 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,154 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,154 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,154 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,154 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,155 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,155 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,155 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,155 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,156 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,157 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,157 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,159 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,159 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,160 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,160 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,161 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,162 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,162 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,162 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,166 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 21:20:35,167 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 21:20:35,167 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 21:20:35,167 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 21:20:35,167 DEBUG #63052 End of retrieval
2020-08-27 21:20:35,175 DEBUG #63052 Connecting to database ...
2020-08-27 21:20:35,189 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 21:20:35,191 INFO #63052 already posted, no new Facebook post
2020-08-27 21:20:35,191 INFO #63052 facebook made in 0:00:00.104142
2020-08-27 21:20:35,192 DEBUG #63052 === Building twitter ===
2020-08-27 21:20:35,192 DEBUG #63052 Start of retrieval
2020-08-27 21:20:35,192 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 21:20:35,193 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,193 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 21:20:35,193 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,193 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 21:20:35,193 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 21:20:35,193 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,194 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,194 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,194 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,194 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,194 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,195 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,195 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,195 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,195 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,196 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,196 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,196 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,196 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,198 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,198 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,199 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,200 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,200 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,201 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,202 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,203 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,203 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,203 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 21:20:35,206 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 21:20:35,206 DEBUG #63052 End of retrieval
2020-08-27 21:20:35,207 DEBUG #63052 Connecting to database ...
2020-08-27 21:20:35,218 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 21:20:35,220 INFO #63052 twitter made in 0:00:00.027749
2020-08-27 22:20:20,699 DEBUG #63052 === Building facebook ===
2020-08-27 22:20:20,699 DEBUG #63052 Start of retrieval
2020-08-27 22:20:20,702 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 22:20:20,702 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 22:20:20,702 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 22:20:20,702 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 22:20:20,704 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 22:20:20,704 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 22:20:20,706 INFO #63052 Running html thru tidy.
2020-08-27 22:20:20,724 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 22:20:20,725 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 22:20:20,725 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 22:20:20,764 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 22:20:20,764 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 22:20:20,764 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 22:20:20,765 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,765 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 22:20:20,765 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,765 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 22:20:20,765 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 22:20:20,765 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,766 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,766 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,766 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,766 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,766 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,766 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,767 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,767 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,767 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,767 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,768 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,768 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,768 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,770 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,770 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,771 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,772 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,772 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,773 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,774 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,775 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,775 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,775 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,779 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 22:20:20,779 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 22:20:20,779 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 22:20:20,779 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 22:20:20,780 DEBUG #63052 End of retrieval
2020-08-27 22:20:20,788 DEBUG #63052 Connecting to database ...
2020-08-27 22:20:20,802 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 22:20:20,804 INFO #63052 already posted, no new Facebook post
2020-08-27 22:20:20,804 INFO #63052 facebook made in 0:00:00.104921
2020-08-27 22:20:20,805 DEBUG #63052 === Building twitter ===
2020-08-27 22:20:20,805 DEBUG #63052 Start of retrieval
2020-08-27 22:20:20,805 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 22:20:20,806 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,806 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 22:20:20,806 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,806 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 22:20:20,806 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 22:20:20,807 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,807 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,807 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,807 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,807 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,808 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,808 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,808 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,808 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,808 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,809 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,809 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,809 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,810 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,811 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,811 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,813 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,813 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,814 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,814 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,815 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,816 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,816 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,816 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 22:20:20,820 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 22:20:20,820 DEBUG #63052 End of retrieval
2020-08-27 22:20:20,820 DEBUG #63052 Connecting to database ...
2020-08-27 22:20:20,831 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 22:20:20,833 INFO #63052 twitter made in 0:00:00.028230
2020-08-27 23:20:21,493 DEBUG #63052 === Building facebook ===
2020-08-27 23:20:21,494 DEBUG #63052 Start of retrieval
2020-08-27 23:20:21,497 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-27 23:20:21,497 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 23:20:21,497 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-27 23:20:21,497 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 23:20:21,498 DEBUG #63052 Got charset utf-8 from html meta
2020-08-27 23:20:21,498 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-27 23:20:21,500 INFO #63052 Running html thru tidy.
2020-08-27 23:20:21,518 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-27 23:20:21,519 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-27 23:20:21,519 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-27 23:20:21,558 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-27 23:20:21,558 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-27 23:20:21,558 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 23:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,559 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 23:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,559 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 23:20:21,559 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 23:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,562 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,562 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,562 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,562 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,564 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,564 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,565 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,566 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,566 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,567 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,568 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,569 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,569 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,569 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,573 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-27 23:20:21,573 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-27 23:20:21,573 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 23:20:21,573 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 23:20:21,573 DEBUG #63052 End of retrieval
2020-08-27 23:20:21,588 DEBUG #63052 Connecting to database ...
2020-08-27 23:20:21,602 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 23:20:21,604 INFO #63052 already posted, no new Facebook post
2020-08-27 23:20:21,604 INFO #63052 facebook made in 0:00:00.110337
2020-08-27 23:20:21,605 DEBUG #63052 === Building twitter ===
2020-08-27 23:20:21,605 DEBUG #63052 Start of retrieval
2020-08-27 23:20:21,605 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-27 23:20:21,606 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,606 DEBUG #63052 Not dropping after all because of rel.
2020-08-27 23:20:21,606 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,606 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-27 23:20:21,606 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-27 23:20:21,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,610 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,611 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,611 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,613 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,613 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,614 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,614 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-27 23:20:21,620 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-27 23:20:21,620 DEBUG #63052 End of retrieval
2020-08-27 23:20:21,620 DEBUG #63052 Connecting to database ...
2020-08-27 23:20:21,631 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-27 23:20:21,633 INFO #63052 twitter made in 0:00:00.027825
2020-08-28 00:20:21,342 DEBUG #63052 === Building facebook ===
2020-08-28 00:20:21,342 DEBUG #63052 Start of retrieval
2020-08-28 00:20:21,345 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-28 00:20:21,345 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 00:20:21,345 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-28 00:20:21,345 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 00:20:21,346 DEBUG #63052 Got charset utf-8 from html meta
2020-08-28 00:20:21,347 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-28 00:20:21,349 INFO #63052 Running html thru tidy.
2020-08-28 00:20:21,367 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-28 00:20:21,368 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-28 00:20:21,368 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-28 00:20:21,407 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-28 00:20:21,407 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 00:20:21,407 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 00:20:21,408 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,408 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 00:20:21,408 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,408 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 00:20:21,408 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 00:20:21,409 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,409 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,409 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,409 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,409 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,411 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,411 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,411 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,412 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,413 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,413 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,415 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,415 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,416 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,416 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,417 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,418 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,418 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,418 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,423 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-28 00:20:21,423 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-28 00:20:21,423 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 00:20:21,423 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 00:20:21,423 DEBUG #63052 End of retrieval
2020-08-28 00:20:21,432 DEBUG #63052 Connecting to database ...
2020-08-28 00:20:21,446 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 00:20:21,448 INFO #63052 already posted, no new Facebook post
2020-08-28 00:20:21,448 INFO #63052 facebook made in 0:00:00.106554
2020-08-28 00:20:21,449 DEBUG #63052 === Building twitter ===
2020-08-28 00:20:21,449 DEBUG #63052 Start of retrieval
2020-08-28 00:20:21,450 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 00:20:21,450 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,450 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 00:20:21,450 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,450 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 00:20:21,451 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 00:20:21,451 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,451 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,451 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,452 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,452 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,452 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,452 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,452 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,453 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,453 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,453 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,453 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,453 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,454 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,456 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,456 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,459 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,460 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,461 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,461 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 00:20:21,464 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 00:20:21,464 DEBUG #63052 End of retrieval
2020-08-28 00:20:21,465 DEBUG #63052 Connecting to database ...
2020-08-28 00:20:21,476 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 00:20:21,478 INFO #63052 twitter made in 0:00:00.028697
2020-08-28 01:20:21,096 DEBUG #63052 === Building facebook ===
2020-08-28 01:20:21,096 DEBUG #63052 Start of retrieval
2020-08-28 01:20:21,100 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-28 01:20:21,100 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 01:20:21,100 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-28 01:20:21,100 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 01:20:21,101 DEBUG #63052 Got charset utf-8 from html meta
2020-08-28 01:20:21,101 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-28 01:20:21,103 INFO #63052 Running html thru tidy.
2020-08-28 01:20:21,121 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-28 01:20:21,122 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-28 01:20:21,122 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-28 01:20:21,162 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-28 01:20:21,162 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 01:20:21,162 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 01:20:21,162 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,162 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 01:20:21,163 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,163 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 01:20:21,163 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 01:20:21,163 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,163 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,163 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,164 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,164 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,164 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,164 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,165 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,165 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,165 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,165 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,165 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,166 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,166 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,168 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,168 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,169 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,169 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,170 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,170 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,172 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,172 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,173 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,173 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,177 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-28 01:20:21,177 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-28 01:20:21,177 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 01:20:21,178 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 01:20:21,178 DEBUG #63052 End of retrieval
2020-08-28 01:20:21,187 DEBUG #63052 Connecting to database ...
2020-08-28 01:20:21,201 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 01:20:21,203 INFO #63052 already posted, no new Facebook post
2020-08-28 01:20:21,203 INFO #63052 facebook made in 0:00:00.106613
2020-08-28 01:20:21,204 DEBUG #63052 === Building twitter ===
2020-08-28 01:20:21,204 DEBUG #63052 Start of retrieval
2020-08-28 01:20:21,204 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 01:20:21,205 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,205 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 01:20:21,205 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,205 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 01:20:21,205 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 01:20:21,205 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,206 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,206 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,206 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,206 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,206 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,207 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,207 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,207 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,207 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,208 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,208 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,208 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,209 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,210 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,210 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,211 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,212 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,212 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,213 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,214 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,215 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,215 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,215 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 01:20:21,219 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 01:20:21,219 DEBUG #63052 End of retrieval
2020-08-28 01:20:21,219 DEBUG #63052 Connecting to database ...
2020-08-28 01:20:21,230 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 01:20:21,232 INFO #63052 twitter made in 0:00:00.028011
2020-08-28 02:20:24,539 DEBUG #63052 === Building facebook ===
2020-08-28 02:20:24,539 DEBUG #63052 Start of retrieval
2020-08-28 02:20:24,542 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-28 02:20:24,543 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 02:20:24,543 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-28 02:20:24,543 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 02:20:24,544 DEBUG #63052 Got charset utf-8 from html meta
2020-08-28 02:20:24,544 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-28 02:20:24,546 INFO #63052 Running html thru tidy.
2020-08-28 02:20:24,564 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-28 02:20:24,565 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-28 02:20:24,565 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-28 02:20:24,606 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-28 02:20:24,606 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 02:20:24,606 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 02:20:24,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,607 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 02:20:24,607 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,607 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 02:20:24,607 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 02:20:24,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,608 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,609 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,610 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,610 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,610 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,610 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,610 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,611 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,613 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,613 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,614 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,615 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,616 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,617 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,618 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,618 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,618 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,623 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-28 02:20:24,623 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-28 02:20:24,623 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 02:20:24,624 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 02:20:24,624 DEBUG #63052 End of retrieval
2020-08-28 02:20:24,633 DEBUG #63052 Connecting to database ...
2020-08-28 02:20:24,650 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 02:20:24,652 INFO #63052 already posted, no new Facebook post
2020-08-28 02:20:24,652 INFO #63052 facebook made in 0:00:00.113056
2020-08-28 02:20:24,653 DEBUG #63052 === Building twitter ===
2020-08-28 02:20:24,653 DEBUG #63052 Start of retrieval
2020-08-28 02:20:24,653 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 02:20:24,654 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,654 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 02:20:24,654 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,654 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 02:20:24,655 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 02:20:24,655 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,655 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,655 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,656 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,656 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,656 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,656 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,657 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,657 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,657 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,657 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,657 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,658 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,658 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,660 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,660 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,661 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,662 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,663 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,663 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,664 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,665 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,665 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,666 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 02:20:24,669 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 02:20:24,669 DEBUG #63052 End of retrieval
2020-08-28 02:20:24,670 DEBUG #63052 Connecting to database ...
2020-08-28 02:20:24,683 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 02:20:24,685 INFO #63052 twitter made in 0:00:00.031613
2020-08-28 03:20:21,450 DEBUG #63052 === Building facebook ===
2020-08-28 03:20:21,450 DEBUG #63052 Start of retrieval
2020-08-28 03:20:21,454 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-28 03:20:21,455 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 03:20:21,455 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-28 03:20:21,455 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 03:20:21,456 DEBUG #63052 Got charset utf-8 from html meta
2020-08-28 03:20:21,456 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-28 03:20:21,458 INFO #63052 Running html thru tidy.
2020-08-28 03:20:21,476 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-28 03:20:21,477 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-28 03:20:21,477 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-28 03:20:21,516 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-28 03:20:21,516 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 03:20:21,517 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 03:20:21,517 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,517 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 03:20:21,517 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,517 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 03:20:21,517 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 03:20:21,518 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,518 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,518 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,518 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,519 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,519 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,519 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,519 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,519 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,519 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,520 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,520 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,520 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,521 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,522 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,522 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,524 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,524 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,524 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,525 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,526 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,527 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,527 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,527 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,531 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-28 03:20:21,531 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-28 03:20:21,532 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 03:20:21,532 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 03:20:21,532 DEBUG #63052 End of retrieval
2020-08-28 03:20:21,541 DEBUG #63052 Connecting to database ...
2020-08-28 03:20:21,554 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 03:20:21,556 INFO #63052 already posted, no new Facebook post
2020-08-28 03:20:21,556 INFO #63052 facebook made in 0:00:00.105751
2020-08-28 03:20:21,557 DEBUG #63052 === Building twitter ===
2020-08-28 03:20:21,557 DEBUG #63052 Start of retrieval
2020-08-28 03:20:21,557 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 03:20:21,558 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,558 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 03:20:21,558 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,558 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 03:20:21,558 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 03:20:21,558 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,559 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,560 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,561 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,563 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,563 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,564 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,565 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,565 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,566 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,567 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,568 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,568 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,568 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 03:20:21,571 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 03:20:21,571 DEBUG #63052 End of retrieval
2020-08-28 03:20:21,572 DEBUG #63052 Connecting to database ...
2020-08-28 03:20:21,583 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 03:20:21,585 INFO #63052 twitter made in 0:00:00.027671
2020-08-28 04:20:30,331 DEBUG #63052 === Building facebook ===
2020-08-28 04:20:30,331 DEBUG #63052 Start of retrieval
2020-08-28 04:20:30,334 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-28 04:20:30,334 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 04:20:30,334 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-28 04:20:30,335 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 04:20:30,338 DEBUG #63052 Got charset utf-8 from html meta
2020-08-28 04:20:30,338 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-28 04:20:30,340 INFO #63052 Running html thru tidy.
2020-08-28 04:20:30,363 WARNING #63052 tidy: line 525 column 1 -
lacks "summary" attribute
2020-08-28 04:20:30,364 INFO #63052 tidy: Doctype given is "-//W3C//DTD XHTML 1.0 Strict//EN"
2020-08-28 04:20:30,364 INFO #63052 tidy: Document content looks like XHTML 1.0 Strict
2020-08-28 04:20:30,404 DEBUG #63052 Found link to coverpage file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg.
2020-08-28 04:20:30,404 DEBUG #63052 Done parsing file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-28 04:20:30,404 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 04:20:30,404 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,405 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 04:20:30,405 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,405 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 04:20:30,405 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 04:20:30,405 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,405 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,406 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,406 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,406 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,406 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,406 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,407 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,407 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,407 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,407 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,407 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,408 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,408 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,410 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,411 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,411 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,412 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,412 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,413 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,414 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,415 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,415 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,421 DEBUG #63052 ... got mediatype image/jpeg from guess_type
2020-08-28 04:20:30,421 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg
2020-08-28 04:20:30,421 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 04:20:30,426 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 04:20:30,426 DEBUG #63052 End of retrieval
2020-08-28 04:20:30,436 DEBUG #63052 Connecting to database ...
2020-08-28 04:20:30,452 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 04:20:30,454 INFO #63052 already posted, no new Facebook post
2020-08-28 04:20:30,454 INFO #63052 facebook made in 0:00:00.122979
2020-08-28 04:20:30,455 DEBUG #63052 === Building twitter ===
2020-08-28 04:20:30,455 DEBUG #63052 Start of retrieval
2020-08-28 04:20:30,455 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-28 04:20:30,455 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,456 DEBUG #63052 Not dropping after all because of rel.
2020-08-28 04:20:30,456 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,456 DEBUG #63052 Dropping not included http://dublincore.org/documents/1998/09/dces/
2020-08-28 04:20:30,456 WARNING #63052 External link in file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm: http://dublincore.org/documents/1998/09/dces/
2020-08-28 04:20:30,456 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,456 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,457 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,458 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,459 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,459 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,461 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,461 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,462 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,462 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,463 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,463 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,465 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,465 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,466 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,466 DEBUG #63052 Dropping not included mediatype image/jpeg
2020-08-28 04:20:30,469 DEBUG #63052 Requesting iterlinks for: file:///public/vhost/g/gutenberg/html/files/63052/63052-h/images/cover.jpg ...
2020-08-28 04:20:30,469 DEBUG #63052 End of retrieval
2020-08-28 04:20:30,470 DEBUG #63052 Connecting to database ...
2020-08-28 04:20:30,483 DEBUG #63052 Connected to host gutenberg-pg1.int.ibiblio.org database gutenberg.
2020-08-28 04:20:30,485 INFO #63052 twitter made in 0:00:00.030493
2020-08-31 03:34:57,215 DEBUG #63052 === Building epub.images ===
2020-08-31 03:34:57,215 DEBUG #63052 Start of retrieval
2020-08-31 03:34:57,217 DEBUG #63052 ... got mediatype text/html from guess_type
2020-08-31 03:34:57,217 DEBUG #63052 ... creating new parser for file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm
2020-08-31 03:34:57,218 DEBUG #63052 HTMLParser.pre_parse () ...
2020-08-31 03:34:57,218 DEBUG #63052 Fetching file:///public/vhost/g/gutenberg/html/files/63052/63052-h/63052-h.htm ...
2020-08-31 03:34:57,219 DEBUG #63052 Got charset utf-8 from html meta
2020-08-31 03:34:57,219 DEBUG #63052 Trying to decode document with charset utf_8_sig ...
2020-08-31 03:34:57,221 INFO #63052 Running html thru tidy.
2020-08-31 03:34:57,241 WARNING #63052 tidy: line 525 column 1 -