Fixing error sentinel for pdftotext when the PDF has no text (scanned images). It was causing a crash previously.

This commit is contained in:
Matt 2018-02-01 10:08:57 -05:00
parent 5c59120c57
commit ce98019b49

View File

@ -235,6 +235,6 @@ def get_text_from_pdf(pdf_file):
try:
pdf = pdftotext.PDF(f)
except pdftotext.Error:
return False
return ""
return "\n".join(pdf)