pysummarization.readablewebpdf package

Submodules

pysummarization.readablewebpdf.web_pdf_reading module

class pysummarization.readablewebpdf.web_pdf_reading.WebPDFReading[source]

Bases: pysummarization.readable_web_pdf.ReadableWebPDF

Read the PDF.

is_pdf_url(url)[source]

Check PDF file format.

@TODO(chimera0): validation.

Parameters:url – URL
Returns:PDF, False: not PDF
Return type:True
path_to_text(path)[source]

Transform local PDF file to string.

Parameters:path – path to PDF file.
Returns:string.
url_to_text(url)[source]

Download PDF file and transform its document to string.

Parameters:url – PDF url.
Returns:string.

Module contents