Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
11 changes: 8 additions & 3 deletions README
Original file line number Diff line number Diff line change
Expand Up @@ -13,7 +13,7 @@ as a string of text:
>>> with open('example.pdf') as f:
... doc = slate.PDF(f)
...
>>> doc
>>> doc
[..., ..., ...]
>>> doc[1]
'Text from page 2...'
Expand All @@ -33,14 +33,19 @@ More complex operations
If you would like access to the images, font files and other
information, then take some time to learn the PDFMiner API.

Installation
------------

To install Slate, clone the repository and run

$ python setup.py install

What is wrong with PDFMiner?
----------------------------

1. Getting simple things done, like extracting the text
is quite complex. The program is not designed to return
Python objects, which makes interfacing things irritating.
2. It's an extremely complete set of tools, with multiple
2. It's an extremely complete set of tools, with multiple
and moderately steep learning curves.
3. It's not written with hackability in mind.

2 changes: 1 addition & 1 deletion src/slate/classes.py
Original file line number Diff line number Diff line change
Expand Up @@ -22,7 +22,7 @@
from pdfminer.pdfparser import PDFPage
except ImportError:
from pdfminer.pdfpage import PDFPage
import utils
from . import utils

__all__ = ['PDF']

Expand Down