Commit graph

7 commits

Author SHA1 Message Date
Marty Oehme 9e713193a8
refactor: Fix circular exception import 2024-06-14 15:18:22 +02:00
Marty Oehme 6b35b2f918
chore: Fix strict pyright analysis errors 2024-06-14 15:13:24 +02:00
Marty Oehme 8093259551
refactor: Remove pymupdf coupling in extraction
The library is only needed for pdf extraction which is taken care of
in its own extractor plugin. In the overall extraction routine we do not
need any knowledge of the existence of pymupdf.
2024-06-14 14:59:39 +02:00
Marty Oehme c53cd563b7
feat: Add pocketbook extraction 2024-01-24 08:56:21 +01:00
Marty Oehme ddb34fca7b
refactor: Move tagging by color to Annotation 2024-01-24 08:53:54 +01:00
Marty Oehme 11d570f9d8
refactor: Rename annotation content variables
Renamed the two variables describing an annotation's highlighted PDF-text and
its appended note if any exists. Previously called 'text' (for the in-PDF
highlighted content) and 'content' (for the additional supplied content).

Now they are called 'content' for the IN PDF words, highlighted.
and 'note' for the appended note given (or not) in an annotation.
2024-01-23 09:54:36 +01:00
Marty Oehme 3b4db7b6b8
refactor: Extract PDF extractor into class
Extractor is a general protocol with the PDF extraction routine now being
one implementation of the protocol. Preparation for adding multiple
extractors (epub,djvu, or specific progammes) in the future.
2024-01-20 18:02:51 +01:00