Migrate data to the hoarder bookmarking self-hosted application

Find a file

Marty Oehme 1017b876e9 Attempt to calculate annotation offsets There are clear issues remaining with this approach. The wallabag-given 'start' and 'end' fields do _not_ just point to the n-th paragraph all the time (like I thought) but actually represent a beautifulsoup4 like tree descent. So: `p_start_match = re.match(r"/p\[(\d+)\]", annot["ranges"][0]["start"])` will fail on any annotation not just at the n-th paragraph. Instead we should see how we can move this tree into the beautifulsoup4 parser and make use of wallabag already having done the work for us?		2025-03-12 20:29:11 +01:00
shaarli2hoarder	Add shaarli2hoarder	2025-03-12 20:24:46 +01:00
wallabag2hoarder	Attempt to calculate annotation offsets	2025-03-12 20:29:11 +01:00
.gitignore	initial scaffold	2025-03-12 20:24:45 +01:00
.python-version	initial scaffold	2025-03-12 20:24:45 +01:00
main.py	Add main.py	2025-03-12 20:24:46 +01:00
pyproject.toml	Attempt to calculate annotation offsets	2025-03-12 20:29:11 +01:00
README.md	Refactor wallabag conversion to have simple cli	2025-03-12 20:24:47 +01:00
uv.lock	Attempt to calculate annotation offsets	2025-03-12 20:29:11 +01:00

README.md

Hoarder migration scripts

Simple scripts which try to ease the migration to a self-hosted hoarder instance.

Wallabag

Use one of the flavours of transferring your wallabag entries into hoarder. Run it like: ./wallabag2hoarder/convert.py <input-file>

Where input file is a wallabag json export of all your saved entries. There are options to change the style of output (netscape HTML, native JSON) and the file to output to. By default converts to hoard JSON and prints to stdout.