Skip to content

Adding module pdf_processor

Guillaume ALLEON requested to merge create_toc into main

Adding module pdf_processor that uses rules to reconstruct toc using a combination of embedded toc & links defaulting to pdfstruct Collection in case of error

Adding code to retrieve column id using scikit-learn

Move to latest liriae-common v3.1.0

Move from 'if type(x) is y' to 'if isintance(x, y)'

Adding more tests

Linting with Ruff

Edited by Guillaume ALLEON

Merge request reports

Loading