Getting started
Tutorials
Dedoc API usage
Readers output
Structure types
Package Reference
Notes
Task description
Link to the notebook
automatic detection of document format: DOC, DOCX, PDF or any image format;
text extraction and its structuring;
saving the result to JSON file.
Notebook 1
automatic detection of document format: PDF or any image format;
tables extraction including multi-paged tables;
grouping tables by document page where they are located;
saving each page to CSV file.
Notebook 2
automatic detection of image format;
text extraction from image;
text location visualization;
text recognition confidence visualization.
Notebook 3