dedoc

Getting started

  • Dedoc installation
  • Dedoc usage tutorial
  • Parameters description

Tutorials

  • Notebooks with examples of Dedoc usage
  • Adding support for a new document format to Dedoc
  • Adding support for a new structure type to Dedoc
  • Adding support for a new language to Dedoc
  • Creating Dedoc Document from basic data structures in code
  • Configure structure extraction using patterns

Dedoc API usage

  • Using dedoc via API
  • API schema
  • Description of the API output format

Readers output

  • Text annotations
  • Types of textual lines

Structure types

  • Default document structure type
  • Law structure type
  • Technical specification structure type
  • Diploma structure type
  • Article structure type (GROBID)
  • FinTOC structure type

Package Reference

  • Dedoc pipeline
  • dedoc.data_structures
  • dedoc.converters
  • dedoc.readers
  • dedoc.attachments_extractors
  • dedoc.metadata_extractors
  • dedoc.structure_extractors
  • dedoc.structure_constructors
  • Auxiliary data structures for PDF and images parsing

Development Notes

  • Support and Contributing
  • Changelog
dedoc
  • Search


© Copyright 2023-2025, Dedoc team.

Built with Sphinx using a theme provided by Read the Docs.