Skip to content

XinsGithub/Main-Document-Extraction-and-Document-Attribute-Classification-Methods-Based-on-Deep-Learning

Repository files navigation

Introduction: The Internet has a huge amount of information and is growing at an exponential rate. Document information is particularly important for people's work and life. However, it is difficult to obtain meaningful data for people from massive document information, and document classification is a prerequisite for extracting information from massive documents, so document classification is particularly important. In numerous studies, it has been found that layout cutting is an efficient technique based on object detection. In this text classification task, using layout segmentation facilitates dividing a complete article into smaller sections and applying them to Faster R-CNN.

Methods: This project mainly studies the techniques for classifying articles based on their main content. On the basis of sufficient layout analysis techniques, the image is divided into small regions. We use the Faster R-CNN algorithm to extract and classify features for each region, and then integrate the features of multiple regions within an article to form a document's features, achieving the goal of classification and backbone extraction.

Usage: To run the prediction process, simply execute predict.py and provide the path to the labeled image to view the results. If you would like to use your own data, please label the image manually using the labelImg tool. You can select labels from the following categories: table, figure, message, title, and subtitle. To train the model with your data, modify the train_annotation_path and val_annotation_path variables in train.py.

Contact: If you have some questions while using the repository, please contact me at [email protected]

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages