Skip to content

Tools for extract figure, table, text, .. from a pdf document.

Notifications You must be signed in to change notification settings

Wild-Rift/Document-Layout-Analysis

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Document-Layout-Analysis

Tools for extract figure, table, text,... from a pdf document.

Table of Contents

About The Project

Product Name Screen Shot

Built With

Getting Started

Make sure you have python3 installed on your computer. Recommended to install on Ubuntu.

To get a local copy up and running follow these simple steps.

Prerequisites

This is an example of how to list things you need to use the software and how to install them.

  • Detectron2

Requirment

  • CUDA=10.1
  • Pytorch >= 1.7.0

How to install CUDA 10.1 can be found here: https://developer.nvidia.com/cuda-10.1-download-archive-base

How to install Pytorch can be found here: https://pytorch.org/

After installed above package, follow the instructions below to install detectron2:

$   pip install cython pyyaml==5.1
$   pip install detectron2 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cu100/index.html

After installed detectron2, run:

$   pip install -r requirments.txt

Installation

  1. Clone the repo
git clone https://github.com/Wild-Rift/Document-Layout-Analysis.git
  1. Run demo
streamlit run virtualize.py

About

Tools for extract figure, table, text, .. from a pdf document.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •