BookImageSplitter

Converts list of 2-page book images into 1-page book images.

Using CNNs.

Using transfer learning, CNN is trained to do pixel segmentation of book area vs the background. Notebook
Using transfer learning, CNN is trained to find out the mid line of the two pages. Mid line is the line which divides the left page from the right page. Notebook
Using weighted linear regression (with huber loss so as to ignore outliers), mid line is finetuned. This line is used both to rotate the image so that it becomes vertical in resulting image. Also, this is used to divide the image into left page and right page. Notebook
Using Dense CRF over pixel segmentation of book area output of CNN to improve book boundaries. KL Divergence has improved. Notebook

TODO

Handle rough page edges. Make them smoothe.

Using Traditional Computer Vision Techniques.

How to use

python run.py IMAGE_DIRECTORY

In IMAGE_DIRECTORY, all images which needs to be separated into two should be present.

TODO

Affine transformation on final output image so as to make the text horizontal.
Automatic detection of page numbers.
High resolution Pdf. Currently, low resolution pdf is getting formed.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
images		images
.gitignore		.gitignore
BookSegmentation.ipynb		BookSegmentation.ipynb
CRF_based_BookSegmentation.ipynb		CRF_based_BookSegmentation.ipynb
LICENSE		LICENSE
PageMidBoundaryPixelSegmentation.ipynb		PageMidBoundaryPixelSegmentation.ipynb
PagePixelSegmentation.ipynb		PagePixelSegmentation.ipynb
README.md		README.md
background_cleaning.py		background_cleaning.py
constants.py		constants.py
distortion_rectifier.py		distortion_rectifier.py
images_to_pdf.py		images_to_pdf.py
run.py		run.py
split_image.py		split_image.py
split_image_CNN.py		split_image_CNN.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BookImageSplitter

Using CNNs.

TODO

Using Traditional Computer Vision Techniques.

How to use

TODO

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BookImageSplitter

Using CNNs.

TODO

Using Traditional Computer Vision Techniques.

How to use

TODO

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages