WarpDoc Dataset for Camera Images of Documents

Abstract

WarpDoc consists of 1,020 camera images of documents that were collected from scientific papers, magazines, envelopes, etc., which have different paper materials, page layouts, and contents. The images were taken in different scenes (indoors, outdoors, etc.) with different illuminations.

Updated on June 8, 2022: Added the digital document images with margin for the evaluation of image quality.

Paper

Fourier Document Restoration for Robust Document Dewarping and Recognition (CVPR 2022)

Date
Mar, 2022