Document image dewarping is a crucial step in the digitization of physical documents, as it aims to remove the distortions induced by challenging environment settings and document sheet deformations often encountered when using smartphone cameras for image capture. Recently, deep learning-based methods were combined with knowledge about the expected document structure, also known as a template, at inference time to improve the dewarping results.
Our contributions in this work are threefold:
Our approach improves upon the state-of-the-art methods by 32.6% in Local Distortion and 40.2% in mnCER. Our code and models are available on this website.
@InProceedings{hertlein2025docmatcher,
author = {Hertlein, Felix and Naumann, Alexander and Sure-Vetter, York},
title = {DocMatcher: Document Image Dewarping via Structural and Textual Line Matching},
booktitle = {Proceedings of the Winter Conference on Applications of Computer Vision (WACV)},
month = {February},
year = {2025},
pages = {5771-5780}
}