Interactive degraded document binarization: An example (and case) for interactive computer vision

Zheng Lu, Zheng Wu, Michael S. Brown

Research output: Chapter in Book/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

This paper describes a user-assisted application to perform adaptive thresholding (i.e. binarization) on degraded handwritten documents. While existing adaptive thresholding techniques purport to be automatic, they in fact require the user to perform non-intuitive parameter tuning to obtain satisfactory results. In our work, we recast the problem into one where the user needs only to coarsely markup regions in the thresholded image that have unsatisfactory results. These regions are then segmented and processed locally - no parameter tuning is necessary. Our user study shows that not only do the majority of users prefer our application over parameter tuning, but our final results are better than existing algorithms due to the more targeted processing. While our main contribution is an effective user-assisted application for document binarization, we use this as an example to advocate the need to rethink how many computer vision solutions, notoriously reliant on parameter tuning, can be reworked to exploit meaningful user interaction.

Original languageEnglish
Title of host publication2009 Workshop on Applications of Computer Vision, WACV 2009
DOIs
Publication statusPublished - 2009
Externally publishedYes
Event2009 Workshop on Applications of Computer Vision, WACV 2009 - Snowbird, UT, United States
Duration: 7 Dec 20098 Dec 2009

Publication series

Name2009 Workshop on Applications of Computer Vision, WACV 2009

Conference

Conference2009 Workshop on Applications of Computer Vision, WACV 2009
Country/TerritoryUnited States
CitySnowbird, UT
Period7/12/098/12/09

ASJC Scopus subject areas

  • Computer Science Applications
  • Computer Vision and Pattern Recognition

Fingerprint

Dive into the research topics of 'Interactive degraded document binarization: An example (and case) for interactive computer vision'. Together they form a unique fingerprint.

Cite this