ALDII: Adaptive Learning-based Document Image Inpainting to enhance the handwritten Chinese character legibility of human and machine

Qinglin Mao; Jingjin Li; Hang Zhou; Pushpendu Kar; Anthony Graham Bellotti

doi:10.1016/j.neucom.2024.128897

ALDII: Adaptive Learning-based Document Image Inpainting to enhance the handwritten Chinese character legibility of human and machine

Qinglin Mao, Jingjin Li, Hang Zhou, Pushpendu Kar, Anthony Graham Bellotti

Research output: Journal Publication › Article › peer-review

1 Citation (Scopus)

Abstract

Document Image Inpainting (DII) has been applied to degraded documents, including financial and historical documents, to enhance the legibility of images for: (1) human readers by providing high visual quality images; and (2) machine recognizers such as Optical Character Recognition (OCR), thereby reducing recognition errors. With the advent of Deep Learning (DL), DL-based DII methods have achieved remarkable enhancements in terms of either human or machine legibility. However, focusing on improving machine legibility causes visual image degradation, affecting human readability. To address this contradiction, we propose an adaptive learning-based DII method, namely ALDII, that applies domain adaptation strategy, our approach acts like a plug-in module that is capable of constraining a total feature space before optimizing legibility of human and machine, respectively. We evaluate our ALDII on a Chinese handwritten character dataset, which includes single-character and text-line images. Compared to other state-of-the-art approaches, experimental results demonstrated superior performance of our ALDII with metrics of both human and machine legibility.

Original language	English
Article number	128897
Journal	Neurocomputing
Volume	616
Early online date	Feb 2025
DOIs	https://doi.org/10.1016/j.neucom.2024.128897
Publication status	Published - 1 Feb 2025

Keywords

Document image inpainting
Domain adaptation
Blind image inpainting
Optical Character Recognition (OCR)

ASJC Scopus subject areas

Computer Science Applications
Cognitive Neuroscience
Artificial Intelligence

Access to Document

10.1016/j.neucom.2024.128897

Cite this

@article{659f359a1818427c8af66f3e8391ef1a,

title = "ALDII: Adaptive Learning-based Document Image Inpainting to enhance the handwritten Chinese character legibility of human and machine",

abstract = "Document Image Inpainting (DII) has been applied to degraded documents, including financial and historical documents, to enhance the legibility of images for: (1) human readers by providing high visual quality images; and (2) machine recognizers such as Optical Character Recognition (OCR), thereby reducing recognition errors. With the advent of Deep Learning (DL), DL-based DII methods have achieved remarkable enhancements in terms of either human or machine legibility. However, focusing on improving machine legibility causes visual image degradation, affecting human readability. To address this contradiction, we propose an adaptive learning-based DII method, namely ALDII, that applies domain adaptation strategy, our approach acts like a plug-in module that is capable of constraining a total feature space before optimizing legibility of human and machine, respectively. We evaluate our ALDII on a Chinese handwritten character dataset, which includes single-character and text-line images. Compared to other state-of-the-art approaches, experimental results demonstrated superior performance of our ALDII with metrics of both human and machine legibility.",

keywords = "Document image inpainting, Domain adaptation, Blind image inpainting, Optical Character Recognition (OCR)",

author = "Qinglin Mao and Jingjin Li and Hang Zhou and Pushpendu Kar and Bellotti, {Anthony Graham}",

note = "Publisher Copyright: {\textcopyright} 2024 The Authors",

year = "2025",

month = feb,

day = "1",

doi = "10.1016/j.neucom.2024.128897",

language = "English",

volume = "616",

journal = "Neurocomputing",

issn = "0925-2312",

publisher = "Elsevier B.V.",

}

TY - JOUR

T1 - ALDII

T2 - Adaptive Learning-based Document Image Inpainting to enhance the handwritten Chinese character legibility of human and machine

AU - Mao, Qinglin

AU - Li, Jingjin

AU - Zhou, Hang

AU - Kar, Pushpendu

AU - Bellotti, Anthony Graham

PY - 2025/2/1

Y1 - 2025/2/1

N2 - Document Image Inpainting (DII) has been applied to degraded documents, including financial and historical documents, to enhance the legibility of images for: (1) human readers by providing high visual quality images; and (2) machine recognizers such as Optical Character Recognition (OCR), thereby reducing recognition errors. With the advent of Deep Learning (DL), DL-based DII methods have achieved remarkable enhancements in terms of either human or machine legibility. However, focusing on improving machine legibility causes visual image degradation, affecting human readability. To address this contradiction, we propose an adaptive learning-based DII method, namely ALDII, that applies domain adaptation strategy, our approach acts like a plug-in module that is capable of constraining a total feature space before optimizing legibility of human and machine, respectively. We evaluate our ALDII on a Chinese handwritten character dataset, which includes single-character and text-line images. Compared to other state-of-the-art approaches, experimental results demonstrated superior performance of our ALDII with metrics of both human and machine legibility.

AB - Document Image Inpainting (DII) has been applied to degraded documents, including financial and historical documents, to enhance the legibility of images for: (1) human readers by providing high visual quality images; and (2) machine recognizers such as Optical Character Recognition (OCR), thereby reducing recognition errors. With the advent of Deep Learning (DL), DL-based DII methods have achieved remarkable enhancements in terms of either human or machine legibility. However, focusing on improving machine legibility causes visual image degradation, affecting human readability. To address this contradiction, we propose an adaptive learning-based DII method, namely ALDII, that applies domain adaptation strategy, our approach acts like a plug-in module that is capable of constraining a total feature space before optimizing legibility of human and machine, respectively. We evaluate our ALDII on a Chinese handwritten character dataset, which includes single-character and text-line images. Compared to other state-of-the-art approaches, experimental results demonstrated superior performance of our ALDII with metrics of both human and machine legibility.

KW - Document image inpainting

KW - Domain adaptation

KW - Blind image inpainting

KW - Optical Character Recognition (OCR)

UR - http://www.scopus.com/inward/record.url?scp=85209575294&partnerID=8YFLogxK

U2 - 10.1016/j.neucom.2024.128897

DO - 10.1016/j.neucom.2024.128897

M3 - Article

AN - SCOPUS:85209575294

SN - 0925-2312

VL - 616

JO - Neurocomputing

JF - Neurocomputing

M1 - 128897

ER -

ALDII: Adaptive Learning-based Document Image Inpainting to enhance the handwritten Chinese character legibility of human and machine

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this