Size-Insensitive Network for Visible-Infrared Image Fusion Model

Yihan Zhang; Qian Zhang; Dave Towey

doi:10.1109/COMPSAC61105.2024.00220

Size-Insensitive Network for Visible-Infrared Image Fusion Model

Research output: Chapter in Book/Conference proceeding › Conference contribution › peer-review

Abstract

Visible-infrared image fusion is a technique that extracts information from different sensors. It could be used to enhance human visual perception of video surveillance under low-light conditions, and provide rich information for subsequent tasks. Vision Transformer (ViT) based fusion algorithms require standardizing input images to a specific height and width that could be divided into a series of blocks of fixed size. Consequently, a scaling operation must be performed on the original image, which frequently decreases the quality of fusion results. This paper proposes a visible-infrared image fusion neural network that is insensitive to input size, by first utilizing a fixed-size image pre-fusion framework to generate lossless instructive fusion results (IFRs), followed by a size-insensitive enhancing framework that refines these preliminary fused images under the guidance of IFRs. It also has potential applicability to other image fusion algorithms, like multi-focus image fusion.

Original language	English
Title of host publication	2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC)
Publisher	IEEE
Pages	1524-1525
Number of pages	2
ISBN (Electronic)	9798350376968
ISBN (Print)	9798350376975
DOIs	https://doi.org/10.1109/COMPSAC61105.2024.00220
Publication status	Published - 2024

Keywords

Image fusion
Infrared image
Size-insensitive network

Access to Document

10.1109/COMPSAC61105.2024.00220

Cite this

@inproceedings{ef0606a99453471192d182ae2ab0af20,

title = "Size-Insensitive Network for Visible-Infrared Image Fusion Model",

abstract = "Visible-infrared image fusion is a technique that extracts information from different sensors. It could be used to enhance human visual perception of video surveillance under low-light conditions, and provide rich information for subsequent tasks. Vision Transformer (ViT) based fusion algorithms require standardizing input images to a specific height and width that could be divided into a series of blocks of fixed size. Consequently, a scaling operation must be performed on the original image, which frequently decreases the quality of fusion results. This paper proposes a visible-infrared image fusion neural network that is insensitive to input size, by first utilizing a fixed-size image pre-fusion framework to generate lossless instructive fusion results (IFRs), followed by a size-insensitive enhancing framework that refines these preliminary fused images under the guidance of IFRs. It also has potential applicability to other image fusion algorithms, like multi-focus image fusion.",

keywords = "Image fusion, Infrared image, Size-insensitive network",

author = "Yihan Zhang and Qian Zhang and Dave Towey",

year = "2024",

doi = "10.1109/COMPSAC61105.2024.00220",

language = "English",

isbn = "9798350376975",

pages = "1524--1525",

booktitle = "2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC)",

publisher = "IEEE",

address = "United States",

}

TY - GEN

T1 - Size-Insensitive Network for Visible-Infrared Image Fusion Model

AU - Zhang, Yihan

AU - Zhang, Qian

AU - Towey, Dave

PY - 2024

Y1 - 2024

N2 - Visible-infrared image fusion is a technique that extracts information from different sensors. It could be used to enhance human visual perception of video surveillance under low-light conditions, and provide rich information for subsequent tasks. Vision Transformer (ViT) based fusion algorithms require standardizing input images to a specific height and width that could be divided into a series of blocks of fixed size. Consequently, a scaling operation must be performed on the original image, which frequently decreases the quality of fusion results. This paper proposes a visible-infrared image fusion neural network that is insensitive to input size, by first utilizing a fixed-size image pre-fusion framework to generate lossless instructive fusion results (IFRs), followed by a size-insensitive enhancing framework that refines these preliminary fused images under the guidance of IFRs. It also has potential applicability to other image fusion algorithms, like multi-focus image fusion.

AB - Visible-infrared image fusion is a technique that extracts information from different sensors. It could be used to enhance human visual perception of video surveillance under low-light conditions, and provide rich information for subsequent tasks. Vision Transformer (ViT) based fusion algorithms require standardizing input images to a specific height and width that could be divided into a series of blocks of fixed size. Consequently, a scaling operation must be performed on the original image, which frequently decreases the quality of fusion results. This paper proposes a visible-infrared image fusion neural network that is insensitive to input size, by first utilizing a fixed-size image pre-fusion framework to generate lossless instructive fusion results (IFRs), followed by a size-insensitive enhancing framework that refines these preliminary fused images under the guidance of IFRs. It also has potential applicability to other image fusion algorithms, like multi-focus image fusion.

KW - Image fusion

KW - Infrared image

KW - Size-insensitive network

U2 - 10.1109/COMPSAC61105.2024.00220

DO - 10.1109/COMPSAC61105.2024.00220

M3 - Conference contribution

SN - 9798350376975

SP - 1524

EP - 1525

BT - 2024 IEEE 48th Annual Computers, Software, and Applications Conference (COMPSAC)

PB - IEEE

ER -

Size-Insensitive Network for Visible-Infrared Image Fusion Model

Abstract

Keywords

Access to Document

Fingerprint

Cite this