Abstract
In this letter, we propose a novel lightweight X-ray image contraband segmentation network, XSNet, which integrates State Space Models (SSM) with Convolutional Neural Networks (CNNs) to achieve a significant trade-off between segmentation accuracy and lightweight design for computer-aided X-ray security check. The model is built based on the encoder-decoder framework. Specifically, we design an Multi-scale Convolution Fusion (MCF) block for multi-scale information extraction and a Dual-branch State Space Model (DSSM) block to relieve the bias caused by the imbalance of single branch structure in feature extraction and maintain the capabilities of SSM in modeling long range pixel dependencies. In addition, we present two versions of the model in two different sizes called XSNet-s and XSNet-l respectively. The quantitative and qualitative evaluations on the public PIDray and PIXray datasets both show the superiority of two models in terms of mean Intersection over Union (mIoU) and FLOPs.
| Original language | English |
|---|---|
| Pages (from-to) | 1351-1355 |
| Number of pages | 5 |
| Journal | IEEE Signal Processing Letters |
| Volume | 32 |
| DOIs | |
| State | Published - 2025 |
Keywords
- Contraband segmentation
- X-ray images
- deep learning
- state space models
Fingerprint
Dive into the research topics of 'XSNet: A Lightweight X-Ray Security Image Segmentation Model Combining State-Space Models and Convolutional Neural Networks'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver