Progressive Frequency-Aware Network for Laparoscopic Image Desmoking

December 19, 2023 Β· Entered Twilight Β· πŸ› Chinese Conference on Pattern Recognition and Computer Vision

πŸ’€ TWILIGHT: Eternal Rest
Repo abandoned since publication

Repo contents: .vscode, 1.png, 2.png, 3.png, Poster.pdf, Progressive Frequency-Aware Network for Laparoscopic Desmoking.pdf, README.md, data, docs, environment.yml, models, options, requirements.txt, scripts, test.py, train.py, util, wandb

Authors Jiale Zhang, Wenfeng Huang, Xiangyun Liao, Qiong Wang arXiv ID 2312.12023 Category eess.IV: Image & Video Processing Cross-listed cs.CV Citations 5 Venue Chinese Conference on Pattern Recognition and Computer Vision Repository https://github.com/jlzcode/PFAN ⭐ 7 Last Checked 1 month ago
Abstract
Laparoscopic surgery offers minimally invasive procedures with better patient outcomes, but smoke presence challenges visibility and safety. Existing learning-based methods demand large datasets and high computational resources. We propose the Progressive Frequency-Aware Network (PFAN), a lightweight GAN framework for laparoscopic image desmoking, combining the strengths of CNN and Transformer for progressive information extraction in the frequency domain. PFAN features CNN-based Multi-scale Bottleneck-Inverting (MBI) Blocks for capturing local high-frequency information and Locally-Enhanced Axial Attention Transformers (LAT) for efficiently handling global low-frequency information. PFAN efficiently desmokes laparoscopic images even with limited training data. Our method outperforms state-of-the-art approaches in PSNR, SSIM, CIEDE2000, and visual quality on the Cholec80 dataset and retains only 629K parameters. Our code and models are made publicly available at: https://github.com/jlzcode/PFAN.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” Image & Video Processing