基于外观流估计与Swin Transformer融合的图像畸变矫正方法
DOI:
作者:
作者单位:

1.广西科技大学机械与汽车工程学院;2.福建福耀科技大学智造与未来技术学院

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金(51765007)


Image Distortion Correction via the Fusion of Appearance Flow Estimation and Swin Transformer
Author:
Affiliation:

1.School of Mechanical and Automotive Engineering, Guangxi University of Science and Technology;2.School of Intelligent Manufacturing and Future Technology, Fujian Fuyao University of Science and Technology

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着广角镜头和全景相机的广泛应用,图像中常常出现桶形畸变等几何失真现象,严重影响后续图像理解与计算机视觉任务的准确性。针对该问题,本文提出了一种融合外观流估计模块与Swin Transformer的深度协同网络结构,用于实现图像桶形畸变的自动矫正。该方法以畸变图像为输入,通过统一的深度学习框架预测其对应的光流场,并利用反采样操作生成矫正后的无畸变图像。为提升模型的训练效果,本文构建了一个覆盖多类图像场景的全景图像数据集,并基于统一球体模型生成配对的无畸变图像和标有相应畸变参数的畸变图像。实验结果表明,所提出的方法取得了22.0140 dB的峰值信噪比(Peak Signal-to-Noise Ratio,PSNR)和0.7955的结构相似度(Structural Similarity,SSIM),平均像素位移误差(Mean Pixel Displacement Error,MPDE)为16.5980,直线弯曲度误差(Line Curvature Error,LCE)为1.8153,表明该方法在图像畸变校正和几何结构恢复方面具有良好的性能。同时在主观视觉质量上也显著优于现有方法,充分验证了该融合网络在图像几何畸变校正任务中的有效性。自建数据集可在以下网址公开获取:https://github.com/ssssssch/yuyan-dataset

    Abstract:

    With the widespread application of wide-angle lenses and panoramic cameras, geometric distortions such as barrel distortion frequently occur in images,which severely degrade image quality and adversely affect subsequent image understanding and computer vision tasks.To address this problem,a deep collaborative network integrating an appearance flow estimation module and Swin Transformer was proposed for automatic barrel distortion correction.Taking distorted images as input,the proposed method predicts the corresponding dense optical flow fields within a unified deep learning framework and generates corrected undistorted images through a resampling operation.To enhance training effectiveness,a panoramic image dataset covering diverse scene categories was constructed, and paired distorted and undistorted images with corresponding distortion parameters were generated based on the unified spherical camera model.Experimental results demonstrated that the proposed method achieved a peak signal-to-noise ratio (PSNR) of 22.0140 dB,a structural similarity index (SSIM) of 0.7955,a mean pixel displacement error (MPDE) of 16.5980,and a line curvature error (LCE) of 1.8153,demonstrating good performance in both image distortion correction and geometric structure recovery.Moreover,the proposed approach shows clear superiority over existing methods in terms of subjective visual quality,further validating its effectiveness for geometric distortion correction tasks.The constructed dataset is publicly available at: https://github.com/ssssssch/yuyan-dataset

    参考文献
    相似文献
    引证文献
引用本文

宋昌鸿,葛动元,庄其傲,等. 基于外观流估计与Swin Transformer融合的图像畸变矫正方法[J]. 科学技术与工程, , ():

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-01-05
  • 最后修改日期:2026-03-29
  • 录用日期:2026-04-14
  • 在线发布日期:
  • 出版日期:
×
2026年会通知 | “技术经济学驱动智能经济生态构建与治理变革”——中国技术经济学会第三十三届学术年会(2026)会议通知暨征文启事(第一轮)
亟待确认版面费归属稿件,敬请作者关注