融合扩散和小波残差网络的岩石薄片图像分类
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

黑龙江省科技创新基地项目“数智化油田信息感知与智能分析处理关键技术研究”(JD24A009);东北石油大学人才引进科研启动经费资助项目(13051202402)


Classification of rock thin slice images by Fusing Diffusion and Wavelet Residual Networks
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对岩石薄片图像因类内差异大、类间相似性高而导致的分类难题,本文提出一种融合局部频域特征与全局语义的双路径分类模型(Wavelet Domain-Denoising Diffusion Probabilistic Models,WD-DDPM)。该方法以ResNet-50为基础,设计了基于小波变换的多尺度下采样模块(wavelet transform multi-scale down-sampling block,WTMDB),以替代传统下采样操作,通过对图像进行小波分解,在保留低频轮廓信息的同时,利用注意力机制强化高频子带中的边缘与纹理细节,有效增强了模型对局部判别性特征的提取能力。其次,引入扩散模型的U-Net编码器作为并行路径,提取蕴含整体结构与长距离依赖关系的全局上下文特征。最终,通过融合局部细节与全局语义的双路径特征,形成互补的图像表示,送入分类器完成岩性识别。在包含108个岩石薄片子类的数据集上的实验结果表明,所提方法的准确率达到97.5%,召回率为94.7%,F1分数为96.0%,显著优于VGG16、MobileNetV3等主流卷积神经网络模型(在相同实验条件下),验证了其在复杂岩石图像分类任务中的优越性能与实用价值。

    Abstract:

    Aiming at the classification challenge of rock thin section images caused by large intra-class differences and high inter-class similarity, this paper proposes a dual-path classification model named WD-DDPM, which integrates local frequency-domain features and global semantics. Based on ResNet-50, our method first designs a Wavelet Transform based Multi-scale Downsampling Block (WTMDB) to replace the traditional downsampling operation. By performing wavelet decomposition on the image, it retains low-frequency contour information while simultaneously using an attention mechanism to enhance edges and textural details in high-frequency subbands, effectively improving the model's ability to extract local discriminative features. Secondly, the U-Net encoder of a diffusion model is introduced as a parallel path to extract global contextual features that encompass the overall structure and long-range dependencies. Finally, the features from both paths, representing local details and global semantics, are fused to form a complementary image representation, which is then fed into a classifier for lithology identification. Experiments on a rock thin section dataset containing 108 sub-categories show that our method achieves accuracy, recall, and F1-score of 97.5%, 94.7%, and 96%, respectively, significantly outperforming mainstream convolutional models such as VGG16 and MobileNetV3, verifying its effectiveness and superiority in complex rock image classification.

    参考文献
    相似文献
    引证文献
引用本文

杜睿山,穆文轩,孟令东. 融合扩散和小波残差网络的岩石薄片图像分类[J]. 科学技术与工程, 2026, 26(13): 5566-5573.
Du Ruishan, Mu Wenxuan, Meng Lingdong. Classification of rock thin slice images by Fusing Diffusion and Wavelet Residual Networks[J]. Science Technology and Engineering,2026,26(13):5566-5573.

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-04-30
  • 最后修改日期:2026-01-26
  • 录用日期:2025-11-19
  • 在线发布日期: 2026-05-18
  • 出版日期:
×
2026年会通知 | “技术经济学驱动智能经济生态构建与治理变革”——中国技术经济学会第三十三届学术年会(2026)会议通知暨征文启事(第一轮)
亟待确认版面费归属稿件,敬请作者关注