基于语义提取与图特征增强的非遗传承人视觉名片信息识别
DOI:
作者:
作者单位:

1.西安邮电大学;2.西安建筑科技大学 管理学院;3.西安建筑科技大学

作者简介:

通讯作者:

中图分类号:

TP393,G250

基金项目:

陕西省教育厅科学研究计划项目(25JK0233);国家社会科学基金项目(编号:24XJY008) ;2025年陕西省研究生教育综合改革研究与实践项目(YJSZG2025147)


Visual Business Card Information Recognition of Intangible Cultural Heritage Inheritors Based on Semantic Extraction and Graph Feature Enhancement
Author:
Affiliation:

西安邮电大学

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    对非遗传承人视觉名片信息识别可快速检索非遗文化的关键信息,推进非遗数字化建设。目前,各省市非遗传播平台多以列表形式展示非遗传承人的信息,视觉名片数据较为不足,且视觉信息准确识别需考虑语义特征与空间布局。为此,本研究提出一种融合语义提取与图特征增强方法(Semantic-Graph),以高效精准识别非遗传承人视觉名片信息。首先,通过整合公开网站的信息,构建陕西省非遗传承人视觉名片数据集,涵盖非遗名称、地区、内容等10种类别。其次,构建非遗知识词库与语义提取方法,准确获取视觉名片中的文本语义信息。最后,引入图特征增强策略,通过随机节点掩码、随机删边方法增强模型鲁棒性,通过位置注意力机制学习节点之间位置关系加强模型在视觉空间上的识别能力。对比实验中,Semantic-Graph的识别性能远高于其他基准模型,其宏观平均指标下的F1值可达到0.9281。消融实验与参数影响实验验证了图特征增强方法切实增加了模型性能,相对于基准模型的F1值提升11.6%。最后,Semantic-Graph在多个公开数据集的对比试验均达到最优。

    Abstract:

    The visual identification of intangible cultural heritage (ICH) inheritors enables rapid retrieval of key information related to intangible cultural heritage and promotes its digital preservation and dissemination. Currently, most provincial and municipal ICH dissemination platforms present inheritor information primarily in list form. Visual business card data remain relatively limited, and accurate recognition of visual information requires the joint consideration of semantic features and spatial layout. In this study, a visual information recognition method named Semantic-Graph, which integrates semantic extraction with graph feature enhancement, is proposed. First, by integrating information collected from public websites, a visual business card dataset of ICH inheritors in Shaanxi Province was constructed, covering ten categories, including ICH names, regions, and descriptive content. Second, an ICH-specific lexicon and a semantic extraction method were constructed to obtain textual information from visual business cards. Finally, a graph feature enhancement strategy was introduced. The robustness of the model was improved through random node masking and random edge deletion, while the positional relationships among nodes were learned through a positional attention mechanism to enhance the model’s recognition capability in visual space. Comparative experiments show that the recognition performance of Semantic-Graph is significantly higher than that of other benchmark models, achieving a macro-average F1 score of 0.9281. Ablation and parameter influence experiments further verify that the proposed graph feature enhancement method effectively improves model performance, with an 11.6% increase in F1 score compared with the baseline model. In addition, Semantic-Graph achieves superior performance in comparative experiments conducted on multiple publicly available datasets.

    参考文献
    相似文献
    引证文献
引用本文

王润周,张新生,苏锦旗,等. 基于语义提取与图特征增强的非遗传承人视觉名片信息识别[J]. 科学技术与工程, , ():

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2026-01-20
  • 最后修改日期:2026-03-26
  • 录用日期:2026-05-10
  • 在线发布日期:
  • 出版日期:
×
2026年会通知 | “技术经济学驱动智能经济生态构建与治理变革”——中国技术经济学会第三十三届学术年会(2026)会议通知暨征文启事(第一轮)
亟待确认版面费归属稿件,敬请作者关注