基于大语言模型Function-Calling架构的中医舌象辅助问诊系统
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP391

基金项目:

国家自然科学基金(62001255);内蒙古自治区自然基金(2024MS06026)


The Auxiliary Tongue Diagnosis System for Traditional Chinese Medicine Based on Large Language Model Function-Calling Architecture
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    针对大语言模型缺乏视觉能力及现有多模态大模型在中医舌象分析中准确率不足的问题,本研究构建了一个中医舌象辅助问诊系统,实现舌象的自动分类与中医辅助诊疗方案的生成。在利用UNet进行舌象分割的基础上,使用TransNeXt主干网络构建舌象多标签分类系统,实现舌体分割与舌象多标签分类。进一步,结合舌象多标签分类系统与大语言模型,并基于微调后的Function-calling架构,实现了中医舌象辅助问诊系统。在舌象多标签分类系统的有效性验证实验中,UNet模型在舌象分割的平均精确度、平均召回率和平均交并比指标上表现较好,分别达到97.58%,98.61%和96.25%;基于TransNeXt主干网络开发的舌象多标签分类模型在舌象识别的子集准确率、精确度、召回率、F-1值方面表现更佳,分别达到75.69%,91.18%,91.41%,91.28%。在中医辅助诊疗方案的生成实验中,甄选出的最佳大语言模型相较于多模态大模型在Bleu-4,Rouge-1,Rouge-2,Rouge-L同样表现更佳,分别达到79.03%,82.46%,76.00%,86.46%。本研究结合舌象多标签分类系统以及大语言模型实现了中医舌象辅助问诊系统,该系统能够辅助中医进行舌诊,为中医辅助诊断方案的生成提供技术支持。

    Abstract:

    The lack of visual capability in large language models and the insufficient accuracy of existing multimodal models in Traditional Chinese Medicine (TCM) tongue-image analysis are identified as current challenges. To address these issues, a TCM tongue-image assisted consultation system was constructed, enabling automatic classification of tongue images and the generation of auxiliary diagnostic schemes. Based on UNet, tongue-image segmentation was performed, and a multi-label classification system was built using the TransNeXt backbone to achieve both tongue-body segmentation and multi-label classification. Furthermore, the classification system was integrated with a large language model through a fine-tuned function-calling framework, thereby realizing the TCM tongue-image assisted consultation system.In the validation experiments, the UNet model demonstrated favorable performance in tongue-image segmentation, with mean Intersection over Union, recall, and precision reaching 97.58%, 98.61%, and 96.25%, respectively. The multi-label classification model developed with the TransNeXt backbone exhibited superior performance in subset accuracy, precision, recall, and F1-score, achieving 75.69%, 91.18%, 91.41%, and 91.28%, respectively. In the generation of TCM auxiliary diagnostic schemes, the best-performing large language model outperforms the multimodal model in Bleu-4, Rouge-1, Rouge-2, and Rouge-L metrics, achieving 79.03%, 82.46%, 76.00%, and 86.46%, respectively. This study demonstrates that by combining the tongue-image multi-label classification system with a large language model, a TCM tongue-image assisted consultation system is realized. The system is capable of supporting TCM tongue diagnosis and provides technical support for the generation of auxiliary diagnostic schemes.

    参考文献
    相似文献
    引证文献
引用本文

施浩然,王月明,石磊,等. 基于大语言模型Function-Calling架构的中医舌象辅助问诊系统[J]. 科学技术与工程, 2026, 26(13): 5583-5593.
Shi Haoran, Wang Yueming, Shi Lei, et al. The Auxiliary Tongue Diagnosis System for Traditional Chinese Medicine Based on Large Language Model Function-Calling Architecture[J]. Science Technology and Engineering,2026,26(13):5583-5593.

复制
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2025-05-19
  • 最后修改日期:2026-02-03
  • 录用日期:2025-11-19
  • 在线发布日期: 2026-05-18
  • 出版日期:
×
2026年会通知 | “技术经济学驱动智能经济生态构建与治理变革”——中国技术经济学会第三十三届学术年会(2026)会议通知暨征文启事(第一轮)
亟待确认版面费归属稿件,敬请作者关注