密集池化连接和短语注意力下的文本分类算法
DOI:
作者:
作者单位:

作者简介:

通讯作者:

中图分类号:

TP3-05

基金项目:

国家自然科学基金项目(面上项目,重点项目,重大项目), 江西省教育厅科学技术研究项目


Text classification based on dense-pooling connection and phrase attention
Author:
Affiliation:

Fund Project:

The National Natural Science Foundation of China (General Program, Key Program, Major Research Plan),Science and Technology Research Project of Jiangxi Provincial Department of Education

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    为了解决在文本分类中神经网络训练时产生的梯度消失、特征信息丢失以及注意力机制短语维度组合不匹配的问题,本文提出一种基于密集池化连接和短语注意力机制的文本分类算法,该算法首先通过密集池化连接中的残差网络部分进行特征提取,可有效缓解梯度消失问题,并通过池化层复用重要特征,改善特征信息丢失问题。通过改进常规注意力机制,提出短语注意力机制,可灵活的得到不同阶短语之间联系,解决常规注意力机制短语维度不匹配问题。结果表明,该模型在对比模型中取得了最好的效果,在相同的新闻数据集中准确率可达92.7%,同时还对三个对比模型的收敛性和分类准确性进行分析,可见改进后的模型可以有效缓解梯度消失,并且解决短语维度组合不匹配问题,从而提高了分类准确性。

    Abstract:

    In order to solve problems for the disappearance of gradients, the deficiency of text feature and the mismatch of extracting phrase features in attention mechanism during the training of neural network in text classification, a new method base on dense-pool connection and phrase attention mechanism is proposed. Firstly, the method is used to extracting features while alleviating the gradient disappearance problem through the residual network and reuse important features through dense pooling connection. Then, the phrase attention mechanism is used to solve the problem of phrase dimension mismatch in the traditional attention mechanism. Finally, the results show that the accuracy of the model can achieve 92.7% in the AG news dataset for all variants. In addition, the convergence and classification accuracy of three comparison models are analyzed in different hyperparameters. It is concluded that the improved model can effectively alleviate the disappearance of gradients and solve the problem of phrase feature extraction, thereby improving the classification accuracy.

    参考文献
    相似文献
    引证文献
引用本文

黄卫春,陶自强,熊李艳. 密集池化连接和短语注意力下的文本分类算法[J]. 科学技术与工程, 2021, 21(17): 7193-7199.
Huang Weichun, Tao Ziqiang, Xiong Liyan. Text classification based on dense-pooling connection and phrase attention[J]. Science Technology and Engineering,2021,21(17):7193-7199.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2020-10-06
  • 最后修改日期:2021-03-31
  • 录用日期:2021-02-21
  • 在线发布日期: 2021-07-02
  • 出版日期:
×
律回春渐,新元肇启|《科学技术与工程》编辑部恭祝新岁!
亟待确认版面费归属稿件,敬请作者关注