首页期刊简介编委会征稿启事出版道德声明审稿流程读者订阅论文查重联系我们English
引用本文
  • 吴 进,安怡媛,代 巍.一种基于R3D网络的人体行为识别算法[J].电讯技术,2020,60(8): - .    [点击复制]
  • WU Jin,AN Yiyuan,DAI Wei.A Human Behavior Recognition Algorithm Based on R3D Network[J].,2020,60(8): - .   [点击复制]
【打印本页】 【下载PDF全文】 查看/发表评论下载PDF阅读器关闭

←前一篇|后一篇→

过刊浏览    高级检索

本文已被:浏览 1669次   下载 53 本文二维码信息
码上扫一扫!
一种基于R3D网络的人体行为识别算法
吴进,安怡媛,代巍
0
(西安邮电大学 电子工程学院,西安 710121)
摘要:
现有的行为识别算法不能充分地提取抽象的行为特征,为此提出了基于三维残差卷积神经网络(3D Residual Convolutional Neural Network,R3D)的人体行为识别算法。该网络在三维卷积神经网络(3D Convolutional Neural Network,3D-CNN)基础上加入了残差模块,可以更好地提取时空域的特征,然后通过改变步长大小进行特征图降维,提高网络效率,并加入批量归一化层和Softplus激活函数,提高网络的收敛速度和拟合能力;之后添加Dropout层,降低过拟合风险,并且使用全局平均池化层(Global Average Pooling,GAP)代替全连接层,克服了网络参数量过大的问题;最后,使用Softmax进行分类。实验结果表明,使用R3D网络在HMDB-51数据集上获得了62.3%的识别率。
关键词:  行为识别  三维残差卷积神经网络  批量归一化层  全局平均池化层
DOI:
基金项目:国家自然科学基金资助项目(61834005,61772417,61602377,61634004);陕西省重点研发计划项目(2017GY-060);陕西省自然科学基础研究计划项目(2018JM4018)
A Human Behavior Recognition Algorithm Based on R3D Network
WU Jin,AN Yiyuan,DAI Wei
(School of Electronic Engineering,Xi′an University of Posts and Telecommunications,Xi′an 710121,China)
Abstract:
In view of the problem that the existing behavior recognition algorithms can not extract abstract behavior features fully,a human behavior recognition algorithm based on 3D residual convolutional neural network(R3D) is proposed.Because residual module is added,this network based on 3D convolutional neural network(3D-CNN) can better extract the features of space-time domain,then reduces the dimension of feature map by changing the step size and improves the efficiency of the network.And batch normalization layer and Softplus activation function are added to improve the convergence speed and fitting ability of the network.Then Dropout layer is added to reduce the risk of over fitting,and global average pooling(GAP) instead of full connection layer is used to overcome the problem of too large network parameters.Finally,Softmax is used for classification.The experimental results show that the recognition rate of HMDB-51 dataset is 62.3% by using R3D network.
Key words:  behavior recognition  3D residual convolutional neural network  batch normalization layer  global average pooling layer
安全联盟站长平台