This work is licensed under the Creative Commons Attribution 4.0 International License.
Wang, H., Kläser, A., Schmid, C., et al.: ‘Dense trajectories and motion boundary descriptors for action recognition’, International Journal of Computer Vision, 2013, 103, (1), pp. 60–79.WangH.KläserA.SchmidC.‘Dense trajectories and motion boundary descriptors for action recognition’201310316079Search in Google Scholar
Wang, H., Schmid, C.: ‘Action recognition with improved trajectories’. International Conference on Computer Vision (ICCV), Sydney, NSW, Australia, October 2013, pp. 3551–3558.WangH.SchmidC.International Conference on Computer Vision (ICCV)Sydney, NSW, AustraliaOctober 201335513558Search in Google Scholar
Simonyan, K., Zisserman, A.: ‘Two-stream convolutional networks for action recognition in videos’. Neural Information Processing Systems (NIPS), Montreal, Canada, December 2014, pp. 2136–2145.SimonyanK.ZissermanA.Neural Information Processing Systems (NIPS)Montreal, CanadaDecember 201421362145Search in Google Scholar
Feichtenhofer, C., Pinz, A., Wildes, R. P.: ‘Spatiotemporal residual networks for video action recognition’. Neural Information Processing Systems (NIPS), Barcelona, SPAIN, December 2016, pp. 3476–3484.FeichtenhoferC.PinzA.WildesR. P.Neural Information Processing Systems (NIPS)Barcelona, SPAINDecember 201634763484Search in Google Scholar
Tran, D., Bourdev, L., Fergus, R., et al.: ‘Learning spatiotemporal features with 3D convolutional networks’. International Conference on Computer Vision (ICCV), Santiago, Chile, December 2015, pp. 4489–4497.TranD.BourdevL.FergusR.International Conference on Computer Vision (ICCV)Santiago, ChileDecember 201544894497Search in Google Scholar
He, K., Zhang, X., Ren, S., et al.: ‘Deep residual learning for image recognition’. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, Nevada, USA, June 2016, pp. 770–778.HeK.ZhangX.RenS.IEEE Conference on Computer Vision and Pattern Recognition (CVPR)Las Vegas, Nevada, USAJune 2016770778Search in Google Scholar
R. Vemulapalli, F. Arrate, and R. Chellappa, “Human action recognition by representing 3d skeletons as points in a lie group,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 588–595, IEEE, Columbus, OH (2014).VemulapalliR.ArrateF.ChellappaR.inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition588595IEEE, Columbus, OH2014Search in Google Scholar
Liu H, Tu J, Liu M. Two-stream 3D convolutional neural network for skeleton-based action recognition [J/OL]. [2017-03-23].LiuHTuJLiuM[2017-03-23].Search in Google Scholar
Li C, Zhong Q, Xie D, et al. Co-occurrence Feature Learning from Skeleton Data for Action Recognition and Detection with Hierarchical Aggregation [C]. Twenty-Seventh International Joint Conference on Artificial Intelligence {IJCAI-18. 2018.LiCZhongQXieDTwenty-Seventh International Joint Conference on Artificial Intelligence{IJCAI-18.2018Search in Google Scholar
Zhang P, Lan C, Xing J, et al. View Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition [J]. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2019.ZhangPLanCXingJView Adaptive Neural Networks for High Performance Skeleton-Based Human Action Recognition [J]2019Search in Google Scholar
Liu J, Shahroudy A, Dong X, et al. Spatio-Temporal LSTM with Trust Gates for 3D Human Action Recognition[J]. 2016.LiuJShahroudyADongX2016Search in Google Scholar
Du, Y.; Wang, W.; and Wang, L. 2015. Hierarchical recurrent neural network for skeleton based action recognition. In CVPR, 1110–1118.DuY.WangW.WangL.2015Hierarchical recurrent neural network for skeleton based action recognition11101118Search in Google Scholar
S. Yan, Y. Xiong, and D. Lin, “Spatial temporal graph convolutional networks for skeleton-based action recognition,” in Thirty-Second AAAI Conference on Artificial Intelligence, pp. 7444–7452, AAAI Press, New Orleans, Louisiana, USA (2018).YanS.XiongY.LinD.inThirty-Second AAAI Conference on Artificial Intelligence74447452AAAI PressNew Orleans, Louisiana, USA2018Search in Google Scholar
L. Shi et al., “Two-Stream Adaptive Graph Convolutional Networks for Skeleton-Based Action Recognition,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 12026–12035 (2019).ShiL.inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition12026120352019Search in Google Scholar
K. Thakkar, P J. Narayanan, “Part-based Graph Convolutional Network for Action Recognition,” arXiv preprint arXiv:1809.04983, 2018.ThakkarK.NarayananP J.arXiv preprint arXiv:1809.04983,2018Search in Google Scholar
J. Hu, L. Shen, and G. Sun, “Squeeze-and-Excitation Networks,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 7132–7141, IEEE, Salt Lake City, UT (2018).HuJ.ShenL.SunG.in2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition71327141IEEESalt Lake City, UT2018Search in Google Scholar
S. Woo et al., “CBAM: Convolutional Block Attention Module,” in Proceedings of the European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science, vol 11211, pp. 3–19, Springer, Cham (2018).WooS.inProceedings of the European Conference on Computer Vision (ECCV), Lecture Notes in Computer Science11211319SpringerCham2018Search in Google Scholar
X. Wang et al., “Non-local neural networks,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7794–7803, IEEE, Salt Lake City, UT, USA (2018).WangX.inProceedings of the IEEE Conference on Computer Vision and Pattern Recognition77947803IEEESalt Lake City, UT, USA2018Search in Google Scholar
Kong, Y., Li, L., Zhang, K., Ni, Q., & Han, J. (2019). Attention module-based spatial–temporal graph convolutional networks for skeleton-based action recognition. Journal of Electronic Imaging, 28(4), 1.KongY.LiL.ZhangK.NiQ.HanJ.2019Attention module-based spatial–temporal graph convolutional networks for skeleton-based action recognition2841Search in Google Scholar
Zhou, B., Khosla, A., Lapedriza, A., Oliva, A., & Torralba, A. (2016). Learning Deep Features for Discriminative Localization. CVPR. IEEE Computer Society.ZhouB.KhoslaA.LapedrizaA.OlivaA.TorralbaA.2016Learning Deep Features for Discriminative Localization. CVPRSearch in Google Scholar
Shahroudy A, Liu J, Ng T T, et al. NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis [J]. IEEE Computer Society, 2016:1010–1019.ShahroudyALiuJNgT TNTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis [J]201610101019Search in Google Scholar
Tae S K, Austin R. Interpretable 3D human action analysis with temporal convolutional networks [C]. Proc of IEEE Computer Vision and Pattern Recognition Workshops. New York: IEEE, 2017: 1623–1631.TaeS KAustinRProc of IEEE Computer Vision and Pattern Recognition WorkshopsNew YorkIEEE201716231631Search in Google Scholar
Oord A V D, Dieleman S, Zen H, et al. Wavenet: a generative model for raw audio [J/OL]. [2016-09-12].OordA V DDielemanSZenH[2016-09-12].Search in Google Scholar