三维卷积神经网络及其在视频理解领域中的应用研究白静①②杨瞻源*①彭斌①李文静①①(北方民族大学计算机科学与工程学院银川750021)②(国家民委图像图形智能处理实验室银川750021)摘要:3维卷积神经网络(3DCNN)是近几年来深度学习研究中的热点,在计算机视觉领域取得了诸多成就。虽然研究多年且成果丰富,但目前仍缺少关于此内容全面、细致的综述。基于此,该文从以下几个方面对其进行综述:首先阐述3维卷积神经网络的基本原理和模型结构,接着从网络结构、网络内部和优化方法总结3维卷积神经网络的相关改进工作,然后对3维卷积神经网络在视频理解领域中的应用进行总结,最后总结全文内容并对未来发展方向进行展望。该文针对3维卷积神经网络的最新研究进展以及在视频理解领域中的应用进行了系统的综述,对3维卷积神经网络的研究发展具有一定的积极意义。关键词:视频理解;深度学习;3维卷积神经网络;网络结构中图分类号:TP399文献标识码:A文章编号:1009-5896(2023)06-2273-11DOI:10.11999/JEIT220596Researchon3DConvolutionalNeuralNetworkandItsApplicationonVideoUnderstandingBAIJing①②YANGZhanyuan①PENGBin①LIWenjing①①(SchoolofComputerScienceandEngineering,NorthMinzuUniversity,Yinchuan750021,China)②(NationalEthnicAffairsCommissionImageGraphicsIntelligentProcessingLaboratory,Yinchuan750021,China)Abstract:3DConvolutionalNeuralNetwork(3DCNN)hasbeenahottopicindeeplearningresearchoverthelastfewyearsandhasmadegreatachievementsincomputervision.Despiteyearsofresearchandabundantresults,acomprehensiveanddetailedreviewofthiscontentisstilllacking.Inthispaper,the3Dconvolutionalneuralnetworkisintroducedinthefollowingaspects.Firstly,therationaleandmodelstructureof3Dconvolutionalneuralnetworkareputforward.Thentheimprovementof3Dconvolutionalneuralnetworkissummarizedfromthenetworkstructure,networkinteriorandoptimizationmethods.Afterthattheapplicationof3Dconvolutionalneuralnetworkinthefieldofvideounderstandingisexplained.Finally,thecontentssummaryofthepaperandfuturedevelopment.Thispaperprovidesasystematicreviewofthelatestresearchprogressof3Dconvolutionalneuralnetworksandtheirapplicationsinthefieldofvideounderstanding,whichisofpositivesignificancetoth...