基于半监督学习的单视角点云三维人体姿态与形状估计

3D human pose and shape estimation from single-view point clouds with semi-supervised learning

在线阅读下载PDF

导出

摘要在有限标签样本的条件下,单视角点云的三维人体姿态和形状估计一直存在模型估计精度低、泛化能力弱等问题。现有的方法通常采用微调方法优化模型,但对新样本的微调步骤大大增加了运行复杂度,本质上没有提高模型的泛化能力。为解决以上问题,提出了一种基于半监督学习的三维人体姿态与形状估计方法,在有限的标签数据条件下,利用大量无标签人体点云数据提高模型估计精度和泛化能力。具体地,首先对无标签数据进行弱增强和强增强,同时估计2种增强样本的三维人体参数模型。然后对弱增强样本的预测结果进行伪标签准确性判断,并基于一致性正则化思想约束强增强样本的预测结果,以迭代方式逐步优化伪标签质量和增加用于训练的伪标签数量,进而提升模型的估计精度。该算法在多种公开数据集上做了充分的定量和定性实验,实验结果证明该算法在有限标签样本的条件下提高了三维人体姿态和形状的估计精度,并增强了模型的泛化性能。 Under the condition of limited labeled samples,estimating 3D human pose and shape from single-view point clouds has consistently encountered issues such as low model estimation accuracy and weak generalization capability.Existing methods typically use a fine-tuning step to optimize the models for limited labeled samples,but this fine-tuning process significantly increases computational complexity and without fundamentally enhancing model generalization.To address these issues,a semi-supervised learning-based method was proposed for 3D human pose and shape estimation.Under conditions of limited labeled data,the proposed method utilized a large amount of unlabeled human point clouds to improve model accuracy and generalization capability.Specifically,weak and strong augmentations were applied to the unlabeled data,and 3D human parameter models were estimated for both types of augmented samples.Then,the accuracy of pseudo-labels for weakly-augmented samples was evaluated,and the predictions of strongly augmented samples were constrained based on consistency regularization.The procedure above was applied iteratively to gradually refine the quality of pseudo-labels and increase the number of pseudo-labels for training,thereby enhancing the model’s estimation accuracy.Extensive quantitative and qualitative experiments on various public datasets demonstrate that the proposed method enhanced the accuracy of 3D human pose and shape estimation under conditions of limited labeled samples and enhanced model generalization performance.

作者方程浩王康侃 FANG Chenghao;WANG Kangkan(Key Laboratory of Intelligent Perception and Systems for High-Dimensional Information of Ministry of Education,Nanjing University of Science and Technology,Nanjing Jiangsu 210094,China)

机构地区南京理工大学高维信息智能感知与系统教育部重点实验室

出处《图学学报》北大核心 2025年第2期393-401,共9页 Journal of Graphics

基金国家自然科学基金(62472224) 中央高校基础研究基金(NJ2023032) 浙江大学计算机辅助设计与图形系统全国重点实验室开放课题(A2311) 南京大学计算机软件新技术全国重点实验室开放课题(KFKT2024B37)。

关键词三维人体姿态与形状估计单视角点云半监督学习伪标签点云数据增强 3D human pose and shape estimation single-view point clouds semi-supervised learning pseudo-label data augmentation of point cloud

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

1王默涵.墨子的政治伦理思想[J].今古文创,2024(10):72-74. 被引量：1
2尕玛央宗.个人主义与儒家“礼”思想之间的博弈[J].中文科技期刊数据库(全文版)社会科学,2020(2):00366-00367.
3李震宇,安浩,刘励韬,孙坤.基于改进麻雀搜索算法模型的阵列优化[J].建模与仿真,2025,14(1):982-996.

图学学报

2025年第2期

浏览历史

内容加载中请稍等...

基于半监督学习的单视角点云三维人体姿态与形状估计

相关作者

相关机构

相关主题

浏览历史