Please use this identifier to cite or link to this item: https://swslhd.intersearch.com.au/swslhdjspui/handle/1/14215
Title: GVT2RPM: An empirical study for general video transformer adaptation to remote physiological measurement
Author: Wang, H.
Ahn, E.
Joseph, A.
Pathan, F.
Negishi, K.
Kim, J.
SWSLHD Author: Negishi, Kazuaki
Issue Date: 2026
Journal: Biomedical Signal Processing and Control
Abstract:  Remote physiological measurement (RPM) is an essential tool for healthcare monitoring as it enables the measurement of physiological signs, e.g., heart rate, in a remote setting, via physical wearables. Recent advancements in facial video-based RPMs have leveraged video analysis to detect photoplethysmographic (PPG) changes by learning pixel variations across frames. Transformer architectures, known for their success in natural video understanding, have also been applied to facial video-based RPM. However, existing transformer-based RPM methods often rely on RPM-specific modules, such as temporal difference convolutions and handcrafted feature maps, to capture subtle physiological signals and enhance temporal feature extraction. While these customized modules can improve performance, they lack robustness across datasets and cannot be generalized to different transformer architectures due to their high degree of customization. In this study, we demonstrate that general video transformers (GVTs) can achieve state-of-the-art performance for RPM without the need of RPM-specific modules. This approach simplifies the design process and facilitates the rapid deployment of various GVT architectures for RPM tasks. We conducted an empirical investigation into how training designs, including data preprocessing and network configurations, influence the performance of GVTs in facial video-based RPM. Furthermore, we propose practical guidelines to adapt GVTs to RPM (GVT2RPM) without the need for RPM-specific modules. Our experiments, conducted on five datasets using both intra-dataset (training and testing on the same dataset) and cross-dataset (training and testing on different datasets) settings, demonstrate that the proposed GVT2RPM guidelines outperform existing RPM-specific counterparts in most of cases. In intra-dataset experiments, it reduced mean absolute error by 5.0% (UBFC-rPPG), 35.6% (MMPD-simple), and 38.2% (MMPD). In cross-dataset experiments, it achieved reductions of 4.3% (UBFC-Phys), 13.2% (MMPD-simple), 9.5% (MMPD), and 13.4% (RLAP). The results demonstrate that our guidelines can be applied across various GVT architectures and are robust to diverse datasets, making them a promising solution for advancing RPM methodologies. � 2025 The Author(s)
ISSN: 17468094 (ISSN)
Digital object identifier: 10.1016/j.bspc.2025.108886
URI: https://swslhd.intersearch.com.au/swslhdjspui/handle/1/14215
Department: Liverpool Hospital, Department of Cardiology
Appears in Collections:Liverpool Hospital

Files in This Item:
There are no files associated with this item.


Items in Prosentient are protected by copyright, with all rights reserved, unless otherwise indicated.

Google Media

Google ScholarTM

Who's citing