Figures
Abstract
In this work, we propose a non-contact video-based approach that estimates an individual’s blood pressure. The estimation of blood pressure is critical for monitoring hypertension and cardiovascular diseases such as coronary artery disease or stroke. Estimation of blood pressure is typically achieved using contact-based devices which apply pressure on the arm through a cuff. Such contact-based devices are cost-prohibitive as well as limited in their scalability due to the requirement of specialized equipment. The ubiquity of mobile phones and video-based capturing devices motivates the development of a non-contact blood pressure estimation method—Video-based Blood Pressure Estimation (V-BPE). We leverage the time difference of the blood pulse arrival at two different locations in the body (Pulse Transit Time) and the inverse relation between the blood pressure and the velocity of blood pressure pulse propagation in the artery to analytically estimate the blood pressure. Through statistical hypothesis testing, we demonstrate that Pulse Transit Time-based approaches to estimate blood pressure require knowledge of subject specific blood vessel parameters, such as the length of the blood vessel. We utilize a combination of computer vision techniques and demographic information (such as the height and the weight of the subject) to capture and incorporate the aforementioned subject specific blood vessel parameters into our estimation of blood pressure. We demonstrate the robustness of V-BPE by evaluating the efficacy of blood pressure estimation in demographically diverse, outside-the-lab conditions. V-BPE is advantageous in three ways; 1) it is non-contact-based, reducing the possibility of infection due to contact 2) it is scalable, given the ubiquity of video recording devices and 3) it is robust to diverse demographic scenarios due to the incorporation of subject specific information.
Citation: Dasari A, Jeni LA, Tucker CS (2025) Video-based estimation of blood pressure. PLoS ONE 20(1): e0311654. https://doi.org/10.1371/journal.pone.0311654
Editor: Nayan Ranjan Singha, Maulana Abul Kalam Azad University of Technology West Bengal, INDIA
Received: May 4, 2024; Accepted: September 23, 2024; Published: January 30, 2025
Copyright: © 2025 Dasari et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: Data Availability Statement: The data utilized for the study is available at the following Github repository: https://github.com/AiPEX-Lab/vbpe Code Availability Statement: The code utilized for the study is available at the following Github repository: https://github.com/AiPEX-Lab/vbpe.
Funding: A.D., L.A.J. and C.S.T. received the award from the Bill and Melinda Gates Foundation, grant number INV-002154. The website of the funding agency is: https://www.gatesfoundation.org The sponsors and funders did not play a role in the study design, data collection, analysis or manuscript preparation. The authors would like to thank Mr. Srihari Boregowda, President, AIRA Sociocare, from the India data collection team for their efforts in collecting Indian subject data. We would like to thank Mr. Nathaniel Houston and Mr. Sheku Kamara from the Sierra Leone data collection team for their efforts in collecting Sierra Leone subject data.
Competing interests: The authors have declared that no competing interests exist.
Introduction
The cardiovascular system, comprising primarily of the heart and the blood vessels, is responsible for regulating the flow of blood to all organs of the body through the periodic pumping action of the heart [1]. The pumping action of the heart causes the circulating blood to exert a force on the walls of the arteries, which is termed as the blood pressure [1–3]. Sustaining adequate blood pressure in a narrow range is vital to the transport of nutrients and oxygen through the blood to various organs of the body [2]. The narrow range of blood pressure is maintained through the interplay of various physiological processes such as the cardiac output, peripheral resistance and reflex mechanisms [3, 4]. Impairment in the aforementioned physiological processes due to disease causes either higher-than-adequate blood pressure (hypertension) or lower-than-adequate blood pressure (hypotension) [5, 6]. Hypertension increases the risk of heart attack and stroke, as well as cardiovascular and kidney diseases [3–5]. Hypotension impedes oxygen transport to the organs, causing fatigue and dizziness [3, 4, 6]. Blood pressure levels are influenced by multiple factors, such as cardiovascular health, age, lifestyle, environment and genetics highlighting the importance of blood pressure estimation [7].
Blood pressure is typically reported as two values—the systolic blood pressure (the maximum pressure exerted by the heart during a pulse) and the diastolic blood pressure (the pressure in the arteries during the time between consequent pulses) [8]. The exact values of systolic and diastolic blood pressure are measured in clinical settings through the utilization of an intra-arterial catheter invasively placed in various arteries and connected to a pressure transducer [9]. The invasive nature of the intra-arterial catheter increases the risk of infection during blood pressure monitoring and is not implementable in non-clinical settings [9]. The auscultatory method utilizes a non-invasive, contact-based mercury sphygmomanometer (blood pressure measurement device) and a stethoscope and has been the gold standard for blood pressure estimation [10, 11]. However, the auscultatory method also carries the risk of infection through contact and requires trained medical personnel to estimate blood pressure, reducing its practicality in non-clinical settings [11]. Also, the growing concern over mercury-based devices has reduced the utilization of the auscultatory method in blood pressure measurement [10]. The oscillometric method detects pressure oscillations through a cuff placed on the subject’s upper arm and estimates the blood pressure through an empirical algorithm [9, 10]. The oscillometric method is now accepted as the gold standard for the measurement of blood pressure [9, 10]. Such cuff-based contact-based devices are cost-prohibitive and carry the risk of infection, necessitating sanitizing procedures. Cuff-less blood pressure estimation devices utilize the electrocardiogram (ECG) or the photoplethysmograph (PPG) signals to calculate the signal correlation which provides an estimate of the blood pressure [12–14]. The ECG signal is measured using a Holter monitor which detects electrical disturbances under the skin corresponding to the pulsations of the heart [15]. PPG signals are detected using light intensity oscillations transmitted through the skin, which correspond to the heart beat [16]. Non-contact blood pressure estimation approaches have focused on analyzing the recorded ECG and PPG signals to calculate the blood pressure. Such methods reduce the requirement for contact-based sensors by utilizing remote photoplethysmography (rPPG) to estimate the PPG signal. However, the ECG signal still requires a contact-based lead to be affixed to the skin, limiting the scalability of such non-contact-based approaches.
In this study, we introduce V-BPE, a novel Video-based Blood Pressure Estimation method that leverages rPPG signals extracted from video recordings to estimate systolic and diastolic blood pressure. V-BPE specifically captures rPPG signals from two key anatomical locations—the face and the hand—both of which are concurrently visible within a single video frame. The face and hand are selected due to their suitability for unobstructed rPPG signal acquisition [17]. V-BPE utilizes a standard video recorded with a mobile phone/computer camera with a minimum video resolution requirement of 320 × 240 pixels and a frame rate requirement of 30 frames per second (fps).
V-BPE estimates the blood pressure based on the inverse relation between the blood pressure and the velocity of blood pressure pulse in the artery or Pulse Wave Velocity (PWV) [18]. To estimate PWV, we compute Pulse Transit Time (PTT) [19], defined as the temporal delay between rPPG signals obtained from two regions at different distances from the heart (such as the face and the hand). We demonstrate the critical role of demographic factors such as the subject’s height (measured by a medical grade height scale) in the estimation of PWV. Such demographic factors are necessary to estimate key physiological parameters such as blood vessel length and blood velocity, which are essential for blood pressure estimation. Existing video-based blood pressure estimation methods often bypass the inclusion of such demographic factors, thereby compromising robustness in diverse, real-world conditions. To this end, we create a dataset containing subject videos with ground truth blood pressure and associated demographic information. V-BPE incorporates the demographic information to compute key physiological parameters (such as blood vessel length and blood velocity) to perform robust estimation of systolic and diastolic blood pressure.
In summary, the key contribution of the current study is the development of V-BPE, a demographically robust non-contact blood pressure estimation approach. Our statistical analysis underscores the necessity of incorporating subject specific demographic data in blood pressure estimation. Additionally, we introduce a new dataset to rigorously evaluate the performance and robustness of existing video-based blood pressure estimation methods compared to V-BPE in real-world, outside-the-lab environments.
Research on the estimation of blood pressure can be classified into contact-based and non-contact-based approaches. The following sections review these approaches, highlighting the contributions of the current work.
Contact-based blood pressure estimation
Contact-based approaches for estimating blood pressure have focused on the detection of the ECG and the PPG signals from different locations on the body and analyzing properties such as signal correlation to estimate the blood pressure. Gesche et al. developed a linear regression model to relate the systolic blood pressure with the pulse wave velocity calculated from the time difference between the ECG and finger PPG signals [13]. The study is limited to deployment in clinical settings due to the requirement for chest electrodes to detect the ECG signal. Chen et al. utilized two separate contact-based PPG signals from the ear and the toe and developed empirical relations to calculate PWV for different levels of systolic blood pressure values [20]. The study shows that different PWV values can be utilized to compute the systolic and the diastolic blood pressure values separately. However, the empirical relations developed are not generalizable to a wide range of blood pressure values, limiting their practicality. Ma developed an ECG-PTT-based blood pressure estimation method and introduced subject specific calibration [12]. The approach however requires an initial blood pressure measurement to be provided as a reference. Buxi et al. demonstrated that frequent subject specific re-calibration is necessary to ensure the accuracy of the ECG-PTT based approach to estimate blood pressure [14]. Bang et al. proposed a bio-impedance and ECG based approach to estimate the electrical signals of the blood flowing through the radial artery in the wrist [21]. However, bio-impedance signal collection is limited by the necessity of a contact wrist band integrated with electrodes to measure electrical changes in the blood, restricting their utilization due to the risk of infection. Studies have also utilized the Penaz principle which clamps the arteries in the finger with a volume clamping device to continuously monitor blood pressure [22]. Such clamping devices introduce errors due to either insufficient or excess pressure leading to incorrect blood pressure estimates. Chandrasekhar et al. modified a smartphone to apply the pressure to clamp the finger to continuously monitor blood pressure [23]. Though the smartphone integration improves the accessibility of non-clinical blood pressure estimation, the modifications require the application of external pressure limiting their use in outside-the-lab environments.
In summary, contact-based approaches have employed cuff-based pressure application methods and cuff-less ECG-PPG-based methods. ECG-PPG methods attempt to estimate the blood pressure based on empirical estimates, which do not include subject specific demographic information. Moreover, the necessity of contact between the device and the skin introduces the risk of infection and person-person transmission. Additionally, the requirement of proprietary devices such as blood pressure cuffs and ECG detecting sensors reduces the ubiquity of contact-based approaches. Such limitations motivate the development of non-contact-based approaches to estimate the blood pressure.
Non-contact-based blood pressure estimation
Non-contact-based approaches to estimate blood pressure have explored video-based solutions due to the ubiquity of video recording devices such as computers and mobile phones. Sugita et al. explored the correlation between the PTT computed from different rPPG signals and the blood pressure [24]. Murakami et al. report a high negative correlation between PTT (obtained from PPG signals of the wrist and ankle) and blood pressure [25]. Both the studies [24, 25] establish the correlation between the PTT and the blood pressure. To estimate the value of the blood pressure, studies have utilized the Moens Korteweg equation, which relates the PWV with arterial distensibility [26]. However, the equation assumes isotropic arterial wall conditions, which are atypical of actual arterial wall properties. To account for the physical properties observed in the actual arterial wall, the Moens Korteweg equation is modified in studies by Bramwell and Hughes to include non-isotropic arterial wall conditions using data driven methods or finite element analysis [27, 28]. Though the Moens Korteweg equation is developed for a single blood vessel, it can relate the PTT (which is measured from two different blood vessels) to the blood pressure by computing the PWV as an intermediate [27–29]. Studies have shown that the modified Moens Korteweg relations are limited by the demographic variability among populations [30]. To reduce the dependency on demographic parameters, Ma et al. developed a PWV-Blood Pressure relation based on the Fung hyper-elastic model of the artery. The study relies on arterial studies performed in vitro, and does not provide an analysis of the effect of demographic variability on the estimated values of blood pressure in outside-the-lab settings. Wang et al. developed a cuff-less approach for estimating the blood pressure utilizing the finger PPG, ECG and the heart rate using a modified form of the Moens Korteweg equation [31]. Liu et al. employed multi-wavelength PPG signal detection to analyze the skin penetration depth of light at different wavelengths [32]. The wavelength dependent skin penetration depth of light computes the PTT which is utilized to estimate the mean blood pressure through an empirical relation. Hassan et al. utilized heart rate information in addition to the ECG signal and reported a strong correlation between the heart rate and the systolic blood pressure [33]. However, the study reports that the explicit determination of the heart rate to estimate the blood pressure could introduce errors due to incorrect heart rate estimation. Finnegan et al. explored a diverse set of features from ECG and PPG signals to inform blood pressure estimation [34]. The approach develops regression equations based on chemically induced blood pressure changes in the body, which is not representative of the natural blood pressure changes. Jeong et al. [35] explored the correlation between blood pressure and the PTT signal and reported that the level of correlation varied among subjects. Secerbegovic et al. [36] developed a regression-based approach to relate the variation of blood pressure with the PTT and demonstrated that regression-based approaches can inform blood pressure variation rather than the absolute value of blood pressure. Zhou et al. [37] developed a regression-based approach linking the blood pressure to the subject’s Body Mass Index (BMI) in various environemntal conditions. Hamoud et al. [38] developed a deep convolutional neural network approach to identify skin regions of interest and estimate the blood pressure from facial videos. The approach demonstrated a good correlation with additional vital signs such as respiration rate. Trirongjitmoah et al. [39] utilized a continuous wavelet transform of the PPG signal to learn the SBP and BP utilizing a deep convolutional neural network. Luo et al. [40] utilized transdermal optimal imaging to identify a blood haemoglobin signal from a video of a subject’s face. Feature engineering was employed to identify a set of features that correlate with the blood pressure. However, the approach is tested in laboratory settings and is constrained to a narrow range of blood pressure values. Such constraining is typically encountered in machine learning approaches and has been identified as a critical challenge in the estimation of blood pressure [41].
In summary, non-contact blood pressure estimation approaches have focused on the video-based identification of relevant signals such as the ECG and the PPG signals. Non-contact approaches have been limited by the demographic biases that affect the blood pressure estimation. Table 1 shows a summary of the existing studies in blood pressure estimation as compared to the current work.
In this work, we utilize an RGB camera that is a component of most mobile phones and computer-based systems. Such availability improves the accessibility of V-BPE due to the ubiquity of devices with video recording capabilities, such as smartphones or digital cameras. V-BPE utilizes a modified form of the Moens Korteweg equation to estimate the blood pressure [42, 43]. Subject specific parameters such as the lengths of blood vessels are estimated using demographic information such as the height of the subject and a full length image of the subject. Two locations—the face and the hand are selected to estimate rPPG signals [20, 44]. Dasari et al. reported that the CHROM [44] and BKF [45] algorithms demonstrate consistency of rPPG signal extraction in demographically diverse conditions [46]. Leveraging the consistency of rPPG signal extraction in demographically diverse settings, V-BPE utilizes the CHROM algorithm to extract the rPPG signal. The mean peak-peak time difference between the two signals is computed to obtain the face-to-hand PTT, which is critical for the calculation of the blood pressure using the modified Moens Korteweg equation. V-BPE computes different PTT values to independently estimate the systolic and diastolic blood pressure values. Studies by Hamoud et al. [38] Chen et al. [47], Cheng et al. [48] and Xing et al. [49] explore deep learning models such as Convolutional Neural Networks and Recurrent Neural Networks to estimate the blood pressure, utilizing information from facial videos and ECG signals. Mehta et al. have reported that deep learning approaches of estimating vital signs face challenges such as over-constraining the task (the distribution of ground truth is limited to a certain range) and data leakage (where data similar to the training data is present in the test data) [41]. Such challenges introduce bias in vital sign estimation approaches. V-BPE does not involve a data-driven training procedure, mitigating the effect of data leakage. We mitigate the effect of task over-constraining by creating the Face-Hand dataset which consists of subject videos in outside-the-lab settings with associated demographic and ground truth information. The Face-Hand dataset consists of unfiltered ground truth data from FDA-approved measurement devices and does not constrain the range of blood pressure values. V-BPE is rigorously tested on the Face-Hand dataset to assess its robustness and efficacy. Additionally, the Face-Hand dataset is employed to evaluate the robustness of existing blood pressure estimation methods, providing a comparative analysis against V-BPE. In summary, the novel contributions of this study are:
- The development and comparative evaluation of a non-contact-based approach to estimate human blood pressure, that is scalable to resource constrained environments
- The evaluation of the effect of demographic factors on the computation of pulse wave velocity for the estimation of blood pressure
- The creation of the Face-Hand dataset consisting of videos and demographic data of subjects with their face and hand in the same video frame, to facilitate PTT-based blood pressure estimation in outside-the-lab settings
Methods
Overview
V-BPE utilizes a pulse transit time (PTT) based approach to estimate the blood pressure rather than inducing external pressure on the artery. V-BPE utilizes a face and hand tracking algorithm to identify facial and hand landmarks of the subject in a video. The extracted facial and hand landmarks are processed by a remote-PPG algorithm to extract the PPG signal. The PTT is calculated utilizing the peak-peak difference between the two PPG signals (face PPG and hand PPG). V-BPE utilizes the calculated PTT to estimate the systolic and diastolic blood pressures. The Moens Korteweg equation [42, 43] relates the Pulse Wave Velocity (PWV) to the arterial distensibility and is given by
(1)
where PWV is the pulse wave velocity, E is the modulus of elasticity of the artery, h is the thickness of the artery wall, ρ is the density of blood and D is the arterial diameter. The modulus of elasticity, E can be expressed in terms of the blood pressure as follows [28].
(2)
where E0 is the zero pressure modulus of elasticity of the arterial wall, α is a constant and BP is the blood pressure. From Eqs 1 and 2, we obtain the following relation for the PWV.
(3)
The PWV can be computed from the PTT by utilizing the difference (L) in the lengths of the artery at the two locations [29, 31, 50].
(4)
From Eqs 3 and 4, we obtain the following relation between PWV and BP
(5)
Rearranging Eq 5 to compute the BP, we obtain
(6)
Applying the natural logarithm, we obtain
(7)
Simplifying Eq 7, we obtain the value of BP as
(8)
Eq 8 relates the blood pressure (BP) to the PTT and subject specific parameters (α, ρ, L, D, h and E0). Eq 8 can be utilized to calculate the systolic and diastolic blood pressures separately by using the systolic pulse transit time and the diastolic pulse transit time respectively [31] (Eqs 9 and 10).
(9)
(10)
where Ds and Dd are the systolic and diastolic arterial diameters respectively. The systolic and diastolic pulse transit time (PTTs and PTTd respectively) are computed as described in Section: Estimation of pulse transit time.
Estimation of pulse transit time
As depicted in Fig 1, we estimate the PTT by utilizing two rPPG signals from a single video of the face and the hand. We ensure that the subject’s face and open hand (palm facing the camera) are clearly visible in the frame of the video (Fig 2). In order to identify crucial regions of interest on the face and the hand that provide a strong PPG signal, we utilized a landmark tracking algorithm that identifies landmarks on the face and the hand throughout the duration of the video. For V-BPE, we utilize Google Mediapipe [51] as the landmark tracking approach to identify landmarks on the face and the hand. Google Mediapipe is a framework that performs real-time landmark detection and tracking on a video of a person and is robust to illumination and environmental conditions. We employed Google Mediapipe to simultaneously track landmarks on both the face and the hand given an input video (Fig 2). Google Mediapipe provides 468 landmarks on the face and 36 landmarks on the hand. We constructed regions of interest on the forehead and the palm utilizing the landmark information on the forehead and the palm, from Google Mediapipe. Moreover, the forehead and the palm are clearly visible throughout the video. The employment of a face tracking algorithm such as Google Mediapipe to construct regions of interest also reduces the influence of face and hand motion during the video. We note that V-BPE can utilize a generic face or hand tracking algorithm and is not limited to Google Mediapipe. Additionally, we note that despite differences in the arteries supplying blood to the face and hand, pulse transit time computation and the utilization of Eqs 9 and 10 is not affected [31].
Schematic diagram of V-BPE for estimating blood pressure demonstrating steps such as face/hand tracking and rPPG signal extraction.
Subject in a seated position with right hand beside the ear with the open palm facing the camera (left). Face and hand tracking demonstrating identified landmarks on the face and the hand (right).
A remote photoplethysmography algorithm is employed to extract the PPG signal from the constructed regions of interest. For V-BPE, we utilized the CHROM algorithm to analyze the forehead and palm regions of interest and extract the PPG signal [44]. CHROM is an open source chrominance-based method that extracts the PPG signal from a video of human skin. CHROM extracts two PPG signals (PPGFace and PPGHand). We filtered the obtained raw PPG signals using the butterworth bandpass filter (low cut-off frequency = 0.25 Hz, high cut-off frequency = 15 Hz) to remove low frequency noise and highlight the peaks of the signals. The peak detection algorithm [52, 53] identifies the locations of the peaks in PPGFace and PPGHand. We compute the time difference between each peak in PPGFace and the corresponding peak in PPGHand and utilize the mean peak-peak time difference as the PTT between PPGFace and PPGHand for systolic blood pressure estimation. For diastolic blood pressure estimation, we utilize the mean valley-valley time difference as the PTT between PPGFace and PPGHand [31]. We compute the mean valley-valley time difference by inverting the PPGFace and PPGHand signals and utilizing the peak detection algorithm (Fig 3).
Systolic pulse transit time (PTTs) is determined by computing the mean peak-peak distance between two PPG signals from different locations on the body. Diastolic pulse transit time (PTTd) is determined by computing the mean valley-valley distance between two PPG signals from different locations on the body.
Estimation of subject specific parameters
Having obtained the PTT for systolic and diastolic blood pressure, we utilize demographic information to compute the subject specific parameters in Eqs 9 and 10 (α, ρ, L, D, h and E0). The utilization of demographic information is critical due to the reduced correlation between the PTT estimated from the peak/valley of the PPG signal and the SBP/DBP due to wave reflections in the late systole. Demographic features such as the height of the subject are related to the subject specific parameters such as the blood vessel length difference [54, 55].
The blood vessel length difference between the face and the hand is estimated utilizing a full length image (Fig 4) of the subject (where the subject is standing with arms straight by the side of the body as shown in Fig 4) and the height of the subject. The height of the subject is proportional to their height in the full length image [54, 56]. This can be represented as
(11)
where Heightpixels is the height of the subject in the full length image in pixels, Heightcm is the actual height of the subject in centimeters (measured using a medical grade height scale) and p is the proportionality constant. The actual height of the subject is obtained from the subject specific demographic information (Fig 4). The proportionality constant p incorporates the distance of the subject from the camera since the Heightpixels is proportional to the distance of the subject from the camera. This ensures that the mapping in Eq 11 is independent of the distance of the subject from the camera. Such independence is critical in outside-the-lab settings, where the distance of the subject from the camera is variable.
Full length image of standing subject demonstrating the posture of the arms. The height in centimeters is extracted from the demographic data (right) and the height in pixels is identified from the full length image of the subject (left). The proportionality relation (Eq 11—with proportionality constant p) can be used to estimate the heart-to-hand and the heart-to-head distances.
The length of the blood vessel from the heart to the face/hand is proportional to the heart to face/hand distance in pixels in the full length image [54, 56].
(12)
(13)
where distFpixels and distFcm are the face to heart distances in pixels and centimeters respectively and distHpixels and distHcm are the hand to heart distances in pixels and centimeters respectively. To estimate the proportionality constant p, we compute Heightpixels by identifying the landmarks of the person on the full length image. Utilizing the pose estimation algorithm from Google Mediapipe, we compute Heightpixels, distFpixels and distHpixels (Fig 4). Having computed the Heightpixels, we utilize the height of the person (Heightcm) obtained from subject specific demographic information to calculate p. Having calculated p, we calculate distFcm and distHcm using
(14)
(15)
Subtracting Eq 15 from Eq 14, we estimate the blood vessel length difference (scalar difference (Fig 4) [57, 58]) between the face and the hand (L) as
(16)
Studies have demonstrated the relation between the parameters of the artery and the demographic parameters such as the height, age and weight of the subject [55, 59, 60] (Table 2). Kozakova et al. discovered a positive correlation between the inner radius of the artery with the weight of the subject [59]. Samijo et al. found that the age and gender of the subject are significant determinants of the artery radius [60]. Hwaung et al. established the following relation between the diameter of the artery (D) and demographic parameters such as age, height and weight through experimental validation (n = 231). We note that the diameter of the artery is assumed to be constant throughout the length of the artery.
(17)
Hwaung et al. established the relation between the artery wall thickness with demographic parameters such as age, height and weight of the subject [55]. Through experimental validation, the following correlation was developed linking the artery wall thickness (h) with the age, height and weight of the subject.
(18)
The density of the blood (ρ) and the zero pressure modulus of elasticity (E0) do not demonstrate significant variability with demographic parameters and can be regarded as constants [61, 62]. V-BPE assumes the density of blood to be 1060 kg/m3, the zero pressure modulus of elasticity to be 1005 and α to be 0.017 [29, 62].
Experimental protocol to develop the Face-Hand dataset
The experimental setup consists of a mobile phone (Google Pixel 6) affixed to a tripod stand placed at a distance of 0.7 m from a seated subject. The phone is oriented in the landscape mode to enable the capture of a larger field of view. The mobile phone records RGB video at a frame rate of 60 fps and a resolution of 1920 × 1080 pixels. The subject is instructed to remain still and place their open right palm beside their face with the open palm facing the camera and the thumb touching the ear (Fig 2). The ground truth blood pressure information is collected using an FDA-approved cuff-based blood pressure measurement device (OMRON-BP 7000). The ground truth consists of a single value for the entire video in the SBP/DBP format. Subject height and weight are determined by medical grade height and weight scales. Informed written consent is obtained from the subject in accordance with the relevant guidelines and regulations of The Carnegie Mellon University’s Institutional Review Board (IRB: STUDY2019_00000362). Ethics approval for the study was also granted by the Government of Sierra Leone, Office of the Sierra Leone Ethics and Scientific Review Committee. Ethics approval for the study was also granted by Yelahanka General Hospital, Yelahanka, Bengaluru, India. Recruitment and data collection for the study was performed from May 5 2022 to October 11 2022.
The experimental protocol to record the video of the subject to create the Face-Hand dataset consists of 1) ensure the subject is seated upright facing the camera with their right hand beside their ear and facing the camera with the palm open 2) ensure the cuff-based blood pressure measurement device is affixed to the subject’s left upper arm 3) record RGB video of the subject’s face and hand in the same video frame for a duration of 120 seconds while the cuff-based device captures the ground truth blood pressure data 3) instruct the subject to stand upright with hands by their sides and facing the camera 4) capture a full length image of the standing subject 5) collect demographic information such as height, weight and age utilizing medical grade height and weight scales. The collected videos are analyzed to ensure video quality in terms of resolution, frame rate and subject posture. We collected video data from 200 subjects in outside-the-lab conditions in Sierra Leone and 200 subjects in outside-the-lab conditions in India (N = 400). The recorded videos are collected into the Face-Hand dataset and are utilized to evaluate the performance of V-BPE in outside-the-lab conditions.
Inclusivity in global research
Additional information regarding the ethical, cultural, and scientific considerations specific to inclusivity in global research is included in the Supporting Information (S1 Questionnaire).
Results and discussion
Dataset
The dataset used in this study (Face-Hand dataset) consists of the video and ground truth blood pressure data of 200 subjects from India and 200 subjects from Sierra Leone. The sample size of 200 subjects from each country was obtained through statistical power analysis (Statistical power = 0.9 for effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, degrees of freedom = 1). The videos are recorded according to the experimental protocol described (Section: Experimental protocol to develop the Face-Hand dataset) ensuring the face and the hand are visible in the frame of the video (Fig 2). The videos in the Face-Hand dataset are recorded in an outside-the-lab environment where ambient light conditions or backgrounds are not controlled. Of the 200 subjects from India, 56 subjects are male and 144 subjects are female and of the 200 subjects from Sierra Leone, 59 subjects are male and 141 subjects are female. Additionally, the dataset contains ground truth demographic and vital sign information comprising of subject age (Mean ± Std: 40.35 ± 14.6 years; Min-Max: 17–82 years), subject height (Mean ± Std: 158.1 ± 15.1 cm; Min-Max: 68–200 cm), subject weight (Mean ± Std: 63.80 ± 14.7 kg; Min-Max: 34–160 kg), systolic blood pressure (Mean ± Std: 121.2 ± 19.7 mmHg; Min-Max: 85–202 mmHg) and diastolic blood pressure (Mean ± Std: 78.45 ± 13.3 mmHg; Min-Max: 28–126 mmHg) (Table 3). The distribution of subject skin tones is determined according to the Monk skin tone scale [64] (Fig 5). Performing the Shapiro-Wilk test, we obtain a p-value of 0.0475, rejecting the null hypothesis that the skin tone is normally distributed. Consequently, this result indicates that the distribution of skin tones in the Face-Hand dataset deviates from normality, underscoring the demographic diversity present within the dataset. We analyze the performance of V-BPE in outside-the-lab conditions utilizing the Face-Hand dataset, with the statistical package Scipy 1.13.0 running on Python 3.8. No data preprocessing was performed as normality of the data could not be assumed.
Distribution of skin tone of subjects in the Face-Hand dataset according to the Monk scale [64].
We ensure that the videos in the Face-Hand dataset consist of a single subject, with the face and hand clearly visible in the frame of the video. The face and the hand of the subject are simultaneously tracked using a face/hand tracker that can provide dense facial and hand landmarks. We process the video to remove the frames where the hand/face is incorrectly positioned or occluded, causing a loss of hand/face tracking. Studies have demonstrated that the forehead and the cheeks contain crucial information for estimating the PPG signal from the face [65, 66] and the palm contains crucial information for PPG signal extraction from the hand [66, 67]. V-BPE extracts a region of interest using specific landmarks bounding the forehead and the palm to ensure the consistent detection of a suitably sized region of interest that scales according to the distance of the subject from the camera.
Experimental results
V-BPE computes a single SBP/DBP value for each video in the Face-Hand dataset. We test the performance of V-BPE in challenging outside-the-lab conditions utilizing the videos from the Face-Hand dataset. We hypothesize that the estimation of the blood pressure utilizing subject specific demographic information such as age, height and weight (section: Estimation of subject specific parameters) is not significantly different from the estimation of blood pressure without using subject specific demographic information. To explore the influence of subject-specific demographic information on the performance of V-BPE, we specified fixed values for the blood vessel length difference (L), the radius of the artery (r) and the artery wall thickness (h) instead of utilizing the relations described in section: Estimation of subject specific parameters. According to results from existing studies [18, 68–70], L is assigned a value of 0.2 m, r is assigned a value of 0.005 m and h is assigned a value of 0.001 m. We estimated the blood pressure of all the subjects from the Face-Hand dataset using 1) V-BPE with L, r and h estimated from subject specific demographic information (denoted by Demographic-V-BPE), 2) V-BPE with fixed values for L, r and h (denoted by Fixed-V-BPE). We construct the null and alternate hypotheses as follows to test for the significance of including subject specific demographic information in the estimation of 1) systolic blood pressure, 2) diastolic blood pressure.
For systolic blood pressure, the null and alternate hypotheses are defined as
(19)
(20)
where SBPd is the distribution of computed systolic blood pressure values for the videos from the Face-Hand dataset utilizing Demographic-V-BPE and SBPf is the distribution of computed systolic blood pressure values for the videos from the Face-Hand dataset utilizing Fixed-V-BPE. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, power = 0.9, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test, we obtain the a p-value of 0.00, rejecting the null hypothesis and demonstrating that there is a statistically significant difference in the SBP estimates for Demographic-V-BPE as compared to Fixed-V-BPE (Table 4).
Significant results are shaded (α = 0.05).
For diastolic blood pressure, the null and alternate hypotheses are defined as
(21)
(22)
where DBPd is the distribution of computed diastolic blood pressure values for the videos from the Face-Hand dataset utilizing Demographic-V-BPE and DBPf is the distribution of computed diastolic blood pressure values for the videos from the Face-Hand dataset utilizing Fixed-V-BPE. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, power = 0.9, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test, we obtain the a p-value of 0.00, rejecting the null hypothesis and demonstrating that there is a statistically significant difference in the DBP estimates for Demographic-V-BPE as compared to Fixed-V-BPE (Table 4).
Having observed a statistically significant difference in the SBP and DBP estimates for Demographic-V-BPE as compared to Fixed-V-BPE, we hypothesize that the estimation of SBP and DBP using Demographic-V-BPE is not significantly different from the ground truth SBP and DBP respectively.
For SBP, we construct the null and alternate hypotheses as follows for the ground truth SBP and SBP estimated by Demographic-V-BPE.
(23)
(24)
where SBPd is the distribution of computed systolic blood pressure values for the videos from the Face-Hand dataset utilizing Demographic-V-BPE and SBPGT is the distribution of ground truth systolic blood pressure values for the videos from the Face-Hand dataset. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, power = 0.9, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test, we obtain the a p-value of 0.20, failing to reject the null hypothesis and demonstrating that there is no statistically significant difference in the SBP estimates for Demographic-V-BPE as compared to the ground truth SBP (Table 4).
For DBP, we construct the null and alternate hypotheses as follows for the ground truth DBP and DBP estimated by Demographic-V-BPE.
(25)
(26)
where DBPd is the distribution of computed diastolic blood pressure values for the videos from the Face-Hand dataset utilizing Demographic-V-BPE and DBPGT is the distribution of ground truth diastolic blood pressure values for the videos from the Face-Hand dataset. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, power = 0.9, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test, we obtain the a p-value of 0.30, failing to reject the null hypothesis and demonstrating that there is no statistically significant difference in the DBP estimates for Demographic-V-BPE as compared to the ground truth DBP (Table 4).
Having observed no statistically significant difference for Demographic-V-BPE estimates of SBP and DBP as compared to the ground truth SBP and DBP respectively, we hypothesize that the estimation of SBP and DBP using Fixed-V-BPE is not significantly different from the ground truth SBP and DBP respectively.
For SBP, we construct the null and alternate hypotheses as follows for the ground truth SBP and SBP estimated by Fixed-V-BPE.
(27)
(28)
where SBPf is the distribution of computed systolic blood pressure values for the videos from the Face-Hand dataset utilizing Fixed-V-BPE and SBPGT is the distribution of ground truth systolic blood pressure values for the videos from the Face-Hand dataset. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, power = 0.9, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test, we obtain the a p-value of 0.00, rejecting the null hypothesis and demonstrating that there is a statistically significant difference in the SBP estimates for Fixed-V-BPE as compared to the ground truth SBP (Table 4).
For DBP, we construct the null and alternate hypotheses as follows for the ground truth DBP and DBP estimated by Fixed-V-BPE.
(29)
(30)
where DBPf is the distribution of computed diastolic blood pressure values for the videos from the Face-Hand dataset utilizing Fixed-V-BPE and DBPGT is the distribution of ground truth diastolic blood pressure values for the videos from the Face-Hand dataset. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 400, power = 0.9, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test, we obtain the a p-value of 0.00, rejecting the null hypothesis and demonstrating that there is a statistically significant difference in the DBP estimates for Fixed-V-BPE as compared to the ground truth DBP (Table 4).
Having observed the differences in the estimation of SBP with the inclusion of subject specific demographic information, we computed the Mean Absolute Error (MAE) of the SBP estimates with respect to the ground truth SBP. V-BPE obtains an MAE of 17.1 mmHg for Demographic-V-BPE and an MAE of 36.2 mmHg for Fixed-V-BPE. The reduction of MAE on the inclusion of subject specific demographic information demonstrates the sensitivity of the systolic blood pressure to demographic differences between subjects. The Bland Altman plots (Fig 6) compare the predicted systolic blood pressure estimation with Demographic-V-BPE and Fixed-V-BPE to the ground truth. Demographic-V-BPE estimates of SBP display a mean difference of 1 mmHg, demonstrating the agreement with the ground truth SBP. In contrast, Fixed-V-BPE estimates for SBP display a mean difference of 28 mmHg with the ground truth demonstrating that Fixed-V-BPE under-estimates the systolic blood pressure. Additionally, we compute the performance (for SBP) of existing approaches for video-based blood pressure estimation utilizing the Face-Hand dataset to evaluate their robustness to various demographic groups. Table 5 demonstrates the performance of regression-based approaches (such as Secerbegovic et al. [36] and Zhou et al. [37]) and deep learning-based approaches (such as Hamoud et al. [38] and Trirongjitmoah et al. [39]) as compared to Fixed-V-BPE and Demographic-V-BPE. Demographic-V-BPE demonstrates a reduction in the MAE for SBP over different demographic groups as compared to existing regression-based and deep learning-based approaches. Regression-based and deep learning-based approaches demonstrated limited robustness to changing demographic conditions. The Bland-Altman plots for the regression-based approaches (Secerbegovic et al. [36] and Zhou et al. [37]) (Figs 8 and 9) display mean differences of -40 mmHg and -24 mmHg respectively and demonstrate that regression-based approaches over-estimate SBP. The Bland-Altman plots for the deep learning-based approaches (Hamoud et al. [38] and Trirongjitmoah et al. [39]) (Figs 10 and 11) display mean differences of 10mmHg (demonstrating under-estimation of SBP) and -45 mmHg (demonstrating over-estimation of SBP).
Demographic-V-BPE estimates of SBP agree with the ground truth SBP (left), while there is significant difference in the SBP estimates of Fixed-V-BPE with the ground truth (right). Error bars and 95% confidence intervals are marked (in mmHg).
Having observed the differences in the estimation of DBP with the inclusion of subject specific demographic information, we computed the Mean Absolute Error (MAE) of the DBP estimates with respect to the ground truth DBP. V-BPE obtains an MAE of 13.2 mmHg for Demographic-V-BPE and an MAE of 29.1 mmHg for Fixed-V-BPE. The reduction of MAE on the inclusion of subject specific demographic information demonstrates the sensitivity of the diastolic blood pressure to demographic differences between subjects. The Bland Altman plots (Fig 7) compare the predicted diastolic blood pressure estimation with Demographic-V-BPE and Fixed-V-BPE to the ground truth. Demographic-V-BPE estimates of DBP display a mean difference of 5 mmHg, demonstrating the agreement with the ground truth DBP. In contrast, Fixed-V-BPE estimates for DBP display a mean difference of 22 mmHg with the ground truth demonstrating that Fixed-V-BPE under-estimates the diastolic blood pressure. Additionally, we compute the performance (for DBP) of existing approaches for video-based blood pressure estimation utilizing the Face-Hand dataset to evaluate their robustness to various demographic groups. Table 6 demonstrates the performance of regression-based approaches (such as Secerbegovic et al. [36] and Zhou et al. [37]) and deep learning-based approaches (such as Hamoud et al. [38] and Trirongjitmoah et al. [39]) as compared to Fixed-V-BPE and Demographic-V-BPE. Demographic-V-BPE demonstrates a reduction in the MAE for DBP over different demographic groups as compared to existing regression-based and deep learning-based approaches. Regression-based and deep learning-based approaches demonstrated limited robustness to changing demographic conditions. The Bland-Altman plots for the regression-based approaches (Secerbegovic et al. [36] and Zhou et al. [37]) (Figs 8 and 9) display mean differences of -30 mmHg (demonstrating over-estimation of DBP) and -1 mmHg (demonstrating good agreement). The Bland-Altman plots for the deep learning-based approaches (Hamoud et al. [38] and Trirongjitmoah et al. [39]) (Figs 10 and 11) display mean differences of 10mmHg (demonstrating under-estimation of DBP) and -28 mmHg (demonstrating over-estimation of DBP).
Demographic-V-BPE estimates of DBP agree with the ground truth DBP (left), while there is significant difference in the DBP estimates of Fixed-V-BPE with the ground truth (right). Error bars and 95% confidence intervals are marked (in mmHg).
SBP (left) and DBP (right) estimates utilizing Secerbegovic et al. [36] demonstrate mean differences of -40 mmHg and -30 mmHg respectively, with the ground truth. Error bars and 95% confidence intervals are marked (in mmHg).
SBP (left) and DBP (right) estimates utilizing Zhou et al. [37] demonstrate mean differences of -24 mmHg and -1 mmHg respectively, with the ground truth. Error bars and 95% confidence intervals are marked (in mmHg).
SBP (left) and DBP (right) estimates utilizing Hamoud et al. [38] demonstrate mean differences of 10 mmHg and 10 mmHg respectively, with the ground truth. Error bars and 95% confidence intervals are marked (in mmHg).
SBP (left) and DBP (right) estimates utilizing Trirongjitmoah et al. [39] demonstrate mean differences of -45 mmHg and -28 mmHg respectively, with the ground truth. Error bars and 95% confidence intervals are marked (in mmHg).
To summarize, we note that existing video-based blood pressure estimation approaches show limited robustness in diverse demographic conditions. Moreover, we note that the estimation of SBP is more challenging that the estimation of DBP (Tables 5 and 6). Demographic-V-BPE demonstrated improved performance for both SBP and DBP estimation across different demographic groups as compared to existing approaches.
The difference in the performance of Demographic-V-BPE and Fixed-V-BPE for the estimation of systolic and diastolic blood pressure demonstrates that subject specific demographic information is critical in the estimation of blood pressure. The Face-Hand dataset contains a wide variety of subject specific demographic data such as the height, the weight, the age, the gender and the country of origin of the subject. Demographic-V-BPE utilizes the height, the weight and the age of the subject to estimate the blood pressure according to existing studies which reported a correlation between the blood pressure and the height, weight and age of the subject. We investigate whether the selection of the height, the weight and the age of the subject for blood pressure estimation using Demographic-V-BPE introduces biases (that are different from the biases existing in the ground truth) with respect to other demographic parameters such as the country of origin or the gender of the subject. To investigate whether Demographic-V-BPE introduces biases due to differences in country of origin of the subject, we explore whether the ground truth SBP and DBP distributions of subjects from India and Sierra Leone show a statistically significant difference. We construct the null and alternate hypotheses for ground truth SBP and DBP for the subjects from India and Sierra Leone as follows.
For SBP,
(31)
(32)
and for DBP,
(33)
(34)
where
is the distribution of ground truth SBP for the subjects from India,
is the distribution of ground truth SBP for the subjects from Sierra Leone,
is the distribution of ground truth DBP for the subjects from India and
is the distribution of ground truth DBP for the subjects from Sierra Leone. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 200, power = 0.8, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test for SBP (hypothesis test 18), we obtain the a p-value of 0.00, rejecting the null hypothesis and demonstrating that there is significant difference in the ground truth SBP distributions for subjects from India and subjects from Sierra Leone. Performing the two sample Kolmogorov-Smirnov test for DBP (hypothesis test 20), we obtain a p-value of 0.03, rejecting the null hypothesis and demonstrating that there is significant difference in the ground truth DBP distributions for subjects from India and subjects from Sierra Leone (Table 4).
To investigate whether Demographic-V-BPE introduces biases due to differences in country of origin of the subject, we construct the null and alternate hypotheses for SBP and DBP as follows.
For SBP,
(35)
(36)
and for DBP,
(37)
(38)
where
is the distribution of computed SBP estimates using Demographic-V-BPE for the videos of the subjects from India,
is the distribution of computed SBP estimates using Demographic-V-BPE for the videos of the subjects from Sierra Leone,
is the distribution of computed DBP estimates using Demographic-V-BPE for the videos of the subjects from India and
is the distribution of computed DBP estimates using Demographic-V-BPE for the videos of the subjects from Sierra Leone. Using power analysis, we determine a significance level of α = 0.05. Performing the two sample Kolmogorov-Smirnov test for SBP (hypothesis test 22), we obtain the a p-value of 0.02 rejecting the null hypothesis and demonstrating that there is a significant difference in Demographic-V-BPE estimates of SBP for subjects from India and subjects from Sierra Leone. Performing the two sample Kolmogorov-Smirnov test for DBP (hypothesis test 24), we obtain a p-value of 0.01 rejecting the null hypothesis and demonstrating that there is a significant difference in Demographic-V-BPE estimates of DBP for subjects from India and subjects from Sierra Leone (Table 4). This shows that the inclusion of the height, the weight and the age in estimating blood pressure with Demographic-V-BPE does not introduce additional bias with respect to country of origin. The Bland Altman plots (Figs 12 and 13) display the agreement of the Demographic-V-BPE estimates for SBP and DBP with the ground truth SBP and DBP for India and Sierra Leone. The MAE and Inter Coder Coefficient (ICC) (Tables 5 and 6) for SBP and DBP estimates for subjects from India and Sierra Leone shows the robustness of Demographic-V-BPE in estimating blood pressure across countries.
Demographic-V-BPE estimates of SBP (left) as well as DBP (right) agree with the ground truth SBP and DBP respectively. Error bars and 95% confidence intervals are marked (in mmHg).
Demographic-V-BPE estimates of SBP (left) as well as DBP (right) agree with the ground truth SBP and DBP respectively. Error bars and 95% confidence intervals are marked (in mmHg).
Having explored the performance of Demographic-V-BPE across different countries, we investigate whether Demographic-V-BPE introduces biases due to differences in the gender of the subjects. We explore the distributions of the ground truth and the Demographic-V-BPE estimates for female and male subjects from the Face-Hand dataset. We construct the null and alternate hypotheses for ground truth SBP and DBP for female and male subjects as follows.
For SBP,
(39)
(40)
and for DBP,
(41)
(42)
where
is the distribution of ground truth SBP for the female subjects,
is the distribution of ground truth SBP for male subjects,
is the distribution of ground truth DBP for the female subjects and
is the distribution of ground truth DBP for male subjects. Using power analysis, we determine a significance level of α = 0.05 (effect size = 0.5 (Cohen’s d [63]), no. of samples = 200, power = 0.8, degrees of freedom = 1). Performing the two sample Kolmogorov-Smirnov test for SBP (hypothesis test 26), we obtain the a p-value of 0.00 rejecting the null hypothesis and demonstrating that there is a significant difference in the ground truth SBP distributions for female and male subjects. Performing the two sample Kolmogorov-Smirnov test for DBP (hypothesis test 28), we obtain a p-value of 0.64 failing to reject the null hypothesis and demonstrating that the ground truth DBP distributions for female and male subjects do not show a statistically significant difference (Table 4).
To investigate whether Demographic-V-BPE introduces biases due to differences in the gender of the subject, we construct the null and alternate hypotheses for SBP and DBP as follows.
For SBP,
(43)
(44)
and for DBP,
(45)
(46)
where
is the distribution of computed SBP estimates using Demographic-V-BPE for female subjects,
is the distribution of computed SBP estimates using Demographic-V-BPE male subjects,
is the distribution of computed DBP estimates using Demographic-V-BPE for female subjects and
is the distribution of computed DBP estimates using Demographic-V-BPE for male subjects. Using power analysis, we determine a significance level of α = 0.05. Performing the two sample Kolmogorov-Smirnov test for SBP (hypothesis test 30), we obtain the a p-value of 0.00 rejecting the null hypothesis and demonstrating that there is a significant difference in Demographic-V-BPE estimates of SBP for female and male subjects. Performing the two sample Kolmogorov-Smirnov test for DBP (hypothesis test 32), we obtain a p-value of 0.16 failing to reject the null hypothesis and demonstrating that Demographic-V-BPE estimates of DBP for female and male subjects do not show a statistically significant difference (Table 4). This shows that the inclusion of the height, the weight and the age in estimating blood pressure with Demographic-V-BPE does not introduce additional bias with respect to the gender of the subject. The Bland Altman plots (Figs 14 and 15) display the agreement of the Demographic-V-BPE estimates for SBP and DBP with the ground truth SBP and DBP for female and male subjects. The MAE and ICC (Tables 5 and 6) for SBP and DBP estimates for female and male subjects shows the robustness of Demographic-V-BPE in estimating blood pressure across genders.
Demographic-V-BPE estimates of SBP (left) as well as DBP (right) agree with the ground truth SBP and DBP respectively. Error bars and 95% confidence intervals are marked (in mmHg).
Bland Altman plots for female subjects. Demographic-V-BPE estimates of SBP (left) as well as DBP (right) agree with the ground truth SBP and DBP respectively. Error bars and 95% confidence intervals are marked (in mmHg).
The statistical analysis demonstrates that Demographic-V-BPE does not induce additional bias due to differences in the country of origin or the gender of subjects. In contrast, Fixed-V-BPE estimates displayed significant differences from the ground truth as well as from Demographic-V-BPE (Tables 5 and 6). The increased ICC demonstrates the agreement of SBP and DBP estimates with Demographic-V-BPE with the ground truth as compared to the SBP and DBP estimates of Fixed-V-BPE and other existing approaches with the ground truth (Tables 5 and 6). We note that the MAE for Demographic-V-BPE estimates of SBP and DBP is comparatively higher than the recommended ISO standard for BP estimation. The higher MAE is due to the demographic diversity and environmental variation in the dataset, which introduced environmental noise and demographic bias into the algorithm. However, we demonstrate that the inclusion of demographic information improves the performance of V-BPE, (Fixed-V-BPE as compared to Demographic-V-BPE) mitigating the effect of environmental noise and demographic bias. Demographic-V-BPE demonstrates improved performance as compared to other existing BP estimation approaches underscoring the importance of the inclusion of demographic information. Additionally, we note that the MAE for existing video-based BP estimation approaches is higher than the ISO standard when tested on the Face-Hand dataset highlighting the bias existing in existing approaches. We observe that the utilization of subject specific demographic information comprising of the height, the weight and the age of the subject is sufficient to inform robust blood pressure estimation. The MAE and the ICC obtained for blood pressure estimates for Demographic-V-BPE (Tables 5 and 6) demonstrates the robustness to diverse demographical distinctions.
Conclusions
In this study, we present a an algorithm—V-BPE to estimate the systolic and diastolic blood pressure of an individual utilizing a facial and hand video of the individual, captured with a standard mobile phone camera. We utilize a pulse transit time approach to compute the blood pressure. V-BPE utilizes subject specific demographic information such as the height, weight and age to reduce the influence of demographic bias on the estimation of blood pressure. Through statistical hypothesis testing we demonstrate the significance of utilizing subject specific demographic information for blood pressure estimation. Interestingly, though we obtain an MAE outside the ISO specified range for BP estimation, we discover that the inclusion of subject specific demographic information not only improves the performance of the algorithm, but also preserves the distribution of blood pressure and does not induce additional bias. Moreover, comparative analysis with existing video-based blood pressure estimation approaches demonstrates the robustness of V-BPE. Such a result highlights the importance of subject specific demographic information in the robust estimation of blood pressure in outside-the-lab environments. Another key contribution of the study is the development of the Face-Hand dataset to test blood pressure estimation approaches. The Face-Hand dataset is curated from outside-the-lab settings and is critical in addressing common challenges associated with vital sign estimation algorithms, such as data leakage and over-constraining. Finally, the utilization of a single RGB video increases the scalability of V-BPE in resource-constrained environments due to the ubiquity of mobile phones and computers that can record RGB videos. This work provides the motivation for the development and deployment of video-based blood pressure estimation algorithms, especially in areas with limited access to resources (such as contact-based blood pressure measurement devices). Future work involves the estimation of a continuous blood pressure signal as well as a classification approach to identify subjects with hypertension and hypotension.
Acknowledgments
The authors would like to thank Mr. Srihari Boregowda, President, AIRA Sociocare, from the India data collection team for their efforts in collecting Indian subject data. We would like to thank Mr. Nathaniel Houston and Mr. Sheku Kamara from the Sierra Leone data collection team for their efforts in collecting Sierra Leone subject data.
References
- 1.
Levick JR. An introduction to cardiovascular physiology. Butterworth-Heinemann; 2013.
- 2.
Shahoud JS, Sanvictores T, Aeddula NR. Physiology, arterial pressure regulation. 2019;.
- 3. Ohishi M. Hypertension with diabetes mellitus: physiology and pathology. Hypertension research. 2018;41(6):389–393. pmid:29556093
- 4. Guyton AC. The relationship of cardiac output and arterial pressure control. Circulation. 1981;64(6):1079–1088. pmid:6794930
- 5. Robertson D, Mosqueda-Garcia R, Robertson RM, Biaggioni I. Chronic hypotension: in the shadow of hypertension. American journal of hypertension. 1992;5(6_Pt_2):200S–275. pmid:1632940
- 6. MacDonald JR. Potential causes, mechanisms, and implications of post exercise hypotension. Journal of human hypertension. 2002;16(4):225–236. pmid:11967715
- 7. Floras JS, Hassan MO, Jones JV, Osikowska BA, Sever PS, Sleight P. Factors influencing blood pressure and heart rate variability in hypertensive humans. Hypertension. 1988;11(3):273–281. pmid:2895071
- 8. Flint AC, Conell C, Ren X, Banki NM, Chan SL, Rao VA, et al. Effect of systolic and diastolic blood pressure on cardiovascular outcomes. New England Journal of Medicine. 2019;381(3):243–251. pmid:31314968
- 9. Pickering TG, Shimbo D, Haas D. Ambulatory blood-pressure monitoring. New England Journal of Medicine. 2006;354(22):2368–2374. pmid:16738273
- 10. Ogedegbe G, Pickering T. Principles and techniques of blood pressure measurement. Cardiology clinics. 2010;28(4):571–586. pmid:20937442
- 11. Arabidze G, Petrov V, Staessen JA. Blood pressure by Korotkoff’s auscultatory method: end of an era or bright future?. Blood pressure monitoring. 1996;1(4):321–327. pmid:10226254
- 12. Ma HT. A blood pressure monitoring method for stroke management. BioMed research international. 2014;2014. pmid:25197651
- 13. Gesche H, Grosskurth D, Küchler G, Patzak A. Continuous blood pressure measurement by using the pulse transit time: comparison to a cuff-based method. European journal of applied physiology. 2012;112(1):309–315. pmid:21556814
- 14. Buxi D, Redoute JM, Yuce MR. A survey on signals and systems in ambulatory blood pressure monitoring using pulse transit time. Physiological measurement. 2015;36(3):R1. pmid:25694235
- 15. DiMarco JP, Philbrick JT. Use of ambulatory electrocardiographic (Holter) monitoring. Annals of internal medicine. 1990;113(1):53–68. pmid:2190517
- 16.
Fortino G, Giampà V. PPG-based methods for non invasive and continuous blood pressure measurement: An overview and development issues in body sensor networks. In: 2010 IEEE International Workshop on Medical Measurements and Applications. IEEE; 2010. p. 10–13.
- 17. Pedrana A, Comotti D, Re V, Traversi G. Development of a wearable in-ear PPG system for continuous monitoring. IEEE Sensors Journal. 2020;20(23):14482–14490.
- 18. Ma Y, Choi J, Hourlier-Fargette A, Xue Y, Chung HU, Lee JY, et al. Relation between blood pressure and pulse wave velocity for human arteries. Proceedings of the National Academy of Sciences. 2018;115(44):11144–11149. pmid:30322935
- 19. Mukkamala R, Hahn JO, Inan OT, Mestha LK, Kim CS, Töreyin H, et al. Toward ubiquitous blood pressure monitoring via pulse transit time: theory and practice. IEEE transactions on biomedical engineering. 2015;62(8):1879–1901. pmid:26057530
- 20. Chen Y, Wen C, Tao G, Bi M. Continuous and noninvasive measurement of systolic and diastolic blood pressure by one mathematical model with the same model parameters and two separate pulse wave velocities. Annals of biomedical engineering. 2012;40:871–882. pmid:22101758
- 21.
Bang S, Lee C, Park J, Cho MC, Yoon YG, Cho S. A pulse transit time measurement method based on electrocardiography and bioimpedance. In: 2009 IEEE Biomedical Circuits and Systems Conference. IEEE; 2009. p. 153–156.
- 22.
Penaz J. Photoelectric measurement of blood pressure, volume and flow in the finger. In: Digest of the 10th international conference on medical and biological engineering-Dresden, 1973. vol. 104; 1973.
- 23. Chandrasekhar A, Kim CS, Naji M, Natarajan K, Hahn JO, Mukkamala R. Smartphone-based blood pressure monitoring via the oscillometric finger-pressing method. Science translational medicine. 2018;10(431):eaap8674. pmid:29515001
- 24.
Sugita N, Obara K, Yoshizawa M, Abe M, Tanaka A, Homma N. Techniques for estimating blood pressure variation using video images. In: 2015 37th annual international conference of the IEEE engineering in medicine and biology society (EMBC). IEEE; 2015. p. 4218–4221.
- 25.
Murakami K, Yoshioka M, Ozawa J. Non-contact pulse transit time measurement using imaging camera, and its relation to blood pressure. In: 2015 14th IAPR International Conference on Machine Vision Applications (MVA). IEEE; 2015. p. 414–417.
- 26.
Vlachopoulos C, O’Rourke M, Nichols WW. McDonald’s blood flow in arteries: theoretical, experimental and clinical principles. CRC press; 2011.
- 27. Bramwell JC, Hill AV. The velocity of pulse wave in man. Proceedings of the Royal Society of London Series B, Containing Papers of a Biological Character. 1922;93(652):298–306.
- 28. Hughes D, Babbs CF, Geddes L, Bourland J. Measurements of Young’s modulus of elasticity of the canine aorta with ultrasound. Ultrasonic imaging. 1979;1(4):356–367. pmid:575833
- 29. Ding X, Zhang YT. Pulse transit time technique for cuffless unobtrusive blood pressure measurement: from theory to algorithm. Biomedical engineering letters. 2019;9:37–52. pmid:30956879
- 30. Laurent S, Cockcroft J, Van Bortel L, Boutouyrie P, Giannattasio C, Hayoz D, et al. Expert consensus document on arterial stiffness: methodological issues and clinical applications. European heart journal. 2006;27(21):2588–2605. pmid:17000623
- 31.
Wang R, Jia W, Mao ZH, Sclabassi RJ, Sun M. Cuff-free blood pressure estimation using pulse transit time and heart rate. In: 2014 12th international conference on signal processing (ICSP). IEEE; 2014. p. 115–118.
- 32. Liu J, Yan BP, Zhang YT, Ding XR, Su P, Zhao N. Multi-wavelength photoplethysmography enabling continuous blood pressure measurement with compact wearable electronics. IEEE Transactions on Biomedical Engineering. 2018;66(6):1514–1525. pmid:30307851
- 33.
Hassan MKBA, Mashor M, Mohd Nasir N, Mohamed S. Measuring of systolic blood pressure based on heart rate. In: 4th Kuala Lumpur International Conference on Biomedical Engineering 2008: BIOMED 2008 25–28 June 2008 Kuala Lumpur, Malaysia. Springer; 2008. p. 595–598.
- 34. Finnegan E, Davidson S, Harford M, Watkinson P, Tarassenko L, Villarroel M. Features from the photoplethysmogram and the electrocardiogram for estimating changes in blood pressure. Scientific Reports. 2023;13(1):986. pmid:36653426
- 35. Jeong IC, Finkelstein J. Introducing contactless blood pressure assessment using a high speed video camera. Journal of medical systems. 2016;40:1–10. pmid:26791993
- 36.
Secerbegovic A, Bergsland J, Halvorsen PS, Suljanovic N, Mujcic A, Balasingham I. Blood pressure estimation using video plethysmography. In: 2016 IEEE 13th international symposium on biomedical imaging (ISBI). IEEE; 2016. p. 461–464.
- 37. Zhou Y, Ni H, Zhang Q, Wu Q. The noninvasive blood pressure measurement based on facial images processing. IEEE Sensors Journal. 2019;19(22):10624–10634.
- 38. Hamoud B, Kashevnik A, Othman W, Shilov N. Neural network model combination for video-based blood pressure estimation: New approach and evaluation. Sensors. 2023;23(4):1753. pmid:36850349
- 39. Trirongjitmoah S, Promking A, Kaewdang K, Phansiri N, Treeprapin K. Assessing heart rate and blood pressure estimation from image photoplethysmography using a digital blood pressure meter. Heliyon. 2024;10(5). pmid:38439889
- 40. Luo H, Yang D, Barszczyk A, Vempala N, Wei J, Wu SJ, et al. Smartphone-based blood pressure measurement using transdermal optical imaging technology. Circulation: Cardiovascular Imaging. 2019;12(8):e008857. pmid:31382766
- 41.
Mehta S, Kwatra N, Jain M, McDuff D. “Can’t Take the Pressure?”: Examining the Challenges of Blood Pressure Estimation via Pulse Wave Analysis. arXiv preprint arXiv:230414916. 2023;.
- 42.
Moens AI. Die pulscurve. EJ Brill: Leiden, The Netherlands, 1878.
- 43. Korteweg vD. Ueber die Fortpflanzungsgeschwindigkeit des Schalles in elastischen Röhren. Annalen der Physik. 1878;241(12):525–542.
- 44. De Haan G, Jeanne V. Robust pulse rate from chrominance-based rPPG. IEEE Transactions on Biomedical Engineering. 2013;60(10):2878–2886. pmid:23744659
- 45. Prakash SKA, Tucker CS. Bounded Kalman filter method for motion-robust, non-contact heart rate estimation. Biomedical optics express. 2018;9(2):873–897. pmid:29552419
- 46. Dasari A, Prakash SKA, Jeni LA, Tucker CS. Evaluation of biases in remote photoplethysmography methods. NPJ digital medicine. 2021;4(1):91. pmid:34083724
- 47. Chen Y, Zhuang J, Li B, Zhang Y, Zheng X. Remote Blood Pressure Estimation via the Spatiotemporal Mapping of Facial Videos. Sensors. 2023;23(6):2963. pmid:36991677
- 48. Cheng H, Xiong J, Chen Z, Chen J. Deep Learning-Based Non-Contact IPPG Signal Blood Pressure Measurement Research. Sensors. 2023;23(12):5528. pmid:37420695
- 49. Xing W, Shi Y, Wu C, Wang Y, Wang X. Predicting blood pressure from face videos using face diagnosis theory and deep neural networks technique. Computers in Biology and Medicine. 2023; p. 107112. pmid:37481950
- 50. Kim HL, Kim SH. Pulse wave velocity in atherosclerosis. Frontiers in cardiovascular medicine. 2019;6:41. pmid:31024934
- 51.
Lugaresi C, Tang J, Nash H, McClanahan C, Uboweja E, Hays M, et al. Mediapipe: A framework for perceiving and processing reality. In: Third workshop on computer vision for AR/VR at IEEE computer vision and pattern recognition (CVPR). vol. 2019; 2019.
- 52. He X, Goubran RA, Liu XP. Secondary peak detection of PPG signal for continuous cuffless arterial blood pressure measurement. IEEE Transactions on Instrumentation and Measurement. 2014;63(6):1431–1439.
- 53.
Palshikar G, et al. Simple algorithms for peak detection in time-series. In: Proc. 1st Int. Conf. Advanced Data Analysis, Business Analytics and Intelligence. vol. 122; 2009.
- 54. Kato K, Higashiyama A. Estimation of height for persons in pictures. Perception & psychophysics. 1998;60(8):1318–1328. pmid:9865073
- 55. Hwaung P, Heo M, Bourgeois B, Kennedy S, Shepherd J, Heymsfield SB. Greater height is associated with a larger carotid lumen diameter. Medicines. 2019;6(2):57. pmid:31091706
- 56. Edelman G, Alberink I. Height measurements in images: how to deal with measurement uncertainty correlated to actual height. Law, Probability & Risk. 2010;9(2):91–102.
- 57.
Lidayova K, Frimmel H, Wang C, Bengtsson E, Smedby Ö. Skeleton-based fast, fully automated generation of vessel tree structure for clinical evaluation of blood vessel systems. In: Skeletonization. Elsevier; 2017. p. 345–382.
- 58. Wu J, Hu Z, Gu Y, Li L, Zhu H. A multi-segmented human bioheat model for cold and extremely cold exposures. International Journal of Thermal Sciences. 2022;173:107394.
- 59. Kozakova M, Palombo C, Morizzo C, Højlund K, Hatunic M, Balkau B, et al. Obesity and carotid artery remodeling. Nutrition & diabetes. 2015;5(8):e177–e177. pmid:26302064
- 60. Samijo S, Willigers J, Barkhuysen R, Kitslaar P, Reneman R, Brands P, et al. Wall shear stress in the human common carotid artery as function of age and gender. Cardiovascular research. 1998;39(2):515–522. pmid:9798536
- 61. Wen W, Luo R, Tang X, Tang L, Huang HX, Wen X, et al. Age-related progression of arterial stiffness and its elevated positive association with blood pressure in healthy people. Atherosclerosis. 2015;238(1):147–152. pmid:25485738
- 62. Simmonds MJ, Meiselman HJ, Baskurt OK. Blood rheology and aging. Journal of geriatric cardiology: JGC. 2013;10(3):291. pmid:24133519
- 63.
Cohen J. Statistical power analysis for the behavioral sciences. Academic press; 2013.
- 64.
Monk E. The Monk Skin Tone Scale. 2023; https://doi.org/10.31235/osf.io/pdf4c
- 65.
Moço A, Stuijk S, van Gastel M, de Haan G. Impairing factors in remote-ppg pulse transit time measurements on the face. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops; 2018. p. 1358–1366.
- 66. Elgendi M, Fletcher R, Liang Y, Howard N, Lovell NH, Abbott D, et al. The use of photoplethysmography for assessing hypertension. NPJ digital medicine. 2019;2(1):60. pmid:31388564
- 67.
Ding X, Wang W, Chen Y, Yang Y, Zhao Y, Kong D. Feasibility study of pulse width at half amplitude of camera PPG for contactless blood pressure estimation. In: 2021 43rd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC). IEEE; 2021. p. 365–368.
- 68. Young CC, Mark JB, White W, DeBree A, Vender JS, Fleming A. Clinical evaluation of continuous noninvasive blood pressure monitoring: accuracy and tracking capabilities. Journal of clinical monitoring. 1995;11:245–252. pmid:7561998
- 69. Huynh TH, Jafari R, Chung WY. Noninvasive cuffless blood pressure estimation using pulse transit time and impedance plethysmography. IEEE Transactions on Biomedical Engineering. 2018;66(4):967–976. pmid:30130167
- 70. Kachuee M, Kiani MM, Mohammadzade H, Shabany M. Cuffless blood pressure estimation algorithms for continuous health-care monitoring. IEEE Transactions on Biomedical Engineering. 2016;64(4):859–869. pmid:27323356