Validity and intra-rater reliability of an Android phone application to measure cervical range-of-motion
Journal of NeuroEngineering and Rehabilitation volume 11, Article number: 65 (2014)
Concurrent validity and intra-rater reliability using a customized Android phone application to measure cervical-spine range-of-motion (ROM) has not been previously validated against a gold-standard three-dimensional motion analysis (3DMA) system.
Twenty-one healthy individuals (age:31 ± 9.1 years, male:11) participated, with 16 re-examined for intra-rater reliability 1–7 days later. An Android phone was fixed on a helmet, which was then securely fastened on the participant’s head. Cervical-spine ROM in flexion, extension, lateral flexion and rotation were performed in sitting with concurrent measurements obtained from both a 3DMA system and the phone.
The phone demonstrated moderate to excellent (ICC = 0.53-0.98, Spearman ρ = 0.52-0.98) concurrent validity for ROM measurements in cervical flexion, extension, lateral-flexion and rotation. However, cervical rotation demonstrated both proportional and fixed bias. Excellent intra-rater reliability was demonstrated for cervical flexion, extension and lateral flexion (ICC = 0.82-0.90), but poor for right- and left-rotation (ICC = 0.05-0.33) using the phone. Possible reasons for the outcome are that flexion, extension and lateral-flexion measurements are detected by gravity-dependent accelerometers while rotation measurements are detected by the magnetometer which can be adversely affected by surrounding magnetic fields.
The results of this study demonstrate that the tested Android phone application is valid and reliable to measure ROM of the cervical-spine in flexion, extension and lateral-flexion but not in rotation likely due to magnetic interference. The clinical implication of this study is that therapists should be mindful of the plane of measurement when using the Android phone to measure ROM of the cervical-spine.
Cervical range-of-motion (ROM) assessment forms an integral part of physiotherapy evaluation in people with neck-pain by quantifying an important physical impairment  and providing potentially useful diagnostic data . In this regard, the cervical range-of-motion device (CROM) [3, 4] and single inclinometer are considered the most appropriate clinical measurement instruments. However, the CROM is relatively expensive (US$395) and cumbersome, and the inclinometer although more affordable, has been reported to have inconsistent and inferior validity for cervical lateral-flexion and rotation measurements [5, 6].
Advances in smart phone sensor technology have resulted in inexpensive ROM measurement tools with clinical and research potential. Specifically, the smart phone uses an embedded-accelerometer and a magnetometer to detect motion using gravity and the earth’s magnetic field respectively. To our knowledge, only one published study  has examined the validity and reliability of the smartphone to measure cervical ROM. Although that study reported some promising findings, it did possess limitations including: a) the criterion reference used (i.e. CROM) did not allow for concurrent testing of the phone, and lacked the sensitivity and precision of a multi-camera three-dimensional motion analysis (3DMA) system, which may have negatively influenced the mostly moderate validity findings; b) no reported effort was made to ensure that movement was well-controlled and along the intended axis of head movement; and c) the examiner was not blinded to the results obtained from the phone and the CROM device, hence error due to reporting bias cannot be ruled out. This may potentially overestimate the validity results. Therefore the purpose of this study was to investigate the concurrent validity and test-retest reliability of an Android smart phone to assess cervical ROM. Our study extends prior research by (i) verifying the validity of the smart phone by concurrently assessing with a 3DMA system, the gold-standard for capturing motion analysis , (ii) adding a spirit-level type indicator to the phone application to ensure a pure axis of movement  and (iii) blinding the examiner to the results. We hypothesize that the phone will be valid and reliable.
Twenty-one healthy individuals (age:31 ± 9.1 years, height: 172.7 ± 8.9 cm, weight:68.5 ± 11.2 kg, male:11) with no reported neck-pain participated. Sixteen participants returned 1–7 days later to assess intra-rater reliability. All participants provided informed consent as outlined by the institution’s Human Research Ethics Committee and all procedures were conducted according to the Declaration of Helsinki.
Three reflective markers were located on the following anatomical landmarks: anterior to the tragus bilaterally and on the glabella (Figure 1) for 3DMA analysis. Markers were tracked using VICON Nexus V1.7.1 and a 9-camera VICON MX motion analysis system (VICON, UK). The angle of the head in the three planes was referenced to the laboratory axis, and normalized to the starting neutral position, and was deemed our benchmark reference kinematic data.
All measures were performed with the subject seated in the same high-back padded chair. To ensure minimal contribution from the thoracic spine, the participant was securely strapped across the shoulders to the chair using an inelastic belt (Figure 1: Mulligan Mobilization Belt). An Android 4.0 phone (Samsung Galaxy S3, GT-I9300T) was mounted on a helmet (Figure 1), and the helmet was fastened securely on the patient’s head using an internal adjustable head strap fixed within the helmet. This phone contains a LSM330DLC inertial monitoring unit combining tri-axial accelerometer and gyroscope sensors, and an AKM8975 tri-axial magnetometer.
The following cervical-spine ROM limit measurements were obtained in the same order in all subjects: (i)flexion, (ii)extension, (iii)right-lateral-flexion, (iv)left-lateral-flexion, (v)right-rotation and (vi)left-rotation. The flexion/extension, lateral flexion/extension and rotation axes were measured using the pitch, roll and azimuth angles respectively. Given that cervical rotation values are based on the magnetometer within the phone and the outcome may be influenced by the surrounding magnetic fields, a magnetic yoke was placed around the subject’s neck in an attempt to address this problem. This replicates the use of the CROM, which also uses magnetic fields to determine angles and requires the use of a magnetic yoke.
The patient was instructed to perform each test actively, with manual guidance provided by the examiner to ensure that the movement was along the pure axis of alignment if necessary. Specifically, the examiner determined the end of ROM when a firm resistance was felt. No pain was reported by any subject during the procedure. Three consecutive trials using concurrent measurements from the VICON and the phone were obtained for each movement. The mean value of the three measurements for the first testing day was used to calculate validity, and an inter-day comparison of these mean values was performed to determine intra-rater reliability.
All subjects were assessed by the same examiner (JQ) who has 12 years of clinical musculoskeletal physiotherapy experience. Noteworthy, because it is difficult for the examiner to visually detect when the subject deviates away from the pure movement plane, one of the advantages of this phone application over previous applications  was that it included a visual representation of a circular spirit device (Figures 2A and 2B). This enabled the examiner to guide subjects along the desired plane of movement using the real-time visual feedback. This program sampled data at 100Hz using a custom program designed by co-author RC using MIT App Inventor. The standard angle data parsed from the angle calculation performed within the Smartphones operating system was used, indicating that our results are likely to be applicable regardless of the software program used. Two separate examiners were assigned to each device (phone and 3DMA), hence they were blinded to the results of the other device.
Validity was determined from Spearman’s correlation and intra-class correlation coefficient (ICC) in combination with assessment of systematic bias. Bland and Altman plots were constructed to determine the 95% limits of agreement (LoA) between the 3DMA and phone measures [10, 11]. Ordinary least products (OLP) regression, which accounts for error in both devices, was used to determine fixed and proportional biases . All calculations were performed as described previously .
Intra-rater reliability was determined using intra-class coefficients (ICC [3,3]), and OLP regression to quantify the relationship between sequential measurements for both instruments. ICC was calculated in a 2-way analysis of variance based on absolute agreement. Point estimates of the ICC values >0.75 were considered excellent, 0.4-0.75 modest or <0.4 poor . To estimate measurement error, standard error of measurement (SEM), LOA, and minimal detectable change (MDC) were calculated. Statistical analyses were completed using PASW software V21.
The phone demonstrated excellent concurrent validity for flexion, extension, and lateral flexion ROM based on Spearman’s ρ-values >0.84 and ICC values >0.90, but only modest validity results for left-rotation (ICC =0.53, Spearman’s ρ =0.52) and right-rotation (ICC =0.53) (Table 1). Furthermore, for right- and left-rotation, both proportional and fixed biases were observed (see Table 1 and Additional file 1: Appendix A for the OLP and LOA plots).
Intra-rater reliability is presented in Tables 2 and 3. Excellent intra-rater reliability results were observed for both phone and 3DMA measurements in cervical flexion, extension and right- and left-lateral flexion (ICC = 0.82-0.90), but results were poor for the phone in right- and left-rotation (ICC = 0.05-0.33), whilst the 3DMA showed modest intra-rater reliability (ICC = 0.64-0.77). Percentage error values for the phone ranged from 7-40% and 6-9% for 3DMA (Tables 2 and 3). LOA plots are presented in the Additional file 1: Appendix B & C.
This study demonstrates that an Android phone can be a valid and reliable tool to measure ROM of cervical flexion, extension and lateral-flexion but not cervical rotation, consistent with previous results . Cervical rotation results cannot be seen as valid and reliable as, although the rotation measurements from the phone showed moderate validity values (ICC = 0.53), the reliability results were poor. Possible reasons for these results are that, in the position tested, both sagittal and frontal measurements rely on the gravity-dependent accelerometers within the phone but the movements in the transverse plane are detected by the magnetometer, which can be adversely affected by any surrounding magnetic fields. This includes equipment such as computers, speakers and some automatic doors, which were all present in the laboratory and may have caused the error observed in this axis. We attempted to overcome this issue using the magnet supplied with the CROM, however our results were still invalid in this axis. This is clinically relevant because strong magnetic fields are likely to be present in many clinical settings and thus rotation ROM assessment using devices that rely on data from the magnetometer cannot be recommended (i.e. rotation in sitting).
Potential reasons for the greater ICC values in the present study compared to previous work  are the concurrent measurements and the addition of the spirit level indicator to improve the accuracy of measurement. The latter is especially important because the cervical-spine is a multi-joint structure and susceptible to coupled movements. Furthermore, we minimized measurement errors by standard fixation of the phone on a helmet, compared to the phone being held by hand on the participant’s head in the previous study . This also implies that the phone ought to be mounted on a helmet when it is being used in the clinical setting, and may be considered a limitation of this study. Furthermore, we found that when measuring cervical extension, the combined weight of the helmet and the phone tended to cause the helmet to slip. The examiner overcame this problem by providing adequate support to ensure that the helmet was firmly fixed on the head during the movement.
This study has several other limitations. (i) We did not assess inter-rater reliability and this may potentially limit the applicability of our findings in clinical settings between observers. (ii) We did not include a rigorous warm-up regime to ensure consistent inter-day readiness to perform the movements. While this is unlikely to affect the concurrent validity data (i.e. an increase in range of motion intra-session would be detected by both devices if they are comparable), it may have negatively affected our reliability results. (iii) As a preliminary step to assess the validity and reliability of the Android phone application, all participants were healthy, therefore the results need to be replicated in populations of interest, such as those with neck-pain. (iv) The reliability data of the 3DMA system for the rotation axis was not particularly good, and it is not possible to determine whether this is due to intra-day subject variation (which would provide justification for the poor phone reliability results) or equipment-related measurement error (which would not have affected the phone reliability values).
In summary, this study aimed to establish the validity and intra-rater reliability of an Android phone application to measure cervical-spine ROM and found that cervical flexion, extension and lateral-flexion measurements are both valid and reliable in sitting and may be used in the clinical setting. In contrast, cervical rotation measurements in sitting are neither valid nor reliable likely due to magnetic field interference. We suggest further study to determine whether the phone is valid to measure cervical-rotation in supine, which would use the accelerometer derived angles and is therefore likely to provide more consistent results.
Dall'Alba PT, Sterling MM, Treleaven JM, Edwards SL, Jull GA: Cervical range of motion discriminates between asymptomatic persons and those with whiplash. Spine 2001,26(19):2090-2094. 10.1097/00007632-200110010-00009
O'Leary T, Sterling J: Whiplash, headache, and neck pain: research-based directions for physical therapies. Edinburgh: Churchill Livingstone; 2008.
Rheault W, Albright B, Byers C, Franta M, Johnson A, Skowronek M, Dougherty J: Intertester reliability of the cervical range of motion device. J Orthop Sports Phys Ther 1992,15(3):147-150. 10.2519/jospt.19126.96.36.199
Fletcher JP, Bandy WD: Intrarater reliability of CROM measurement of cervical spine active range of motion in persons with and without neck pain. J Orthop Sports Phys Ther 2008,38(10):640-5. 10.2519/jospt.2008.2680
Hole DE, Cook JM, Bolton JE: Reliability and concurrent validity of two instruments for measuring cervical range of motion: effects of age and gender. Man Ther 1995,1(1):36-42. 10.1054/math.1995.0248
Bush KW, Collins N, Portman L, Tillett N: Validity and intertester reliability of cervical range of motion using inclinometer measurements. J Manual Manipul Ther 2000,8(2):52-61. 10.1179/106698100790819546
Tousignant-Laflamme Y, Boutin N, Dion AM, Vallée CA: Reliability and criterion validity of two applications of the iPhone™ to measure cervical range of motion in healthy participants. J NeuroEngineering Rehab 2013,10(1):69. 10.1186/1743-0003-10-69
Goodvin C, Park EJ, Huang K, Sakaki K: Development of a real-time three-dimensional spinal motion measurement system for clinical practice. Med Biol Eng Comput 2006,44(12):1061-75. 10.1007/s11517-006-0132-3
Quek J, Pua YH, Clark RA, Bryant AL: Effects of thoracic kyphosis and forward head posture on cervical range of motion in older adults. Manual Therapy 2013, 18: 65-71. 10.1016/j.math.2012.07.005
Bland JM, Altman DG: Statistical Methods for Assessing Agreement between Two Methods of Clinical Measurement. Lancet 1986,1(8476):307-310.
Bland JM, Altman DG: Measuring agreement in method comparison studies. Stat Met Med Res 1999,8(2):135-160. 10.1191/096228099673819272
Ludbrook J: Statistical techniques for comparing measurers and methods of measurement: a critical review. Clin Exp Pharmacol Physiol 2002,29(7):527-536. 10.1046/j.1440-1681.2002.03686.x
Ludbrook J: Special article comparing methods of measurement. Clin Exp Pharmacol Physiol 1997,24(2):193-203. 10.1111/j.1440-1681.1997.tb01807.x
Fleiss JL: The design and analysis of clinical experiments, Wiley series in probability and mathematical statistics Applied probability and statistics. New York: Wiley; 1986:xiv-432.
JQ received a PhD scholarship funded by Singapore General Hospital.
The authors declare that they have no conflict of interest.
JQ was involved in the study design, coordination, data collection, statistical analysis and manuscript drafting. SB was involved in the study design, coordination, manuscript drafting and general supervision of the study. JT was involved in the study design, coordination, manuscript drafting and general supervision of the study. PYH was involved in statistical analysis and manuscript drafting. BM was involved in data collection. RC was involved in the study design, application creation, data collection, coordination, manuscript drafting and general supervision of the study. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 1: Appendix A. Validity assessment using OLP plots for measurements with proportional bias. Appendix B. Reliability (phone). Normal Bland Altman plots. Appendix C. Reliability (3DMA). Normal Bland Altman Plots. (DOCX 91 KB)
About this article
Cite this article
Quek, J., Brauer, S.G., Treleaven, J. et al. Validity and intra-rater reliability of an Android phone application to measure cervical range-of-motion. J NeuroEngineering Rehabil 11, 65 (2014). https://doi.org/10.1186/1743-0003-11-65