Objectives We compared the ability of physical activity and sitting time questionnaires (PAQ) for ranking individuals versus continuous volume calculations (physical activity level (PAL), metabolic equivalents of task (MET), sitting hours) against accelerometry measured physical activity as our criterion.
Methods Participants in a cohort from the Tromsø Study completed three questionnaires; (1) The Saltin-Grimby Physical Activity Level Scale (SGPALS) (n=4040); (2) The Physical Activity Frequency, Intensity and Duration (PAFID) questionnaire (n=5902)) calculated as MET-hours·week-1 and (3) The International Physical Activity questionnaire (IPAQ) short-form sitting question (n=4896). We validated the questionnaires against the following accelerometry (Actigraph wGT3X-BT) estimates: vector magnitude counts per minute, steps∙day-1, time (minutes·day-1) in sedentary behaviour, light physical activity, moderate and vigorous physical activity (MVPA) non-bouted and ≥10 min bouted MVPA.
Results Ranking of physical activity according to the SGPALS and quartiles (Q) of MET-hours∙week-1 from the PAFID were both positively associated with accelerometry estimates of physical activity (p<0.001) but correlations with accelerometry estimates were weak (SGPALS (PAL): r=0.11 to 0.26, p<0.001) and weak-to-moderate (PAFID: r=0.39 to 0.44, p<0.01). There was 1 hour of accelerometry measured sedentary time from Q1 to Q4 in the IPAQ sitting question (p<0.001) and also weak correlations (r=0.22, p<0.01).
Conclusion Ranking of physical activity levels measured with PAQs appears to have higher validity than energy expenditure calculations. Self-reported sedentary time poorly reflects accelerometry measured sedentary time. These two PAQs can be used for ranking individuals into different physical activity categories supporting previous studies using these instruments when assessing associations with health outcomes.
- physical activity
- sitting time
This is an open access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited, appropriate credit is given, any changes made indicated, and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/.
Statistics from Altmetric.com
Ranking of the two included physical activity questionnaires reduces information content but may be the optimal way of processing self-reported physical activity.
Volume calculations (physical activity level, metabolic equivalents of task hours) allow the biasses associated with self-reported physical activity to be more pronounced.
Self-reported sitting time shows low validity and does not reflect accelerometry measured sedentary time.
Physical activity surveillance at population level may support public health initiatives and allow researchers to track physical activity levels and patterns over time.1 Physical activity is traditionally measured using self-reported methods such as questionnaires.2 However, the validity of physical activity questionnaires (PAQ) is threatened by recall and social desirability bias, resulting in imprecise assessments.3–6 Nevertheless, PAQs have over the years led to valuable knowledge on the effect of physical activity on health outcomes and mortality.7–14
Validation of PAQs is crucial to guide researchers when interpreting associations between self-reported physical activity and health outcomes. Moreover, PAQs may inherit different measurement properties. For example, one of the first developed PAQs, by Saltin and Grimby15 named ‘Saltin-Grimby Physical Activity Level Scale’ (SGPALS),16 17 ranks individuals by physical activity levels. A more recent PAQ, the Physical Activity Frequency, Intensity and Duration (PAFID) questionnaire,18 allows the answers to be summed up as total physical activity volume (ie, energy expenditure, metabolic equivalents of task (MET)-hours per week). Finally, sedentary behaviour has been suggested as a risk factor for disease and mortality, which is also commonly assessed by PAQs,19 20 such as the International Physical Activity Questionnaire (IPAQ) short-form sitting question.21 Both PAQs (SGPALS,16 22–26 PAFID18 27) and the IPAQ short-form sitting question21 have previously been validated, however, the studies that compare these questionnaires against accelerometry are characterised by small sample sizes.18 21 23 As population samples are heterogeneous and consequently result in heterogeneous findings, validation studies based on small samples may have limited representability. Furthermore, considering that already established longitudinal population cohorts have implemented PAQs from inception allowing for long follow-up time (SGPALS: >45 years,28–30 PAFID: >35 years31), validation of PAQs and sitting questionnaires against accelerometry measured physical activity and sitting time from large heterogeneous samples will allow researchers to more accurately interpret results from longitudinal cohort studies where only questionnaires are the physical activity and sedentary time measure.
Moreover, although PAQs can inherit different measurement properties, the methods for processing the PAQs can result in similar expressions (eg, ranking of the SGPALS can be summarised as volume,25 volume calculations can be grouped as quartiles), and thus the processing of questionnaires may also influence the validity differently.
We aimed to assess the validity of two PAQs inheriting different measurement properties; ranking of physical activity levels (SGPALS), volume calculations (PAFID) and one sedentary time questionnaire (IPAQ sitting short-form), by using accelerometry as our criterion, in a large heterogeneous sample of adults and older adults. Additionally, we aimed to assess how ranking and volume calculations of the PAQs reflects accelerometry measured physical activity and sedentary time.
We used participants from the seventh wave of the population-based cohort study named The Tromsø Study, which is conducted in Tromsø, Northern Norway. The study includes seven waves of data collection (Tromsø 1: 1974, Tromsø 2: 1979 to 1980, Tromsø 3: 1986 to 1987, Tromsø 4: 1994 to 1995, Tromsø 5: 2001, Tromsø 6: 2007 to 2008, Tromsø 7: 2015 to 2016) (details described elsewhere32).
All inhabitants in Tromsø municipality aged 40 years and older were invited to Tromsø 7. A total of 21 083 (65% of 32 591 invited participants) participants attended a first visit including questionnaires, biological sampling and clinical examinations. A random selection of 8346 participants attended a second visit at a later time point (>7 days), where 6778 participants were invited to wear an accelerometer, of which 6332 (93%) participants accepted. Of those who provided valid accelerometry data, 4040 participants completed both the leisure time and occupational time SGPALS; 5902 participants completed the PAFID questionnaire, and 5186 and 5088 participants completed the sitting question from the IPAQ short-form for week and weekend, respectively, where 4896 completed both.
All participants gave written informed consent.
Patient and public involvement
The Tromsø study advisory board includes patient (University hospital of Northern Norway) and public (eg, Norwegian Health Association, Tromsø municipality) representatives. Some participants are invited as ambassadors when data collection is ongoing, where they actively contribute to recruitment of participants. We have together with the Norwegian Health Association provided individual feedback on levels of physical activity to participants in Tromsø 7. There was no public involvement when designing this study.
Height and weight were measured in light clothing without shoes. Body mass index (BMI) was calculated (kg/m2) and defined as normal and underweight (<25 kg/m2), overweight (25 to 29.9 kg/m2) and obese (≥30 kg/m2). Educational level was collected from questionnaires and categorised in; (1) primary school, (2) high school diploma, (3) university education <4 years and (4) university education ≥4 years.
The physical activity and sitting questionnaires
The Saltin-Grimby physical activity level scale
The SGPALS asks participants to rank their leisure time and occupational time physical activity level separately, choosing one of four options. Based on the idea of the original questionnaire by Saltin and Grimby15, the SGPALS used in Tromsø 7 is a slight modification of Saltin and Grimby15 according to Rödjer et al.17 The SGPALS is presented in online supplementary table 1.
We computed the SGPALS as combined leisure time and occupational time where individuals were categorised as (1) inactive, (2) moderately inactive, (3) moderately active and (4) active according to Wareham et al33 with some modifications. In order to calculate physical activity volume, we assigned a physical activity level (PAL) value from the combined leisure time and occupational time SGPALS, which we derived from a previous validation study that calculated PAL as energy expenditure obtained from doubly labelled water divided by the estimated basal metabolic rate.25 The classifications and the assigned PAL value are presented in table 1.
The physical activity frequency, intensity and duration questionnaire
The PAFID questionnaire (table 2) includes three questions referring to frequency, intensity and duration of physical activity. We generated an index to reflect METs by multiplying intensity (METs) by duration (minutes) by frequency (times per week), and the outcome was expressed as MET-hours per week.34 35 We also grouped MET-hours per week in quartiles in order to assess the validity of ranking physical activity in this PAQ.
The International physical activity questionnaire, sitting question
In this study, the IPAQ short-form sitting question21 was employed, asking participants to estimate their average amount of sitting hours on a typical week and weekend day during the last week. In addition to the reported volume, we also grouped sitting hours in quartiles to assess the validity of ranking sitting hours.
Accelerometry data processing
Accelerometry measured physical activity was measured with the triaxial (three planes; axial, coronal and sagittal) ActiGraph wGT3X-BT accelerometer (ActiGraph, LLC, Pensacola, USA), firmware 1.2.0 to 1.8.0. Trained technicians attached the accelerometer to the participants’ right hip and instructed them to wear the accelerometer for 24 hours a day on eight consecutive days (the rest of the day following the visit in the clinic and seven more days) and only to remove the accelerometer during water-based activities (eg, showering or swimming) and contact sports. The accelerometer was returned by mail in a prepaid envelope. The ActiLife software (ActiGraph, LLC, Pensacola, USA) was used for initialisation and downloading the data. The accelerometer was initialised for raw data mode with a sampling frequency of 100 Hertz and recordings started at 00:00 the day following the visit in the clinic.
The raw acceleration files were filtered to 10 s epochs using the normal (default) proprietary filter in the ActiLife software. The acceleration units are expressed in triaxial vector magnitude (VM) (the square root of the sum of squared activity counts) counts per minute (CPM). We also extracted the number of steps in the accelerometer, which derives from the axial plane in a proprietary algorithm by the manufacturer. The .agd-files (epoch files) were further converted to .csv-files and further analysed in the Quality Control & Analysis Tool software (a custom-made software developed in Matlab: The MathWorks, Inc, Natick, Massachusetts, USA).
The 10 s epochs were further aggregated to 60 s and an epoch was classified as wear time if two of the following three criteria were fulfilled: (1) an epoch >5 VM CPM, (2) if at least two epochs >5 VM CPM in the proceeding 20 min or (3) at least two epochs >5 VM CPM in the following 20 min. Otherwise the acceleration was considered to be noise and classified as non-wear time.36
The triaxial VM CPM cut-points for different intensities are <150 VM CPM for sedentary behaviour37 and ≥2690 VM CPM for moderate and vigorous physical activity (MVPA),38 where light physical activity is between 150 to 2689 VM CPM.
Extracted accelerometry measures were volume measures (steps per day and mean VM CPM per day) in addition to intensity measures (minutes per day in sedentary behaviour, light physical activity, MVPA and ≥10 min bouted MVPA).
We calculated Pearson correlation coefficients to assess the correlation between the PAQs volume outcomes (SGPALS: PAL score, PAFID: MET-hours·week-1, IPAQ sitting: hours spent sitting) and accelerometry outcomes (VM CPM, steps per day, minutes in sedentary behaviour, light physical activity, non-bouted and bouted MVPA) where a coefficient of 0.00 to 0.10, 0.10 to 0.39, 0.40 to 0.69 and ≥0.70 was considered a negligible, weak, moderate and strong correlation, respectively.39 Univariate analyses of variance (ANOVA) were performed to assess associations of accelerometry measures (VM CPM, steps, minutes in sedentary behaviour, light physical activity, non-bouted and bouted MVPA) with the SGPALS physical activity ranking, quartiles of MET-hours per week from the PAFID questionnaire and quartiles of reported sitting from the IPAQ. For the IPAQ sitting question, a Bland-Altman plot was created (online supplementary figure 1). The Alpha level was set to 0.05 and data are presented as mean±SEM unless otherwise is stated. All data were confirmed to follow normal distribution by visual inspection of residuals when performing the above-mentioned analyses. The analyses were performed overall and in strata of sex, age (10 year groups), BMI (<25, 25 to 29, ≥30 kg·m-2) and education (primary, high school, <4 years university, ≥4 years university). The Statistical Package for Social Sciences (V.25, International Business Machines Corporation, Armonk, New York, USA) was used to perform all statistical analyses.
Patient and public involvement
Patients and/or the public were involved in the design, or conduct, or reporting or dissemination plans of this research. Refer to the Methods section for further details.
The descriptive characteristics of the participants wearing the accelerometers and completing the PAQs are presented in table 3.
PAL scores calculated from the SGPALS correlated weakly with VM CPM (r=0.32), steps per day (r=0.27), sedentary behaviour (r=−0.20), light physical activity (r=0.22), non-bouted MVPA (r=0.25) and bouted MVPA (r=0.16) (all p<0.05), which was consistent across sex, age, BMI and educational level (all p<0.05) (online supplementary table 2). All accelerometry measures increased by increasing rank of self-reported physical activity (Ptrend <0.001) (table 4).
Calculated MET-hours per week from the PAFID questionnaire showed negligible correlation with accelerometry measured light physical activity (r=0.06), weak correlation with VM CPM (r=0.34), moderate correlation with steps per day (r=0.43) and weak and moderate correlation with non-bouted MVPA (r=0.39) and bouted MVPA (r=0.44), respectively (p<0.001). This was consistent across sex, age, BMI and educational level (p<0.05) except for light physical activity, which did not correlate with MET-hours per week in some age groups (40 to 49 years; p=0.19, 50 to 59 years; p=0.13, 60 to 69 years; p=0.79), BMI classifications (<25 kg/m2; p=0.54 and 25 to 29 kg/m2; p=0.31) and educational levels (high school; p=0.07 and university ≥4 years; p=0.051) (online supplementary table 3).
Quartiles of MET-hours per week from the PAFID questionnaire showed positive association with all accelerometry measures (Ptrend <0.001) (table 5).
Accelerometry measured sedentary hours per day correlated weakly with reported sitting hours from the IPAQ sitting question (week day; r=0.22, weekend day; r=0.15), combined (mean of week and weekend; r=0.22, all p<0.01), which was consistent across sex, age, BMI and educational level (p<0.01) (online supplementary table 4). There was a positive association between quartiles of reported sitting in the IPAQ and accelerometry measured sedentary time (Ptrend <0.001) (table 6).
We assessed the criterion validity of two PAQs inheriting different physical activity measurement properties (physical activity ranking, volume calculation) and one sedentary time questionnaire, processed as both ranking and volume calculations, against accelerometry as our criterion measure. We found positive associations between ranking of physical activity in both the SGPALS and the PAFID questionnaire, and accelerometry measured physical activity. When processed as calculated volume, we found at best moderate correlations between self-reported and accelerometry measured physical activity. The IPAQ sitting question showed weak correlations and a narrow range in mean accelerometry measured sedentary time between quartile 1 and 4 in the IPAQ (within 1 hour per day).
The validity of the questionnaires
We found positive associations between accelerometry measured physical activity and ranking in the SGPALS. For example, those who categorised themselves in the lowest rank in the combined SGPALS accumulated on average ~4900 steps and 23 min of MVPA per day, respectively, which is about half of the accumulated steps and MVPA per day in the highest rank (~8290 steps and 53 min MVPA). This illustrates the ability of the SGPALS to rank physical activity levels in a large cohort of adults and elderly. The findings of positive associations between SGPALS rankings and accelerometry measured physical activity are consistent with previous validation studies of the SGPALS.23 26
In contrast, when estimating PAL volume scores from the SGPALS, the correlations between PAL scores and accelerometry measured physical activity were weak, which accentuates the biasses associated with self-reported physical activity.2–4 6 These findings may suggest that the biases associated with self-reported physical activity are more pronounced when physical activity is processed as total volume (eg, PAL, MET-hours per week) compared with ranking individuals according to their self-reported physical activity.
We found positive associations between quartiles of MET-hours per week from the PAFID questionnaire and accelerometry estimates. However, correlations between MET-hours per week from the PAFID questionnaire and accelerometry estimates were weak and only moderate for bouted MVPA. Such correlations are consistent with a previous validation study of the PAFID questionnaire.18 As with the SGPALS, ranking by quartiles may be the preferred way of expressing self-reported physical activity.
Although we found a positive association between quartiles of reported sitting hours from the IPAQ and accelerometry measured sedentary time, the narrow 1 hour range between quartile 1 and 4 in the IPAQ suggests small differences in real sedentary time between quartiles in the IPAQ.
This study included one of the largest sample sizes in validation studies of PAQs, allowing us to assess the validity in a large heterogeneous sample with high participation rate, which may represent the heterogeneous population to a larger extent than smaller sample sizes. Consequently, the generalisability of the findings from this study is likely high, at least for adults >40 years in western high-income countries.
Validation of PAQs is challenging. First of all, in contrast to doubly labelled water, which is the gold standard for measuring free-living energy expenditure,40 41 there is no gold standard to measure all aspects (domain, context, intensity, duration, frequency and volume) of physical activity accurately.16 42 43
Second, we used specific cut-points to split intensity in the accelerometry data, which may not reflect the intended intensity by the participants when answering the PAQs. However, in general, accelerometry measured physical activity shows greater validity than self-reported methods when compared with energy expenditure estimated from doubly labelled water,44–46 thus, a criterion validation from accelerometry can be considered applicable.
Third, the time periods for self-reported physical activity and sedentary time were not aligned with the accelerometry assessment. However, most physical activity instruments are intended to assess habitual physical activity.47 Moreover, as all included questionnaires (SGPALS: Kappa: 0.69,16 PAFID: Spearman’s rho (ρ): 0.76 to 87),18 IPAQ: ρ: 0.50 to 0.9421) and a 7 day accelerometry recording with four valid days (intraclass correlation: 0.8)47 are found to provide acceptable reliability, we believe that the included instruments provide reasonable estimates of habitual physical activity and our comparison is justified.
Finally, the waist placement of accelerometers in our study does not assess sitting per se. Other placements, such as thigh-worn accelerometers, may be more suitable for validating self-reported sitting. Nevertheless, our results are consistent with a previous study that employed thigh-worn accelerometers,48 suggesting that hip-worn accelerometers are able to measure sedentary time more accurately than self-reported methods.
Ranking of physical activity seems to be the preferred method to process PAQs, exhibiting higher validity against accelerometry measures than volume calculations of self-reported physical activity. Self-reported sedentary time poorly reflects accelerometry measured sedentary time. The two PAQs can be used for ranking individuals into different physical activity categories supporting previous studies using these instruments when assessing associations with health outcomes.
The authors would like to acknowledge PhD Ola Løvsletten for his advice on statistical analyses.
Contributors Significant manuscript writer: EHS. Significant manuscript reviewer/reviser: BM, UE, LAH, SB, BHH, JJ and AH. Concept and design: EHS, BM, UE and LAH. Data acquisition, analysis and interpretation: EHS, BM, LAH, UE, SB, BHH, JJ and AH. Statistical expertise: Ola Løvsletten mentioned in acknowledgements.
Funding The article processing charges are funded by the publication fund at the University Library at UiT the Arctic University of Norway. The work of Edvard H Sagelv and Jonas Johansson are funded by the Population Studies in the High North (Befolkningsundersøkelser i Nord: BiN), an internally funded project by UiT The Arctic University of Norway. The work of Søren Brage is funded by the UK Medical Research Council (MC_UU_12015/3) and the NIHR Biomedical Research Centre in Cambridge (IS-BRC-1215-20014). The remaining authors were funded internally by their institutional tenures/positions.
Competing interests None declared.
Patient consent for publication Not required.
Ethics approval Tromsø 7 and this present study were approved by the Regional Ethics Committee for Medical Research (REC North ref. 2014/940 and 2016/758410, respectively) and the Norwegian Data Protection Authority.
Provenance and peer review Not commissioned; externally peer reviewed.
Data availability statement Data may be obtained from a third party and are not publicly available. The data that support the findings of this study are available from the Tromsø Study but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. The data can be made available from the Tromsø Study upon application to the Data and Publication Committee for the Tromsø Study.
If you wish to reuse any or all of this article please use the link below which will take you to the Copyright Clearance Center’s RightsLink service. You will be able to get a quick price and instant permission to reuse the content in many different ways.