The Multidimensional Fatigue Inventory (MFI-20) was developed in 1995. Since then, it has been widely used in cancer research and cancer-related illnesses but has never been validated in fatiguing illnesses or in a large US population-selected sample. In this study, we sought to examine the reliability and validity of the MFI-20 in the population of the state of Georgia, USA. Further, we assessed whether the MFI-20 could serve as a complementary diagnostic tool in chronically fatigued and unwell populations.
The data derive from a cross-sectional population-based study investigating the prevalence of chronic fatigue syndrome (CFS) in Georgia. The study sample was comprised of three diagnostic groups: CFS-like (292), chronically unwell (269), and well (222). Participants completed the MFI-20 along with several other measures of psychosocial functioning, including the Medical Outcomes Survey Short Form-36 (SF-36), the Zung Self-Rating Depression Scale (SDS), and the Spielberger State-Trait Anxiety Inventory (STAI). We assessed the five MFI-20 subscales using several criteria: inter-item correlations, corrected item-total correlations, internal consistency reliability (Cronbach's alpha coefficients), construct validity, discriminant (known-group) validity, floor/ceiling effects, and convergent validity through correlations with the SF-36, SDS, and STAI instruments.
Averaged inter-item correlations ranged from 0.38 to 0.61, indicating no item redundancy. Corrected item-total correlations for all MFI-20 subscales were greater than 0.30, and Cronbach's alpha coefficients achieved an acceptable level of 0.70. No significant floor/ceiling effect was observed. Factor analysis demonstrated factorial complexity. The MFI-20 also distinguished clearly between three diagnostic groups on all subscales. Furthermore, correlations with depression (SDS), anxiety (STAI), and functional impairment (SF-36) demonstrated strong convergent validity.
This study provides support for the MFI-20 as a valuable tool when used in chronically unwell and well populations. It also suggests that the MFI-20 could serve as a complementary diagnostic tool in fatiguing illnesses, such as CFS.
Fatigue is a common symptom associated with numerous acute and chronic illnesses. Fatigue is one of the most frequent symptoms reported to physicians; between 7% and 45% of primary care consultations involve fatigue [1,2]. High levels of fatigue negatively affect quality of life for patients with cancer, Parkinson's disease, multiple sclerosis, and persons with less well-understood illnesses such as chronic fatigue syndrome (CFS) and fibromyalgia [3-6]. The fatigue associated with various conditions is generally not alleviated by rest and precludes normal mental and physical activities. Fatigue also often accompanies affective disorders . Fatigue was significantly positively correlated with depression in patients with multiple sclerosis [8-10] and in patients with unexplained fatigue . The national Canadian Community Health Survey reported that 36% of individuals with CFS were depressed , whereas a population-based study of CFS reported that 22% of individuals with CFS in Georgia had major depressive disorder, and 46% had anxiety disorders . Roy-Byrne et al.  found that fatigued twins were more somatically preoccupied and anxious than non-fatigued twins. Optimal management of patients with fatiguing illnesses requires assessing the nature, frequency, severity, and duration of fatigue and evaluating effects of interventions on fatigue. Although several standardized instruments have been designed to evaluate fatigue, they have not been validated across illnesses in adults.
The Multidimensional Fatigue Inventory (MFI-20) was developed by a Dutch group in 1995 to measure fatigue severity . The MFI-20 was first evaluated in a group of people with CFS, cancer patients, a healthy control group comprised of psychology and medical students, and a group of army recruits . The MFI-20 showed good internal consistency (Cronbach's alpha > 0.80) for the general, physical, and mental fatigue dimensions, and adequate reliability for the reduced activity and motivation items (Cronbach's alpha > 0.65). Construct validity between the different test groups was significant at p < 0.001 for all five dimensions. Convergent validity between the MFI-20 and the Visual Analogue Scale (VAS) fatigue score for the group of cancer patients was significant for all subscales.
Validity and reliability of the MFI-20 have also been evaluated in several other non-US populations. These included patients with cancer [15-17], chronic fatigue [3,18], craniopharyngioma , myelodysplastic patients , thyroid disease, and a "not tired" control group .
Test-retest reliability of the MFI-20 has been reported in several European studies. Ericsson and Mannerkorpi validated the MFI-20 in 166 Swedish patients with fibromyalgia and chronic widespread pain . Hagelin et al.  validated the MFI-20 in four groups composed of 584 Swedish subjects: palliative cancer patients, cancer patients receiving radiation therapy, noncancer outpatients, and a group of hospital staff. Gentile et al.  validated the MFI-20 in three groups of French subjects: tired, (82 subjects), moderately tired (36), and not tired (107). Finally, Schwarz et al.  published the population norms for the five MFI-20 subscales in a sample of 2,037 adult Germans. The crucial result of the Schwarz study was the quantification of age and sex dependency in fatigue.
In the United States, Schneider validated the MFI-20 in 97 rural oncology outpatients and in 45 spouses or first-degree female caregivers of male hemodialysis patients in northern and eastern Iowa [22,23]. To our knowledge, the MFI-20 has not been validated in persons from the US with fatiguing illnesses nor in a large US population-selected sample. The aims of the present study were: 1) to investigate the reliability and validity of the MFI-20 in chronically unwell and well persons; 2) and to assess whether the MFI-20 could serve as a complementary diagnostic tool in populations with fatiguing illnesses.
The data came from a cross-sectional, population-based study investigating the prevalence of CFS in Georgia. Details of the source study have been previously published  but are summarized here. The Centers for Disease Control and Prevention (CDC) Institutional Review Board, as required by US Department of Health and Human Services regulations, approved the study. All participants provided informed consent.
Study design and sample
The study was carried out in two phases between September 2004 and July 2005. Phase 1 involved a random-digit-dialing telephone survey to screen 19,381 adult residents (96% response) ages 18 to 59 from metropolitan, urban, and rural Georgia populations. Based on the 19,381 people from the household screening interview, 8,910 adults were randomly selected for detailed telephone interviews: 5,623 individuals completed the detailed telephone interview; 1,874 refused to participate; 141 were further confirmed to be ineligible; and 1,272 were excluded due to physical or mental inability to participate, inability to be contacted, language barriers, or because they had died. This yielded an overall response rate of 75%. Based on the detailed telephone interviews, study participants were classified into three groups:
1) CFS-like, characterized by severe fatigue lasting six months or longer that was not alleviated by rest, that caused substantial reduction in occupational, educational, social, or personal activities, and that was accompanied by at least four of the CFS case-defining symptoms.
2) Chronically unwell, having chronic (≥ six months) unwellness with or without fatigue, but not meeting the criteria for CFS.
In Phase 2, all 469 people with CFS-like illness were invited for clinical evaluation, and 292 (62%) participated. Of randomly selected chronically unwell participants, 286 (53%) completed the clinical evaluation. Finally, 223 individuals classified as well in the telephone interview completed clinical evaluations. They were matched to the CFS-like group based on residence (metropolitan, urban, rural), sex, race/ethnicity, and age (within three years). Overall, about 50% of invited respondents from all three groups completed the one-day clinical evaluation.
Participants completed the MFI-20 and other questionnaires during the clinical evaluation. This study involves data from 783 participants who completed the MFI-20 along with several other measures of psychosocial functioning, including the Medical Outcomes Survey Short Form-36 (SF-36), the Zung Self-Rating Depression Scale (SDS), and the Spielberger State-Trait Anxiety Inventory (STAI).
The MFI-20 comprises five subscales: general fatigue, physical fatigue, mental fatigue, reduced activity, and reduced motivation . Each subscale includes four items with five-point Likert scales. General fatigue includes general statements about fatigue and decreased functioning and was designed to encompass both physical and psychological aspects of fatigue. Physical fatigue concerns physical sensations related to fatigue. Mental fatigue pertains to cognitive functioning, including difficulty concentrating. Reduced activity refers to the influence of physical and psychological factors on the level of activity. Reduced motivation relates to lack of motivation for starting any activity. Scores on each subscale range from 4 to 20, with higher scores indicating greater fatigue.
The SF-36 contains eight multi-item subscales: general health perceptions, physical functioning, role physical (role limitations due to physical problems), bodily pain, general mental health, vitality (vitality/energy/fatigue), role emotional (role limitations due to emotional problems), and social functioning. The number of response choices per item ranges from two to six. Each transformed subscale has a range from 0 to 100 (100 = optimal function) . The SF-36 also yields two summary scores that reflect the two-dimensional factor structure underlying the eight subscales: a physical component summary (PCS) score and a mental component summary (MCS) score. PCS and MCS are a linear combination of eight SF-36 subscales, but PCS is predominantly based on the subscales physical functioning, role physical, bodily pain, and general health perceptions, and MCS is predominantly based on the scales mental health, role emotional, social functioning, and vitality (range 0-100, 100 = optimal) .
The SDS  includes 20 questions that quantify the severity of depression symptoms. Each item ranges from 1 (none or a little of the time) to 4 (most or all of the time). The raw SDS score is the sum of all 20 items and ranges from 20 to 80. Following standard practice, we converted raw SDS scores to a 100-point scale (SDS index) in which < 50 = normal, 50-59 = mild depression, 60-69 = moderate to marked depression, and ≥ 70 = severe depression.
The STAI  includes 40 questions with four possible responses to each. It was constructed as two subscales: 20 items to assess state anxiety, and another 20 to assess trait anxiety. State anxiety is defined as a transient, momentary emotional status that results from situational stress. Trait anxiety represents a predisposition to react with anxiety in stressful situations. Each subscale ranges from 20 to 80, with higher scores indicating higher anxiety. These two parts differ in the item wording, in the response format (intensity versus frequency), and in the instructions for how to respond. The STAI clearly differentiates between the temporary condition of state anxiety and the more general and long-standing quality of trait anxiety.
All four questionnaires were self-reported and self-administered by participants. The mean time taken to complete each questionnaire was five, nine, three, and four minutes for the MFI-20, SF-36, SDS, and STAI, respectively. The Flesch Reading Ease formula and a Flesch abstraction formula were applied. The measures are generally shown to be useful for respondents with a sixth grade reading level or below. The reading level of each respondent was assessed by the Wide Range Achievement Test reading subtest , and only 45 (6%) of respondents were below a sixth grade reading level.
We used SAS version 9.1 (SAS Institute Inc, Cary, NC) for data analysis. Descriptive statistics (frequencies, percentages, means, standard deviations, and ranges) were generated to characterize the study sample in terms of socio-demographic parameters. We used several criteria to assess the subscale validity and reliability of the MFI-20.
Internal consistency of each of the five MFI-20 subscales was determined using three reliability tests: 1) inter-item correlation; 2) corrected-to-total (or item-total) subscale correlation; 3) and Standardized Cronbach's α coefficients (and item discrimination). The cutoff criteria for acceptance on reliability tests are as follows. First, item-total subscale correlations of not less than 0.30 and inter-item correlations of 0.30 to 0.70 were retained. Second, a fairly high reliability coefficient (Cronbach's α > 0.70) was required to assess the internal consistency reliability [30,31]. Floor/ceiling effects were considered significant if more than 15% of the subjects had either the lowest possible or highest possible score on the subscales . A significant floor effect was expected in the well group.
As an indication of discriminant (known-group) validity, group differences in the five MFI-20 subscales were calculated using analyses of variance to examine the ability of the MFI-20 instrument to distinguish three groups: CFS-like, chronically unwell, and well. Using a Tukey correction, the alpha per test for each subscale was 0.01, for an overall alpha of 0.05. Two-way analyses of variance were performed to test the age and sex effects on the five MFI-20 subscales. Post-hoc analysis with Tukey p-value adjustment was performed for multiple subgroup comparisons.
To further assess construct validity of the subscales, an exploratory factor analysis was performed. A principle component analysis was used to extract factors. The obtained factors were rotated oblique using the Varimax procedure. A minimum eigenvalue of 1 was specified as the extraction criterion . The desired criterion of factor loadings was set at 0.50 or above, slightly higher than the typical cutoff value of 0.40 .
Finally, the convergent validity of the MFI-20 was evaluated through comparisons of the MFI-20 with other instruments administered in the protocol. Pearson correlation coefficients were used to assess linear associations between the multi-item scales of SF-36, SDS, and STAI. We chose these instruments based on the association between fatigue and other measures on psychosocial functioning, such as health-related quality of life (measured by SF-36), depression (measured by SDS), and anxiety (measured by STAI) as well as the existing data from the source study.
The most valid SF-36 subscales for measuring physical health include the physical functioning, role physical, and bodily pain subscales and the physical component summary score . The most valid SF-36 subscales for measuring mental health include the mental health, role emotional, and social functioning subscales and the mental component summary score . For the concept of physical and mental health, we investigated correlations between MFI-20 subscales and physical and mental health as measured by the SF-36.
Data completeness was high, with only one missing response for the reduced activity subscale among all five subscales. This indicated that the MFI-20 was well-accepted in our study sample of chronically unwell and well people.
Table 1 summarizes subscale validity and reliability analyses for the 783 participants who completed the MFI-20 questionnaire. Of these, 37% had been classified as CFS-like based on the detailed telephone interview, 34% were chronically unwell, and 28% were considered well. The participants had a mean age of 43, were primarily female (76%), white (70%), and from rural or urban areas (83%). Nearly 95% had completed at least a high school education. Nearly 38% were unemployed, self-employed (not working for pay), retired, laid off, disabled, or students. More than 60% of participants were married or cohabitating. More than half of participants had a household income equal to or higher than the Georgia median income level of $42,679.
Table 1. Characteristics of the study sample
Associations of age and sex with MFI-20 subscale scores
Only the physical fatigue subscale score differed significantly by both age (p = 0.0024) and sex (p = 0.0015). Reduced activity (p = 0.0078) and reduced motivation (p = 0.0112) scores differed significantly between age groups. General fatigue (p = 0.0003) and mental fatigue (p = 0.0272) scores were significantly worse in females than in males. The interaction between age and sex was not significant in any of the MFI-20 subscales. Although only three of the five MFI-20 subscales were significantly different by sex, descriptive statistics of all the subscales were summarized for females and males (Table S1 and Table S2, Additional file 1).
For subscales with significant age or sex effects, we estimated partial correlations controlling for sex and age, respectively (Table 2). This had negligible effects on the correlations between the physical fatigue, reduced activity, and reduced motivation subscales.
Table 2. Correlations among MFI-20 subscales and their partial correlations controlled for age or sex.
Table 3 summarizes the results of three reliability tests for the five MFI-20 subscales. There was no item redundancy; inter-item correlations averaged 0.56 (range 0.46-0.69) for general fatigue, 0.52 (range 0.44-0.61) for physical fatigue, 0.53 (0.41-0.66) for reduced activity, 0.38 (0.17-0.56) for reduced motivation, and 0.61 (0.53-0.66) for mental fatigue. Corrected item-total correlations were higher than 0.30 for all five MFI-20 subscales. The values for standardized Cronbach's α for the five MFI-20 scales were: general fatigue: 0.83; physical fatigue: 0.81; reduced activity: 0.82; reduced motivation: 0.71; and mental fatigue: 0.86. These values were greater than the suggested criteria value of 0.70 for acceptable reliability.
Table 3. MFI-20 scale item characteristics and internal consistency reliabilities.
Relationships among five MFI-20 subscales
Pairwise correlations between the MFI-20 subscales ranged from 0.49 to 0.74. Although the subscales are strongly related to each other, it is unclear whether an overall summary component of the MFI-20 is appropriate. Factor analysis confirmed that overall summary components accounted for 70% of the reliable variance in the five subscales. The total scale with 20 items yielded a Cronbach's α coefficient of 0.93, which is consistent with the result from the Gentile study .
The factor analysis solution was complex, with multiple loadings of items having factor-loading values > 0.50 across five factors (Table 4). However, the first factor, which explained 20% of the variance in the 20 items of the MFI-20, was dominated by general fatigue and physical fatigue. Six items (four physical fatigue, one general fatigue, and one reduced activity) loaded on the first factor (loadings from 0.54 to 0.83). The second factor was comprised solely of all four mental fatigue items (loadings from 0.71 to 0.81), which explained 15% of the variance in the 20 items of the MFI-20. Three of the reduced activity items fell nicely (loading from 0.52 to 0.71) on the third factor. The fourth factor was loaded by three of the general fatigue items (loading from 0.52 to 0.71) and two of the reduced motivation items (loading 0.58 and 0.64). The remaining two reduced motivation items fell nicely on the fifth factor.
Table 4. Factor analysis of 20 MFI item responses.
Discriminant (known-group) validity: MFI-20 subscale differences between three groups
The CFS-like, chronically unwell, and well groups had significantly different mean values (p < 0.0001) for all the MFI-20 subscales (Table 5). All subscales appeared to discriminate between groups, but the degree to which they discriminated varied. The CFS-like group had higher scores in all the subscales compared to the chronically unwell group (average mean difference = 2.90; range of mean difference: 2.26 points (reduced activity) - 3.54 points (general fatigue). Compared to the well group, the CFS-like group had subscale scores that were, on average, 6.01 points higher. Mean differences between these groups ranged from 4.56 points (reduced activity) to 7.96 points (general fatigue), whereas the chronically unwell group scored, on average, 3.11 points higher in the five subscales than the well group.
Table 5. Descriptive statistics for the five MFI-20 scales by subgroups
We observed a floor/ceiling effect in all the MFI-20 subscales in the well group, except for general fatigue, as expected. No floor/ceiling effects were detected in the CFS-like and chronically unwell groups. There were no floor/ceiling effects in the whole study sample (Table 5).
Convergent validity: relationships to functional impairment, depression, and anxiety
We calculated correlations between fatigue subscales and subscales measuring functional impairment (SF-36), depression (SDS), and anxiety (STAI) to evaluate convergent validity in the overall sample (Table 6). The MFI-20 subscales were substantially correlated with the eight SF-36 subscales (average: r = -0.53; range of absolute values of correlations: |r| = 0.34 - 0.83). All MFI-20 subscales were most strongly correlated  with the SF-36 subscales measuring vitality (average: r = -0.68; range of |r|: 0.57 - 0.83), followed by general health perception (average: r = -0.59; range of |r|: 0.48 - 0.71), and social functioning (average: r = -0.54; range of |r|: 0.50 - 0.59).
Table 6. Convergent Validity: Pearson Correlation Coefficients between the MFI-20, SF-36, SDS, and STAI† in overall sample.
As expected, all five MFI-20 subscales were significantly correlated with depression, anxiety, and functional impairment. However, the correlations with depression and anxiety were generally lower (average: r = 0.50; range of r = 0.34-0.65) than correlations with functional impairment (the SF-36 subscales). The highest correlations were found between MFI-20 subscales and measurement of depression (SDS index) (average: r = 0.58; range of r = 0.50 - 0.65).
Conceptual relationship: Mental and Physical
The general fatigue subscale of the MFI-20 was associated with both physical and mental health, based on strong correlations (|r| >=0.5) with functional impairment as measured by the SF-36 subscales (except for physical functioning, bodily pain, and role emotional), and both the physical component summary score and mental component summary score. General fatigue was also highly associated with the SDS index and the STAI trait-anxiety subscale.
The physical fatigue subscale of the MFI-20 was highly correlated (|r| >=0.5) with several subscales of the SF-36 that measure predominantly physical health (physical functioning, role physical, bodily pain, social functioning, vitality, general health) and the physical component summary score but not the mental component summary score (Table 6). Physical fatigue was also highly correlated with the SDS index score measuring depression.
The mental fatigue subscale of the MFI-20 was highly correlated with several subscales of the SF-36 that measure predominantly mental health (social functioning, mental health, and vitality subscales), as well as the mental component summary score. The mental fatigue subscale was also associated with depression (SDS index) and trait anxiety (STAI).
The reduced activity subscale of the MFI-20 was highly correlated (|r| >=0.5) with several SF-36 subscales (physical functioning, social functioning, vitality, and general health perception). The reduced motivation subscale of the MFI-20 was highly correlated with many SF-36 subscales (role physical, social functioning, mental health, and vitality) as well as the mental component summary measure, but not the physical component score. Reduced activity was also correlated with depression (SDS index) and anxiety (STAI trait-anxiety).
We examined the total fatigue score of the MFI-20 in relation to other instruments. Total fatigue was highly correlated with all SF-36 subscales except for bodily pain, and was correlated with the physical component summary score and mental component summary score, as well as SDS index and state-anxiety and trait-anxiety subscales (STAI). The total fatigue score of the MFI-20 was highly consistent and demonstrated the highest correlations with other questionnaires.
Relationships to depression, anxiety, and functional impairment among classification groups
In the CFS-like group, the SF-36 subscale scores were highly correlated with the MFI-20 subscales for general fatigue, physical fatigue, reduced activity, and reduced motivation but not with mental fatigue. Also in this group, depression (SDS index) was highly correlated with reduced motivation (r = 0.50) but only moderately correlated with other subscales of the MFI-20. In general, the scores of the STAI correlated with all five MFI-20 subscale scores. The trait-anxiety score of the STAI had stronger correlations than state-anxiety with the MFI-20 subscale scores (Table S3, Additional file 1).
For the chronically unwell and well groups, depression and anxiety correlated with all five MFI-20 subscales. The bodily pain subscale and the physical component summary scores of the SF-36 did not correlate with the mental fatigue subscale of the MFI-20. Depression, as measured by the SDS index, correlated with all the MFI-20 subscales. The correlations between bodily pain of the SF-36 and activity fatigue (reduced activity or reduced motivation) are not statistically significant (Table S4 and Table S5, Additional file 1).
This study greatly extends previous research with the MFI-20 in several ways. The first objective of this study was to assess reliability and validity of the MFI-20 in chronically unwell and well groups identified from metropolitan, urban, and rural populations in the state of Georgia. The MFI-20 was well-accepted in our sample of unwell and well people. Low to moderate inter-item correlations indicated no item redundancy. Corrected item-total correlations for all MFI-20 subscales were all in an acceptable range. The MFI-20 item subscales exhibited adequate internal consistency reliability with Cronbach's α coefficients ranging from 0.72 to 0.86, which is consistent with results from previous studies [2,17,21,36]. We found no significant floor/ceiling effects in the whole study sample.
With respect to validity, the results of factor analysis of the MFI-20 in a sample of unwell and well people provide additional support for the five-factor structure of the MFI-20 . As previously noted, however, some factors are highly correlated, and several items would have loaded on more than one factor had the paths not been constrained. In addition to forming its own factor component, one of the general fatigue subscale items loaded on the same factor with items of the physical fatigue subscale because it provides information about physical fitness. This general fatigue subscale item may be considered along with the physical fatigue subscale to assess fatigue scores in populations with fatiguing illnesses. The results also showed that a total fatigue summary score is a valid summary score for people with fatiguing illnesses.
In a further examination of known-group comparison for construct validity, all five MFI-20 subscales distinguished clearly between our three study groups. The magnitude of the mean group differences in the MFI-20 subscales is greater than the generic minimal clinically important difference (MCID) of two points across the pre- and post-radiotherapy comparison and occupational productivity anchor . People with CFS-like illness had several higher fatigue and activity subscale mean scores that were both statistically and clinically significant (an average of three points higher) than those who were chronically unwell but did not have CFS-like illness. These differences were more exaggerated (six points higher, on average) when the CFS-like group was compared to the well group with respect to these subscales. As expected, those who were chronically unwell also had fatigue and activity subscale scores that were both statistically and clinically significant (three points higher than well people).
The MFI-20 subscales exhibited adequate convergent validity with other instruments. The general fatigue subscale of the MFI-20 is highly correlated with the functioning subscales of the SF-36, SDS depression, and the trait anxiety subscale of the STAI. This confirms that the general fatigue subscale represents both physical and psychological aspects of fatigue. Physical fatigue represents the physical sensation related to fatigue, which is validated by the substantial associations with physical functioning, role physical, bodily pain, social functioning, vitality, general health perception, and physical component summary measure. Reduced activity refers to the influence of both physical and psychological factors on the level of activity. Reduced motivation refers to the psychological experience of feeling unable to start an activity . Finally, mental fatigue, which originally measures cognitive functioning such as difficulty concentrating, reflects the "mental health" concept of fatigue, which is validated by the associations with social functioning, mental health, and vitality as well as the mental component summary measure.
Our study showed that sex and age exert effects on several MFI-20 subscales. Compared to males, females had slightly higher mean scores for subscales measuring general fatigue, physical fatigue, and mental fatigue. This confirms previous findings of sex differences in mean scores of fatigue scales [21,39], and age-associated increases in mean scores in physical fatigue, reduced activity, and reduced motivation .
We showed that the five MFI-20 subscales were highly correlated with functional impairment, depression, and anxiety in the overall sample. Breslin et al.  showed that depression correlated with the general fatigue and mental fatigue subscales of the MFI-20 but not with physical fatigue in patients with chronic obstructive pulmonary disease (COPD). Schwarz et al.  showed that fatigue is correlated with hospital anxiety and depression scale (HADS) and the global quality-of-life scale.
Our CFS-like group provides the opportunity to examine the convergent validity of the MFI-20 with other measurements among people with fatiguing illness. In the CFS-like group, additional support for the validity of the MFI-20 is provided by the insignificant-to-moderate correlations between the SF-36 subscales and mental fatigue of the MFI-20. This indicates that mental fatigue is only partly measured by the SF-36 among individuals with CFS-like illness. Depression is moderately correlated with several subscales of the MFI-20. We also showed low correlations between state-anxiety of the STAI and general fatigue, physical fatigue, and reduced activity of the MFI-20. Therefore, the additional information provided by the MFI-20 may deepen our insight into functional impairment, depression, and anxiety in fatiguing illnesses.
Strengths and limitations
The study's strengths include: a rigorous study design with a large, randomly selected sample from a cross-sectional, population-based study of fatiguing illness; and the careful clinical determination of groups, selection of comparison measures, report of reading levels of the instrument, and correction of p-values for multiple testing.
This study has several limitations. Our existing data did not allow us to conduct test-retest reliability of the MFI-20. Further studies might be needed to explore test-retest reliability of MFI-20 in fatiguing illness. Another limitation is external validity/generalizability. While the study employed random sampling, the population was limited to an adult population in Georgia and could therefore differ from results that might be obtained from implementing the same study design in other regions due to the effect of regional lifestyle. Nonetheless, previous studies on MFI-20 have not identified the effect of regional lifestyle in their study populations. Our cross-sectional data precluded us from examining responsiveness (ability of the MFI-20 to detect clinically important changes over time) and obviates the possibility of eventually examining responsiveness differences due to treatments. Longitudinal studies are needed to determine minimal clinically important differences (MCIDs) of the MFI-20 subscales in fatiguing illness.
In this study, we applied a 0.01 alpha level of statistical significance to adjust for multiple testing instead of the popular standard level of 0.05. This increases our confidence in the associations that were determined to be of statistical significance but also increases the risk of failing to reject a false null hypothesis (a Type II error), and so results in less statistical power. However, the statistically significant results observed in this study are of practical significance. For example, the group mean differences in our study are greater than the generic MCID of two points in the MFI-20 subscales in Purcell's study . The possibility of a Type II error should, however, be considered.
This study further demonstrates that the MFI-20 appears to be a valid and reliable measure of chronically unwell and well populations with a stable multidimensional factorial structure. It also suggests that the MFI-20 could indeed be a useful tool for further investigation of generic functional impairment and a complementary diagnostic tool to depression-specific and anxiety-specific instruments in fatiguing illnesses such as chronic fatigue syndrome.
The authors declare that they have no competing interests.
JML contributed to the conception of the manuscript, had primary responsibility for data processing, statistical analyses, and interpretation of the data, and wrote the manuscript. DJB contributed to intellectual input to data interpretation, streamlining the introduction, and revising the manuscript. EM contributed to data interpretation and critically revised the manuscript. EN contributed to tabulating the results and literature search, and revised the manuscript. RB contributed to intellectual input in the discussion section and revised the manuscript. WCR was Principal Investigator of the source study, collaborating with others in designing the study, writing the protocol, supervising field work, interpretation of the data, and critically revising the manuscript. All authors have read and approved the final manuscript.
This study was fully funded by the US Centers for Disease Control and Prevention. The authors would like to acknowledge Drs. James F. Jones and Roumiana S. Boneva of the CDC for their reviews of this manuscript.
Benedict RH, Wahlig E, Bakshi R, Fishman I, Munschauer F, Zivadinov R, Weinstock-Guttman B: Predicting quality of life in multiple sclerosis: accounting for physical disability, fatigue, cognition, mood disorder, personality, and behavior change.
J Neurol Sci 2005, 231(1-2):29-34.
Epub 2005 Jan 26.PubMed Abstract | Publisher Full Text
Journal of Neurological Sciences 2006, 243:39-45. Publisher Full Text
Fam Pract 2008, 25(6):414-22.
Epub 2008 Oct 3PubMed Abstract | Publisher Full Text
Psychosom Med 2009, 71(5):557-65.
Epub 2009 May 4.PubMed Abstract | Publisher Full Text
Brit J Haem 2003, 121:270-274. Publisher Full Text
Population Health Metrics 2007, 8(5):5. BioMed Central Full Text
Psychometrika 1951, 16:297-334. Publisher Full Text
J ClinEpidemiol 2007, 60(1):34-42.
Epub 2006 Aug 24.
Ann Oncol 2005, 16(3):372-82.
Epub 2005 Jan 27.PubMed Abstract | Publisher Full Text