Introduction

Dementia is highly prevalent in today’s society. It was estimated that worldwide 50 million people lived with dementia in 2018, and with the ageing population this number is expected to increase to 82 million by 2030 and further to 152 million by 2050 [1]. With these increasing numbers comes an increase in the magnitude of care required. Family members, who are most often elderly spouses, siblings or friends, are frequently the ones to provide (part of) this care [2]. These informal carers may be unprepared for the physical and emotional demands that caring entails and many carers experience considerable strain and well-being losses due to their caregiving tasks [3, 4].

At the same time, governments struggle with the rising demand for care of people with dementia, those with other (chronic) diseases, and their carers, and with limited health care resources. Regarding decisions for optimal spending, economic evaluation is a useful decision-making aid. It is increasingly applied in many Western healthcare systems. Usually in economic evaluations, health-related quality of life (HrQoL) is used to measure intervention benefits. However, in areas such as mental health and care of older people, improving health outcomes is not necessarily the main focus of care interventions [5, 6]. Broader outcome measures, which go ‘beyond health’ [7], capturing the effects of interventions in terms of well-being, for both patients and carers, may be more appropriate to capture relevant benefits of such interventions.

Several well-being measures have recently been developed, broadening the evaluative scope beyond health in different ways. Some of these have focused on context-specific elements of well-being beyond health, such as care-related quality of life in carers [8, 9] or social care-use-related well-being [10]. Analogous with measuring disease-specific quality of life rather than generic health-related quality of life, context-specific measures may be more sensitive to specific changes, but this comes at the expense of comparability across situations and populations. Generic measures of well-being in principle allow comparisons, across interventions, diseases and situations. The ICECAP-O, a capability well-being measure for older people, is such a generic well-being measure [11], which is increasingly used. Capability in this context refers to the extent to which a person is able to function in a particular way, whether or not he or she chooses to do so [11]. The ICECAP-O consists of five important general capability well-being dimensions: attachment, security, role, enjoyment and control, and has four answering levels per domain. Values for the states described with the instrument were derived from a sample of older people in England, using best–worst scaling [12]. ICECAP-O questionnaires and further information can be found on the website: www.birmingham.ac.uk/research/activity/mds/projects/HaPS/HE/ICECAP/ICECAP-O.

As a generic well-being measure, the ICECAP-O is well suited for economic evaluation of care interventions in elderly populations, especially those who are suffering from chronic diseases [13]. Given the generic nature of the instrument, it may be suitable to measure well-being not only in patients but also in informal carers. Using the ICECAP-O to measure well-being in both patients and their informal carers would facilitate comparisons and aggregation of outcomes within the same economic evaluation. This is also relevant for a scenario in which two interventions are being compared, and one of these interventions impacts informal carers. Without a comparable outcome between patients and informal carers, for example if the ICECAP-O and the CarerQoL were used, respectively, it would be necessary to perform multiple-criteria decision analysis, which may be more costly and time consuming.

Before the ICECAP-O can be used as a well-being measure in economic evaluations of care interventions, it needs to be validated in relevant populations. Validation is performed to assess the extent to which a measure evaluates what it sets out to represent. The most frequently used validity tests for health status measures in the literature are construct validity tests. These examine the extent to which the measure indeed captures the concept it intends to measure [14]. Construct validity consists of both convergent and discriminant validity. Convergent validity refers to the extent to which a measure correlates with related concepts [14], while discriminant validity refers to the extent to which relevant differences in (sub-) groups are adequately reflected by the measure [9].

So far, the ICECAP-O has been validated for various populations such as older people in England [12, 15,16,17,18], psycho-geriatric older people in nursing homes in the Netherlands [19], post-hospitalized older people in the Netherlands [20] and older people with dementia in Germany [5], mostly with favourable results. In these studies, sample sizes typically have been relatively small and taken from only one country. To date, to our knowledge, no study has validated the ICECAP-O as a well-being measure in informal carers.

In this study, we therefore add to the literature in a number of ways. This is the first study to validate the ICECAP-O in a sample of informal carers, using a rich dataset. We use data from a relatively large sample of carers obtained in eight European countries in the context of the Actifcare project [21], which aims to analyse the pathways to care for people with dementia and their families. This paper therefore considers the construct validity of the ICECAP-O in a sizeable international population of informal carers for people with dementia. Furthermore, validating the ICECAP-O in this kind of population allows those performing economic evaluations to consider the ICECAP-O when measuring carer well-being.

Methods

Data were collected in eight European countries: Germany, Ireland, Italy, the Netherlands, Norway, Portugal, Sweden and the UK. Care receivers adhering to the specified criteria ("Appendix 1") and their informal carers were invited to complete the questionnaires, available in seven different languages. For all measures, including the ICECAP-O, nationally validated versions were used, or, if not available, the measure was translated, back translated and pilot tested following a translation protocol [21,22,23]. The data collection consisted of different parts. People with dementia and carers were interviewed by trained interviewers about their socio-demographic characteristics and the comorbidities and health care resource use of the former. Carers completed questionnaires covering a variety of outcome measures and were interviewed about the caregiving situation, their resource use, and the person with dementia’s health. Finally, the interviewer completed questionnaires about the health, quality of life and care needs of the person with dementia [21].

Demographic characteristics included age, gender, nationality, ethnicity, marital status, level of education and state of employment. Before listing the health and well-being measures used in our analyses, it is worth describing the ICECAP-O in further detail. As mentioned above, the ICECAP-O is a general capability well-being measure, consisting of five dimensions: attachment, security, role, enjoyment and control. These dimensions are sometimes described in a little more detail as ‘love and friendship’, ‘thinking about the future’, ‘doing things that make you feel valued’, ‘enjoyment and pleasure’ and ‘independence’. The ICECAP-O and has four answering levels per domain which are no capability, a little capability, a lot of capability and full capability. The ICECAP-O provides us with separate scores for each domain, meaning there are 1024 different possible ‘capability states’. By attaching the designated utilities to each attribute, we are provided with a final ICECAP-O tariff score with a range between zero (no capability) and one (full capability) [15].

There are health and well-being measures used in our analysis to test the validity of the ICECAP-O. The first measures are those answered by the carer, about the carer and their environment. These are CarerQol [9], EQ-5D-5L [24], Positive Affect Index (PAI) [25], Perseverance Time (PT) [26] and the Lubben Social Network Scale (LSNS) [27]. The next measures are those answered by the carer and/or an interviewer as a proxy about the person with dementia. These are Clinical Dementia Rating (CDR) [28], DemQoL-U [29, 30] (proxy-rated), Quality of Life in Alzheimer’s disease (QoL-AD) [31], Resource Utilization in Dementia (RUD) [32], and finally a subset of questions regarding unmet need from the Camberwell Assessment of Need for the Elderly (CANE) [33]. These measures are discussed and referenced in Table 1. Here it is important to note that when referring to ‘CANE Unmet Need’ we are referring to a measure taken from the CANE instrument, which in this case was collected by an interviewer talking with the carer and person with dementia. In the measure, we sum the number of times ‘unmet need’ is chosen out of the 24 questions asked. Summary statistics of all continuous variables used are shown in "Appendix 2".

Table 1 Measures

Data analysis

To test whether the ICECAP-O is a valid measure of capability well-being, two main sections of analysis were performed: convergent validity and discriminant validity. A priori expected correlations and relationships between the ICECAP-O and other variables from the questionnaires, discussed below, were drawn from previous literature, if available. In the analyses, correlation strength levels were taken from Cohen’s Set Correlation and Contingency Tables [34]. Correlations are considered strong if the coefficient is above 0.5, moderate if the coefficient is between 0.3 and 0.5, and weak if the coefficient is below 0.3. A p value of 0.05 was taken to signify statistical significance.

Convergent validity

To test convergent validity, Spearman correlation coefficients of the tariff scores and dimensions of the ICECAP-O were compared against the EQ-5D-5L results (utility tariff, health problems index, and VAS) [35], CarerQol-7D tariff scores and CarerQol-VAS scores, respectively. It was anticipated that there would be a moderate positive correlation between the ICECAP-O scores and the EQ-5D-5L utility tariff scores and VAS scores of carers, a moderate negative correlation between the ICECAP-O scores and the EQ-5D-5L health problems index of carers, and a strong positive correlation between the ICECAP-O and the CarerQol scores.

Discriminant validity

For discriminant validity, sub-groups were defined based on characteristics that previously were shown to be related to informal carer outcomes. For measures that have no pre-defined cut-off points for high or low, in this case the EQ-5D-5L tariff and VAS scores, the cut-off points between sub-groups were primarily based on a face valid classification in relatively similar group sizes. Education was split unto three sub-groups based on primary school only (low), up to high-school education (medium), and higher education (high).

Student’s t tests (for two sub-groups) or ANOVA (for more than two sub-groups) were performed to identify significant differences in ICECAP-O scores. Then, a multivariate regression model was estimated for the ICECAP-O tariff scores using all variables in which the ICECAP-O could discriminate at a P value of 0.1 or less, to gain insight into the magnitude and significance of the variables that were associated with the ICECAP-O scores. There are exceptions to this exclusion rule: the variables age, gender, education, relationship between the carer and person with dementia, and carer daily hours. We include age, gender, education, and the type of relationship because these are basic demographic factors. It was pre-defined by the authors that carer daily hours would be included in the multivariate regression as it is a key variable in the care giving context. A second model was estimated including country dummies, to account for country-level effects. In this regression, Germany was used as the reference country as it had the lowest mean ICECAP-O score among carers.

Several hypotheses were generated regarding carer, care receiver and caregiving context variables and their relationship with the ICECAP-O. It is important to note that this literature did not necessarily refer to informal caregivers, or carers of people with dementia. Regarding carer variables, employed carers were expected to have a significantly higher ICECAP-O score than those unemployed [36], carers with higher health status (i.e. a higher EQ-5D-5L score) were expected to have significantly higher ICECAP-O scores than those with lower health status [37], and carers with a higher PAI score were expected to have a significantly higher ICECAP-O score than those with a lower PAI score [38]. Furthermore, there was insufficient evidence to form a hypothesis on the effect of carer age on the ICECAP-O [20, 39]. There was no expectation for the ICECAP-O to score differently for different levels of carer education [20]. Regarding care receiver variables, carers for a person with dementia with a higher health status (i.e. a higher EQ-5D-5L, DemQoL-U and QoL-AD score, or a lower CDR score) were expected to have a significantly higher ICECAP-O score than carers for persons with lower health status. Finally, regarding caregiving context variables, carers with a low care burden (i.e. fewer daily care hours, lower CANE unmet needs in the person with dementia, higher PT and/or higher RUD scores) were expected to have a significantly higher ICECAP-O score than those with a higher care burden [40], and carers with a higher LSNS score were expected to have a significantly higher ICECAP-O score than those with a lower LSNS score [41].

All tariff scores (for ICECAP-O, EQ-5D-5L and CarerQol-7D) were calculated using UK value sets because of both their availability and the need for consistency. The proxy ratings of the informal carers were used for the EQ-5D-5L, QoL-AD and DemQoL-U of people with dementia. All analyses were performed in STATA 14.

Results

Study sample

A total of 451 informal carers and home-dwelling people with mild to moderate dementia completed the questionnaires and were included in the analysis. The people with dementia were selected for this study based on their probability of needing formal care within 1 year. Table 2 presents sample characteristics of informal carers, the people with dementia (or care receivers) and the caregiving situation. The mean age of informal carers was 66.4 years old. Most of the carers were female and 28% of the carers were employed. The mean age of the care receivers was 77.7 years and approximately half of them were female.

Table 2 Sample characteristics and bivariate results

Figure 1 shows the scores of informal carers on the different dimensions of the ICECAP-O. The mean ICECAP-O tariff score of the informal carers was 0.78, with standard deviation 0.16. The minimum tariff score in the sample was 0 while the maximum score was 1. The mean ICECAP tariff scores varied per country, as displayed in Table 3.

Fig. 1
figure 1

ICECAP-O response of informal carers

Table 3 ICECAP-O values per country, ranked by mean tariff*

Convergent validity

The Spearman’s correlation coefficients are presented in Table 4. There was a moderate positive correlation between the ICECAP-O tariff scores and the EQ-5D-5L utility tariff and EQ-VAS scores, a moderate negative correlation with the EQ-5D-5L health problems index, and a strong positive correlation with the CarerQol tariff and CarerQol-VAS scores.

Table 4 Spearman correlations

Looking at the dimensions of the ICECAP-O in Table 4, it is clear that the other measures hold the strongest correlations with the Security, Role and Enjoyment dimensions of the ICECAP-O. Country-specific correlations are provided in "Appendix 3". Overall country-specific correlations matched those of the aggregate results, with Sweden being somewhat of an exception. In the correlation results for Sweden, the EQ-5D-5L utility tariff score and health problems index were uncorrelated with the ICECAP-O.

Discriminant validity

Bivariate results regarding discriminant validity are shown in Table 2. The ICECAP-O significantly discriminated between old and young informal carers, between those who were employed and unemployed, between carers with low and high PAI, between carers who were and were not in danger of social isolation (LSNS) and carers who felt they could and could not continue caregiving for 2 years or more (PT). The ICECAP-O mean scores all differed in the expected direction. The ICECAP-O did not discriminate between carers who had daily care hours of less than 4 h or 4 h and over.

The ICECAP-O discriminated between carers of people who were 80 years of age or over, or below 80 years of age, between carers of those who received some home care services versus those who received no home care services, carers of people with dementia with high and low numbers of unmet needs (CANE), and carers for people with dementia who had or had not spent time in hospital in the past month (RUD). A significant difference in ICECAP-O scores between carers of care receivers with high, medium and low levels of both the EQ-5D-5L tariff score and the EQ-5D-5L health problems index was observed. The ICECAP-O mean score was lower for carers of those with a lower QoL-AD or a higher CDR, and for carers of those with a lower DemQoL-U score.

Multivariate analysis

The multivariate regression results are shown in Table 5. Due to missing data, only 389 observations were included in this analysis.

Table 5 Multivariate regression coefficients, confidence intervals and P values

Several results can be derived from the multivariate regression. The age of the person with dementia, the relationship with the person with dementia, CDR, social isolation (LSNS) of the carer, the Positive Affect Index of the carer, and Perseverance Time all had a significant relationship with the ICECAP-O tariff score. Age of the person with dementia had a non-linear relationship with the carer ICECAP-O score, suggesting that when people with dementia reach roughly age 79 carer ICECAP-O scores stop increasing and start decreasing. Spouses or partners who care for the person with dementia had a significantly worse well-being than carers with other relationships with the recipient of care. A higher EQ-5D-5L health problems index had a significant negative relation with carer well-being. A higher CDR for the person with dementia score had a significant negative relation with carer well-being. A higher LSNS score had a significant positive relation with carer well-being, as did a higher PT score, while a higher PAI score had a significant positive relation with carer well-being. This can be summarized to mean that a lower severity of dementia and fewer health problems in the person with dementia, a better relationship with the person with dementia, and more perseverance time and less loneliness of the carer were associated with better well-being in the latter. The regression analysis in which countries were included shows that Norway has higher levels of carer well-being than the other countries in the sample.

Discussion

The aim of this paper was to determine the validity of the ICECAP-O in a relatively large, eight-country population sample of informal carers for people living with dementia. Validation was performed using convergent and discriminant validity tests, followed by multivariate analysis. As hypothesized, there were significant moderate-to-strong correlations in the expected directions between the ICECAP-O scores and carers’ EQ-5D-5L utility tariff score and health problems index, EQ-VAS scores, as well as the CarerQol-7D and the CarerQol-VAS scores. The multivariate regressions showed that the age of the person with dementia, the EQ-5D-5L health index of the person with dementia, carer–patient relationship, care recipient CDR, carer LSNS Score, the carer PAI score, and Perseverance Time all had a significant relation with the carer ICECAP-O score. The fact that age of the person with dementia had a non-linear relationship with the carer’s ICECAP-O score may be explained by older people with dementia having more health and behavioural problems that were not captured in the multivariate regression. The reason for age of the person with the dementia being correlated with an increase in the ICECAP-O until age 79 is still a somewhat surprising result, perhaps explained by younger people wanting to take part in more activities or work than their older counterparts. Somewhat surprisingly, the ICECAP-O did not have a significant relationship with the number of daily care hours, even though it was assumed these would have an impact on carer well-being. This may be due to the selection of the sample as only people with mild to moderate dementia were included. The results also showed that living in certain countries may be of importance for the carer ICECAP-O scores.

Even though we presented the first validation of the ICECAP-O instrument in a sample of informal carers, our results were quite comparable to results from previous ICECAP-O validation studies [5, 12, 16, 19, 20]. While many addressed the specific dimensions of the ICECAP-O rather than the ICECAP-O tariff scores, several of the results found in our study were similar to those of previous studies. Almost all studies found that the ICECAP-O could discriminate effectively between groups of different ages [5, 12, 16, 20]. Additionally, all studies found moderate-to-strong convergent validity between the ICECAP-O and health (quite frequently using the EQ-5D as measure) although not necessarily for every dimension of both measures. Makai et al. [20] also found that the ICECAP-O could discriminate between older people who had more or fewer opportunities for social interaction, which is in line with the significance of the LSNS score in our analysis. Most previous validation studies conclude that the ICECAP-O may be a promising patient outcome measure in economic evaluations, although it may not completely cover physical health [18]. Based on the results of the above analysis, this paper comes to the same conclusion for the validity of the ICECAP-O in carers (of persons with dementia). An interesting finding is that in Sweden the ICECAP-O was not correlated with the EQ-5D-5L utility tariff score and health problems index. One reason for this may be that in our sample, the lowest EQ-5D-5L utility tariff score in Sweden is approximately 0.37, which is far higher than the lowest score from the full sample (− 0.1).

The main strength of this study is that it is the first to validate the ICECAP-O in carers: in a sample both large in size and country variety. While several validation studies of the ICECAP-O have been executed, they all used relatively small sample sizes and only focused on one country. The eight-country nature of this sample allowed a more comprehensive overview of the ICECAP-O’s validity in Europe (in carers). Another strength is that extensive data were provided on both carers and care receivers. None of the previous studies looked at convergent validity between the ICECAP-O and the CarerQol, CANE, RUD, PAI, QoL-AD and Perseverance Time. Moreover, this study was the first to validate the ICECAP-O in carers.

Some limitations of our study need to be mentioned as well. First is the lack of variation within the sample. Large percentages of both carers and care receivers were relatively healthy and most carers seemed to experience a relatively low care burden, which may be the result from selection bias as carers who experience a high care burden may be less likely to participate in the study. Therefore, a detailed analysis of validity of the ICECAP-O in those carers who are less healthy or feel higher care burden is not possible here. Second, carer proxy scores were used for some of the outcome measures for people with dementia (i.e. EQ-5D-5L, QoL-AD and DemQoL-U). The correlation of ICECAP-O scores with the harder to observe variables for persons with dementia (such as the DemQoL-U items) may be less reliable than those with more easily observed variables (such as the EQ-5D-5L items). In addition, due to the neurodegenerative nature of dementia and the stress experienced by carers, proxies may give more negative answers regarding care receivers’ health and well-being [42]. While this most likely does not affect our regression results, as from the measures of health of persons with dementia only CDR was used, it is worth bearing in mind for future studies. Another limitation is the use of UK value sets for both the ICECAP-O and EQ-5D-5L-related measures. This was done for consistency, as value sets were not available for all countries in the sample; however, it may be partially responsible for Sweden-specific EQ-5D-5L results being uncorrelated with the ICECAP-O. Finally, we used the ICECAP-O in the complete sample of carers. However, as can be derived from Table 2, nearly half of carers were younger than 65. The ICECAP-O (Older) was designed to capture the capability well-being of people age 65 and over. For people aged under 65, the ICECAP-A (Adults) [6]—which covers the five capability well-being dimensions attachment, stability, achievement, enjoyment and autonomy—would in principle be more suitable. This was not feasible in the current study. It is unclear how accurately the ICECAP-O measures the well-being of people aged under 65.

It has been shown in previous validation studies that the ICECAP-O is a worthy contender as a patient outcome measure in economic evaluations regarding care of the older people, due to its broad, well-being-focused nature. In this study, the ICECAP-O has shown good convergent and discriminant validity as a well-being outcome measure in carers of people with dementia. These findings suggest that the ICECAP-O potentially is a relevant and useful measure for economic evaluation in samples of elderly informal carers, especially when considering interventions that have impacts ‘beyond health’. If it is used in both carers and care receivers, this allows comparisons of outcomes across interventions and aggregation of outcomes within interventions. Before being able to recommend this, a number of important issues need to be resolved. First, the ICECAP-O needs to be further validated as an outcome measure among people with dementia and their carers. This would include linguistic validation of translations of the ICECAP-O, currently being analysed in Germany and Portugal as part of the Actifcare project [21]. It would be beneficial to conduct linguistic validations in other countries where psychometric validations have been conducted [48]. Second, future studies need to confirm our results and expand on them, to increase the evidence of the validity of the ICECAP-O. Third, a choice needs to be made whether the use of the generic ICECAP-O (which is aimed at older people) is to be preferred over the use of more carer-specific well-being measures, such as the CarerQol. While the results of the latter may be less easily aggregated with, for example, ICECAP outcomes in patients, they may provide more precise estimates of care-related quality of life and more detailed information. Finally, while the CarerQoL is aimed at carers (regardless of age), the ICECAP measures would need to be tailored to age groups of carers, which raises questions of aggregation and comparison of ICECAP-A and ICECAP-O scores.

Further research of the ICECAP-O in samples of informal carers for people with different chronic illnesses would also be useful. It would allow investigation into whether the ICECAP-O is also a valid measure and shows similar relationships to other outcomes in the context of diverse chronic illnesses. If the ICECAP-O is to be used as a well-being measure in economic evaluations, it would also be of interest to conduct further research into its sensitivity to change and Minimal Clinically Important Difference.

The ICECAP-O is a capability well-being measure that has been proven to be of use for economic evaluations of care of older people. This study adds that the ICECAP-O may be useful in economic evaluations of interventions considering elderly informal carers, where a broader measure of well-being is more relevant than a narrower health-related quality of life measure such as the EQ-5D-5L.