Figures
Abstract
The Consolidated Framework for Implementation Research (CFIR) is a well-established framework for systematically identifying key factors influencing the implementation of programs. To enhance the classification of existing CFIR definitions, as well as its +2/-2 scoring system, this study incorporated the views of relevant experts to: i) improve how themes are scored; and ii) utilise more information regarding the frequency with which themes are identified. This structured, frequency-based approach to the CFIR’s scoring process has been trialled as the CFIR-frequency (CFIR-f). Researchers thematically analysed semi-structured interview data from four groups of policy and program experts (N = 24) delivering two family-based therapies in New South Wales (NSW), Australia. Themes identified by less than 50% were excluded from further analysis. Themes identified by 50% or more of expert participants in the four groups were classified as enablers or barriers using clearly defined criteria. Each theme was allocated a score according to how many experts identified it as an enabler or barrier, and then mapped back onto the latest adaptation of the CFIR comprising 67 constructs. The CFIR-f successfully determined three enablers of, and six barriers to, implementation. Enablers included the family-based therapy programs, therapist training and participant monitoring systems. Barriers included referral, data collection and staffing difficulties, NSW adaptation issues and the suitability and safety of the programs for Aboriginal families. The same enablers and barriers were identified using both the CFIR-f and the original scoring approach, and the identified themes were successfully mapped to almost all CFIR constructs (65/67). This paper proposes a more frequency-based approach to CFIR’s scoring process (the CFIR-f). By specifically utilising the frequency with which these barriers and enablers are identified, the CFIR-f engenders a list of ranked themes that service providers and policymakers can use to inform their decisions about program modification and implementation.
Citation: Economidis G, Eades A-M, Shakeshaft A, Farnbach S (2023) Improving the pragmatic usefulness of the scoring matrix for the Consolidated Framework for Implementation Research (CFIR). A proposal for a more frequency-based approach: The CFIR-f. PLoS ONE 18(11): e0295204. https://doi.org/10.1371/journal.pone.0295204
Editor: Dorothy Lall, CMC Vellore, IPH India, INDIA
Received: July 6, 2023; Accepted: November 15, 2023; Published: November 30, 2023
Copyright: © 2023 Economidis et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the manuscript and its Supporting Information files.
Funding: GE was supported by an Australian Government Research Training Program (RTP) Scholarship via the University of New South Wales (UNSW), Sydney, Australia, and a Higher Degree Research scholarship from the National Drug and Alcohol Research Centre (NDARC), UNSW, Sydney. The evaluation of the MST-CAN program in New South Wales, Australia (2018-2020) was funded via a competitive tender process (FACS.17.266) by the NSW Government Department of Family and Community Services (now Department of Communities and Justice). Their Futures Matter and Department of Communities and Justice staff provided feedback on the evaluation plan, participated in interviews as key stakeholders in this evaluation, and reviewed the manuscript.
Competing interests: All authors have approved the manuscript for submission. One co-author, Anthony Shakeshaft, was the lead investigator on a competitive tender (FACS.17.266) awarded by the NSW Government Department of Family and Community Services (now Department of Communities and Justice) to evaluate the Functional Family Therapy – Child Welfare [FFT-CW®] and Multisystemic Therapy for Child Abuse and Neglect [MST-CAN®] programs in New South Wales, Australia (2018-2020). The lead author, George Economidis, was employed part-time at the National Drug and Alcohol Research Centre as a Project Coordinator for the FFT-CW® and MST-CAN program evaluation from May 2018 to August 2020. This does not alter our adherence to PLOS ONE policies on sharing data and materials.
Introduction
Programs that simultaneously respond to harms that emanate from the individual, community and societal level are increasingly being utilised, both to optimise their effectiveness and improve the sustainability with which they are integrated into real-world service delivery [1, 2]. The increased complexity of the methods required to evaluate these multi-level, complex programs means that the cost of designing and implementing evaluations has also increased [3]. Identifying the enablers of, and barriers to, the delivery of multi-level programs at an early stage of their implementation is especially important because it is a particularly volatile stage in the program delivery process [3, 4]. Moreover, identifying the factors that influence the early implementation of a program creates the opportunity for the enablers to be leveraged and the barriers to be modified, to optimally tailor the program to the context in which it is being delivered before the processes that support its implementation become entrenched [5]. Taxonomic frameworks have been developed to standardise the systematic identification of enablers of, and barriers to, implementation [6, 7]. One such framework is the Consolidated Framework for Implementation Research (CFIR) [6].
The CFIR is a conceptual, systematic and meta-theoretical approach that is used to assess the uptake of a program against key factors that have been shown to influence the implementation of complex programs [8–10]. The CFIR has been used to assess the implementation of health-related services, such as weight management [7], hospital care [11], chronic disease [12] and pharmacy services [13], and the implementation of social services, such as the introduction of intensive family-based therapy programs in the Australian state of Victoria [4].
The CFIR is described in detail elsewhere [6]. In brief, evaluators have used the CFIR to measure the impact of a wide range of variables on the implementation of a program by using data generated by both qualitative and quantitative research methods. From a qualitative perspective, evaluators have used the CFIR as a framework, or ‘checklist’, for coding information collected using standardised qualitative techniques, such as focus group discussions and semi-structured interviews [10]. From a quantitative and mixed-methods perspective, evaluators have applied ratings, or rankings, to the CFIR’s constructs based on the results of surveys or questionnaires [7, 14]. It is worth noting that most studies have only used the CFIR to inform their data analysis rather than their data collection [10].
The latest adaptation of the CFIR (CFIR 2.0) comprises 67 constructs that are organised into five domains: 1) innovation (e.g., the distinct features and components of the program); 2) outer setting (the external features of the program); 3) inner setting (the internal features of the program); 4) individuals (e.g., the personal traits, attributes and roles of staff responsible for program delivery); and 5) implementation (the change strategies adopted by the organisation) [15]. Evaluators then frequently apply a scoring system, whereby each theme is allocated a score from +2 to -2: themes rated +1 or +2 are classified as enablers; themes rated -1 or -2 are classified as barriers; and themes rated as 0 are classified as being neither an enabler nor a barrier to implementation [7, 16]. The more extreme scores (i.e., +2 and -2) are interpreted as more influential barriers and enablers, while +1 and -1 are interpreted as being generally influential. These categorised themes can then be mapped back onto the five CFIR domains.
Although this process is clearly defined and useful overall, its pragmatic value may be improved both in terms of how themes are scored and the frequency with which themes are identified. In relation to how themes are scored, for example, a theme would be allocated a score of -1 (and consequently interpreted as a barrier with a generally negative influence) if it was only discussed “in passing” or at a “high level without examples” [7]. A potential weakness in this process is that the definitions of these two phrases “in passing” and “high level” are potentially too vague: they could be interpreted differently by various independent raters (a challenge to inter-rater reliability), or interpreted differently by the same independent rater in various contexts or at different points in time (a challenge to intra-rater reliability). These challenges to inter- and intra-rater reliability could create artificial variation in the results of a CFIR evaluation.
In relation to assessing the frequency with which themes are identified, a theme would be allocated a score of -2 or +2 (and consequently interpreted as a barrier or an enabler) if it meets the threshold of being “…discussed by at least two interviewees using explicit examples…” [7]. A potential weakness in this process is that a theme discussed by two interviewees is treated in the same way as a theme discussed by five or 10 interviewees. This process may subsequently miss the opportunity to identify themes according to the frequency with which they are raised by expert participants, rather than the current dichotomous categorisation with a cut-off of two interviewees. The pragmatic value in improving how themes are scored is that service providers and funders would have greater confidence in the reliability with which themes are classified as barriers or enablers. The pragmatic value in utilising more information regarding the frequency with which experts identify themes is that service providers and funders would obtain more actionable information to inform their decisions about how to improve programs, or discontinue their investment in them, because implementation strategies are based specifically on the number of participants who identified themes as enablers or barriers to implementation.
Consequently, the aim of this study is to utilise the opportunity of two family-based therapy programs being integrated into the routine delivery of child protection services in New South Wales (NSW), Australia, to propose a more frequency-based approach to the CFIR’s classification and scoring process: the CFIR-frequency (CFIR-f).
Methods
Family-based therapy services in NSW
The CFIR framework was used in a mixed-methods process evaluation of the implementation of Functional Family Therapy (FFT-CW®) and Multisystemic Therapy for Child Abuse and Neglect (MST-CAN) into the routine delivery of child protection services in NSW, Australia [17]. These government-funded programs use a therapeutic, strengths-based and whole-of-family approach aimed at reducing the likelihood that children who are at high risk of maltreatment will be placed into out-of-home care. The CFIR framework was used to assess the fidelity and success with which FFT-CW® and MST-CAN were being implemented into the routine delivery of child protection services in NSW.
Participants in the process evaluation
Twenty-four policy and program experts (defined as those actively engaged in the adaptation, planning, implementation and/or monitoring of either MST-CAN or FFT-CW®) were purposively recruited in August 2018 and allocated into one of four groups based on their role: i) model purveyors (five clinicians and/or program experts who own the intellectual property and licensing rights of both models); ii) intermediaries (seven domestic or international implementation support organisations who provided adherence and fidelity support to service providers); iii) representatives of the NSW Government’s Their Futures Matter team responsible for the implementation of MST-CAN and FFT-CW® in NSW (six staffers who oversaw the implementation and contractual arrangements for service providers to deliver both programs in NSW); and iv) staff of the NSW Government’s Department of Communities and Justice (DCJ) (six staffers who commissioned and referred high-risk and eligible families to the programs). Considering both MST-CAN and FFT-CW® were implemented across NSW at the same time and for largely the same purpose, the CFIR was used to evaluate the implementation of both programs simultaneously, irrespective of the program in which participants were involved, noting that the program-specific enablers and barriers to the implementation of both programs in NSW have been reported elsewhere [18, 19]. Details about the roles of the policy and program experts, and the broader organisational network involved in the implementation of MST-CAN and FFT-CW® are provided elsewhere [17], as are details about the specific qualitative methods used to evaluate the uptake process for MST-CAN [18] and FFT-CW® [19].
Identifying themes
Twenty-four in-depth, semi-structured interviews were undertaken between August and September 2018 (see [18, 19] for further information regarding the sampling methodology utilised, and for the interview questions asked of policy and program experts). Responses were deductively and inductively thematically analysed using a reflexive approach [20–22], which identified the 16 themes summarised in Table 2 (further details on how this thematic analysis was undertaken are provided elsewhere [18, 19]). Authors did not have access to information that could identify individual participants during or after data collection.
Procedures for this analysis
A modified CFIR analysis process (the CFIR-f) is proposed, comprising four steps.
First, as with current practice using the CFIR, issues related to the implementation of a program are identified using an established qualitative method (e.g., semi-structured interviews), consisting of questions that align with the CFIR’s 67 constructs. These issues are then transposed into themes, using standard thematic analysis.
Second, themes were excluded as being too infrequently identified if they were mentioned by less than 50% of participants. Introducing a cut-off of 50% aimed to concentrate the focus of the analysis on the themes that emerged from the issues identified by the majority of group participants, rather than including all themes. The aim of this step is to identify the barriers and enablers with the most expert consensus so that service providers and policymakers can have more confidence in their decisions about program modification and implementation.
Third, for the themes identified by at least 50% of participants in each group, the number of participants who identified it as a barrier or an enabler was recorded by the researchers, and that number was allocated to each barrier (up to -5 for model purveyors because there were five participants in this group, -7 for intermediaries, -6 for Their Futures Matter staff and -6 for DCJ staff) and enabler (up to +5 for model purveyors, +7 for intermediaries, +6 for Their Futures Matter staff and +6 for DCJ staff). In addition, a theme was classified as neither an enabler nor barrier to implementation (termed ‘neutral’ and scored 0) if at least one of two criteria were met: 1) the majority of group participants discussed the theme in neutral terms (e.g., being ‘indifferent’ or ‘unclear’ about the impact of the theme) or specifically indicated that the theme neither enhanced nor inhibited program implementation; or 2) an equal number of group participants discussed the theme as an enabler and barrier to program implementation in NSW (or, for groups with odd numbers of participants, if the number of group participants who discussed a theme as both an enabler and barrier differed by one person). These two criteria for neutrality are strongly aligned with the original CFIR criteria for neutrality (that there is no evidence of positive or negative impact on implementation, or if a theme is discussed in both positive and negative terms) but differ by specifying the objective threshold of a majority of group participants (criterion 1) or an equal number of group participants (criterion 2).
Fourth, the score for each group was summed to give a total score for each theme, resulting in a ranked list of themes for relevant stakeholders to action. The logic is to provide services, funders and policymakers with a list of barriers and enablers determined by the frequency with which they are mentioned by participants across each of the four expert groups.
To illustrate the application of these four steps in the current study, the emergent theme outcome measures and data collection (theme 2 in Table 2) was identified by 14 interviewees across the four FFT-CW® and MST-CAN participant groups. The theme was scored as being of low relevance only in the model purveyor group (less than 50% of model purveyors identified this theme). Fifty percent or more of participants in both the intermediary and Their Futures Matter groups identified this theme and specified it as barrier, largely due to the administrative and logistical difficulties associated with obtaining and storing data. Considering four expert participants in the intermediary group and four expert participants in the Their Futures Matter group discussed this theme as a barrier, each group was allocated a score of -4 for this theme. Although more than 50% of participants in the DCJ group also identified this theme, it was rated as neutral (and scored 0) because equal numbers of group participants rated it as a potential enabler (e.g., noting the benefits of a meticulous and centralised database that is immediate and easily accessible) and a potential barrier (e.g., highlighting similar barriers to implementation discussed by intermediaries). As a result, the overall score for the theme, using the proposed modified for the CFIR-f, is -8.
The differences between the original CFIR approach and the CFIR-f are summarised in Table 1.
Mapping themes onto the 67-construct CFIR
To identify the extent to which the themes identified by the CFIR-f were able to be mapped to the CFIR constructs, themes were independently mapped onto the 67 CFIR constructs by two researchers: authors GE and AME. The mapped constructs were compared and discussed until consensus was reached. In addition, the themes identified by the CFIR and the themes identified by the CFIR-f were compared to identify similarities and differences in the enablers and barriers to the implementation of FFT-CW® and MST-CAN generated by these two different scoring methods (see Table 3).
Results
Identifying themes using the proposed adapted scoring method
Table 2 shows the 16 themes identified, ranked by both their CFIR score (ranging from -6 to +4) and their CFIR-f score (ranging from -11 to +7), and the number of participants in each of the four groups who identified each theme. Using the CFIR-f scoring method, 13 of the 16 emergent themes were identified by 50% or more of Their Futures Matter staff and could thus be scored, 11 were identified by 50% or more of the intermediaries, six were identified by 50% or more of the model purveyors, and five were identified by 50% or more of DCJ staff. This resulted in the identification of three enablers, six barriers and seven neutral themes, as depicted in Table 2.
Of these six barriers, technical, logistical and referral challenges (scored as -11) was identified by all experts, (except DCJ staff). Workforce, staffing, resourcing and funding issues (scored as -7) was identified as a barrier by model purveyors and intermediaries, and as neither an enabler nor barrier by Their Futures Matter staff, and was not identified at all by DCJ staff. Outcome measures and data collection issues (scored as -8) was highlighted as a barrier by intermediaries and Their Futures Matter staff, whereas the eligibility criteria of the programs (scored as -3) was identified as a barrier by only Their Futures Matter staff. The adaptation challenges to the NSW context (scored as -4), as well as the evidence base and appropriateness of the programs for Aboriginal families (scored -5), were perceived as barriers only by intermediary organisations.
The three enablers included: the nature and structure of the FFT-CW® program (scored as +7), model purveyors’ training and support of therapists (scored as +4), and the ability to monitor a family’s progress in the programs in real-time (scored as +3). Of these, one was rated as an enabler by Their Futures Matter staff and intermediaries (the nature and structure of the FFT-CW® program (scored as +7)), one was rated as an enabler by Their Futures Matter staff only (the ability to monitor a family’s progress (scored as +3)), and one was rated as an enabler by model purveyors only (role of model purveyors’ training and support of therapists (scored as +4)). A full list of the key ideas underpinning each of the themes in Table 2 used to generate them as either enablers, barriers or neither, as well as supporting interviewee quotes, is detailed in S1 Table.
Mapping themes onto the 67-construct CFIR framework
Table 3 shows that the themes generated by policy and program experts using the CFIR-f, as summarised in Table 2, were mapped to 65 of the 67 constructs encompassed in CFIR 2.0. There was 100% agreement between raters regarding which themes mapped to which of the 65 CFIR constructs. The two CFIR constructs to which no themes generated by the CFIR-f were able to be mapped were: 1) Innovation Trialability (i.e., piloting the program by delivering it on a much smaller scale prior to its wider implementation); and 2) Critical Incidents (i.e., significant delays in program implementation due to unforeseen, wide-scale interruptions) [15]. Definitions of each these CFIR constructs, and explanations for how FFT-CW® and MST-CAN align with each, is articulated in S2 Table.
In addition to summarising how the themes identified by the CFIR-f mapped back to the 67 CFIR 2.0 constructs, Table 3 shows three key features that highlight the potential of the CFIR-f scoring system proposed in this paper. First, it clearly shows that both the CFIR and CFIR-f scoring systems identify the same enabler themes (n = 3), the same barrier themes (n = 6) and the same neutral themes (n = 7). This replication of themes as being enablers, barriers or neutral provides reassurance of the concurrent validity of the CFIR-f scoring system: using it resulted in the themes being classified in exactly the same way as the original CFIR scoring, which can be regarded as the current gold standard CFIR scoring method [23].
Second, it highlights the practical value of the additional information that can be derived from the CFIR-f. Specifically, Table 3 shows that the original CFIR scores are only capable of generating an ordinal list of themes because the original scoring method only applies scores of -2 and +2 (where the theme was raised by two or more interviewees using explicit examples). For example, using the original scoring method, the theme workforce, staffing, resourcing and funding issues (theme 3 in Tables 2 & 3) was discussed with specific examples by at least two participants in the model purveyors group and at least two participants in the intermediaries group, meaning it scored -2 for each of these groups separately. This theme also scored a 0 for the Their Futures Matter and the DCJ groups because it was identified in neutral terms by participants in those groups. Consequently, this theme received a total CFIR score of -4. When applied to all themes, Table 3 shows this original CFIR scoring mechanism generates an ordinal list of themes that can only score 0 or +/- 2, 4, 6 and 8.
In contrast, the proposed CFIR-f scoring system generates a continuous range of scores for each theme (bounded by the number of participants). These scores can be interpreted as a list of ranked themes based on the frequency with which they were identified; it is reasonable to suggest themes with the higher scores were identified by the greatest number of expert participants as being influential. In turn, this provides service providers and funders with an indication about which issues most require modification to improve the implementation of a program, and they can allocate their resources accordingly. Table 3, for example, shows that of the barriers identified, theme 1 (technical, logistical and referral challenges) is easily the issue that is identified by the greatest number of experts as a barrier (scoring -11) and could therefore be addressed as an issue requiring immediate modification to streamline program implementation.
Third, in addition to generating a frequency ranked list of barriers and enablers, the data presented in Table 2 can also help service providers and policymakers determine the experts who are likely to be most effectively engaged in addressing barriers. Table 2, for example, shows that the intermediaries group identified five of the six barriers, while the DCJ group identified none, which suggests that intermediaries are most likely to be usefully engaged in co-designing solutions to the identified barriers.
Discussion
Summary of findings
Using the modified scoring system proposed by the CFIR-f, three enablers and six barriers to implementing FFT-CW® and MST-CAN were identified. These themes were exactly the same as the themes identified using the original CFIR scoring system, which provides some evidence of concurrent validity for the CFIR-f given the original CFIR scoring mechanism can be regarded as the current gold standard measure. In contrast to the original CFIR scoring method, the CFIR-f allocated a continuous score to each theme. This generated a list of themes, ranked by the frequency with which each theme was identified by a range of experts involved in the delivery of the program. Synthesising the enablers and barriers in this way provides policymakers with an objective method for prioritising their decisions about how the implementation of a program might be most effectively and efficiently improved. In addition, the CFIR-f was able to identify the specific group of experts who are most likely to be usefully engaged in co-designing solutions to the identified barriers. Finally, the themes were able to be mapped to 65 of the 67 constructs from the latest adaptation of the CFIR (CFIR 2.0) [15].
Comparisons to previous literature
The present study aims to enhance both the reliability with which themes are classified as enablers or barriers and make greater use of the implied consensus among expert participants, by allocating continuous scores to the themes that they identify. In terms of reliability, previous CFIR research relied on potentially subjective phrases, such as “in passing” or at a “high-level”, to allocate a score to each of the identified enablers and barriers [7, 24], whereas this study allocates a score to each identified enabler and barrier based on the number of expert participants who identified it, which obviates the reliance of the potentially subjective phrases. In terms of frequency, previous CFIR research allocated a score from +2 to -2 for all themes, whereas this study only considered themes that were identified by at least 50% of expert participants as an enabler or barrier (to focus on the most frequently identified themes), and allocated a score based on the number of expert participants who identified each enabler and barrier to generate ranked themes based on a continuous, rather than ordinal, scale. This proposed frequency-based scoring approach meant that barriers, in particular, could be ranked and subsequently actioned based on precisely the number of participants who viewed them as inhibiting the implementation of a program.
To date, there has only been one published study employing CFIR in the child maltreatment field [4]. That study focused on the implementation of one of the programs examined in the current study (FFT-CW®) in another Australian state (Victoria). The authors of that study followed the original CFIR approach and developed interview questions in accordance with the five CFIR domains, whereas the questions used in the current study were instead developed for the purposes of the broader research evaluation of the FFT-CW® and MST-CAN programs in NSW (see S1 Appendix). Despite the differences in how the CFIR was utilised between that previous study [4] and this one, many similar implementation barriers were identified. These encompassed therapists feeling de-skilled (indicated here in the barrier, “workforce, staffing and resourcing issues”), lack of organisational readiness (“technical, logistical and referral challenges”) and tensions between program and staff (“relationship with service providers and other stakeholders”). These similarities suggest that both approaches are appropriate for identifying implementation barriers.
Methodological considerations
Greater utilisation of the number of expert participants who identified barriers to the implementation of FFT-CW® and MST-CAN allows service providers and policymakers to objectively address them using the more frequency-based approach proposed in this study. For example, addressing technical, logistical and referral challenges (allocated a score of -11) prior to the clarification and amendment of the eligibility criteria of the programs (allocated a score of -3) is likely to achieve greater improvements because there is more expert agreement about its negative influence on program implementation. The proposed CFIR-f would likely improve the reliability with which enablers and barriers to implementation are identified because it has less vulnerability to inter- and intra-rater errors related to inconsistencies in how phrases such as “in passing’ and “high-level without examples” are interpreted by different evaluators or the same evaluators in different settings. In addition, the CFIR-f’s greater emphasis on the frequency with which all experts identify themes would protect against the possibility of researchers unintentionally giving undue influence to certain themes in which they have tacit interest or influence [6, 12].
The accuracy with which enablers and barriers were identified by the CFIR-f also seems reasonable: the same enablers and barriers were identified by both the CFIR-f and the original CFIR scoring methods, and the order to which they were allocated was similar. Among the barriers, for example, the technical, logistical and referral challenges was ranked as the highest barrier using both the original scoring system and the CFIR-f (i.e., -6 and -11, respectively).
Furthermore, the relative advantage of the CFIR-f is that it generates a frequency-based list of enablers and barriers for policymakers to action based on continuous (not ordinal) scores from the perspectives of key stakeholders who have been actively engaged in the delivery of a program. In this study, for example, it is reasonable to conclude that technical, logistical and referral challenges [score -11] were identified about three times more frequently as a barrier to implementation than the need for clarification and amendment of the eligibility criteria of the programs ([score -3]). Using the original CFIR framework, both these themes would be allocated the same score of -2 for each expert group who identified them as barriers, irrespective of whether two or more stakeholders in each group articulated these themes as such. Thus, the original CFIR framework does not provide policymakers with sufficiently actionable information, that is, information that would be useful in assisting their decision-making about which of these barriers to prioritise to achieve relatively greater improvement in the ongoing implementation of a program.
In the real-world context of policymakers and service providers having limited resources and time, the continuous scores generated by the CFIR-f enhances the pragmatic usefulness of the results of an implementation evaluation. Specifically, it generates information that originates directly from the experiences of key stakeholders who have been engaged in the implementation of a program, it is less susceptible to evaluator bias, and it presents a list of enablers and barriers that are ranked by the frequency with which they were identified. Generating and presenting information in this way aims to make that information more actionable for policymakers and service providers in determining the order in which barriers should be addressed, and enablers should be leveraged, to optimally improve implementation.
Despite using the 50% cut-off threshold for identifying themes (step 2), and the allocation of scores based on the number of participants who identified themes as enablers or barriers (step 3), the proposed CFIR-f scoring method could most likely be improved by further clarifying the criteria for how themes are classified and scored as neutral (0). As for the original CFIR, the CFIR-f approach still relies on the interpretation and judgement of researchers (e.g., to identify whether a theme is perceived by participants as ‘indifferent’ or ‘unclear’). However, specifying that most participants are required to identify relevant themes in neutral terms, or that an equal number of participants are required to identify a specific theme as either an enabler or barrier to implementation, may provide the CFIR-f with a greater level of objectivity and standardisation, relative to the CFIR.
Another consequence of using the 50% cut-off threshold for identifying themes was that none of the themes identified by the DCJ participants were classified as enablers or barriers (because less than 50% of the DCJ group identified these themes). Although this suggests that there was less consensus regarding implementation issues among DCJ participants, relative to the other three policy and program expert groups, it may also highlight that further consideration could be given to how best to determine the level at which this cut-off threshold is set, including the idea that it might vary according to the circumstances of each evaluation. Alternatively, the cut-off threshold for identifying themes could be determined by using different, or more than one, criteria to reduce the reliance on one frequency-based criterion. For example, the threshold could be defined as including all enablers and barriers that were identified by at least 50% of participants and the most frequently cited enabler and barrier from any group whose views are excluded by the 50% threshold.
In addition, this initial version of the CFIR-f only considers the extent to which each enabler and barrier influences the implementation of a program through the assumption that the more frequently identified themes may be more influential. Future research could test the validity of this assumption. While the CFIR does not use this frequency-based approach (e.g., it only allocates a score of +2 or -2 to any theme identified by at least two participants), it does attempt to capture how strongly different enablers and barriers are likely to have influenced implementation by using complementary indicators, such as the level of agreement among participants, strength of language and the use of concrete examples [7]. However, a limitation of these indicators, and a key reason for the development of the CFIR-f, is their reliance on the evaluation team to interpret concepts such as “level of agreement” and “strength of language”. Moreover, these indicators ultimately rely on the same type of assumption as the CFIR-f’s frequency-based method in interpreting their pragmatic meaning: the use of “concrete examples” may reflect the relative simplicity of articulating a particular enabler or barrier, for instance, rather than the perceived strength of its influence on the implementation of a program.
A potential solution that could be tested in future research is the adoption of a matrix approach that incorporates both the frequency-based process of the CFIR-f and a less subjective method of capturing the relative impact of different enablers and barriers on the implementation of a program. The nominal group technique (NGT) method [25], for example, is a consensus-based approach to identifying priorities that the authors used in the evaluation of the FFT-CW® and MST-CAN programs in NSW [18, 19]. It may be possible to combine the thematic analysis approach in this study with the NGT method in either a two-step process (e.g., semi-structured interviews and thematic analysis, followed by the ranking of themes using NGT in a subsequent session) or a one-step process if it becomes possible to harness artificial intelligence (AI) technology to perform thematic analysis with sufficient rapidity and accuracy.
Implications for policy and practice
Using a standardised scoring and conceptual framework to assess the implementation of programs, such as the CFIR or the CFIR-f, should be routinely incorporated in the evaluations of public health programs to ensure there is standardisation in the way program enablers and barriers are identified. Solutions to barriers identified by expert participants (e.g., model purveyors and intermediaries) can then be explored using focus groups, co-design workshops or forums with a range of relevant stakeholders. Although both the CFIR and the CFIR-f are appropriate, the key implication of the latter for policy and practice is that it engenders a list of barriers ranked by frequency, which service providers and policymakers can use to inform their decisions about issues for action. Making the results of program implementation more actionable makes it easier for policymakers and funders to justify their decisions based on the ratings of independent experts.
Despite the CFIR-f being developed and piloted in the child maltreatment space, it is readily applicable to other programs delivered in education, workplace, legal or social services. It can also accommodate for iterative changes made to the constructs encompassed by the CFIR, such as the latest adaptations of CFIR 2.0 [15] which were incorporated in the present study, or the proposition to include real and perceived implementation and program outcomes, as well as outcome moderators, in future [26]. With enough applications of the CFIR-f, it is likely that it could be further refined to improve its practical usefulness in ensuring that barriers to program implementation, in particular, are identified as early as possible and modified to ensure they are maximally tailored to meeting the needs of their target populations. Integrating the views of other key participants involved in, or impacted by, the delivery of these programs into the CFIR-f scoring system, for example, may be warranted in future iterations. If the current study were to be replicated, this may include interviews with families who have had varying levels of engagement with the FFT-CW® and MST-CAN programs (i.e., families who exited early from, or who completed either program in full).
Conclusions
This paper proposes a more frequency-based approach to CFIR’s scoring process (the CFIR-f). The CFIR-f identified the same barriers and enablers as the original CFIR scoring process. By specifically utilising the frequency with which these barriers and enablers are identified, the CFIR-f engenders a list of ranked themes that service providers and policymakers can use to inform their decisions about program modification and implementation.
Supporting information
S1 Table. Key ideas of themes identified as barriers, enablers or neither from semi-structured interviews.
https://doi.org/10.1371/journal.pone.0295204.s001
(PDF)
S2 Table. Definitions of CFIR domains and constructs.
https://doi.org/10.1371/journal.pone.0295204.s002
(PDF)
S1 Appendix. Evaluation questions to assess adaptation and implementation of FFT-CW® and MST-CAN in NSW, Australia.
https://doi.org/10.1371/journal.pone.0295204.s003
(PDF)
Acknowledgments
The research team acknowledges and thanks all policy experts and service providers who participated in data collection for this study. We acknowledge the contribution of Dr. Paula Jops and Dr. Hueiming Liu to conducting interviews with some of the policy experts in this study.
References
- 1. Johnston L, Doyle J, Morgan B, Atkinson-Briggs S, Firebrace B, Marika M, et al. A review of programs that targeted environmental determinants of Aboriginal and Torres Strait Islander health. International Journal of Environmental Research and Public Health. 2013;10(8): 3518–3542. pmid:23939388
- 2. Knight A, Shakeshaft A, Havard A, Maple M, Foley C, Shakeshaft B. The quality and effectiveness of interventions that target multiple risk factors among young people: A systematic review. Australian and New Zealand Journal of Public Health. 2017;41(1): 54–60. pmid:27624886
- 3. Akin BA, Strolin-Goltzman J, Collins-Camargo C. Successes and challenges in developing trauma-informed child welfare systems: A real-world case study of exploration and initial implementation. Children and Youth Services Review. 2017;82: 42–52.
- 4. Albers B, Hateley-Browne J, Steele T, Rose V, Shlonsky A, Mildon R. The early implementation of FFT-CW®, MST-Psychiatric®, and SafeCare® in Australia. Research on Social Work Practice. 2020;30(6): 658–77.
- 5. Ronto R, Rathi N, Worsley A, Sanders T, Lonsdale C, Wolfenden L. Enablers and barriers to implementation of and compliance with school-based healthy food and beverage policies: a systematic literature review and meta-synthesis. Public Health Nutrition. 2020;23(15): 2840–2855. pmid:32317047
- 6. Damschroder LJ, Aron DC, Keith RE, Kirsh SR, Alexander JA, Lowery JC. Fostering implementation of health services research findings into practice: a consolidated framework for advancing implementation science. Implementation Science. 2009;4(1): 1–5. pmid:19664226
- 7. Damschroder LJ, Lowery JC. Evaluation of a large-scale weight management program using the consolidated framework for implementation research (CFIR). Implementation Science. 2013;8: 1–7. pmid:23663819
- 8. Keith RE, Crosson JC, O’Malley AS, Cromp D, Taylor EF. Using the Consolidated Framework for Implementation Research (CFIR) to produce actionable findings: a rapid-cycle evaluation approach to improving implementation. Implementation Science. 2017;12(1): 1–2. pmid:28187747
- 9. Dekker M, Jongerden IP, de Bruijne MC, Jelsma JG, Vandenbroucke-Grauls CM, van Mansfeld R. Strategies to improve the implementation of infection control link nurse programmes in acute-care hospitals. Journal of Hospital Infection. 2022;128: 54–63. pmid:35820554
- 10. Kirk A, Kelley C, Yankey N, Birken SA, Abadie B, Damschroder L. A systematic review of the use of the Consolidated Framework for Implementation Research. Implementation Science. 2015;11(1): 1–13. pmid:27189233
- 11. English M. Designing a theory-informed, contextually appropriate intervention strategy to improve delivery of paediatric services in Kenyan hospitals. Implementation Science. 2013;8: 1–3. pmid:23537192
- 12. Manca DP, Aubrey-Bassler K, Kandola K, Aguilar C, Campbell-Scherer D, Sopcak N, et al. Implementing and evaluating a program to facilitate chronic disease prevention and screening in primary care: a mixed methods program evaluation. Implementation Science. 2014;9(1): 1–9. pmid:25293785
- 13. Cheng C, Walsh A, Jones S, Matthews S, Weerasooriya D, Fernandes RJ, et al. Development, implementation and evaluation of a seven-day clinical pharmacy service in a tertiary referral teaching hospital during surge-2 of the COVID-19 pandemic. International Journal of Clinical Pharmacy. 2023;45(2): 293–303. pmid:36367601
- 14. Kalkan A, Roback K, Hallert E, Carlsson P. Factors influencing rheumatologists’ prescription of biological treatment in rheumatoid arthritis: an interview study. Implementation Science. 2014;9(1): 1–12. pmid:25304517
- 15. Damschroder LJ, Reardon CM, Widerquist MAO, Lowery J. The updated Consolidated Framework for Implementation Research based on user feedback. Implementation Science. 2022;17(1): 1–16. pmid:36309746
- 16. Shade L, Ludden T, Dolor RJ, Halladay J, Reeves K, Rees J, et al. Using the Consolidated Framework for Implementation Research (CFIR) to evaluate implementation effectiveness of a facilitated approach to an asthma shared decision making intervention. Journal of Asthma. 2021;58(4): 554–63. pmid:31868043
- 16. Damschroder LJ, Reardon CM, Opra Widerquist MA, Lowery J. Conceptualizing outcomes for use with the Consolidated Framework for Implementation Research (CFIR): the CFIR Outcomes Addendum. Implementation Science. 2022;17(1): 1–10. pmid:35065675
- 17. Shakeshaft A, Economidis G, D’Este C, Oldmeadow C, Tran DA, Nalukwago S, et al. The application of Functional Family Therapy-Child Welfare (FFT-CW®) and Multisystemic Therapy for Child Abuse and Neglect (MST-CAN) to NSW: an early evaluation of processes, outcomes and economics. NSW Government’s Their Futures Matter. 2020 Jun 29 [Cited 2023 May 18]. Available from: https://disability.royalcommission.gov.au/system/files/exhibit/NSW.0030.0046.0001.pdf
- 18. Economidis G, Farnbach S, Falster K, Eades AM, Shakeshaft A. Enablers and barriers to the implementation of Multisystemic Therapy for Child Abuse and Neglect (MST-CAN) into the routine delivery of child protection services. Children and Youth Services Review. 2023;155:107297.
- 19. Economidis G, Farnbach S, Falster K, Eades AM, Shakeshaft A. Identifying enablers and barriers to the implementation of Functional Family Therapy–Child Welfare (FFT-CW®) into the routine delivery of child protection services in New South Wales, Australia. Children and Youth Services Review. 2023;149:106927.
- 20.
Bingham A, Witkowsky P. Deductive and inductive approaches to qualitative data analysis. In: Vanover C, Mihas P, & Saldana J, editors. Analyzing and Interpreting Qualitative Research: After the Interview. USA: Sage Publications; 2021. pp. 133–146.
https://doi.org/
10.1080/2159676X.2019.1628806
- 21. Braun V, Clarke V. Reflecting on reflexive thematic analysis. Qualitative Research in Sport, Exercise and Health. 2019;11(4): 589–597.
- 22. Etikan I, Musa SA, Alkassim RS. Comparison of convenience sampling and purposive sampling. American Journal of Theoretical and Applied Statistics. 2016;5(1): 1–4.
- 23.
Miller LA, Lovler RL. Foundations of psychological testing: A practical approach. 1st ed. USA: Sage Publications; 2018.
https://doi.org/
10.1186/1752-4458-8-46
- 24. Murphy AL, Gardner DM, Kutcher SP, Martin-Misener R. A theory-informed approach to mental health care capacity building for pharmacists. International Journal of Mental Health Systems. 2014;8(1): 1–1. pmid:25473416
- 25. Van de Ven AH, Delbecq AL. The nominal group as a research instrument for exploratory health studies. American Journal of Public Health. 1972;62(3): 337–42. pmid:5011164