Skip to main content
Advertisement
  • Loading metrics

Effects of digital chatbot on gender attitudes and exposure to intimate partner violence among young women in South Africa

  • Alexandra De Filippo,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing

    Affiliation Behavioral Insights Team, New York, New York, United States of America

  • Paloma Bellatin,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Validation, Writing – original draft, Writing – review & editing

    Affiliation Behavioral Insights Team, New York, New York, United States of America

  • Neville Tietz,

    Roles Data curation, Investigation, Methodology, Project administration, Resources, Software, Writing – review & editing

    Affiliation Praekelt.org, Johannesburg, South Africa

  • Eli Grant,

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Writing – review & editing

    Affiliation Independent consultant, Connecticut, United States of America

  • Alexander Whitefield,

    Roles Data curation, Formal analysis, Investigation, Methodology, Validation, Visualization, Writing – review & editing

    Affiliations Behavioural Insights Team, London, United Kingdom, Department of Economics, Duke University, Durham, North Carolina, United States of America

  • Puseletso Nkopane,

    Roles Conceptualization, Investigation, Project administration, Validation, Writing – review & editing

    Affiliation Praekelt.org, Johannesburg, South Africa

  • Camilla Devereux,

    Roles Conceptualization, Investigation, Methodology, Validation, Writing – review & editing

    Affiliation Behavioural Insights Team, London, United Kingdom

  • Kaitlyn Crawford,

    Roles Data curation, Investigation, Project administration, Writing – review & editing

    Affiliation Praekelt.org, Johannesburg, South Africa

  • Benjamin Vermeulen,

    Roles Conceptualization, Funding acquisition, Methodology, Project administration, Resources, Software, Validation

    Affiliation Praekelt.org, Johannesburg, South Africa

  • Abigail M. Hatcher

    Roles Investigation, Methodology, Project administration, Supervision, Visualization, Writing – original draft, Writing – review & editing

    abbeymae@email.unc.edu

    Affiliations Department of Health Behavior, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, North Carolina, United States of America, School of Public Health, Faculty of Health Sciences, University of the Witwatersrand, Johannesburg, South Africa

Abstract

Background

South Africa has among the highest rates of intimate partner violence (IPV) globally, with young women at heightened risk due to inequitable gender roles, limited relationship skills, and inadequate social support. Despite an urgent need for violence prevention in low- and middle-income settings, most efficacious approaches are time-intensive and costly to deliver. Digital, interactive chatbots may help young women navigate safer relationships and develop healthier gender beliefs and skills

Methods

Young women (18–24 years old) across South Africa were recruited via Facebook for participation in an individually randomised controlled trial (n = 19,643) during the period of June 2021-September 2021. Users were randomly allocated, using a pipeline algorithm, to one of four trial arms: Pure Control (PC) had no user engagement outside of study measures; Attention Treatment (T0) provided didactic information about sexual health through a text-based chatbot; Gamified Treatment (T1) was a behaviourally-informed gamified text-based chatbot; Narrative Treatment (T2) was a behaviourally-informed drama delivered through pre-recorded voice notes. All chatbots were delivered in WhatsApp, through which users were invited to complete brief “quizzes” comprising adapted versions of validated scales. Primary outcomes were short-form adaptations of scales for gender attitudes (Gender Relations Scale) and past-month IPV (WHO Multi-country Study Instrument). Secondary outcomes were identification of unhealthy relationship behaviours (Intimate Partner Violence Attitudes Scale) and brief screener for depressive symptoms (Patient Health Questionnaire). A direct chat link to a trained counsellor was a safety measure (accessed by 4.5% of the sample). We estimated treatment effects using ordinary least squares and heteroskedasticity robust standard errors

Findings

The trial retained 11,630 (59.2%) to the primary endpoint of gender attitudes. Compared to control, all treatments led to moderate and significant changes in attitudes towards greater gender equity (Cohen’s D = 0.10, 0.29, 0.20 for T0, T1, and T2, respectively). The gamified chatbot (T1) had modest but significant effects on IPV: 56% of young women reported past-month IPV, compared to 62% among those without treatment (marginal effects = -0.07, 95%CI = -0.09to-0.05). The narrative treatment (T2) had no effect on IPV exposure. T1 increased identification of unhealthy relationship behaviours at a moderate and significant level (Cohen’s D = 0.25). Neither T1 nor T2 had a measurable effect on depressive symptoms as measured by the brief screener. Interpretation: A behaviourally-informed, gamified chatbot increased gender equitable attitudes and was protective for IPV exposure among young women in South Africa. These effects, while modest in magnitude, could represent a meaningful impact given potential to scale the low-cost intervention.

Author summary

Intimate partner violence (IPV) affects one in three women worldwide and harms health and well-being. Yet, there are few low-cost ways proven to prevent IPV. We tested if digital, interactive chatbots could help young women across South Africa stay safer by changing their attitudes and skills. We assigned similar young women to one of two chatbots. The first was gamified (with emojis and small tasks through Whatsapp chat). The second was narrative (with a story pre-recorded using Whatsapp voice notes). Their responses were compared to young women randomly assigned to a control group that received information-only or no chatbot at all. Both gamified and narrative chatbots improved young women’s attitudes and increased skills in identifying unhealthy relationship. The gamified chatbot only reduced IPV after three months. This is one of the first digital approaches to decrease IPV exposure.

Introduction

Exposure to intimate partner violence [IPV] among South African women is higher than most settings globally. One in five women report lifetime IPV exposure [1], and rates of femicide are six times the global average [2]. Younger women report higher rates of IPV compared to older peers [3], and in one study 53% of young women aged 14–19 years reported physical and/or sexual IPV [4]. Subtler forms of unhealthy relationship behaviours are also prominent for young women, including age-disparate relationships, early unintended pregnancies, and relationships characterised by controlling partners [1,5,6].

Extant cross-sectional and qualitative literature points to factors associated with IPV exposure among young women. Inequitable gender attitudes [7,8], inadequate social support [9], and limited knowledge of harmful relationship behaviours [10], are associated with higher IPV exposure. Building skills around autonomy and agency in relationships is tied to lower IPV exposure among young women [11,12].

Experimental data from sub-Saharan Africa include SASA! in Uganda in which critical discussions around gender reduced IPV exposure among women [13], and IMAGE South Africa and Tanzania, where group-based gender workshops combined with microfinance reduced adult women’s IPV [14,15]. Yet, of 36 African trials reviewed in 2022, only one-third reduced IPV among women [16]. Similarly, few rigorously tested approaches have been shown to reduce men’s perpetration of IPV in African trials. Group-based gender training with younger men may decrease reported IPV perpetration [17], particularly if combined with economic livelihoods [18]. However, other trials have had null findings for men’s IPV perpetration [13,19,20].

Mobile devices offer a promising channel to deliver targeted edutainment at scale through mHealth, a strategy shown to effectively encourage health-seeking behaviours related to diabetes, weight loss, physical activity, smoking cessation, and medication adherence [21]. Chatbots are a type of mHealth strategy that builds interactivity into digital interventions. By enabling two-way communications, behaviourally-informed chatbots offer a dynamic format to deliver strategies known to encourage behavioural change more effectively. However, few digital interventions for violence prevention have been tested to date in African settings [16].

Edutainment is another promising approach to reduce young women’s violence exposure, though this has been understudied to date. Through relatable characters or influential messengers, edutainment can convey educational messages, relevant information [such as where to seek help], or promote healthful social norms [22].

The evidence for whether edutainment can reduce IPV exposure or improve gender attitudes is promising, but inconclusive. Soul City, a large-scale edutainment campaign in South Africa, led to knowledge about IPV, improved self-efficacy, and increased helpline calls [23]. The MyPlan app in Kenya improved women’s safety strategies but had null effects on violence exposure [24]. In Uganda, a media campaign in 112 villages featuring video vignettes during community film festivals significantly reduced IPV but had null effects on attitudes towards IPV [25]. In Egypt, short videos delivered via Whatsapp or Facebook detected no improvements in violence exposure or gender beliefs but did increase women’s knowledge and intention to access resources in future cases of IPV [26].

We conducted a trial testing whether two versions of a behaviourally-informed chatbot could improve gender attitudes in order to reduce IPV exposure among young women in South Africa.

Materials & methods

We conducted a digital, individually randomised controlled trial with 4 arms, consisting of:

  • Pure Control: quizzes measuring outcomes only
  • Attention Treatment: interactive chatbot featuring information about sexual health
  • T1: gamified interactive chatbot delivering behaviourally-informed content
  • T2: narrative-based interactive chatbot delivering behaviourally-informed content

The target audience for ChattyCuz was young women aged 18 to 24-years-old. Our primary hypotheses are as follows: If chatbots can be effective at improving attitudes and relationship safety among young South African women, compared to pure control T1 and T2 will demonstrate:

  1. higher mean score on gender attitudes scale, and;
  2. lower women’s past-month self-reported IPV exposure

We secondarily hypothesised that identification of unhealthy relationship behaviours would be higher and depressive symptoms would be lower for T1 and T2 compared to control.

Inclusion criteria

We targeted a sample of South African young women who responded to targeted advertising on Facebook, inviting them to engage with a WhatsApp chatbot. Advertising prompts were sent only to individuals who self-identify on their Facebook profiles as female between 18 and 24 years; and were living in South Africa (though they may be national or non-national residents).

Treatment arms

Pure control arm exposed eligible participants only to specific evaluation quizzes. The quiz pertaining to gender attitudes was delivered directly after eligibility screening. Follow-up quizzes three months later evaluated IPV exposure and the brief depressive symptoms screener.

Attention treatment arm (T0) was modelled on an existing chatbot, Big Sis. It featured the ChattyCuz brand and persona, but did not include any of the behaviourally-informed content modules. Instead, the Attention Treatment chatbot addressed the basics of sex, reproduction and puberty, and allowed users to browse topics by thematic areas, such as “love and relationships” (e.g. “Do relationships = sex?”) “sex basics” (e.g. “What is virginity?”, “Why might sex hurt?”) and “protecting yourself” (e.g. “How to use a condom”). The content comprised overlapping concepts and thematic areas, which enabled the delivery of the same evaluation quizzes as the treatment arms.

ChattyCuz-Gamified (T1) is a WhatsApp chatbot that aims to support young women in navigating intimate relationships. ChattyCuz is designed as a safe, private space. It offers simple, symbolic rewards to guide and motivate users throughout their interactions. T1 follows an interactive script that adapts depending on responses provided by the user. Regardless of whether the user answers a question correctly, ChattyCuz gives feedback on the response. T1 was designed to appear as a unique personality based on the characteristics of an older cousin (approachable, relatable, and supportive of the user). She takes users through a series of different topics, offering a critical reflection on power in relationships; skills practice to identify unhealthy relationship behaviours; learn healthy communication and coping mechanisms; affirming values to build self-efficacy; and planning for safety. Users are encouraged to collect six Squad Members to become a Relationship Superstar (Table 1).

thumbnail
Table 1. Module content for gamified chatbot and narrative chatbot.

https://doi.org/10.1371/journal.pdig.0000358.t001

ChattyCuz-Narrative (T2) uses storytelling to connect with audiences and delivers the themes through WhatsApp voice notes (short digital audio clips) and short sections of text. The narrative follows the life of Sandiswa, a young woman engaged in an embattled relationship with a man named Jabu. Each voice clip opens with a short, ambiance-rich scene (two characters arguing; a character at a family party with her sister-in-law; a character playing a song). The scene then fades and Sandiswa speaks directly to the user to summarise what happened, how she feels and the dilemma she faces. By ‘helping’ Sandiswa to make life choices (who she should trust, how to communicate with Jabu, and how to plan for safety) ChattyCuz-Narrative’s design aims to improve users’ knowledge and relationship skills in line with module aims. The aim of this format is to make the user feel as if she is a trusted person that the narrator wants to speak to during her daily life. To maximise accessibility of the narrative, the user is presented with the choice of hearing the story through voice clips, or reading it as text.

Additional information about each chatbot can be found in the Online Supplemental Appendix.

Measures

The outcome variables for this trial are adapted versions of existing validated scales translated into quizzes (mimicking those in teen magazines) through an iterative, user-centred process. We conducted cognitive interviews to test quiz questions with end users, gathering their feedback on question phrasing and level of comprehension. In October 2020, we validated multiple, short-form adapted surveys (n = 1,088 young women), and invited a sub-sample to retest in December 2020 (n = 231).

Gender attitudes were measured 4 days after allocation using an adapted Gender Relations Scale (aGRS) [27]. The aGRS is comprised of 6 items scored on Likert-type responses and had an acceptable internal consistency in the pre-trial validation with n=1,088 participants (Cronbach’s ⍺ = 0.75). An example item is “It’s natural for a guy to have girls on the side while he’s dating you.” We analysed aGRS as a continuous outcome with higher scores denoting more gender equitable beliefs.

IPV exposure was assessed 3 months after allocation using an adapted WHO Multi-country Study Instrument (aWHO) [28]. comprising two physical and one sexual violence measure shown to be sensitive and specific in the South African setting. An example item is “In the past month, did your bae ever slap, hit you or throw something at you?”. In addition, we added two technological violence items. An example is “If you sent your bae a sexy photo of yourself, are you afraid they might share it?” In the validation assessment (n = 67), internal consistency of the aWHO was acceptable (Cronbach’s ⍺ = 0.75).

Identification of unhealthy relationship behaviors was measured 3 days after allocation using an adapted Intimate Partner Violence Attitudes Scale (aVAS) [29]. This scale comprises abuse and control subscales to measure psychological violence identification. An example item is “My bae never hurts me but I understand why sometimes he threatens to do it so that I get in line.” Validation with 74 participants found that internal consistency was high (Cronbach’s ⍺ = 0.82).

Depressive symptoms were assessed using a brief self-reported symptom screener: the two-item Patient Health Questionnaire (PHQ-2). A score of 3 or greater is suggestive of probable depression.

Socio-demographic variables were ascertained in single item form at baseline. Age was self-completed. Gender was self-reported as either female, male, or non-binary. Sexual orientation was ascertained through a single item asking users whether they were attracted to men or women, “it is complicated” or “prefer not to say”. Relationship status was assessed through a single item asking whether users were seeing “someone special”. The state of participants’ mental distress was assessed through one item asking if, in the past month, they had been mostly “energised and hopeful”, “just getting along”, or “down and hopeless”. Food security level was assessed by an item asking if users had been hungry in the last two months because “there was not enough food in their house”. Household composition was measured using a single item asking how many people lived in the household with the user. Participants were also asked whether they had any children. Province was self-reported.

Procedures

Facebook advertisements targeted mobile devices (to be viewed on mobile news feeds only) and seamlessly directed interested participants to WhatsApp. We conducted pipeline randomisation using a custom-built randomisation algorithm implemented as an R application and run as a separate web service. After consenting to participate in the study, users were randomised into either Pure Control (20%), Attention Control (20%), ChattyCuz-Gamified (T1), or ChattyCuz-Narrative (T2).

The trial was set up on RapidPro (open source jointly owned by UNICEF and Nyaruka), an open-sourced user management system that records user states and manages inbound and outbound messages between users and the chatbot [30]. The trial was set up with content stored in the form of RapidPro “flows”, a series of steps with built-in logic to direct users through the experience based on their interactions with the chatbot.

When users were allocated into the different trial arms using the randomisation algorithm, users’ records were stored in a PostgreSQL database, along with user data and message interactions exchanged between users and ChattyCuz. To support chatbot users, careful monitoring of the database was required as it grew throughout the trial (possibly due to the design of the program, which involved a high ratio of outbound to inbound messages).

Users participating in the trial were remunerated with ZAR 30 (approximately $2.00) on completion of all components of the respective trial arms, and ZAR 10 (approximately $0.70) on completion of a follow-up quiz three months later. Remuneration was carried out by connecting RapidPro to a DT One account with pre-loaded airtime. This integration with DT One is built into RapidPro and requires only the DT One account and API key to be specified to connect the account. The airtime remunerations were defined based on local research standards and triggered automatically when users reached the end of specified RapidPro flows within the different trial arms.

Data privacy and protection

Personal data collected in the trial was covered by Praekelt.org’s data protection policy. Data handling, storage and transfers comply with national and international guidelines, standards and legal frameworks, notably the General Data Protection Regulation of the European Union. ChattyCuz complies with the record management in the Protection of Personal Information Act 4 of 2013.

User data recorded during the trial included age, gender and telephone number, and responses to quizzes, polls or surveys, or questions asked by the user. This data was stored in a secure PostgreSQL database. Users were exposed to Terms and Conditions as part of the consent procedure (S1 Table). The Terms and Conditions explained, in simple terms, that should they choose to participate, their data would be collected and used for statistical or research purposes. This data was visible to both Praekelt.org and BIT throughout the trial. All personally identifiable information was removed from the data (i.e. the data was anonymised) before we transferred it from Praekelt.org to BIT for analysis.

Consent procedures

During the onboarding process, users received information about the intervention and the research study. After users clicked on a Facebook advertisement on their device, they were directed to WhatsApp. On WhatsApp, users were exposed to a brief description of the trial with all the key information required for consent. The user then received a PDF explaining the nature of the trial in plain terms. The user was asked if they wanted to participate in the trial; if they agreed, they were allocated to a trial arm at random using a built-in algorithm. The user was automatically remunerated with preloaded airtime after the completion of the primary outcome quiz (S1 Fig). Users who could not receive automated airtime remunerations were sent money vouchers, redeemable as cash or airtime at any bank ATM, manually via SMS.

Analysis

For primary analysis 1, we estimated the following model:

Where:

  • γi is Primary Outcome 1; a score of attitudes and beliefs on power equity in relationships;
  • T1 is a dummy variable = 1 if individual i was allocated to T1 (Gamified bot);
  • T2 is a dummy variable = 1 if individual i was allocated to T2 (Narrative bot);
  • BAU is a dummy variable = 1 if individual i was allocated to Attention control;
  • Xi is vector of control variables, which includes:
    1. ○ A measure of Baseline Attitudes and Beliefs on Power equity in the relationships (taken at the beginning of the chatbot experience).
    2. ○ Region;
    3. ○ Gender;
    4. ○ Age;
    5. ○ Mental health score;
    6. ○ Socioeconomic status—food security;
    7. ○ Household composition;
    8. ○ Children;
    9. ○ Partnership status;
    10. ○ Province.
  • ui is the error term.

The coefficients of interest are β1, which measures the impact of being allocated to T1 compared to Control; β2, which measures the impact of being allocated to T2 compared to Control; and β3, which measures the impact of being allocated to Attention Control compared to Control, on the outcome measure of interest.

We estimated the model using ordinary least squares and used heteroskedasticity robust standard errors. We modelled the primary outcome with a series of three regressions: (1) an unadjusted treatment-only model, (2) a model adjusted for covariates gender and region of recruitment, (3) a model adjusted for a full set of collected covariates. The models will be compared using a goodness-of-fit statistic, AIC. If there are qualitative differences between the results of these models, we will focus our reporting on the results of the model with the lowest AIC.

We adopted a strategy of using multiple comparisons adjustments for the primary analysis of primary and secondary outcomes, employing the Benjamini-Hochberg step-up procedure. In brief, this uses the traditional p<0.05 threshold as a marker of statistical significance, but reduces it each time an additional comparison is made. The primary analysis adjusts for 2 comparisons: T1 vs. Pure Control; T2 vs. Pure Control. The primary analysis for secondary outcomes adjusted for 2 comparisons: T1 vs Attention, and T2 vs Attention.

Ethical considerations

Our trial received ethical approval by Pharma-Ethics (Protocol number 2019063; 21 September 2020). We pre-registered our trial on the Open Science Framework (16 March 2021).

Given the sensitive nature of this study, the safety for the study population informed the design of the trial and the platforms through which it was delivered. Participants in the trial were 18 or older, the age when young persons are able to consent on their own to research in South Africa. For participants requiring additional support, access to text-based individual counselling services was provided by an external counselling service provider, MobieG. This non-governmental organisation already offered text-based counselling, and our trial provided additional training specific to responding to IPV.

We established a platform for the trained MobieG counsellors to use WhatsApp for individual counselling. Reply messages from counsellors were channelled through RapidPro without interrupting the active conversations between users and counsellors. A benefit of this approach is that it allowed us to keep the interactions with the counsellors within WhatsApp, rather than directing users to an external platform that would have disrupted the user experience. A total of n = 631 users took up the counselling (4·5% of the sample who were followed to midline). The quality of the counselling experience and moderation of trial outcomes will be the subject of a separate paper.

Results

We recruited 19,643 women in 31 days. As expected, 20.0% were allocated to Pure Control, 20.0% to Attention Control, 30.0% to ChattyCuz-Gamified and 30.0% ChattyCuz-Narrative (Fig 1).

As Table 2 indicates, baseline characteristics are very similar for all these conditions. Across all arms, the average age was 21.2 and 70.7% were in the relationship. Overall, 31.7% of participants reported to have symptoms consistent with probable depression, with similar rates across arms.

Of the 19,643 participants we recruited, 11,630 (59.2%) completed the Adapted GRS quiz, our first primary outcome. We refer to these participants as midline participants. As expected, almost all (98%) of participants in the Pure Control completed the midline quiz as this was asked on the same day as recruitment in this trial arm. There were differences between trial arms regarding retention of participants, with T0 having notably high attrition. The baseline covariates are similar across all trial arms. However, in terms of statistical significance, the treatment groups are slightly different. Descriptive statistics for all groups and highlights statistically significant imbalances, where groups differ on attributes at the 10% level confidence level, in red (Table 2). These differences are due to different rates of attrition and perhaps different types of attrition across the treatment groups.

Primary outcome: Equitable attitudes

We found that all three chatbots improved beliefs in power equity in relationships compared to receiving no intervention (“pure control”). The effect sizes for the Attention Control, Narrative and Gamified bot were 0.10, 0.20 and 0.29 (Cohen’s D, 2.s.f.) respectively. While these effect sizes are modest, they are substantial in terms of real-world significance given the potential scale at which chatbots can be delivered.

Effect sizes for the experimental chatbots (Narrative and Gamified) were found to be relatively stable after adding important controls to the models (gender attitudes, age, mental health, and partnership status at baseline). Effect sizes for the Attention Control, however, decreased as more controls were included, confirming the suspicion that attrition and differences in observed variables could account for the treatment effects observed in this trial arm. As indicated in Table 3, we find the effect of all chatbots are significant at 5% level across specifications, after correcting for multiple comparisons.

thumbnail
Table 3. Primary trial outcomes, by randomization arm: Gender beliefs and partner violence.

https://doi.org/10.1371/journal.pdig.0000358.t003

Fig 2 presents the outcome for regression 3, the prespecified model.

thumbnail
Fig 2. Gender relations scale (T1, T2, and Attention Control compared to Control).

https://doi.org/10.1371/journal.pdig.0000358.g002

Primary outcome: IPV Exposure

We found small but significant effects for T1 on overall past-month IPV exposure at three-month follow-up (Table 3). The 1043 women who completed the follow-up quiz from the ChattyCuz-Gamified arm reported lower rates of IPV exposure (55%) than young women in the pure control group (62%, Fig 3).

thumbnail
Fig 3. Past-month IPV exposure (T1 and T2 compared to Control).

https://doi.org/10.1371/journal.pdig.0000358.g003

T2 demonstrated no effects on IPV exposure. Estimates for Attention Control were not robust to additional controls, suggesting differential attrition based on observed variables account fully for treatment effects observed in this arm. We therefore omit T0 in the presentation of data.

In supplemental analysis we examine IPV by type and see that all types of violence are reduced, with psychological and technological forms of IPV significantly reducing this exposure by 8% (S2 Table). Physical and/or sexual IPV was reduced by 2% and this point estimate did not reach statistical significance (S3 Table).

Secondary outcomes

Identification of unhealthy relationship behaviours.

We found that both T1 and T2 increased users’ ability to correctly identify unhealthy relationship behaviours compared to the Attention Control. On a scale of 6–18 the score of the Attention Control group was 14.80 while the score of the T1 and T2 chatbots were 15.51 and 15.22 respectively (Fig 4).

thumbnail
Fig 4. Identification of unhealthy relationship behaviors (T1 and T2 compared to Attention Control).

https://doi.org/10.1371/journal.pdig.0000358.g004

Mental health

T1 was slightly protective for depressive symptoms, but this was not statistically significant (S2 Fig).

The proportion of users who expressed interest in receiving counselling was much higher in the control conditions (above 17%), compared to 8.4% of users in the ChattyCuz-Narrative group and 6.9% in the ChattyCuz-Gamified group. Similarly, users in the control conditions accessed counselling at more than twice the rate of the treatment arms.

Discussion

In an experimental trial among young women in South Africa, a gamified digital chatbot improved equitable gender beliefs and made a small, but significant, reduction in past-month IPV exposure. This gamified chatbot is among the first digital interventions to effectively reduce IPV exposure among young women. The chatbot significantly improved skills around identifying unhealthy relationships, but did not alter depressive symptoms compared to the control. These primary results are robust across specifications and robustness checks, including comparison to an attention control condition.

Ours is the first study, to our knowledge, to compare two digital communication approaches in relation to actual IPV exposure that offers depth to the growing edutainment field. We hypothesise that the observed reduction in IPV is due to the gamified chatbot offering tailored, immediate feedback on gender beliefs, identifying unhealthy relationship behaviours, and implementing skills to stay safe–all shown to be precursors for staying safe in relationships [13,31,32]. The MyPlan app in Kenya had promising results around safety skills and intentions, but no effect on violence exposure [24].

While the narrative chatbot did improve gender equitable attitudes, it had no measurable effect on IPV exposure. Two hypotheses may frame these null findings. Firstly, the narrative chatbot was less directive, allowing users to reach their own conclusions from the experience of a fictional character. In India, “explicit” calls to action were more impactful than “implicit” narratives in terms of changing beliefs about IPV [33]. Similarly, the effective intervention using mass media in Uganda reduced IPV not by shifting attitudes [as these remained largely unchanged] but by increasing women’s willingness to report violence [25]. Secondly, since many young South Africans already have a high saturation of narrative programming through Soul City [23], which may offer a ceiling effect in this particular setting.

Overall, both behaviourally-informed chatbots were significantly better at encouraging user retention and engagement, compared to information-focused chatbots. Technological solutions may improve youth’s access to care and could complement existing in-person services if young people feel more engaged or less threatened with this medium of communication [34]. Only 24% of Attention Treatment participants were retained (compared to 56% and 60% for T1 and T0, respectively). This could indicate that the behaviourally-informed chatbots are more engaging than typical information-only digital approaches. On the other hand, the high attrition limits our ability to assess impact on IPV exposure for the Attention Treatment arm.

The study was not powered to detect distinctions by IPV type, but this sub-analysis is important for the field. The gamified chatbot most effectively decreased reported exposure to technological and psychological safety. This has relevance for young women’s wellbeing, since technological and psychological forms of IPV may be more common, and have demonstrated preliminary causal associations with mental and physical health [35,36]. Physical and/or sexual IPV trended towards decreased exposure (p=0.02), suggesting this may require more targeted or in-person approaches.

Prevalence of depressive symptoms among this relatively large sample of young women was high, with 32% reporting symptoms consistent with probable depression on a brief screener. This rate is similar to studies nationally, which find up to 41% of young women exhibit elevated depressive symptoms [37]. Neither experimental chatbot was effective at reducing depressive symptoms, nor were they associated with increased distress. This offers an important confirmatory finding for the IPV field, since one ethical consideration for working with younger populations or intervening through low-cost digital interventions is the risk that disclosure of IPV could cause additional distress or even psychological harm. Our findings refute this claim and suggest that—with appropriate links to online counselling services—technologies like chatbots may neither increase depressive symptoms, nor be protective.

Implications for research and practice

The WhatsApp application is a popular internet protocol messaging platform, providing an opportunity to deliver chatbots at greater scale as the market expands. There are, however, inherent risks of digital IPV interventions, particularly if phones are not private or if male partners use technological violence or other controlling behaviours. Our opt-in, live individual counselling delivered via WhatsApp was an important safety consideration of the trial. This safety enhancement was able to draw upon a South African organisation with expertise in chat-based counselling for young people–a partnership that may be harder to identify in other LMIC settings. Providing this counselling service at-scale could be cost-effective (if it serves to identify and refer roughly 4.5% of those at highest need for individualised care), but requires a more integrated approach to de-identifying user data.

Several potential avenues for future research exist. Primary prevention with younger adolescent women, male peers, and adolescent pregnant women seems most promising given the high exposure to violence victimisation and perpetration among these groups. ChattyCuz was designed to reach girls at a young age, though South African ethical constraints mandate that research working directly with participants occur over the age of 18. Future studies should adapt and pilot the intervention for adolescents aged 17 or younger, finding creative methods of working with ethics boards to waive the need for parental informed consent [which negate the chatbot potential to reach young persons directly and may itself lead to safety concerns]. It would be important to evaluate whether this intervention could have a similar, or even more significant, effect among younger girls if we reach them prior to sexual debut or while forming initial intimate relationships.

Limitations

Findings should be viewed in light of study limitations. We recruited young women who were already on Facebook with access to a mobile phone with internet connection. While Facebook is one of the most widely used platforms, we are unable to generalise findings to young women with no private phone access or who do not use WhatsApp or Facebook. This is an important sampling limitation, since those young people who are highly vulnerable to violence exposure may have limited access to digital technologies. The heteronormative framing of the intervention is imprecise and deserves increased attention in South Africa, where sexual and gender minorities are at higher risk of IPV exposure than heterosexual counterparts [38].

Several design characteristics of digital intervention research are worth noting as potential limitations. Low literacy may have inhibited the ability of some participants to take part in the intervention or quizzes. We chose not to measure education as a socio-demographic covariate, but future studies should include this. It is possible that participants completing the quizzes were different from those who filled out baseline surveys, but we chose not to ask for South African identity numbers or private details because of confidentiality risks. Attrition can only be partially accounted for in our analytical approach. While retention was better than for most digital interventions, it is plausible that those in the endline sample are systematically different across arms. Indeed, we find that in the endline sample, there are statistically significant differences in observable characteristics between subgroups. However, because our models are able to control for these measurable differences, differential attrition due to observed characteristics should not affect the overall results. However, we are unable to control for unobserved characteristics and our main analysis requires a missing at random assumption. We conduct additional “robustness checks” in response to concerns that data is missing not at random, including a sensitivity analysis.

Conclusion

Digital solutions represent an important approach to scale efforts to reduce IPV among young women. A behaviourally-informed chatbot improved gender equitable attitudes and reduce exposure to IPV in the short-term. When coupled with tailored support for a small number who would benefit from live online counsellors, behaviourally-informed chatbots have potential to reach a large subset of population at low-cost. Chatbots may offer a promising strategy to support healthy norms and safe relationships.

Supporting information

S2 Fig. Mental health by trial arm, n = 5,022.

https://doi.org/10.1371/journal.pdig.0000358.s002

(DOCX)

S2 Table. Gender equitable beliefs regression.

https://doi.org/10.1371/journal.pdig.0000358.s004

(DOCX)

S4 Table. Technological and/or psychological IPV.

https://doi.org/10.1371/journal.pdig.0000358.s006

(DOCX)

Acknowledgments

We are appreciative in the many young women who took part in user testing, cognitive interviews, the pilot, formative co-creation workshops, and the Community Advisory Board. We thank Nkeletseng Tsesane for supporting the Community Advisory Board recruitment and Lele van Eck for facilitating their group discussions. We thank the MobieG team for direct counselling support for participants who required additional help. Girl Effect was monumental to the design of the gamified chatbot and the attention condition was composed of their Big Sis chatbot. The research team at BIT supported design and analysis.

References

  1. 1. SA S. South Africa Demographic and Health Survey 2016: Key Indicator Report. Pretoria: Stats SA, 2018.
  2. 2. Abrahams N, Mathews S, et al. Intimate partner femicide in South Africa in 1999 and 2009. PLoS Med. 2013;10(4):e1001412. pmid:23565064
  3. 3. Groves AK, Gebrekristos LT, et al. Describing Relationship Characteristics and Postpartum HIV Risk Among Adolescent, Young Adult, and Adult Women in South Africa. Journal of Adolescent Health. 2020. pmid:31992490
  4. 4. Shamu S, Gevers A, et al. Prevalence and risk factors for intimate partner violence among Grade 8 learners in urban South Africa: baseline analysis from the Skhokho Supporting Success cluster randomised controlled trial. Int Health. 2016;8(1):18–26. pmid:26637828
  5. 5. Closson K, Hatcher A, et al. Gender role conflict and sexual health and relationship practices amongst young men living in urban informal settlements in South Africa. Culture, Health & Sexuality. 2020;22(1):31–47. pmid:30762491
  6. 6. Hatcher AM, Gibbs A, et al. Gendered syndemic of intimate partner violence, alcohol misuse, and HIV risk among peri-urban, heterosexual men in South Africa. Soc Sci Med. 2022;295:112637. pmid:31708236
  7. 7. Mpondo F, Ruiter RAC, et al. Intimate Partner Violence and Its Association With Self-Determination Needs and Gender-Power Constructs Among Rural South African Women. J Interpers Violence. 2019;34(14):2975–95. pmid:27543301
  8. 8. Stockl H, March L, et al. Intimate partner violence among adolescents and young women: prevalence and associated factors in nine countries: a cross-sectional study. BMC Public Health. 2014;14:751. pmid:25059423
  9. 9. Russell M, Cupp PK, et al. Intimate partner violence among adolescents in Cape Town, South Africa. Prev Sci. 2014;15(3):283–95. pmid:23743796
  10. 10. Makongoza M, Nduna M. Awareness and rejection accounts of intimate partner violence by young women in Soweto, Johannesburg, South Africa. Journal of interpersonal violence. 2021;36(1–2):7–32. pmid:29294878
  11. 11. Zembe YZ, Townsend L, et al. Intimate Partner Violence, Relationship Power Inequity and the Role of Sexual and Social Risk Factors in the Production of Violence among Young Women Who Have Multiple Sexual Partners in a Peri-Urban Setting in South Africa. PLoS One. 2015;10(11):e0139430. pmid:26599394
  12. 12. Willan S, Ntini N, et al. Exploring young women’s constructions of love and strategies to navigate violent relationships in South African informal settlements. Cult Health Sex. 2019;21(11):1225–39. pmid:30632915
  13. 13. Abramsky T, Devries K, et al. Findings from the SASA! Study: a cluster randomized controlled trial to assess the impact of a community mobilization intervention to prevent violence against women and reduce HIV risk in Kampala, Uganda. BMC medicine. 2014;12(1):122. pmid:25248996
  14. 14. Kapiga S, Harvey S, et al. A social empowerment intervention to prevent intimate partner violence against women in a microfinance scheme in Tanzania: findings from the MAISHA cluster randomised controlled trial. Lancet Glob Health. 2019;7(10):e1423–e34. pmid:31537372
  15. 15. Pronyk PM, Hargreaves JR, et al. Effect of a structural intervention for the prevention of intimate-partner violence and HIV in rural South Africa: a cluster randomised trial. Lancet. 2006;368(9551):1973–83. pmid:17141704
  16. 16. Keith T, Hyslop F, et al. A Systematic Review of Interventions to Reduce Gender-Based Violence Among Women and Girls in Sub-Saharan Africa. Trauma Violence Abuse. 2022:15248380211068136. pmid:35057674
  17. 17. Jewkes R, Nduna M, et al. Impact of stepping stones on incidence of HIV and HSV-2 and sexual behaviour in rural South Africa: cluster randomised controlled trial. BMJ. 2008;337:a506. pmid:18687720
  18. 18. Gibbs A, Washington L, et al. Stepping Stones and Creating Futures Intervention to Prevent Intimate Partner Violence Among Young People: Cluster Randomized Controlled Trial. J Adolesc Health. 2019. pmid:31784410
  19. 19. Hossain M, Zimmerman C, et al. Working with men to prevent intimate partner violence in a conflict-affected setting: a pilot cluster randomized controlled trial in rural Cote d’Ivoire. BMC public health. 2014;14:339.
  20. 20. Pettifor A, Lippman SA, et al. Community mobilization to modify harmful gender norms and reduce HIV risk: results from a community cluster randomized trial in South Africa. J Int AIDS Soc. 2018;21(7):e25134. pmid:29972287
  21. 21. Hall AK, Cole-Lewis H, et al. Mobile text messaging for health: a systematic review of reviews. Annu Rev Public Health. 2015;36:393–415. pmid:25785892
  22. 22. Paluck EL, Ball L, et al. Social norms marketing aimed at gender based violence: A literature review and critical assessment. New York: International Rescue Committee, 2010.
  23. 23. Scheepers E, Christofides NJ, et al. Evaluating health communication–a holistic overview of the impact of Soul City IV. Health Promotion Journal of Australia. 2004;15(2):121–33.
  24. 24. Decker MR, Wood SN, et al. Safety decision-making and planning mobile app for intimate partner violence prevention and response: randomised controlled trial in Kenya. BMJ Glob Health. 2020;5(7). pmid:32675229
  25. 25. Green DP, Wilke AM, et al. Countering violence against women by encouraging disclosure: A mass media experiment in rural Uganda. Comparative Political Studies. 2020;53(14):2283–320.
  26. 26. Fotini C, Larreguy H, et al. Can media campaigns empower women facing gender-based violence amid COVID-19? Toulouse School of Economics, 2022.
  27. 27. Stephenson R, Bartel D, et al. Constructs of power and equity and their association with contraceptive use among men and women in rural Ethiopia and Kenya. Global public health. 2012;7(6):618–34. pmid:22568536
  28. 28. Ellsberg M, Jansen HA, et al. Intimate partner violence and women’s physical and mental health in the WHO multi-country study on women’s health and domestic violence: an observational study. Lancet. 2008;371(9619):1165–72. pmid:18395577
  29. 29. Smith BA, Thompson S, et al. Development of the intimate partner violence attitude scales (IPVAS) with a predominantly Mexican American college sample. Hispanic Journal of Behavioral Sciences. 2005;27(4):442–54.
  30. 30. UNICEF. Case Study: RapidPro. 2019.
  31. 31. Parker EM, Gielen AC. Intimate partner violence and safety strategy use: Frequency of use and perceived effectiveness. Women’s health issues. 2014;24(6):584–93. pmid:25442703
  32. 32. Boyer C, Levy Paluck E, et al. Religious leaders can motivate men to cede power and reduce intimate partner violence: Experimental evidence from Uganda. Proceedings of the National Academy of Sciences. 2022;119(31):e2200262119. pmid:35905318
  33. 33. Donati D, Orozco-Olvera V, et al. Using Social Media to Change Gender Norms: An Experimental Evaluation Within Facebook Messenger in India. Available at SSRN 4158065. 2022.
  34. 34. Storer HL, Nyerges EX, et al. Technology “Feels Less Threatening”: The processes by which digital technologies facilitate youths’ access to services at intimate partner violence organizations. Children and youth services review. 2022;139:106573.
  35. 35. Henry N, Powell A. Technology-facilitated sexual violence: A literature review of empirical research. Trauma, violence, & abuse. 2018;19(2):195–208. pmid:27311818
  36. 36. Sanz-Barbero B, Barón N, et al. Prevalence, associated factors and health impact of intimate partner violence against women in different life stages. PLoS one. 2019;14(10):e0221049. pmid:31596869
  37. 37. Das-Munshi J, Lund C, et al. Mental health inequalities in adolescents growing up in post-apartheid South Africa: cross-sectional survey, SHaW study. PloS one. 2016;11(5):e0154478. pmid:27139456
  38. 38. Hatcher AM, de Kadt J, et al. Experiences of violence and safety. In: de Kadt J, Hamann C, Mkhize SP, Parker AE, editors. Quality of Life Survey 6 (2020/21):Overview Report. Johannesburg: Gauteng City-Region Observatory (GCRO); 2021.
  39. 39. Hamari J, Koivisto J, et al., editors. Does gamification work?—a literature review of empirical studies on gamification. 2014 47th Hawaii international conference on system sciences; 2014: Ieee.
  40. 40. Butler M, Pyzdrowski L, et al. The effects of feedback on online quizzes. International Journal for Technology in Mathematics Education. 2008;15(4).
  41. 41. Datta S, Burns J, et al. Risking it all for love? Resetting beliefs about HIV risk among low-income South African teens. Journal of Economic Behavior & Organization. 2015;118:184–98.
  42. 42. La Ferrara E, Chong A, et al. Soap operas and fertility: Evidence from Brazil. American Economic Journal: Applied Economics. 2012;4(4):1–31.
  43. 43. Banerjee A, La Ferrara E, et al. The entertaining way to behavioral change: Fighting HIV with MTV. National Bureau of Economic Research, 2019.
  44. 44. Pulerwitz J, Barker G, et al. Promoting more gender-equitable norms and behaviors among young men as an HIV/AIDS prevention strategy. 2006.