Acceptability of an Embodied Conversational Agent for Type 2 Diabetes Self-Management Education and Support via a Smartphone App: Mixed Methods Study

Original Paper

¹The University of Melbourne, Melbourne, Australia

²The Australian Centre for Behavioural Research in Diabetes, Melbourne, Australia

³The University of Queensland, Brisbane, Australia

⁴see Authors' Contributions

Corresponding Author:

Shaira Baptista, BSc, PGDipPsyc

The University of Melbourne

207 Bouverie Street

Carlton

Melbourne

Australia

Phone: 61 3 8344 4037

Email: shaira.baptista@unimelb.edu.au

Background: Embodied conversational agents (ECAs) are increasingly used in health care apps; however, their acceptability in type 2 diabetes (T2D) self-management apps has not yet been investigated.

Objective: This study aimed to evaluate the acceptability of the ECA (Laura) used to deliver diabetes self-management education and support in the My Diabetes Coach (MDC) app.

Methods: A sequential mixed methods design was applied. Adults with T2D allocated to the intervention arm of the MDC trial used the MDC app over a period of 12 months. At 6 months, they completed questions assessing their interaction with, and attitudes toward, the ECA. In-depth qualitative interviews were conducted with a subsample of the participants from the intervention arm to explore their experiences of using the ECA. The interview questions included the participants’ perceptions of Laura, including their initial impression of her (and how this changed over time), her personality, and human character. The quantitative and qualitative data were interpreted using integrated synthesis.

Results: Of the 93 intervention participants, 44 (47%) were women; the mean (SD) age of the participants was 55 (SD 10) years and the baseline glycated hemoglobin A1c level was 7.3% (SD 1.5%). Overall, 66 of the 93 participants (71%) provided survey responses. Of these, most described Laura as being helpful (57/66, 86%), friendly (57/66, 86%), competent (56/66, 85%), trustworthy (48/66, 73%), and likable (40/66, 61%). Some described Laura as not real (18/66, 27%), boring (26/66, 39%), and annoying (20/66, 30%). Participants reported that interacting with Laura made them feel more motivated (29/66, 44%), comfortable (24/66, 36%), confident (14/66, 21%), happy (11/66, 17%), and hopeful (8/66, 12%). Furthermore, 20% (13/66) of the participants were frustrated by their interaction with Laura, and 17% (11/66) of the participants reported that interacting with Laura made them feel guilty. A total of 4 themes emerged from the qualitative data (N=19): (1) perceived role: a friendly coach rather than a health professional; (2) perceived support: emotional and motivational support; (3) embodiment preference acceptability of a human-like character; and (4) room for improvement: need for greater congruence between Laura’s words and actions.

Conclusions: These findings suggest that an ECA is an acceptable means to deliver T2D self-management education and support. A human-like character providing ongoing, friendly, nonjudgmental, emotional, and motivational support is well received. Nevertheless, the ECA can be improved by increasing congruence between its verbal and nonverbal communication and accommodating user preferences.

Trial Registration: Australian New Zealand Clinical Trials Registry CTRN12614001229662; https://tinyurl.com/yxshn6pd

JMIR Mhealth Uhealth 2020;8(7):e17038

doi:10.2196/17038

Keywords

embodied conversational agent; type 2 diabetes; mobile apps; mHealth; smartphone; self-management; mobile phone

Diabetes will affect 693 million people worldwide by 2045, most of whom will have type 2 diabetes (T2D) [1,2]. People with T2D can prevent or delay the onset and progression of diabetes-related complications such as heart attack, stroke, kidney failure, vision loss, and nerve damage through intensive management of blood glucose levels [3]. However, effective self-management is complex and difficult to implement and sustain in daily life. Consequently, many people with T2D are not able to achieve their recommended self-management targets [4].

For several decades, diabetes self-management education and support have been provided in person (one-to-one and group-based), with many trials and real-world studies demonstrating improved diabetes outcomes [5]. However, the high cost and resource requirements limit the reach and scalability of in-person programs [6]. Furthermore, ongoing in-person support for sustaining the recommended diabetes care targets is not feasible for most health care systems [4].

Considerable advances in technology related to smartphone apps (including voice recognition, natural language processing, and artificial intelligence capabilities) have led to an increase in the feasibility of using embodied conversational agents (ECAs) to provide education and support for the self-management of chronic conditions, including T2D [7]. An ECA is an animated conversational human-like character that simulates person-to-person conversation with appropriate dialog and human-like physical properties, including facial expressions and body movements [8-10]. ECAs are increasingly being used in a wide range of apps, providing support for mental health, web-based information seeking, medication taking, behavior change, and prevention of suicide [7,10-13].

Research on the acceptability of ECAs in self-management of chronic conditions is still in its infancy, with a small number of studies reporting high levels of acceptability of ECA-based interventions [13-15]. Trust, empathy, and expertise have been cited as essential components of diabetes education and support [16]. Similar expectations may exist when the intervention is delivered by an ECA. ECAs use facial expressions, body movements, and speech and can offer a natural and accessible means of communication. These characteristics of ECAs potentially improve engagement compared with a static character image, a nonrelational agent, or a text-only display [9,17,18]. ECAs may be perceived to provide additional motivational and emotional support, which has previously been described by people with diabetes as being as important to them as practical support [19]. Preliminary evidence suggests that ECAs are perceived to be less judgmental, less intimidating, and more likable than a human counterpart, resulting in participants feeling less guilty and more motivated by the interaction [13,14,17,18]. Collectively, this evidence suggests that ECAs may be effective in providing support for chronic disease management as they help to engage users by building a social and emotional relationship over time [9,18,20].

Some of the characteristics of ECAs that may affect their acceptability include users being deterred by a monotonous voice and repetitive messages [13,18,20]. Although ECAs are more engaging if they have human-like characteristics and engage in social dialog, this effect is mitigated if there is an unnatural dissonance between a character’s speech and the expected facial expressions and body movements of the ECA [17,21-23]. This phenomenon, coined the uncanny valley by Masahiro Mori in 1970, was supported by research suggesting that people have unpleasant impressions of artificial characters, such as ECAs that have an almost, but not perfectly, realistic human appearance [24,25]. Previous studies have also emphasized that the visual characteristics of an ECA are important as these affect the perceptions of trustworthiness and credibility, which can affect acceptability. For example, a more playful, cartoon-like character is perceived as being more friendly, whereas a more serious human-like character, dressed professionally, is usually perceived to be more appropriate for serious apps, such as self-management of chronic conditions [26].

Research on the acceptability of ECAs to deliver self-management support for chronic conditions via self-management apps has been limited primarily to short-term feasibility or pilot studies and to interventions that address only a single behavior. Other studies use static images rather than animations or have been conducted using desktop or laptop computers in laboratory settings rather than with personal smartphones used in everyday settings or in the wild [27,28]. Thus, this study aimed to address these gaps by investigating the acceptability of an ECA delivering self-management education and support to people with T2D in their everyday lives.

Study Design

A convergent study design was used where quantitative and qualitative data were collected at similar time points [29]. This study was conducted within the context of a randomized controlled trial to test the effectiveness of a T2D self-management smartphone app, My Diabetes Coach (MDC) [30,31]. The MDC study was conducted from 2014 to 2018 (Australia New Zealand Clinical Trials Registry ID ACTRN12614001229662). The study was approved by the University of Melbourne’s Human Research Ethics Committee (ethics ID 1442433).

My Diabetes Coach

MDC used an ECA called Laura (Figure 1) to deliver self-management education and support to adults with T2D. When users logged in for the first time, they were prompted to set up a regular time to complete weekly interactive sessions with Laura. During these conversations, Laura provided education, feedback and motivational support for blood glucose level monitoring, taking medication, physical activity, healthy eating, and foot care. The conversations were personalized to the individual’s self-management targets, physical fitness, and foot health using recommendations provided by his or her general practitioner.

The MDC app used voice recognition, prescripted conversational elements, and a sophisticated script logic enabling the user to interact with Laura in several predetermined variations, mimicking natural conversations. Laura’s voice and conversation were produced by a proprietary dialog engine (by Clevertar). Nonverbal behaviors were either explicitly scripted for each dialog, or, if no behavior was specified, they were selected randomly from a finite set of animations based on whether the character was speaking and the dialog duration. User responses from previous sessions dictated the direction of future sessions, enabling a high degree of personalization. The ECA’s appearance, conversational elements, back story, and accent were refined through several rounds of expert and user testing. Users were able to respond to Laura by touching an option on the screen or by speaking out one of the options on the screen when prompted to do so. Users also had access to a web-based discussion board and website (with additional diabetes resources) that could be accessed via the app as well as technical support from the research team. An excerpt from a conversation with Laura can be found on YouTube [32].

Figure 1. Laura, the embodied conversational agent.

Participants

Recruitment methods for the MDC trial are reported in the main outcomes paper (under review). Briefly, participants were recruited to the MDC trial from the general population in Australia via several recruitment strategies. Adults with T2D registered on the National Diabetes Services Scheme (NDSS) database; willing to be contacted about research; and living in New South Wales, Queensland, Victoria, and Western Australia were invited to participate by the NDSS via mail and email. The invitation letters were supplemented with media releases and targeted advertising on social media by several organizations (Diabetes New South Wales; Diabetes Queensland; Diabetes Victoria; Diabetes Western Australia; Bupa Australia, a health insurance provider; and the Australian Diabetes Educators Association).

For this study, participants from the intervention arm of the MDC trial, who had access to the MDC app, completed a survey at 6 months postbaseline, assessing several clinical and behavioral outcomes, including their interaction with the ECA, and a purposive subsample participated in subsequent interviews. All participants received a plain language statement describing the study and provided written consent.

Data Collection

Demographic and Clinical Characteristics

The demographics and duration of diabetes (self-reported) were collected using web-based surveys at baseline. Glycated hemoglobin A_1c (HbA_1c) is a pathology test assessing average blood glucose levels over the past 2 to 3 months, providing an indication of risk for long-term complications [33]. It was obtained, with participants’ consent, from their general practitioner.

Acceptability: Quantitative Data

At the 6-month follow-up, the participants completed a web-based survey that included 2 questions assessing the acceptability of the ECA. The first question assessed the perceptions of the ECA: “How well do the words below describe Laura?” The respondents rated a range of positive and negative traits (helpful, boring, friendly, competent, annoying, likable, trustworthy, and real) using a 5-point Likert-type scale ranging from describes very well to describes very poorly. The second question asked, “How did interacting with Laura make you feel?” The respondents selected from a list of descriptive emotions (happy, confident, hopeful, motivated, worried, guilty, frustrated, and comfortable) and were asked to select all that applied. For both questions, positive and negative words were randomly sequenced to minimize response bias. The descriptive adjectives were chosen based on the literature on evaluating ECAs and on working alliances between ECAs and users [34].

Acceptability: Qualitative Data

In-depth, semistructured qualitative interviews were conducted from October 2017 to February 2018. Most participants had, at that stage, completed the 6-month survey but were still actively using the app. Purposive sampling of survey respondents was used to identify interviewees who varied in terms of the duration of diabetes, gender, age, and baseline familiarity with apps.

The interview guide was developed by the first author (SB) and used exploratory questions and probes, with feedback from other members of the research team (BO, GW, and JS) based on the research question and findings from the current literature [8,9,14,18,21-23,26,27,35-38]. The guide explored a variety of topics, including experience at diagnosis; self-care behavior before using the MDC app; users’ experiences with the MDC app, including when, where, and how it was used; changes to self-management practices as a result of using the MDC app; initial impression of the ECA Laura and changes over time; perceptions of her role in self-management, and her perceived personality characteristics. The data relating to the acceptability of Laura are presented here, with the other findings published separately.

The interviews were conducted by telephone (by SB) and recorded using a cloud architecture solution from the CTI Group using their SmartInteraction Suite of recording software. During each interview, SB used exploratory questions and probes (from the interview guide) and noted points of interest, using these as further probes. Immediately after each interview, SB prepared a written summary of the interview and any relevant observations. These were used to communicate interim findings to the research team. When appropriate, additional questions were added to the interview guide, enabling further exploration of the issues raised by participants that were relevant to the research aims. These notes were also used to aid in the meaningful interpretation of data during data analysis.

Data Analysis

The quantitative data were analyzed using IBM’s SPSS Statistics 25 package. Descriptive statistics were computed for demographic and clinical characteristics and 2 questions assessing the ECA. The qualitative data were transcribed, deidentified, and thematically analyzed using NVivo 11, following the first 5 steps of Braun and Clarke’s methodology [39,40]. The integration of the quantitative and qualitative data was achieved at the interpretation stage by comparing the findings from the surveys and the semistructured interviews. In practice, this involved referring to and using the qualitative data to help interpret, triangulate, and add meaning to the quantitative data. This process was iterative, with input from several researchers (SB, GW, BO, and JS). This integration of quantitative and qualitative data enabled further validation of the findings and increased their explanatory value [41]. The narrative of the results is blended with embedded quotes from several sources to make the results more readable while using as much evidence as possible. An anonymized coding system—participant identity number (IDX): sex (male, M; female, F): age (years)—was used to identify the source of each quote (in parentheses after each quote).

Sample Characteristics

Of the 93 MDC trial participants in the intervention arm, 66 (71%) participants provided responses at 6 months postbaseline, and 19 of these participated in the interviews. Table 1 details the characteristics of the 3 samples. Overall, 50% (33/66) of the survey respondents were women, and the mean age of the participants was 57 (SD 9) years and the mean baseline HbA_1c level was 7.1% (SD 1.4%) [33].

Those who completed the survey were significantly older (P=.03) and completed more interactions with Laura (P=.001) than those who did not complete the survey. No significant differences were observed between the interviewees and other participants in the intervention arm, except that the interviewees completed significantly more interactions with Laura (P=.001).

The mean duration of the interviews was 51 min (range 29-79 min).

Overall, participants found Laura to be acceptable and were positive in their appraisal of her and their interactions with her. Most respondents agreed that Laura was helpful (57/66, 86%), friendly (57/66, 86%), competent (56/66, 85%), trustworthy (48/66, 73%), and likable (40/66, 61%). Some participants described her as boring (26/66, 39%) and annoying (20/66, 30%; Multimedia Appendix 1). Participants were undecided about whether or not they thought Laura was realistic. Of the 66 participants, 26 (39%) participants agreed that Laura was real, 22 (33%) were undecided, and 18 (27%) disagreed. The participants’ responses to their interactions with Laura were positive overall, with many reporting that she made them feel motivated (29/66, 44%), comfortable (24/66, 36%), confident (14/66, 21%), happy (11/66, 17%), and hopeful (8/66, 12%). Notably, 20% (13/66) were frustrated by their interaction with Laura, and 17% (11/66) of the participants reported that interacting with Laura made them feel guilty. One participant reported feeling worried (Multimedia Appendix 2).

Overall, 4 themes were identified from the qualitative data: (1) perceived role—a friendly coach rather than a health professional; (2) perceived support—emotional and motivational support; (3) embodiment preference—acceptability of a human-like character; and (4) room for improvement—need for greater congruence between Laura’s words and actions. Table 2 provides an integrative synthesis of the findings, summarizing the 4 main themes emerging from the qualitative data, quantitative endorsement of the adjectives describing Laura and how the interaction made the participants feel, and exemplars of the qualitative data. The 4 themes are described in detail below.

Table 1. Demographic and clinical characteristics of the total sample and interviewed sample.

Participant Characteristics		My Diabetes Coach trial population (intervention arm; n=93)		Six-month follow-up sample (n=66)		Interviewed participants (n=19)
Gender (female), n (%)		44 (47)		33 (50)		8 (42)
Age (years), mean (SD)		55 (10)		57 (9)		60 (8)
Education (highest level), n (%)
	Year 10		10 (11)		9 (14)		5 (26)
	Year 12 or apprentice		42 (45)		31 (47)		2 (11)
	Graduate or post graduate		41 (44)		26 (39)		12 (63)
Employment status, n (%)
	Paid employment		59 (63)		41 (62)		7 (37)
	Retired		22 (24)		18 (27)		11 (58)
	Unemployed or other		12 (13)		7 (11)		1 (5)
Duration of diabetes (years), n (%)
	<5		43 (46)		25 (38)		8 (42)
	5-10		29 (31)		23 (35)		8 (42)
	10-20		7 (8)		4 (6)		3 (16)
	Unknown		14 (15)		14 (21)		0 (0)
Baseline glycated hemoglobin A_1c (%), mean (SD)		7.3 (1.5)		7.1 (1.4)		6.8 (0.9)
Baseline glycated hemoglobin A_1c (mmol/mol), mean (SD)		56 (44)		53 (30)		51 (20)
General app usage^a (reported at baseline), n (%)
	Multiple times per day		69 (74)		50 (76)		14 (74)
	Once a day		23 (25)		13 (20)		4 (21)
	Less than once a day		1 (1)		3 (5)		1 (5)
Total interactions with Laura, mean (SD)		18 (15)		23 (16)		36 (17)

^aGeneral app usage at baseline represents the use of any app before participating in the My Diabetes Coach trial.

Table 2. Integrated results matrix.

Themes	Quantitative data: endorsement of adjectives	Qualitative data: exemplar quotes
Perceived role: Laura is more acceptable as a friendly coach than as a health professional	Laura was likable, n=40 (61%), friendly, n=57 (86%), and helpful, n=57 (86%) ‎ Interacting with Laura made me feel comfortable, n=24 (36%) ‎ Interacting with Laura made me feel guilty, n=11 (17%), and worried, n=1 (1%) ‎	“A ‘neutral approach’ was ‘better’ because it ‘didn’t try and lean on any perceptions of authority.’” [ID04: M^a: 44 years] ‎ “I was worried about making sure that I was within [my limits] knowing that I had to report to Laura!” [ID11: F^b: 62 year] ‎
Perceived support: Laura provides emotional and motivational support	Laura was trustworthy, n=48 (73%). Interacting with Laura made me feel confident, n=14 (21%), hopeful, n=8 (12%), and happy, n=11 (17%) ‎ Laura was competent, n= 56 (85%). Interacting with Laura made me feel motivated, n=29(44%) ‎	“I needed somebody just to be there.” (ID15: F: 66 years) ‎ “(She) used to make me laugh...and that’s hard to do.” [ID18: M: 65 years] ‎ “She was keeping you on track and keeping you doing what you’re supposed to be doing.” [ID16: F: 57 years] ‎
Character preference: Laura is engaging and her human-like character is appropriate	Laura was helpful, n=57 (86%) ‎ Laura was competent, n=56 (85%), and trustworthy, n=48 (73%). Laura made me feel confident, n=14 (21%), and comfortable, n=24 (36%) ‎	“Instead of reading it, you’re hearing it and can read at the same time. Instead of just hearing some voice, you’re actually seeing [Laura] talk.” [ID05: M: 55 years] ‎ “I’m not sure I would have given the same level of credibility to, for example, a dog or a cat or something like that.” [ID04: M: 44 years] ‎
Room for improvement: dissonance between Laura’s words and actions	Laura was annoying, n=20 (30%), boring, n=26 (39%), and not real, n=18 (27%) ‎ Interacting with Laura made me feel frustrated, n=13 (20%) ‎	“She said something, but her hand gestures were exactly the opposite of what they should have been. Like, rather than a big gesture, where a big gesture is needed, there was a little gesture.” [ID08: F: 42 years] ‎

^aM: male.

^bF: female.

Theme 1: Perceived Role—Laura Is More Acceptable as a Friendly Coach Than as a Health Professional

When prompted about what role Laura was perceived to play in self-management support, some participants described Laura as “a ‘friendly’ coach” (ID11: F: 62 years) who was just “reminding me” of various diabetes self-management tasks. Furthermore, when asked about their perceptions of Laura, some participants described her with adjectives suggesting that she had a personality, such as “sassy” (ID15: F: 66 years), “friendly” (ID16: F: 57 years), “kind” (ID05: M: 55 years), and “intriguing” (ID06: M: 71 years). These findings may explain why most survey respondents described Laura as likable, friendly, and helpful and reported that interacting with Laura made them feel comfortable.

Conversely, other participants commented on how Laura reminded them of their health professional: “There were times when I would go and see my doctor, and I’d see Laura sitting there, because her gestures, her voices, and mannerisms are almost identical” (ID08: F: 42 years). Some participants “did not necessarily want to see an authority figure” (ID1: M: 63 years), saying, for example, that “I don’t need to be called into a doctor” (ID09: M: 71 years). Those who described Laura in similar terms to their health professionals did not warm up to Laura as they found her to be “patronizing,” “censorious,” and “authoritarian” (ID11: F: 62 years). For example, some described receiving her feedback as “having a mother-in-law in your pocket” (ID08: F: 42 years) and “feeling as though you’re getting a slap on the wrist” (ID02: F: 66 years) similar to “a recalcitrant child” (ID15: F: 66 years). Other negative descriptions of Laura were that she was “really young,” “super-skinny” (ID11: F: 62 years), and that she “talked at” people (ID13: M: 58 years).

Laura’s perceived role influenced the participants’ reactions to the support she offered. For example, participants who described Laura as being similar to a health professional reacted to this by “resisting” and “rebelling” against the “kind of authority” (ID11: F: 62 years) that Laura represented to them. One participant described how “feeling guilty” led him to “stop using” the app for a while (ID13: M: 58 years). Another participant commented on how she worried about negative feedback: “I was worried about making sure that I was within [my limits] knowing that I had to report to Laura!” (ID11: F: 62 years) Finally, one participant contemplated selecting her best readings to report to Laura to avoid “getting told off,” saying:

Do I record this one? It might be a bit high and she’s going to get upset with me.
[[ID15: F: 66 years]]

Conversely, those who perceived Laura to be less of an authority figure and more like a friendly coach as she “didn’t try and lean on any perceptions of authority, like for example, having a doctor in a white coat” were also more receptive to the support she offered. This is because they perceived her as having a more “neutral approach,” which was “better” because “a conversation between peers is more likely to be engaged with than one that references levels of authority” (ID04: M: 44 years).

The varied reactions of the participants to Laura may be linked to the inconsistencies between how Laura looked and how they expected her to act. For example, one participant said:

It's set up with this young, groovy woman who's going to help me, but she sounded like my GP who was telling me what to do. So, it's a kind of disconnect between how [Laura] looks and what she's actually saying.
[ID13: M: 58 years]

Finally, some participants described Laura’s role as an artificial entity as a positive trait, making them more receptive to receiving support from her. This is because they experienced judgment and blame for their condition from “real” people:

From the minute they meet you, just by the look of you, by the look of your appearance, they will judge you. That's one thing I don't like about real people because it happened to me.
[ID03: F: 62 years]

Theme 2: Perceived Support—Laura Provides Emotional and Motivational Support

For many, Laura provided emotional support that the participants did not otherwise have:

I needed somebody just to be there. I see the hospital doctors every six months, I only see my local doctor when I need scripts or something. Apart from that, who do you talk to?
[ID15: F: 66 years]

Supporting this premise is the fact that many survey respondents thought that Laura was trustworthy, and interacting with her made them feel confident. Another example of how Laura provided emotional support is described by one participant who expressed how her humor helped him feel better:

[She] used to make me laugh when she used to stand there with her hands on her hips waiting for me sometimes. Like my wife is saying it was probably good because if you felt down or something it made you feel better. Well it definitely bought a smile to may face a lot of times and my wife said that’s hard to do.
[ID18: M: 65 years]

A small number of survey respondents reported that interacting with Laura made them feel happy, demonstrating some support for the premise that she may have helped alleviate some of the burden of care.

Laura also provided additional motivation through enhanced monitoring and positive reinforcement:

She was keeping you on track and keeping you doing what you're supposed to be doing and keeping you doing the check-ups and that sort of stuff.
[ID16: F: 57 years]

When I was doing the exercise section, she would ask for me to record how much exercise I was doing for the week and when I’d come back [and do it], I actually almost got a pat on the back from her. I wasn't trying to be impressive for (Laura), but I think it just gave you that little bit more incentive.
[ID01: M: 63 years]

Similarly, many survey respondents reported feeling more motivated after their interactions with Laura.

Theme 3: Character Preference—Laura Is Engaging and Her Human-Like Character Is Appropriate

Interacting with Laura provided an additional dimension to the relational aspect of communication, resulting in reports of improved engagement:

instead of reading it, you're hearing it and can read at the same time. Instead of just hearing some voice, you're actually seeing [Laura] talk.
[ID05: M: 55 years]

Participants appreciated this additional dimension of communication, describing it as an attempt to “try and engage with you” and compared it with other apps where “you’re inputting information and you might get a summary,” but there was no “attempt to interact back with the user” (ID10: M: 49 years):

Laura was more personal so that's why I think I went on for the six months. The other apps were like just an impersonal graph or something, or just boxes where you put the things in.
[ID07: F: 67 years]

Some participants expressed a strong preference for Laura’s human-like character. Diabetes was described as “a human problem that should have a bit of stance and a bit of professionalism” (ID06: M: 71 years). Others thought that a nonhuman character such as a “fuzzy duck” or “Dobby the diabetes elf” (ID11: F: 62 years) would be better as it would be more “fun.” For these people, having “a character, even a fictitious character” was more “user friendly” and better than having “nothing there” (ID07: F: 67 years).

Participants who preferred a human-like character did not think a cartoon character could be taken seriously: “I’m not sure I would have given the same level of credibility to, for example, a dog or a cat or something like that” (ID04: M: 44 years). Two users put it as follows:

A cute puppy telling you that you got to exercise more or, you know, eat more greens, is going to be less convincing than a human. It just becomes a toy. Stick with somebody that looks like they know what they’re talking about. [Laura] fitted that bill.
[ID17: M: 66 years]

[A nonhuman character] would just make me want to throw the phone away completely! Because it's about a human interaction with someone who has information and resources about diabetes.
[ID13: M: 58 years]

However, there were those who did not care about what kind of character Laura was because she was “an inanimate object, not a person” (ID19: M: 59 years). Some did not “identify with or warm to Laura.” One participant said, “Laura had various statements [that were motivational] but I don’t have a relationship with Laura that caused me to value her opinion” (ID04: M: 44 years). Another participant said that although she “learnt from” Laura:

it’s not like if you went to your GP and you got your bloods done and it was physically down from the last six months, that’s a tangible quantity, but when it’s coming from an avatar, it didn’t really mean anything much.
[ID12: F: 61 years]

Some participants described being irritated by Laura’s “life” story, for example, when she said “I find that my family does such-and-such,” because she was pretending to be something she was not: “Don’t try and put it over me that this is a real person that I’m talking to” (ID16: F: 57 years). However, others liked Laura’s backstories:

Yeah, even though it's not real, but the way she talks about her kids and things like that. [I liked that] because it is more human.
[ID06: M: 71 years]

Theme 4: Room for Improvement—A Dissonance Between Laura’s Words and Actions

When prompted about Laura’s appearance, speech, and mannerisms, the interviewed participants described Laura as being “just another robot” (ID09: M: 71 years) that they “could not connect to” as she was “not human enough.” Interacting with Laura, for many, depended on “how far along are you going to pretend.” As one participant put it, “I couldn't suspend belief that Laura wasn't this algorithm working out what she needed to say to me or not say to me” (ID13: M: 58 years).

The primary reason given for this perception was Laura’s “monotone” (ID08: F: 42 years) voice that sounded similar to a “mechanised reading mechanism” with “a strange cadence and inflection to some of her sentences” (ID11: F: 62 years). Another reason was her “artificial movements” (ID14: M: 66 years) and dissonance between what was being said and her body movements. According to one user:

She said something, but her hand gestures were exactly the opposite of what they should have been. Like, rather than a big gesture, where a big gesture is needed, there was a little gesture.
[ID08: F: 42 years]

This may have been the reason why some survey respondents reported feeling frustrated after interacting with Laura and why a reasonable proportion of participants described Laura as boring, annoying and not real, or were undecided about these descriptions.

Although it seems as though Laura was not an entirely successful ECA, participants were willing to overlook her shortcomings as they understood the intention behind Laura and appreciated the effort made to make her engaging: they were willing to “cut them some slack” because “at least it’s trying to be personable.” Moreover, “They’re trying to make her look [real]—I can understand what they’re trying to do” (ID17: M: 66 years).

Principal Findings

Overall, the results suggest that an ECA is acceptable to people with T2D for the delivery of long-term self-management education and support. We found that people with T2D were willing to make compromises and adjust their expectations, while appreciating the effort of trying to create something more appealing and engaging than graphs and numbers on a screen. This implies that the increased interaction offered by an ECA may be valuable to users and a worthy avenue for developers to pursue when designing apps for people with chronic conditions such as diabetes [20].

Our findings corroborate earlier research suggesting that some users perceive an ECA to be less judgmental and more likable than a human counterpart [13,14]. This is an important finding as people with T2D often experience diabetes-related stigma, the consequences of which can include disengagement with or suboptimal self-care and diabetes-related health outcomes [42]. Suitably designed ECA support may be especially important in making people who experience diabetes-related stigma feel less judged and more open to sharing difficulties with self-management, thereby potentially increasing their engagement with appropriate self-care [43]. We also suggest that using supportive, nonblaming language is critical when designing ECAs for stigmatized conditions such as diabetes [44].

The results suggest several mechanisms through which an ECA may help establish and maintain a relationship with the user over time, such as increasing relational communication, providing ongoing emotional support and motivation, and alleviating some of the burden associated with chronic disease management through humor [18]. Our results suggest that another way to improve acceptability is to achieve a better match between an ECA’s appearance and users’ expectations of the ECA’s perceived role. For example, some participants expressed that diabetes is a serious human issue and viewed human-like characteristics as being more credible. Others expressed the desire to alleviate the burden of management by incorporating a fun character, supporting previous findings of a similar nature [26]. These varying opinions may reflect the nascent nature of the field and the fact that ECAs are not yet common.

Another related finding was that participants who perceived Laura to be a friendly coach were more open to receiving support from her when compared with those who perceived Laura to be similar to a health professional. The implication is that an ECA with a relaxed, friendly approach may be more successful in building a supportive relationship than an ECA that adopts a more authoritative role. Future attempts to develop ECAs for diabetes management could accommodate both viewpoints by striking a balance with a human-like, friendly, approachable character and avoiding patronizing messaging and mannerisms. More research is necessary to determine how expectations of users on the role that an ECA plays in self-management varies and how this informs their preference for the ECA’s character.

Another important consideration is just how human an ECA should be. Although participants reported a clear preference for a human-like character, which is supported by previous research [15], her presentation of a backstory might be a step too far as it did not seem credible to some participants, possibly because of an uncanny valley phenomenon [25]. This finding is supported by previous research on other relational agents whose personality traits and life stories are enjoyed by some users, whereas, to others, the attempt at making them too human-like is not appealing [18,45]. It will be interesting to explore attitudinal changes toward ECAs with personality as they become more common, and familiarity increases.

Our findings add to the mounting evidence that suggests that perfecting natural communication via congruence between verbal and nonverbal communication is critical to improving acceptability [20]. Nonverbal cues such as facial expressions, gaze, gestures, postures, and body movements have a deep impact on the process and outcome of communication, with approximately 65% of social meaning derived from nonverbal behavior [46]. Laura’s mannerisms and body movements were the main basis on which she had particular personality traits attributed to her, ranging from patronizing and censorious to funny and sassy. Although the MDC app attempted to create an ECA with natural communication, this effort was impeded by difficulties using the speech recognition function; lack of inflection in Laura’s voice; and a limited number of random body movements, rather than ones that match the context of the conversation. Understanding natural behaviors, biological processes that underlie them, and creating efficient algorithms to implement a convincing simulation via an ECA is challenging but critical to the success of future ECA-based self-management support [15,35].

Strengths and Limitations

This mixed methods study is one of the first to explore users’ experiences of a sophisticated ECA in a real-world setting over a 6-month period and offers several novel findings and suggestions for future research. Although conducted within the context of a randomized controlled trial, our participants used the app in the context of their everyday lives, which is a strength of the research. The mixed methods approach provides robust evidence based on responses from a wide range of participants. However, people who were retired, highly educated, and engaged with the app were overrepresented in the interviewed sample of participants.

Conclusions

The importance of the relational aspect of agents for health care is becoming an increasingly prominent theme in the literature. Our study adds to this literature by describing the long-term experiences of people using an ECA for diabetes self-management support and making recommendations for improvements and future research. These findings suggest that ECAs play a promising role in self-management support and education. However, accommodating user preferences and expectations of the role that an ECA may play in self-management and improving their natural communication are key to their success.

Acknowledgments

The MDC randomized controlled trial was conducted with funding from a National Health and Medical Research Council partnership grant (ID1057411), with additional financial and in-kind support provided by Diabetes Australia, Diabetes Queensland, Diabetes Victoria, Diabetes Western Australia, and Roche Diabetes Care. The development of the MDC app was the result of a collaboration among the University of Melbourne, Bupa Australia, The Bupa Foundation, and Clevertar. SB is supported by a postgraduate scholarship from the National Health and Medical Research Council, Australia, and Diabetes Australia. JS is supported by core funding to the Australian Centre for Behavioural Research in Diabetes, derived from the collaboration between Diabetes Victoria and Deakin University.

Authors' Contributions

BO conceived the MDC study and developed the MDC research program together with JS, DB, and the MDC research group (Emily D Williams, Michaela A Riddell, Paul A Scuffham, and Anthony Russell). SB developed the interview schedule and survey questions (with BO and JS). SB conducted the interviews, collected the survey responses, analyzed and interpreted the data, and prepared the first draft of the manuscript. GW analyzed some of the data. GW, BO, DB, and JS interpreted the data and reviewed and edited the manuscript for critical content. All authors approved the final version of the manuscript. The authors also thank Dr Mandy Cassimatis, Dr Fiona Cocker, Enying Gong, Prof Mark Harris, Anna Scovelle, Suman Shetty, Jillian Zemanek and Robin Zhou for their contributions to the MDC project.

Conflicts of Interest

BO and DB received some royalty payments for the development of the scripts for the MDC platform.

‎

Multimedia Appendix 1

Responses to question Q1: How well do the words below describe Laura? (N=66 ).

PNG File , 70 KB

‎

Multimedia Appendix 2

Responses to question Q2: How did interacting with Laura make you feel? (N=66 ).

PNG File , 65 KB

Cho N, Shaw J, Karuranga S, Huang Y, da Rocha JD, Ohlrogge A, et al. IDF diabetes atlas: global estimates of diabetes prevalence for 2017 and projections for 2045. Diabetes Res Clin Pract 2018 Apr;138:271-281. [CrossRef] [Medline]
Hu FB. Globalization of diabetes: the role of diet, lifestyle, and genes. Diabetes Care 2011 Jun;34(6):1249-1257 [FREE Full text] [CrossRef] [Medline]
World Health Organization. Global Report on Diabetes. Geneva, Switzerland: World Health Organisation; 2016.
Shrivastava S, Shrivastava P, Ramasamy J. Role of self-care in management of diabetes mellitus. J Diabetes Metab Disord 2013 Mar 5;12(1):14 [FREE Full text] [CrossRef] [Medline]
Chrvala CA, Sherr D, Lipman RD. Diabetes self-management education for adults with type 2 diabetes mellitus: a systematic review of the effect on glycemic control. Patient Educ Couns 2016 Jun;99(6):926-943 [FREE Full text] [CrossRef] [Medline]
Chatterjee S, Davies M, Heller S, Speight J, Snoek F, Khunti K. Diabetes structured self-management education programmes: a narrative review and current innovations. Lancet Diabetes Endocrinol 2018 Feb;6(2):130-142. [CrossRef] [Medline]
Laranjo L, Dunn AG, Tong HL, Kocaballi AB, Chen J, Bashir R, et al. Conversational agents in healthcare: a systematic review. J Am Med Inform Assoc 2018 Sep 1;25(9):1248-1258 [FREE Full text] [CrossRef] [Medline]
Bente G, Rüggenberg S, Krämer NC, Eschenburg F. Avatar-mediated networking: increasing social presence and interpersonal trust in net-based collaborations. Human Comm Res 2008 Apr;34(2):287-318. [CrossRef]
Bickmore TW, Caruso L, Clough-Gorr K, Heeren T. ‘It's just like you talk to a friend’ relational agents for older adults. Interact Comput 2005 Dec;17(6):711-735. [CrossRef]
Martínez-Miranda J, Martínez A, Ramos R, Aguilar H, Jiménez L, Arias H, et al. Assessment of users' acceptability of a mobile-based embodied conversational agent for the prevention and detection of suicidal behaviour. J Med Syst 2019 Jun 25;43(8):246. [CrossRef] [Medline]
Provoost S, Lau HM, Ruwaard J, Riper H. Embodied conversational agents in clinical psychology: a scoping review. J Med Internet Res 2017 May 9;19(5):e151 [FREE Full text] [CrossRef] [Medline]
Dworkin MS, Lee S, Chakraborty A, Monahan C, Hightow-Weidman L, Garofalo R, et al. Acceptability, feasibility, and preliminary efficacy of a theory-based relational embodied conversational agent mobile phone intervention to promote HIV medication adherence in young HIV-positive African American MSM. AIDS Educ Prev 2019 Feb;31(1):17-37. [CrossRef] [Medline]
Sillice MA, Morokoff PJ, Ferszt G, Bickmore T, Bock BC, Lantini R, et al. Using relational agents to promote exercise and sun protection: assessment of participants' experiences with two interventions. J Med Internet Res 2018 Feb 7;20(2):e48 [FREE Full text] [CrossRef] [Medline]
Bickmore T. Relational agents for chronic disease self management. In: Hayes BM, Aspray W, editors. Health Informatics: A Patient-centered Approach to Diabetes. Cambridge, MA: MIT Press; 2010:181-205.
Easton K, Potter S, Bec R, Bennion M, Christensen H, Grindell C, et al. A virtual agent to support individuals living with physical and mental comorbidities: co-design and acceptability testing. J Med Internet Res 2019 May 30;21(5):e12996 [FREE Full text] [CrossRef] [Medline]
Cooper H, Booth K, Gill G. Patients' perspectives on diabetes health care education. Health Educ Res 2003 Apr;18(2):191-206. [CrossRef] [Medline]
Grekin ER, Beatty JR, Ondersma SJ. Mobile health interventions: exploring the use of common relationship factors. JMIR Mhealth Uhealth 2019 Apr 15;7(4):e11245 [FREE Full text] [CrossRef] [Medline]
Bickmore T, Gruber A, Picard R. Establishing the computer-patient working alliance in automated health behavior change interventions. Patient Educ Couns 2005 Oct;59(1):21-30. [CrossRef] [Medline]
Baptista S, Trawley S, Pouwer F, Oldenburg B, Wadley G, Speight J. What do adults with type 2 diabetes want from the 'perfect' app? Results from the second diabetes miles: Australia (MILES-2) study. Diabetes Technol Ther 2019 Jul;21(7):393-399. [CrossRef] [Medline]
Gaffney H, Mansell W, Tai S. Conversational agents in the treatment of mental health problems: mixed-method systematic review. JMIR Ment Health 2019 Oct 18;6(10):e14166 [FREE Full text] [CrossRef] [Medline]
Kang S, Feng A, Leuski A, Casas D, Shapiro A. The Effect of An Animated Virtual Character on Mobile Chat Interactions. In: Proceedings of the 3rd International Conference on Human-Agent Interaction. 2015 Presented at: HAI'15; October 21-24, 2015; Daegu, Kyungpook, Republic of Korea p. 105-112. [CrossRef]
Trinh H, Shamekhi A, Kimani E, Bickmore T. Predicting User Engagement in Longitudinal Interventions with Virtual Agents. In: Proceedings of the 18th International Conference on Intelligent Virtual Agents. 2018 Presented at: IVA'18; November 5-8, 2018; Sydney, NSW, Australia. [CrossRef]
Bente G, Krämer N, Eschenburg F. Is there anybody out there? Analyzing the effects of embodimentnonverbal behavior in avatar-mediated communication. In: Konijn EA, Utz S, Tanis M, Barnes SB, editors. Mediated Interpersonal Communication. New York, USA: Routledge; 2008:131-157.
Seyama J, Nagayama RS. The uncanny valley: effect of realism on the impression of artificial human faces. Teleoperators Virtual Environ 2007 Aug;16(4):337-351. [CrossRef]
Mori M, MacDorman K, Kageki N. The uncanny valley [from the field]. IEEE Robot Automat Mag 2012 Jun;19(2):98-100. [CrossRef]
Ring L, Utami D, Bickmore T. The Right Agent for the Job? In: Proceedings of the International Conference on Intelligent Virtual Agents. 2014 Presented at: IVA'14; August 26-29, 2014; Boston, MA, USA. [CrossRef]
Black L, McTear M, Black N, Harper R, Lemon M. Appraisal of a Conversational Artefact and Its Utility in Remote Patient Monitoring. In: Proceedings of the 18th IEEE Symposium on Computer-Based Medical Systems. 2005 Presented at: CBMS'05; June 23-24, 2005; Dublin, Ireland. [CrossRef]
Harper R, Nicholl P, McTear M, Wallace J, Black L, Kearney P. Automated Phone Capture of Diabetes Patients Readings with Consultant Monitoring via the Web. In: Proceedings of the 15th Annual IEEE International Conference and Workshop on the Engineering of Computer Based Systems. 2008 Presented at: ECBS'08; March 31-April 4, 2008; Belfast, UK. [CrossRef]
Guetterman TC, Fetters MD, Creswell JW. Integrating quantitative and qualitative results in health science mixed methods research through joint displays. Ann Fam Med 2015 Nov;13(6):554-561 [FREE Full text] [CrossRef] [Medline]
Oldenburg B, Baptista S, Bird D, Shetty S, Zemanek J. Randomized controlled evaluation of an mhealth program for people with type 2 diabetes: My Diabetes Coach. Int J Behav Med 2018;25:S112-S113 [FREE Full text]
Baptista S, Zemanek J, Shetty S, Bird D, Oldenburg B. Baseline sample characteristics and 6-month evaluation of an mhealth program for people with type 2 diabetes-My Diabetes Coach. Ann Behav Med 2018;52(1):S290-S291 [FREE Full text]
YouTube. 2015. My Diabetes Coach: Laura's Introduction URL: https://www.youtube.com/watch?v=8nfw8Cpd8yA [accessed 2018-03-21]
American Diabetes Association. Glycemic targets: standards of medical care in diabetes-2020. Diabetes Care 2020 Jan;43(Suppl 1):S66-S76. [CrossRef] [Medline]
Ackerman S, Hilsenroth M. A review of therapist characteristics and techniques positively impacting the therapeutic alliance. Clin Psychol Rev 2003 Feb;23(1):1-33. [CrossRef] [Medline]
Vogeley K, Bente G. 'Artificial humans': psychology and neuroscience perspectives on embodiment and nonverbal communication. Neural Netw 2010;23(8-9):1077-1090. [CrossRef] [Medline]
Bickmore T, Caruso L, Clough-Gorr K. Acceptance and Usability of a Relational Agent Interface by Urban Older Adults. In: CHI '05 Extended Abstracts on Human Factors in Computing Systems. 2005 Presented at: CHI EA'05; April 2-7, 2005; Portland, Oregon, USA. [CrossRef]
Bickmore TW, Pfeifer LM, Byron D, Forsythe S, Henault LE, Jack BW, et al. Usability of conversational agents by patients with inadequate health literacy: evidence from two clinical trials. J Health Commun 2010;15(Suppl 2):197-210. [CrossRef] [Medline]
Kang S, Watt JH. The impact of avatar realism and anonymity on effective communication via mobile devices. Comput Hum Behav 2013 May;29(3):1169-1181. [CrossRef]
Braun V, Clarke V. Using thematic analysis in psychology. Qual Res Psychol 2006 Jan;3(2):77-101. [CrossRef]
Braun V, Clarke V. In: Charmichael M, editor. Successful Qualitative Research: A Practical Guide for Beginners. Thousand Oaks, CA: Sage Publications; 2013.
Sahin C, Naylor P. Mixed-methods research in diabetes management via mobile health technologies: a scoping review. JMIR Diabetes 2017 Feb 6;2(1):e3 [FREE Full text] [CrossRef] [Medline]
Browne JL, Ventura A, Mosely K, Speight J. 'I call it the blame and shame disease': a qualitative study about perceptions of social stigma surrounding type 2 diabetes. BMJ Open 2013 Nov 18;3(11):e003384 [FREE Full text] [CrossRef] [Medline]
Pal K, Dack C, Ross J, Michie S, May C, Stevenson F, et al. Digital health interventions for adults with type 2 diabetes: qualitative study of patient perspectives on diabetes self-management education and support. J Med Internet Res 2018 Jan 29;20(2):e40 [FREE Full text] [CrossRef] [Medline]
Speight J, Conn J, Dunning T, Skinner T, Diabetes Australia. Diabetes Australia position statement. A new language for diabetes: improving communications with and about people with diabetes. Diabetes Res Clin Pract 2012 Sep;97(3):425-431. [CrossRef] [Medline]
Cowan B, Pantidi N, Coyle D, Morrissey K, Clarke P, Al-Shehri S. 'What Can I Help You With?': Infrequent Users' Experiences of Intelligent Personal Assistants. In: Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services. 2017 Presented at: MobileHCI'17; September 4-7, 2017; Vienna, Austria. [CrossRef]
Burgoon J, Guerrero L, Manusov V. Nonverbal signals. In: Knapp ML, Daly JA, editors. The SAGE Handbook of Interpersonal Communication. Fourth Edition. Thousand Oaks, CA: Sage Publications; 2011:239-280.

‎

ECA: embodied conversational agent

HbA1c: glycated hemoglobin A1c

MDC: My Diabetes Coach

NDSS: National Diabetes Services Scheme

T2D: type 2 diabetes

Edited by G Eysenbach; submitted 17.11.19; peer-reviewed by M Sillice, T Bickmore; comments to author 31.12.19; revised version received 18.03.20; accepted 29.03.20; published 22.07.20

©Shaira Baptista, Greg Wadley, Dominique Bird, Brian Oldenburg, Jane Speight, The My Diabetes Coach Research Group. Originally published in JMIR mHealth and uHealth (http://mhealth.jmir.org), 22.07.2020.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR mHealth and uHealth, is properly cited. The complete bibliographic information, a link to the original publication on http://mhealth.jmir.org/, as well as this copyright and license information must be included.

This paper is in the following e-collection/theme issue:

Acceptability of an Embodied Conversational Agent for Type 2 Diabetes Self-Management Education and Support via a Smartphone App: Mixed Methods Study