An mHealth app using machine learning to increase physical activity in diabetes and depression: clinical trial protocol for the DIAMANTE Study
==============================================================================================================================================

* Adrian Aguilera
* Caroline A. Figueroa
* Rosa Hernandez-Ramos
* Urmimala Sarkar
* Anupama G Cemballi
* Laura Gomez-Pathak
* Jose Miramontes
* Elad Yom Tov
* Bibhas Chakraborty
* Xiaoxi Yan
* Jing Xu
* Arghavan Modiri
* Jai Aggarwal
* Joseph Jay Williams
* Courtney R. Lyles

## ABSTRACT

**Introduction** Depression and diabetes are highly disabling diseases with a high prevalence and high rate of comorbidity, particularly in low-income ethnic minority patients. Though comorbidity increases the risk of adverse outcomes and mortality, most clinical interventions target these diseases separately. Increasing physical activity might be effective to simultaneously lower depressive symptoms and improve glycemic control. Self-management apps are a cost-effective, scalable and easy access treatment to increase physical activity. However, cutting-edge technological applications often do not reach vulnerable populations and are not tailored to an individual’s behavior and characteristics. Tailoring of interventions using machine learning methods likely increases the effectiveness of the intervention.

**Methods and analysis** In a three-arm randomized controlled trial we will examine the effect of a text-messaging smartphone application to encourage physical activity in low-income ethnic minority patients with comorbid diabetes and depression. The adaptive intervention group receives messages chosen from different messaging banks by a reinforcement learning algorithm. The uniform random intervention group receives the same messages, but chosen from the messaging banks with equal probabilities. The control group receives a weekly mood message. We aim to recruit 276 adults from primary care clinics aged 18 to 75 years who have been diagnosed with current diabetes and show elevated depressive symptoms (PHQ-8 >5). We will compare passively collected daily step counts, self-report PHQ-8 and most recent HbA1c from medical records at baseline and at intervention completion at 6-month follow-up.

**Ethics and dissemination** The Institutional Review Board at the University of California San Francisco approved this study (IRB: 17-22608). We plan to submit manuscripts describing our User Designed Methods and testing of the adaptive learning algorithm and will submit the results of the trial for publication in peer-reviewed journals and presentations at (inter)-national scientific meetings.

**Registration** clinicaltrials.gov: [NCT03490253](http://medrxiv.org/lookup/external-ref?link_type=CLINTRIALGOV&access_num=NCT03490253&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom); pre-results

**Strengths and Limitations**

*   Novel approach of targeting diabetes and depressive symptoms using a smartphone application

*   Ability to compare adaptive messaging for increasing physical activity to messages delivered with equal probabilities

*   Testing of a smartphone application integrated within primary care settings in a low-income vulnerable patient population

*   Longitudinal design with 6-month follow-up enables assessing intervention effects over time

*   Challenges of this trial include supporting users in key behavior change in an automated manner with minimal in-person support

Keywords
*   Telemedicine
*   Health informatics
*   Depression and mood disorders
*   Diabetes and endocrinology

## INTRODUCTION

### Background

Both depression and diabetes are highly prevalent, often co-occurring diseases that are among the major causes of global disability (1,2,3). Comorbid diabetes and depression is associated with a worse prognosis of both diseases, including a higher rate of complications of diabetes, greater disability, and an increased risk of mortality (4). Vulnerable populations, including low-income, low health literacy and ethnic minority individuals experience higher prevalences and worse outcomes for both diabetes and depression.(5,6) There is a great need for the development of treatments that can target overlapping risk factors for diabetes and depression. A growing body of evidence suggests that physical activity is such a risk factor: it is linked to both mental health and diabetic outcomes (7,8,9,10,11)

Mobile applications have been found effective in helping patients engage in healthy behaviors including physical activity. For instance, a recent meta-analysis of nine RCT’s concluded that smartphone apps that focus on physical activity have a moderate positive effect on increasing physical activity levels, and another meta-analysis including 18 studies moderate to large effect in daily step changes(12,13). These effect sizes are similar to ‘face-to-face’ interventions(14). However, because around 70% of lower income Americans (including Latinos) currently own smartphones(15), and the ownership of smartphones is expected to increase in low income populations globally(16), mobile apps have great potential to reach individuals that normally do not have access to care. Further, mobile technology can help overcome existing barriers in access to care for vulnerable populations, including lower availability of psychological treatment in primary care settings, language and literacy barriers, stigma, cost, and inflexible employment schedules(17). Latinos in the US in particular show higher under-utilization rates of mental health treatment than non-Latino whites(18). Deploying effective mobile applications can therefore decrease existing decrease disparities in health.

However, in the U.S., low-income minority patients frequently receive their care in safety net settings (services in the public sector for those unable to attain private health insurance), where novel mobile technologies are not often designed, developed or implemented(19). This translational gap increases the probability that these interventions will ultimately fail when implemented in actual clinical settings that serve vulnerable populations.

Further, most mobile applications that target behavioral changes are not personalized(20), which could contribute to lower effect sizes of these interventions for trials with longer study durations (e.g. over 3 months)(12). Smartphone interventions allow for data collection by passive sensing technologies, which offer an opportunity for tailoring and personalizing interventions to users’ behaviors, preferences and needs. Personalization can be achieved by computer tailoring: tailoring interventions to observed behavior and characteristics of the participant. This can include feedback, goal setting, or user targeting (i.e. conveying that communication is designed specifically for the user)(21). However, more complex forms of personalization might be needed to increase and maintain engagement with PA interventions (22), a requirement for a digital intervention to be effective.

One promising approach is to use adaptive learning, which allows the prediction of which content might be effective for users, learning from its previous actions and participant data collected by mobile phones(23). For instance, an adaptive learning algorithm might be more effective as it can match motivation needs for each participant. Some intervention studies have attempted cultural tailoring to large groups of people. However, most tailoring approaches occur at the group level whereas we are bringing the tailoring down to the individual level(24). Research on the efficacy of these adaptive treatments is still in its early stages.

### Aim and hypotheses

The main aim of the “Diabetes and Mental Health Adaptive Notification Tracking and Evaluation” trial (DIAMANTE) is to test a smartphone intervention that generates adaptive messaging, learning from daily patient data to personalize the timing and type of text-messages. We will compare the adaptive content to **1**. a uniform random messaging intervention, in which the messaging content and timing will be delivered with equal probabilities (i.e. not adapted by a learning algorithm) **2**. a control condition that only delivers a weekly mood message. The primary outcomes for this aim will be improvements in physical activity at 6-month follow-up defined by daily step counts.

We will investigate the following specific hypotheses:

#### Primary Hypothesis

We expect that the group receiving adaptive messaging will have a statistically significant higher increase in step counts over the 6-month intervention period, compared to both the group receiving the uniform random messaging and the control group receiving only a weekly mood message. We expect that both the adaptive and the uniform random messaging groups will have a higher increase in step counts than the control group.

#### Secondary hypotheses

*   We expect that the group receiving adaptive, personalized messages will have significant reduction in HbA1c levels, depression scores, weekly mood ratings, higher behavioral activation, lower rumination and anxiety, more positive attitudes about physical activity and higher self-efficacy to manage chronic diseases at 6 months, compared to the group that receives the uniform random messaging intervention and the control group.

*   Exploratory: Because of the unique nature of the patient population enrolled in our study, we expect to find differences by language (English vs. Spanish). Language has been shown to be especially influential in mobile technology uptake and effectiveness in our previous work(25). Spanish speakers may engage more in the mobile intervention and might have different motivations for becoming physically active, which might result in differences in effectiveness of the various messages.

## METHODS AND ANALYSIS

### Design

This study is a randomized, controlled, single-center superiority trial with three groups and a primary endpoint of increase in daily number of steps during a 6-month intervention delivered by a smartphone app. Randomization will be performed as block randomization with a 1:1:1 allocation. Patients will be automatically randomized into groups through our secure server during onboarding of the app, thereby ensuring allocation concealment. Patients will be informed of the nature and frequency of the messages they will be receiving and will be allowed to discuss this with investigators during the course of the study.

Further, if messages are not being sent out appropriately, research assistants will contact the app developer to address errors within 24 hours. If physical activity data is not coming in, they may need to contact participants to ensure that the app is actively running on their phone, and assist participants with re-downloading the app if necessary. The necessity of these steps makes it unfeasible to blind the researchers. We used the SPIRIT checklist when writing our report(26).

### Population

Patients will be recruited within various clinics in the San Francisco Health Network, including the Richard Fine People’s Clinic, the Family Health Center, and from several endocrinologist providers at the Zuckerberg San Francisco General hospital (ZSFG); as well as the Department of Public Health in San Francisco, California in the United States that will serve as the study sites for this trial. These clinics serve a diverse, chronically ill population and, as in many public hospital settings, they employ multiple part-time providers, creating the risk for lack of continuity of care that mobile health programs may be able to supplement.

#### Inclusion criteria

Patients aged 18 to 75 years who have been diagnosed with diabetes and documented depressive symptoms (score > 5 on the PHQ-8), as determined by their medical health records, will be eligible for this study. If the patients’ health records show a diagnosis of diabetes but do not include information on PHQ-8 scores or depression diagnosis, we will assess patients’ PHQ-8 scores by telephone to determine their eligibility. After the assessment, this score will be uploaded into the patient health records, regardless of their eligibility for the study. Further, the researchers will reach out to the patient and his/her Primary Care Physician if participants screen > 20 on the PHQ-8 (severe depression). Eligible patients will need to use text messaging and have an iPhone or Android smartphone in order to download the pedometer app onto their phones, but they do not need to be proficient in using mobile phone applications. Concomitant care and interventions are permitted during the trial.

#### Exclusion criteria

We will exclude patients with: an inability to exercise due to physical disability; active psychosis or mania; active suicidal ideation; severe cognitive impairment; inability to read and write in English or Spanish; current pregnancy; plans to leave the country for extended periods of time during the 6-month trial.

#### Recruitment

Patients will be recruited using flyers placed in the waiting rooms, direct provider referrals and in-person recruitment strategies timed to visits. To identify potentially eligible patients, we will ask for permission from known providers to pull patient lists, and review and identify patients with dual diagnosis of diabetes and depression that would fit our inclusion criteria (e.g. is able to walk and is not pregnant).

#### Baseline visit

A researcher or healthcare provider will ask patients if they are interested in joining a study to increase physical activity and help manage their mood using their mobile phone. We will bring interested individuals into our offices at ZSFG for a baseline study visit and to obtain informed consent in Spanish or English. Thereafter we will collect all baseline survey measures of interests (outlined below) as well as patient demographics and information about current mobile technology familiarity and utilization. All patients will receive assistance in downloading a pedometer application onto their phone and will send test text-messages back to our system. The researcher will construct a plan for physical activity goals with the patient and instruct patients to have the app open at all times. Thereafter, users will be automatically randomized by the secure server, using a block randomization with block size 3 to allocate individuals into either the uniform random, adaptive, or control group.

#### Partial Patient and Public Involvement

Patients worked with us to design the text-messages, mobile app user interface, and were asked to assess the burden and time-commitment of the study, as part of our User Centered Design approach. We did not involve patients in other areas of the study design.

### Measures

Our primary outcome, change in daily step counts, will be passively collected by a mobile phone application during the time that patients remain in the intervention. For secondary outcomes, we will derive HbA1c, the average plasma glucose over the previous eight to 12 weeks, recommended as a means to diagnose diabetes(27), from patients’ electronic health records (EHR). We will use the most recent, available measurement from a maximum of 12 months before participating in the study. After 6 months, we will again assess the most recent HbA1c (pulling from patients’ EHR), ensuring that at least 3 months elapsed between baseline and follow-up HbA1c levels. For additional secondary outcomes, we will administer a survey at baseline and 6-month follow-up.

The project coordinator and research assistants will be responsible for managing patient data collection. Data will be stored on UCSF Qualtrics and the HealthySMS platform. Once data from all patients is collected, it will be stored on UCSF’s Secure Box, a secure cloud-hosted platform. See **Table 1** for all included questionnaires.

View this table:
[Table 1](http://medrxiv.org/content/early/2020/07/01/2020.06.29.20142943/T1)

Table 1 

#### Engagement measures

In addition to physical activity and the measures mentioned above, we will also examine engagement measures, such as (1) times that the app was opened, (2) time spent reading the messages, (3) Usability data, assessed by the Systems Usability Scale(28) and open ended qualitative questions about their opinions of the app.

### Procedure

#### Determining content of motivational message categories

We designed motivational text-messages according to a behavioral theory framework, using the Capability, Opportunity, Motivation, Behavior (COM-B) model(29). COM-B is a behavioral change model that proposes that engaging in a particular behavior depends on the dimensions of capability (physical and psychological), opportunity (social and physical) and motivation (need to engage in the behavior more than in other behaviors). Messages were designed to fit into these three dimensions. Before the start of the RCT, English and Spanish speaking patients tested the messages in the Amazon Mechanical Turk (MTurk) crowdsourcing platform. Patients on MTurk were given random messages and asked to categorize them into our overarching theoretical categories. MTurk workers and each coauthor also rated messages for their overall quality from 1 to 5.

#### User Centered Design Testing

We utilized user-centered design (UCD) methods(30) to iteratively develop the content and text messaging system. We conducted three iterative phases of UCD with ten patients each (total n=30). The first phase consisted of 1.5-hour individual semi-structured interviews that took place at the offices at ZSFG. Findings from phase 1 were used to inform content and information delivery decisions of the final intervention including selecting the thematic message categories and the design decisions. The second phase ran as a usability test in which patients tested out an early prototype of the mobile application. The third phase tested out the final DIAMANTE intervention including thematic message content and finalized application in order to address any user-related issues prior to launching the randomized control trial (See **Figure 1** for an overview of the different UCD phases and the RCT).

![Figure 1:](http://medrxiv.org/https://www.medrxiv.org/content/medrxiv/early/2020/07/01/2020.06.29.20142943/F1.medium.gif)

[Figure 1:](http://medrxiv.org/content/early/2020/07/01/2020.06.29.20142943/F1)

Figure 1: 
Overview User Centered Design process (UCD) and randomized trial. We first conducted three iterative phases of only UCD with ten patients each (total n=30). The DIAMANTE trial will have different intervention groups: adaptive (n=92), static (n=92) and control (n=92).

#### Interventions

We employ a mobile phone app, “DIAMANTE” developed by Audacious Software [https://diamante.healthysms.org/](https://diamante.healthysms.org/). This application tracks step counts by pooling from Google Fit, Apple HealthKit or the built-in pedometer on patients’ phones. We use a text-messaging platform HealthySMS, developed by Dr. Aguilera, to send text-messages and manage patient responses back to our system. The app only needs to be installed once, but has to remain open consistently. The app is designed in English and Spanish versions and is freely available as a download from the Apple App Store and Android Google Play application.

**Figure 1**. shows the different intervention groups during the trial period. Briefly, both the adaptive and uniform random group will receive the same types of messages: feedback (4 active categories plus no message) and motivation (3 active categories plus no message). However, the message categories, timing and frequency will be optimized by a reinforcement learning algorithm in the adaptive group, and will be delivered with equal probabilities in the uniform random group (following a uniform random distribution). The control group will not receive these messages (only a weekly mood check-in message). All groups will have the app downloaded on their phone and their steps will be passively tracked within the app. See Figure 1 for an overview.

### Control condition

Control patients will only install the app on their phone and will not receive any feedback messages. They will receive one message a week, on a fixed day, asking them to assess their mood in the previous week on a scale of 1 to 9. The message will be sent weekly at 10:00 am. Non-responders will receive reminders to submit their mood self-assessments in two-hour intervals.

### Uniform random message arm

We will send patients up to two messages per day within 4 randomly selected time intervals. These messages are based on the COM-B framework (examples shown in **Table 1 A/B**). In addition, they will receive one message, on the seventh day, which will ask patients to rate their mood on a scale from 1 to 9. Physical activity (step-count/day) will be passively monitored via the app on their smartphone.

### Adaptive message arm

Patients in the adaptive messaging arm will receive the daily COM-B messages (equal to the uniform random arm, examples shown in **Table 1 A/B**), but the message categories, timing and frequency, will not be chosen randomly, but by using a reinforcement learning (RL) algorithm. This allows us to adequately assess whether differences in effects are driven by the use of the RL algorithm.). In addition, they will receive one message, on the seventh day, which will ask patients to rate their mood on a scale from 1 to 9. Physical activity (step-count/day) will be actively monitored via the app on their smartphone.

Patients in all groups will receive reminders to open the app if no data is being transmitted. Additionally, patients in all groups can reply ‘STOP’ or ‘PARAR’ if they wish to stop receiving messages. Finally, the researchers will monitor the incoming step data. The researchers will contact the patients by phone for troubleshooting if patients’ phones are not transmitting data for more than 3 days.

### Adaptive Learning algorithm

We developed a reinforcement learning algorithm (an area of machine learning) based on previous work(31). On a daily basis, this algorithm assesses 1. which feedback message (**Table 2A**) 2. which motivational message (**Table 2B**) and 3. which time period of the day (in intervals of 2.5 hours from 9:00 am to 7:00 pm) is predicted to maximize the number of steps walked the next day. We use algorithms for contextual multi-armed bandit (MAB) problems, as these have been employed in different domains (including mobile health) and show promise to maximize cumulative rewards in sequential decision tasks (here, which sequences of messages optimally promote physical activity)(32). A contextual MAB problem is a reinforcement learning setting in which the algorithm choses between different treatment options which all have different reward functions. The reward functions depend on contextual variables.

View this table:
[Table 2A.](http://medrxiv.org/content/early/2020/07/01/2020.06.29.20142943/T2)

Table 2A. 
Different categories with feedback messages that the algorithm chooses from. The algorithm can also choose not to send a message (category no feedback).

View this table:
[Table 2B.](http://medrxiv.org/content/early/2020/07/01/2020.06.29.20142943/T3)

Table 2B. 
Different categories with Motivational messages that the algorithm chooses from. The algorithm can also choose not to send a message (category no feedback).

We use Thompson Sampling, a Bayesian method, which will allow us to continuously learn which feedback and motivational messages are effective for a user, based on contextual features like their previous physical activity, demographic and clinical characteristics (such as age, gender and PHQ-8 scores). Thompson sampling can effectively deal with small amounts of data and addresses the exploration/exploitation trade-off(33). As such, it frequently picks out from the most rewarding messages and occasionally explores the messages with uncertainty in their reward.

More specifically, each morning, the algorithm evaluates which messages will likely increase the physical activity for every participant in the upcoming day, and at which time period the messages should be delivered. The algorithm training data consists of the historical data of all participants (contextual variables), which include which messages were sent previously and within which time periods, and select clinical/demographic data (such as age, language and depression scores) to improve prediction abilities.

During the trial, patients will receive messages for 6 months. After 6 months, we will provide patients the possibility to remain receiving messages for up to 9 months.

### Statistical analysis plan

#### Baseline characteristics

We will provide descriptive summaries and examine any potential imbalances between the intervention and control groups using χ2-tests for categorical variables and independent samples t-tests for continuous variables.

#### Primary and secondary hypothesis

Analyses will be conducted on an intention-to-treat basis. We will use full-information maximum likelihood to handle missing data, which has been shown to be preferred over multiple imputations(34). We will include patients in the analysis for primary and secondary outcomes that have at least one month of data available. The primary outcome measure, increase in daily step counts, and the secondary outcomes measures (HbA1c, PHQ-8 and other questionnaire data, see measures) will be analyzed using a conditional growth model, in line with previous work(35), with intervention (adaptive, uniform random, control) being the fixed effect and time being nested within individuals. Let *j = 1*, …, *n, i = 1*, …, *t*, where *t* is the number of recorded days for the *j*th patient Yij is the daily steps count on the *i*th day of *j*th patient.

Let Aj = 2, 1 or 0 for patient *j* in adaptive, uniform random or control arm, and let Tij be the *i*th day for patient *j*.

The daily steps for patients given the treatment, in equation form:

At level 1 (within-patient): ![Formula][1]</img>  

At level 2 (between-patient): ![Formula][2]</img>  

As this is a randomized trial, we will not adjust for any patient characteristics or patient engagement measures in the primary intention-to-treat analysis unless there is a significant imbalance between the arms of the trial.

### Pre-planned subgroup analyses

We will conduct an exploratory sub-group analyses for our primary and secondary hypotheses for English versus Spanish speaking patients. We expect at least 50% of the sample to be Spanish speakers given our previous testing recruitment.

#### Power Calculations and Sample Size

Power for growth models were calculated based on Monte Carlo simulations using the ‘*nlme*’ package in R(36) We conducted simulations, using data from our own study team testing the app (n=47) and data from pilot patients (n=8) to estimate the model parameters for calculating the necessary sample size. We used an estimated mean increase of 1250 steps based on previous work (37, 38). With 5000 Monte Carlo simulation runs, number of days for each patient *t* ∼ *Uniform(28,180)*, we estimate that we need n = 160 for 2-arms (the adaptive vs. uniform random) for 80% power. Inflating to include the control arm and controlling for 15% drop-out, we aim to recruit a total of 276 patients. We believe that low drop-out is feasible as (1) the primary outcome of interest will be passively assessed and thus requires limited proactive engagement by the user, and (2) we have a comprehensive protocol to automatically text users who have closed the app or are not transmitting data.

#### Compensation

Patients will receive a compensation of $40 for participation in the baseline part of the study, and an additional $70 for completion of the 6-month follow up.

#### Ethics and Dissemination

The UCSF Institutional Review Board has approved this protocol. All protocol amendments will be communicated for approval to the USCF IRB. We will ensure that our text messaging content is publicly available through a creative commons licensing agreement. The HealthySMS system is available for use upon request.

#### Data statement

We will submit study-results for publication in peer reviewed journals and presentation at (inter)national meetings, taking into account relevant reporting guidelines (e.g. CONSORT(39)). We will attempt to publish all findings in open-access journals when possible, or in other journals with a concurrent uploading of the manuscript content into PubMed Central for public access.

Curated technical appendices, statistical code, and anonymized data will become freely available from the corresponding authors upon request.

### Safety protocol

We will review the clinician dashboard for patients in all arms of the trial to identify safety concerns that may emerge over the course of the trial. In addition, we will be automatically notified when key events, such as self-reported suicidality (messages with the words “Kill, Die, Suicide, Don’t want to live, Do not want, Bridge, What is the point, Harm, Hurt, Gun” for English speaking patients; and “Tiro, Matar, Morir, Suicidio, suicidar, no quiero, puente, para que, daño, dañar, dano, danar” for Spanish speaking patients) occur. We will provide immediate phone outreach and will refer patients to urgent or emergency care as necessary. Dr. Aguilera will be available to consult and intervene as necessary for mental health emergencies.

## DISCUSSION

In this randomized controlled trial, we aim to examine the effect of a smartphone app that uses reinforcement learning to predict the most effective messages for increasing physical activity in 276 low-income, ethnic and racial minority patients with diabetes and depression in urban public sector primary care clinics. We will compare this intervention to uniform random messages, delivered with equal and unchanging probabilities, and a control group that only receives a weekly mood message.

### Decreasing health disparities

Though the numbers of mHealth pilot studies are increasing in vulnerable populations, many of these fail to follow through with an implementation component to the study design(40). Here, we are using a blended design: while the intervention is in addition to current care, there are ways we are attempting to make it more a part of patients’ clinical care. For instance, patients are mainly approached through primary care health providers, which recommend eligible patients whom they think are directly interested in a physical activity intervention. In addition, we will make patients’ data available to providers: a summary of the step increase for that patient at the conclusion of the study and updated PHQ-8 and GAD scores entered into the record. Our study therefore is the first step to addressing this gap because of its integration in primary care clinics that serve low-income patients. Future work should focus more specifically on implementation of the app as part of routine clinical care.

### Physical Activity measure

We chose passively collected daily step count from patients’ pre-owned digital devices as a measure of physical activity. Although there are many different ways to measure physical activity, daily step count seems to be a particularly relevant measure because of: 1. Its relative ease to measure, 2. The clinical importance of individuals’ walking behavior. Low number of daily step counts have been associated with all-cause mortality in some longitudinal studies(41) and results from pooled population studies show clear dose–response effects of physical activity to overall mortality(42). In patients with Diabetes Type 2, several studies have now shown that increasing step counts can significantly decrease HbA1c levels. For instance, a 10,000 steps per day walking prescription increased steps and decreased HbA1c in patients with Diabetes Type 2(43). Further, Manjoo et al. found that each standard deviation increase in daily steps was associated with a 0.21% decrease in HbA1c(44). However, negative findings have also been reported. For instance, a meta-analysis by Qui et al. found that step-counter use was associated with increased steps per day (over 1800 more steps compared to a control group) among people with diabetes, but not with lowering of HbA1c(45).

In exploratory post hoc analyses, we will also be able to examine the more immediate effect of physical activity messages, e.g. on hourly steps in addition to daily steps, which will help to improve future physical activity interventions (e.g. deliver messages at the right times). For instance, it is possible that one could receive a message in the morning and make plans to walk in the afternoon or evening, or messages could have more of an immediate impact. This information is currently unknown.

### Personalization of intervention

The results of this RCT will help us understand if adaptive mHealth interventions for depression and diabetes are more beneficial than interventions that do not use learning algorithms. If mHealth interventions are not personalized, their efficacy might be low, due to low engagement and high drop-out rates(20). The use of machine learning to adapt interventions according to users’ characteristics and behaviors is still in its early stages, but shows promise(46). For instance, Yov-Tom et al. using an adaptive learning algorithm, found that adaptive feedback messages were more effective in increasing the amount and speed of physical activity and also reduced HbA1c in sedentary patients with diabetes type 2(31). Further, Zhou et al. showed short-term efficacy of using adaptive weekly step goals determined by reinforcement learning in healthy patients(47). The current study, with a relatively large group of patients, will further increase our understanding of the potential of machine-learning driven text messaging interventions.

## Limitations and strengths

### Limitations

First, the results of this study might be specific to this population of low-income ethnic minority patients and might therefore not be generalizable to other populations. However, the inclusion of vulnerable patients in a primary care setting increases the likelihood that this intervention will be effective for other underserved populations. Further, our study procedures do not allow a double-blind design, as researchers and patients need to be made aware of the nature and frequency of messages they are receiving. Additionally, as with all digital interventions, technical issues might arise leading to unreliable step-count data and reduced ability for the algorithm to predict the most effective messages. The researchers and technical personnel will frequently check our server for data collection troubleshooting. Patients will also be made aware when their data is not collected correctly.

### Strengths

Diabetes and depression are among the top ten causes of disability in the US(48). Developing cost-effective and scalable models of care for patients with common chronic conditions has been postulated as of key importance in improving the performance of health care systems(49). If mHealth apps that target diabetes and depression through their common risk-factor physical inactivity are effective, they can have a major public health impact. Further, because the learning algorithm that we apply in this study is automated and delivers adaptive messages based on patients’ behaviors, it can potentially be applied in other patient populations with a wide range of conditions.

## Conclusion

The outcome of this trial will provide information on the effectiveness of a text-message based smartphone app that uses machine learning to increase physical activity in low-income ethnic minority patients in primary care settings. The results will provide key information on the effectiveness of adaptive mobile applications, compared to more traditional static digital interventions. If effective, this application has the ability to decrease healthcare disparities by providing a type of personalized care to a diverse and traditionally hard to reach group of underserved patients.

## Competing interests

All authors report no competing interests.

## Author contributions

Dr. Aguilera and Dr. Lyles designed the overall study. Ms. Hernandez-Ramos, Ms. Gomez-Pathak, Mr. Miramontes, Ms. Cemballi, Dr. Sarkar and Dr. Figueroa were involved in the design and implementation of the User Centered Design phase and RCT. Dr. Jay Williams, Ms. Modiri, Mr. Aggarwal, Dr. Figueroa and Dr. Yom-Tov were involved in the design of the reinforcement learning algorithm. Dr. Figueroa wrote the first draft of the paper. Dr. Chakraborty, Ms. Yan and Dr. Xu revised the statistical analyses and power calculation sections of the paper. All authors revised the manuscript for relevant scientific content and approved the final version of the manuscript.

## Funding

This trial is funded by an R01 to Dr. Adrian Aguilera and Dr. Lyles, 1R01 HS25429-01 from the Agency for Healthcare Research and Quality (AHRQ). The sponsor did not have any role in the study design; collection, management, analysis, and interpretation of data; writing of the report; and the decision to submit the report for publication, not ultimate authority over any of these activities.

## Data Availability

Curated technical appendices, statistical code, and anonymized data will become freely available from the corresponding authors upon request. 

## Acknowledgements

We would like to acknowledge Chris Karr who helped develop and manage the automated text messaging service, passive data collection and reinforcement algorithm and Patricia Avila Garcia who was involved in the design and implementation of the User Centered Design Phase of the study.

## Footnotes

*   * Authors share first authorship

*   Received June 29, 2020.
*   Revision received June 29, 2020.
*   Accepted July 1, 2020.


*   © 2020, Posted by Cold Spring Harbor Laboratory

This pre-print is available under a Creative Commons License (Attribution-NonCommercial-NoDerivs 4.0 International), CC BY-NC-ND 4.0, as described at [http://creativecommons.org/licenses/by-nc-nd/4.0/](http://creativecommons.org/licenses/by-nc-nd/4.0/)

## References

1.  1.Vos T, Abajobir AA, Abate KH, Abbafati C, Abbas KM, Abd-Allah F, et al. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990–2016: a systematic analysis for the Global Burden of Disease Study 2016. The Lancet. 2017;390(10100):1211–59.
    
    
2.  2.Fisher EB, Chan JC, Nan H, Sartorius N, Oldenburg B. Co-occurrence of diabetes and depression: conceptual considerations for an emerging global health challenge. Journal of affective disorders. 2012;142:S56–S66.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/S0165-0327(12)70009-5&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23062858&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

3.  3.Chireh B, Li M, D’Arcy C. Diabetes increases the risk of depression: A systematic review, meta-analysis and estimates of population attributable fractions based on prospective studies. Preventive Medicine Reports. 2019;14:100822.
    
    
4.  4.Sartorius N. Depression and diabetes. Dialogues in clinical neuroscience. 2018;20(1):47–52.
    
    
5.  5.Thomas J, Jones G, Scarinci I, Brantley P. A descriptive and comparative study of the prevalence of depressive and anxiety disorders in low-income adults with type 2 diabetes and other chronic illnesses. Diabetes care. 2003;26(8):2311–7.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czo5OiIyNi84LzIzMTEiO3M6NDoiYXRvbSI7czo1MDoiL21lZHJ4aXYvZWFybHkvMjAyMC8wNy8wMS8yMDIwLjA2LjI5LjIwMTQyOTQzLmF0b20iO31zOjg6ImZyYWdtZW50IjtzOjA6IiI7fQ==) 

6.  6.Colon E, Giachello A, McIver L, Pacheco G, Vela L. Diabetes and depression in the Hispanic/Latino community. Clinical Diabetes. 2013;31(1):43–5.
    
    [FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiRlVMTCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2xpbiI7czo1OiJyZXNpZCI7czo3OiIzMS8xLzQzIjtzOjQ6ImF0b20iO3M6NTA6Ii9tZWRyeGl2L2Vhcmx5LzIwMjAvMDcvMDEvMjAyMC4wNi4yOS4yMDE0Mjk0My5hdG9tIjt9czo4OiJmcmFnbWVudCI7czowOiIiO30=) 

7.  7.Chekroud SR, Gueorguieva R, Zheutlin AB, Paulus M, Krumholz HM, Krystal JH, et al. Association between physical exercise and mental health in 1.2 million individuals in the USA between 2011 and 2015: a cross-sectional study. The Lancet Psychiatry. 2018;5(9):739–46.
    
    
8.  8.Sigal RJ, Armstrong MJ, Bacon SL, Boulé NG, Dasgupta K, Kenny GP, et al. Physical activity and diabetes. Canadian journal of diabetes. 2018;42:S54–S63.
    
    
9.  9.Cooney G, Dwan K, Greig C, Lawlor D, Rimer J, Waugh F, et al. Exercise for depression Cochrane Database. Syst Rev. 2013;9:CD004366.
    
    
10. 10.Mammen G, Faulkner G. Physical activity and the prevention of depression: a systematic review of prospective studies. American journal of preventive medicine. 2013;45(5):649–57.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1016/j.amepre.2013.08.001&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=24139780&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

11. 11.Avery L, Flynn D, Van Wersch A, Sniehotta FF, Trenell MI. Changing physical activity behavior in type 2 diabetes: a systematic review and meta-analysis of behavioral interventions. Diabetes care. 2012;35(12):2681–9.
    
    [Abstract/FREE Full Text](http://medrxiv.org/lookup/ijlink/YTozOntzOjQ6InBhdGgiO3M6MTQ6Ii9sb29rdXAvaWpsaW5rIjtzOjU6InF1ZXJ5IjthOjQ6e3M6ODoibGlua1R5cGUiO3M6NDoiQUJTVCI7czoxMToiam91cm5hbENvZGUiO3M6NzoiZGlhY2FyZSI7czo1OiJyZXNpZCI7czoxMDoiMzUvMTIvMjY4MSI7czo0OiJhdG9tIjtzOjUwOiIvbWVkcnhpdi9lYXJseS8yMDIwLzA3LzAxLzIwMjAuMDYuMjkuMjAxNDI5NDMuYXRvbSI7fXM6ODoiZnJhZ21lbnQiO3M6MDoiIjt9) 

12. 12.Romeo A, Edney S, Plotnikoff R, Curtis R, Ryan J, Sanders I, et al. Can Smartphone Apps Increase Physical Activity? Systematic Review and Meta-Analysis. J Med Internet Res. 2019;21(3):e12053.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.2196/12053&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=30888321&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

13. 13.Gal R, May AM, van Overmeeren EJ, Simons M, Monninkhof EM. The effect of physical activity interventions comprising wearables and smartphone applications on physical activity: a systematic review and meta-analysis. Sports medicine-open. 2018;4(1):42.
    
    
14. 14.Han Y, Yan J. The effect of face-to-face interventions in promoting physical activity. AJN The American Journal of Nursing. 2014;114(4):23.
    
    
15. 15.Schrire V, Beck W. Human heart transplantation--the pre-operative assessment. S Afr Med J. 1967;41(48):1263–5.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=4866698&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

16. 16.Center PR. Smartphone ownership on the rise in emerging economies. 2018 June 19.
    
    
17. 17.Vázquez MYG, Sexto CF, Rocha Á, Aguilera A. Mobile Phones and Psychosocial Therapies with Vulnerable People: a First State of the Art. Journal of Medical Systems. 2016;40(6):157.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

18. 18.Quality AfHRa. National Healthcare Quality Disparities Report 2016.
    
    
19. 19.Lyles CR, Handley MA, Ackerman SL, Schillinger D, Williams P, Westbrook M, et al. Innovative Implementation Studies Conducted in US Safety Net Health Care Settings: A Systematic Review. American Journal of Medical Quality. 2018;34(3):293–306.
    
    
20. 20.Polgreen LA, Anthony C, Carr L, Simmering JE, Evans NJ, Foster ED, et al. The effect of automated text messaging and goal setting on pedometer adherence and physical activity in patients with diabetes: A randomized controlled trial. PloS one. 2018;13(5):e0195797.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

21. 21.Monteiro-Guerra FM, Rivera-Romero O, Luque LF, Caulfield B. Personalization in Real-Time Physical Activity Coaching using Mobile Applications: A Scoping Review. IEEE journal of biomedical and health informatics. 2019.
    
    
22. 22.Cheung KL, Durusu D, Sui X, de Vries H. How recommender systems could support and enhance computer-tailored digital health programs: A scoping review. DIGITAL HEALTH. 2019;5:2055207618824727.
    
    
23. 23.Nahum-Shani I, Smith SN, Spring BJ, Collins LM, Witkiewitz K, Tewari A, et al. Just-in-Time Adaptive Interventions (JITAIs) in Mobile Health: Key Components and Design Principles for Ongoing Health Behavior Support. Annals of behavioral medicine: a publication of the Society of Behavioral Medicine. 2018;52(6):446–62.
    
    
24. 24.Lyles CR, Sarkar U, Osborn CY. Getting a technology-based diabetes intervention ready for prime time: a review of usability testing studies. Current diabetes reports. 2014;14(10):534.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1007/s11892-014-0534-9&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25173689&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

25. 25.Aguilera A, Berridge C. Qualitative Feedback From a Text Messaging Intervention for Depression: Benefits, Drawbacks, and Cultural Differences. JMIR mHealth uHealth. 2014;2(4):e46.
    
    
26. 26.Chan A-W, Tetzlaff JM, Altman DG, Laupacis A, Gøtzsche PC, Krleža-Jerić K, et al. SPIRIT 2013 statement: defining standard protocol items for clinical trials. Annals of internal medicine. 2013;158(3):200–7.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.7326/0003-4819-158-3-201302050-00583&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23295957&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 
    
    [Web of Science](http://medrxiv.org/lookup/external-ref?access_num=000314757900007&link_type=ISI) 

27. 27.Organization WH. Use of glycated haemoglobin (HbA1c) in diagnosis of diabetes mellitus: abbreviated report of a WHO consultation. Geneva: World Health Organization; 2011.
    
    
28. 28.Bangor A, Kortum PT, Miller JT. An empirical evaluation of the system usability scale. Intl Journal of Human–Computer Interaction. 2008;24(6):574–94.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1080/10447310802205776&link_type=DOI) 

29. 29.Michie S, Van Stralen MM, West R. The behaviour change wheel: a new method for characterising and designing behaviour change interventions. Implementation science. 2011;6(1):42.
    
    
30. 30.McCurdie T, Taneva S, Casselman M, Yeung M, McDaniel C, Ho W, et al. mHealth consumer apps: the case for user-centered design. Biomedical instrumentation & technology. 2012;46(2):49–56.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=22239360&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

31. 31.Yom-Tov E, Feraru G, Kozdoba M, Mannor S, Tennenholtz M, Hochberg I. Encouraging Physical Activity in Patients With Diabetes: Intervention Using a Reinforcement Learning System. J Med Internet Res. 2017;19(10):e338.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

32. 32.Tewari A, Murphy SA. From ads to interventions: Contextual bandits in mobile health. Mobile Health: Springer; 2017. p. 495–517.
    
    
33. 33.Agrawal S, Goyal N, editors. Analysis of thompson sampling for the multi-armed bandit problem. Conference on Learning Theory; 2012.
    
    
34. 34.Cham H, Reshetnyak E, Rosenfeld B, Breitbart W. Full Information Maximum Likelihood Estimation for Latent Variable Interactions With Incomplete Indicators. Multivariate Behav Res. 2017;52(1):12–30.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

35. 35.Korinek EV, Phatak SS, Martin CA, Freigoun MT, Rivera DE, Adams MA, et al. Adaptive step goals and rewards: a longitudinal growth model of daily steps for a smartphone-based walking intervention. Journal of behavioral medicine. 2018;41(1):74–86.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

36. 36.Pinheiro J, Bates D, DebRoy S, Sarkar D, Heisterkamp S, Van Willigen B, et al. Package ‘nlme’. Linear and Nonlinear Mixed Effects Models, version. 2017:3–1.
    
    
37. 37.Fukuoka Y, Haskell W, Lin F, Vittinghoff E. Short-and Long-term Effects of a Mobile Phone App in Conjunction With Brief In-Person Counseling on Physical Activity Among Physically Inactive Women: The mPED Randomized Clinical Trial. JAMA network open. 2019;2(5):e194281–e.
    
    
38. 38.Chase JA. Interventions to Increase Physical Activity Among Older Adults: A Meta-Analysis. The Gerontologist. 2015;55(4):706–18.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1093/geront/gnu090&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=25298530&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

39. 39.Grant SP, Mayo-Wilson E, Melendez-Torres G, Montgomery P. Reporting quality of social and psychological intervention trials: a systematic review of reporting guidelines and trial publications. PloS one. 2013;8(5):e65442.
    
    [CrossRef](http://medrxiv.org/lookup/external-ref?access_num=10.1371/journal.pone.0065442&link_type=DOI) 
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=23734256&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

40. 40.Anderson-Lewis C, Darville G, Mercado RE, Howell S, Di Maggio S. mHealth Technology Use and Implications in Historically Underserved and Minority Populations in the United States: Systematic Literature Review. JMIR Mhealth Uhealth. 2018;6(6):e128.
    
    
41. 41.Yamamoto N, Miyazaki H, Shimada M, Nakagawa N, Sawada SS, Nishimuta M, et al. Daily step count and all-cause mortality in a sample of Japanese elderly people: a cohort study. BMC public health. 2018;18(1):540.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

42. 42.Cavero-Redondo I, Peleteiro B, Alvarez-Bueno C, Artero EG, Garrido-Miguel M, Martinez-Vizcaino V. The Effect of Physical Activity Interventions on Glycosylated Haemoglobin (HbA1c) in Non-diabetic Populations: A Systematic Review and Meta-analysis. Sports Med. 2018;48(5):1151–64.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

43. 43.Fayehun AF, Olowookere OO, Ogunbode AM, Adetunji AA, Esan A. Walking prescription of 10 000 steps per day in patients with type 2 diabetes mellitus: a randomised trial in Nigerian general practice. The British journal of general practice: the journal of the Royal College of General Practitioners. 2018;68(667):e139–e45.
    
    
44. 44.Manjoo P, Joseph L, Dasgupta K. Abdominal adiposity and daily step counts as determinants of glycemic control in a cohort of patients with type 2 diabetes mellitus. Nutrition & diabetes. 2012;2(1):e25.
    
    
45. 45.Qiu S, Cai X, Chen X, Yang B, Sun Z. Step counter use in type 2 diabetes: a meta-analysis of randomized controlled trials. BMC medicine. 2014;12(1):36.
    
    
46. 46.Triantafyllidis AK, Tsanas A. Applications of Machine Learning in Real-Life Digital Health Interventions: Review of the Literature. J Med Internet Res. 2019;21(4):e12286.
    
    [PubMed](http://medrxiv.org/lookup/external-ref?access_num=http://www.n&link_type=MED&atom=%2Fmedrxiv%2Fearly%2F2020%2F07%2F01%2F2020.06.29.20142943.atom) 

47. 47.Zhou M, Fukuoka Y, Mintz Y, Goldberg K, Kaminsky P, Flowers E, et al. Evaluating machine learning–based automated personalized daily step goals delivered through a mobile phone app: Randomized controlled trial. JMIR mHealth and uHealth. 2018;6(1):e28.
    
    
48. 48.The US Burden of Disease Collaborators. The State of US Health, 1990-2016: Burden of Diseases, Injuries, and Risk Factors Among US States. US Burden of Diseases, Injuries, and Disease Risk Factors, 1990-2016US Burden of Diseases, Injuries, and Disease Risk Factors, 1990-2016. JAMA. 2018;319(14):1444–72.
    
    
49. 49.Institute of Medicine Committee on Quality of Health Care in A. Crossing the Quality Chasm: A New Health System for the 21st Century. Washington (DC>): National Academies Press (US)

 [1]: /embed/graphic-5.gif
 [2]: /embed/graphic-6.gif