PT - JOURNAL ARTICLE AU - Tim Althoff AU - Hamed Nilforoshan AU - Jenna Hua AU - Jure Leskovec TI - How Food Environment Impacts Dietary Consumption and Body Weight: A Country-wide Observational Study of 2.3 Billion Food Logs AID - 10.1101/2020.09.29.20204099 DP - 2020 Jan 01 TA - medRxiv PG - 2020.09.29.20204099 4099 - http://medrxiv.org/content/early/2020/09/29/2020.09.29.20204099.short 4100 - http://medrxiv.org/content/early/2020/09/29/2020.09.29.20204099.full AB - IMPORTANCE An unhealthy diet is a key risk factor for chronic diseases including obesity, diabetes, and heart disease. Limited access to healthy food options may contribute to unhealthy diets. However, previous studies of food environment have led to mixed results, potentially due to methodological limitations of small sample size, single location, and non-uniform design across studies.OBJECTIVE To quantify the independent impact of fast food and grocery access, income and education on food consumption and weight status.DESIGN, SETTING AND PARTICIPANTS Retrospective cohort study of 1,164,926 participants across 9,822 U.S. zip codes logging 2.3 billion consumed foods. Participants were users of the My-FitnessPal smartphone application and used the app to monitor their caloric intake for an average of 197 days each (min 10, max 1,825 days, STD=242).MAIN OUTCOMES AND MEASURES The primary outcomes were relative change in consumption of fresh fruits and vegetables, fast food, and soda, as well as relative change in likelihood of overweight/obese body mass index (BMI), based on food consumption logs. Food access measures for each zip code were computed from USDA Food Access Research Atlas and Yelp.com, and demographic, income and education measures were based on Census data. Genetic Matching-based approaches were used to create matched pairs of zip codes.RESULTS Access to grocery stores, non-fast food restaurants, income, and education were independently associated with healthier food consumption and lower prevalence of overweight/obese BMI levels. Substantial differences were observed between predominantly Black, Hispanic, and White zip codes. For instance, within predominantly Black zip codes we found that high income was associated with a decrease in healthful food consumption patterns across fresh fruits and vegetables and fast food. Further, high grocery access had a significantly larger association with increased fruit and vegetable consumption in predominantly Hispanic (7.4% increase) and Black (10.2% increase) zip codes in contrast to predominantly White zip codes (1.7% increase).CONCLUSIONS AND RELEVANCE Policy targeted at improving access to grocery stores, access to non-fast food restaurants, income and education may significantly increase healthy eating, but interventions may need to be adapted to specific subpopulations for optimal effectiveness.Note We will release all data aggregated at a zipcode level in order to enable validation, follow-up research, and use by policy makers.Question How does food consumption and weight status vary with food access, income and education in the United States?Findings In this country-wide observational study of 1,164,926 participants and 2.3 billion food entries, higher access to grocery stores, lower access to fast food, higher income and education were independently associated with higher consumption of fresh fruits and vegetables, lower consumption of fast food and soda, and lower likelihood of being overweight/obese, but these associations varied significantly across Black, Hispanic, and White subpopulations.Meaning Policy targeted at improving food access, income and education may increase healthy eating, but interventions may need to be targeted to specific subpopulations for optimal effectiveness.Competing Interest StatementThe authors have declared no competing interest.Funding StatementFunding/Support: T.A. and J.L. were supported by a National Institutes of Health (NIH) grant (U54 EB020405, Mobilize Center, NIH Big Data to Knowledge Center of Excellence). T.A. was supported by the SAP Stanford Graduate Fellowship, NSF grant IIS-1901386, and Bill Melinda Gates Foundation (INV-004841). H.N was supported by NSF REU #1659585 at the Stanford Center for the Study of Language and Information (CSLI). J.H is supported by Postdoctoral Fellowship in Cardiovascular Disease Prevention (T32) funded by the National Heart, Lung, and Blood Institute (NHLBI) at the National Institutes of Health (NIH). J.L. was supported by DARPA under Nos. FA865018C7880 (ASED), N660011924033 (MCS); ARO under Nos. W911NF-16-1-0342 (MURI), W911NF-16-1-0171 (DURIP); NSF under Nos. OAC-1835598 (CINES), OAC-1934578 (HDR), CCF-1918940 (Expeditions); Stanford Data Science Initiative, Wu Tsai Neurosciences Institute, Chan Zuckerberg Biohub, Amazon, Boeing, Chase, Docomo, Hitachi, Huawei, JD.com, NVIDIA, Dell. Roles of Funder/Sponsor: The funding sources had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.Author DeclarationsI confirm all relevant ethical guidelines have been followed, and any necessary IRB and/or ethics committee approvals have been obtained.YesThe details of the IRB/oversight body that provided approval or exemption for the research described are given below:Data handling and analysis was conducted in accordance with the guidelines of the Stanford University Institutional Review Board.All necessary patient/participant consent has been obtained and the appropriate institutional forms have been archived.YesI understand that all clinical trials and any other prospective interventional studies must be registered with an ICMJE-approved registry, such as ClinicalTrials.gov. I confirm that any such study reported in the manuscript has been registered and the trial registration ID is provided (note: if posting a prospective study registered retrospectively, please provide a statement in the trial ID field explaining why the study was not registered in advance).YesI have followed all appropriate research reporting guidelines and uploaded the relevant EQUATOR Network research reporting checklist(s) and other pertinent material as supplementary files, if applicable.YesNote We will release all data aggregated at a zipcode level in order to enable validation, follow-up research, and use by policy makers.