Skip to main content
medRxiv
  • Home
  • About
  • Submit
  • ALERTS / RSS
Advanced Search

Clinical subgroup clustering analysis in a systemic lupus erythematosus cohort from Western Pennsylvania

Patrick Coit, Lacy Ruffalo, View ORCID ProfileAmr H Sawalha
doi: https://doi.org/10.1101/2020.11.12.20230789
Patrick Coit
1Division of Rheumatology, Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
2Graduate Program in Immunology, University of Michigan, Ann Arbor, MI, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lacy Ruffalo
3Lupus Center of Excellence, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Amr H Sawalha
1Division of Rheumatology, Department of Pediatrics, University of Pittsburgh, Pittsburgh, PA, USA
3Lupus Center of Excellence, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
4Division of Rheumatology and Clinical Immunology, Department of Medicine, University of Pittsburgh, PA, USA
5Department of Immunology, University of Pittsburgh, Pittsburgh, PA, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • ORCID record for Amr H Sawalha
  • For correspondence: asawalha@pitt.edu
  • Abstract
  • Full Text
  • Info/History
  • Metrics
  • Data/Code
  • Preview PDF
Loading

Abstract

Objective Systemic lupus erythematosus (SLE) is a complex heterogenous autoimmune disease that can affect multiple organs. We performed clinical clustering analysis to describe a lupus cohort from the University of Pittsburgh Medical Center.

Methods A total of 724 patients who met the ACR classification criteria for SLE were included in this study. Clustering was performed using the ACR classification criteria and the partitioning around medoid method. Correlation analysis was performed using the Spearman’s Rho test.

Results Patients with SLE in our cohort identify 3 district clinical disease subsets. Patients in Cluster 1 were significantly more likely to develop renal and hematologic involvement, and had overrepresentation in African-American and male lupus patients. Clusters 2 and 3 identified a milder disease, with a significantly less likelihood of organ complications. Patients in Cluster 2 are characterized by malar rash and photosensitivity, while patients in Cluster 3 are characterized by oral ulcers which is present in ∼90% of patients within this cluster. The presence of photosensitivity or oral ulcers appears to be protective against the development of lupus nephritis in our cohort.

Conclusions We describe a large cohort of SLE from Western Pennsylvania and identify 3 distinct clinical disease subgroups. Clustering analysis might help to better manage and predict disease complications in heterogenous diseases like lupus.

Introduction

Systemic lupus erythematosus (SLE or lupus) is a chronic remitting-relapsing autoimmune disease characterized by the production of antinuclear antibodies. Lupus is heterogenous and can affect multiple organ systems 1. Although more commonly affects women, lupus tends to be more severe in men 2. In the United States lupus is more common and more severe in patients of African-American descent compared to European-American patients with the disease 3.

The etiology of lupus is not fully understood. Genetic and environmental factors are thought to be involved in the pathogenesis of lupus 4. Further, a clear role for epigenetic dysregulation in the pathogenesis lupus has been established 5; 6. The clinical heterogeneity of lupus is suggested to reflect variability in the underlying genetic background, epigenetic modifications, and immunologic dysregulation, among individual lupus patients 7-10. While lupus is unified by the presence of autoantibodies directed against self-nuclear antigens, clinical and molecular heterogeneity of the disease is an important factor hindering the success of clinical trials in lupus 11.

In this report, we describe a subset of lupus patients enrolled in the University of Pittsburgh Medical Center Lupus Cohort who meet the American College of Rheumatology (ACR) classification criteria for SLE 12. We implement a subgroup clinical clustering analysis and characterize 3 district clinical subsets of lupus in our Lupus Cohort.

Methods

Patients

We studied a subset of patients included in our UPMC Lupus Cohort who met the American College of Rheumatology classification criteria for SLE 12. All patients were evaluated in our clinics between January 2018 and April 2020. A total of 724 patients were studied.

Clustering

The 11 ACR classification criteria for SLE were used as input for calculating a distance matrix using Gower’s Distance method using the cluster (v2.1.0) package in R. This method is intended for non-numeric data 13. All ACR criteria were entered as asymmetric binary values. Cluster group number (k) was determined a priori using the NbClust (v3.0) package 14. This method uses a collection of 30 clustering indices that suggested an optimal recommended k = 3. Clustering of Gower’s Distance matrix was performed using the partitioning around medoid (PAM) method in the cluster package that identifies clusters based around a single object with minimal dissimilarity to all objects with its cluster 15. PAM operates on the same principles as the k-means algorithm but is more robust to outliers 16. Assigned clusters had a combined average silhouette of 0.24 (Cluster 1 = 0.27, Cluster 2 = 0.25, and Cluster 3 = 0.19). Silhouette values can range from −1 to +1, with a higher value indicating a better cohesion of the objects within the cluster 17. Cluster assignments for each sample were used to test for differences in the distribution of sex, race/ethnicity, and presence of ACR criteria across clusters.

Statistical analysis

A Pearson’s chi-square test was performed to compare sex and the presence of ACR criteria across clusters. A Fisher’s Exact test was performed to compare race/ethnicity across clusters. P values for the differences between the presence of the 11 ACR criteria across clusters were adjusted using the Benjamini-Hochberg method to account for multiple testing. Sex and race/ethnicity P values were reported unadjusted. Odds ratios and Fisher’s Exact test P values were calculated for sex and the presence of ACR criteria across clusters using the epitools (v0.5-10.1) package in R without correction for multiple testing 18. A significance threshold of P < 0.05 was used for all statistical testing. Correlation analysis was performed using the non-parametric Spearman’s Rho test with Benjamini-Hochberg FDR-adjusted P values reported to correct for multiple testing using the correlations (v0.4.0) package in R 19.

Results

We evaluated a total of 724 lupus patients included in the Lupus Cohort at the University of Pittsburgh Medical Center. These patients represent a subset of our Lupus Cohort who meet the American College of Rheumatology classification criteria for SLE, and were evaluated at our center between January 2018 and March 2020.

Our study population included 672 female and 52 male lupus patients, and are 73% (n=529) European-American, 23% (n=168) African-American, 2% (n=16) Asians, and <2% (n=11) others (Table 1). The average and median age of our patients are 48 and 47 years, respectively (range 19 to 86).

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 1:

Clinical characteristics of 3 subgroups of patients with lupus in our cohort

To further characterize the patterns of disease involvement in our lupus patients, we performed a medoid clustering analysis using the 11 ACR classification criteria for lupus. The analysis revealed that our lupus patients cluster in 3 distinct clinical clusters (Figure 1).

Figure 1.
  • Download figure
  • Open in new tab
Figure 1.

Clustering analysis of 724 lupus patients reveals 3 disease subsets. Clusters were determined using portioning around medoids method applied to a Gower’s Distance matrix of 11 ACR criteria reported for all patients.

Lupus Cluster 1 includes 270 (37%) patients with overrepresentation of organ specific manifestations. This includes renal involvement in 30% of patients, compared to 11% and 7% in Clusters 2 and 3, respectively (P=5.79E-14), hematologic involvement (76%, compared to 11% and 20% in Clusters 2 and 3, respectively, P= 7.11E-56), and discoid rash (18%, compared to 10% and 11% in Clusters 2 and 3, respectively, P= 0.02). As shown in Table 1, among all of lupus patients in our cohort that have renal involvement (n=119), hematological involvement (n=277), and discoid rash (n=95), 69%, 74%, 51%, respectively, are in Cluster 1. Not unexpectedly, the majority of our African-American lupus patients (98 of 168 patients) were in this cluster, which is also enriched with our male lupus patients (28 of 52 male patients in our cohort) (Table 1).

Patients in Cluster 2 (25%, n=179) were more likely to have non-chronic cutaneous involvement including malar rash (70% of patients, P=1.77E-39) and photosensitivity (79% of patients, P= 7.11E-56), and arthritis (91% of patients, P= 0.0057). Cluster 3 (38%, n=275) is characterized by oral ulcers in the vast majority of patients (89%, n= 246) and has the lowest rate of renal involvement among all 3 clusters (7%) (Table 1).

We next determined the odds of developing specific lupus features for patients in any given cluster (Table 2). Patients in Cluster 1 were 3.7 and 6.25 times more likely to develop lupus renal involvement compared to Clusters 2 and 3, respectively (P= 5.16E-07 and 2.59E-13), and 25 and 12.5 times more likely to have hematologic involvement (P= 6.25E-45 and 5.63E-41). Cluster 2 patients were 13.6 and 31.5 times more likely to have malar rash and photosensitivity, respectively, compared to Cluster 1 (P= 1.85E-33 and 1.69E-51). Meanwhile, patients in Cluster 3 were ∼50 times more likely to have oral ulcers (OR= 49.54, P= 1.20E-76) and were protected from lupus nephritis (OR= 0.16, P= 2.59E-13) compared to patients in Cluster 1 (Table 2)

View this table:
  • View inline
  • View popup
  • Download powerpoint
Table 2.

Odds ratios for differences in clinical characteristics and manifestations between the lupus subgroups identified in our study. Odds ratio values and 95% confidence intervals (CI) in Clusters 2 and 3 versus Cluster 1 are depicted.

A correlation analysis between the 11 ACR criteria was performed in our lupus patients. We detected a significant positive correlation between fulfilling the immunologic disorder criterion and both renal involvement and hematologic disorder (P<0.001 and <0.01, respectively). The presence of either photosensitivity or oral ulcers in our lupus patients was negatively correlated with the presence of renal disorder, hematologic disorder, and immunologic disorder (P<0.001 for all correlations) (Figure 2).

Figure 2:
  • Download figure
  • Open in new tab
Figure 2:

Correlation matrix of 11 ACR criteria reported for 724 lupus patients included in our study. Correlation values were calculated using Spearman’s Rho test. P values were adjusted for multiple testing using the Benjamini-Hochberg false discovery rate method and adjusted P values are reported. *, P < 0.05; **, P < 0.01; ***, P < 0.001.

Discussion

Systemic lupus erythematosus is a heterogenous remitting-relapsing chronic autoimmune disease. In this report, we describe a lupus cohort from a single tertiary referral center in Western Pennsylvania. Clustering analysis based on the ACR classification criteria for systemic lupus erythematosus identified 3 distinct clinical lupus clusters. 37% of our lupus patients are within a cluster of a more severe disease characterized by renal and hematologic involvement, 25% are in a cluster characterized by malar rash and photosensitivity, and the remaining 38% are in a cluster characterized by the presence of oral ulcers. Patients in the latter two clusters have less severe lupus with a significantly lower frequency of organ complications such as renal involvement. Intriguingly, our data suggest that the presence of photosensitivity or oral ulcers in lupus patients is protective against the development of lupus nephritis.

Clinical clustering in heterogenous diseases helps to identify disease subsets and might have value in predicting patterns of disease involvement and expected disease severity and organ complications 20. In lupus, our data suggest 3 clinical disease subsets with distinct patterns of clinical manifestations and differences in the odds of developing organ involvement. These data might have implications in the management of lupus patients.

Whether difference in the molecular mechanisms underlying lupus influence or determine the clinical clustering we observed in our patients remains to be determined. If that were to be the case, then perhaps clinical clustering might be a useful tool to reduce disease heterogeneity in lupus clinical trials with the premise that this might improve the likelihood of achieving a successful outcomes in lupus trials 21.

Our results are derived from a single cohort of lupus patients, and might not necessarily reflect the clinical subsets of lupus in other lupus cohorts from different geographic location or different ancestral groups of patients. Expanding these observations and examining clinical clustering in lupus patients from across different ancestries and locations are certainly warranted.

In summary, we describe lupus patients from our Lupus Cohort at the University of Pittsburgh Medical Center and identify distinct clinical subsets of lupus characterized by a specific pattern of disease and organ involvement. These data might have implication in the clinical care of lupus patients. Further, clinical clustering might be a useful tool to reduce disease heterogeneity and improve outcomes in clinical trials in lupus and similar complex autoimmune diseases.

Data Availability

All data are provided in the manuscript

Acknowledgements

This work was supported by the National Institute of Allergy and Infectious Diseases of the National Institutes of Health grant number R01AI097134, and the Lupus Research Alliance.

Footnotes

  • Conflict of interest: The authors have declared that no conflict of interest exists

References

  1. 1.↵
    Tsokos, G.C. (2020). Autoimmunity and organ damage in systemic lupus erythematosus. Nat Immunol 21, 605–614.
    OpenUrl
  2. 2.↵
    Hughes, T., Adler, A., Merrill, J.T., Kelly, J.A., Kaufman, K.M., Williams, A., Langefeld, C.D., Gilkeson, G.S., Sanchez, E., Martin, J., et al. (2012). Analysis of autosomal genes reveals gene-sex interactions and higher total genetic risk in men with systemic lupus erythematosus. Ann Rheum Dis 71, 694–699.
    OpenUrlAbstract/FREE Full Text
  3. 3.↵
    Coit, P., Ognenovski, M., Gensterblum, E., Maksimowicz-McKinnon, K., Wren, J.D., and Sawalha, A.H. (2015). Ethnicity-specific epigenetic variation in naive CD4+ T cells and the susceptibility to autoimmunity. Epigenetics Chromatin 8, 49.
    OpenUrl
  4. 4.↵
    Harley, J.B., Chen, X., Pujato, M., Miller, D., Maddox, A., Forney, C., Magnusen, A.F., Lynch, A., Chetal, K., Yukawa, M., et al. (2018). Transcription factors operate across disease loci, with EBNA2 implicated in autoimmunity. Nat Genet 50, 699–707.
    OpenUrlCrossRefPubMed
  5. 5.
    Weeding, E., and Sawalha, A.H. (2018). Deoxyribonucleic Acid Methylation in Systemic Lupus Erythematosus: Implications for Future Clinical Practice. Front Immunol 9, 875.
    OpenUrl
  6. 6.
    Li, H., Tsokos, M.G., Bickerton, S., Sharabi, A., Li, Y., Moulton, V.R., Kong, P., Fahmy, T.M., and Tsokos, G.C. (2018). Precision DNA demethylation ameliorates disease in lupus-prone mice. JCI Insight 3.
  7. 7.↵
    Sanchez, E., Nadig, A., Richardson, B.C., Freedman, B.I., Kaufman, K.M., Kelly, J.A., Niewold, T.B., Kamen, D.L., Gilkeson, G.S., Ziegler, J.T., et al. (2011). Phenotypic associations of genetic susceptibility loci in systemic lupus erythematosus. Ann Rheum Dis 70, 1752–1757.
    OpenUrlAbstract/FREE Full Text
  8. 8.
    Coit, P., Renauer, P., Jeffries, M.A., Merrill, J.T., McCune, W.J., Maksimowicz-McKinnon, K., and Sawalha, A.H. (2015). Renal involvement in lupus is characterized by unique DNA methylation changes in naive CD4+ T cells. J Autoimmun 61, 29–35.
    OpenUrlCrossRefPubMed
  9. 9.
    Mok, A., Solomon, O., Nayak, R.R., Coit, P., Quach, H.L., Nititham, J., Sawalha, A.H., Barcellos, L.F., Criswell, L.A., and Chung, S.A. (2016). Genome-wide profiling identifies associations between lupus nephritis and differential methylation of genes regulating tissue hypoxia and type 1 interferon responses. Lupus Sci Med 3, e000183.
    OpenUrlAbstract/FREE Full Text
  10. 10.↵
    Renauer, P., Coit, P., Jeffries, M.A., Merrill, J.T., McCune, W.J., Maksimowicz-McKinnon, K., and Sawalha, A.H. (2015). DNA methylation patterns in naive CD4+ T cells identify epigenetic susceptibility loci for malar rash and discoid rash in systemic lupus erythematosus. Lupus Sci Med 2, e000101.
    OpenUrlAbstract/FREE Full Text
  11. 11.↵
    Merrill, J.T., Manzi, S., Aranow, C., Askanase, A., Bruce, I., Chakravarty, E., Chong, B., Costenbader, K., Dall’Era, M., Ginzler, E., et al. (2018). Lupus community panel proposals for optimising clinical trials: 2018. Lupus Sci Med 5, e000258.
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    Hochberg, M.C. (1997). Updating the American College of Rheumatology revised criteria for the classification of systemic lupus erythematosus. Arthritis Rheum 40, 1725.
    OpenUrlCrossRefPubMedWeb of Science
  13. 13.↵
    Gower, J.C. (1971). A General Coefficient of Similarity and Some of Its Properties. Biometrics 27, 857–871.
    OpenUrlCrossRefWeb of Science
  14. 14.↵
    Charrad, M., Ghazzali, N., Boiteau, V., and Niknafs, A. (2014). NbClust: AnRPackage for Determining the Relevant Number of Clusters in a Data Set. Journal of Statistical Software 61.
  15. 15.↵
    Reynolds, A.P., Richards, G., de la Iglesia, B., and Rayward-Smith, V.J. (2006). Clustering Rules: A Comparison of Partitioning and Hierarchical Clustering Algorithms. Journal of Mathematical Modelling and Algorithms 5, 475–504.
    OpenUrlCrossRef
  16. 16.↵
    Kaufman, L., and Rousseeuw, P.J. (1987). Clustering by means of Medoids.(Amsterdam: North-Holland).
  17. 17.↵
    Rousseeuw, P.J. (1987). Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Journal of Computational and Applied Mathematics 20, 53–65.
    OpenUrlCrossRefWeb of Science
  18. 18.↵
    Aragon, T.J. (2020). epitools: Epidemiology Tools.
  19. 19.↵
    Makowski, D., Ben-Shachar, M., Patil, I., and Lüdecke, D. (2020). Methods and Algorithms for Correlation Analysis in R. Journal of Open Source Software 5.
  20. 20.↵
    Stafford, I.S., Kellermann, M., Mossotto, E., Beattie, R.M., MacArthur, B.D., and Ennis, S. (2020). A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases. NPJ Digit Med 3, 30.
    OpenUrl
  21. 21.↵
    Dall’Era, M., Bruce, I.N., Gordon, C., Manzi, S., McCaffrey, J., and Lipsky, P.E. (2019). Current challenges in the development of new treatments for lupus. Ann Rheum Dis 78, 729–735.
    OpenUrlAbstract/FREE Full Text
View Abstract
Back to top
PreviousNext
Posted November 16, 2020.
Download PDF
Data/Code
Email

Thank you for your interest in spreading the word about medRxiv.

NOTE: Your email address is requested solely to identify you as the sender of this article.

Enter multiple addresses on separate lines or separate them with commas.
Clinical subgroup clustering analysis in a systemic lupus erythematosus cohort from Western Pennsylvania
(Your Name) has forwarded a page to you from medRxiv
(Your Name) thought you would like to see this page from the medRxiv website.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Clinical subgroup clustering analysis in a systemic lupus erythematosus cohort from Western Pennsylvania
Patrick Coit, Lacy Ruffalo, Amr H Sawalha
medRxiv 2020.11.12.20230789; doi: https://doi.org/10.1101/2020.11.12.20230789
Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
Citation Tools
Clinical subgroup clustering analysis in a systemic lupus erythematosus cohort from Western Pennsylvania
Patrick Coit, Lacy Ruffalo, Amr H Sawalha
medRxiv 2020.11.12.20230789; doi: https://doi.org/10.1101/2020.11.12.20230789

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Subject Area

  • Rheumatology
Subject Areas
All Articles
  • Addiction Medicine (62)
  • Allergy and Immunology (141)
  • Anesthesia (44)
  • Cardiovascular Medicine (408)
  • Dentistry and Oral Medicine (67)
  • Dermatology (47)
  • Emergency Medicine (141)
  • Endocrinology (including Diabetes Mellitus and Metabolic Disease) (171)
  • Epidemiology (4809)
  • Forensic Medicine (3)
  • Gastroenterology (177)
  • Genetic and Genomic Medicine (670)
  • Geriatric Medicine (70)
  • Health Economics (187)
  • Health Informatics (621)
  • Health Policy (314)
  • Health Systems and Quality Improvement (200)
  • Hematology (84)
  • HIV/AIDS (155)
  • Infectious Diseases (except HIV/AIDS) (5279)
  • Intensive Care and Critical Care Medicine (325)
  • Medical Education (91)
  • Medical Ethics (24)
  • Nephrology (73)
  • Neurology (677)
  • Nursing (41)
  • Nutrition (110)
  • Obstetrics and Gynecology (124)
  • Occupational and Environmental Health (203)
  • Oncology (437)
  • Ophthalmology (138)
  • Orthopedics (36)
  • Otolaryngology (88)
  • Pain Medicine (35)
  • Palliative Medicine (15)
  • Pathology (127)
  • Pediatrics (193)
  • Pharmacology and Therapeutics (129)
  • Primary Care Research (84)
  • Psychiatry and Clinical Psychology (766)
  • Public and Global Health (1796)
  • Radiology and Imaging (321)
  • Rehabilitation Medicine and Physical Therapy (138)
  • Respiratory Medicine (255)
  • Rheumatology (86)
  • Sexual and Reproductive Health (68)
  • Sports Medicine (61)
  • Surgery (100)
  • Toxicology (23)
  • Transplantation (28)
  • Urology (37)