Thromboembolic risk in lung cancer patients receiving systemic therapy

In this retrospective study, we investigated the influence of chemotherapy and immunotherapy on thromboembolic risk among US Veterans with lung cancer during their first six months of systemic therapy. Patients in the study cohort received treatment with common frontline agents that were divided into four groups: chemotherapy alone, immunotherapy alone, combination of chemo- and immunotherapies, and molecularly targeted therapies. The latter served as a control group of systemically treated lung cancer patients who received neither chemotherapy nor immunotherapy. The cohort experienced a 6.8% overall incidence of thromboembolic events with a median time to event of 49 days, but the analysis demonstrated significantly different rates among the different treatment groups. We explored models incorporating multiple confounding variables as well as the competing risk of death, and these results indicated that both chemotherapy and immunotherapy were associated with an increased incidence of thrombosis, either when given alone or combined, compared with the control group (6.91%, 9.09%, and 7.47% respectively versus 3.68%, p < 0.024). Both the Khorana score assessing thrombosis risk for cancer patients and the Charlson comorbidity score were found to be associated with increased risk of thrombosis in our analyses. Paradoxically, we found an association between risk of thrombosis and the use of prophylactic anticoagulation or aspirin during the first month of systemic treatment, accounting for several confounding variables including a patient's prior history of thrombosis. Additionally, our data suggest that thromboembolic events may occur more commonly in lung cancer patients treated with immunotherapy compared with chemotherapy. Further study is warranted to better determine the drivers of thromboembolic risk and to identify ways to mitigate this risk for patients.


Introduction
Venous thromboembolism (VTE) is a common complication of cancer with an incidence of 1-20% (1). While randomized trials have shown prophylactic anticoagulation roughly halves the relative risk of cancer-associated VTE, the absolute risk reduction among ambulatory patients with cancer is relatively modest, given that the baseline VTE risk in this population is about 3-5% in the first months of chemotherapy (2). Bleeding complications associated with anticoagulation are more common in cancer patients compared with those without cancer, and in patients at low risk of thromboembolism, this means that the risks of anticoagulation may exceed the benefits. Thus, risk stratification is an important consideration when deciding whether to recommend prophylactic anticoagulation in the cancer setting.
Arterial and venous thromboembolic events (TEEs) in cancer patients are associated with several known risk factors such as age, type and stage of cancer, hospitalization, and indwelling catheters. Active chemotherapy treatment has been shown to increase the incidence of TEEs in patients with breast cancer (3,4) and lung cancer (5,6). While cancer itself was associated with a 4.1-fold increase in the risk of thrombosis, the addition of chemotherapy increased the risk to 6.5 times the baseline level in a large population-based case control study (7). Lung cancer is among the most common cancer types; an estimated 2.1 million new cases were diagnosed in 2018, resulting in an estimated 1.8 million deaths (8). Over half of newly diagnosed patients are considered incurable due to the presence of metastatic disease (9) but are eligible for systemic therapy.
Current standard of care for first line treatment of advanced stage non-small cell lung cancer (NSCLC) varies depending on histology, the presence or absence of driver mutations, and programmed death ligand-1 (PD-L1) expression levels (10)(11)(12). Approximately 25-40% of NSCLCs have a PD-L1 tumor proportional score (TPS) of 50% or higher (13). Currently recommended first-line therapies for patients with higher PD-L1 TPS levels and without a driver mutation are either a single agent checkpoint inhibitor (e.g. pembrolizumab, nivolumab) or checkpoint inhibitor plus pemetrexed and platinum regimen. For patients with a confirmed and actionable driver mutation, first line therapy may employ molecularly targeted agents (e.g. crizotinib, erlotinib).
In this large retrospective population-based cohort study, we determined the TEE incidence and identified associated VTE risk factors among 95,466 lung cancer patients receiving systemic therapy.

Methods
We performed a retrospective cohort study of US national health records available within the Veterans Health Administration's (VA) Corporate Data Warehouse (CDW). Our VA CDW derived database contains individually identifiable clinical and demographic information from the 1990s through 2020 for ~19 million individual Veterans who received care provided or paid for by the VA. All work was performed under local IRB approval.
The inclusion criteria for this study were a diagnosis of a primary malignant neoplasm of lung or bronchus and subsequent treatment with any of fourteen systemic therapies. The specific chemotherapy agents studied were cisplatin, carboplatin, paclitaxel, docetaxel, pemetrexed, etoposide, and gemcitabine, and the immunotherapy agents studied were pembrolizumab and nivolumab. The molecularly targeted therapies included were crizotinib, alectinib, gefitinib, erlotinib, and afatinib. Diagnoses were identified in the database using ICD-9 or ICD-10-CM codes (Appendix 1), and therapeutic agents were identified by name using RxNorm (14). Using custom Structured Query Language (SQL) and R scripts (Appendix 2), 95,466 Veterans meeting these criteria were identified. We divided the overall cohort into four groups based on type of initial systemic therapy administered: 1) conventional chemotherapy, 2) immunotherapy, 3) combination chemo-and immunotherapies, and 4) molecularly targeted therapy (nonchemo non-immunotherapy control group). Characteristics of the overall patient cohort are shown in Table 1.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint Initial cancer diagnosis was defined as the first occurrence in the patient's record of one of the specified lung cancer diagnoses, and elapsed time until treatment was calculated using the initial diagnosis date and the beginning of systemic therapy with any of this study's specified agents.
Incidence of and number of days to any TEE was assessed for a 180-day period following a patient's first treatment with one of the specified systemic therapies. TEEs were defined as deep vein thrombosis, pulmonary embolism, stroke, or myocardial infarction and identified in the database using ICD-9 or ICD-10-CM codes (Appendix 1). Patients who experienced two or more TEEs simultaneously were counted as a single event, and for those who experienced multiple events at different times, only the first event was considered. History of prior TEE was defined as the occurrence in the patient's record of any of the specified TEE diagnoses at any time before the first day of systemic therapy with any of this study's specified agents. Patients who experienced a TEE on the first day of systemic therapy were excluded.
The 5,377 patients lost to follow up during the study period were censored from the analyses at the time of last follow up. Of the patients who did not die during the study, the average follow-up time was 179 days.
We calculated the Khorana score (as per Table 2) for prediction of VTE risk for our cohort (15). Pretreatment platelet counts, hemoglobin levels, and leukocyte counts were extracted from the database, using most recent value up to 90 days prior to start of systemic therapy. Body mass index (BMI) at the approximate time of treatment initiation was calculated using custom R and SQL scripts. We omitted 21,023 patients without sufficient laboratory data for Khorana score calculation, resulting in 74,443 patients with Khorana scores. The Charlson comorbidity index score at the beginning of systemic cancer treatment was calculated using custom R and SQL scripts (Appendix 2), adapted from published methods (16-18). Diagnoses pertaining to the index categories and occurring before the start of cancer therapy were extracted from the database and scored according to the index algorithm (Table 3). We omitted 187 patients without sufficient data for Charlson score calculation, resulting in 95,279 patients with Charlson scores. We also examined the presence and effects of prophylactic anticoagulation and prophylactic aspirin. Anticoagulant drugs studied were warfarin, low molecular weight heparin, enoxaparin, dalteparin, apixaban, rivaroxaban, dabigatran, and edoxaban. A patient was considered to be on prophylactic anticoagulation or aspirin if they had a prescription filled within a window of 5 days prior to 30 days after start of systemic therapy, so long as the prescription preceded (by at least one day) any diagnosed TEE while on treatment. All drugs were identified in the database by name using RxNorm (14).

Results
We first studied the incidence of TEE in our cohort, stratified by type of systemic therapy (chemotherapy, immunotherapy, combined chemo-and immunotherapies, or non-chemo non-immunotherapy control).
Note that the majority of events observed were pulmonary embolus (52.0%), followed by deep vein thrombosis (20.4%), cerebral infarction (18.7%), and acute myocardial infarction (6.79%); the remaining events were of multiple types. We identified an overall TEE rate of 6.8% (n = 6,493) within the first 180 days of treatment initiation, with a median time to TEE of 49 days (range 1 -180). Because the majority (88.0%) of our cohort received only chemotherapy, the TEE rate and median time to TEE were similar in the chemotherapy alone group compared to the cohort overall. A significantly higher proportion of patients receiving immunotherapy alone (9.09%, n = 234, p < 0.02) experienced a TEE with a median time to event of 32.5 days (range 1 -175 days), and a significantly lower proportion of patients receiving the non-chemo non-immunotherapy control agents (3.68%, n = 205, p < 0.001) experienced a TEE with a median time to event of 26 days (range 1 -178 days). Results for all groups are summarized in Table 4. Using Kaplan-Meier cumulative risk analysis, we noted that patients who received immunotherapy alone were not only most likely to experience a TEE, but that TEE in this group occurred sooner after the start of treatment when compared to patients receiving chemotherapy alone, combined chemo-and immunotherapies, or neither of these therapies ( Figure 1). These results were unchanged when using a competing risk analysis accounting for an observed death rate of 26.5% by 180 days (Supplementary Figure 1). is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020.

Figure 1. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by Treatment Type
Risk of experiencing a TEE during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by treatment group: immunotherapy alone (red), chemotherapy alone (green), or combination of both chemo-and immunotherapies (blue), as well as a non-chemo non-immunotherapy control group (purple). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the y-axis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects a significant difference in cumulative risk of TEE for the immunotherapy v. chemotherapy (alone or combined) groups, the control v. chemotherapy (alone or combined) groups, and the immunotherapy v. control groups. P-value computed by logrank.
Because platinum-based chemotherapy agents have been shown to be more thrombogenic than other chemotherapy agents (27-29), we subdivided the treatment group who received chemotherapy alone into a group that received at least one platinum-based agent (n=77,522) and a group that received no platinumbased agents during their cancer treatment (n=6,459). These two subgroups differed significantly by Wilcox test in time to but not incidence of TEE (Supplementary Table 1). However, the Kaplan-Meier plot modeling both time to and incidence together demonstrates the similarity of the risk of TEE in these two subgroups ( Figure 2).

Figure 2. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by Chemotherapy Regimen
Risk of experiencing a thromboembolic event during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by chemotherapy regimen: not platinum-based agents (red) or platinum-based agents (blue). The two groups did not experience statistically significant different risks of events (p = 0.77). Pvalue computed by log-rank.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020.
A prior history of TEE elevates a patient's risk of experiencing a TEE during systemic therapy (30, 31) and may play a role in the decision to prescribe prophylactic anticoagulant or aspirin. We hypothesized that either or both of these factors could significantly affect observed TEE rates in our cohort and could be influencing the differences we observed among systemic treatment groups. We therefore analyzed TEE in our study cohort according to prior history of TEE and prophylactic anticoagulation or aspirin use, alone and in interaction. Surprisingly, we found that patients receiving prophylactic anticoagulation or aspirin experienced TEE at both a higher rate and a shorter interval than patients who did not (Supplementary Figure 2 and Supplementary Figure 3, respectively). In particular, the cohort receiving prophylactic anticoagulation had a much higher incidence of TEE compared to those receiving aspirin. While additional undetermined clinical risk factors could be significantly influencing patient TEE outcomes, we assessed the incidence of a prior history of TEE and its influence on subsequent TEE risk in this context and note that it accounts for the majority of increased TEE risk (Supplementary Figure 4). However, the use of prophylactic anticoagulation or aspirin does remain associated with a higher incidence of TEE ( Figure 3 and Figure 4, respectively).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ;

Figure 3. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by Prior TEE and Prophylactic Anticoagulation Use
Risk of experiencing a thromboembolic event during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by occurrence of prior TEE and prophylactic anticoagulation use: the presence of neither (red), the presence of a prior TEE and of anticoagulation use (purple), the presence of a prior TEE without anticoagulation use (green), or the use of prophylactic anticoagulation without a history of TEE (blue). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the y-axis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects significant differences in cumulative risk of TEE. P-value computed by log-rank.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ;

Figure 4. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by Prior TEE and Aspirin Use
Risk of experiencing a thromboembolic event during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by occurrence of prior TEE and prophylactic aspirin use: the use of aspirin without a history of TEE (blue), neither (red), the presence of a prior TEE and aspirin use (purple), or the presence of a prior TEE without aspirin use (green). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the y-axis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects significant differences in cumulative risk of TEE. P-value computed by log-rank.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; We used the Cox proportional hazard model to quantify the effects of the different systemic treatments on patients' risk of TEE ( Figure 5). In the univariate model, patients treated with immunotherapy experienced 2.5 times greater risk of TEE compared to controls, while patients treated with chemotherapy experienced 1.7 times greater risk; whereas, combining chemo-and immunotherapies appeared to offset risk of TEE by 2.4-fold. To assess the relative importance of additional risk factors, we therefore created multivariate models describing the effects of the different systemic treatments combined with prophylactic anticoagulation or aspirin use, history of prior TEE, age, elapsed time between diagnosis and systemic treatment, Khorana score, and Charlson comorbidity score. When accounting for these covariates, the overall influence of treatment type was diminished, but the relative effects of the different treatment types remained the same ( Figure 6). Unsurprisingly, as both Khorana and Charlson scores increase, so does the risk of TEE. Increased age and increased time between initial diagnosis and systemic treatment both demonstrated a slight protective effect, likely due to a more robust underlying health of patients who could either safely delay systemic treatment or be considered good candidates for systemic treatment even at an advanced age. Unsurprisingly, prior TEE was a strong predictor of subsequent TEE (hazard ratio of 4.04, p < 0.001). However, both prophylactic aspirin and anticoagulant use paradoxically increased the residual risk of TEE (hazard ratios of 1.10 and 1.38, respectively, p < 0.04), despite their known TEE-preventative effects; we suspect that this may arise from other latent clinical characteristics influencing prophylactic prescription rates.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020.

Figure 5. Cox Proportional Hazard Model of Effect on TEE of Treatment Type
Hazard ratios are represented on the x-axis, with a neutral value of 1 represented by the vertical dotted line and corresponding to the relative TEE risk among controls. The calculated value for hazard ratio and the 95% confidence interval are shown for both chemotherapy and immunotherapy. P-values for each treatment type are displayed on the right-hand side of the plot, and the global p-value and Akaike information criterion (AIC) estimate are in the lower left corner.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint

Figure 6. Cox Proportional Hazard Model of Effect on TEE of Treatment Type and Covariates
Hazard ratios are represented on the x-axis, with a neutral value of 1 represented by the vertical dotted line and corresponding to the relative TEE risk among controls. The calculated value for hazard ratio and the 95% confidence interval are shown for both chemotherapy and immunotherapy and all covariates. P-values for each covariate are displayed on the right-hand side of the plot, and the global p-value and Akaike information criterion (AIC) estimate are in the lower left corner.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint We also used a competing risks model to determine Fine-Gray subdistribution hazards of TEE for the systemic treatments and covariates in consideration of the competing risk of death (Table 5). Treatment with chemotherapy demonstrated 1.2 times greater risk of TEE under this model, but the effect of treatment with immunotherapy no longer showed a significant effect; the hazard ratios of the covariates were similar to those developed in the Cox model. Finally, in recognition of the strong effect on our study cohort of prior history of TEE on the risk of TEE during systemic treatment of lung cancer, we eliminated the 8,351 patients who had experienced a TEE before beginning lung cancer therapy, resulting in a reduced cohort of 87,115 Veterans. This change substantially reduced the cumulative risk of TEE experienced by the group who received immunotherapy alone; nonetheless, both chemotherapy and immunotherapy retained an increased risk of TEE ( Figure 7).
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ;

Figure 7. Cumulative Risk of Thrombosis in Lung Cancer Patients with no prior history of TEE, stratified by Treatment Type
Risk of experiencing a TEE during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by treatment group: chemotherapy alone (green), immunotherapy alone (red), or combination of both chemotherapy and immunotherapies (blue), as well as a non-chemo non-immunotherapy control group (purple). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the yaxis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects a significant difference in cumulative risk of TEE for the immunotherapy v. chemotherapy (alone or combined) groups, the control v. chemotherapy (alone or combined) groups, and the immunotherapy v. control groups.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ;

Discussion
To the best of our knowledge, this is the largest study to-date exploring thromboembolic risk in the setting of lung cancer, and the first to report on these risks among a cohort of US Veterans. This is also the largest study to suggest a correlation in cancer patients between checkpoint inhibitor immunotherapy and increased thromboembolic risk. We surprisingly found that lung cancer patients on prophylactic anticoagulation or aspirin had a higher TEE incidence compared with those not on prophylaxis, and we speculate this counterintuitive finding may be due to latent clinical variables associated with a higher baseline thrombosis risk, in particular, but not wholly, the existence of a prior TEE.
Thromboembolic events have not been reported in phase III trials evaluating checkpoint inhibitors in patients with cancer, and little is known about thromboembolic risk in this setting. In one secondary analysis within a prospective observational protocol of 217 NSCLC patients treated with checkpoint inhibitors, an elevated thrombosis incidence of 13.8% was noted (32). Other small, single institutional studies have also observed a correlation between checkpoint inhibitor therapy and thrombosis in cancer patients (33, 34). In our study, we found a potential increased incidence of thromboembolic events among lung cancer patients who received first line checkpoint inhibitor monotherapy compared with chemotherapy either alone or combined with immunotherapy. These data likely merit further clinical and translational studies to investigate the potential thrombogenicity of checkpoint inhibitors in the lung cancer setting (whether alone or in combination with chemotherapy) and argue for incorporating this therapeutic class into thrombosis risk models and recommendations for prophylactic anticoagulation.
In the present study, we also found evidence of a weak correlation between thrombosis risk and the Khorana score (a widely utilized VTE risk model in cancer patients). However, we note that other studies have been unable to demonstrate a significant association between the Khorana score and VTE in lung cancer patients (35-37). These data call into question the routine clinical utility of the Khorana Score in the lung cancer setting.
We note differences in the overall observed TEE rates in our lung cancer cohort treated with systemic therapy compared to the published literature. For instance, a retrospective analysis of 204 lung cancer patients treated with cisplatin-based chemotherapy, found an 11.8% incidence of TEE during treatment or within 4 weeks of the last treatment (27). While this incidence is higher than the cumulative incidence we observe (6.80%), we note this discrepancy could be explained by different length of observation between the two studies (our study only investigated events occurring within 180 days of treatment initiation) and the small cohort size in the prior study.
There are limitations associated with administrative databases, including numerous sources of possible bias and missing or inaccurate data. Additionally, while the cohort we analyzed was national in scope, it is most reflective of the US Veteran population, and therefore contains relatively few women. Moreover, we did not explicitly study the influence of initial disease staging, potentially relevant laboratory results such as D-dimer, family history of VTE, or non-prescription medications. We only analyzed the impact of clinical predictors at the time of treatment initiation and did not assess risk factors that may have arisen later in the treatment course. Finally, we confined analysis to lung cancers, as they are known to be at increased risk for TEEs compared to most other malignancies (15) and did not discriminate between lung cancer subtypes (e.g. small cell).
In conclusion, given the continued high rate of TEEs in lung cancer and the increased risks associated with chemotherapy and immunotherapy, future thromboprophylaxis risk-adapted trials in this setting may be warranted, particularly accounting for immunotherapy-based treatment regimens.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint Appendix 1 Diagnostic criteria for "lung cancer" and "TEE" is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ;

Identify patients with specified cancer diagnoses
A custom R function was used to connect with both the CDW and the research database and perform the following steps. The relevant dimension tables of the data warehouse were queried to determine the identifiers associated with ICD codes for the diagnoses listed in Appendix 1. All relevant views of the research database were queried for these identifiers to identify all instances of diagnoses. The resultant dataset comprised all unique instances of the specified diagnoses and associated dates, aggregated to the highest level (location-agnostic) of unique patient identifier. From this dataset, a table was created in the research database containing patient identifiers and the earliest associated date of diagnosis (one row per individual patient).

Identify patients with specified treatment agents
This portion of the work was performed in SQL. RxNorm was used to ensure all variants of agent names, as listed in the main body of the paper, were included. The names were used to determine the identifiers associated with the specified treatment agents in multiple dimension tables of the data warehouse: CPT codes, IV usage, EHR orderable items, and drugs. All relevant views of the research database were queried for these identifiers to identify all instances of an association between a patient and a treatment agent. The resultant dataset comprised all unique instances of the specified treatment agents and associated dates, aggregated to the highest level (location-agnostic) of unique patient identifier. From this . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Form cohort from intersection of patients with specified diagnoses and treatments
This portion of the work was performed in SQL. The study cohort comprised the patients represented by rows in both the created diagnosis and treatment tables, with the restriction that treatment must be both after diagnosis and within 180 days of earliest treatment (for patients treated with multiple agents). SELECT    This portion of the work was performed in SQL. Anticoagulants, as listed in the main body of the paper, were identified by name and retrieved using the same methods as described for the treatment agents. SELECT

Calculate BMI
A custom R function was used to determine each patient's BMI at the time of treatment start.

Retrieve lab information (platelets, hemoglobin, leukocytes)
This portion of the work was performed in SQL. LOINC codes were used to identify relevant lab tests. SELECT

Calculate Charlson score
A custom R function was used to determine each patient's Charlson score at the time of treatment start.

Identify TEEs
TEEs were identified using the same custom R function as described for cancer diagnoses. Using the resultant dataset, patients with a history of prior TEE were identified as any patient with an instance of a TEE diagnosis occurring before the first day of treatment, patients who experienced a TEE during treatment were identified as any patient with an instance of a TEE diagnosis between days 1 and 180 of treatment, and patients with an instance of a TEE diagnosis occurring exactly on the first day of treatment were deleted from the cohort. is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint

Participants
13*  (a) Report numbers of individuals at each stage of study-eg numbers potentially eligible, examined for eligibility, confirmed eligible, included in the study, completing follow-up, and analysed

Other information
Funding 22  Give the source of funding and the role of the funders for the present study and, if applicable, for the original study on which the present article is based *Give information separately for exposed and unexposed groups.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint

Supplementary Figure 1. Cumulative Probability of TEE and Death Stratified by Treatment Group
Risk of experiencing death (upper panel) or TEE (lower panel) during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by treatment group: chemotherapy alone (green), immunotherapy alone (blue), or combination of both chemo-and immunotherapies (orange), as well as a non-chemo nonimmunotherapy control group (red). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk along the y-axes.

Supplementary Table 1. Incidence of and Time to TEE Stratified by Chemotherapy Regimen
The incidences of TEE in chemotherapy patients receiving platinum-based or not platinum-based regimens did not differ significantly, but the median time to event was significantly shorter in patients receiving regimens without platinum-based agents.
Platinum-based n = 77,522 Not platinum-based n = 6,459 is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint

Supplementary Figure 2. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by Prophylactic Anticoagulation Use
Risk of experiencing a TEE during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by prophylactic anticoagulant: presence of prophylactic anticoagulation (blue) or absence of prophylactic anticoagulation (red). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the y-axis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects a significant difference in cumulative risk of TEE for the two groups.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint

Supplementary Figure 3. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by Prophylactic Aspirin Use
Risk of experiencing a TEE during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by prophylactic aspirin: presence of prophylactic aspirin (blue) or absence of prophylactic aspirin (red). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the y-axis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects a significant difference in cumulative risk of TEE for the two groups.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint

Supplementary Figure 4. Cumulative Risk of Thrombosis in Lung Cancer Patients Stratified by History of Prior TEE
Risk of experiencing a TEE during the first 180 days after the start of treatment for lung cancer patients. The cohort is stratified by history of prior TEE: the presence thereof (blue) or the absence thereof (red). Time (in days from start of treatment) is shown along the x-axis, with cumulative risk of TEE along the y-axis. The lower table depicts number of individuals at risk for each 30-day interval. P-value reflects a significant difference in cumulative risk of TEE for the two groups.
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. (which was not certified by peer review) preprint The copyright holder for this this version posted December 22, 2020. ; https://doi.org/10.1101/2020.12.21.20248675 doi: medRxiv preprint