Transcranial magnetic stimulation language mapping analysis revisited: Machine learning classification of 90 patients reveals distinct reorganization pattern in aphasic patients.

Repetitive TMS (rTMS) allows to non-invasively and transiently disrupt local neuronal functioning. Its potential for mapping of language function is currently explored. Given the inter-individual heterogeneity of tumor impact on the language network and resulting rTMS derived functional mapping, we propose to use machine learning strategies to classify potential patterns of functional reorganization. We retrospectively included 90 patients with left perisylvian glioma tumors, world health organization (WHO) grade II-IV, affecting the language network. All patients underwent navigated rTMS language mappings. The severity of aphasia was assessed preoperatively using the Berlin Aphasia Score (BAS), which is adapted to the Aachener Aphasia Test (AAT). After spatial normalization to MNI 152 of all rTMS spots, we calculated the error rate (ER) in each cortical area by automated anatomical labeling parcellation (AAL) and used support vector machine (SVM) as a classifier for significant areas in relation to aphasia. 29 of 90 (32.2%) patients suffered from aphasia. Univariate analysis revealed 11 perisylvian AVOIs’ ERs (eight left, three right hemispheric) that were significantly higher in the aphasic than non-aphasic group (p < 0.05), depicting a broad, bihemispheric language network. After feeding the significant AVOIs into the SVM model, it showed that additional to age (w = 2.95), the ERs of right Frontal_Inf_Tri (w = 2.06) and left SupraMarginal (w = 2.05) and Parietal_Inf (w= 1.80) contributed more than other features to the model. The model’s sensitivity was 89.7%, the specificity was 82.0%, the overall accuracy was 81.1% and AUC was 88.7%. Our results transcranial demonstrate an increased vulnerability of the right inferior frontal gyrus to rTMS in patients suffering from aphasia due to left perisylvian gliomas. This confirms a functional relevant involvement of the right frontal area in relation to aphasia. While age as a feature improved our SVM model the most, the tumor location feature didn’t affect the SVM model. This finding indicates that general tumor induced network disconnection is relevant to aphasia and not necessarily related to specific lesion locations. Additionally, our results emphasize the decreasing potential for neuroplasticity with age.

interlinked brain regions encoding function within large-scale networks are currently simplified and described as associationist and hodotopical models of connectivity. (Catani et al., 2012;Duffau et al., 2014) Tumor induced neural-reorganization mechanisms further increase the challenge to map language in general. According to Duffau's and Mesulams hodotopical models of connectivity, the central nervous system (CNS) is organized in parallel networks that not only interact but also can compensate each other to a certain extent. (Mesulam, 1990;Duffau, 2014Duffau, , 2015 Neuroplasticity is a dynamic process that enables redistribution within remote networks and plays a central part in ontogeny, learning as well as recovery after brain injury. (Duffau, 2005(Duffau, , 2015 Cognitive mapping is additionally complicated by the non-specificity of tasks resulting in uncontrolled co-activations and by the patient's capability and willingness to cooperate. To understand how the brain's structural architecture supports and enables cognitive processes and further advance the current understanding of brain network, researchers are developing computational models and graph-based analysis to investigate the function of brain networks, for example to determine how neuronal activity propagates along structural For these reasons, it is necessary to critically question the performance of rTMS for mapping language function in order to better understand its potential and identify suitable use cases. In this study, we describe and investigate the clinical course of 90 patients with brain tumors in the language area and their rTMS-based language maps using spatial normalization methods and a machine learning support vector machine (SVM) model to generate data-driven classification.
were: 1. Frequent generalized seizures (more than one per week); 2. Aphasia with more than 28% errors in the baseline object naming task and 3. multiple neoplasms. All patients gave their informed written consent, in line with the declaration of Helsinki as well as approved by the local Human Research Ethics Committee.

rTMS language mapping
Navigated rTMS language mapping was performed with nTMS eXimia NBS version 3.2.2, Nexstim NBS 4.3 and NexSpeech module (Nexstim Oy, Helsinki, Finland). The rTMS language mapping was conducted as previously reported. (Schwarzer et al., 2018) The degree of discomfort or pain during the mapping was evaluated with visual analogue scale (VAS). Not properly named images, or not immediately named or named with difficulties were excluded from the following rTMS examination to limit misnaming unrelated to rTMS. The baseline was performed up to three times, in case errors were still made in the second baseline. The images remaining after the final baseline cycle were used for rTMS mapping, which covered the perisylvian cortex of both hemispheres. (Schwarzer et al., 2018) The rTMS coordinates were exported as text files for subsequent analysis and spatial normalization. rTMS induced language errors were classified into two categories: Performance errors (e.g. stuttering), phonological errors (vocalization), no responses (absence of reply) and hesitation errors (delayed response) were grouped in category I, semantic errors (misnaming error) were grouped in category II. This grouping allowed to disentangle errors that could have been caused by issues with uttering a response alone (Category I) from errors that pointed directly to an impairment of semantic processing (Category II) in picture naming.
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .

Aphasia grading
The severity of aphasia was assessed preoperative using the Berlin Aphasia Score (BAS). The BAS is used and developed by physicians of the Charit University Hospital and adapted to the Aachener Aphasia Test. (Huber et al., 1980) The test classifies patients into 4 categories: 0 = no aphasia, 1 = mild aphasia, 2 = moderate aphasia, 3 = severe aphasia, A = motor accented, B = sensory accented. (Picht et al., 2013;Schwarzer et al., 2018) All patients with a BAS score of 0 were grouped into the non-aphasic cohort, others were classified as aphasic patients.

Spatial normalization & anatomical labelling
In order to optimize the registration process to MNI ICM 152 space, we skull-stripped all T1 MPRAGE data sets applying the ANTs brain extraction tool in combination with the public ANTs/ANTsR IXI brain template (

SVM
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .
With the progression of computational power over the last two decades, SVM has received growing attention and is increasingly being used in biomedicine. SVMs are supervised machine learning models and aim to classify data points by maximizing the margin between classes. (Cortes and Vapnik, 1995) SVM can be used for nonlinear classification using kernel tricks by implicitly mapping inputs to high-dimensional feature spaces using various kernel functions. (Liu et al., 2019) In the present study we used the linear kernel function, followed by the recursive feature elimination (RFE) method for feature selection using the training data set to classify the patients into aphasic or non-aphasic groups. Features which had less than 10 % missing values were included to fit in the SVM model. Missing data were interpolated with the mean or median, depending on whether the variables followed normal distribution or not. All 90 patients were selected for training and subsequent testing. A grid search strategy was used to tune the classifier parameters. A leaveone-out cross validation (LOO-CV) approach was applied for training. Parameter C was determined to be 1.15. Due to the unbalanced data, we adjusted the weight of the aphasia group to 2.0. For machine learning coding we used MATLAB R2014b (MathWorks, Natick, MA, US) with LIBSVM.(Chang and Lin, 2011) We evaluated the performance of the SVM model by sensitivity, specificity, its overall accuracy as well as area under the receiver operating characteristic curve (AUC). The overall accuracy is the ratio of correctly predicted classification to the entire cohort in the aphasic or non-aphasic group.

Statistical analysis
Statistical analysis was performed by using MATLAB R2014b (MathWorks, Natick, MA, US) and SPSS22 IBM SPSS, Armonk, New York, US). To compare continuous variables, two tailed Student's t-tests or Mann-Whitney U tests were performed separately, according to normally distributed data. Significant effects were considered at P = 0.05. Fisher's exact test (for expected values less than five) or Pearson's chi-squared test (larger values) were applied for the comparison of parameter variables.

Data availability
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .
The data that support the findings of this study are not publicly available due to information that could compromise the privacy of the research participants but are available from the corresponding author on reasonable request.

Presurgical rTMS mapping
Presurgical rTMS speech mapping was successful in all patients and generally well tolerated. 22 patients were only mapped on their left hemisphere due to fatigue or decreasing attention level.
The mapping results of frontopolar and temporopolar cortices were not considered for analysis due to the discomfort evoked by the rTMS mapping in these areas (Fig. 2). The mean VAS score during rTMS mapping was 3.9±2.9 in the left and 3.7±2.8 in the right hemisphere. The ER of the entire brain mapping in the aphasia group (7.49 [IQR 5.82,9.53]) was significantly higher than the ER (3.48 [IQR 2.48,5.79]) of the entire brain mapping in the non-aphasia group (P < .001, Z = -4.606, η 2 = .238).
The aphasic patients showed a significantly higher ER in their left hemisphere (aphasia group: 8.00 [IQR 6.32,10.80]  author/funder, who has granted medRxiv a license to display the preprint in perpetuity. is the (which was not peer-reviewed) The copyright holder for this preprint . aphasia group showed higher ERs than the non-aphasia group (aphasia group: 1.79 (IQR 0.59,3.43); non-aphasia group: 0.65 (IQR 0,1.61); P = .003, Z = -2.959, η 2 = .098). The overall rTMS positive spot distribution showed no clear pattern, not favoring specific cortical areas (Fig.   2). Moreover, the closely matching cortical distribution of both positive and negative rTMS spot distribution further demonstrates the non-occurrence of a particular pattern.

AAL labelling
The analysis of aphasic and non-aphasic patients in relation to rTMS ERs distribution showed specific cortical patterns. The AVOIs of ERs were calculated for each patient (Supplementary Table 1). To increase the reliability of the ER results, AVOIs which were stimulated less than 250 times in the sum of all patients, resulting in a total inclusion of 28 AVOIs (Fig. 3 & Table 2).

SVM classification
We built three different SVM models. One based purely on the ERs, one with a feature for age and another with a feature for tumor location. By adding a feature for tumor location, the model could not be improved compared to the pure ER model. However, by adding a feature for age, the model was improved (improvement of accuracy from 75.6% to 81.1%). The above 11 ER AVOIs with significant differences were taken as input features for the SVM-RFE model. After RFE was embedded within a LOO-CV framework, three regions were selected as the most important features. Fig. 4 and Table 3 illustrate the weight of these three regions (Frontal_Inf_Tri_R, SupraMarginal_L, Parietal_Inf_L). For the test, sensitivity was 89.7%, specificity was 82.0%, overall accuracy was 81.1% and AUC was 88.7%. Fig. 5 illustrates the model's receiver operating characteristic (ROC) curve and indicates the AUC.

Discussion
In this study, we examined the language network in a spatially normalized cohort of 90 rTMS mapped glioma patients applying group analysis methodology. Our results confirm current neuroplasticity models from stroke research with regard to aphasia as well as tumor-induced language neuroplasticity findings. (Thiel et al., 2005;Piai, 2019;Stefaniak et al., 2019) Interestingly, the pattern of overall positive rTMS spots does not clearly differ from the pattern of overall negative rTMS spots (Fig 1). Furthermore, the rTMS-based analysis does not confirm traditional localizationist models of the language network, but rather illustrates a perisylvian distribution of cortical areas where the patients' language network was disturbed by rTMS. This . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .
fMRI experiments show asymmetric left lateralization patterns for language function in healthy subjects and thus weaken a possible hypothesis of an a priori bihemispheric configuration. (Raemaekers et al., 2018) In contrast to the rTMS ERs analysis, the SVM results do not show a proportional increase of ER in all areas, but rather an increment in specific right frontal and left parietal areas, such as Frontal_Inf_Tri_R, SupraMarginal_L, Parietal_Inf_L. This result directs us towards an interpretation of a functional shift, indicating a functional relevant involvement of the right frontal area in relation to aphasia. Furthermore, Table 2 indicates that patients with aphasia show an increased rTMS error susceptibility left and right frontally as well as a larger right perisylvian distribution of increased ERs. Moreover, while it is of crucial importance to interpret these results with regard to the tumor location, the location feature didn't affect the SVM model, whereas the age feature improved the SVM model. This result is confirmed by an early study predicting language dysfunction, that correlated age and tumor grade but not tumor location with aphasia. (Recht et al., 1989) This finding supports the notion that general tumor induced network disconnection is relevant to aphasia and not necessarily related to specific lesion locations.
Additionally, the results emphasize the decreasing potential for neuroplasticity with age, confirming earlier studies. (Lu et al., 2004) The relationship of aphasia and the right hemisphere's susceptibility to rTMS induced errors points towards rTMS being a marker for language network impairments and resulting reorganization. By analyzing the overall increase of errors in aphasic patients it can be stated that even after baseline correction of the object-naming image stack, patients with neuropsychological impairment tend to make more errors during mapping of both hemispheres.
This unspecific effect can be attributed to purely false positive errors or to an unspecific increase of the language network's susceptibility to rTMS disruption of language processing. In a previous study a cut-off value of 28% errors during baseline naming of the object naming task was identified, above which the naming task cannot be reliably enough performed by the subjects. (Schwarzer et al., 2018) Yet, the SVM results demonstrate that the overall increase in ER over both hemispheres in the aphasic patients manifests in a disproportional increase of errors on the right cortex, sustaining a specific effect of left hemispheric pathology on functional reorganization, specifically highlighting the relevance of rIFG, as previously claimed. (Thiel et al., 2005;Rosler et al., 2014) The functional relevance of the right hemispheric areas with . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .
increased vulnerability towards rTMS is still unclear. In stroke studies recruitment of the rIFG after left hemispheric stroke was associated with unfavorable outcome compared to cases with predominant regional ipsilateral recruitment. (Winhuisen et al., 2007) Yet, in brain tumor patients, there is evidence that a shift towards the right hemisphere is associated with better outcome after left hemispheric surgery, sustaining the hypotheses of increased functional reserve in patients with more bilateral distribution of language function. (Ille et al., 2016)

Limitations
The number of rTMS stimulations per area is heterogenous, with the discomfort evoked by the current rTMS methodology limiting cortical coverage. In addition, the use of only one task (object naming) can induce a systematic error due to potential location specificity. (Krieg et al., 2017) Also, error annotation is user dependent leading to difficulties to compare results across institutions and warranting objective measures for analysis of language performance. Finally, as long as task performance is necessary, cognitive mapping in general is heavily depending on the patient's performance, inducing a difficult to control bias.

Conclusion
The results of this study based on group analysis, cortical parceling and machine learning classification with an SVM model, show that the pattern of rTMS-induced errors in aphasic patients differs distinctly from the pattern in non-aphasic patients, with the right inferior frontal gyrus in particular indicating increased vulnerability to rTMS. Furthermore, the use of SVM classification offers new perspectives for rTMS data analysis and allows new interpretations of the method, which contributes to our understanding of rTMS in tumor patients with challenged neuronal networks. While reliable mapping of the functional language-network remains a major challenge in individual brain tumor patients, the results of this study suggest that machine learning adds to detecting distinct patterns of functional reorganization in patients with language eloquent brain tumors. This study constitutes the first machine learning based classification of rTMS language mapping results. . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .   . CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Figure captions
is the (which was not peer-reviewed) The copyright holder for this preprint .

Table1
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the (which was not peer-reviewed) The copyright holder for this preprint .

Table3
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.

Figure1
. CC-BY 4.0 International license It is made available under a author/funder, who has granted medRxiv a license to display the preprint in perpetuity.
is the

(which was not peer-reviewed)
The copyright holder for this preprint