IE-Map: A novel in-vivo atlas template of the human inner ear

Brain atlases and templates are core tools in scientiﬁc research with increas-ing importance also in clinical applications. Advances in neuroimaging now allowed us to expand the atlas domain to the vestibular and auditory organ, the inner ear. In this study, we present IE-Map, an in-vivo template and atlas of all known substructures of the human labyrinth derived from multi-modal high-resolution magnetic resonance imaging data in a non-invasive manner (no contrast agent or radiation). We reconstructed a common template from 126 inner ears (63 normal subjects) and annotated it with 94 established landmarks and semi-automatic segmentations. Quantitative substructure analysis revealed a correlation of labyrinth parameters with total intracranial volume. No eﬀects of gender or laterality were found. We provide the validated templates, atlas segmentations, surface meshes and landmark annotations as open-access material, to provide neuroscience researchers and clinicians in neurology, neurosurgery, and otorhinolaryngology with a widely applicable tool for computational neuro-otology. a post-correction alpha level of p < 0 . 05 . For the main general inner ear measures (total inner ear volume, total inner ear length, oblique cochlea height, and radii of anterior/posteri-or/lateral SCCs), we investigate a potential correlation with overall head size,


Introduction
The pervasiveness of brain imaging techniques across many disciplines has increased the relevance of standardized templates of brain structures and developing atlases for the structural and functional substructures within these templates. Almost the entire human central nervous system has been mapped: various templates and atlases have charted out the human cortex, the subcortical nuclei, charting the thalamus and the cerebellum [1, 2, 3, 4] based on MRI.
Recently, a dedicated neuroimaging template was also developed for the brain stem and spinal chord [5]. These templates provide a means to make group comparisons and to do quantitative assessments of individual patients, and as such have found applications in neurology [6,7], in neurosurgery [8,9] and in neuroscience [10]. In comparison, few works have proposed templates and corresponding atlases to chart the anatomy of sensory organs like the inner ear, and the few existing studies have focused on ex-vivo imaging using (micro-)CT, for comparative neuroanatomy in the field of anthropology.
The inner ear contains several important substructures [11,12]: (1) The three semicircular canals with their ampullae that shelter the cupula containing the sensory hair cells, which detect head rotation. (2) The macular organs (utricle and saccule), which respond to translational head motion and convey the gravitational force vector [13]. Anatomically, the utricle and the saccule together form the vestibule. (3) The cochlea, which is a separate inner ear organ for hearing. The cochlea consists of the scala vestibuli, the scala tympani and the cochlear duct (with hair cells). These structures spiral 2.57 ± 0.28 times from the base to the cochlear cupula, or apex of the cochlea [12]. Physiological and morphological alterations of the cochlea can result in hearing deficits or loss whereas affections of the semicircular canals or otoliths lead to vertigo and balance disorders [14,15]. The shape of the labyrinth structures carries valuable functional and phylogenetic information about the inner ear over the course of 2 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint mammalian evolution [16]. Inner ear imaging aids neurological examination and neuroscientific analysis. A quantitative assessment of the inner ear organs often requires a preceding segmentation of the relevant target regions and a subparcellation into its key structures like the semicircular canals, vestibule and cochlea. As has been shown for brain images, segmentation with a template image and atlas is a versatile and robust tool for anatomic delineation [17].
Beyond deformable atlas segmentation, image templates and structural atlases are fundamental tools for statistical shape analysis [18,8] and group-based image analysis [19].
Previous anatomical and morphological analyses of the inner have emphasised the cochlea and avoided the complex anatomy of the peripheral vestibular system. Models of the cochlea have been assessed for cross-species comparative neuroanatomy [16,20,21,22], neuronavigation of cochlear electrode implantation and simulation [23,24,25], cochlear shape modeling and surgical trauma assessment [12]. In the field of anthropology, previous studies have investigated the inner ear through shape comparison of labyrinthine structures in ancient Egyptian mummies versus today [26], or in form and function of the mammalian inner ear across species [27].
The majority of these analyses were based on ex-vivo micro-CT as the imaging modality [28,29], few also providing cyto-architecture scans based on synchrotron radiation [30] or ex-vivo ultra-high-field MRI [31]. A few studies have reported detailed measurements of the inner ear's membraneous structures [32], and of the semicircular canals in particular, again based on ex-vivo micro-CT [33,21] and multi-detector CT [34]. Recently, in-vivo visualization of the inner ear with MRI, either with [35,36] or without Gadolinium-based contrast agents [37,38,39,40,41] has been used for the detection of endolymphatic hydrops [39] and for diagnosis of sensorineural hearing loss associated with inner ear lesions [42]. Furthermore, MRI sequences, such as Constructive Interference in Steady State (CISS) imaging, which have low susceptibility and a high contrastto-noise ratio, have been used to image the inner ear for the detection of leakage of labyrinthine and cerebrospinal fluids [43].

3
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint The purpose of this study was to build a template and atlas of the human inner ear and its substructures, based on multiple non-invasive in-vivo MRI sequences. By harnessing the signal and contrast-to-noise advantages of multiple MRI sequences, all without invasive contrast agents or radiation, we believe the inner ear imaging procedures and corresponding template will have a widespread clinical applicability. We then demarcated the anatomical substructures of the inner ear by registering a previously available inner ear atlas, based on micro-CT, to our template. With this atlas, quantitative size and volume information of the inner ear substructures can be extracted for each individual and side. After validation of the template and atlas with the previous literature, we demonstrate the usability of the template and atlas by examining three questions about the population statistics of the inner ear. 1. Does head size, as measured by intracranial volume, influence labyrinth measurements? 2.
Does gender have an effect in inner ear structure size? 3. Does ear laterality play a role in inner ear dimensions? In summary, we successfully created an inner ear template and atlas that can be applied to a wide range of MRI sequences. Our template and atlas are comprehensive, applicable, accessible, and generalizable, forming the basis for future non-invasive neuroimaging studies of the inner ear under the umbrella term computational neuro-otology.

Subject cohort
The template was created from a cohort of 63 healthy, right-handed subjects ( an optimized contrast between tissue types for the atlas generation, to detect brain pathologies and achieve whole-head coregistration for all data. In addition, we acquired an optimised CISS sequence for labyrinthography (TR = 8.56 ms, TE = 3.91 ms, flip angle = 50 deg., FoV = 150 mm, slice thickness= 0.50 mm, right-left phase encoding direction, sequential multi-slice mode, tune up shimming, activation of head coil elements 5-7 and the cranial neck coil elements only, bandwidth 460Hz/pixel, the average of two repetitions was built, total time of acquisition 13:40 minutes) for achieving the best possible contrast and resolution for inner ear structures.
Head motion was controlled by a dedicated head fixation device ( R Pearltec Crania Adult, Schlieren, Switzerland). All MRI image data underwent stringent quality assessment before inclusion in the study, using a state-of-the-art MRI quality control toolkit (MRIQC) [44]. As pre-processing, we performed a correction for MRI field inhomogeneity using the N4 bias correction algorithm [45].
Further, we performed a rigid intra-subject registration of T1 and CISS volumes to the T2 volume, before proceeding to template building. Intra-subject registration was performed using Advanced Normalization Tools (ANTs), with normalized mutual information for multi-modal alignment [19].
Total intracranial volume (TIV) as a potential covariate for head size was 5 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019.

Template building
Template building was performed in three steps: 1) full-brain template building and annotation, 2) axial re-orientation of all subjects to the Reid's axial plane, and 3) final inner ear template building. The template building method utilized in steps 1 and 3 is a state-of-the-art algorithm provided by the ANTs toolkit [19]. It is designed for computation of a geodesic mean template that optimally represents the average anatomic shape from a set of input volumes.
Critically, all ANTs components are capable of multi-variate image registration, allowing us to fully leverage the multi-sequence appearance of the inner ear and its surrounding anatomy in T1, T2 and CISS MRI during template reconstruction. The high accuracy and robustness of ANTs registration and template normalization methods have been validated by demonstrating state-of-the-art performance in several internationally recognized medical image processing challenges [17,46].
In the first step, we computed a full-brain template for our cohort using T1and T2-weighted MRI sequences. For the full-brain template, CISS volumes had to be omitted since they were acquired only within a narrow axial field-ofview (FOV) centred around the inner ear region. In the full-brain template, we manually annotated four landmarks for identification of the Reid's axial plane, i.e. the locations of the infraorbital point and external auditory meatus in the left and right hemisphere [47]. We further placed two landmarks at the locations of the left and right inner ear, for a consistent localization of the inner ears across all subjects. This procedure is illustrated in Fig. 1.
In the second step, we re-oriented source volumes to Reid's standard plane.
This was a necessary pre-processing step for the inner ear template building 6 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review)  Figure 1: Full-brain template building and annotation, for re-orientation of the axial plane to Reid's standard plane, and for localization of left/right inner ear region-of-interests (ROI).
(Print in colour.) (step 3), which took crops around the inner ear's region-of-interest (ROI) as input. Pre-aligning these crops resulted in less out-of-border interpolation, i.e.
less black margins and a better registration result at the ROI borders during template building. A side benefit of this step was that the lateral semicircular canal became approximately aligned with the axial plane, given the common assumption of coplanarity of the lateral semicircular canals with Reid's standard plane [48,49,50]. The landmarks for Reid's plane and inner ear FOV centres were transformed into the volumes of all input subjects beforehand, using a full-brain registration from template to individual spaces, followed by visual inspection for quality control. We then computed Reid's plane parameters using least-squares fitting to the four landmarks, and re-oriented the axial plane in all volumes accordingly.
In the third step, we used the inner ear landmarks identified in step 2 to place region-of-interest (ROI) bounding boxes at the left and right inner ear. At these locations, the T2, CISS and T1 volumes were cropped (4×4×3 cm) and resampled (0.2 mm isotropic, bi-cubic B-spline interpolation) with a ROI bounding 7 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019.  box that included all relevant inner ear structures as well as the vestibulocochlear nerve. As performed during pre-processing on the full brain volumes, we linearly registered the CISS and T1 crops to the T2 crop, to ensure a perfect alignment of fine structures of the inner ear across all MRI contrasts. Cropped volumes were centred and all left inner ears were mirrored with respect to the sagittal plane, to match the orientation of the right inner ear. This latter step ensured that the obtained template is unbiased with respect to afference laterality (sidedness) of the inner ear structures. The above steps resulted in 126 localized, multi-variate MRI volumes, oriented to Reid's plane and hence roughly oriented with the lateral semicircular canal in the axial plane. These volumes served as input to the multi-variate template building algorithm, which computed three inner ear templates at an isotropic target resolution of 0.2 mm.
The procedure and resulting template are illustrated in Fig. 2.

Atlas annotation
The multi-sequence template was segmented into anatomic regions in order to obtain an atlas of the inner ear. Segmentations were obtained in parts by transferring fine-grained surface meshes of the semicircular canals, vestibule 8 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint and cochlea from Ariadne, a publicly available atlas and segmentation toolkit of the inner ear derived from ex-vivo micro-CT data [20]. From this toolbox, we leveraged micro-meter accurate surface meshes as delineations of inner ear structures, by registering them to our multi-variate MRI template.
Due to the lack of an accompanying micro-CT intensity volume in the Ariadne toolkit, we performed a volumetric registration and geometric transfer through alignment of the entire inner ear's outer surface. For our T2 template volume, we computed the outer iso-surface of the inner ear at an optimal intensity threshold obtained by Otsu's method [51], followed by manual removal of background structures that were obtained during thresholding. For the micro-CT data, we utilized the bony surface mesh provided in the Ariadne toolbox.
Next, we voxelized both the T2 surface mesh and the micro-CT surface mesh of the inner ear at an isotropic resolution of 0.1 mm, and computed signed Euclidean distance transforms (SDT) for both meshes [52]. These SDT representations served as surrogate image volumes during deformable image registration, yielding a non-linear deformation field from Ariadne atlas space into our template space. All micro-CT surfaces including semicircular canals, vestibule, and cochlear ducts (i.e. scala vestibuli, scala tympani and cochlear duct) were then transferred onto our template volume and voxelized at the resolution of 0.2 mm. The resulting labelmap was manually post-processed, based on visual inspection, to ensure alignment of labels with the three intensity channels T2, T1 and CISS.
The cochlea regions were pre-segmented by intensity-based thresholding of the CISS template. The threshold was set to obtain a clear separation of the scala tympani and vestibuli. Both regions were refined manually using basic image processing operations such as connected components, and surface smoothing. The cochlear duct was transformed from the David micro-CT atlas [20] using the computed deformation fields.
The procedures described above were realized using various tools in the open-source software toolkits ANTs (deformable surface registration) [19], 3D Slicer (segment editor effects: threshold, connected components, paint, scis-9 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint sors, margin) [53] and SimpleITK (Euclidean SDT computation with Signed-DanielssonDistanceMapImageFilter) [54].
Using landmark annotation in 3D Slicer [53], we further annotated multiple anatomic landmarks of the inner ear structures described in related literature.
For the sake of reproducibility and annotation consistency, we placed all landmarks on the Otsu-threshold iso-surface of the T2 template volume. The landmark locations on the T2 iso-surface are visualized in detail in Fig. 3. Following guidelines and 3D visualizations in [33], we annotated the three semicircular canals regarding width and height, as well as inner widths and heights measured at three positions ( Figure 3, panels A and B). We also followed instructions in [33] to compute the height and three width measures of the common crus. The height and width of the vestibule were described in [55]. The total inner ear length was described in [56] (Figure 3, panel C). The placement of landmarks for measurement of cochlea width, length and height was described in [57] (Figure 3, panel D). While cochlear height here was defined on an oblique 3D cut through the volume, earlier approaches defined cochlear height on the coronal plane [58,59], which we also measured for comparison.

Statistical methods
First-order group statistics on inner ear dimensions and volumes were computed in terms of mean and standard deviation. Normality of variables throughout this work was tested using a Kolmogorov-Smirnov (KS) test with Lilliefors correction, at an alpha level of p < 0.05 [60]. Comparisons of the semicircular canal dimensions between the left and right inner ear sides were performed using multiple paired statistical tests (t-test for normal distributed data, Mann-Whitney U test for non-normal distributions). We corrected p-values after multiple testing using the Benjamini-Hochberg False Discovery Rate (FDR) algorithm [61], results were considered as significant at a post-correction alpha level of p < 0.05. For the main general inner ear measures (total inner ear volume, total inner ear length, oblique cochlea height, and radii of anterior/posterior/lateral SCCs), we investigate a potential correlation with overall head size,

10
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. was tagged and measured using both the coronal plane approach (gold arrow) [58,59], and the oblique plane (i.e. 3D) approach (grey arrow) [57]. H = height, W = width, L = length.

11
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint represented by TIV. Correlation is reported as Pearson's correlation coefficient r, again at a significance level of p < 0.05. After clarification of the role of TIV, we tested gender effects, i.e. whether there was a significant difference between male and female cohort subgroups. These tests were performed on the two major inner ear measures, total inner ear volume, total inner ear length. To account for head size, we performed an Analysis of Covariance (ANCOVA), with TIV as the relevant covariate [62]. Statistical computation was performed using python implementations in scipy.stats [63], Pingouin [64] and Statsmodels [65], as well as MATLAB (The Mathworks Inc., MA, USA). Where available and applicable, we provide comparable values from literature that we are aware of, by referencing reported mean and standard deviation values.

Inner ear template and atlas
The reconstruction of the T2-and T1-weighted MRI volumes into unbiased full-brain templates was successful. Using this template, the localization of both inner ears and the re-orientation of the axial slice to Reid's standard plane could be executed for all subjects and inner ears without manual intervention.
Subsequently, the reconstruction of the T2-, T1-weighted and CISS inner ear template volumes at 0.2 mm isotropic resolution was performed successfully using the ANTs toolkit.
Volume renderings can be seen in Fig. 4. The rendering transfer function was set as a linear ramp between the minimum and maximum voxel intensity for each template, no manual adjustments other than a shift of the centre intensity (T1: 1.2; T2: 1.5; CISS: 2.3) was necessary to achieve the visibility of inner ear structures as displayed. As noticeable from the rendering, the inner structures of semicircular canals, vestibule and cochlea are clearly visible and are morphologically correct. The volume rendering does not suffer from noise artefacts typical in single-subject volumes, which we attribute to the fact that the multi-variate templates on average have a 1.7 times higher signal-to-noise 12 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review)  Figure 4), and fine-grained structures like the separation of cochlear ducts become visible in templates that are sometimes not even clearly discernible in individual volumes (cf. Figure 4, panel E).
ratio (SNR) than the mean-SNR in single-subject volumes (see Table 1). SNR is approximated as the mean voxel intensity divided by the standard deviation of voxel intensities inside the inner ear ROI [66].

Cohort measurements and atlas validation
The characterization of the three semicircular canals (see Table 2) was compared to previously reported results using micro-CT in ex-vivo specimen [33].
In general, our atlas produces larger values than reported in literature, however, the error margin never exceeds the original image resolution of T1-and T2-weighted MRI (0.75 mm). Overall, the sub-voxel accurate distance measures demonstrate that the computed template has morphometric properties that are in line with previously ex-vivo micro-CT studies. The same applies to landmarks of the common crus (see Table 3) and the cochlea (see Table 4), which all corresponded to values in literature within a margin of less than 0.5 mm.
Inner ear volume measurements are also largely in line with related literature (see Table 5

14
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint a hemisphere of 2 mm diameter (i.e. 2.1 mm 3 ; ours: 2.6 mm 3 ), utricle as an irregular oval averaging 1.35 mm in diameter and 5.5 mm in length (i.e. 7.9 mm 3 ; ours: 8.8 mm 3 ), and the cochlea as a cone with 8 mm diameter and 5 mm height (i.e. 83.8 mm 3 ; ours: 94.4 mm 3 ). Especially the larger volume measurements of the cochlea and the total inner ear region indicate an increased estimate of the inner ear region by our T2 iso-surface, by 11% and 12%, respectively.

Cohort analysis and statistics
As detailed in Table 2, paired univariate testing with multiple testing correction yielded no significant differences regarding sidedness in any of the semicircular canals measures (height, width, radius, as well as internal height and width measured at three distinct points from the distal to the common crus).
We further investigated the major inner ear dimensions, i.e. total inner ear length, total inner ear volume, oblique cochlea height, and radii of the three semicircular canals. First, we found all these values to be normally distributed (KS-test with Lilliefors correction, p > 0.05). Second, we investigated a possible correlation of selected inner ear measures with brain size measured by TIV. We found the total inner ear volume and length, as well as the radii of the anterior and posterior semicircular canals (SCC) to be moderately and significantly correlated with brain size, while the oblique cochlea height and the radius of the lateral SCC were not significantly correlated (cf. Figure 5). Third, we investigated a potential difference between male and female subjects in our cohort regarding the main and most robust inner ear measures, i.e. the total inner ear length and total inner ear volume. Since TIV was found to be a confounding variable for these measures, we performed an ANCOVA, correcting for TIV as a covariate. Our ANCOVA tests yielded no significant differences for the investigated inner ear measures (total inner ear length: p = 0.182; total inner ear volume: p = 0.088). In other words, after accounting for overall brain size, gender difference cannot explain residual differences in measured inner ear characteristics.

15
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019.

16
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review)

17
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review)

18
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint

Discussion
In this work, we presented a human inner ear atlas derived from multisequence magnetic resonance imaging in-vivo and non-invasively. To the best of our knowledge, this is the first time a multi-sequence MRI template-atlascombination of the human labyrinth covering vestibular and auditory structures has been proposed, and the first time its has been constructed from a sizable cohort (>50 subjects). The segmentation and annotation of templates in our work benefitted predominantly from the SNR of the source whole-head T2 images in combination with the high spatial resolution of the CISS sequence. The data analyses resulted in an inner ear atlas uniquely depicting the vestibule (including utricle and saccule), the three semicircular canals and their ampullae, and the cochlea with its three ducts (scala vestibuli, scala tympani and cochlear duct) as well as the cochlear cupula. The TIV was found to be a relevant covariate for measurements of the labyrinth and its substructures. After controlling for TIV as a covariate, no gender or laterality effects were found.
Analogous to the volumetric measures, our reported landmark distances were very much in line with the published literature. We were able to reproduce these landmarks and the distances in-between mainly with the help of detailed illustrations of landmark locations with high-quality 3D visualizations in several works [33,57], which we followed closely in our work, and which are presented in a similar fashion for comparison purposes in Fig. 3. Earlier descriptions of landmark placement were often based on 2D volume planes, which can be more difficult to understand and reproduce. For example, the cochlear height was previously defined either in the coronal image plane [59,58], or using an oblique volume cut through the centre of the cochlear basal turn and the cochlear apex [57]. While the former depends on the orientation of the inner ear anatomy with respect to the coronal plane, the latter is uniquely defined in 3D space.
Consequently, our measurements of the former has a larger ∆Mean from measurements reported in either [59] or [58], while the measurement of the latter is almost identical to the ones in [57]. A further improvement without going to 19 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint higher MRI field strengths is difficult to achieve without a significantly higher voxel resolution in the source material. In any case, an exact replication of reported values in literature is not relevant at this point because i) we relied on a young healthy cohort, while several other works we compared to included pathologies [59,58,70] ii) we relied on a single-rater placement of landmarks, and neither we nor related works considered intra-or inter-rater variability on these localizations, and iii) it is not clear yet whether and to what degree distance measures in MRI can be identical to micro-CT.
Overall, no other publicly available template or atlas of the inner ear has assembled and published a comprehensive list of inner ear landmarks for distance measurements. It should be noted that the literature-oriented landmark annotation in our work was time-consuming and non-trivial. By publicly providing these landmarks along with our templates and segmentations, we hope to provide a reproducible and consistent basis for such measurements in future studies as well as to pave the road for population-based standard values. Recently, another inner ear MRI template was presented, albeit including the application of Gadolinium contrast agent and with a less extensive quantitative assessment of template morphology, without investigation of potential confounds, and with separate (i.e. biased) templates for the left and right ear [71]. In comparison, we created an unbiased template and atlas, with detailed labels for sub-structures of the semicircular duct system, vestibule and cochlea. These were obtained through manual labelling and registration of an atlas from micro-CT [20]. We validated the template and atlas by performing a morphometric analysis of the semicircular canals, vestibule and cochlea, and by comparing anatomical measurements to established measures from literature.
It is important to note that the multi-sequence intensity templates provide a lot of flexibility for usage of this atlas. Future studies based on T1-, T2weighted data and CISS-sequences can directly utilize the provided templates through intra-modal atlas registration (in a single-or multi-variate approach).
Other structural MRI sequences (with or without contrast), MRI paradigms (e.g. diffusion-weighted MRI) or even other modalities (e.g. CT, micro-CT) 20 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint can still register and utilize the atlas through multi-modal image similarity functions (e.g. normalized mutual information).
Our atlas deviates slightly from the related literature, in particular regarding volumetric measures. For example, the overall inner ear volume appears to be marginally over-estimated. This may be due to partial volume effects and signal blurring after template building. Other volumetric measures like the ampullae look to be slightly under-estimated compared to the literature. This is likely due to simplified geometric assumptions for anatomic shapes made by the authors in the previous work [67], e.g. a sphere for the ampullae. These simplifications were probably helpful in this pioneering study, which approximated 3D volumes through 2D cross-cuts but should be dropped in the long run at higher resolutions to reach anatomical ground truth. Materials like our 3D templates and surface meshes should allow for more accurate volumetric and thus more realistic measurements. Overall, it is remarkable in our opinion that the outer surface of the inner ear in the T2-weighted template (i.e. the Otsu-threshold iso-surface) seems to very accurately represent the outer surface of bony structures as measured by micro-CT, a result that was not expected prior to the conductance of this study. This result in return then also justifies the surface-based co-registration of the bony surface mesh from Ariadne to our T2 iso-surface, in order to transfer internal surface meshes into our template.
Implications. Our atlas can be applied for localization, segmentation and subparcellations of the entire human inner ear for clinical and scientific questions.
With co-registration to the template, individual differences can be detected (e.g. of the anatomical structures itself or with respect to deviations from Reid's plane). The atlas may also be helpful for surgical planning of implanting the electrodes of cochlear or vestibular implants. Even though malformations were not used to build our atlas, image-based registration is often robust enough to compensate for a wide range of deformations, such that we expect the material to be useful for in-vivo detection and comparison of morphological abnormalities of the inner ear structures as well.

21
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org /10.1101/19011908 doi: medRxiv preprint Limitations. Despite the high isotropic voxel resolution of 0.2 mm in our final template, our results still do not resolve every fine detail of the inner ear, especially not of the auditory aspects. We were able to visualize the three major parts of the cochlea (the scala vestibuli, the scala tympani and the cochlear duct), but the results were still insufficient for e.g. a visualization of the organ of corti that lies inside the cochlear duct and houses the hair cells. The same holds true for the structure inside the ampullae: the cupulae. These structures are not ossified and it is far more complex to detect them with 3T MRI contrasts due to the tissue type, achievable isotropic resolution, and mitigating head motion during data acquisition. In this context, it seems feasible and realistic to achieve improvements to the resolution of a high-resolution T2-SPACE with coverage of the temporal bone alone and the CISS sequence in the target range of 0.4mm. It should further be noted as a limitation that the cohort's mean age was 26 +/-2.3 years, i.e. after development of a fully formed adult human skull. Changes due to age cannot be addressed within our cohort. However, we would not expect a change in bony structures in elderly patients with hearing or vestibular problems. Typically, a degeneration of soft tissue like hair cells in the organ of corti or the cochlear nerve lead to age related hearing loss. Further inner ear malformations reported previously [70] are not represented by our atlas, since we only investigated MRI images of normal subjects. Nonetheless, as noted, malformations may well be detectable as significant structural deviations, through co-registration and overlay on our template on a subject-basis.
Future Work. There are several ways in which the proposed inner ear atlas can be extended. First, the atlas can be augmented with additionally labelled regions nearby. The auditory and the two vestibular nerves appear as separable structures in the templates and are hence of particular interest, but were out of scope for this inaugural study. Once these nerves are included, the facial nerve can as well be segmented as it runs in close proximity of the vestibule.
Similar to the inner ear micro-CT atlas [20] used in this work, other micro-CT atlases as in [29] can help in identifying and accurately delineating nerves, vas-

22
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint culature and bones in our template space. For nerves, the T1-weighted template could become more relevant than in this study. In T1, nerve structures are not only visible (compared to the rest of the inner ear structures), but they have a complementary, hyper-intense appearance, while appearing hypo-intense in the T2 and CISS templates (see Figure 4, panel C). Adding the angles of the semicircular canals towards each other on a single-subject basis and analysing the coplanarity of functional pairs [72], together with the orientation of the eye muscles can further increase the clinical relevance of our IE-Map atlas. Another next step is to develop an automatic segmentation method for prospective validation and application of the templates and atlas. In initial registration experiments, we have verified that segmentation of new subjects is possible using pair-wise single-and multi-variate image registration methods, again provided by ANTs [19]. A fully automatic segmentation method would require the toolchaining of left and right inner ear localization, template displacement, and local deformation to the inner ear structures of test subjects. The implementation and validation of such a toolkit is also left for future work. A further exciting avenue for improvement stems from the multi-modal fusion of our invivo template with micro-CT volumes from multiple subjects. In-vivo MRI may never resolve a similarly fine-grained structural information as micro-CT, but we have demonstrated the feasibility of geometric fusion through multi-variate image registration and surface alignment. Eventually, this could allow for the construction of probabilistic label maps in micro-meter resolution, within the unbiased geometry of our template space.

Conclusion
In this work, we presented a comprehensive human inner ear atlas and three unbiased templates. Data were acquired in-vivo in a completely non-invasive approach from the largest dedicated cohort for vestibular substructures to date.
Semimanual segmentation allowed for successful quantitative assessment of all relevant vestibular and auditory substructures of the inner ear and landmark-23 . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint based annotation. Total intracranial volume was found to be a confounding covariate with inner ear measurements. We found no differences in inner ear measurements with respect to gender or ear. Due to the absence of laterality effects, a single unified multi-variate template and atlas was published. It is our hope and aim that template and atlas can find application in numerous domains and disciplines studying neurootological health and disease. Potential applications include inner ear localization, segmentation and sub-parcellation, to quantitatively assess peripheral vestibular and auditory anatomy in group comparisons as well as over the course of longitudinal studies. By providing the templates, atlas segmentation, surface meshes and landmark annotations freely and publicly, we hope to provide clinicians and researchers in neurology, neuroscience and otorhinolaryngology with a new tool and robust basis as a starting point for computational neuro-otology grounded in neuroimaging.

Data availability
The atlas material in this repository can be downloaded and used for noncommercial purposes from our public github repository under the URL: https://github.com/pydsgz/IEMap.

24
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint

26
. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint . CC-BY 4.0 International license It is made available under a is the author/funder, who has granted medRxiv a license to display the preprint in perpetuity. not certified by peer review) (which was The copyright holder for this preprint this version posted December 5, 2019. . https://doi.org/10.1101/19011908 doi: medRxiv preprint