Main

Pioneering studies in invertebrates have established that wiring diagrams of complete neuronal circuits at synaptic resolution are valuable tools for relating nervous system structure and function3,4,5,6,7. These studies benefited from their model organisms’ small sizes and stereotypy, which enabled complete ssEM of an entire specimen or the assembly of mosaics from multiple individuals.

Vertebrate nervous systems, however, are considerably larger. Consequently, ssEM of whole vertebrate circuits requires rapid computer-based technologies for acquiring, storing, and analysing many images. Because vertebrate nervous systems can vary substantially between individuals8, anatomical data often must be combined with other experiments on the same animal9,10,11 to define relationships between structure, function, and behaviour. For mammalian brains, this analysis requires the imaging of very large volumes that are still technically out of reach (but see ref. 12), thus confining studies to partial circuit reconstructions13,14,15,16,17,18,19. One strategy for capturing brain-wide circuits is to generate high-resolution whole-brain datasets in smaller vertebrates.

The larval zebrafish is an ideal system for this endeavour. It is nearly transparent, offering convenient optical access that permits whole-brain calcium imaging20. Additionally, its small size is well suited for ssEM, having already enabled studies of specific brain subregions21,22. Integrated with established genetic toolkits and quantitative behavioural assays21, it is an excellent model organism in which to investigate the neuronal basis of behaviour23.

Our goal was to develop a framework for ssEM of complete larval zebrafish brains at 5–7 days post-fertilization (dpf), when complex behaviours such as prey capture24 and predator avoidance25 emerge. To preserve the ultrastructure across the brain, we developed dissection techniques to remove skin and membranes from the dorsum that resulted in high-quality fixation and staining (Extended Data Fig. 1). Sectioning perpendicular to most axon and dendrite paths is preferable for ease and reliability in reconstructing neuronal morphology. Therefore, we oriented our cutting plane orthogonal to the long (anterior–posterior) axis, despite this requiring around 2.5 times more sections than the horizontal orientation. We improved sectioning consistency by embedding samples surrounded by support tissue from mouse cerebral cortex, yielding a section library that could be imaged multiple times at different resolutions26,27 (Extended Data Fig. 2).

Overview images were acquired to survey all sections (Extended Data Figs 3, 4; Supplementary Videos 1, 2), resulting in a 1.02 × 1010 μm3 image volume with 3.01 × 1011 voxels and occupying 310 gigabytes. In total, 17,963 sections, each around 60 nm thick, were collected from 18,207 attempted, leaving 244 (1.34%) lost, 283 (1.55%) containing partial tissue regions (Extended Data Fig. 5), no adjacent losses, and five (0.03%) adjacent lost–partial or partial–partial events. These low-resolution data confirmed that our approach enabled stable sectioning through a millimetre-long region spanning from myotome 7 to the anterior-most structures—encompassing some spinal cord and the entire brain.

We next selected subregions to capture areas of interest at higher resolutions, first performing isotropic imaging over the anterior-most 16,000 sections (Fig. 1a–f; Supplementary Video 3). All cells are labelled in ssEM, so these data offer a dense picture of the fine anatomy across the anterior quarter of the larval zebrafish, including the brain, sensory organs, and other tissues. Furthermore, its 56.4 × 56.4 × 60 nm3 per voxel resolution is about 500 times greater than that afforded by diffraction-limited light microscopy. The resulting 2.28 × 108 μm3 volume consisted of 1.12 × 1012 voxels and occupied 2.4 terabytes. In these data, one can reliably identify cell nuclei and track large-calibre myelinated axons (Fig. 1e, f; Supplementary Video 4). To resolve its tightly packed structures, 18.8 × 18.8 × 60 nm3 per voxel imaging of the brain was performed over 12,546 sections (Fig. 1g, h). The resulting 5.49 × 107 μm3 volume consisted of 2.36 × 1012 voxels and occupied 4.9 terabytes. Additional 4.0 × 4.0 × 60 nm3 per voxel acquisition was used for inspecting regions of interest, resolving finer axons and dendrites, and identifying synapses between neurons (Fig. 1i–k). Image co-registration across sections and scales then formed a coherent multi-resolution dataset (Extended Data Fig. 6).

Figure 1: Targeted, multi-scale ssEM of a larval zebrafish brain.
figure 1

a, The anterior quarter of a 5.5 dpf larval zebrafish was captured at 56.4 × 56.4 × 60 nm3 per voxel resolution from 16,000 sections. b, The Mauthner cell (M), axon cap (AC), and axon (Ax) illustrate features visible in the 56.4 × 56.4 × 60 nm3 per voxel image volume. c, Posterior Mauthner axon extension. d, Targeted re-acquisition of brain tissue at 18.8 × 18.8 × 60 nm3 per voxel (dashed) from 12,546 sections was completed after 56.4 × 56.4 × 60 nm3 per voxel full cross-sections (solid). e, f, Peripheral myelinated axons (arrowheads) recognized from 56.4 × 56.4 × 60 nm3 per voxel imaging of nerves (e) and the ear (f). g, h, Neuronal processes including myelinated fibres can be segmented at 18.8 × 18.8 × 60 nm3 per voxel resolution. ik, Targeted re-imaging to distinguish finer neuronal structures and their connections. Scale box: a, 50 × 50 × 50 μm3. Scale bars: b, c, 10 μm; d, 50 μm; e, f, 5 μm; g, h, 1 μm; ik, 500 nm.

PowerPoint slide

With a framework in place for whole-brain ssEM, we tested our ability to identify the same neurons or regions across imaging modalities9,10,11 at this scale (Extended Data Figs 7, 8). Using common structural features, we matched nuclei in ssEM data to their locations in two-photon calcium imaging data from the same animal (Supplementary Video 5). Reference atlases containing molecular labels were similarly co-registered. These results serve as proof-of-principle for the integration of rich activity maps with subsequent whole-brain structural examination of functionally characterized neurons and their networks.

We next tested the general applicability of this dataset for neuron reconstruction. First, we reconstructed a peripheral lateral line afferent neuron that innervated a dorsal neuromast sensory organ (Fig. 2a–e; Supplementary Video 6). By re-imaging at 4.0 × 4.0 × 60 nm3 per voxel, we identified synapses that connected this afferent with neuromast hair cells. We then annotated a myelinated spinal motor neuron that directly contacted muscle (Fig. 2g). Myelinated axons could also be identified and tracked within the brain. These reconstructions highlight the utility of multi-resolution ssEM for reassembling neuron morphologies from sensory inputs, throughout the brain, and to peripheral innervation of muscle.

Figure 2: Neuron reconstructions capturing sensory input and motor output.
figure 2

a, Bipolar lateral line afferent neuron tracked from a neuromast (bd) through its ganglion (e) into the hindbrain over approximately 5,000 serial sections. b, Dorsal neuromast innervated by the afferent. c, Ribbon synapse connecting the afferent and a hair cell. d, The afferent exiting the neuromast and becoming myelinated. e, Myelinated perikarya evident in the posterior lateral line ganglion. f, Volume rendering depicting reconstructions in this figure. g, Caudal primary motor neuron leaving the spinal cord and innervating myotome 6. Scale bars: a, e, g, 10 μm; bd, 1 μm; f, 100 μm.

PowerPoint slide

To extend our analysis, we produced a ‘projectome’ reconstruction consisting of all myelinated axons (Fig. 3a; Supplementary Video 7). We reconstructed 2,589 myelinated axon segments along with many attached somata and dendrites to yield 39.9 cm of combined length. Of these, 834 myelinated axons comprising 30.6 cm were easily followed to their somata, whereas unmyelinated stretches made it difficult to reach somata for the remaining 9.3 cm. The longest reconstruction, of a trigeminal sensory afferent, was 1.2 mm and extended from anterior skin sensory terminals to the hindbrain.

Figure 3: Reconstruction of a larval zebrafish projectome.
figure 3

a, Myelinated axon reconstructions from top (upper) and side (lower) views. Colours assigned randomly. b, Lateral line afferent reconstructions. Afferents innervating identified neuromasts are labelled anterior (purple, darker more anterior), whereas posterior lateral line nerve members are labelled posterior (yellow). c, Reticulospinal neuron reconstructions, including the Mauthner cells and the identified nucleus of the medial longitudinal fasciculus (nucMLF) neurons MeLc (green), MeLr (yellow), MeLm (orange), and MeM (blue). Note bilateral symmetry apparent in b and c. Scale bars: ac, 100 μm.

PowerPoint slide

The resulting projectome included 94 lateral line afferents that innervated 41 neuromasts (Fig. 3b). These reconstructions revealed striking bilateral symmetry in the lateral line system (Supplementary Video 8). Only one neuromast and its afferents lacked contralateral counterparts. This may be an important anatomical feature that facilitates comparisons of local velocity vector fields for detecting differential flow along the left and right sides, which is essential for rheotaxis in larval zebrafish28.

Also included was a substantial fraction of midbrain and hindbrain reticulospinal neurons, which send axons to the spinal cord (Figs 3c, 4a). Similar to lateral line neurons, these appeared bilaterally symmetrical (Supplementary Video 9). However, our ability to identify reticulospinal neurons by their known positions and morphologies29 afforded us the opportunity to precisely examine the extent of their symmetry. We selected 22 identified left–right reticulospinal neuron pairs (44 total neurons) whose myelinated axons form the medial longitudinal fasciculus (MLF) to quantify the degree of bilateral symmetry (Fig. 4a–d). Developing a cost metric allowed us to investigate whether myelinated MLF axons of one hemisphere were symmetrical in three-dimensional shape and position to axons of their contralateral homologues (Extended Data Fig. 9a–c). Notably, globally optimal pairwise assignment based on computed costs matched left–right homologues in all but one pair (Fig. 4b; Extended Data Fig. 9d).

Figure 4: Bilateral symmetry in myelinated reticulospinal axon reconstructions.
figure 4

ad, Analysis of symmetry in 3D position and shape for 22 identified left–right neuron pairs with axons in the MLF. a, Plane of symmetry fit from reticulospinal reconstructions, which were identified by morphology and overlap with the Z-Brain reference atlas spinal backfill label. b, Costs computed from comparisons of each axon with every reflected contralateral axon. Globally optimal pairwise assignment matched left–right homologues (asterisks) for all but one pair (red). Low off-diagonal costs highlight similarities across neuron types. c, d, Highest (c) and lowest (d) cost comparisons. ek, Analysis of symmetry in 2D neighbour relations for a subset of six left–right pairs. e, Apparent mirror-symmetrical relative positioning across the midline. f, Angle and distance differences from one slice, with every vector between two left axons compared to the reflected vector between the right axons with the same identities. g, Examined myelinated axon subset. h, i, Mirror-symmetrical relative positional arrangements over long MLF stretches evident by concatenating linearized angle (h) and distance (i) differences. Neighbour relations return to a symmetrical state despite local perturbations. Black indicates insufficient data. j, Summing normalized differences uncovered regions with perturbations (peaks), such as where axons entered the MLF. Artificially swapping two left axon identities nearly doubled this sum. k, Axon sets with weaker neighbour relations exhibit greater variance in angle and distance difference across slices. Scale bars: a, 100 μm; c, d, g, 50 μm; e, 5 μm.

PowerPoint slide

We also noticed that axons appeared to occupy similar domains within the left and right tracts (Fig. 4e), leading us to investigate possible symmetry in neighbour relations. We selected a subset of six left–right pairs and analysed their spatial relationships on every slice by comparing the vector between each set of two left axons to the reflected vector between the right axons with the same identities (Extended Data Fig. 9e–h). From all 15 pairwise combinations, we observed that these positional arrangements within the MLF were mirror-symmetrical over long stretches (Fig. 4f–j; Supplementary Video 10). Moreover, the neighbour relations returned to a symmetrical state away from local perturbations (for example, a new axon entering the MLF). Similar relationships were seen in the larger set of 22 reticulospinal axon pairs (Extended Data Fig. 9i–k).

Although these axons originated from stereotyped locations29, we expected that the MLF would become progressively more scrambled in the same manner as peripheral mammalian motor nerves, which show no mirror symmetry8. If this were true, MLF configurations should become less symmetrical as they travel further posterior. The fact that scrambling does not occur suggests that axon bundles preserve positional information along their length, an idea with precedent in ribbon-shaped optic nerves of certain fish species but thought to be a special feature of this structure30. Our results instead indicate that stereotyped neighbour relations may be a general feature of central nerve tracts.

However, we can only speculate about the purpose of maintaining this positional information. We cannot say whether symmetrical axon trajectories are accompanied by symmetrical connections. If so, symmetry might assure that axons find appropriate postsynaptic targets. Importantly, not all MLF axons exhibited strong positional stereotypy (Fig. 4k; Extended Data Fig. 9i–k). Perhaps the central nerve tracts of larval zebrafish contain both axons that rely on intrinsically stereotyped positions to innervate specific targets and others that do not. Developing axons could rely on fasciculation with existing axons that previously pioneered the pathway. Alternatively, positional information acting on a growth cone could maximize the likelihood of reaching the intended postsynaptic partners. Whatever its purpose and mechanism, the evident stereotypy indicates that some neurons are identifiable from their axon’s precise locations within nerve tracts.

Here, we demonstrate the feasibility of whole-brain ssEM for neuron reconstruction in larval zebrafish and illustrate the utility of re-imaging at multiple scales for reducing imaging time and data storage requirements. Finally, the presented dataset is not limited to nervous system analyses. It also contains other organ systems, including musculoskeletal, cardiac, intestinal, and pancreatic tissues (Extended Data Fig. 10), thus serving as an open-access resource that is available for the scientific community (hosted by NeuroData at http://neurodata.io/data/hildebrand16 and http://zebrafish.link/hildebrand16).

Methods

No statistical methods were used to predetermine sample size. The experiments were not randomized and the investigators were not blinded to allocation during experiments and outcome assessment.

Animal care

Adult zebrafish (Danio rerio) for breeding were maintained at 28 °C on a 14 h:10 h light:dark cycle following standard methods31. The Tg(elavl3:GCaMP5G)a4598 transgenic line32 used in this study was of genotype elavl3:GCaMP5G+/+; nacre (mitfa–/–), conveying nearly pan-neuronal expression of the calcium indicator GCaMP5G33 and increased transparency due to the nacre mutation34. The larval zebrafish samples described in this study were raised in filtered fish facility water31 until 5–7 dpf.

Mice from which support tissue was collected had been previously killed for other experiments. Only unused, to-be-discarded tissue was collected to serve as support tissue.

The Standing Committee on the Use of Animals in Research and Training of Harvard University approved all animal experiments.

Two-photon laser-scanning microscopy

Larval zebrafish were immobilized by immersion in 1 mg ml–1 α-bungarotoxin (Invitrogen) and mounted dorsum-up in 2% low-melting-temperature agarose in a small dish containing a silicone base (Sylgard 184, Dow Corning). Upon agarose hardening, E3 solution (5 mM NaCl, 0.17 mM KCl, 0.33 mM CaCl2 and 0.33 mM MgSO4) was added to the dish. In vivo structural imaging of elavl3-driven GCaMP5G signal was conducted with a custom-built two-photon microscope equipped with a Ti:Sapphire laser (Mai Tai, Spectra-Physics) excitation source tuned to 800 nm. Frames with a 764.4 × 509.6 μm2 field of view size (1,200 × 800 pixels2) were acquired at 1-μm intervals (0.637 × 0.637 × 1 μm3 per voxel) at approximately 1 Hz with a scan pattern of four evenly spaced, interlaced passes35. A low-noise anatomical snapshot of brain fluorescence was captured in 300 planes, each the sum of 50 single frames. All light-based imaging was performed without any intentional stimulus presentation.

Dissection and tissue preparation

Initial attempts at high-quality larval zebrafish brain preservation were impeded by skin and membranes, which prevented sufficient fixation with whole-fish immersion alone (Extended Data Fig. 1a). To overcome this, skin and membranes covering the brain36 were dissected away.

Each larval zebrafish, which had been previously immobilized and embedded for two-photon laser-scanning microscopy, was introduced to a dissection solution (64 mM NaCl, 2.9 mM KCl, 10 mM HEPES, 10 mM glucose, 164 mM sucrose, 1.2 mM MgCl2, 2.1 mM CaCl2, pH 7.5; ref. 37) containing 0.02% (w/v) tricaine mesylate (MS-222, Sigma-Aldrich). Flow of red blood cells through the vasculature was confirmed before proceeding as an indicator of good health. A portion of agarose was removed to expose the dorsum from the posterior hindbrain to the anterior optic tectum. The dissection was initiated by puncturing the thin epithelial layer over the rhombencephalic ventricle above the hindbrain38 with a sharpened tungsten needle. Small incremental anterior-directed incisions were made along the midline as close to the surface as possible until the brain was exposed from the hindbrain entry site to the anterior optic tectum (Extended Data Fig. 1b). The majority of damage associated with this dissection was restricted to medial tectal proliferation zone progenitor cells39 that are unlikely to have integrated into functional neuronal circuits.

Dissections lasted 1–2 min, upon which time the complete dish was immersed in a 2.0% formaldehyde and 2.5% glutaraldehyde fixative solution (Electron Microscopy Sciences) overnight at room temperature (Extended Data Fig. 1d). Following washes, larval zebrafish were cut out from the dish in a block of agarose with a scalpel and moved to a round-bottomed microcentrifuge tube. Specimens were then incubated in post-fixation solution containing 1% osmium tetroxide and 1.5% potassium ferricyanide for 2 h (Extended Data Fig. 1e), washed with water, washed with 0.05 M maleate buffer (pH 5.15), and stained with 1% uranyl acetate in maleate buffer overnight (Extended Data Fig. 1f). During the subsequent wash step with maleate buffer, larval zebrafish were freed from the surrounding agarose block and moved to a new microcentrifuge tube. Next, specimens were washed with water, dehydrated with serial dilutions of acetonitrile in water (25%, 50%, 70%, 70%, 80%, 90%, 95%, 100%, 100%, 100%) for 10 min each, and infiltrated with serial dilutions of a diepoxyoctane-based low viscosity resin40 in acetonitrile (25%, 50%, 75%, 100%) for 1 h each. The samples were then embedded in diepoxyoctane-based resin with surrounding support tissue and hardened for 2–3 days at 60 °C (Extended Data Fig. 1g, h). Aqueous solutions were prepared with water passed through a purification system (typically Arium 611VF, Sartorius Stedim Biotech). This process resulted in high-quality ultrastructure preservation (Extended Data Fig. 1c, i).

Additional solution, washing, and timing details were described previously in a step-by-step protocol41.

Serial sectioning

Consistent ultrathin sectioning was difficult to achieve in larval zebrafish samples, which contain heterogeneous tissues, but imperative for reconstructing 3D structure from a series of 2D sections. Tests revealed that errors occurred primarily when the sample composition changed markedly (for example, borders between tissue and empty resin). We overcame this by embedding samples surrounded by a mouse cerebral cortex support tissue (Extended Data Figs 1h, 2f, 3a, 4a).

We preferred sectioning perpendicular to most axon and dendrite paths for ease and reliability in reconstructing neuronal morphology. For this reason, our cutting plane was oriented perpendicular to the long (anterior–posterior) axis, which required around 2.5× more sections than alternative orientations. This was made possible by customizing an automated tape-collecting ultramicrotome26,27 by extending the device’s main mounting plate and enlarging its reels (compare Extended Data Fig. 2a with Fig. 1e from ref. 26) to accommodate one long tape stretch capable of collecting all sections.

Sections were continuously cut with a diamond knife (Extended Data Fig. 2b, c) affixed to an ultramicrotome (EM UC6, Leica) and collected onto 8-mm-wide and 50–75-μm-thick tape (Kapton polyimide film, DuPont). Restarts were occasionally required for three reasons: fine-tuning of tape positioning or settings is necessary at the beginning of a run; the ultramicrotome design is constrained by a cutting depth range of about 200 μm; and diamond knives must be shifted after cutting several thousand sections to expose the sample to a fresh edge before dulling impairs sectioning quality. When necessary, restarts were completed as quickly as possible (typically 1–2 min) to minimize possible thermal, electrostatic, or other fluctuations. For the same reason, tape reels were fed continuously without ever being reloaded or exchanged. This combination of fast restarts and continuous tape feeding successfully maintained a steady state across restarts.

We sectioned two larval zebrafish specimens. These represent the only two samples we have attempted to cut since adopting the surrounding support tissue approach. The primary focus of this study was a 5.5 dpf larval zebrafish sectioned with a 45° ultra diamond knife (Diatome) and a nominal sectioning thickness that averaged 60 nm with a variable setting ranging from 50–70 nm depending on sectioning consistency. Restarts occurred after sections 276, 3,669, 6,967, 10,346, 12,523, 12,916, and 15,956. Knife shifts occurred after sections 6,967 and 12,916. After sectioning, the tape was cut into segments with a razor blade between collected sections and adhered with double-sided conductive carbon adhesive tape (Ted Pella) to 4-in-diameter silicon wafers (University Wafer), which served as an imaging substrate. A total of 17,963 sections, each approximately 60 nm thick, were spread across 80 wafers (Extended Data Figs 2d, e, 3).

One potential limitation of the 5.5 dpf larval zebrafish series is the section thickness. Minimizing section thickness is an important factor in the success of axon and dendrite reconstruction1. Small neuronal processes are difficult to reconstruct in thicker sections, especially when they are running roughly parallel to the plane of the section. To be sure that our approach was not fundamentally limited to thicker sections, we sectioned the second sample—a 7 dpf larval zebrafish—with a nominal sectioning thickness that remained constant at 50 nm throughout the entire cutting session using a 45° histo diamond knife (Diatome). Restarts occurred after sections 296, 312, 4,114, 8,233, and 12,333. Knife shifts occurred after sections 4,114 and 12,333. A total of 15,046 sections, each approximately 50 nm thick, were obtained from 15,052 attempted (Extended Data Fig. 4) and spread across 70 wafers. The thinner sections did not result in more lost material: this series contained 6 losses (0.04%; Extended Data Fig. 4d upper), 25 partial sections (0.17%; Extended Data Fig. 4d middle), no adjacent losses, and 6 adjacent lost–partial or partial–partial events (0.04%; Extended Data Fig. 4d lower).

The nominal section thickness of approximately 60 nm made it possible to span the entire 5.5 dpf larval zebrafish brain in about 18,000 sections, as determined by finding the location of the spinal cord–hindbrain boundary42. Although the 7 dpf sample was sectioned at 50 nm, it was not made the focus of subsequent imaging because it contained less of the brain. However, improved reliability for this sample despite a 16% reduction in nominal sectioning thickness suggests that yet higher axial resolution is attainable. A section thickness of ≤30 nm would increase confidence in the ability to reconstruct complete neuronal circuit connectivity, and cutting at such thicknesses is known to be possible for mammalian brain sections of comparable sizes27,43.

Wafers containing tape segments were made hydrophilic by brief glow discharging, post-section stained for 1–2 min inside a chamber containing sodium hydroxide pellets using a stabilized lead citrate solution (UltroStain II, Leica) filtered through a 0.2 μm syringe filter, and then washed thoroughly with water. A thin layer of carbon was then deposited onto each wafer to prevent charging during scanning electron microscopy.

Electron microscopy

WaferMapper software was used with light-based wafer overview images to semi-automatically map the positions of all sections and relate them to fiducial markers. This enabled targeted section overview acquisition (758.8 × 758.8 × 60 nm3 per voxel for 5.5 dpf; 741.5 × 741.5 × 50 nm3 per voxel for 7 dpf). Semi-automated alignment of section overviews in WaferMapper then permitted targeting for imaging at higher resolutions26.

Field emission scanning electron microscopy of back-scattered electrons was primarily conducted on a Zeiss Merlin equipped with a large-area imaging scan generator (Fibics) and stock detector. An accelerating voltage of 5.0 kV and beam current of 7–10 nA were used for most acquisition. Imaging of back-scattered electrons at the highest resolution (4.0 × 4.0 × 60 nm3 per voxel) was performed on an FEI Magellan XHR 400L with an accelerating voltage of 5.0 kV and beam current of 1.6–3.2 nA. Field of view sizes acquired from a given section varied depending on the cross-sectional area occupied by tissue. All acquisition was performed with a scan rate at or under 1 megapixel per s. For the 5.5 dpf larval zebrafish, this resulted in overhead-inclusive acquisition times of 5.4 days for section overviews (758.8 × 758.8 × 60 nm3 per voxel), 97 days for isotropic full transverse cross-sections (56.4 × 56.4 × 60 nm3 per voxel), and 100 days for high-resolution brain images (18.8 × 18.8 × 60 nm3 per voxel).

Continued development of faster electron microscopy technologies44 will hasten the re-imaging process and permit whole-brain studies to be carried out in a fraction of the time required here.

Image alignment and intensity normalization

Producing anatomically consistent image registration over about 18,000 sections required control of region of interest drift, over-fitting, magnification changes, and intensities. To quickly assess the quality of the dataset and begin reconstructions, we initially performed affine intra- and inter-section image registrations with Fiji45 TrakEM2 alignment plug-ins46. These results revealed that additional nonlinear registration was required in order to compensate for distortions that were likely caused by section compression during cutting and sample charging during imaging. While the state-of-the-art elastic registration method47 also provided in Fiji45 as a TrakEM2 alignment plug-in achieved excellent local registration, we experienced difficulty—at least without modification to the existing implementation—in achieving an anatomically consistent result that preserved the overall larval zebrafish structure, largely due to struggles with constraining region of interest drift across magnification changes and correcting for shearing caused by sectioning. We also determined that the similar AlignTK9 method, which uses Pearson correlation as the matching criterion coupled with spring mesh relaxation to stabilize the global volume, was likely to suffer from similar problems and would require substantial additional data handling to operate on our multi-resolution dataset.

Therefore, in order to preserve the overall larval zebrafish structure and simultaneously achieve high-quality local registration, we turned to a new Signal Whitening Fourier Transform Image Registration (SWiFT-IR) method43,48. Compared to conventional Pearson or phase correlation-based registration approaches, SWiFT-IR produces more robust image matching by using modulated Fourier transform amplitudes, adjusting its spatial frequency response during matching to maximize a signal-to-noise measure as its indicator of alignment quality. This alignment signal better handles variations in biological content and typical data distortions. Additionally, SWiFT-IR achieves higher precision in block matching as a result of the signal whitening, improves processing speeds with the computational complexity advantages of fast-Fourier transforms, and reduces iterative convergence from thousands to dozens of steps. Together, these capabilities enable a model-driven alignment in place of the usual approach of comparing and aligning a given section to a pre-selected number of adjacent sections.

The SWiFT-IR model we used consisted of an estimate of local aligned volume content formed by a windowed average, typically spanning 6 μm along the axis orthogonal to the sectioning plane (z, anterior–posterior). Damaged regions, in particular partial sections, were removed from the model to avoid adversely influencing alignment results. This model then served as a registration template, in which raw images were matched to the current model rather than nearby sections. Alignment proceeded in an iterative fashion starting at 758.8 × 758.8 × 60 nm3 per voxel (section overviews) and progressing incrementally to 56.4 × 56.4 × 60 nm3 per voxel for regions outside the brain and 18.8 × 18.8 × 60 nm3 per voxel for regions inside.

At each resolution, source images were iteratively aligned to the current model until no further alignment improvement could be achieved, as indicated by the SWiFT-IR signal-to-noise figure of merit. The model was then transferred to higher resolution data by applying the current warpings to the source data for that scale. Iterative model refinement then continued at this subsequent level. Although most computations were locally affine, residual nonlinear deformations, particularly at the highest resolutions, were represented by a triangulation mesh that deformably mapped raw data onto the model volume.

Importantly, access to the lowest resolution section overview data permitted us to build an initial model that constrained subsequent registration steps to the overall larval zebrafish structure. Although their resolution and signal quality were intentionally sacrificed in favour of rapid acquisition, the fact that overviews were quickly captured with the same microscope settings and included support tissue provided key constraints for model refinement that resulted in a more accurate global result.

More specifically, the 17,963-section overview image volume was processed using SWiFT-IR to produce an initial model at 564 × 564 × 600 nm3 per voxel. Although the lowest-resolution section overview images were each captured at 758.8 × 758.8 × 60 nm3 per voxel, the relative oversampling orthogonal to the sectioning plane enabled a geometrically accurate model at 564 × 564 × 600 nm3 per voxel. This initial model was then cropped and warped using SWiFT-IR–driven matching across the midline axis to remove cutting compression, rotations, and other systematic variations in the specimen pose. The 16,000-section 56.4 × 56.4 × 60 nm3 per voxel volume was next downsampled to 564 × 564 × 600 nm3 per voxel and aligned to the initial overview model, resulting in an improved model. The matching and remodelling process was iterated at this scale until there was no further improvement in SWiFT-IR match quality. The final model at this scale was then expanded to 282 × 282 × 300 nm3 per voxel and similarly aligned in an iterative fashion. This model volume (about 6 gigavoxels; 1,600 × 1,400 × 2,667 voxels) was convenient for rapid viewing to identify and manually correct defects and refine the pose. Further scales at 169.2 × 169.2 × 180 nm3 per voxel and 56.4 × 56.4 × 60 nm3 per voxel were similarly processed by successively expanding the model and aligning until no meaningful improvement in the figure of merit was reached. The 12,546-section 18.8 × 18.8 × 60 nm3 per voxel image set was then registered using the final 56.4 × 56.4 × 60 nm3 per voxel volume as its model.

Image intensity was adjusted across sections to achieve a consistent background level by matching the average over a tissue-free region defined by a 256 × 256 pixels2 area. Many images were acquired at 16-bit depth and were converted in this process to 8-bit depth. The target background level was mapped to intensity 250, which left headroom for bright pixels while keeping most tissue of interest from saturating. Next, a linear intensity fit between the background and a second level, typically the average grey level of a continuous trajectory region on the right side of the brain, was made to adjust the intensity values for each section.

Correspondence across light and ssEM datasets

Correspondence of individual neurons or functional reference atlas regions across imaging modalities was achieved with landmark-based 3D thin-plate spline warping of each fluorescence dataset to the ssEM dataset using BigWarp49.

For matching in vivo two-photon laser-scanning microscopy data from the same specimen, we primarily chose landmarks consisting of distinctive arrangements of low-fluorescence regions where GCaMP5G was excluded and could be easily matched to similar patterns of nuclei in the ssEM dataset. This process was difficult in regions with low fluorescence signal (Extended Data Fig. 7e), where many cells were packed closely together (Extended Data Fig. 7f), and at locations where new neurons were likely to have been added between light microscopy and preparation for ssEM (Extended Data Fig. 7g). In the future, improving the light-level data with specific labelling of all nuclei and faster light-based imaging approaches should improve the ease and accuracy of matching neuron identity.

Two functional reference atlases with many separate labels were also registered to the ssEM dataset. For matching the Z-Brain atlas50, we chose landmarks based on identifiable structures in the Z-Brain averaged elavl3:H2B-RFP or anti-tERK fluorescence image stacks that were also observed in the ssEM dataset. These structures primarily consisted of region boundaries, discernable clusters of neurons, midline points, ganglia, and the brain outline. The same Z-Brain landmarks were used for transforming a version51 of the Zebrafish Brain Browser52 that was previously registered to the Z-Brain atlas.

Image annotation and neuron reconstruction

Reconstruction across multi-resolution ssEM image volumes profits from being able to simultaneously access and view separate but co-registered datasets. Without this, some of the time benefits of our imaging approach would be offset by the need to register and track each structure across volumes that span both low-resolution, large fields of view and high-resolution, specific regions of interest. With this in mind, we added a feature to the Collaborative Annotation Toolkit for Massive Amounts of Image Data (CATMAID) neuronal circuit mapping software53,54 to overlay and combine image stacks acquired with varying resolutions in a single viewer (Extended Data Fig. 6). This is made possible by rendering using WebGL. Additionally, this new feature combines stacks via a configurable overlay order, introduces blending operations for each overlaid stack, and enables programmatic shaders for dynamic image processing. When overlaid stacks’ resolutions differ, the nearest available zoom level for each stack is interpolated. Missing data regions can be omitted or rendered with interpolation. To account for the increase in data storage and bandwidth when viewing multiple image stacks, the CATMAID image data hierarchy was extended with a shared graphics card memory cache of image tiles using a least-recently-used replacement policy. All additions and modifications to the CATMAID software are now incorporated into the main open-source release.

Manual reconstruction was conducted using our modified CATMAID version by placing nodes near the centre of each neuronal structure on every section in which it could be clearly identified. This led to a wire-frame model (‘skeleton’) for each annotated structure. Starting points for reconstruction (‘seeds’) of myelinated processes were manually identified by searching for profiles surrounded by the characteristic thick, densely stained outline associated with staining of the myelin sheath55 (Fig. 1e–g, i). The search protocol consisted of viewing all tissue on a given section from the upper-left to lower-right corner at the highest available resolution. To obtain seeds for the projectome reconstruction, searching was repeated every 50 sections throughout the 16,000 sections acquired at or higher than 56.4 × 56.4 × 60 nm3 per voxel. Many annotations were produced in an affine-only alignment space before being mapped into the final SWiFT-IR alignment space. The reconstructions reported here represent about 450 days of uninterrupted (24 h per day) human annotation.

For visualization and reported length measurements, each skeleton was smoothed using a custom python-based implementation of a Kalman smoothing algorithm on a space defined by manually annotated points within unique segments. The initial state variables for smoothing were derived by an optimization of point-to-line distance to connected reconstruction segments. Other variables were tuned with the Estimation Maximization algorithm of the pykalman library to compensate for a lack of human input where data was unavailable because of lost or partial sections. Because the final image alignment was of good quality, smoothing in this manner should produce a slight underestimate in reported reconstruction path lengths.

Neurons with known projection patterns or identities were named in the CATMAID database. For example (and subject to change), the reconstruction of a neuron innervating the right anterior macula (utricle) might be named as ‘Ear_AnteriorMacula_R_01’, while an identified neuron such as the left Mauthner neuron was named ‘Mauthner_L’. Two identifiable left–right reticulospinal neuron pairs belonged to the MeM class, which emanates from the nucleus of the medial longitudinal fasciculus (nucMLF). On each side, these were differentiated into dorsal (MeMd) and ventral (MeMv) subclasses based on consistent soma positioning.

Visualization

Image volumes and reconstructions were primarily visualized using Vivaldi56, a domain-specific language for rendering and processing on distributed systems, because it provides access to the parallel computing power of multi-GPU systems with language syntax similar to python.

For volume visualizations, we used a direct volume rendering ray-casting technique in which an orthogonal or perspective iterator was marched along a viewing ray while sampled voxel colours were accumulated using an alpha compositing algorithm. We screened out regions containing only support tissue during rendering with labelled volumes constructed by interpolating between manually produced masks indicating which image voxels belong to each separate tissue region. In cases where separate image volumes of the same region were rendered together (for example, ssEM and fluorescence combined), direct volume rendering was performed by combining front-to-back colour and alpha compositions formed from the different transfer function belonging to each image volume.

For volume visualizations including reconstructions, direct volume rendering of image data was combined with streamline rendering of reconstructions using two different techniques. The first combined an OpenGL framebuffer with the Vivaldi volume rendering. In this case, each streamline was rendered using OpenGL as a tube into an off-screen buffer (that is, Framebuffer Object). Vivaldi then compared the resulting render and depth buffers to perform direct volume rendering of only the image data above the streamline depth value. This made it possible to ignore image voxels obscured by streamlines, which were treated as opaque. The second technique involved generating a complete streamline volume by 3D rasterization. This streamline volume was then combined with the image volume for direct volume rendering. The former technique is faster and can cope with dynamic streamline changes, but the latter was found to yield better overall rendering quality for our purposes. Visualizations of reconstructions without the image volume context were rendered either in the CATMAID 3D WebGL viewer or plotted in MATLAB. When reconstructions are shown without specific labelling, colours were assigned randomly from a custom palette.

Reference plane (for example, horizontal, sagittal, and section) indicators were rendered with Vivaldi by detecting the zero-crossing of each viewing ray and the plane. Support for viewing opaque data views in some spatial regions alongside the semi-transparent volume visualization views in other regions was introduced as a new Vivaldi function, clipping_plane. Similarly, contour (nonplanar) reslice support was added to illustrate a flattened view along a specific reconstruction path consisting of vertical line segments extracted from the image volume.

For many cases, the size of the volume being rendered was larger than available memory. In order to support out-of-core processing, we developed and integrated into Vivaldi a slice-based streaming computing framework using the Hadoop distributed file system (HDFS) that will be reported elsewhere.

Symmetry analyses

Initial observations of apparent myelinated axon symmetry were found during visual inspection (Fig. 3; Supplementary Videos 8, 9). To quantitatively assess the extent of symmetry, we developed a 3D symmetry plane fitting method and two symmetry analyses: one that produces a cost associated with the 3D shape and position similarity between reconstructed structures and another that compares the relative 2D (cross-sectional) positioning of two identified neuron axons on one side with that for the contralateral axons with the same identities. Only the longest reconstructed path from the soma through the myelinated axon projection was considered in plane fitting and symmetry analyses. Dendrites or short axonal branches were ignored. Each resulting reconstruction path (skeleton) was represented as an ordered list of nodes (points) taken directly from manual reconstructions. Sidedness (left or right) was determined by soma position.

The new 3D symmetry plane fitting and 3D symmetry comparison analysis approaches have been described elsewhere57. The symmetry plane fitting, in brief, involves choosing an approximate symmetry plane, reflecting the complete set of points belonging to the reconstruction subset of interest with respect to this plane, registering the original and reflected point clouds with an iterative closest point algorithm, and inferring the optimal symmetry plane from the reflection and registration mappings. The subset of reconstructions from which this plane fitting was performed consisted entirely of identified neurons whose axon projections formed part of the approximately 30-μm-diameter MLF, recognized with the help of refs 29, 58.

The 3D symmetry comparison for each template reconstruction on one side, in brief, involved reflecting all contralateral skeletons and computing a matching cost via dynamic time warping (DTW) between the template and each reflected skeleton. The reconstruction subset analysed in this fashion was restricted to identified neuron classes with 1–2 members per side whose axons formed part of the MLF. For our purposes, the DTW cost was taken as the sum of the Euclidian distances between all matched points normalized to the number of matched point pairs (Extended Data Fig. 9a–c). The DTW gap cost parameter for matching a point in one sequence with a gap in another was set to zero because our data was sampled at a nearly constant rate and we sought the optimal subsequence match even in cases where one is shorter than or offset with respect to the other. To compensate for unmatched regions (that is, overhangs), the DTW cost was then multiplied by a penalty factor proportional to the sequence lengths remaining unmatched (total length divided by matched length). Comparing each reconstruction on one side to all reconstructions from the opposite side formed a cost matrix (Fig. 4b) from which an optimal pairwise assignment could be determined without any bias introduced from the previously determined identities. The Munkres algorithm59 was then used to compute a globally optimal pairwise assignment.

We also sought to compare the relative 2D positioning for each set of two axons on one side with the contralateral set that had the same identities. The reconstruction subset analysed in this fashion was restricted either to the Mauthner cells and nucMLF neurons (Fig. 3c, 4e, g) or the larger set of 44 identified reticulospinal neurons (Fig. 4b; Extended Data Fig. 9i).

To start, we compensated for a small angle offset in the sectioning plane relative to the true transverse plane by projecting the point coordinates of reconstructions such that the previously computed symmetry plane became the plane x = 0. Given the transverse planes z0, z1 and a projected skeleton S containing points s = (sx, sy, sz), we let . That is, was taken as the subset of points from S whose coordinates are contained in the interval . We refer to the subset of ℝ3 bounded by as the slice . For each slice and skeleton S, we defined as the mean of the elements in . This mean was then taken as representative of the skeleton S in slice for analysis and plotting. Note that all analysis and plotting presented in static form was based on a slice thickness corresponding to a single section (approximately 60 nm), where each slice consisted simply of adjacent sections. Larger slice sizes were used for dynamic presentation (Supplementary Video 10) in order to reduce video duration and size.

For comparing a set of two axons with its contralateral counterpart, we then took s1,..., sn to be the set of representative points in a fixed slice for skeletons S1,..., Sn and took t1,..., tn to be the representative points (for the same slice) of the respective skeletons T1,..., Tn that were previously matched to S1,..., Sn by the Munkres algorithm assignment after 3D symmetry analysis.

To quantify the degree of similarity, we devised two measures (Extended Data Fig. 9e). The first, termed the angle difference, ai,j, between a set of two axons and their contralateral counterparts, was defined as:

where i, j were skeleton indices, was the reflections of with respect to the computed plane of symmetry, was the dot product between x and y, and was the norm of x.

The second, termed the distance difference, di,j, between a set of two axons and their contralateral counterparts, was defined as:

where i, j were skeleton indices, was the reflections of with respect to the computed plane of symmetry, was the absolute value of x, was the norm of x, and M was the maximum of across all axon sets and all slices. Note that ai,j and di,j were normalized such that they could vary from 0 (no difference, 0° or 0 μm) to 1 (maximum difference, 180° or 8 μm). Further, when the points si and sj were perfectly symmetrical with respect to points ti and tj, then ai,j = 0 and di,j = 0.

To visualize this quantification, a difference matrix, D, was generated for each slice such that D(i,j) = ai,j if j > i and D(i,j) = di,j if j < i (Fig. 4f; Extended Data Fig. 9f; Supplementary Video 10). Calculating the variance for each element in D across all slices showed which axon sets deviated most with respect to the reflection of their contralateral counterparts (Fig. 4k). Heatmaps of the vectorized upper (j > i) and lower (j < i) triangles of D across slices additionally revealed locations with differences between axon sets and their contralateral counterparts (Fig. 4h, i; Extended Data Fig. 9h, j, k; Supplementary Video 10), with black values representing insufficient data at slice positions where at least one of the compared axons was not annotated. Plotting the sum of all ai,j and di,j values for a given slice further illustrated positions where differences were present (Fig. 4j). Finally, the same analysis was performed after artificially swapping the identities (assignment) of the two axon reconstructions with the lowest 3D symmetry analysis costs (MeLc and Mauthner) to provide a basis for comparison (Fig. 4j).

Code availability

Custom software tools generated for data handling, visualization, and analysis are publicly available (http://zebrafish.link/hildebrand16/code). Our modifications to CATMAID53,54 software are included in the main open-source release (http://github.com/catmaid/catmaid). More information on SWiFT-IR alignment software is publicly available (http://www.mmbios.org/swift-ir-home).

Data availability

All aligned ssEM data, reconstructions, transformed functional reference atlases, and an introductory guide are publicly available (hosted by NeuroData at http://neurodata.io/data/hildebrand16 and http://zebrafish.link/hildebrand16). Image data are served as a collection of 8-bit 1,024 × 1,024 pixel2 PNG images with an optional tRNS value of 255 specified to enable transparency. The original resolution for each image stack was downsampled multiple times to create a resolution hierarchy that provides a smooth visualization experience. The entire aligned image dataset requires about 2.7 terabytes of disk space as compressed PNG images (607 gigabytes for 56.4 × 56.4 × 60 nm3 per voxel ssEM data, 1,824 gigabytes for 18.8 × 18.8 × 60 nm3 per voxel ssEM data, 355 gigabytes for 4.0 × 4.0 × 60 nm3 per voxel ssEM of dorsal neuromasts, 1 gigabyte for 600 × 600 × 1200 nm3 per voxel Z-Brain data, and 3 gigabytes for 600 × 600 × 1200 nm3 per voxel Zebrafish Brain Browser data). Data and reconstructions are served to end users via Amazon Web Services (AWS), with an instance of our modified CATMAID53,54 software deployed on the Elastic Compute Cloud (EC2) that points to static images hosted by the Simple Storage Service (S3) built-in web server.