Keywords

1 Introduction

Fluorescence microscopy is an invaluable tool for the investigation of spatial distribution and dynamics of cellular and tissue structures in biological samples. Fluorescence allows reaching high contrast and selective labelling of proteins of interest in a live-cell compatible manner. Two main strategies can be distinguished in fluorescence microscopy, according to the way the image is created. In the case of widefield microscopy, the whole sample is excited through an even and extended illumination, and the fluorescence is collected at once with your eye or on an imaging detector (Fig. 1a). Alternatively, in scanning approaches, the sample is probed point-by-point using a sharply confined spot of excitation light. To enhance the image quality of a scanning microscope, a pinhole is placed in the image plane of the detection path to spatially filter out and select primarily the light emitted from the focal plane of the sample (confocal microscope, Fig. 1b). Confocal microscopy introduces optical sectioning and increased spatial resolution compared to widefield approaches. Widefield systems can however image significantly faster than scanning microscopes on extended fields of view (FOV) and are simpler and cheaper to build and maintain. Both widefield and scanning microscopes are limited in spatial resolution by the wavelength of light, as first realized by Ernst Abbe in 1873 when he formulated the theory behind the diffraction limit [1]. The wave nature of light means that it cannot be focused to a spot smaller than roughly half its wavelength. This characteristic consequently sets a limit on the size and shape of the so-called point spread function (PSF) of a microscope, which describes the image of an isolated emitting point in the sample. If two adjacent emitting points are distanced significantly closer than the size of the PSF and imaged through an optical system, their PSFs will overlap and they will thus not be resolvable in the image.

Fig. 1
figure 1

Fluorescence microscopy. (a) A widefield fluorescence microscope illuminates the whole sample with excitation light, creating fluorescence emission that is imaged through the objective onto a camera. (b) An image can also be generated by sequentially scanning a focused probing beam over the sample, at each point measuring the amount of emitted fluorescence. In a confocal scanning microscope, the detected light is also passed through a pinhole to reject emission from out-of-focus planes. (c) The wave nature of light means that it cannot be focused onto an infinitesimally small spot. The smallest achievable focal point is around half the wavelength of the light that is focused. This limit is called the diffraction limit. (d) The image of an emitting point is called the point spread function (PSF) and its size is limited by the diffraction limit. Two adjacent emitting points can only be resolved if they reside further away from each other than about half the size of the PSF of the imaging system

Super-resolution fluorescence microscopy techniques emerged in the late 1990s, pushing the resolution beyond the diffraction limit imposed by the wave nature of light. The key to surpassing the diffraction limit lies in the possibility to switch a fluorescent molecule between an on and an off state [2,3,4] (Fig. 2a). This can be done either in a deterministic way, using the on/off switching to reduce the effective PSF, or in a stochastic way, by recording the emission of many individual molecules within a diffraction-limited spot one at a time (single molecule localization microscopy, SMLM) [5,6,7].

Fig. 2
figure 2

Coordinate-targeted nanoscopy. (a) Illustration of the on/off switching between a fluorescent and a non-fluorescent generic molecular state. The on-to-off transition is induced by red light, while the off-to-on is induced by blue light. (b) Spatial distribution of the light and the state distribution of the molecules in the foci of a coordinate-targeted switching point-scanning microscope in 2D and (c) in 1D. To optimize on-state confinement (orange line), the minimum of the off-switching light (red line) should be exactly at the maximum of the on-switching (blue line). (d) Jablonski diagram illustrating electronic transitions of the standard pathway of fluorescence emission and how stimulating emission can inhibit the emission of fluorescence by forcing the molecule to emit the excess energy as stimulated emission photons. Below the state diagram is an illustration of the pulse sequence and dwell time for a STED recording. In the pulse sequence, the time delay between the different pulses and emission is highlighted. (e) Diagram illustrating a molecular transition where the molecule can instead be switched between two metastable molecular states, referred to as on and off state. The probes can be switched between the two states through illumination with different wavelengths of light. Below the state diagram, an example RESOLFT pulse scheme and dwell time is shown on a time scale. (f) Example of STED imaging. Confocal and STED images of the nucleus of a U2OS cell with inset showing zoomed-in area. The nuclear pore complex protein Nup153 has been labeled with Abberior STAR635P. Scale bars: 5 μm, and 500 nm, respectively

This chapter will focus on the deterministic methods, alternatively called coordinate-targeted switching approaches, since they rely on the point-by-point investigation of the sample typical of a scanning microscope. The optical suppression of the fluorescence can be achieved either by STimulated Emission Depletion (STED) [8, 9] or more generally by Reversible Saturable OpticaL Fluorescence Transitions (RESOLFT) [10,11,12]. STED is based on optically perturbing the electronic transition pathway that generates the fluorescence, and can therefore act on the ps-ns scale (Fig. 2d). In RESOLFT, the fluorescence properties of the molecule changes in response to a perturbation of its chemical configuration (Fig. 2e). Since the dark states involved in RESOLFT have a longer lifetime, the switching is notably slower (μs-ms scale) but requires magnitudes lower illumination intensities. Starting from a conventional confocal microscope, super-resolution can be achieved using STED or RESOLFT by superimposing a patterned illumination on top of the excitation illumination. The patterned illumination suppresses fluorescence emission from the outer regions of the diffraction-limited excitation spot and allows fluorescence emission only from the very central point where the suppressing light is zero. In this way, the effective excitation spot is shrunk, giving a finer probing of the sample structure (Fig. 2f). Expanding scanning systems to these super-resolution implementations however also adds significant complexity and comes with new challenges, both from an optical engineering perspective and in terms of sample preparation and label design.

STED and RESOLFT approaches were first introduced in a point-scanning implementation, where images are acquired by sampling over the FOV using a single probing spot. This adds a dependency between the temporal resolution and the imaged area (or volume) since the dwell time at each position is multiplied with the number of sampling points. Thus, point-scanning microscopy methods reach a high temporal resolution for small areas and volumes but imaging time quickly increases when scaling up the FOV. To overcome these limitations, multiple approaches to further increase the imaging speed have been proposed and demonstrated since the introduction of the methods, including minimizing the recording time and illumination doses by using smart scanning or by introducing light patterns to parallelize the acquisition. Smart scanning refers to systems where a single excitation spot is adaptively scanned over the sample, meaning that the scanning speed or illumination intensities are modulated in real time during the scan. Therefore, scanning of regions of the sample which do not contain structures of interest can either be completely avoided or illumination levels can be adapted to the structure. This increases the speed and/or lowers the illumination dose in an adaptable way for each FOV and its biological content. Smart or adaptive scanning systems have been demonstrated to improve performance both in STED and RESOLFT [13, 14]. Another approach to increase acquisition speed is to instead utilize the concept of parallelization where a large array of confined PSFs is generated in each illumination sequence. In these methods, the temporal resolution does not scale linearly with the FOV size, since the total sampling area is instead determined by the distance between two adjacent points in the array (which can be as close as 250 nm, limited only by diffraction). As a consequence, parallelized systems have a clear advantage when imaging larger areas.

In this chapter, we will describe the underlying concepts of STED and RESOLFT microscopy, and outline their strengths and limitations. In particular, we will review the approaches and ideas demonstrated to increase the imaging speed and decrease the illumination dose with the main goal of increasing their compatibility with live-cell imaging over extended periods of time.

2 Light-Induced Reversible State Transitions

Many state transitions can be used as reversible on/off switches. Their suitability for coordinate-targeted switching techniques depends on how efficiently molecules can be switched to a certain state at a specific time and be reversibly brought back to the original state with minimal loss. This translates into spatiotemporal control of the saturation of state populations within the fluorophore population which is necessary for successful imaging. In practice, the biological application will lend itself better to the use of one type of transition over another, as the application sets the requirements on illumination, light source, recording time, photobleaching, and imaging environment. Regardless of the nature of the transition, the effective excitation spot will be given by the equation:

$$ d=\frac{\lambda }{2\mathrm{NA}}\frac{1}{\sqrt{1+I/{I}_S}} $$
(1)

where the intensity of saturation for the on-to-off transition is IS = kon/σoff, with σoff being the molecular cross-section for the transition and kon accounting for the competing process that restores the initial fluorescent state [15]. The saturation intensity reports the intensity required to suppress the fluorescence to half its initial level and it is the key to obtaining the spatially sub-diffracted confinement of emitting molecules.

2.1 Reversible Electronic Transitions: Stimulated Emission

Following excitation from the ground state to the excited state, a fluorophore can release energy through spontaneous fluorescence emission or thermal relaxation. The return to the ground state can, alternatively, be induced by stimulated emission in an interaction with incoming photons. Illumination with light in which the photons have an energy equal to the energy gap between the excited and ground state will drive the electrons back to the ground state, resulting in the emission of photons of the same energy as the stimulating illumination, as illustrated in Fig. 2d. In STED, therefore, stimulated emission represents a competing process to fluorescence emission, effectively able to deplete the molecules from the excited state and thereby suppress the spontaneous fluorescence emission. Commonly used STED microscopes feature an illumination beam shaped like a doughnut to induce stimulated emission (depletion beam) overlapped onto a Gaussian-shaped beam of light used for fluorescence excitation. In the center, only the excitation light is present as the doughnut features a local “zero” here (Fig. 2c), and the molecules residing here will thus emit fluorescence. Instead, on the crests of the doughnut where the intensity of the STED beam is higher than the saturation level, Is, the stimulated emission will prevail over fluorescence emission, effectively turning off the molecules in this peripheral area.

To efficiently drive the molecules to the off state, the stimulated emission transition has to outcompete spontaneous fluorescent emission, which has a lifetime of a few nanoseconds (1–5 ns). Combined with the low stimulated emission cross-section (σ ≈ 10−16 cm2), this translates into a requirement of high photon flux (Is ~ MW–GW/cm2) to achieve saturation of the on-to-off transition.

The relatively high intensity required by the STED beam sets specific demands on the fluorescent probes [16,17,18]. The main parameter for a probe to perform well in STED microscopy is photostability, meaning the ability of the probe to withstand several excitation-depletion cycles without significant photobleaching. The depletion wavelength depends on the emission spectrum of the fluorophore, and moving it toward the peak of emission gives a larger cross-section and therefore reduces the STED power needed [19, 20]. However, to avoid direct excitation by the depletion beam and to minimize interference with the fluorescence window, the depletion wavelength is generally moved to the red tail of the emission spectra. Even in such optimized depletion conditions, the STED image quality can be compromised by the interference of the STED beam with other photophysical pathways. This can lead to excited state absorption [21, 22] which, in turn, can result in a higher probability of bleaching.

Stimulated emission is a general process universal to all fluorophores. As such, STED has been demonstrated with many fluorophores of different nature, e.g., organic molecules [16, 23], fluorescent proteins [24,25,26], and inorganic nanocrystals [27,28,29] (e.g., nanodiamonds, quantum dots, and lanthanide upconversion nanoparticles). Besides matching the photophysical parameters optimal for STED imaging, the choice of probe is also linked to its biocompatibility. Cytotoxicity can arise from the intracellular oligomerization and aggregation of probes, and the high illumination power can induce photothermal effects around the probe as well as the formation of free radicals and singlet oxygen molecules. It is difficult to absolutely predict and directly detect the damage induced during imaging [30, 31]. Therefore, it is often estimated by assessing cell viability before and after the imaging experiment. Phototoxicity is highly dependent on the irradiation wavelength, therefore a common strategy is to shift the illumination to the red region of the spectra. This also has the additional advantage of being less susceptible to scattering and creating less autofluorescence, thus increasing the possible penetration depth. Labeling strategies used for live-cell and in vivo STED microscopy rely on genetically encoded markers like fluorescent proteins; self-labeling protein tags [32,33,34,35] (SNAP-, HALO- or CLIP-tags) or unnatural aminoacids [36] combined with cell-permeable dyes; or small cell-permeable molecules with high labelling specificity that can be for example drug-based or lipid-based (e.g., NileRed and SiR-Actin) [37].

The nature of the STED process and in particular the need to act within the fluorescent lifetime of the fluorophore sets general requirements on the hardware used in STED microscopes [22]. STED lasers need to deliver high illumination power to the sample during the lifetime of the excited state and thus be precisely timed after the excitation (delay in the order of ~ps to allow relaxation to the lowest excited state). Pulsed lasers with picosecond pulse widths match these power (MW–GW/cm2) and time (~500 ps) requirements for optimal depletion. The fast time scale of stimulated emission depletion directly translates into fast imaging times. Pixel dwell times in STED microscopy can vary from 1–300 μs depending on the fluorophore brightness and density. This requires a fast and precise scanning system, often based on galvanometric mirrors, resonant scanners [38], or electro-optical scanners [39]. Additionally, the fast scanning of STED systems also requires fast detectors and electronics able to record and distinguish photons from adjacent scanning points. Common detector types for STED microscopes are, for example, photo-multiplying tubes (PMTs), avalanche photodiodes (APDs), or single-photon avalanche diodes (SPAD).

2.2 Reversible Molecular Transitions

RESOLFT encompasses all the coordinate-targeted super-resolution approaches that rely on metastable or long-lived fluorescent and non-fluorescent states, which can be reversibly and efficiently populated upon illumination with specific wavelengths. The use of molecular transitions between metastable states with long lifetimes, such as isomers, decreases the intensity required since the transitions and population saturation do not have to be induced on the nanosecond time scale. Compared to using electronic transitions, the intensities used while imaging can be decreased by several orders of magnitude. For example, in states with lifetimes on the microsecond timescale, the intensity required for saturation is on the order of a few W/cm2.

RESOLFT techniques set strong requirements on the fluorescent label, especially in terms of switching kinetics and photobleaching resistance [40]. In STED microscopy, the use of electronic transitions on the nanosecond timescale makes the process quasi-instantaneous. In RESOLFT, the kinetics between the on and the off state reside within the microsecond–millisecond timescale and therefore directly influence the image recording time negatively (Fig. 1e). To be suitable for RESOLFT imaging, the conformational changes that underpin photoswitching needs to withstand many cycles. For example, to be able to acquire an image with an expected resolution of 50 nm in the focal plane, a step size of 25 nm is needed (according to Nyquist-Shannon sampling theory). Assuming that proteins within a 200 × 200 nm2 area are switched on in every illumination cycle, the proteins will need to cycle at least (200/25)2 ≈ 60–70 times without a significant loss in fluorescence to allow for an image to be acquired. The ability of a probe to be photoswitched many times without significant loss of fluorescence is referred to as fatigue resistance.

Reversible on-off switching mechanisms can be found in reversibly photoswitchable fluorescent proteins (RSFPs) and organic photochromic fluorophores (Fig. 3). In GFP-like proteins, reversible photoswitching is generally a result of a photoinduced isomerization coupled with a protonation/deprotonation of the chromophore [41]. Three types of photoswitching can be identified according to how light controls the switch: positive, negative, and decoupled. The three types and their response to illumination schemes are illustrated in Fig. 3a–c.

Fig. 3
figure 3

Reversibly switchable probes. Example of photoswitching behavior and related effects on the pulse scheme used for RESOLFT. (a) Negative switchers are fluorescently excited by the same wavelength as that which causes them to switch off. (b) Positive switchers are fluorescently excited by the same wavelength as that which switches them on. (c) Decoupled switched are switched on and off and are excited by three separate wavelengths. (d) Example of used switching mechanisms for RESOLFT. In reversibly switchable fluorescent proteins, the switch is linked to a cis/trans isomerization combined with a protonation/deprotonation step or by a rearrangement of the hydrogen network. Among the photoswitching dyes, some of the molecular mechanisms used are either directly switching the dye between an “open” and a “closed” form or indirectly using this switching to enable or impair the fluorescence of a covalently-linked dye. The molecular structure and schemes are readapted from the original publications [59, 60]

In negative switchers, the excitation wavelength also induces off-switching (Figs. 1e and 3a). The coupling of the excitation and off-switching wavelength imposes a sequential recording scheme where, at first, an ensemble of molecules is switched to their emitting states. Subsequently, a spatially patterned light featuring a minimum in the center switches off all the molecules in the periphery and leaves only the ones in the center able to emit. At last, those are excited and their fluorescence is induced upon illumination with a third beam of the same wavelength (Fig. 3a). Negative RSFPs have varying off-switching kinetics affecting the dwell time used for imaging, from tens of μs to ms. For fast switchers like rsEGFP2 or Dronpa2, the dwell time is generally 300 μs in a point-scanning recording and 3 ms in a parallelized system where lower intensities are generally used. The most common types of negative switchers have excitation maxima in the blue (~488 nm) and on-switching in the UV–violet. The ability to completely cycle a population within hundreds of microseconds together with their ability to withstand thousands of switching cycles made rsEGFP2 [12] and Dronpa2 [42] fundamental for the development of the RESOLFT concept and the possibility to achieve the best trade-off between temporal and spatial resolution. Additional variants such as rsGreen [43] and rsFolder [44] were also developed to enhance the expression levels in mammalian and bacterial cellular systems. The need for new and brighter labels led to the further development of the GMars family [45, 46], whose biphasic photobleaching trend can be used for prolonged timelapse imaging. The development of the rsGamillus family [47], instead, extended the imaging to biological processes in more acidic pH compared to the physiological one. Furthermore, the use of photoswitching mechanisms outside the GFP superfamily and into the bacterial photoreceptors, such as the LOV domain of YtvA from Bacillus subtilis, adds to the list a smaller oxygen-independent alternative compatible with anaerobic environments [48].

For the positive switchers, the excitation wavelength is coupled to the on-switching path while the off-switching is driven with violet light (Fig. 3b). All the reported positive switchers, such as Padron [49] and Kohinoor [50], are in the green spectral range, with on-switching and excitation around 488–500 nm and off-switching in the UV–violet range. With the faster switching speed and the different dependency on the wavelengths exhibited by the newly developed Padron2, Konen et al. demonstrated point-scanning RESOLFT imaging with illumination patterns superimposed both spatially and temporally and with pixel dwell times of ~500 μs [51].

For the decoupled switchers, the switching is fully decoupled from the excitation wavelength (Fig. 3c). The RSFP Dreiklang is switched on and off by illumination at ~365 nm and ~405 nm, respectively, while the excitation wavelength is at ~515 nm [52, 53]. In contrast to the other groups, the switching is the result of a reversible hydration/dehydration reaction that modifies the chromophore. In terms of imaging, these switching properties allow maximization of the photon budget of the protein, potentially increasing the SNR. However, the switching speed of current decoupled switchers is significantly slower and has so far required pixel dwell times of ~20–50 ms, thus liming their use for dynamic live-cell imaging.

The reported switching mechanisms are predominantly in the green region of the spectra and require the use of UV–violet light to a varying extent. To reduce any potentially phototoxic effects of the shorter wavelengths and thereby enhance the live-cell compatibility of RESOLFT, there is a continuous push to develop probes in the red region of the spectra. Examples of variants successfully used in RESOLFT are asFP595 [10] and rsCherryRev1.4 [54]. They are negative switchers, with excitation maxima in the orange (~590 nm) and a broad on-switching range between UV–violet and green (400–510 nm). The newly reported red fluorescent protein variants of rsFusionRed show a sevenfold decrease in off-switching time, allowing imaging speeds compatible with living cells and enough photostability to follow their dynamics over multiple frames [55].

Different photoswitching strategies have been identified also in synthetic dyes (Fig. 2d). The covalently-linked dye pair of Cy3 and Alexa647 can act as an organic photoswitcher, using the long-lived non-fluorescent state of the dye in a thiol buffer solution and the possibility to revert it by a different wavelength of light [56]. Fluorescence bis-sulfone diarylethenes have also been used for RESOLFT [57,58,59]. These compounds present an “open” and “closed” chemical form of the chromophore and upon illumination with UV–visible light (350–405 nm), they switch to the fluorescent isomer. Alternatively, the fluorescence of a dye can be modulated by a covalently-linked photoswitchable quencher [60]. This strategy employs a different class of photoswitchers called spironaphthoxazines, which can switch from a yellow spiro form to a deep violet merocyanine form upon 405 nm illumination, enabling Förster energy transfer and consequently quenching of the second fluorophore. Even if brighter than fluorescent proteins, the overall performance of synthetic dyes in RESOLFT microscopy is limited. The major challenges are still photobleaching, reduced number of cycles, and often the need for UV light or a specific environment for optimized switching. Nevertheless, the possibility to open up the technique to new labeling approaches for living cells (like SNAP-Tag or HaloTag technologies) is an area of interest in the field.

From the hardware point of view, the nature of the transitions used in RESOLFT eases the requirement of power and time compared to STED microscopes, and thus the requirements of the lasers. The much lower peak intensities (kW–W/cm2) required for molecular switching are compatible with continuous wave (CW) lasers. A key technical aspect is also the possibility to digitally modulate the illumination on the μs-ms timescale, achievable both through commercial CW lasers or acuosto-optic modulators. The different pulse sequences that have been implemented in RESOLFT are summarized in Fig. 3.

3 Adaptive and Smart Scanning of the Illumination

In a point-scanning system, an image is acquired by scanning a probing beam over the FOV to be imaged, point-by-point and line-by-line (Fig. 4a). The residence time of the beam in each pixel is called the dwell time, ts, while the step size from one point to the other (corresponding to the final pixel dimension) is called the scanning step, Δx. In the context of a single-point-scanning system, the imaging speed is determined by the number of scanning points, ns, required to construct the image and the dwell time required to measure at each scanning position. The number of scanning points in turn depends on the scanning step size and the size of the imaged field of view (lfov), while the dwell time required to collect sufficient signal depends on the illumination power, switching kinetics, brightness, and labelling density. In most traditional systems, the aforementioned imaging parameters, ns, ts, and lfov, are set to fixed values before image acquisition begins. This means that the same parameters will be used for every scanning point, regardless of the sample properties at that point. Consider, for example, a large sample area in which labeled structures of interest are sparsely dispersed. Large regions of the area will not contain structure and thus not generate any signal, but will still be imaged using the same parameters as the regions containing structure. This results in unnecessarily long acquisition times and large total illumination dose also to samples containing small or sparse structures (Fig. 4a). Smart scanning or adaptive illumination systems feature a more flexible and sample-dependent scanning scheme, where the dwell times or illumination intensities are adjusted depending on the properties of the sample in the currently scanned region (Fig. 4b–c).

Fig. 4
figure 4

Point-scanning approaches. (a) Conventional point scanning. By scanning a probing focal spot over the sample in a raster-scanning geometry and detecting the amount of fluorescence generated at each point, a point-scanning microscope builds up an image of the sample pixel-by-pixel. According to the Nyquist-Shannon criteria, the scan step should be no more than half the expected resolution. Therefore, if a 10× resolution increase is expected over a diffraction-limited image, the scanning steps required are 100× more in a 2D image. This leads to a significantly slower recording time and increased light dose on the sample. The size of the probing focal spot determines the level of detail i.e. the resolution of the image. (b, c) Point-scanning microscopy can be made more efficient by implementing adaptive or smart scanning approaches. (b) Adaptive scanning refers to the adaptation of illumination intensities to the sample structure. A weak intensity is used as an initial probe. When sample structure is detected, the illumination powers are increased to optimize image quality at regions containing structure of interest. (c) Smart scanning approaches also adapt the temporal pulse sequence at each pixel to the sample, essentially skipping over pixels that do not generate significant signal when probed with a probing pulse. Adaptive scanning minimizes the illumination dose on the sample and smart scanning additionally decreases the required scanning time

Adaptive illumination has been used to decrease the illumination dose of STED microscopy (Fig. 4b). In adaptive illumination STED microscopy, the power of the STED laser is modulated during the scan in response to the structure of the sample. The sample is scanned with a diffraction-limited excitation spot with constant intensity. As long as the excitation spot does not generate fluorescence, no STED illumination is applied. When fluorescence exceeds a certain threshold the STED beam is activated (RESCue-STED) [61]. In DyMIN (Dynamic Intensity Minimum), the STED beam is instead switched on at increasing power the higher the fluorescence detected, and therefore the structure of interest more centered in the recorded area. This strategy minimizes the STED illumination, using the STED beam only when and at the intensity needed to maximize information instead of always at the maximum level independently from the characteristics of the sample. DyMin has been tested on periodic actin structures as well as the gephyrin clusters at inhibitory synapses of rat hippocampal neurons [14], with a reduction of total light dose on the sample up to 45-fold. Moreover, the more efficient illumination scheme allows the use of longer pixel dwell times without bleaching, enabling imaging at higher signal-to-noise ratio compared to conventional scanning.

When the regions of interest are within the scale of the diffraction limit, the concept of sample-dependent scanning can be adapted accordingly. In MINFIELD [62], a low-resolution image of a sparse sample is used to localize small substructures and define small regions of interest (~100 × 100 nm2) that are sequentially imaged with the STED beam active. Since the region of interest containing the structure is smaller than the size of the STED beam (central zero FWHM ~ 280 nm), MINFIELD minimizes the exposure of fluorophores to STED illumination as they will not see the maximum intensity of the doughnut crest.

These approaches are efficient in decreasing the light dose to the sample. A further improvement toward live-cell compatibility is to speed up the acquisition with smart scanning approaches. Smart scanning implementations in RESOLFT involve using a short initial probing pulse to detect the presence of any fluorescent structure [13] in a specific scanning step. If the probing pulse generates a low signal, the region is ignored and the scan rapidly moves on to the next scanning position. If the probing pulse generates sufficient photons, the region is considered to be of interest, and the full RESOLFT pulse scheme is applied at the current scanning point (Fig. 4c). This smart scanning approach saves time in the acquisition by not dwelling on pixels that do not add information to the image, and the speed-up depends on the type of structure being imaged. The method reaches up to 6 times faster recordings and is an especially efficient approach when imaging, for example, neuronal protrusions or sparse point-like structures, where only a small fraction of the pixels (15%) in the final image will contain relevant information. Using smart scanning RESOLFT, Dreier et al. show peroxisome dynamics in small fields of view at imaging speeds of 2–5 Hz and mitochondria membranes at imaging speeds of 27 Hz, essential to remove motion artifacts. Furthermore, it was shown that the illumination dose could be decreased by 70–90% [13] since regions of the sample not containing labeled structures are not exposed to the RESOLFT imaging scheme but only to the probing illumination.

The use of fast scanning devices, like electro-optic deflectors (EODs), introduces the possibility of pixel hopping also in STED. FastRESCue STED [63] is a direct evolution of the RESCue-STED [61] technique. If no signal is detected, not only is the STED beam inactive but also the scan skips to the next pixel. This strategy increases the imaging speed by up to 4–5 times compared to RESCue-STED, with a decreased light dose of ~20% [63] in a biological sample compared to conventional STED.

4 Optimized Patterns and Adaptive Detection

The methods described above optimize the scanning strategies to improve image quality and speed while lowering illumination dose. Other methods instead focus on the illumination or detection side.

In the illumination, different spatial distributions of the STED beam have been explored. In particular, the use of one-dimensional depletion patterns instead of a two-dimensional doughnut shape allows for a more efficient depletion (due to steeper minima and consequently lower saturation light). A two-dimensional resolution improvement is then obtained by recording one-dimensional high-resolution images at multiple angles of the depletion pattern. For this reason, the strategy is called tomographic STED [64].

In the detection, by measuring the arrival time of photons at the detection site, information regarding their origin across the PSF can be inferred. Gated-STED is an implementation mainly applied to STED using CW depletion lasers [65]. The lower peak powers of CW lasers decreases the efficiency of stimulated emission and thus the STED beam has not had time to induce sufficient suppression of fluorescence at early times after the excitation pulse. With time-gated detection as in gated-STED, only the photons arriving in the later parts of the lifetime window and thus increasingly from the central parts of the PSF are used to construct the image, increasing the resolution. A similar concept is used in the SPLIT-STED implementation, where a phasor analysis is used to separate “high-resolution photons” (mainly from the central parts of the PSF) from “low-resolution photons” (mainly from the outer parts of the PSF) based on photon arrival times [66,67,68]. Optimized detection and photon assignment schemes like these enable high-resolution imaging to be performed with lower and less perturbing illumination intensities. Alternatively, the strategies can be used to improve the resolution without the need of increasing the illumination dose.

One additional strategy to compensate for photobleaching, therefore allowing long recording time, focuses on the probe and more specifically the labelling strategy. Combining the PAINT (Point Accumulation for Imaging in Nanoscale Topography) [69] and the STED approach, fluorogenic labels that transiently bind to their target structure can be used to obtain a constant exchange of labels between the solution and the target structure [70, 71]. As a consequence, the photobleached population of fluorophores is constantly exchanged with a new set of bright fluorophores, allowing prolonged STED recordings. The potential of this approach was demonstrated by extended timelapse and volumetric recording without degradation of the image quality. Alternatively, a photobleaching-immune approach has been developed for the study of the extracellular space of the brain with 3D STED microscopy. The entire extracellular fluid is labeled with a fluorescent marker to enable super-resolution shadow imaging (SUSHI [72]) of the extracellular space in living organotypic brain slices, and thus negative imaging of the complete cellular outlines. The large volume and continuous refreshing of fluorescent labels makes the technique unaffected by photobleaching and protected from phototoxicity since phototoxic molecules are produced outside of the cells.

5 Parallelization

Another way of increasing the imaging speed, especially for large FOV images, is through parallelization of the illumination and read-out (Fig. 5). In coordinate-targeted switching techniques, the parallelized acquisition means that the illumination confines and probes the fluorescence of the sample in multiple points at the same time. Using different types of light patterns, arrays of sharply confined emitting points are generated, each probing a spatially separate part of the sample. For arrays of emitting points covering the whole FOV, the scanning time is then decoupled from the FOV and instead defined by the distance between the individual illumination points. The array of emitting points needs to be read out at the same time but in spatially distinct pixels achievable either with multiple point detectors or using large camera sensors. The emission from each point is quantified by measuring the recorded fluorescence signal, which will be assigned to the corresponding pixel in the final image. Parallelized scanning has been proposed and implemented both in STED and RESOLFT systems.

Fig. 5
figure 5

From point scanning to parallelization. (a) Representation of a parallelized workflow. Parallelization of coordinate-targeted switching techniques relies on creating multiple emitting points in the sample that are imaged onto an extended sensor. If the emitting points are sufficiently far apart, the emission from the different points can be independently quantified and assigned to the corresponding pixel in the final image. Thanks to the parallelization, the number of scanning steps can be greatly reduced. (b) From a single scanning point, the degree of parallelization can be gradually increased by multiplexing the foci and doughnut patterns or pushed to the limit using large patterns generating tightly packed emitting spots separated only by a distance just above the diffraction limit

The different characteristics of STED and RESOLFT introduce different challenges and limitations regarding the extent of parallelization. In STED, the scale of achievable parallelization is limited by the high intensity required for efficient depletion, while in RESOLFT the lower intensity allows for extended parallelization of the illumination. In both STED and RESOLFT, the temporal resolution of parallelized systems is to a certain extent limited by the current camera technology. Read-out times of state-of-the-art cameras are currently limited to a few milliseconds for the often required megapixel frames. In STED microscopy, the required pixel dwell time is usually only a few tens of microseconds. In point-scanning systems with practically instantaneous read-out, the pixel dwell time is thus usually also a few tens of microseconds. In a highly parallelized STED system, though, every scan step requires a camera frame to be read out, resulting instead in pixel dwell times on the millisecond time scale. Much of the gain from the reduced number of scan steps is therefore lost with the increased pixel dwell time. On the other hand, the switching cycle in RESOLFT systems requires several hundreds of microseconds or even milliseconds to achieve the on-state confinement in each scan step. The camera read-out time, therefore, increases the pixel dwell time by a relatively low amount and parallelization of RESOLFT systems can thus give a vast improvement in imaging speed compared to point-scanning systems. Nonetheless, even a parallelized RESOLFT system only outperforms a point-scanning STED system in terms of imaging speed if the side of a squared imaged region surpasses ~5–10 μm.

The use of cameras as detectors adds some additional complexity, but also offers more possibilities to the processing of the detected signal. In contrast to a single-point detector, which inevitably integrates all photons on the photosensitive area, a camera can capture images of the intensity distribution detected. This allows for different types of digital pinholing or more advanced statistical analysis of the detected signal.

5.1 Parallelized STED

STED has been demonstrated with different approaches and degrees of parallelization [73,74,75]. With four illumination spots at a distance of 5.7 μm, the first implementation could achieve only a moderate degree of parallelization. The moderate degree of parallelization however allowed for the continued use of point detectors, leading to an increased recording speed by four times compared to a single-point-scanning approach [74] (Fig. 5a). This however comes at the expense of system complexity and scalability of the approach. This together gives a four times faster system able to image structures down to ~35 nm (Fig. 5b, center).

A significantly higher degree of parallelization was achieved by introducing a new STED beam pattern exhibiting zero-intensity points over a large region and at a periodicity of 290 nm [73, 75] (Fig. 5b, right). Each zero-intensity point allows for highly confined emission creating a large array of emitting points. To detect the highly parallelized emission, point detectors are substituted by a camera sensor. As discussed, the slower reading time of a camera compared to a point detector in a STED microscope limits the pixel dwell time. For this reason, the 1,000-fold parallelization achieved in the implementations based on large interference patterns for depletion can increase the recording speed to seconds, but not to the theoretical recording time of 100 frames per second (potentially achievable considering the pixel dwell times of STED). Furthermore, to achieve such speeds, the extension of the parallelization requires high laser intensities to induce stimulated emission, demanding powerful laser sources or a smaller number of foci. The use of a spatial light modulator to create the optical lattice for the depletion could reach ~70 nm resolution over 3 × 3 μm2 FOV [73]. Improvement on the laser source (low repetition rate and high pulse energy) could extend the FOV to 20 μm in diameter [75]. The continued development of laser and detector technology may further extend the current limit of parallelized STED.

5.2 Parallelized RESOLFT

In contrast to STED microscopy, the molecular switching required for RESOLFT microscopy leads to pixel dwell times on the order of hundreds of microseconds to milliseconds. Since this is on par with the speed of fast exposure times using state-of-the-art sCMOS cameras, significant gains can be made in terms of imaging speed by parallelizing the raster-scanned acquisition of RESOLFT microscopes. Many different implementations of parallelized RESOLFT exist, all aiming at achieving high degrees of parallelization by using illumination patterns covering large areas of the sample (up to 100 × 100 μm2) [76,77,78,79].

5.2.1 Widefield-RESOLFT

The implementation of parallelized RESOLFT that gives the highest degree of parallelization uses homogeneous widefield illumination for the on-switching and read-out step of the pulse scheme and a superposition of orthogonal sinusoidal patterns in the off-switching step to achieve the confined emission. This implementation is termed Widefield-RESOLFT (wf-RESOLFT) [76] and is illustrated in Fig. 6. The distribution of confined emitting points in wf-RESOLFT is solely determined by the geometry of the off-switching pattern. To achieve sharply confined on-state points in a short time, it is beneficial to use an off-switching pattern with a steep intensity gradient surrounding the intensity zeroes. The shorter the periodicity of the pattern, the lower the intensity at its peaks is needed to achieve the same confinement [75, 80]. As an additional benefit, the contraction of the periodicity translates into an increased speed of recording, although it comes with the drawback of cross-talk between neighboring emitting spots, whose PSFs will start to overlap in the detected image. Overlapping PSFs on a parallelized detector can be separated using computational unmixing or deconvolution algorithms [76, 77], although the presence of noise will quickly deteriorate the image quality. In practice, this limits the accessibility of the technique to bright and flat samples, while axially extended samples where out-of-focus emission has a larger contribution or signal-to-noise is low are difficult to image. Balancing all these aspects, wf-RESOLFT has been implemented using off-switching patterns with a periodicity, P, of 320–360 nm [46, 76, 77], and with a massive improvement in throughput covering a FOV of 100 μm in diameter. Overall, the time of the RESOLFT pulse scheme multiplied by ns (now defined as (P/ds)2 instead of (lfov/ds)2 as in point scanning) leads to a frame time of around 3 s for a lfov = 100 × 100 μm2. The low illumination powers required for RESOLFT imaging makes wf-RESOLFT a suitable method for acquiring timelapse recordings of large and flat samples.

Fig. 6
figure 6

wf-RESOLFT patterns. A high degree of parallelization can be achieved by switching on the whole FOV with a widefield illumination. A patterned illumination exhibiting an array of zeros switches off everything except the fluorophores residing in these zero-intensity points. A final widefield illumination excites fluorescence from the remaining on-state fluorophores. Scale bar: 1 μm

5.2.2 MoNaLISA or Multifoci Parallelized RESOLFT

To extend the parallelized approach to thicker 3D samples, the influence of out-of-focus emitting regions and the cross-talk between in-focus emitters needs to be minimized or eliminated. This has been achieved by independently modulating the periodicity of the on-switching and read-out illumination, named Pon and Pro, from the one of the off-switching illumination, Poff, in the RESOLFT pulse scheme (Fig. 7). In practice, this can be done by creating an array of intensity maxima in the focal plane with the use of microlenses and relay optics. Spatially modulated on-switching light switches on primarily the fluorophores residing at these points (Fig. 7). The co-alignment of the intensity maxima of the on-switching multifoci with every minimum of the off-switching array generates sharply confined nanosized regions of fluorophores (Fig. 7). The additional multifoci read-out pattern co-aligned with the on-switching pattern is used to excite fluorescence from these spatially confined on-state molecules. The multifoci pattern in the on-switching and read-out step not only allows for better signal detection from in-focus emitters but also gives the system improved optical sectioning. Indeed, the non-overlapping PSFs of the emitting points allow for digital pinholing to discard most of the out-of-focus signal. Additionally, the combination of illumination patterns creates emission primarily from the focal plane of the system (Fig. 8a), both due to the two-step process of on-switching and read-out but also since much of the out of focus on-switched molecules will be switched off by the off-switching pattern.

Fig. 7
figure 7

wf-RESOLFT and MoNaLISA. The wf-RESOLFT configuration forces a trade-off between the ability of the off-switching pattern to confine the emission and the distance between adjacent emitting points. An off-switching pattern with high modulation will create sharply confined emitting spots, but at such a short distance from each other that they will be hard to distinguish in the diffraction-limited detection. Relaxing the modulation of the off-switching pattern will bring the emitting points further apart but will also compromise the confinement of the emitting spots. The MoNaLISA scheme alleviates this problem by decoupling the modulation of the off-switching pattern from the distance between adjacent emitting spots

Fig. 8
figure 8

MoNaLISA patterns and data. (a) In the MoNaLISA scheme, additional patterns are added into the pulse scheme causing the on-switching and read-out light to be confined to a sparser subset of zero-points in the off-switching pattern and also inducing an axially confined emission. (b) Comparison of a traditional widefield image with MoNaLISA image of U2OS cell endogenously expressing Vimentin-rsEGFP2. Scale bar: 5 μm. (c) Zoom-ins of ROI I and II showing a comparison of the resolution and contrast achieved with widefield imaging, enhanced confocal (using multifoci arrays without the off-switching pattern), and the full MoNaLISA scheme. Scale bars: 500 nm. Image adapted from Masullo et al. [78]

Having decoupled periodicities (Pon = Pro ≠ Poff) allows shrinking Poff down to the diffraction limit ~250 nm, without affecting the cross-talk of the emitting spots, which is defined by Pon. The MoNaLISA (Molecular Nanoscale Live Imaging with Sectioning Ability [78]) system delivers fast high-resolution imaging of living cells by balancing the trade-off between imaging speed and image quality and by providing optical sectioning through axially confined emission and digital pinholing. Fields of view of 50 × 50 μm2 are recorded at around 1 Hz inside the three-dimensional structures of the cell. The enhanced contrast gained by the full integration of the emitting volume enables the use of fast and resistant RSFPs that due to lower brightness are generally difficult to image in other parallelized systems (Fig. 8b). Dim signal, like the endogenous level of expression of vimentin tagged with rsEGFP2 protein, can be visualized thanks to the improved contrast of MoNaLISA. In Fig. 8c, the contrast and resolution of MoNaLISA images are compared with widefield and enhanced confocal (using multifoci arrays without the off-switching pattern) images of the same structures.

5.2.3 3D Parallelized RESOLFT

Although the multifoci arrays used in the MoNaLISA microscope provide optical sectioning abilities, the axial resolution is still diffraction limited. The off-switching pattern used in these implementations does not provide any additional confinement along the axial direction. Three-dimensional super-resolution has however been demonstrated using a more complex off-switching pattern that exhibits three-dimensionally confined zero-intensity points [79]. An example of a 3D-confining pattern is shown in Fig. 9a. This pattern, in combination with the multifoci array used for on-switching and read-out, creates emitting volumes that are confined in all three spatial dimensions. Probing the spatial distribution of fluorophores in 3D requires the sample to be raster scanned along the x-, y- and z-dimension with a scanning step satisfying the Nyquist-Shannon sampling criterion.

Fig. 9
figure 9

3D pRESOLFT pattern and data. (a) Extension to 3D super-resolution can be achieved by creating an off-switching pattern that exhibits zero-volumes confined in all three dimensions, creating a 2D lattice structure in both the x–y, x–z and y–z planes. (b) Full volumetric recording of a mitochondrial network, together with an x–z slice of a single mitochondria compared with an enhanced confocal recording of the same structure showing the drastically improved resolution in all dimensions. Cells are U2OS expressing rsEGFP2-Omp25. Star (*) denotes deconvolved data. Scale bars: 2 μm (top) and 500 nm (bottom). (c) Graphs showing approximate imaging time taken for volumetric imaging using psRESOLFT, psSTED, and 3D pRESOLFT. The top graph assumes a fixed axial size of 1.6 μm with increasing lateral FOV. The bottom graph assumes a fixed 40 × 40 μm2 lateral FOV with increasing axial size. Calculations assume standard pixel dwell times of the different techniques (psSTED: 30 μs, psRESOLFT: 400 μs, 3D pRESOLFT: 6 ms) and a scan step size of 50 nm

Examples of improved axial resolution on a point-scanning architecture have been proved by utilizing interference between two opposite objectives (isoSTED [81, 82], 4Pi-RESOLFT [83]) or using axial depletion patterns in a single objective configuration (3DSTED [9, 84]), shrinking the PSF down to 30–50 nm. This 3D parallelized RESOLFT (3D pRESOLFT) implementation has been shown to achieve a spatial resolution below 80 nm in all three dimensions. As such, it is a powerful tool for measuring the lateral and axial spatial distribution of proteins in micrometer thick sections of axially extended samples as shown in volumetric images of mitochondria of U2OS cells presented by Bodén et al. [79] and shown in Fig. 9b. Parallelization on a volumetric recording translates into a 200-fold improvement in the speed of recording, comparing the same RESOLFT dwell time (from 4.4 h for point-scanning RESOLFT to 78 s in 3D pRESOLFT, Fig. 9c).

5.3 Digital Pinholing of Camera Data

When multiple emitting points are imaged onto a camera, a consecutive step is needed to quantify the emissions from the emitting points and assign them to the correct pixels in the final image. The most straightforward approach is the use of a binary mask on the camera frame, aligned to the a priori knowledge of the geometry of the emitting points. If the emitting points are sufficiently far apart compared to the diffraction limit, this approach is the digital analogy to the use of a physical pinhole in a point-scanning system. This approach has been used in wf-RESOLFT systems with a Gaussian pinhole function instead of a binary mask [76]. If the images of the emitting points are too close to each other so that their respective PSFs overlap, very small pinholes need to be used in order to minimize cross-talk (around one-fifth of the Airy disc). Alternatively, the cross-talk can be unmixed by defining the problem as a system of linear equations to be solved through least-squares minimization or maximum likelihood estimations [75, 77]. High cross-talk between emitting regions reduces the total signal extractable from each spot. Alternatively, when the emitting points are non-overlapping, as in MoNaLISA and 3D pRESOLFT systems, they can be considered independent from each other. Therefore, each camera region can then be processed individually and the signal can be fully extracted and separated from the background contribution.

6 STED and RESOLFT for Live-Cell Imaging

STED and RESOLFT microscopy are versatile imaging tools that can be used for a plethora of different imaging applications in material and biological sciences including plants, bacterial, and mammalian systems. Their contribution to these disciplines is extensively covered in many recent reviews [85,86,87]. Here we limited our attention to the new strategies introduced in coordinate-targeted nanoscopy to push their ability toward live-cell imaging at the nanoscale. Within this goal, three aspects assume a central role: temporal resolution, phototoxicity, and cell accessibility. We summarized the techniques described in the previous paragraphs according to these metrics to give a comparative view of them (Table 1).

Table 1 Comparison of the super-resolution deterministic strategies for increased live-cell imaging compatibility

Since biological processes take place on many time scales, from milliseconds to hours, following sub-second dynamics without sacrificing spatial resolution or the overall cellular context represents an important challenge in biological imaging. STED microscopy, providing instantaneous nanoscale confinement, reaches extremely short pixel dwell times (<100 μs). Though parallelized approaches have been demonstrated, the most common implementation of STED still uses a single scanning point due to the requirements on power density and limitations imposed by current camera technology. This intrinsically links the temporal resolution of the technique to the size of the FOV recorded. For smaller fields of view, STED systems can reach tens of milliseconds temporal resolution enabling the study of fast processes in small localized regions. RESOLFT on the contrary has intrinsically slower dwell time (>100 μs) but offers more modularity in recording strategy. The ability to work with lower intensities allows moving from single-point scanning to extended parallelized recordings.

The nanoscale resolution for both techniques also increases the need for sensitivity. Higher resolution ultimately means a decrease in the emitting volume and the number of fluorophores contributing to the signal. The universal nature of stimulated emission provides a great variety of fluorophores to choose from. Especially useful are the bright and photostable dyes that can push the resolving ability toward 20–30 nm. For RESOLFT, instead, the primary type of labels to date are the RSFPs that often offer a limited photon budget highly dependent on the illumination condition being used. When the system under study is dynamic, it is not only crucial to achieve high resolution on a snapshot of the sample, but also to describe its time evolution. The need for stable fluorophores to meet this demand is a common requirement among super-resolution techniques, as that photostability ultimately limits the sampling and the time window available.

Considerations on light damage in live-cell experiments are of great importance and a complex topic where many factors come into play. High irradiance on cells generally increases the risk of phototoxicity but how the light is distributed in space and time on the sample is also crucial to the effect. Slow scanning (i.e., long dwell time) induces more pronounced stress on the cells compared to fast scanning (i.e., short dwell time, compensated by line or frame averaging to reach a similar contrast [88]). The level of resolution achievable in the deterministic approach is tuned by the intensity of the off switching beam, which allows tuning of the power used according to the desired resolution. The smart scanning approaches push this concept of adaptability to the sample further by actively investigating the space and introducing a decision in the recording. In each step, a decision is taken whether to use the STED beam or not, or whether to stay or to move. Another parameter that can influence the level of phototoxicity in the sample is the wavelength of light in use, with red-shifted light being less toxic to the cells. Moving the STED beam toward the red has been crucial to increasing its live-cell compatibility. For RESOLFT almost all reported switching mechanisms involve the use of UV–violet light at different doses depending on the specific photophysics in use. The complex photophysics of some red RSFPs allows moving away from this requirement and completing the whole switching cycle with light above 500 nm. Active research into new switching mechanisms can further help to move away from this requirement. Especially for negatively switching RSFPs, the UV light dose is minimal and does not preclude imaging at different scales, from adherent cells to full organisms like Drosophila [89] or C. Elegans [13]. In general, RESOLFT microscopy with its significantly lower illumination intensities is a technique aimed primarily at live-cell imaging, allowing prolonged recordings of living cells and tissues. However, the performance of the technique is highly dependent on the use of a few select fluorescent markers. A broader adaptation to a wider range of biological questions is dependent on the continued development of new probes and illumination schemes.

7 Challenges and Outlooks

Coordinate-targeted switching approaches to super-resolution microscopy have in the last 20 years moved from being a niche technique, available only to highly specialized labs and for selected biological applications to being a technique widely used in life science institutes around the world. The psSTED microscope, for example, is today a workhorse in many imaging facilities. This is largely due to the versatility of the technique, easily extendable to multicolor and able to achieve high resolution in fixed, thin and thick samples. It also benefits from the possibility to use conventional dyes and to be added as an extra module onto a standard confocal microscope. However, for applications requiring long-term and minimally invasive monitoring of living cells, point-scanning systems often fall short due to the time taken to scan the image with a single point and the energy applied by focused light. Novel smart and adaptive scanning systems partially overcome this issue by optimizing the scanning trajectory and illumination sequence. Development in this direction is already available in some commercial microscopes and open-source initiative. RESOLFT microscopes are also commercialized, to some extent also with the parallelized recording (i.e., wfRESOLFT), even if these are less present in imaging facilities. The lack of commercial RESOLFT systems is likely due to the need for specific probes and that the technique is mainly targeted at live-cell imaging. Multicolor RESOLFT, though feasible, is also more challenging due to the need for multiple laser wavelengths for each RSFP requiring multiple lasers for multicolor imaging. Finally, the use of RESOLFT for live-cell imaging is still fairly recent, with the first successful results emerging only about 10 years ago.

Going from a single-point-scanning microscope to a parallelized system adds significant optical complexity to the setup but can provide orders of magnitude improvements in imaging speed and decrease of illumination dose. The added optical complexity of parallelized schemes arises particularly when different patterned illuminations are used in multiple steps of the illumination scheme to accommodate the need of specialized photoswitchable probes. In MoNaLISA and 3D pRESOLFT systems, three different illumination patterns need to be crafted and co-aligned to each other in order to generate high-quality images. Additionally, as in single-point-scanning approaches, the quality of the zeros in the off-pattern needs to be close to perfect in order to achieve sharp on-state confinement. Refractive index variations or other optical imperfections in the sample may cause aberrations to the illuminating beams distorting the geometry or degrading the quality of the zero-intensity points. The issues become more severe when imaging deeper into the sample. Imaging tens of micrometers inside tissue is thus challenging even with the improved optical sectioning provided by these systems. Future implementation of adaptive optics may provide solutions to restore the quality of the wave front inside the sample enabling high-resolution imaging further inside tissues. Creating patterns with controllable shape and maintained quality throughout thicker samples may also allow for even faster volumetric imaging with parallelization also along the axial dimensions.

Highly parallelized imaging schemes often rely on cameras for detection. Although they provide the properties that make this type of imaging possible, they also limit the systems in some aspects. One such limitation comes from the relatively long time required to read out the data from a camera frame. Even state-of-the-art sCMOS cameras require several milliseconds to read out a full camera frame (~4 MP). Since every scan step in a parallelized scanning sequence requires a frame to be read out, the camera read-out time is often a major bottleneck in the data acquisition scheme. In parallelized RESOLFT schemes, the camera read-out time usually accounts for about 20–50% of the pixel dwell time [55, 78, 79]. In parallelized STED systems, this relative time may be significantly higher due to the fast timescale of stimulated emission allowing for microsecond integration times. Cameras also introduce other challenges stemming from their noise properties. Not only do they exhibit non-zero amounts of read-out noise (not present in APDs), but the use of cameras can also introduce additional pixel-dependent noise. Considerations need to be taken to handles all these aspects and future camera and detector development will likely contribute greatly to further push these techniques in terms of image quality and speed.

Another challenge likely to be undertaken is to extend these super-resolution techniques to multi-channel or multicolor imaging. STED microscopy is commonly implemented in multicolor imaging of up to four colors [92]. RESOLFT, however, is still limited to the library of existing RSFPs. Finding multiple spectrally separated RSFPs with switching kinetics that allow for fast imaging remains a challenge. Additionally, if the different RSFPs are switched by different wavelengths, the illumination patterns need to be created and aligned in multiple beam paths adding again to the complexity of the optical system. Other ways to generate multicolor RESOLFT images have been proposed as being preferable, for example, by separating different RSFP labels by their switching kinetics [93, 94]. This has the benefit of requiring only one spectral illumination and detection channel. However, it still requires RSFPs that differ not in spectral but in photoswitching behavior, more specifically fast and slow off-switching kinetics. As a consequence, the imaging speed is limited by the switching speed of the slower switcher. Utilizing both spectral and dynamic parameters to separate channels may allow future implementations with three or more separable channels. More in general, the development of new switchable proteins in different regions of the spectra, with different photophysical behavior and with an added sensing module (e.g., calcium [95, 96] or pH [97]) will likely pave the way for new technical innovations and applications.

In this chapter, we covered the strategies used in deterministic super-resolution techniques to reach higher compatibility toward fast live-cell imaging. The deterministic coordinate-targeted switching approaches compose a subset of super-resolution fluorescence microscopy techniques. The distinction between coordinate-targeted approaches and stochastic localization techniques has however narrowed as techniques are emerging combining stochastic switching with scanning systems and illumination patterns. The concept was first introduced with the MINFLUX technique [98,99,100] and has been further adapted in other forms and to other applications [101,102,103,104]. The successful utilization of well-designed illumination strategies together with accurate knowledge of photophysical behavior will likely be the key to further refinement of optical systems for the interrogation of biological structures and dynamics. The continued improvement of such systems is essential for the advancement of biological research in the life sciences.