Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

11.1 Introduction and Overview

A wide range of timescales are involved in biological processes, ranging from sub-microseconds for helix formation, to milliseconds for translation of an mRNA to a chain of amino acids and, in some cases, seconds for the folding of the chain to a functional specific three-dimensional structure. An overview is given in Fig. 11.1. A goal of the biophysicist is to understand the thermodynamics and mechanism of how these and related processes occur. A structural probe that a biophysicist can use with a time resolution to match the biological/biochemical process is a critical tool for revealing the underlying physics.

Fig. 11.1
figure 1

Timescales for macromolecular dynamic processes (lower half) and techniques used to probe them (upper half). Copyright—2011 Wiley Periodicals, Inc. [31]

An expansive set of tools have been used to achieve a mechanistic and structural understanding of protein-protein, protein-RNA, and protein-DNA interaction, or protein folding and RNA folding. It is important to recognize the complementarity and synergy of many of these techniques. For example, in the case of protein folding, the design of a FRET pair for probing folding dynamics benefits significantly from the availability of a native crystal structure, radius of gyration measurements of the native and denatured state ensembles and residue level hydrogen-deuterium exchange protection information. The present overview of microsecond mixing interfaced with TCSPC is presented with this context in mind, that it can be a valuable tool in the toolkit of the biophysicist. It is also worth noting that numerous other complementary techniques are available with significantly greater time resolution than microfluidic mixing (Fig. 11.1), [31] such as temperature-jump, [14, 20, 24, 30, 44, 53, 67] NMR relaxation methods [28, 51] and FCS, [17, 18] but these have their own limitations with respect to initiation of the reaction and appropriateness for a given protein. The microfluidic methods discussed in this review are ideal for kinetic reactions initiated by a dilution reaction, as in the case of protein folding, [62] RNA folding, [50] or enzymatic reactions [34] in which a reagent, such as ATP or an analogue, is added. For protein folding studies, the mixing studies provide a means for studying high-energy partially folded structures that are transiently populated as an approximately random-coil polypeptide folds to its native structure. The mixing approaches are able to achieve relatively high populations of these high-energy states, which may have too low of an equilibrium population for single-molecule studies for proteins with large stability, see Fig. 11.2.

Fig. 11.2
figure 2

Cartoon representation of the energy landscape for a protein. Some partially folded states are rarely accessed in equilibrium experiments. Non-equilibrium kinetic methods are therefore advantageous in studying them

11.1.1 Micromachining Advances Usher in a New Era

Although microsecond mixing devices have appeared in the literature for over three decades (without a dramatic improvement in mixing times) [33], they had not been widely adopted by the biophysics and enzymology researchers, who have favoured the more established stopped-flow method with its millisecond time resolution limitation. Over the past decade and a half, however, advances in micromachining methods (e.g., laser machining, photolithographic methods and DRIE etching) have enabled smaller micron-sized features and more intricate patterns to be machined. The devices have also been increasingly more robust, and ushered in new mixing techniques, such as laminar and chaotic mixing, in addition to the more traditional turbulent mixing methods [41, 54].

11.1.2 Concurrent Advances in TCSPC

Concurrent with developments of microfluidic based mixing devices in the late 1990s and early 2000s came the introduction of compact PCI-based high-speed TCSPC electronics. The higher counting rates possible with these PC-based TCSPC cards, their multi-dimensional recording capabilities [2, 3] (see Chap. 1) and high-repetition rate lasers made kinetic experiments, particularly of the ‘double-kinetic’ variety, [7, 9] (simultaneously performing reaction kinetics and TCSPC based time-resolved fluorescence kinetic) more accessible. These experiments, and their counterparts using camera-based detection [35], allowed researchers to probe millisecond timescale kinetic reactions using lifetime detected FRET. These developments allowed users to obtain not just an average end-to-end intramolecular distance for a given set of donor-acceptor labelled residues of a protein but to also obtain, via Laplace inversion, the time evolution of a distance distribution during a biochemical reaction [35].

11.1.3 Merging Microfluidics and TCSPC

The availability of continuous-flow microfluidic mixers opened the possibility of bringing the time-resolution of the ‘double-kinetic’ experiment into the microsecond time range. The availability of performing TCSPC-based lifetime-resolved FRET and anisotropy studies is significant in that many fundamental events, such as the formation of loops and hairpins of proteins [15, 26, 47, 67], and the folding of some globular proteins [37, 71, 72] occur on this timescale.

11.1.4 Matching Simulation and Experiment

The accessibility of the microsecond time regime in experiments has been paralleled over the past decade with complementary advances in computational molecular dynamics simulations. All-atom simulations can now be performed on proteins greater than 100 amino acids and extend into the millisecond timescale [40, 43]. The overlap between experiment and simulations allows validation of the simulations and provides atomistic insight into the mechanism governing the folding process and dynamics of a globular protein. The availability of distance distributions from lifetime-based FRET data [13] (see Chap. 7) obtained from TCSPC-based measurements in microfluidic devices, as discussed later in this chapter, can be a useful metric for comparing with simulations.

In this review, an overview of microfluidic devices interfaced with TCSPC will be presented, with an emphasis on experimental details and data analysis. This area is rapidly evolving, and the material presented here will perhaps foster new ideas and approaches for structural measurements of biological complexes in the microsecond time range.

11.2 Microfluidic Mixing Methods

Microfluidic mixers with microsecond time resolution have typically followed two general approaches (Fig. 11.3).

Fig. 11.3
figure 3

Schematic overview of turbulent/chaotic and laminar mixing approaches to microsecond mixing. Channel widths are ~10 µm for laminar mixers and 30–100 µm for turbulent/chaotic mixers. The central sheath in the hydrodynamically focused laminar mixer is ~0.1 µm. Copyright © 2011 Wiley Periodicals, Inc. [32]

In one case, the mixer operates in the laminar regime and the mixing is accomplished by focusing of a central sheath of flow to a width of approximately 0.1 μm. Diffusion of the solvent molecules across this narrow sheath can occur on the microsecond timescale. These mixers are very efficient, allowing full experiments to be conducted with femtomoles of samples when used with a confocal microscope. The central 0.1 μm (or narrower) wide sheath is narrower than the sample volumes but this is not a serious limitation for fluorescence experiments, especially if a confocal instrument with a high numerical aperture objective is used. For coaxial laminar mixers, the small dimensions of the central sheath are an advantage as there is less out-of-focus light because the central sheath is less than the dimensions of the confocal volume [27]. By appropriate selection of the flow stream, a macroscopic observation volume is attainable for laminar flow mixers [22]. As shown in Fig. 11.4, the central sheath is selected and expanded in an effort to slow the flow rate sufficiently for single-molecule studies. Spectroscopic and kinetic studies have been performed with laminar mixers in the visible [22, 27, 52] and ultraviolet wavelength region [41, 49] (with extrinsic and intrinsic chromophores of proteins, respectively) with both ensemble and single-molecule experiments. However, these experiments have not been used with TCSPC detection.

Fig. 11.4
figure 4

A laminar mixer covering the 200 µs to seconds time range suitable for TCSPC and single-molecule studies. A 20 nm central sheath (yellow) is picked out of the flow and expanded (orange) to slow the flow rate. Reprinted by permission from [22]. Copyright (2011)

Another popular approach to microsecond mixers relies on turbulent or chaotic mixing [33, 57]. Two solutions flowing at a fairly high flow rate (10–100 mL/min) in channels of 50–250 μm width are brought into contact in a way that generates Reynolds numbers on the order of several thousand (Fig. 11.5).

Fig. 11.5
figure 5

An alternative simpler laminar mixer covering the ms to minutes time range suitable for TCSPC and single-molecule studies. The solutions mix in a 2.5 µm central channel (yellow to red transition) before expanding to ~40 µm in the observation channel. The flow velocity is ~1 µm/ms. Reprinted by permission from [70]. Copyright (2013)

The solution breaks up such that the largest eddies are on the order of 0.1 μm, allowing diffusion, and mixing, to occur on the tens of microsecond timescale. These mixers tend to consume significantly more material than laminar mixers but allow a larger volume (and thus more signal) to be detected. The larger sample region makes these devices straightforward to interface with ensemble TCSPC setups. However, the high flow rates in these mixers give rise to very low residence times for sample molecules in the focal volume (~few microseconds) and make them less than ideal for single-molecule measurements.

The early capillary mixers used steady-state fluorescence detection [56, 60], but an early report by the Rousseau group showed that interfacing of a stainless steel mixer to a TCSPC detection setup could be readily accomplished [64]. An optical quality quartz cuvette mounted on the T-shaped mixer with 250 × 250 μm2 channels achieved a mixing time of ~400 μs. A subsequent study using laser machined mixers constructed from 127 μm thick Kapton (polyimide) or PEEK (poly-ether-ether-ketone) used an epifluorescence detection arrangement with a TCSPC detection setup. Using this setup, approximately 100 TCSPC decay traces of horse heart cytocrome c were obtained over a 30–1200 µs range along the time range of the folding reaction [10].

A variety of mixers relying on split-and-recombine and/or flow obstructions that operate at Reynolds numbers in between the laminar and turbulent regimes have been reported [42, 59, 68]. While these “chaotic mixers” generally do not produce mixing on the same timescales as the turbulent mixing or hydrodynamic flow focusing techniques, they operate at relatively lower flow rates than the turbulent mixers and have larger detection volumes than the flow focused laminar mixers. These mixers can be easily interfaced with fluorescence detection using total intensity and TCSPC for both ensemble and single molecule techniques [21, 70]. An illustrative example is shown in Fig. 11.5, where mixing is achieved in a 2.5 µm wide channel before widening of the completely mixed solution. This approach slows the flow sufficiently for single-molecule experiments and also enables a large uniform volume to be used for various types of spectroscopy. Lastly, these mixers can achieve longer timescales ranging from a few hundred microseconds to minutes by introduction of a serpentine pattern [70]. While these timescales are easily accessible by most modern stopped-flow devices, the mixing efficiency of chaotic mixers is far superior to that of stopped-flow devices, consuming significantly less material. These devices also operate under lower turbulence and pressure regimes. Further, as the sample is in continuous flow mode, photobleaching is seldom a problem.

11.2.1 Materials

The last few years have seen significant advances in new mixer designs, facilitated in part due to user-friendly off-the-shelf simulation software and the availability of micromachining capabilities. Photolithography followed by etching is widely being used for micromachining of silica wafers [70] that are readily used in hydrodynamic flow focusing laminar mixer devices. A range of materials like quartz cover slips and silicon based polymer materials, PDMS, PMMS, POM, etc. can be used as window material on these chips for spectroscopic applications under low-pressures.

The higher pressures in turbulent flow mixing techniques prevents the use of delicate window materials and is generally circumvented by separating the mixing, which is performed in metal or PEEK mixers, from the observation channels. The optical quality of quartz along with its inert nature makes it ideal for most mixer types. The use of a femtosecond painting method for micro-etching quartz, developed by Bado and coworkers [8] has enabled the use of fused “single piece” quartz mixers [31, 33] with virtually no design limitations even under high pressures.

11.2.2 Optimization of Mixer Designs by Simulation

Robust commercially available computational fluid dynamics packages, e.g. COMSOL (Fig. 11.6) and ANSYS/Fluent, have led to systematic optimization of various mixer design parameters to achieve the highest mixing efficiencies at the lowest flow rates possible [16, 42, 46, 63]. Of the parameters optimized for various flow geometries, most groups have found that constricting the flow path at the point of mixing of two fluids in conjunction with an obstruction in the form of sharp turns is the most efficient under non-laminar flow conditions. Under laminar conditions, the SAR, serpentine and herringbone patterned mixers appear to be more effective as they provide greater opportunity to increase the contact surface area between the two fluids and enhance passive diffusion.

Fig. 11.6
figure 6

Simulation using COMSOL. Fluid velocities at different positions along the channel are shown. The two side input channels (50 μm) are flowing at 2 m/s and the central channel (30 μm) flowing at 0.4 m/s. The flow velocity is uniform within 0.5 mm from the point of mixing

One of the major limitations of continuous-flow mixers is that they have historically required either higher concentrations (laminar mixers) or amounts (turbulent mixers) of sample relative to stopped-flow experiments. Chaotic mixers can potentially overcome both of these limitations. Simulations play a significant role in developing geometries to optimize flow paths to achieve faster and more efficient mixing and to minimize dead spaces to reduce the priming sample volume required for uniform flow.

11.3 Interfacing Microfluidic Chips with TCSPC

11.3.1 Advantages of TCSPC

The motivation for interfacing microsecond mixing devices with TCSPC instrumentation [2, 3] stems from several factors. For FRET studies, the advantages offered by TCSPC can be substantial, see Chaps. 7 and 8 of this book. By collecting the time-resolved decay of the donor (and, where available, the acceptor), the user is able to record the average distance as well as the distribution of distances between a donor and acceptor fluorescent label on the molecule of interest. Therefore, by having the TCSPC axis, the user can distinguish whether the average FRET efficiency arises from a single population or multiple sub-populations. Additionally, the average FRET efficiency calculated from the donor lifetime is more accurate than that obtained through steady-state methods [13], see Chaps. 7 and 8. In steady-state methods, matching the concentrations of donor-only and donor-acceptor controls is a source of error in the average FRET efficiency determination. The fluorescence lifetime, however, is independent of concentration unless the concentration is sufficiently high that higher order oligomers, exciplexes or other quenching reactions are introduced. Furthermore, the theoretical standard deviation of a lifetime measurement is √N/N for N photons [36], where N is on the order of 106. TCSPC comes very close to this ideal value. This latter point is helpful for the continuous-flow TCSPC experiment as the first moment of the lifetime decay can be used as a cross-check to confirm the intensity normalization, as will be discussed further below. Advantages of TCSPC detection also extend to anisotropy measurements, where the measurement of rotational correlation times can be very useful for assessing local structure and hydrodynamic size, see Chap. 12 of this book.

With the availability of compact high-repetition rate lasers over a broad spectral range and PC-based TCSPC cards, the adoption of TCSPC detection for microfluidic mixing devices is likely to increase. Below we present some of the instrumentation and design considerations in the practical application of continuous-flow microfluidic mixers with TCSPC detection.

11.3.2 Instrumentation

The basic setup for continuous-flow TCSPC experiments is shown in Fig. 11.7.

Fig. 11.7
figure 7

Schematic of epifluorescence setup for continuous-flow TCSPC measurements. Abbreviations: L lens, F bandpass filter, PMT photomultiplier tube, PD fast photodiode. The flow channel is ~50–70 µm wide, ~50–100 µm deep and 30 mm long. The flow velocity is ~(30 µs/mm)−1

The setup is essentially that of an epi-fluorescence microscope equipped with an xy-scanning stage. The implementation currently in use at the University of Massachusetts Medical School utilizes the doubled or tripled output of a Ti:sapphire laser operating at 76 MHz repetition rate as the excitation source. The repetition rate is dropped to 3.8 MHz using an acousto-optical pulse picker for most experiments to allow more accurate determination of longer lifetimes (e.g., >4 ns), such as, experiments using Trp and cyano-Phe [1, 58, 65] fluorescence. The 3.8 MHz is due to the limitation of our pulse picker, in fact a repetition rate on the order of 20 MHz would be more appropriate. A laser power of several hundred µW is available but only a power of ~100 µW is needed for excitation.

For experiments at 290–295 nm, a dichroic filter (FF310, Semrock Inc., Rochester, NY) is used to reflect the excitation to the sample. A single element plano-convex lens (35 mm focal length) is used to focus the excitation to an approximately 20 µm spot. Although a higher numerical aperture objective can be used, maintaining a constant intensity along the length of the 30 mm flow channel becomes more challenging. A thin quartz plate is used to pick off a fraction of the excitation beam and directed into a power meter.

The microfluidic mixer is mounted horizontally onto a xy-stage (Ludl Biopoint 2). A positioning accuracy of a few microns is sufficient for channels that are >30 µm in width.

The fluorescence from the sample solution inside the channel is collimated, and transmitted by the dichroic mirror. The fluorescence light is further split in two components, which are projected onto UV sensitive PMT modules (PMH100-6) or microchannel plate (MCP) PMTs. The outputs of the PMTs are fed into two separate SPC150 TCSPC cards [3].

Wavelength selection is accomplished via bandpass filters (e.g., FF357/44 for tryptophan experiments), although the option of using a monochromator is available. The greater spectral width and excellent out-of-band rejection available with bandpass filters renders them very practical. A motorized Glan-Taylor polarizer at magic-angle is present in each detection path. The polarisers can be set to detect parallel and perpendicular polarisation components or for magic-angle detection. Magic-angle detection rejects contributions from chromophore reorientational dynamics from the decay data. Please note that the commonly used magic-angle of 54.7° strictly applies only for excitation and detection in parallel beams of light. The NA of our detection optics is 0.25, where the magic angle is still very close to the value for parallel beams. For excitation and detection at higher NA the magic angle decreases and approaches 45° for NA = 1 [19].

11.3.3 Alignment

The initial setup for the experiment consists of aligning the excitation beam, the detection path and in locating the start, stop and centre of the channel. Alignment of the excitation and detection paths is analogous to that of a confocal microscope without the pinhole. A concentrated dye solution is placed on the xy-stage instead of the mixer to view the fluorescent light path. The dichroic also transmits a small amount of the excitation light, which is useful for alignment purposes.

The precise location of the mixer flow channel is accomplished by placing an approximately 10-fold higher concentration of a fluorescent small molecule, appropriate for the experiment excitation and emission wavelengths, in the mixer channel (e.g., N-acetyl-tryptophanamide for tryptophan based experiments). The channel is point-scanned (at ~5 µm resolution) perpendicular to the flow direction at approximately 1 mm intervals along the flow direction to locate the centre of the channel along the 30 mm flow path. A linear least squares fit to the measured centre positions yields the slope and intercept that allows calculation of a lookup table. With a lookup table in hand, an additional point-scan is recorded parallel to the flow channel along the centre. The mid-point of the rise of the signal provides the location of the start of the channel and provides an estimate of the zero-time of the experiment. Because the beam focus diameter is smaller than the channel width, the intensity variation along the flow channel is small, approximately 5 %, usually caused by the tolerance in thickness of the quartz plates used to construct the mixer. This intensity profile is recorded with each experiment and used to normalize the TCSPC traces at each point along the channel. The mixer is brought into focus in the z-direction by repeated scans perpendicular to the direction of flow while optimizing the signal intensity.

11.3.4 Data Acquisition Protocol

In a typical experiment the sample and control will be placed in separate sample loops. The blank solution (e.g., matching the sample in every respect except without protein) is placed in the pump pushing out the contents of the sample loop. The other pump contains the dilution buffer. The flow rate is on the order of 2–6 mL/min total, although efforts are underway to reduce this volume to below 1 mL/min. Control of the pumps and valves is under computer control.

A typical experiment uses 10 mL sample loops giving a data collection time of approximately 15–20 min for the sample. As the sample flows, a scan is made along the flow direction every minute, consisting of 60–100 points. Although the TCSPC cards can be operated in “continuous-flow” mode, recording photons without any gap, the double time mode tends to be more commonly employed. In the latter mode, at each step along the channel the xy-stage is positioned and TCSPC acquisition for approximately 0.5 or 1 s is individually triggered, keeping the stage stationary during acquisition. The duty cycle is therefore ~50–75 %.

Count rates for these experiments tend to be in the 5 × 104 to <1×105 cps per detection channel. For an excitation pulse frequency of 3.8 MHz used in our experiments this is well within the range where pile-up is not an issue [2, 3]. Higher count rates may be available with more efficient detectors [4]. Even then, adverse pulse pile-up effects [11] can be avoided by using higher excitation pulse frequency, and by using multiple detectors and parallel TCSPC channels [2, 3].

Data acquisition consists of recording three separate measurements: a blank, sample and control. Each data set consists of approximately 15–20 scans with each scan comprising ~60-100 TCSPC traces with 4096 points in the excited state decay. The experiment is visualized by examining the integrals of the decays (Fig. 11.8).

Fig. 11.8
figure 8

Visualization of the experiment via the total intensity of each scan. The blue bars correspond to scans where the control is flowing through the mixer. The magenta bars correspond to the protein sample, the white bars are the blanks and the grey bars correspond to the traces where the sample flow was started or stopped and the entire channel has not reached flow-equilibrium

Traces which overlay are summed. The control scans (e.g., NATA) are summed along the TCSPC axis since only the integrated intensity is used in the normalization. A corrected data set for the sample is therefore obtained according to the following relationship:

$$ I_{corr} (t_{kin} ,\text{ }t_{TCSPC} ) = \frac{{I(t_{kin} ,\,t_{TCSPC} ) - Blank(t_{kin} ,\,t_{TCSPC} )}}{{\sum\nolimits_{{t_{TCSPC} }} {\left( {NATA(t_{kin} ,\,t_{TCSPC} ) - Blank(t_{kin} ,\,t_{TCSPC} )} \right)} }}. $$
(11.1)

In the above expression the kinetic time axis (corresponding to distance along the flow direction in the channel) is denoted by t kin and the TCSPC time axis is denoted by t TCSPC . The former is in microseconds and the latter in ns. An analysis program with a graphical interface allows the user to analyze a data set in a few minutes, providing rapid feedback on the experiment. A representative data set, collected for horse heart cytochrome c is shown in Fig. 11.9.

Fig. 11.9
figure 9

Representative continuous-flow TCSPC data. Continuous-flow refolding of horse heart cytochrome c is shown over the 30–140 μs time range, where the collapse transition takes place. Reprinted from [32], Copyright (2014), with permission from Elsevier

With binning of several frames, the counts can be on par with what is necessary for distance distribution modelling and maximum entropy analysis, as is discussed further in this chapter.

11.3.5 Point Scanning TCSPC Versus CCD Based Steady-State Acquisition

The point scanning protocol used in these experiments has several pros and cons relative to the CCD based detection that is used by some groups [60]. Other than the advantage of TCSPC based acquisition, the point scanning allows full use of the incident beam intensity, which is very helpful in cases where the experiment is not limited by counting rate and is excitation flux limited. The CCD approach can be advantageous when excitation flux is not limiting. The CCD based approach provides the full kinetics simultaneously for all positions along the channel. The intensity variation along the channel tends to be larger for the CCD based approach but appears to not limit the data quality. The choice of using a CCD based approach or point scanning TCSPC depends on whether the fluorescence decay parameters are desired and the excitation flux available.

11.3.6 Alternative Approaches

Scanning along the flow channel can be avoided by using the multi-channel TCSPC architecture described in Chap. 1, Sect. 1.4.1. The entire flow channel would be illuminated simultaneously, an image of it projected on a multi-anode PMT, and the signals detected by the individual channels of the multi-anode PMT recorded simultaneously. The approach would increase the efficiency of the measurement enormously, and thus decrease the acquisition time and the amount of sample fluid needed for the experiment. A possible problem is that the number of data points along the channel would be limited by the number of PMT channels, typically 16. To spread the 16 data points reasonably over the protein folding time axis the flow speed must be adjusted to the expected folding time. An instrument using the multi-channel PMT technique has not been described yet.

Although the scope of this chapter is on the application of TCSPC, it is worth noting that other recording techniques can also be used for obtaining time-resolved fluorescence decays along a microfluidic microsecond mixer. One option is to use a fast digital oscilloscope (13 GHz bandwidth) in combination with an MCP-PMT, as implemented by Haas and coworkers [29]. An advantage of this approach is that multiple photons can be collected in a single detection channel per laser pulse. This is useful for experiments utilizing low-repetition rate lasers [55]. However, there is no need to use an extremely low repetition rate. For high repetition-rate lasers the problem of detecting multiple photons per laser pulse does not exist. The efficiency of analog recording is lower than for TCSPC, especially if the system is operated at low photon rates. The time resolution is limited by the single-electron response of the detector, not by the transit-time-spread as in the case of TCSPC, see Chap. 1, Fig. 1.4. It is about 300 ps for an MCP PMT, and 1–2 ns for a conventional PMT, compared to 30 and 200–300 ps in the case of TCSPC. Moreover, the current output of an MCP PMT tends to become a limiting factor: If several photons per excitation pulse are detected channel saturation almost certainly causes nonlinearity in the detected decay curves.

Lifetime-based FRET studies using a microsecond mixer were also performed using a streak camera by the Winkler group [35]. The picosecond streak camera allows the entire mixer to be imaged while providing excellent resolution along the excited state fluorescence decay (Fig. 11.10).

Fig. 11.10
figure 10

Schematic of streak camera based fluorescence lifetime measurements. Copyright—2006 by The National Academy of Sciences of the USA [35]

Their study provided high signal-to-noise picosecond resolved excited state decays at 25 points in the folding of a cytochrome c’ four-helix bundle. The dynamic range of the camera allows the full excitation pulse (from an amplified laser) to be used. Other than the cost of the camera, the only limitation for this approach appears to be the coupling of the light into a fibre optic array bundle, which appears to have limited the earliest observation point. It seems likely that these limitations may be readily overcome with more recent microfluidic mixer designs.

11.4 Data Analysis

The continuous-flow TCSPC experiment presents the user with a rich data set, consisting of ~40 scans with ~100 kinetic time points and 4096 TCSPC time channels in each of the detection channels. One of the challenges is to rapidly evaluate the data set to gauge whether the experimental time window and signal-to-noise are sufficient and determine if a kinetic process is being observed. For time-resolved FRET and anisotropy experiments, this is done in several stages of analysis with each stage increasing in complexity and rigor. A brief overview of some of the analysis approaches is presented, with an emphasis on FRET and anisotropy data sets.

11.4.1 First Moment

The first moment of the histogram of the photon arrival times (often referred to as the “centre of mass”) provides a quick overview of a data set. The first moment is known to a relative precision of √N/N and is an easy to calculate metric that is also model independent and relatively insensitive to the number of photons. The first moment is calculated according to:

$$ \langle \tau \rangle = \frac{{\sum\nolimits_{i} {I_{i} (t_{i} - t_{o} )} }}{{\sum\nolimits_{i} {I_{i} } }} $$
(11.2)

where I i is the photon counts in the ith TCSPC channel, t i is the TAC time of the ith channel and t o is the excitation pulse arrival TAC time. This is mathematically equivalent to a quantum yield weighted average lifetime:

$$ \langle \tau \rangle = \frac{{\sum\nolimits_{i} {\alpha_{i} \tau_{i}^{2} } }}{{\sum\nolimits_{i} {\alpha_{i} \tau_{i} } }} $$
(11.3)

where α i is the amplitude of the ith exponential phase with relaxation time constant τ i .

One potential drawback to the use of the first moment as a metric for kinetic analysis is that it is not rigorously proportional to mole fraction (concentration). Therefore, the time constant(s) of the kinetics can be biased to a lower or higher value depending on the relative quantum yields of the reactants. A workaround is to include the total intensity data I(t kin ), which has the quantum yield information, and fit it globally with the first moment. Assuming that the amplitudes in (11.3) are linearly proportional to concentration, α i  = q i c i , for specie, i, the first moment at the reaction time, t kin , can be written as:

$$ \langle \tau \rangle (t_{kin}) = \frac{{\sum\nolimits_{i} {q_{i} c_{i} (t_{kin} )\tau_{i}^{2} } }}{{\sum\nolimits_{i} {q_{i} c_{i} (t_{kin} )\tau_{i} } }} $$
(11.4)

and the total intensity as:

$$ I(t_{kin} ) = \sum\limits_{i} {q_{i} c_{i} (t_{kin} )\tau_{i} } . $$
(11.5)

For a two-state kinetic reaction (i = 1, 2),

$$ \begin{aligned} c_{1} (t_{kin} ) & = c_{1} (0)e^{ - k \cdot t} \\ c_{2} (t_{kin} ) & = c_{2} (0)(1 - e^{ - k \cdot t} ) \\ \end{aligned} $$
(11.6)

where k is the rate constant of the reaction, q i are the relative quantum yields, and c i , are the concentrations. The parameters q i and τ i are globally optimized simultaneously for the two kinetic traces. The approach can be generalized to an arbitrary kinetic mechanism.

This analysis of the first moment is analogous to that proposed previously for the analysis of anisotropy kinetic data [48]. It is worth noting that if the quantum yields are different for the two states the first moment and total intensity will not track each other. For the case where anisotropy and lifetime are collected, the global analysis would also include the steady-state anisotropy data:

$$ r\text{(}t_{kin} \text{) = }\frac{{\sum\nolimits_{i} {q_{i} c_{i} (t_{kin} )r_{i} } }}{{\sum\nolimits_{i} {q_{i} c_{i} (t_{kin} )} }} $$
(11.7)

where r i are the steady state anisotropy values of the states in the kinetic mechanism obtained from the vertical and horizontally polarized data in the customary manner [48]. Similar, to the first moment, the parameters q i and r i are globally optimized. The first moment data calculated from the total intensity (I = V + 2GH) can also be included in the fitting to improve parameter estimation and kinetic model discrimination.

For the case where a FRET efficiency is sought, the average FRET efficiency, E, can be approximated (with the above caveats) from the average lifetimes of the donor-only, \( \langle\tau_{D}\rangle\), donor-acceptor, \( \langle\tau_{DA}\rangle\), and \( \langle\tau_{DA}\rangle\) App , lifetimes:

$$ \begin{aligned} \langle\tau_{DA}\rangle_{App} & = x \cdot\langle\tau_{DA}\rangle \; + \;(1 - x) \cdot \langle\tau_{D}\rangle \\ E & = 1 - \frac{{\langle\tau_{DA}\rangle}}{{\langle\tau_{D} \rangle}} = 1 - \frac{{\langle\tau_{DA}\rangle_{App} }}{{x \cdot \langle \tau_{D} \rangle}}\,+\,\frac{1 -x}{x} \\ \end{aligned}$$
(11.8)

where x represents the acceptor labelling efficiency. If x is less than unity, the measured lifetime of the donor in the presence of the acceptor, \( \langle\tau_{DA}\rangle\) App , may be related to the true value, \( \langle\tau_{DA}\rangle\), via the fractional labelling efficiency, x. An example of a first moment analysis for FRET data on a TIM barrel protein is shown in Fig. 11.11.

Fig. 11.11
figure 11

First moment analysis of FRET data from a TIM barrel protein. The first moment of the donor-only (red) and donor-acceptor (green) labelled protein is shown in the top panel. The labelling efficiency corrected FRET efficiency (blue) is shown in the bottom panel. Copyright—2008 by The National Academy of Sciences of the USA [69]

Use of the first moment data in practice requires subtraction of the pulse arrival time and consistency in the relative time range (relative the pulse arrival time) used for the calculation. It should also be kept in mind that using the steady-state anisotropy and first moments provides an efficient method of determining the number of kinetic steps but comes at the expense of not fully utilizing the information available in the time-resolved FRET and anisotropy data. This information can be obtained alongside the kinetic analysis via a full global analysis, as discussed below.

11.4.2 Global Analysis

Global analysis [6, 7, 9] is a more rigorous approach that involves direct fitting of the raw data. A global analysis allows more stringent testing of a model (e.g., kinetic model and/or distance distribution model) because the linkage of parameters over data sets reduces the number of parameters. The continuous-flow TCSPC data obtained with microfluidic mixers is ideal for this type of analysis although, depending on the extent of the global analysis, can be complicated to set up. In practice, the most robust parameter estimates are obtained if the full raw data set can be parameterized using a kinetic model for the continuous-flow time axis and using a distance distribution or anisotropy decay model for the TCSPC time axis. Below, an overview of a global analysis strategy of time-resolved FRET data is presented.

In one type of global analysis of tr-FRET data, decays from each kinetic time point (e.g., decays corresponding to donor-only and donor-acceptor samples) are globally analyzed using a distance distribution function. In practice, the global analysis is often performed stepwise: the donor-only decay is fit to a sum of exponentials, yielding the fractional amplitudes (α i ), total amplitude (I d (0)) and decay constants \(\text{(}{k}_{d_{i}} \text{)}\).

$$ I_{d} (t_{TCSPC} ) = I_{d} (0) \cdot \sum\limits_{i}^{N} {\alpha_{i} e^{{ - k_{{d_{i} }} \cdot t_{TCSPC} }} \; + \;const} $$
(11.9)

These parameters are then held fixed in the next step of the analysis, performed on the donor-acceptor decay, in which only the normalized distribution function parameters, p(k ET ), are allowed to vary:

$$ I_{da} (t_{TCSPC} ) = \int\limits_{0}^{\infty } {} \sum\limits_{i}^{N} {\alpha_{i} e^{{ - k_{{d_{i} }} \cdot t_{TCSPC} }} \cdot p(k_{ET} ) \cdot e^{{ - k_{ET} \cdot t_{TCSPC} }} \cdot dk_{ET} + const.} $$
(11.10)

In the above expressions, α i is the fractional amplitude of the ith decay component with rate k di , k ET is the energy transfer rate and p(k ET ) is the energy transfer rate distribution. An assumption in this analysis is that, if the donor has multiple sub-populations with different donor decay rates (e.g. tryptophan), they are assumed to have the same energy-transfer rate distribution. (This assumption is not made in the 2D maximum entropy analysis discussed further below). The energy transfer rate distribution, p(k ET ), is related to the pair-distance distribution, p(r), according to the Förster equation:

$$ r^{6} = R_{0}^{6} \cdot \left( {\frac{{k_{Dave} }}{{k_{ET} }}} \right) $$
(11.11)

where k Dave is the inverse of the average lifetime of the donor in the absence of the acceptor. The distribution p(k ET ) is often described by a sum of Gaussian distribution functions:

$$ p(r) = \sum\limits_{i} {\frac{{a_{i} }}{{\sigma \sqrt {2\uppi } }}e^{{ - (r - \omega_{i} )^{2} /2\sigma_{i}^{2} }} } $$
(11.12)

where a i is the amplitude, ω i the centre and σ i the width of the ith Gaussian. The width of the Gaussian can be converted to the full-width at half-maximum by FWHM = σ/2.354. The adjustable parameters in the non-linear least squares optimization are the amplitude(s), α i , width, σ i , and centre, ω i of the Gaussians and the offset (const.) [69]. Distribution functions other than Gaussians may be used. For example, if the distance distribution function of one of the components is expected to correspond to a random-coil, one of the Gaussian functions may be replaced by a worm-like chain model [66]:

$$ p(r) = \frac{{4\uppi aNr^{2} }}{{l_{c}^{2} \left( {1 - \left( {\frac{r}{{l_{c} }}} \right)^{2} } \right)^{9/2} }}\exp \left( {\frac{{ - 3l_{c} }}{{4l_{p} \left( {1 - \left( {\frac{r}{{l_{c} }}} \right)^{2} } \right)}}} \right) $$
(11.13)

where one fewer parameter is required (the amplitude, a, and the persistence length, l p ) to describe the functional form. The contour length, l c , is obtained from the number of amino acids, Nres (3.4* Nres) and is held fixed.

An extension of this analysis parameterizes both the time axis of the reaction kinetics and the TCSPC time axis in a combined analysis. This was performed, for example, for an associative anisotropy model analysis of the TCSPC axis together with a kinetic folding mechanism [9]. The analysis allows deconvolution of physical parameters (e.g., rotational correlation times, lifetimes, fundamental anisotropy etc.) for individual species in the reaction even if the species is not fully populated at 100 %. A similar approach can be applied to tr-FRET data as well. For example, for a kinetic reaction, where A1 ⇔ A2 ⇔ … An, the distribution function at any given time is the population weighted linear combination of the distribution functions, p i (r), of the contributing species, Ai, i = 1, n:

$$ p(r,\,t_{kin} ) = \sum\limits_{i} {c_{i} (t_{kin} ) \cdot p_{i} (r)} $$
(11.14)

The populations (concentrations) can be obtained from the solution to the general rate equation:

$$ \frac{{d\vec{C}(t_{kin} )}}{dt} = \hat{S} \cdot \vec{C}(t_{kin} ) = - \sum\limits_{j = 1,\,n;\,j \ne 1} {k_{ij} \cdot c_{i} (t_{kin} ) \,+ } \sum\limits_{j = 1,\,n;\,j \ne 1} {k_{ji} \cdot c_{j} (t_{kin} )} $$
(11.15)

where C is a vector of concentrations, S is the system matrix and t kin denotes the time along the microsecond reaction kinetics axis (to distinguish from the TCSPC axis). The rate k ij denotes the microscopic rate from species i to species j. In practice, signal-to-noise limits the number of distributions that can be reliably delineated.

11.4.3 Global Analysis with Diffusion of Donor and Acceptor

A further refinement of the global analysis described above takes into account the relative intramolecular diffusion of the donor and acceptor chromophores during the lifetime of the donor excited state [5]. For unfolded proteins and peptides, values of the diffusion constant have ranged from 5 to 15 Å2/ns [25, 39]. Consideration of diffusion, therefore, becomes significant for highly flexible macromolecules probed with a donor having a long lifetime (>4 ns) and a short Förster distance. A straightforward implementation of diffusion begins with the distance distribution function, p(r, t kin , 0), and requires solving the diffusion equation for each of the times along the TCSPC time axis to obtain p(r, t kin , t TCSPC ). If the diffusion coefficient is a constant, then, the distribution function at TCSPC time, t TCSPC , is obtained according to a one-dimensional diffusion equation in the absence of a potential,

$$ \frac{{{\partial p(r,\,t_{kin} ,\,t_{TCSPC} )} }} {\partial t}= D \cdot \frac{{\partial^{2} }}{{\partial r^{2} }}p(r,\,t_{kin} ,\,t_{TCSPC} ) $$
(11.16)

that can be readily solved using finite difference numerical methods.

11.4.4 SVD: Singular Value Decomposition

SVD is a valuable data reduction strategy that provides a quick model-independent assessment of the number of distinguishable lifetime decays or “spectra” in the data. The SVD algorithm reduces the data into the minimum number of orthonormal basis vectors along both, the kinetic time axis, t kin , and the TCSPC time axis, t TCSPC . This is expressed as:

$$ A = UWV^{T} $$
(11.17)

where the columns of U represent the kinetic basis vectors, the columns of V are the TCSPC axis basis vectors and W is diagonal matrix containing the singular values. In practice, only the first few basis vectors of U and V are significant, with the remaining vectors being essentially random noise. SVD can, therefore, be used to filter data while knowing exactly what is being thrown out. The sum of the outer product of the first few significant vectors represents a least-squares approximation to the full data. The number of significant basis vectors also gives the user an indication of the minimum number of distinct species contributing to the data. In the example shown in Fig. 11.12, the SVD analysis suggested that only two dominant intermediates were present during the course of the collapse of cytochrome c, arguing for a concerted process [32].

Fig. 11.12
figure 12

SVD analysis of continuous-flow TCSPC data. The first few columns of the U and V matrices are shown. The remaining columns are essentially noise, similar to the third column. Data correspond to refolding of horse heart cytochrome c shown in Fig. 11.9. Reprinted from [32], Copyright (2014), with permission from Elsevier

SVD can also be used as an efficient route to a full global analysis. The parameters obtained from a global analysis of the SVD vectors (either the U or the V vectors) can be propagated using the coefficients of the other matrices to reconstruct a full parameter set. Application of SVD to trFRET and time-resolved anisotropy data sets is complicated by the requirements that two data sets need to be analyzed globally and examples of this have not appeared in the literature.

11.4.5 Maximum Entropy

Another model independent analysis approach suitable for fluorescence lifetime, anisotropy and FRET is the maximum entropy method (MEM). The MEM describes a data set (e.g., fluorescence excited state decay) with a distribution of rates, choosing the widest and smoothest distribution that is consistent with the data. MEM starts out with a flat a priori distribution and the amplitude of a given rate deviates from this a priori distribution only as warranted by the data. The algorithm simultaneously minimizes χ2 and maximizes the entropy of the distribution, the balance between the two determined by the Lagrange multiplier. An attractive feature of MEM is that it provides the most objective assessment of the true information content of the data [12, 45, 61]. MEM has been widely used in the TCSPC literature for both static and kinetic data [12, 35, 45]. For trFRET data one of the potential applications of MEM is to provide a model independent recovery of the donor-acceptor distance distribution function. Additionally, MEM offers the possibility of avoiding the assumption of the same distance distribution function for each sub-population of the donor excited state.

The ability of MEM to accurately recover the distribution of decay rates has been well established [38]. Extending MEM to time-resolved FRET involves analysis of both the donor and the donor-acceptor excited state decays. This approach parallels the MEM analysis of time-resolved anisotropy [23]. A concern about degeneracy in the 2D MEM distribution was raised; however, the MEM algorithm proposed by Kumar et al. [38] appears to not be susceptible to this, rendering the approach promising. The donor excited state decay is described according to (11.18):

$$ \begin{aligned} I_{d} (t_{TCSPC} ) =& \int\limits_{0}^{\infty }{P(K_{d} ) \cdot e^{{ - (k_{d} ) \cdot t_{TCSPC} }} dk_{d} } \hfill\\ k_{d}\,\equiv\,& 1/\tau_{d} \hfill \\ \end{aligned} $$
(11.18)

where k d is defined, for convenience, as the inverse of the donor lifetime and p(k d ) is the distribution of donor excited state decay rates. For the donor-acceptor labelled system the excited state decay is given as

$$ I_{da} (t_{TCSPC} ) = \int\limits_{0}^{\infty } {\int\limits_{0}^{\infty } {P(k_{d} ,\,k_{ET} ) \cdot e^{{ - (k_{d} + k_{ET} ) \cdot t_{TCSPC} }} dk_{d} } } dk_{ET} $$
(11.19)

where k ET is the energy transfer rate given by the Förster equation. The two-dimensional distribution p(k d , k ET) describes the distribution of donor rates and energy-transfer rates. The distribution p(k d , k ET) is usually approximated in one-dimensional analyses as separate one-dimensional distributions giving rise to a “non-associative” model:

$$ I_{da} (t_{TCSPC} ) = \int\limits_{0}^{\infty } {P(k_{d} ) e^{{ - k_{d} \cdot t_{TCSPC} }} dk_{d} } \cdot \int\limits_{0}^{\infty } {P(k_{ET} )e^{{ - k_{d} \cdot t_{TCSPC} }} dk_{ET} } $$
(11.20)

This assumes that every subpopulation responsible for a different donor rate has the same energy-transfer rate distribution. The pair distance distribution is then calculated from the rate distribution according to the Förster equation [39]. Although this approximation results in significant computational advantages, the underlying assumption is not generally applicable. For example, a partially folded state and the unfolded state may be equally populated and the donor may exhibit different excited state lifetimes and a different donor-acceptor distance in each state. Scenarios such as this become especially likely at early times in folding reactions where marginally populated partially folded intermediates may interconvert rapidly. Discriminating these sub-populations is one of the goals of continuous-flow FRET studies. An analysis employing this 2D MEM approach was applied to the early steps in the folding of a TIM barrel protein (Fig. 11.13).

Fig. 11.13
figure 13

2D MEM analysis at various time points during refolding of a TIM barrel protein. Used with permission [69]. Copyright—2008 by The National Academy of Sciences of the USA

Although the FRET efficiencies were low and not ideal, the algorithm was able to discriminate two sub-populations. The main limitation of the 2D MEM approach is that the accuracy at low FRET efficiencies (e.g., 20 % and lower) is significantly reduced because of insufficient information in the data.

One workaround to this limitation of the 2D MEM analysis [69] is to include an additional experimental axis or dimension such as the acceptor excited state decay. In preliminary tests this was found to yield significantly more reliable distribution functions in the low FRET efficiency tails of distributions. The computational cost of including the additional dimension scales non-linearly although the 3D MEM approach is within reach of current available computational power. The only potential drawback to MEM approaches is that these do not currently consider the diffusion of donor and acceptor chromophores during the excited state lifetime of the donor. However, for sufficiently large Förster distances and short lifetimes, this may still provide significant insights into the sub-populations present during a microsecond kinetic reaction.

11.5 Future Prospects

Significant advances in microfluidic technology have taken place over the past 5 years. Sample consumption, time resolution and temporal dynamic range have improved and the availability of simulation CFD software and microfabrication tools suggests that these advances will continue. These devices and TCSPC detection are used by a relatively small number of laboratories but the robustness of the devices and decreasing cost of TCSPC instrumentation point to increased adoption by the biochemistry community. These tools are likely to play an increasingly important role in FRET studies delineating enzyme mechanisms, protein-protein/RNA/DNA interactions and in enzymatic assays.