Optimizing the automatic selection of spike detection thresholds using a multiple of the noise level

Rizk, Michael; Wolf, Patrick D.

doi:10.1007/s11517-009-0451-2

Optimizing the automatic selection of spike detection thresholds using a multiple of the noise level

Original Article
Published: 10 February 2009

Volume 47, pages 955–966, (2009)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Medical & Biological Engineering & Computing Aims and scope Submit manuscript

Optimizing the automatic selection of spike detection thresholds using a multiple of the noise level

Download PDF

Michael Rizk¹ &
Patrick D. Wolf¹

487 Accesses
20 Citations
3 Altmetric
Explore all metrics

Abstract

Thresholding is an often-used method of spike detection for implantable neural signal processors due to its computational simplicity. A means for automatically selecting the threshold is desirable, especially for high channel count data acquisition systems. Estimating the noise level and setting the threshold to a multiple of this level is a computationally simple means of automatically selecting a threshold. We present an analysis of this method as it is commonly applied to neural waveforms. Four different operators were used to estimate the noise level in neural waveforms and set thresholds for spike detection. An optimal multiplier was identified for each noise measure using a metric appropriate for a brain–machine interface application. The commonly used root-mean-square operator was found to be least advantageous for setting the threshold. Investigators using this form of automatic threshold selection or developing new unsupervised methods can benefit from the optimization framework presented here.

A normalized template matching method for improving spike detection in extracellular voltage recordings

Article Open access 19 August 2019

Bayes optimal template matching for spike sorting – combining fisher discriminant analysis with optimal filtering

Article Open access 05 February 2015

A Template-Based Sequential Algorithm for Online Clustering of Spikes in Extracellular Recordings

Article 08 January 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

High channel count fully implantable neural data acquisition systems must perform data reduction before transmitting recorded neural data over a wireless link [17, 29, 33, 36]. However, these systems have significant size and power constraints and are limited in the amount and complexity of processing that they can perform [16]. In addition, timing constraints must be imposed on systems that are used in a real-time application such as a brain–machine interface (BMI). Consequently, the use of amplitude thresholding for spike detection is especially attractive for these systems due to its relative simplicity. Voltage thresholding has been the method of choice for spike detection in recently developed implantable neural signal processors [17, 29, 33, 36].

We have developed a 96-channel neural signal processor as part of an implantable BMI system [33]. The processor automatically sets thresholds for each channel. The threshold for a given channel is set to a multiple of an estimate of the noise level on that channel. We wanted to know the optimal multiplier and the optimal noise measure to use in this system. Several groups have used a multiple of some measure of the noise level as a threshold; however, a few different noise measures have been used. The most commonly used measure of noise is the root-mean-square (RMS) of the data over a given window, essentially the standard deviation of the data [9, 11, 14, 19, 29, 31, 35, 41]. Other noise measures that have been used are the mean deviation [33, 37] and the value of the data located at a given percentile within a ranked set of data [15, 32, 40]. It must be noted that all of the noise measures mentioned above do not measure only the noise. In reality, they are calculated from the entire neural signal. The samples that fall within a given window may simply be noise but they may also be spikes. So, the distribution of the samples is dependent not only on the distribution of neural noise but also on the size, shape, and number of spikes that fall within the window.

In order to effectively perform automatic threshold selection using a multiple of a noise measure, we would like to select a single value of the multiplier and trust that it will work reasonably well for all channels—even across channels that have different firing rates and SNRs. So, we would like to know whether or not there is a relatively consistent optimal multiplier for a given noise measure. A multiplier that is near optimal across waveforms with varying characteristics would be suitable for all channels in a high channel count system such as ours.

This paper investigates the problem of automatically selecting a threshold for spike detection. Thresholds computed as multiples of four different noise measures are compared as SNR, firing rate, and window size are varied. In addition, this study investigates the selection of an optimal multiplier for each of these noise measures in neural waveforms that contain spikes from either one or two neurons. The work presented here is motivated by the very real problem of selecting a noise measure and a multiplier for use in the 96-channel neural signal processor described in [33]. While the results presented in this paper are specific to our particular method of spike detection, the concepts and optimization framework used to perform the analysis are applicable more generally.

2 Methods

2.1 Simulated waveforms

We compared the performance of thresholds computed as multiples of four different noise measures on simulated realistic neural signal waveforms. In order to simulate waveforms, we first constructed two libraries, one containing noise waveforms and the other containing spike templates. The noise library contained 17 5-s segments each from different channels of cortical microelectrode recordings from a rat. Each of these segments had no visible spikes and was thus classified as noise. The spike template library consisted of ten spike templates from rat and monkey data. Each template was the average of similarly shaped spikes extracted from a single channel. The templates are shown in Fig. 1. All recordings were made using the neural data acquisition system described in [33]. Briefly, this system had a sampling rate of 31.25 kHz and a passband of 500 Hz to 6 kHz. Each data point was an eight-bit sample. For each channel, the system computed the mean value on that channel and subtracted the mean from incoming samples. This resulted in essentially zero-mean data.

For each simulated waveform, a 2-s span from one of the segments contained in the noise library was randomly selected. A template was also randomly selected to be used as the spike waveform to be inserted at locations within the 2-s noise waveform. This method of simulating neural signals has been used widely throughout the spike detection literature [2, 4, 5, 7, 24].

Each simulated waveform was described by two parameters—SNR and firing rate. The SNR was used to scale the template. We define SNR as

$$ {\rm SNR}={\frac{\hbox{peak\,amplitude\,of\,scaled\,template}} {\hbox{standard deviation of 2-s noise waveform}}} $$

(see Sect. 4.7.1 for more on the definition of the SNR). Spike times were selected to determine where in the noise waveform to place the spike waveforms. For a given firing rate, the appropriate number of spikes was placed into the waveform (for example, for a firing rate of 20 Hz there would be 40 spikes in a 2-s waveform). Spike times were randomly selected within the 2-s period; a refractory period of 1.6 ms was enforced. For each selected spike time, the spike waveform was added to the noise waveform (resulting in a noisy version of the “clean” spike) with the portion of the spike waveform with the highest amplitude placed at the selected spike time. Thus, the simulation resulted in realistic neural waveforms with known SNRs and spike times.

Rather than simulating an entirely new waveform (with a new noise waveform, a new spike template, and completely new spike times) for each combination of SNR and firing rate, sets of waveforms were simulated in a manner that limited all variation due to factors other than changing SNRs or firing rates. Each set of waveforms was generated in the following manner:

1.
Randomly select a 2-s noise waveform.
2.
Randomly select a spike template.
3.
For the lowest firing rate of interest that has not yet been simulated, generate spike times. If spike times have already been generated for lower firing rates, start with those and generate more until the number of spikes corresponding to the current firing rate has been generated.
4.
For each SNR of interest, use the noise waveform, spike template, and spike times to generate a full waveform.
5.
If simulations have not yet been run for the highest firing rate of interest, go back to step 3.

The procedure described above generates neural waveforms that can be thought of as containing spikes from a single neuron. Commonly, signals from multiple neurons (or “units”) are observed on a single extracellularly recorded waveform. In order to simulate this circumstance, we generated two-unit waveforms. This was done by first simulating a single-unit waveform as described above. Then a second unit, with its own SNR and firing rate, was added to the waveform. The spike times for the second unit were selected independently of the spike times for the first unit. This created the potential for overlapping spikes from different units, a phenomenon often observed in real neural recordings. A template was scaled according to the SNR of the second unit and spikes were added into the waveform at the selected times regardless of if they overlapped with spikes from the first unit.

2.2 Noise measures

For each simulated waveform, a sliding window was passed across the waveform. The sliding window was advanced one point at a time along the waveform. Each unique position of the sliding window will be referred to as an epoch. For each epoch, a noise measurement was made using four different operators—MD, RMS, P84, and PA68. These noise metrics are all based on measures that have been used previously in the neural spike detection literature for setting thresholds (see Sect. 1). MD took the mean deviation of the points in the window. The simulated waveforms were essentially zero-mean due to the fact that the noise waveforms were nearly zero-mean (see Sect. 2.1), the biphasic nature of the spike templates (allowing the spike templates to be close to zero-mean), and the fact that the spikes make up only a relatively small percentage of the overall waveform. Consequently, to compute MD we simply took the mean of the absolute value of the points in the window. RMS took the root-mean-square of all of the values in the window. P84 found the 84th percentile of the data in the window. This operator was based on a spike detection technique that has been implemented in a mixed-signal neural signal processor [15, 40]. PA68 found the 68th percentile of the absolute value of the data in the window. This operator was chosen because it combines characteristics of the MD and P84 operators (taking the absolute value and finding a percentile, respectively). For zero-mean, normally distributed data, RMS, P84, and PA68 should approximate the standard deviation of the data. MD should give $\sigma\sqrt{{\frac{2}{\pi}}},$ where σ is the standard deviation of the data [8]. The presence of high-amplitude spikes, however, causes neural data to be non-Gaussian [39]. Thus, the RMS, P84, and PA68 operators will not necessarily give identical results.

The MD and RMS operators were defined as follows:

$$ \begin{aligned} {\rm MD}({\bf X})&={\frac{1}{N}}\sum_{i=1}^N |x_i| \\ {\rm RMS}({\bf X})&=\sqrt{{\frac{1}{N}}\sum_{i=1}^N x_i^2} \end{aligned} $$

where N is the window size; the vector X = {x ₁,...,x _N} contains the waveform samples that fall within the window; and |·| is the absolute value function. Percentiles for the P84 and PA68 operators were found using Matlab’s quantile function.

2.3 Detection of spikes

In order to perform spike detection, nonadaptive thresholds were applied to the absolute value of the data. This method of spike detection has been shown to be appropriate for computationally limited systems [28] and is used in the neural signal processor presented in [33]. Thresholds were computed as multiples of the noise measures. For each combination of noise measure and multiplier, a nonadaptive threshold was computed for each epoch and applied to the entire waveform. Thus, we applied tens of thousands of individual thresholds (one for each noise measure/multiplier pair for each epoch; each 2 s waveform has 62,500 data points) to each simulated waveform. A refractory period of 1.28 ms (40 samples) was imposed so that a single spike would not cause multiple detections. Detections that occurred within 0.48 ms (15 samples) of the true spike time were considered to be accurate detections. Detections falling outside of this time window were considered to be false positives.

2.3.1 Probability of detection and rate of false alarms

For each combination of noise measure and multiplier, we computed the thresholds, performed spike detection, and generated a pair of values to characterize the spike detection performance—the probability of detection (P _D) and the rate of false alarms (R _FA). The probability of detection is defined as

$$ P_{\rm D}={\frac{N_{\rm correct}}{\hbox{actual number of spikes in the waveform}}} $$

and the rate of false alarms is defined as

$$ R_{\rm FA}={\frac{\hbox{total\,number\,of\,detections}-N_{\rm correct}} {\hbox{temporal duration of waveform}}} $$

where N _correct is the number of correctly detected spikes. Note that while P _D is unitless, R _FA has units of false alarms per second. The values of P _D and R _FA are computed as averages across the performance using thresholds computed for each epoch (see Sect. 2.3). By using many different multipliers (and consequently many different thresholds), we were able to generate full receiver operating characteristic (ROC) curves. In this study, we used 31 multipliers ranging from 0 to 15 in steps of 0.5.

2.3.2 Optimality index

In order to determine the optimal multiplier for each noise measure, we devised an optimality index (OI). The optimality index is a function of the probability of detection and the rate of false alarms, so each point on the ROC curve has a corresponding optimality index. As mentioned earlier, one application in which amplitude thresholding is a suitable means of spike detection is a fully implantable neural data acquisition system. One concern in such systems is the limited bandwidth available for transmitting data recorded inside the body to an external device. In these systems, false alarms can have a true cost by increasing latencies and preventing true spikes from being transmitted [6, 33]. We formulated the optimality index using parameters suitable for this type of application.

Our formula for computing the optimality index is based on the cost function used in [28]. We define the optimality index as

$${\rm OI}=P_{\rm D}-w\times{\frac{(r \cdot P_{\rm D} + R_{\rm FA}) \cdot n \cdot b}{\rm BW}} $$

where P _D is the probability of detection, w a unitless weight, r the expected firing rate per channel, R _FA the rate of false alarms per channel, n the number of channels, b the number of bytes that will be transmitted for each detected spike, and BW the available bandwidth, or data rate, of the data acquisition system. The second term of the formula represents the fraction of the available bandwidth that is needed to transmit information about detections (whether true spikes or false alarms) multiplied by the cost, w, of using all of the available bandwidth. Throughout this study, we used 96 channels for n, 48 bytes per spike for b, and 96,000 bytes per second for BW. These values were taken from a 96-channel system that sends a channel number, timestamp, and extracted spike waveform for each spike detected and has a limited wireless data rate [33]. We also used a value of 50 spikes per channel per second for r and a value of 0.1 for w, based on values used in [28]. The value used for r is a reasonable average for a channel with one or two neurons. The value of w can be adjusted based on how costly false detections are deemed to be. This is discussed in Sect. 4.6.

In this study, we computed the optimality index separately for each SNR. This allowed us to find an optimal multiplier for each noise measure for each SNR.

2.3.3 Optimality score

We used the optimality score to find the optimal multiplier when all SNRs were taken into consideration together. For a given multiplier, the optimality score is a weighted average of the optimality indices at that multiplier and each of the SNRs.

$${\rm optimality}\;{\rm score}=\sum_{\rm SNR}w_{\rm SNR}\cdot {\rm OI}_{\rm SNR} $$

where w _SNR is the weight used for a given SNR and OI_SNR is the optimality index computed at that SNR. Table 1 presents the weights that we used. These weights correspond to the frequency with which waveforms of a given SNR were found in a sample real neural recording. The weights are based on the distribution of SNRs presented in figure 13 of [38]. That figure contains histograms that show the distribution of SNRs observed on electrodes implanted in the motor cortex for each of three monkeys.

Table 1 Weights used for each SNR in computing the optimality score

Full size table

It must be pointed out that SNR is defined differently in this paper and in [38]. In [38], SNR is defined as the peak-to-peak amplitude of the spikes divided by two times the standard deviation of the noise. The spike templates used in our study had, on average, peak-to-peak amplitudes that were approximately 1.5 times the size of their peak amplitudes. In order to account for this, we scaled the SNR values presented in [38] by 4/3 so that they would correspond to the SNR values used here. Once this was done, we used the frequency with which waveforms of a given SNR were found in the recordings from [38] to determine the weights used in the optimality score calculation.

3 Results

3.1 Single-unit waveforms

We simulated ten sets of waveforms. Each set had spikes from a single unit with SNRs ranging from 1 to 15 in steps of 1 and firing rates ranging from 5 to 50 Hz in steps of 5 Hz. Each set of waveforms was generated as described in Sect. 2.1. On each waveform, we computed all four noise measures using window sizes of 512, 1,024, 2,048, 4,096, and 8,192 samples and performed spike detection as described in Sect. 2.3.

For the ten waveforms with a given SNR (from 1 to 15) and firing rate 20 Hz, the probability of detection and rate of false alarms values were used to construct optimality index curves for each of the four noise measures. These are plotted in Fig. 2 as a function of the multiplier value. The plots are based on spike detection performance using a window size of 4,096 samples. The optimality indices were computed using overall probabilities of detection and rates of false alarms measured across ten waveforms with identical firing rates and SNRs. Curves are plotted for SNRs from 1 through 15. Figure 3a shows the maximum optimality index achieved for each noise measure at each SNR and Fig. 3b shows the optimality score for each multiplier for each noise measure. The multiplier that received the highest optimality score for each noise measure is listed in Table 2. Optimality scores were also calculated for the other combinations of firing rate and window size. Table 3 shows the maximum optimality score obtained for each noise measure for selected parameter combinations. Overall, for a given set of parameters, comparable maximum optimality scores were achieved by all of the noise measures for the single-unit waveforms.

Table 2 Comparison of optimal multipliers across the different noise measures

Full size table

Table 3 Maximum optimality score obtained for each operator for several combinations of firing rate and window size

Full size table

3.2 Two-unit waveforms

We simulated two-unit waveforms in which the first unit had an SNR of 8 and each unit had a firing rate of 20 Hz. For each set of waveforms, the SNR of the second unit was varied from 1 to 8 in steps of 1. Ten of these sets were simulated. Similar to Sect. 3.1, spike detection was performed and optimality indices and scores were computed. These results are shown in Fig. 4. Since the SNR of the first unit was held constant at a value of 8, all variations in SNR referred to in the plots and calculations correspond to the SNR of the second unit.

Similarly, we also simulated two-unit waveforms in which the first unit had an SNR of 12. The results for this case are shown in Fig. 5.

3.3 Application of thresholds to real neural data

The key difference between applying thresholds to simulated data and to real neural data is that the true spike times are known for the simulated data while the true spike times are not known for the real data. As a result, analyses evaluating the performance of the threshold (or any other spike detection technique) cannot be performed on real data as they can be on simulated data. Nevertheless, it is useful to visualize the application of a given spike detection technique to real neural data. We took rat neural data recorded using the neural data acquisition system mentioned in Sect. 2.1 and computed thresholds in the manner used to compute thresholds for the simulated data. For each combination of noise measure and multiplier, an individual threshold was computed for each epoch. Thus, for each noise measure/multiplier pair, a distribution of thresholds was generated for each neural waveform. Figure 6 presents two examples of neural data along with the distributions of thresholds computed for selected combinations of noise measures and multipliers. To reduce clutter, the figure only shows negative thresholds. When applying a threshold to the absolute value of data, however, a positive threshold of equal magnitude would also be applied. The waveform shown in Fig. 6a appeared to have spikes from a single unit. The waveform shown in Fig. 6b appeared to have spikes from two distinct units.

4 Discussion

4.1 Modified criteria for evaluating spike detectors

Spike detectors are generally evaluated and compared based on performance either with a fixed set of parameters (for example, using a particular threshold) or across a range of parameters (resulting, for example, in an ROC curve). For certain types of systems, the performance of a detector (generally expressed in terms of metrics such as probability of detection and rate of false alarms) should not be looked at in isolation—other factors must be taken into account as well. Specifically, for high channel count systems with limited resources, such as implantable BMIs, it should be possible to set the parameters to satisfactory values relatively easily. These parameters ideally should be automatically determined. The usefulness of a detector that can perform very well when its parameters are set optimally but will perform poorly otherwise is limited by how reliably the correct parameters can be selected. Another factor that must be taken into account is the complexity of implementing the spike detector [28]. Developers of implantable neural signal processors have been willing to accept suboptimal spike detection performance compared to other types of spike detectors in return for the simplicity of implementing a voltage amplitude thresholding spike detection scheme. These systems already have high channel counts and the number of channels will only continue to increase in the future. As a result, the problem of automatically selecting thresholds is an important one. Spike detectors for these systems should be evaluated not only on their achievable levels of performance, but also on their complexity and ability to perform well using parameters that are selected without supervision.

4.2 Threshold stability

Neural data is often thought of as consisting of large amplitude spikes superimposed on a noise cloud primarily representing background neural activity. Calculations that use a portion of the recorded data to estimate the noise level will be affected by spikes that happen to fall within the window over which the calculation is made. The level to which the calculation is affected depends on the firing rate and SNR of the spikes as well as the type of calculation.

A relatively straightforward way to automatically compute a threshold is to use a multiple of a noise measure as the threshold. In this case, having a relatively stable noise measure is desirable. Thresholds calculated based on the noise measure made during an initial monitoring period [33] ideally should not be significantly influenced by the location in time at which the monitoring period occurs (since that is essentially random).

4.3 Optimal multipliers for single-unit waveforms

Having to select a multiplier for each channel individually essentially defeats the purpose of trying to automatically set thresholds. If an individual is required to look at each channel in order to choose the multiplier, he might as well directly set the threshold for each channel. It is valuable to know the range over which multipliers are near optimal for waveforms with different firing rates and SNRs. This way a multiplier that can perform well across many channels can be selected. There can still be some design flexibility in picking the multiplier. A value toward the high end of the range would limit false detections and a value toward the low end would increase the probability of detection. At the same time, it is desirable to have a wide range of multipliers that are near optimal so that performance will not suffer greatly as long as a multiplier near the optimal multiplier is used. We used the optimality index described in Sect. 2.3.2 and computed in Sect. 3.1 to compare the optimal multipliers for the different noise measures.

From Fig. 2 it is clear that for a given noise measure, the optimality index can reach its peak at slightly different multiplier values for different SNRs. As a result, a single optimal multiplier cannot be identified. Nevertheless, across SNRs the optimal multiplier does generally fall within a small range. These ranges are different for the different noise measures (Table 2). In order to pick a single multiplier to be used across channels with a variety of SNRs, an optimality score that is computed using weights corresponding to the frequency with which a given SNR occurs is appropriate.

While the optimality index curves shown in Fig. 2 look relatively similar for the different noise measures, one noticeable difference is the range over which the individual curves remain near their peaks. Curves remain near their peaks over the widest range for the MD operator. The range is slightly narrower for the P84 and PA68 operators. The RMS operator has the narrowest range. A review of the literature shows that many different multipliers have been used [9, 11, 15, 19, 29, 31, 32, 33, 35, 40, 41]. In some cases, these may not have been optimal.

4.4 Effect of multiple units

While the spike detection performance of all four operators was quite similar for waveforms containing spikes from only one unit, a couple of interesting observations can be made regarding the performance for waveforms containing spikes from two units. First, when large amplitude spikes from one unit are present along with lower amplitude spikes from a second unit, the maximum optimality indices achieved by the RMS operator at the SNRs that are most common (SNRs of 3 through 6 according to Table 1) are lower than those achieved by the other operators. (Compare Figs. 4b, 5b with 3a.) Second, the optimality score curve for the RMS operator peaks farther to the left (at a lower multiplier value) in the two-unit case than it does in the single-unit case. This can be seen by comparing Fig. 3b with 5c. For the other operators, the multiplier for which the optimality score is highest remained unchanged across all of the different parameter combinations and circumstances investigated. This suggests that a single multiplier may prove to be optimal more generally for the MD, P84, and PA68 operators than for the RMS operator.

4.5 Comparison of noise measures

The complexity of neural noise should not be underestimated. It has been observed to be colored and non-stationary [12, 24]. The complexity of the neural waveforms over which noise calculations are performed is further complicated by the presence of spikes in addition to the noise. These spikes distort the distribution of the samples in the waveform since they contain a disproportionate number of high amplitude samples [39]. As a result, the noise level estimates (and consequently the thresholds) computed by the different noise measures should not automatically be assumed to lead to equal performance as they might when applied to large samples of perfectly Gaussian data. Since the statistical nature of the overall neural waveform is not simple or obvious, an experimental approach (using real data as opposed to a statistical model) to the problem under consideration is warranted.

While the overall performance of all four measures investigated in this study was fairly similar, the RMS operator did exhibit a few disadvantageous characteristics. The range of multipliers over which a given optimality index curve remained near its peak was narrowest for the RMS operator (Sect. 4.3). In the two-unit case, the RMS operator had lower maximum optimality indices and a less consistent optimal multiplier than the other operators (Sect. 4.4). Additionally, performing the squaring and square root operations used by the RMS operator can be problematic in certain types of systems. This suggests that using a percentile operator may be preferable for a system utilizing a mixed-signal processor [15, 40] and using the mean deviation operator may be preferable for a system utilizing a digital processor with limited memory resources [33].

4.6 Limitations and extensions

It should be noted that the results presented in Sects. 3.1 and 3.2 and discussed in Sects. 4.3 and 4.4 use a specific value of the weight w. This weight is related to the cost incurred by false detections. The effect of false detections can range from simply being a nuisance to causing a severe drop in performance depending on the circumstances. For example, in a wired recording system in which spikes are detected and extracted with essentially no limit on the number of spikes and in which a reliable spike sorting process will be performed (meaning that most of the false extractions will later be categorized as noise rather than as true spikes), false detections are undesirable but should have only a minor impact on the performance of the system. In this case, the emphasis should be on picking a multiplier that results in a high probability of detection.

In the case of a fully implantable neural data acquisition system with a limited data rate, false detections will take up valuable bandwidth and could possibly prevent some true spikes from being transmitted [33]. Even if spike sorting is performed after the extracted spikes are received outside of the body (again categorizing false detections as noise), some of the true spiking information may be irretrievably lost due to the false detections. The cost of false detections might be relatively high in this case if the data rate necessary for true spikes is close to the available data rate.

Another case is a system that does not use spike sorting but instead simply processes spike detection information. This system would end up treating false detections exactly like true spikes. Thus false detections would never be removed and the performance of the system could be impacted negatively. In this case, multipliers that lead to many false detections would not be considered optimal.

From these examples it is clear that the multiplier that is considered to be optimal in one case may not be considered optimal in another. In this study, we looked at one possible case. From Fig. 6, we can see that the thresholds computed with the multipliers that were deemed to be optimal in this study fall very near the boundary of the noise cloud. It is likely that using these thresholds, some false detections will occur. Using larger multipliers would decrease the likelihood of false detections but may also result in a higher number of undetected true spikes.

This study, as all studies must, had a limited scope. Seemingly endless combinations of SNRs, firing rates, and window sizes could have been investigated, especially when allowing spikes from multiple neurons to be present in a given waveform. This study investigated only a limited subset of the possible parameter combinations. Nevertheless, many of the ideas and methods presented in this paper can be extended and applied to circumstances and spike detection techniques that do not exactly match those presented in this study. For example, we performed the same analyses described in this paper using adaptive thresholds rather than nonadaptive thresholds. The adaptive thresholds were constructed by setting the threshold at each point to a multiple of the noise measure computed over the epoch ending at that point. We found that the results using adaptive thresholds were very similar to the average performance of nonadaptive thresholds presented here.

Also, this work is applicable beyond the realm of implantable neural data acquisition systems. Voltage thresholding is used widely to perform neural spike detection in many different contexts [9, 11, 14, 15, 17–19, 23, 26, 29–33, 35–37, 40, 41]. Other means of spike detection exist and have the potential to have improved spike detection performance [1–3, 7, 10, 13, 20, 21, 22, 24, 39]. Nevertheless, voltage amplitude thresholding, even with its less than ideal spike detection performance, is the predominant method of spike detection. This is in large part due to its simplicity and its ease of implementation. Automatic threshold selection is desirable in a wide variety of circumstances.

Finally, we suggest that the ability to easily and robustly select parameters that result in good performance across a wide range of realistic circumstances should be considered while developing and evaluating spike detection techniques. This is especially true for spike detection techniques intended for use in implantable systems. The optimization framework presented here may be useful in the development and evaluation of many of these techniques.

4.7 Final notes

4.7.1 SNR

No consistent definition of SNR is found in the neural spike processing literature. Two commonly used definitions are the one used in this paper (see Sect. 2.1) [7, 24, 41] and one in which the numerator of the ratio is the RMS value of the template rather than the peak amplitude of the template [2, 9, 25, 34, 39]. Other definitions are used as well [5, 10, 21, 22, 38]. A definition of SNR that relies on the peak amplitude fits well with a detection scheme that uses an amplitude threshold since only the high amplitude portion of the spike is likely to cross the threshold and lead to a detection. Also, SNR computations that rely on the rms value of a spike are influenced by the assumed temporal duration of the spike. In general, exactly what portion of a full neural waveform should be considered to be part of a spike is not well defined. For a useful note on SNR see [24].

4.7.2 Spike detection performance parameters

In Sect. 2.3.1, we characterize the spike detection performance by two parameters—P _D and R _FA. We defined R _FA as the number of false alarms divided by the temporal duration of the waveform. In the spike detection literature, it is more common to plot P _D versus the probability of false alarm (P _FA) or the “false alarm ratio” [5, 10, 18, 21, 24]. These are generally defined as the number of false alarms divided by the total number of detections. Since for realistic firing rates a single neuron’s spikes make up only a small percentage of the overall neural waveform, the number of false alarms is essentially independent of the firing rate. Rather, the number of false alarms is dependent on the noise characteristics and the spike detection parameters. The denominator of R _FA is also independent of the firing rate; thus, R _FA is essentially independent of the firing rate. On the other hand, the denominator of P _FA (namely, the sum of false alarms and true detections) is dependent on the firing rate. Consequently, P _FA can be very dependent on the firing rate. In a rough sense, for a given threshold, R _FA can be thought of as a measure of the noise level and P _D can be thought of as a measure of the signal level, both independent of firing rate. While plots of P _D versus R _FA do not exactly correspond to classical ROC curves, they do present the same general type of information. A few others have also plotted the probability of detection versus the rate of false alarms [22, 27].

5 Conclusion

We have investigated methods for automatically setting a spike detection threshold as a multiple of an estimate of a given channel’s noise level. The original motivation for this work was the problem of performing spike detection in the implantable neural signal processor presented in [33]. The results of this study confirm that the mean deviation operator is an appropriate noise measure for use within our implantable neural signal processor. The results also provide guidance regarding the selection of appropriate multipliers for this means of spike detection.

References

Aksenova TI, Chibirova OK, Dryga OA, Tetko IV, Benabid AL, Villa AEP (2003) An unsupervised automatic method for sorting neuronal spike waveforms in awake and freely moving animals. Methods 30(2):178–187
Article Google Scholar
Bankman IN, Johnson KO, Schneider W (1993) Optimal detection, classification, and superposition resolution in neural waveform recordings. IEEE Trans Biomed Eng 40(8):836–841
Article Google Scholar
Bar-Hillel A, Spiro A, Stark E (2006) Spike sorting: Bayesian clustering of non-stationary data. J Neurosci Methods 157(2):303–316
Article Google Scholar
Benitez R, Nenadic Z (2008) Robust unsupervised detection of action potentials with probabilistic models. IEEE Trans Biomed Eng 55(4):1344–1354
Article Google Scholar
Borghi T, Gusmeroli R, Spinelli AS, Baranauskas G (2007) A simple method for efficient spike detection in multiunit recordings. J Neurosci Methods 163(1):176–180
Article Google Scholar
Bossetti CA, Carmena JM, Nicolelis MAL, Wolf PD (2004) Transmission latencies in a telemetry-linked brain–machine interface. IEEE Trans Biomed Eng 51(6):919–924
Article Google Scholar
Brychta RJ, Tuntrakool S, Appalsamy M, Keller NR, Robertson D, Shiavi RG, Diedrich A (2007) Wavelet methods for spike detection in mouse renal sympathetic nerve activity. IEEE Trans Biomed Eng 54(1):82–93
Article Google Scholar
Burrows BL, Talbot RF (1985) The mean deviation. Math Gazette 69(448):87–91
Article Google Scholar
Chandra R, Optican LM (1997) Detection, classification, and superposition resolution of action potentials in multiunit single-channel recordings by an on-line real-time neural network. IEEE Trans Biomed Eng 44(5):403–412
Article Google Scholar
Choi JH, Jung HK, Kim T (2006) A new action potential detector using the MTEO and its effects on spike sorting systems at low signal-to-noise ratios. IEEE Trans Biomed Eng 53(4):738–746
Article Google Scholar
Farshchi S, Pesterev A, Guenterberg E, Mody I, Judy JW (2007) An embedded system architecture for wireless neural recording. In: 3rd Int IEEE/EMBS Conference on Neural Engineering, pp 327–332
Fee MS, Mitra PP, Kleinfeld D (1996) Variability of extracellular spike waveforms of cortical neurons. J Neurophysiol 76(6):3823–3833
Google Scholar
Gozani SN, Miller JP (1994) Optimal discrimination and classification of neuronal action potential waveforms from multiunit, multichannel recordings using software-based linear filters. IEEE Trans Biomed Eng 41(4):358–372
Article Google Scholar
Guillory KS, Normann RA (1999) A 100-channel system for real time detection and storage of extracellular spike waveforms. J Neurosci Methods 91(1–2):21–29
Article Google Scholar
Harrison RR (2003) A low-power integrated circuit for adaptive detection of action potentials in noisy signals. Conf Proc IEEE Eng Med Biol Soc 4:3325–3328
Google Scholar
Harrison RR (2008) The design of integrated circuits to observe brain activity. Proc IEEE 96(7):1203–1216
Article Google Scholar
Harrison RR, Watkins PT, Kier RJ, Lovejoy RO, Black DJ, Greger B, Solzbacher F (2007) A low-power integrated circuit for a wireless 100-electrode neural recording system. IEEE J Solid State Circuits 42(1):123–133
Article Google Scholar
Hatsopoulos NG, Xu Q, Amit Y (2007) Encoding of movement fragments in the motor cortex. J Neurosci 27(19):5105–5114
Article Google Scholar
Jensen W, Rousche PJ (2006) On variability and use of rat primary motor cortex responses in behavioral task discrimination. J Neural Eng 3(1):7–13
Article Google Scholar
Kim KH, Kim SJ (2000) Neural spike sorting under nearly 0-dB signal-to-noise ratio using nonlinear energy operator and artificial neural-network classifier. IEEE Trans Biomed Eng 47(10):1406–1411
Article Google Scholar
Kim KH, Kim SJ (2003) A wavelet-based method for action potential detection from extracellular neural signal recording with low signal-to-noise ratio. IEEE Trans Biomed Eng 50(8):999–1011
Article Google Scholar
Kim S, McNames J (2007) Automatic spike detection based on adaptive template matching for extracellular neural recordings. J Neurosci Methods 165(2):165–174
Article Google Scholar
Leiser SC, Moxon KA (2007) Responses of trigeminal ganglion neurons during natural whisking behaviors in the awake rat. Neuron 53(1):117–133
Article Google Scholar
Nenadic Z, Burdick JW (2005) Spike detection using the continuous wavelet transform. IEEE Trans Biomed Eng 52(1):74–87
Article Google Scholar
Nenadic Z, Burdick JW (2006) A control algorithm for autonomous optimization of extracellular recordings. IEEE Trans Biomed Eng 53(5):941–955
Article Google Scholar
Nicolelis MA, Dimitrov D, Carmena JM, Crist R, Lehew G, Kralik JD, Wise SP (2003) Chronic, multisite, multielectrode recordings in macaque monkeys. Proc Natl Acad Sci USA 100(19):11041–11046
Article Google Scholar
Obeid I (2007) Comparison of spike detectors based on simultaneous intracellular and extracellular recordings. In: 3rd Int IEEE/EMBS Conference Neural Engineering, pp 410–413
Obeid I, Wolf PD (2004) Evaluation of spike-detection algorithms for a brain–machine interface application. IEEE Trans Biomed Eng 51(6):905–911
Article Google Scholar
Olsson RH III, Wise KD (2005) A three-dimensional neural recording microsystem with implantable data compression circuitry. IEEE J Solid State Circuits 40(12):2796–2804
Article Google Scholar
Paninski L, Fellows MR, Hatsopoulos NG, Donoghue JP (2004) Spatiotemporal tuning of motor cortical neurons for hand position and velocity. J Neurophysiol 91(1):515–532
Article Google Scholar
Pouzat C, Mazor O, Laurent G (2002) Using noise signature to optimize spike-sorting and to assess neuronal classification quality. J Neurosci Methods 122(1):43–57
Article Google Scholar
Quiroga RQ, Nadasdy Z, Ben-Shaul Y (2004) Unsupervised spike detection and sorting with wavelets and superparamagnetic clustering. Neural Comput 16(8):1661–1687
Article MATH Google Scholar
Rizk M, Obeid I, Callender SH, Wolf PD (2007) A single-chip signal processing and telemetry engine for an implantable 96-channel neural data acquisition system. J Neural Eng 4(3):309–321
Article Google Scholar
Rutishauser U, Schuman EM, Mamelak AN (2006) Online detection and sorting of extracellularly recorded action potentials in human medial temporal lobe recordings, in vivo. J Neurosci Methods 154(1):204–224
Article Google Scholar
Snider RK, Bonds AB (1998) Classification of non-stationary neural signals. J Neurosci Methods 84(1–2):155–66
Article Google Scholar
Sodagar AM, Wise KD, Najafi K (2007) A fully integrated mixed-signal neural processor for implantable multichannel cortical recording. IEEE Trans Biomed Eng 54(6):1075–1088
Article Google Scholar
Soto E, Manjarrez E, Vega R (1997) A microcomputer program for automated neuronal spike detection and analysis. Int J Med Inform 44(3):203–212
Article Google Scholar
Suner S, Fellows MR, Vargas-Irwin C, Nakata GK, Donoghue JP (2005) Reliability of signals from a chronically implanted, silicon-based electrode array in non-human primate primary motor cortex. IEEE Trans Neural Syst Rehabil Eng 13(4):524–541
Article Google Scholar
Thakur PH, Lu H, Hsiao SS, Johnson KO (2007) Automated optimal detection and classification of neural action potentials in extra-cellular recordings. J Neurosci Methods 162(1):364–376
Article Google Scholar
Watkins PT, Santhanam G, Shenoy KV, Harrison RR (2004) Validation of adaptive threshold spike detector for neural recording. Conf Proc IEEE Eng Med Biol Soc 6:4079–4082
Google Scholar
Zumsteg ZS, Kemere C, O’Driscoll S, Santhanam G, Ahmed RE, Shenoy KV, Meng TH (2005) Power feasibility of implantable digital spike sorting circuits for neural prosthetic systems. IEEE Trans Neural Syst Rehabil Eng 13(3):272–279
Article Google Scholar

Download references

Acknowledgments

We would like to thank Dr. Iyad Obeid for his helpful comments. This work was supported by Grant Number F31EB007897 from the National Institute of Biomedical Imaging and Bioengineering.

Author information

Authors and Affiliations

Department of Biomedical Engineering, Duke University, Durham, NC, 27708, USA
Michael Rizk & Patrick D. Wolf

Authors

Michael Rizk
View author publications
You can also search for this author in PubMed Google Scholar
Patrick D. Wolf
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael Rizk.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Rizk, M., Wolf, P.D. Optimizing the automatic selection of spike detection thresholds using a multiple of the noise level. Med Biol Eng Comput 47, 955–966 (2009). https://doi.org/10.1007/s11517-009-0451-2

Download citation

Received: 05 August 2008
Accepted: 17 January 2009
Published: 10 February 2009
Issue Date: September 2009
DOI: https://doi.org/10.1007/s11517-009-0451-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Optimizing the automatic selection of spike detection thresholds using a multiple of the noise level

Abstract

Similar content being viewed by others

A normalized template matching method for improving spike detection in extracellular voltage recordings

Bayes optimal template matching for spike sorting – combining fisher discriminant analysis with optimal filtering

A Template-Based Sequential Algorithm for Online Clustering of Spikes in Extracellular Recordings

1 Introduction