Professional Documents
Culture Documents
Absract-In brain-machine interface (BMI) applications, the number of regularization techniques used in data mining,
neural firing activities have been represented by spike counts the method based on L1-norm penalty will be more suitable
with a fixed-width time bin. Adaptive models have been designed since it is able to select input variables. It will also enable us
to utilize these bin counts for mapping the associated behavior to understand the structure of the neural activity by selecting
which is typically 2D or 3D hand movement. However, the
more correlated neurons with the associated behavior.
representation of the firing activities can be enriched by binning
neural spikes with multiple time scales based on multiresolution In this paper, we present the multiresolution analysis of
analysis. This multiresolution representation of neural activities neural spikes and the application of modeling on those
can provide more accurate prediction of the hand movement representations using the L1-norm based regularization
parameters. Data mining techniques must be applied to models method.
using multiresolution representation in order to avoid overfitting.
In this paper, we demonstrate that the multiresolution II. MULTIRESOLUTION ANALYSIS OF NEURAL SPIKES
representation improves the performance of the linear model for
BMIs compared to the model with the fixed-width time bin. The multiresolution representation of a neural spike train
can be performed via the wavelet transform. To facilitate the
I. INTRODUCTION computation, we apply the discrete wavelet transform with the
dyadic Haar wavelets. This dyadic Haar wavelet is basically
The main function of the signal processing module in brain-
utilized in the à trous wavelet transform which can be
machine interfaces (BMIs) is to learn mappings between
implemented very effectively in hardware [9]. The smallest
neural activity patterns and motor parameters. To represent the
scale should be larger than 1ms because of the refractory
neural activity, most of experimental designs utilize the
period of a neuron. The largest scale should not exceed 1sec
estimate of local mean firing rate of neurons. The local mean
since it has been reported that the neural activity up to 1sec is
firing rate has been estimated by binning neural spikes with a
correlated with the associated behavior [1]. In our model, we
non-overlapping sliding time window of the length ranging
select eight scales starting at 5ms1 up to 640ms with dyadic
from 50ms up to 100ms [1-5]. Those representations of the
scaling: 5, 10, 20, 40, 80, 160, 320, and 640ms.
firing rate have been used for modeling of the relationship
With a set of scales, the wavelet transform is computed on
with responsive motor parameters. Adaptive models
each neuronal spike train. The Haar wavelet transform can be
(including the simple Wiener filter [6]) based on this estimate
regarded as multi-scale binning: at any given time, neural
have predicted motor parameters with a correlation coefficient
spikes are binned with different time window widths. The
between 0.6 ~ 0.8. It has also been shown that all the proposed
multi-scale binning process is repeated at each time instance at
models reached the same basic performance level for a target
a sampling rate of 200Hz, therefore time windows are sliding
reaching task, which may not be sufficient for more involved
with overlaps except the 5ms bin. In that case, the larger scale
real applications [7]. This led us to explore different
with wider overlaps represents a smoother temporal firing
representations of neural activity to help improve the accuracy
pattern. This procedure is illustrated in Fig. 1.
of the models. One possible way is to estimate the firing rate
Fig. 2 demonstrates an example of the wavelet transform
with multi-scale binning, which can be referred as the
coefficients (or multi-scale binned data) for some particular
multiresolution analysis of neural spike trains [8]. In this
neuron. The coefficients (for 5sec-long data) are presented
analysis, the short-time neural activity as well as the large-
along with the associated hand movement trajectories in the
scale activity can be represented. This multi-scale binning
bottom panel. In order to present the association between
possibly discloses information that may not be available with
wavelet coefficients and the movement, the temporal pattern
fixed-width binning.
of the wavelet coefficients for each scale is plotted on top of
With the multiresolution representation of neural spikes,
the hand position trajectory (x-coordinate) in fig. 3. The figure
however, the dimensionality of inputs to the mapping model
demonstrates that temporal patterns with larger scales seem to
will be considerably increased. Also, input variables are likely
to be collinear, thus increasing the condition number of the
input covariance matrix. Therefore, regularization methods
1
seem very appropriate to avoid poor generalization. Among The minimum scale (5ms) was empirically determined for which binning
yielded significantly different time series from raw spike trains.
t0
5ms 0
10ms 0
20ms 1
40ms 2
80ms 4
160ms 6
320ms 12
640ms 21
Fig. 3. The temporal trajectory of neural spike counts with multi-scale binning
along with the x-coordinate of the hand position. The solid line depicts the
wavelet coefficients while the dotted line does the trajectory. Note that each
trajectory is normalized to fit in the dynamic range of hand trajectory for the
visual purpose.
REFERENCES
Fig. 4. The distribution of the selected input variables for (a) x-coordinate, (b) [1] J. Wessberg, et. al., “Real-time prediction of hand trajectory by
and y-coordinate of position, and (c) x-coordinate, and (d) y-coordinate of ensembles of cortical neurons in primates,” Nature, vol. 408, pp. 361-
velocity. Black pixels denote the selected inputs. Neuron indices are aligned in 365, 2000.
the x-axis, and scales are in the y-axis. [2] Y. Gao, M.J. Black, E. Bienenstock, W. Wu, J.P. Donoghue, “A
quantitative comparison of linear and non-linear models of motor
cortical activity for the encoding and decoding of arm motions,” IEEE
We used two performance measures including the EMBS CNE, Capri, Italy, 2003.
correlation coefficients (CC) and the normalized MSE. The [3] A.B. Schwartz, D.M. Taylor, S.I.H. Tillery, “Extraction algorithms for
CC varies within [-1 1] where 0 indicates no correlation and 1 cortical control of arm prosthetics,” Current Opinion in Neurobiology,
vol. 11, pp. 701-708, 2001.
indicates the perfect correlation between the desired response [4] M.D. Serruya, N.G. Hatsopoulos, L. Paninski, M.R. Fellows, J.P.
and the model output. The normalized MSE was computed as Donoghue, “Brain-machine interface: Instant neural control of a
dividing MSE by the desired signal power. These measures movement signal,” Nature, vol. 416, pp. 141-142, 2002.
were evaluated for the test dataset to assess generalization [5] J.M. Carmena, et. al., “Learning to Control a Brain-Machine Interface
for Reaching and Grasping by Primates,” PLoS Biology, vol. 1, pp. 193-
performance. This analysis resulted in superior performance 208, 2003.
of the linear model with multiresolution data compared to the [6] S. Haykin, Adaptive filter theory. Upper Saddle River, NJ: Prentice Hall,
one with fixed-width binned data, as summarized in table IV. 1996.
To assess the statistical difference of performances between [7] S.P. Kim, et. al., “A comparison of optimal MIMO linear and nonlinear
models for brain-machine interfaces,” unpublished.
two models, we performed the one-tail t-test based on MSE as [8] F. Murtagh, J.L. Starck, O. Renaud, “On neuro-wavelet modeling,”
proposed in [7]. The null hypothesis of no difference between Decision Support Systems Journal, vol. 37, pp. 475-484. 2004.
model performance was rejected at significance level of [9] M.J. Shensa, “Discrete wavelet transfoms: Wedding the à trous and
0.05/0.01 (p<0.001) for both position and velocity, Mallat algorithms,” IEEE Trans. on Sig. Proc., vol. 40, pp. 2464-2482,
1992.
statistically proving a superior performance of the [10] A.E. Hoerl and R.W. Kennard, “Ridge regression: Biased estimation for
multiresolution model. nonorthogonal problems,” Technometrics, vol. 12, pp. 55-67, 1970.
[11] A. Krogh and J.A. Hertz, “A simple weight decay can improve
TABLE IV generalization,” In Advances in NIPS IV, San Mateo, CA, pp. 950-957,
PERFORMANCE COMPARISON 1995.
[12] R. Tibshirani, “Regression shrinkage and selection via the Lasso,” J.
Single Resolution Multiresolution Royal Stat. Soc. B, vol. 58, pp. 267-288, 1996.
Measures CC MSE CC MSE [13] M.A.T. Figueiredo, “Adaptive sparseness for supervised learning,” IEEE
Trans. on Pattern Analysis and Mach. Intell., vol. 25, 2003.
Position-X 0.712 0.474 0.755 0.409 [14] B. Efron, I. Johnstone, T. Hastie, and R. Tibshirani, “Least Angle
Position-Y 0.699 0.636 0.734 0.585 Regression,” Annals. of Statistics, in press.
Velocity-X 0.697 0.529 0.715 0.499 [15] C.M. Bishop, Neural networks for pattern recognition. Oxford, UK:
Oxford University Press, 1995.
Velocity-Y 0.763 0.419 0.779 0.396