Skip to main content

Motor-cognitive functions required for driving in post-stroke individuals identified via machine-learning analysis

Abstract

Background

People who were previously hospitalised with stroke may have difficulty operating a motor vehicle, and their driving aptitude needs to be evaluated to prevent traffic accidents in today’s car-based society. Although the association between motor-cognitive functions and driving aptitude has been extensively studied, motor-cognitive functions required for driving have not been elucidated.

Methods

In this paper, we propose a machine-learning algorithm that introduces sparse regularization to automatically select driving aptitude-related indices from 65 input indices obtained from 10 tests of motor-cognitive function conducted on 55 participants with stroke. Indices related to driving aptitude and their required tests can be identified based on the output probability of the presence or absence of driving aptitude to provide evidence for identifying subjects who must undergo the on-road driving test. We also analyzed the importance of the indices of motor-cognitive function tests in evaluating driving aptitude to further clarify the relationship between motor-cognitive function and driving aptitude.

Results

The experimental results showed that the proposed method achieved predictive evaluation of the presence or absence of driving aptitude with high accuracy (area under curve 0.946) and identified a group of indices of motor-cognitive function tests that are strongly related to driving aptitude.

Conclusions

The proposed method is able to effectively and accurately unravel driving-related motor-cognitive functions from a panoply of test results, allowing for autonomous evaluation of driving aptitude in post-stroke individuals. This has the potential to reduce the number of screening tests required and the corresponding clinical workload, further improving personal and public safety and the quality of life of individuals with stroke.

Introduction

Approximately 1.1 million individuals in Japan have suffered from stroke [1], which manifests via various symptoms, including paresis, aphasia, and cognitive function, depending on the location of the lesion. Among these symptoms, more than two-thirds of post-stroke individuals are thought to suffer from some degree of cognitive disturbances [2], resulting in interfere with activities of daily living (ADLs) and the need for additional care [3]. Even if post-stroke individuals are able to engage in ADLs, their social life can be severely impacted. Driving a car is an effective way for people with impaired walking ability to participate in social activities [4]; however, driving is a complex task that requires many cognitive and executive functions to regulate physical behaviors [5, 6]. On the other hand, the main causes of traffic accidents among drivers include incorrect perception, poor judgment, and the inability to respond to emergencies in a timely manner. These risk factors can be exacerbated by stroke or dementia with age [7], as evidenced by the increasing number of traffic accidents compared to the control group depending on the type of stroke [2]. In addition, traffic accidents caused by the elderly and people with dementia has increased in recent years. Therefore, for many individuals with cognitive impairment due to these disorders, evaluations of driving aptitude are required for today’s car-based society.

On-road driving tests using a real car are the gold standard for evaluating driving aptitude and previous studies showed no increase in the number of car accidents among post-stroke drivers who passed the test compared to the control group [8]. Whereas it is difficult to perform on-road test to all participants with stroke, there is a need to screen for the necessity of road testing. Therefore, in medical institutions, evaluation with a driving simulator and physical and cognitive function tests are also used [9]. A driving aptitude evaluation method using a simulator to reproduce on-road driving test situations has been developed. The simulator evaluation can provide an evaluation of situations that cannot be evaluated using on-road testing, such as bad weather, collision avoidance, and abrupt changes in the road [2]. However, there are various types of simulators with different standards, making it difficult to obtain better and constant results than on-road testing [2].

Physical and cognitive function tests have been performed in medical institutions to perform rehabilitation for individuals with stroke. Several studies have investigated the association between physical and cognitive function tests and driving aptitude [9,10,11]. Reger et al. reported relationships among the results of cognitive function tests, including attention, visuospatial cognition, memory, and executive function, and the results of on-road testing and the evaluation results from a driving simulator [10]. Yamada et al. performed two cognitive function tests, the mini-mental state examination (MMSE) and the trail-making test (TMT), on participants with stroke and reported relationships between the results of tests and driving aptitude [11]. A number of physical and/or cognitive function tests have shown the significant relation with driving aptitude, while too few studies have supported their reliability and validity in predicting driving aptitude. It was because that there are too many cognitive tests for measuring various cognitive dysfunction after stroke, and very few reports examined by the single method and verified the predictive ability of operation aptitude. As a result, there is no consensus regarding a valid and reliable test to determine the driving aptitude for individuals with stroke. Therefore, there is a need at this stage to conduct on-road driving tests to judge driving aptitude in subjects with hospital screening subtractions [2].

This paper investigates the association between driving aptitude and physical impairment, and cognitive dysfunction in participants with stroke. In this study, 10 tests of physical and cognitive function were administered to participants with stroke, taking up to 5 h per participant and lasting a week or more to complete. Therefore, it is necessary to use it as screening after reducing the number of essential tasks out of a physical-cognitive function test group. Here, we proposed a machine learning algorithm that can automatically select input indices related to driving aptitude by introducing sparse regularization, aiming to develop an effective method to identify subjects who must undergo the on-road driving test through prior hospital-based tests. In addition, we also analyzed the relationship between the indices obtained from physical and cognitive tests and driving aptitude using the proposed algorithm, thus promising to provide a theoretical basis for further valid evaluation.

Table 1 Clinical characteristic of the participants

Materials and methods

Participant data

Fifty-five participants with stroke (age: 61.3 ± 11.3 years) of Hibino Hospital participated in this study. Of these, 41 participants (age: 61.7 ± 11.2 years) were evaluated as a group with driving aptitude, and 14 participants (age: 61.2 ± 11.4 years) were evaluated as a group without driving aptitude. Table 1 shows the baseline data for participants in the groups with and without driving aptitude. The participants were asked to take the following 10 types of tests: the functional independence measure (FIM) [12], a type of functional independence assessment method; MMSE [13, 14], TMT [15, 16], which are cognitive function tests; Rivermead behavioral memory test (RBMT) [17], which is a type of memory test; CAT [18], which indicates attention function; the Japanese perceived stress scale (JPSS) [19, 20], which is a type of self-perceived stress test; the Brunnstrom stages of motor recovery (BRS) [21], which indicates the degree of paralysis; dominant hand side before stroke and affected side after stroke were also included in this analysis; the BIT [22], which indicates the degree of unilateral spatial neglect; the hospital anxiety and depression scale (HADS) [23, 24], which determines the presence or absence of depression and anxiety; the apathy score (AS) [25, 26], which determines the presence or absence of apathy.

The participants performed an on-road driving test and simulator test at the Numata driving school to evaluate their driving aptitude as described previously [27]. Driving aptitude was evaluated by driving school instructors based on the results of these tests. Based on the driving simulator results and the on-road testing, the participants were classified into two groups: a group with driving aptitude that the driving school instructor judged to be able to drive, and a group assessed to be unable to drive or require further training. The study was approved by the Ethics Review Committee of the Hiroshima University Epidemiological Research (E-1554-2, E-466-3) and was performed per relevant guidelines and regulations. Written informed consent was obtained from all participants.

Proposed neural network model

The log-linearized Gaussian mixture network (LLGMN) [28], a neural network based on a discriminative Gaussian mixture model, can estimate the posterior probability of the target class that the input data belong to by estimating the statistical distribution of the sample data. So far, several studies have demonstrated its effectiveness in recognizing biological information [29, 30], including an application to analyze the relationship between driving aptitude and physical and cognitive functions [31]. Moreover, since only the number of mixtures is a hyperparameter, the tuning cost is lower than that of general neural networks such as the multilayer perceptron [32]. However, simply evaluating driving aptitude using the LLGMN does not reveal the relationship between each participant’s clinical index and driving aptitude, rendering it challenging to realize the original intent of establishing an effective method for identifying subjects who must undergo an on-road driving test through hospital-based physical and cognitive function screening tests. Therefore, this study proposes an LLGMN-based sparse neural network that can automatically select indices related to driving aptitude by adding a dimensionality reduction layer with sparse weights before the input layer of the LLGMN.

Fig. 1
figure 1

Overview of the proposed neural network model for evaluating driving aptitude. The proposed network is composed of two parts: a dimensionality reduction layer with \(L_1\) regularized weights and a log-linearized Gaussian mixture network (LLGMN) [28]. This network calculates the posterior probability \(p(c|\textbf{x})\) of the presence or absence of driving aptitude (i.e., drivable or undrivable) using the indices \(\textbf{x}\) obtained from the physical and cognitive function tests as input. The weight parameters of the dimensionality reduction layer are denoted by \(\textbf{w} = \{w_i\}_{i=1}^P\)

The overview of the proposed network is shown in Fig. 1. The network’s input is a P dimensional index value \(\textbf{x} \in \mathbb {R}^P\) obtained from 10 physical and cognitive tests. The output is the posterior probability \(p(c|\textbf{x})\) of class \(c \in \{1, 2\}\) representing the presence or absence of driving aptitude. The following equation can express the relationship between the input and output in the dimensionality reduction layer:

$$\begin{aligned} y_i=w_i x_i \ \ \ (i=1,2,...,P), \end{aligned}$$
(1)

where \(y_i\) is the dimensionality reduction layer’s output, and \(w_i\) is the dimensionality reduction layer’s weights. In the proposed network, the corresponding input dimension is reduced by sparsifying the dimensionality reduction layer’s weights.

Given a set of training data \(\textbf{x}_n\) and corresponding target values \(t_{nc}\) indicating the presence or absence of driving aptitudes (\(n = 1, \ldots , N\); N is the number of participants for learning), let us consider the training of the proposed network. The target \(t_{nc}\) is a vector of one-of-C form, indicating the presence or absence of driving aptitude, where \(t_{nc} = 1\) if \(\textbf{x}_n\) belongs to class c, \(t_{nc} = 0\) otherwise. The solution of the proposed network can be obtained at the same time as learning by applying \(L_1\) regularization to the weights \(w_i\) of the dimensionality reduction layer. Here, the energy function E of the proposed network is defined as follows:

$$\begin{aligned} E = -\sum ^{N}_{n=1}\sum ^{2}_{c=1} t_{nc} \ln p(c|\textbf{x}_n)+\lambda \sum _{i=1}^{P} |w_i|, \end{aligned}$$
(2)

where the first term represents the cross-entropy error and the second term is the \(L_1\) regularization term, which penalizes the error according to the magnitude of the absolute value of the dimensionality reduction layer’s weights (\(w_i\)). The parameter \(\lambda\) is a regularization coefficient that determines the strength of \(L_1\) regularization.

The proposed network minimizes the energy function E based on the error backpropagation method. Hence, the parameters of the entire network can be learned in an end-to-end fashion to reduce the error within the range where \(w_i\) does not become large. Some \(w_i\) will be completely zero at the end of training by applying \(L_1\) regularization to the dimensionality reduction layer’s weights. Thus, input indices corresponding to \(w_i\) that become zero can be excluded from the model, and input indices related to driving aptitude can be selected automatically. As a result, redundant input dimensions may be minimized, further identifying crucial indices that are highly relevant to the evaluation of driving aptitude and reducing test time in future applications.

Importance of indices

It is important to identify the key features of a model in relational analysis using machine learning. \(L_1\) regularization is applied to \(w_i\) in the dimensionality reduction layer in the proposed network to perform parameter reduction estimation and dimensionality reduction by actively reducing the \(w_i\) with a low contribution to zero error reduction. The \(w_i\) remaining after learning represents the contribution of each index to the discrimination result; hence, the importance of each index may be obtained by examining the magnitude of \(w_i\).

We also examined the importance of indices using the permutation importance [33], which is one way to evaluate the importance of machine learning inputs, and compared it with the learned \(w_i\) in the proposed network. The permutation importance used the following algorithm to evaluate the importance of indices.

  1. 1.

    After splitting the whole dataset into training and validation datasets based on the cross-validation procedure, the model is trained using the training dataset.

  2. 2.

    The validation data, \(\textbf{X}^\text{val} = \{\textbf{x}^\text{val}_n\}\) (\(n = 1, 2, \ldots ,N^\text{val}\); \(N^\text{val}\) is the number of participants in the validation data), are input to the trained model to calculate the cross-entropy error e which is a measure to evaluate the accuracy of the model:

    $$\begin{aligned} e = L(t^\text{val}_{nc}, \textbf{X}^\text{val}) = -\sum ^{N^\text{val}}_{n=1}\sum ^{2}_{c=1} t^\text{val}_{nc} \ln p(c|\textbf{x}^\text{val}_n), \end{aligned}$$
    (3)

    where \(t^\text{val}_{nc}\) is the corresponding target value.

  3. 3.

    Shuffle the order of the i-th feature in the validation data (i.e., permutation) to create a new validation data matrix \(\widetilde{\textbf{X}}^\text{val}\).

  4. 4.

    Input \(\widetilde{\textbf{X}}^\text{val}\) into the trained model and compute the error \(e_i = L(t^\text{val}_{nc}, \widetilde{\textbf{X}}^\text{val})\) based on the output.

  5. 5.

    Calculate the difference in the error before and after permuting i-th feature, \(PI_i\), which is the evaluation value of permutation importance:

    $$\begin{aligned} PI_i = e_i - e. \end{aligned}$$
    (4)
  6. 6.

    Repeat steps 3 to 5 for all features \(i = 1,...,P\).

The above algorithm obtains the evaluation value of permutation importance \(PI_i\) for each feature. Since the cross-entropy error is used as the score for evaluating the model’s accuracy, the larger \(PI_i\) value indicates that the model’s accuracy deteriorates when the data order is randomized, suggesting that features with larger \(PI_i\) are important.

Relationship between evaluation indices and driving aptitudes

More than 300 indices were obtained from the 10 physical and cognitive function tests. Analyses of all these indices would require a great deal of time. Therefore, the indices that were considered to be related to driving aptitude were selected from all the indices under specialized physicians’ guidance. Indices that were highly related to driving aptitude were rated as \(\circledcirc\), those that were related as \(\bigcirc\), and those that were slightly related as \(\triangle\). Table 2 shows the evaluation results for each test. The results were \(\circledcirc\): 48 indices, \(\bigcirc\): 17 indices, and \(\triangle\): 7 indices. Moreover, the effective indices for driving aptitude evaluation were selected by machine learning using the proposed network using the three datasets created based on the evaluation results. Dataset 1 consisted of 48 indices with a rating of \(\circledcirc\). Dataset 2 consisted of 65 indices with a rating of \(\circledcirc\) and \(\bigcirc\). Dataset 3 consisted of 72 indices with a rating of \(\circledcirc\), \(\bigcirc\), and \(\triangle\). The data was standardized (mean = 0, standard deviation = 1) for each index. Table 3 shows the results of ROC analysis of the identification results by the proposed network. The AUC of dataset 1, dataset 2, and dataset 3 were 0.918, 0.946, and 0.862. Therefore, dataset 2 consisting of 65 indices of physical and cognitive functions listed in Table 4 were selected for analysis in this study (i.e., \(P = 65\)). The presence or absence of driving aptitude was used as the target value, and the values of 65 indices shown in Table 4 were used as input data. The experiments were run on a computer with an Intel Xeon X5-2620 (8 cores, 2.1\(-\)3.0 GHz) processor and 16 GB RAM.

Table 2 Evaluation results by physicians
Table 3 Results of ROC analysis based on datasets

The indices for driving aptitude evaluations were selected by machine learning using the proposed network, thereby judging the indices whose \(w_i\) was non-zero at least once to be effective. In this study, 11-fold cross-validation was used for analysis. The ROC analysis was also conducted using the posterior probability of driving aptitude identified by LLGMN and the presence or absence of driving aptitude. The classification accuracy of driving aptitude by the selected indices was evaluated using AUC values. This study examines the validity of the indices selected based on the proposed method by comparing the results with those of permutation importance.

The evaluation value of permutation importance, \(PI_i\), was calculated based on the 11-fold cross-validation using LLGMN on the dataset created with only the indices selected by the proposed network to verify whether the weight \(w_i\) of the proposed network expresses the importance of the indices. Since the permutation importance calculates the evaluation value \(PI_i\) for each fold of the cross-validation, 11 evaluation values can be obtained for each index i. This study calculates the average of \(\{PI^{(1)}_i, PI^{(2)}_i, \ldots , PI^{(11)}_i\}\) obtained for each index in cross-validation as permutation importance for the final evaluation.

We also evaluated the prediction ability of the proposed method by comparing it with conventional methods. The LLGMN with dimensionality reduction by partial KLI [31, 34, 35] and Lasso regression were used as conventional methods. As an indicator of the importance of the selected indices, the reduction rate of the AUC value generated by deleting each index after dimensionality reduction was used in LLGMN with dimensionality reduction by partial KLI, and the magnitude of the standardized partial regression coefficient was used in Lasso regression.

Table 4 List of indices used in the analysis

The proposed network was trained using stochastic gradient descent (SGD) with a learning rate of 0.01, a batch size of 50, and a number of epochs of 10,000. The learning was terminated when the loss did not improve by more than \(1\times 10^{-4}\) for more than 5 epochs. The regularized parameter \(\lambda\) of the proposed network was optimized using the tree-structured Parzen estimator (TPE) [36]. The detailed settings of the TPE are described in Additional file 1.

Statistical analysis

Fisher’s exact test was performed to confirm the relationship between the evaluation of driving aptitude by the indices selected by the proposed network and the evaluation by the instructor of the driving school at a significance level of 5%. The evaluation of driving aptitude by the indices selected by the proposed network was discriminated against using thresholds of the maximum sum of sensitivity and specificity attributed to the ROC analysis. Additionally, Yule’s correlation coefficient was calculated to confirm the strength of the relationship. The AUC values for the conventional methods and the proposed method were calculated and compared using the Holm-adjusted Delong test at a significance level of 5%.

Results

In this study, we used 65 indices obtained from physical and cognitive function tests conducted on 55 participants with stroke (Table 4). These indices were input into a neural network model to evaluate driving aptitude. The proposed neural network model consists of a dimensionality reduction layer and a discriminative layer (Fig. 1). In the dimensionality reduction layer, there is a single weight parameter corresponding to each input index. The value of the weight parameter is determined according to the contribution of each corresponding input index to the prediction of driving aptitude. For a discriminative layer, we used the log-linearized Gaussian mixture network (LLGMN), which is a neural network based on discriminative Gaussian mixture models. During the model training, the weight parameters of the entire network were optimized based on the loss function with the \(L_1\) norm on the weights of the dimensionality reduction layer. This learning method provides a sparse solution for the weight parameters of the dimensionality reduction layer, thereby effectively eliminating less important input variables in the learning process. Therefore, the proposed neural network model can find important evaluation indices as well as evaluate driving aptitude.

Prediction accuracy of driving aptitude

Dimensionality reduction and prediction of driving aptitude based on the proposed method were performed using 65 indices as input variables. The training and testing processes were conducted by 11-fold cross-validation. For comparison, we used two baselines: the LLGMN with dimensionality reduction by partial Kullback–Leibler information (KLI) and the least absolute shrinkage and selection operator (Lasso) regression. The partial KLI-based dimensionality reduction is a discrete variable selection method that reduces the input variables stepwise based on the class posterior probabilities output by LLGMN. The Lasso regression is a linear regression analysis method with \(L_1\) regularization, which allows continuous variable selection. Each analysis’s duration was 1.7 h for the proposed network, 19.6 h for LLGMN with dimensionality reduction using partial KLI, and 0.016 h for Lasso regression, as shown in Fig. 2c.

Figure 2a and Table 5 show the receiver operating characteristic (ROC) analysis of each method. The area under the curve (AUC) of the proposed network and LLGMN with partial KLI-based reduction were both \(> 0.94\), showing that these methods could discriminate driving aptitude with high accuracy. Significant differences between the proposed network and Lasso regression and between LLGMN with partial KLI-based reduction and Lasso regression were observed (proposed network vs. Lasso regression: \(p=9.8 \times 10^{-3}\), LLGMN with dimensionality reduction using partial KLI vs. Lasso regression: \(p=7.0 \times 10^{-3}\)).

Fig. 2
figure 2

Results of driving aptitude evaluation. a ROC curve for each method. Red solid line, green dotted line, and gray dashed line represent the proposed network, LLGMN with partial KLI-based dimensionality reduction, and Lasso regression, respectively. b Confusion matrix for comparing the diagnostic results with the results analyzed by each method. c Comparison of predictive ability and time complexity of three methods. The statistical test results based on the Delong test with Holm adjustment are also shown (significance level: 0.01)

Table 5 Results of ROC analysis

Figure 2b shows the confusion matrices of the evaluation of driving aptitude by each method and the evaluation by the instructors of the driving school. There were significant relationships between the evaluations of each method and the instructors (proposed network: \(p=2.3 \times 10^{-7}\), LLGMN with partial KLI-based reduction: \(p=1.7 \times 10^{-8}\), and Lasso regression: \(p=1.8 \times 10^{-3}\)). Yule’s coefficient of association for each method was 0.974 (proposed network), 0.983 (LLGMN with partial KLI-based reduction), and 0.797 (Lasso regression), meaning that the neural network-based dimensionality reduction showed a high degree of relationships between actual and predicted driving aptitudes.

Importance of indices for driving aptitude evaluation

The importance of each index can be evaluated by examining the values of the weight parameters, \(w_i\), in the dimensionality reduction layer of the proposed network, since the \(L_1\) regularization shrinks the weight parameters related to the indices with a low contribution to the driving aptitude. Therefore, the indices with larger values of the corresponding weight parameters can be interpreted as more important variables in evaluating driving aptitude.

Fig. 3
figure 3

Results of importance evaluation of indices. a Optimized weights (\(w_i\)) of indices selected by the proposed network. b Results of permutation importance for indices selected by the proposed network. c Standard partial regression coefficients for Lasso regression. “\(\times\)” in the boxplot denotes the average value of each index

Resultantly, the original 65 input indices were reduced to 20 after dimensionality reduction using the proposed network, whose corresponding weight parameter values have never been reduced to zero due to the 11-fold cross-validation analysis. Figure 3a shows the boxplots of weight parameters, \(w_i\), of the indices selected by the proposed network, sorted in descending order by their mean values. The index with the largest averaged \(w_i\) was CPTSRT average reaction time. MMSE: Orientation-time, CPTSRT coefficient of validation, Affected side Left, and Copying test had the next largest averaged \(w_i\), in that order. In addition, as shown in Table 1, four of these five indices were significantly different between the two groups with and without driving aptitude: CPTSRT average reaction time (\(p=0.0499\)), MMSE: Orientation-time (\(p=0.0004\)), Affected side Left (\(p=0.0286\)), and Copying test (\(p=0.0058\)).

We also conducted a similar analysis using permutation importance [33] as another way to evaluate the importance of indices using a learned model. Figure 3b shows the result of the calculated permutation importance, denoted as PI, using the 20 indices selected using the proposed network. The order of the indices was the same as in Fig. 3a. The index with the largest average PI is CPTSRT accuracy. CPTSRT average reaction time, Copying test, MMSE: Orientation-time, and Digit span backward were the next indices with the largest average PI in that order. These results showed that four out of the top five indices with the largest PI were consistent with the top six indices with the largest average \(w_i\) of the proposed network. It is worth noting that the CPTSRT accuracy of the two groups of participants was also significantly different (\(p=0.0168\)) as shown in Table 1.

Table 6 Ranking list of the indices selected by LLGMN with partial KLI-based dimensionality reduction

Table 6 shows the indices selected for dimensionality reduction by partial KLI and the reduction rate of AUC values after removing indices in descending order. Nineteen indices were obtained after dimensionality reduction by partial KLI. The maximum reduction in the AUC was 15.8% of CPTSRT accuracy. Additionally, the decrease in the AUC for CPTAX average reaction time was 14.8%, for CPTSRT coefficient of validation was 11.8%, for Digit span backward was 11.2%, and for Letter cancellation test was 9.2%. Figure 3c shows the mean of the standardized partial regression coefficients of Lasso regression in descending order of absolute value. The standardized partial regression coefficients were largest for MMSE: Orientation-time; the next largest coefficients were for Copying test, CPTSRT accuracy, CPTSRT average reaction time, and Lower limbs on discharge. The top five most important indices for each analysis method were MMSE: Orientation-time of MMSE, 2 sub-tests (CPTSRT average reaction time and CPTSRT accuracy) of continuous performance test (CPT) of clinical assessment for attention (CAT), and 2 sub-tests (Copying test and Letter cancellation test) of behavioral inattention test (BIT) (Fig. 3 and Table 6). These results indicated that the same indices were selected by all analysis methods.

Discussion

In this paper, we present a neural network model with a dimensionality reduction layer and analyzed the physical and cognitive function test results of post-stroke individuals associated with driving aptitude. In the proposed network, \(L_1\) norm-based sparse regularization is applied to the weight parameters of the dimensionality reduction layer, and the entire network is trained end-to-end for continuous variable selection of input indices. Using the various indices of each participant as input data and the corresponding driving aptitude as output labels, the proposed approach enables the automatic extraction of important indices for evaluating the driving aptitude of post-stroke individuals out of a huge number of medical indices.

The AUC values of the indices selected by the proposed network and the indices selected by dimensionality reduction with partial KLI showed high accuracy in discriminating driving aptitude (Table 5), whereas that of Lasso regression showed lower accuracy. Fisher’s exact test showed a significant relationship for each method, but Yule’s coefficient of relationship for the proposed network and LLGMN with dimensionality reduction using partial KL information were higher than that for Lasso regression, reaching \(>0.97\). This suggests that the nonlinear analysis methods, the proposed network and LLGMN with dimensionality reduction using partial KL information, are more suitable for analyzing the relationship between physical and cognitive function tests and driving aptitude compared with a linear analysis method, Lasso regression. In addition, the analysis time of the proposed network is much shorter than that of the partial KLI-based method (Fig. 2c). This is because the proposed network can select indices continuously by end-to-end training, while the partial KLI-based method selects indices stepwise by repeating the training for the number of indices.

Four of the top five indices with the largest PI coincided with the top six indices with the largest average weight \(w_i\) of the proposed network (Fig. 3a and b), all of which are also significantly between the two groups of participants with and without driving aptitude. This suggests that the average weights \(w_i\) of the proposed network reflect the importance of the indices in terms of their contribution to discrimination. In contrast, some indices, such as Affected side Left, have a large average of weight \(w_i\) without a large PI. The distribution of \(w_i\) was confirmed to be wide by focusing on the weight \(w_i\) of Affected side Left. This suggests that the PI became small even for indices with a large average of \(w_i\) because the weights for each fold in the cross-validation differed greatly. However, this Affected side Left suggests the importance of right hemisphere damage in determining driving aptitude. In our previous studies, the right hemisphere has been shown to have cognitive functions related to car driving, directed attention, and sustained attention. In addition, our previous research has shown that the left and right hemispheres of the brain can coordinate to regulate the speed and accuracy of thought and action processing [27]. Therefore, both left and right affected sides were included in the remaining indices in the evaluation.

For each analysis method, MMSE: Orientation-time, CPTSRT average reaction time, CPTSRT accuracy and other indices obtained by the CPT, and the Copying test and Letter cancellation test, which are indices obtained by the BIT, were selected (Fig. 3, Table 6). This consistent result suggests that the indices selected by the proposed method are appropriate since the proposed method can select indices similar to those of the conventional methods. The relationship between these indices and driving aptitude is discussed below. MMSE: Orientation-time is an index to evaluate the orientation of time. This index may be selected as an index of declining cognitive function because conventional research has reported that declining orientation of time is an early stage of declining cognitive function required for driving [37]. The CPT, a sub-test of the CAT, evaluates sustained attention. Sustained attention is considered an index related to driving aptitude. It has been reported that slow cognitive processing speed related to sustained attention [38] is associated with traffic accidents [39]. Copying test and Letter cancellation test which are sub-tests of BIT are obtained. BIT is used for the purpose of measuring unilateral spatial neglect which has been known to interfere with driving a car [40]. For these reasons, the indices selected for each analysis method are related to driving aptitude and are considered important indices for evaluating driving aptitude. Typically, it takes a total of about 5 h to complete all 10 physical and cognitive function tests, which can last one week or more. However, if the number of indices can be reduced based on the results of this study, effective evaluation of driving aptitude can be achieved by completing the physical and cognitive function test in one day. In conclusion, the proposed method provides compelling evidence for the effective evaluation of driving aptitude in individuals with stroke by specifying highly relevant tests of physical/cognitive functions and reducing the number of tasks required in the screening test, potentially raising the possibility of their resumption of car driving as well as reducing the risk of accidents.

Limitations and future work

In this paper, we evaluated driving aptitude in 55 participants with stroke using the developed machine learning model. Whereas current work has narrowed down evaluation indices that would have taken a week to obtain to the extent of a single day, it remains challenging to justify a full day of driving assessment for participants with stroke. In addition, the specificity of all three investigated classification methods may be questioned in the clinical setting by the 4.9\(-\)29.3% of false positives.

On the one hand, the dataset of the same participants was used for the cross-validation to select indices and for the cross-validation to evaluate the driving aptitude with the selected indices due to the small number of participants enrolled. Therefore, the evaluation of driving aptitude in this study is not a rigorous evaluation of completely unknown data. In the future, priority should be given to increasing the number of participants and confirming the evaluation accuracy of unknown data.

On the other hand, previous studies have reported an association between motor-cognitive functions and the location of brain damage [27]. Therefore, we intend to include brain imaging analysis in the future to identify brain regions associated with driving aptitude, which may contribute to higher specificity while maintaining the evaluation duration, thus improving the clinical feasibility. Furthermore, although the w of the dominant side factor before stroke is low, it is still possible that driving ability was related to whether the affected side was on the dominant side. In the future, we would like to clarify this relationship by analyzing lesion sites.

Conclusion

This paper demonstrated that the sparse neural network, LLGMN with \(L_1\) dimensionality reduction layer, can analyze physical and cognitive function test results associated with driving aptitude more efficiently than the conventional method and identifies the presence or absence of driving aptitude with high accuracy. In addition, the indices obtained from the MMSE and the CPT, which is a sub-test of the CAT and the BIT, were important in the evaluation of driving aptitude. This indicates the possibility of evaluating driving aptitude with fewer tests than those that are currently required.

Availability of data and materials

The datasets supporting the conclusions of this article are available on request and if no ethical reasons prohibit sharing.

Abbreviations

ADLs:

Activities of daily living

AS:

Apathy score

AUC:

Area under the curve

BIT:

Behavioral inattention test

BRS:

Brunnstorm stages of motor recovery

CAT:

Clinical assessment for attention

CPT:

Continuous performance test

CPTAX:

Continuous performance test AX version

CPTSRT:

Continuous performance test simple version

CPTX:

Continuous performance test X version

HADS:

Hospital anxiety and depression scale

JPSS:

Japanese perceived stress scale

KLI:

Kullback–Leibler information

Lasso:

Least absolute shrinkage and selection operator

LLGMN:

Log-linearized Gaussian mixture network

MMSE:

Mini-mental state examination

PASAT:

Paced auditory serial addition test

RBMT:

Rivermead behavioral memory test

ROC:

Receiver operating characteristic

SDMT:

Symbol digit modalities test

SGD:

Stochastic gradient descent

TMT:

Trail-making test

TPE:

Tree-structured Parzen estimator

References

  1. Ministry of Health Labour and Welfare: Summary of patient survey 2017. (Accessed 29 Oct 2020). https://www.mhlw.go.jp/toukei/saikin/hw/kanja/17/index.html

  2. Hird MA, Vetivelu A, Saposnik G, Schweizer TA. Cognitive, on-road, and simulator-based driving assessment after stroke. J Stroke Cerebrovasc Dis. 2014;23(10):2654–70.

    Article  PubMed  Google Scholar 

  3. Matérne M, Strandberg T, Lundqvist L-O. Change in quality of life in relation to returning to work after acquired brain injury: a population-based register study. Brain Injury. 2018;32(13–14):1731–9.

    Article  PubMed  Google Scholar 

  4. Collia DV, Sharp J, Giesbrecht L. The 2001 national household travel survey: a look into the travel patterns of older Americans. J Saf Res. 2003;34(4):461–70.

    Article  Google Scholar 

  5. Akinwuntan AE, Devos H, Feys H, Verheyden G, Baten G, Kiekens C, De Weerdt W. Confirmation of the accuracy of a short battery to predict fitness-to-drive of stroke survivors without severe deficits. J Rehabil Med. 2008;39(9):698–702.

    Article  Google Scholar 

  6. Akinwuntan AE, Wachtel J, Rosen PN. Driving simulation for evaluation and rehabilitation of driving after stroke. J Stroke Cerebrovasc Dis. 2012;21(6):478–86.

    Article  PubMed  Google Scholar 

  7. Shangguan Q, Fu T, Wang J, Luo T, et al. An integrated methodology for real-time driving risk status prediction using naturalistic driving data. Accident Anal Prev. 2021;156: 106122.

    Article  Google Scholar 

  8. Devos H, Akinwuntan AE, Nieuwboer A, Truijen S, Tant M, De Weerdt W. Screening for fitness to drive after stroke: a systematic review and meta-analysis. Neurology. 2011;76(8):747–56.

    Article  CAS  PubMed  Google Scholar 

  9. Mimura M. Driving a car for people with higher brain dysfunction. Higher Brain Funct Res. 2011;31(2):157–63.

    Article  Google Scholar 

  10. Reger MA, Welsh RK, Watson G, Cholerton B, Baker LD, Craft S. The relationship between neuropsychological functioning and driving ability in dementia: a meta-analysis. Neuropsychology. 2004;18(1):85–93.

    Article  PubMed  Google Scholar 

  11. Yamada K, Sasaki T, Kudo A, Sengoku Y. Relationship between neuropsychological testing and actual driving assessment in patients with cerebrovascular disease. Higher Brain Funct Res. 2013;33(2):270–5.

    Article  Google Scholar 

  12. Christiansen CH. Evaluation and management of daily self-care requirements. Rehabilitation Medicine: Principles and Practice. 1998;145–7.

  13. Folstein MF, Folstein SE, McHugh PR. ‘Mini-mental state’’: a practical method for grading the cognitive state of patients for the clinician. J Psychiatric Res. 1975;12(3):189–98.

    Article  CAS  Google Scholar 

  14. Mori E, Mitani Y, Yamadori A. Usefulness of the Japanese version of the mini-mental state test in patients with neurological diseases. Jpn J Neuropsychol. 1985;1(2):82–90.

    Google Scholar 

  15. Reitan RM. The relation of the trail making test to organic brain damage. J Consult Psychol. 1955;19(5):393–4.

    Article  CAS  PubMed  Google Scholar 

  16. Hirota C, Watanabe M, Sun W, Tanimoto Y, Kono R, Takasaki K, Kono K. Association between the trail making test and physical performance in elderly Japanese. Geriatr Gerontol Int. 2010;10(1):40–7.

    Article  PubMed  Google Scholar 

  17. Wilson B, Cockburn J, Baddeley A, Hiorns R. The development and validation of a test battery for detecting and monitoring everyday memory problems. J Clin Exp Neuropsychol. 1989;11(6):855–70.

    Article  CAS  PubMed  Google Scholar 

  18. Japan Society for Higher Brain Dysfunction. Clinical assessment for attention. Shinko-Igaku Press; 2006.

  19. Cohen S, Kamarck T, Mermelstein R. A global measure of perceived stress. J Health Soc Behav. 1983;24:385–96.

    Article  CAS  PubMed  Google Scholar 

  20. Iwahashi S, Tanaka Y, Fukudo S, Hongo M. The development of the Japanese version of the perceived stress scale. Jpn J Psychosom Med. 2002;42(7):459–66.

    Google Scholar 

  21. DeLisa JA, Gans BM, Walsh NE. Physical medicine and rehabilitation: principles and practice. Philadelphia: Lippincott Williams & Wilkins; 2005.

    Google Scholar 

  22. Wilson B, Cockburn J, Halligan P. Development of a behavioral test of visuospatial neglect. Arch Phys Med Rehabil. 1987;68(2):98–102.

    CAS  PubMed  Google Scholar 

  23. Zigmond AS, Snaith RP. The hospital anxiety and depression scale. Acta Psychiatrica Scandinavica. 1983;67(6):361–70.

    Article  CAS  PubMed  Google Scholar 

  24. Matsudaira T, Igarashi H, Kikuchi H, Kano R, Mitoma H, Ohuchi K, Kitamura T. Factor structure of the hospital anxiety and depression scale in Japanese psychiatric outpatient and student populations. Health Qual Life Outcomes. 2009;7:42.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Starkstein SE, Fedoroff JP, Price TR, Leiguarda R, Robinson RG. Apathy following cerebrovascular lesions. Stroke. 1993;24(11):1625–30.

    Article  CAS  PubMed  Google Scholar 

  26. Yamagata S, Yamaguchi S, Kobayashi S. Impaired novelty processing in apathy after subcortical stroke. Stroke. 2004;35(8):1935–40.

    Article  PubMed  Google Scholar 

  27. Shimonaga K, Hama S, Tsuji T, Yoshimura K, Nishino S, Yanagawa A, Soh Z, Matsushige T, Mizoue T, Onoda K, et al. The right hemisphere is important for driving-related cognitive function after stroke. Neurosurg Rev. 2021;44:977–85.

    Article  PubMed  Google Scholar 

  28. Tsuji T, Fukuda O, Ichinobe H, Kaneko M. A log-linearized Gaussian mixture network and its application to EEG pattern classification. IEEE Trans Systems, Man, and Cybern Part C. 1999;29(1):60–72.

    Article  Google Scholar 

  29. Furui A, Eto S, Nakagaki K, Shimada K, Nakamura G, Masuda A, Chin T, Tsuji T. A myoelectric prosthetic hand with muscle synergy-based motion determination and impedance model-based biomimetic control. Sci Robot. 2019;4(31):6339.

    Article  Google Scholar 

  30. Soh Z, Matsuno M, Yoshida M, Furui A, Tsuji T. Measurement of emotional states of zebrafish through integrated analysis of motion and respiration using bioelectric signals. Sci Rep. 2021;11:187.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Hama S, Yoshimura K, Yanagawa A, Shimonaga K, Furui A, Soh Z, Nishino S, Hirano H, Yamawaki S, Tsuji T. Relationships between motor and cognitive functions and subsequent post-stroke mood disorders revealed by machine learning analysis. Sci Rep. 2020;10:19571.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Doi H, Iijima N, Furui A, Soh Z, Yonei R, Shinohara K, Iriguchi M, Shimatani K, Tsuji T. Prediction of autistic tendencies at 18 months of age via markerless video analysis of spontaneous body movements in 4-month-old infants. Sci Rep. 2022;12:18045.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Breiman L. Random forests. Mach Learn. 2001;45(1):5–32.

    Article  Google Scholar 

  34. Shibanoki T, Shima K, Tsuji T, Otsuka A, Chin T. A quasi-optimal channel selection method for bioelectric signal classification using a Partial Kullback-Leibler information measure. IEEE Trans Biomed Eng. 2012;60(3):853–61.

    Article  PubMed  Google Scholar 

  35. Shibanoki T, Shima K, Takaki T, Kurita Y, Otsuka A, Chin T, Tsuji T. A novel class selection method based on a Partial Kullback-Leibler information measure and its application to motion classification problem for EMG signal discrimination. Trans Soc Instrum and Control Eng. 2012;48(9):580–8.

    Article  Google Scholar 

  36. Bergstra J, Bardenet R, Bengio Y, Kégl B. Algorithms for hyper-parameter optimization. Adv Neural Inf Process Syst. 2011;24:2546–54.

    Google Scholar 

  37. Ikeda M. Clinical skills for dementia. Higher Brain Funct Res. 2009;29(2):222–8.

    Article  Google Scholar 

  38. Lundqvist A, Alinder J, Alm H, Gerdle B, Levander S, Ronnberg J. Neuropsychological aspects of driving after brain lesion: simulator study and on-road driving. Appl Neuropsychol. 1997;4(4):220–30.

    Article  CAS  PubMed  Google Scholar 

  39. Anstey KJ, Wood J, Lord S, Walker JG. Cognitive, sensory and physical factors enabling driving safety in older adults. Clin Psychol Rev. 2005;25(1):45–65.

    Article  PubMed  Google Scholar 

  40. Tant MLM. Visual performance in homonymous hemianopia: assessment, training and driving. PhD thesis, University of Groningen, Groningen, Netherlands; 2002.

Download references

Acknowledgements

We thank A. Kandori for helpful comments leading to design approaches; K. Yoshimura for help in developing the machine learning model.

Funding

This work was supported by Adaptable and Seamless Technology transfer Program through Target driven R &D (A-STEP) from Japan Science and Technology Agency (JST) Grant Number JPMJTR22R8 and JSPS KAKENHI Grant Number JP19H04442.

Author information

Authors and Affiliations

Authors

Contributions

GT developed the machine learning model, analyzed data, and wrote the manuscript. SH, AY, and KS performed experiments and collected data. AF and TT devised the machine learning model. SH, AF, ZX, ZS, HH, and TT revised the manuscript. SH and TT contributed to data interpretation. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Seiji Hama or Toshio Tsuji.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the Ethics Review Committee of the Hiroshima University Epidemiological Research (E-1554-2, E-466-3) and was performed per relevant guidelines and regulations. Written informed consent was obtained from all participants.

Consent for publication

All authors have approved the manuscript for submission. All participants gave written informed consent to publish these data. No individual details, images or videos from a person are included in the manuscript.

Competing interests

The authors declare that they have no competing interests and there are no financial competing interests to declare in relation to this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1:

 Optimization method for regularization parameters.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Tabuchi, G., Furui, A., Hama, S. et al. Motor-cognitive functions required for driving in post-stroke individuals identified via machine-learning analysis. J NeuroEngineering Rehabil 20, 139 (2023). https://doi.org/10.1186/s12984-023-01263-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12984-023-01263-z

Keywords