You are viewing a javascript disabled version of the site. Please enable Javascript for this site to function properly.
Go to headerGo to navigationGo to searchGo to contentsGo to footer
In content section. Select this link to jump to navigation

Pilots’ mental workload prediction based on timeline analysis

Abstract

BACKGROUND:

The aircraft cockpit is a highly intensive human-computer interaction system, and its design directly affects flight safety.

OBJECTIVE:

To optimize the display interface design in complex flight tasks, the present study aimed to propose a dynamic conceptual framework and a timeline task analysis method for the quantization of the dynamic time effect of mental workload and the influencing factors of task types in the mental workload prediction model.

METHODS:

The multi-factor mental workload prediction model based on attention resource allocation was integrated to establish the dynamic prediction model of mental workload. The ergonomics simulation experiment was carried out by recording the data on the performance of embedded subtasks, National Aeronautics and Space Administration-Task Load Index (NASA-TLX) subjective evaluation, and eye tracking.

RESULTS:

The results indicated that the prediction model had a good prediction accuracy and effectiveness under different simulated interfaces and complex tasks, and the real-time monitoring of pilots’ mental workload state was realized.

CONCLUSION:

In conclusion, the prediction model and the experimental method could be applied to avoid the overload of the pilot throughout the flight phase by optimizing the display interface and adjusting the flight task.

1.Introduction

Pilots often need to process large amounts of information in a relatively short time and make quick response decisions to deal with possible urgent airspace situations [1, 2]. Moray et al. proposed that rationally optimizing the operator’s mental workload distribution could effectively reduce human error and improve system reliability and operator comfort [3, 4]. Therefore, the study of pilots’ mental workload prediction can maximally prevent flight accidents by effectively adjusting the flight task and the pilots’ mental state in time [5, 6].

At present, the quantitative modeling research of mental workload is still in its infancy [7, 8]. First, most models are descriptive [9, 10], and the quantitative modeling of mental workload is mostly based on post-mortem measurement of mental workload [11, 12]. Second, still many imperfections exist in the quantitative prediction of mental workload. Bin and Salvendy [13, 14] proposed a dynamic time conceptual model for mental workload prediction, including instantaneous mental workload (IMW), average mental workload (AMW), cumulative mental workload (CMW), maximum mental workload, and total mental workload (TMW), independent of a particular field of mission scenario. Laughery et al. [15, 16] conducted a task analysis method to calculate mental workload by decomposing complex tasks into multiple simple behavioral elements and connecting the elements hierarchically and logically. However, the massive task was difficult to analyze in detail. According to the single-channel theory, Siegel and Wolf [3] introduced the timeline analysis and prediction (TLAP) method in which the time workload equaled time required/time available (TR/TA). However, these models do not consider the influencing factors of mental workload in the aviation field, nor do they take into account the impact of different task types on mental workload. Xiao et al. [17] comprehensively adopted factors such as information amount, time pressure, visual coding, and attention resource allocation to establish a multi-factor mental workload prediction model. However, the model does not consider the conflicts within task types and dynamic time effect.

Considering the aforementioned problems, the present study aimed to optimize the display interface design under different flight tasks. On account of the dynamic concept of mental workload, the timeline task analysis method was used to decompose complex tasks into several task units and simple behaviors that changed dynamically with time and employed the McCracken-Aldrich scale [5] to divide and assign each simple behavior. A dynamic prediction model of mental workload based on timeline analysis was established according to the mental workload prediction based on attentional resource allocation and information processing [17]. The model was verified by carrying out an ergonomic experiment on the civil cockpit simulator. A correlation analysis was promoted between model prediction and experimental results, concerning comprehensive metrics including the performance of embedded subtasks, National Aeronautics and Space Administration-Task Load Index (NASA-TLX) subjective evaluation, eye tracking, and other technical indicators. The significant main effects and correlations indicated that the model has a good validity and availability for different interfaces and complex tasks with dynamic time variation.

2.Model development

2.1Timeline task analysis

In practice, tasks, especially high-mobility military tasks, are complex and usually consist of one or more simple tasks [1]. Meanwhile, the flight information required by the pilot is different under different flight phases. As a result, to accurately determine the mental workload state of the pilot over time, the timeline analysis method is used to decompose the complex tasks and derive the time-variant basic task units and the time attributes of each task unit, including start time t1j, end time t2j, and duration Δtj (where j refers to the j-th task unit).

2.2Task type correlation coefficient solution

Since different task types have different effects on mental workload, they need to be quantified according to the internal cognitive mechanism of task types [18]. Each task unit contains one or several specific behaviors [15]. The behavior of task segmentation is executed by different parts and actions of the operator [16]. The McCracken-Aldrich Mental Stress Assessment Scale [5] divides behaviors into four channels: visual, auditory, cognitive, and motor. Each channel is further subdivided into different scales of complexity. According to the multi-resource theory [3], behaviors that are not in the same channel can be executed simultaneously. The task type workload value kqj of the q-th decomposition behavior can be quantified. Then, the task type workload value kj is the summary value of Q decomposed behaviors of the j-th task unit.

(1)
kj=k1j+k2j++kQj=q=1Qkqj

The task type workload value calculated using the McCracken-Aldrich scale is considered to be overly heavy if more than 8 [5]. Therefore, the task type correlation coefficient Kjl of a task unit is calculated as follows, where m is the total number of task units decomposed by the l-th task. When Kjl=1, the mental workload caused by the l-th task is considered overwhelmingly high.

(2)
Kjl=kj/8=18q=1Qkqj,j=1,2,,m

2.3Multi-factor mental workload prediction

The multi-factor mental workload prediction based on attention resource allocation [17] is triggered by two decisive factors: external task demand and internal resource allocation of mental workload, integrating the information amount Hi, time pressure Ti, visual coding Ci, and attention resource allocation fi.

The average amount of information Hi of different information appearing over time is calculated as follows, wherein the information occurrence probability of the i-th areas of interest (AOI) Pi represents a coefficient related to the information number and information value.

(3)
Hi=Pi[log2(1/Pi)]

During the information perception, the attentional allocation factor fi is used to indicate the intrinsic mental resources that the operator needs to consume due to the information of a certain AOI. The attention resource occupied by n AOIs is A; then fi is the ratio of the attention resource to the total attention resource for the i-th AOI.

(4)
A=(A1,A2,,Ai,An)
(5)
fi=nAi/i=1nAi

After the information is captured by the visual system and transmitted to the mental procession by the nervous system, the attention resources allocated to the i-th AOI are as follows, where each of these information has a certain probability βi of being noticed in one gaze. Information prominence Si is quantified by the applicability of the highlighted format. Ei indicates the effort that the vision system needs to pay to acquire the i-th AOI, and Vi is the information importance attribute associated with the assignment task.

(6)
Ai=βiViSiEi-1

Ci [17] is the visual coding comprehensive performance eigenvalue, with 0<Ci<0.1. wij is the weight coefficient corresponding to the j-th visual coding in the i-th AOI, which is obtained by the G1 method through expert evaluation. vij is the comprehensive performance eigenvalue of the j-th visual coding obtained by meta-analysis, and is the fuzzy weighted average operator.

(7)
Ci=0.001j=1nwijvij

Related experimental results [2, 5, 17] show that the influence of visual coding on mental workload is less significant than that of information amount and time pressure. Therefore, the square root of visual coding is applied as the coordinated quantitative result in the model.

Considering the dynamic time effect of the mental workload, the present study improved the method of quantifying time pressure [17]. According to the TLAP, the time pressure affecting the mental workload is highly correlated with TR/TA on the timeline process. Therefore, the adapted calculation of the time pressure Ti is as follows, where TAi is the available time of the i-th AOI, given by the system for the operator to perform a task, and TRi is the required time of the i-th AOI, needed by the operator.

(8)
Ti=TRi/TAi

TRi can be represented by the time required by the operator from information presentation to perception, understanding, and reaction. According to the “behavior time prediction” method, the time from perception to reaction includes the reaction time Tci and the exercise time Tdi. Tci can be subdivided into simple reaction time and selective time. The simple reaction time of different sensory channels can refer to empirical data [3]. Hick-Hyman Law [11] can be employed in calculating the selective time. Ic is the information quantity constant, with the value 50–157 ms, and the average is 92 ms. H is the average information amount of the stimulation signal in the i-th AOI. The exercise time Tdi is calculated by Fitts’ Law [7], where Im is the motion control constant, with the value 70–120 ms and the average 100 ms; log2(D/S+1) is the motion difficulty index, where D is the target distance and S is the width of the target object.

(9)
TRi=Tci+Tdi=Ic×H+Im×log2(D/S+1)

Altogether, the multi-factor mental workload prediction value mwjl of the j-th task unit decomposed by task 1 is obtained:

(10)
mwjl=i=1n(HiTiCi-2)nfi=i=1n{Pi[log2(1/Pi)](TRi/TAi)Ci-2(nAi/i=1nAi)}=i=1n{Pi[log2(1/Pi)]([Ic×H+Im×log2(D/S+1)]Ci-2(nAi/i=1nAi)}

2.4Dynamic time effect calculation of mental workload

To monitor the pilots’ mental workload state in real time, the IMW, CMW, TMW, and AMW are established under the dynamic conceptual framework of mental workload [13, 14].

IMW is defined as the instantaneous amount of mental workload at a given time. Combined with Kjl and mwjl, the IMW MWjl(t) (in bit) for the j-th task unit decomposed by task l can be obtained:

(11)
MWjl(t)=Kjlmwjl=Kjli=1n(HiTiCi-2)nfi

The TMW MWTl (in bit × s) is defined as the total accumulation of mental workload from the start time to the end time. In the formula, T0 represents the initial time, and T represents the termination time. Mathematical expressions of other mental workloads can be obtained from the mathematical expressions of IMW.

(12)
MWTl=T0TMWjl(t)𝑑t=j=1mMWjl(t)Δtj=j=1mKjlmwjlΔtj

The AMW (in bit) is defined as the average intensity of mental workload throughout the time course.

(13)
MW¯l=MWTlT-T0=j=1mMWjl(t)ΔtjT-T0=j=1mKjlmwjlΔtjT-T0

3.Experimental method

3.1Participants

The study was performed on 21 participants (19 males and 2 females), with an average age of 24.5 years (N = 21, Mage = 24.5, SD = 1.0). They had an aeronautical knowledge background, right hand, normal or corrected vision, and no color blindness. The participants were required to participate in flight simulation training until they were fully familiar with the flight operations.

3.2Study design

The study was based on the actual flight tasks, using the display interface (3) × flight task (4) two-factor repeated measurement of the interior design of the test. As shown in Fig. 1, the display interface included three levels: the virtual main flight display (PFD) of B737, A320, and ARJ21, with proper simplification and abstraction. The flight phase included the takeoff phase and three cruise phases (90 turn, straight, and 180 turn). To avoid the practice and fatigue effects, the experimental sequence was designed in Latin.

Figure 1.

Simulated interface of B737, A320, and ARJ21, cruising flight phase (left) and simulation (right).

Simulated interface of B737, A320, and ARJ21, cruising flight phase (left) and simulation (right).

3.3Materials

The experiment was carried out in a Boeing 737–800 flight simulator, with real-time performance and eye movement indicators monitored. Within 15 min after the end of each experiment, the participants were asked to finish the subjective evaluation of different display interfaces and flight phases according to their real feelings.

Since the flight task performance was difficult to measure directly, the reaction time and the accuracy of the embedded subtask were used as performance indicators, which were automatically recorded by MATLAB programming through keyboard operation. The Swedish noncontact infrared eye tracking system Smart Eye Pro 4.5 was used to record the eye movement data, with an accuracy of better than 1 and a 60 Hz sampling rate. National Aeronautics and Space Administration-Task Load Index (NASA-TLX) [5] software was used for subjective evaluation.

3.4Experiment task

This experiment applied the embedded subtask method proposed by Wickens et al. [3], requiring the participant to monitor and identify the flight information on the PFD and navigation display (ND) using the remaining capabilities while ensuring that the manual control task (main task) was completed, including airspeed, altitude, pitch, and heading. During the experiment, the display interface was frozen and covered at any time, and the subject answered questions about the current value or change characteristics of the flight information within the specified time.

Manual control flight task is a simulated dynamic flight task for takeoff and cruise under a specified interface. The participants need to complete the takeoff operation by controlling the landing gear, throttle, flaps, and so on. The simulated cruise tasks consist of three phases. The participants are asked to control the steering wheel so that the aircraft can fly along the prescribed flight path and keep the airspeed, altitude, and attitude within the normal range.

4.Results

4.1Theoretical predictions

First, the takeoff phase lasts 109 s. The operations using accumulated long-term memory related to operation steps through experience and learning are divided into 11 task units. Second, the cruise phase lasts for 240 s and is decomposed into a single task unit. PFD and ND are divided into five AOIs. Then, the IMW value is calculated, as shown in Fig. 2. The TMW and AMW are shown in Table 1.

Table 1

Mental workload prediction and descriptive statistics (mean ± SD)

TMW /bit × sAMW /bitCorrect rate /1Reaction time /sSubjective evaluation /pointBlink frequency /HzAverage gaze time /s
B7371590.925.420.40 ± 0.184.98 ± 1.7166.47 ± 12.670.22 ± 0.140.20 ± 0.15
2876.913.650.60 ± 0.214.44 ± 1.3567.91 ± 7.890.15 ± 0.120.17 ± 0.14
3362.071.510.67 ± 0.164.44 ± 1.3054.97 ± 11.310.15 ± 0.120.18 ± 0.15
41010.024.210.61 ± 0.154.47 ± 1.3670.72 ± 8.480.11 ± 0.090.15 ± 0.12
A3201552.635.070.47 ± 0.165.80 ± 2.1467.08 ± 13.370.23 ± 0.120.17 ± 0.12
2876.603.650.55 ± 0.204.54 ± 1.0867.50 ± 7.780.17 ± 0.110.15 ± 0.10
3362.861.510.67 ± 0.164.74 ± 1.9552.75 ± 11.080.16 ± 0.110.20 ± 0.19
4997.894.160.63 ± 0.204.54 ± 1.4372.20 ± 7.980.12 ± 0.090.14 ± 0.12
ARJ211606.105.560.44 ± 0.175.19 ± 1.3667.25 ± 11.780.22 ± 0.110.24 ± 0.20
2905.023.770.52 ± 0.214.69 ± 1.6370.18 ± 8.450.14 ± 0.110.17 ± 0.12
3370.461.540.63 ± 0.214.33 ± 1.7854.60 ± 10.290.15 ± 0.110.22 ± 0.19
41022.924.260.57 ± 0.204.26 ± 0.9472.46 ± 7.810.13 ± 0.100.18 ± 0.14

Note: MW conditions are 1 (takeoff), 2 (90 turn), 3 (straight), and 4 (180 turn).

Figure 2.

IMW of the three display interfaces during the takeoff phase.

IMW of the three display interfaces during the takeoff phase.

Figure 3.

Distribution of gaze points (left) and correlation between mental workload prediction and some experimental values (right).

Distribution of gaze points (left) and correlation between mental workload prediction and some experimental values (right).

4.2Experiment result

The repeated-measures variance analysis (RMANOVA) of the two-factor design showed that the main effect of the display interface was only significant for the average gaze time (P= 0.005). The main effects of the flight phase on all indicators were significant (P< 0.005). The interaction effect between the two factors was not significant.

The multiple comparisons after single-factor test showed that the performance, subjective evaluation, and blink frequency were not significant among the three interfaces, and the gaze time was partially significant (PB737,𝐴𝑅𝐽21= 0.042, PA320,𝐴𝑅𝐽21= 0.004). Meanwhile, performance, subjective evaluation, and blink frequency were significant between the takeoff and straight phase (P= 0.001, 0.013, 0.001, and 0.001), and only the blink frequency was significant between 90 turn and 180 turn (P= 0.001).

As shown in Fig. 3, the Pearson correlation analysis between the prediction and experimental results indicated a significant positive correlation between TMW and subjective evaluation (r= 0.9, P< 0.001) and a significant negative correlation with mean gaze time (r=-0.681, P= 0.015). The AMW significantly positively correlated with the subjective evaluation (r= 0.809, P= 0.001) and reaction time (r= 0.537, P= 0.072) and significantly negatively correlated with the correct rate (r=-0.839, P= 0.001). The blink frequency was not related to the TMW and the AMW.

5.Discussion and conclusion

The ergonomics simulation experiment verified the sensitivity of each measure indicator to the mental workload under different display interfaces and flight tasks, so the performance of embedded subtask, National Aeronautics and Space Administration-Task Load Index (NASA-TLX) subjective evaluation, and eye tracking could construct a comprehensive evaluation system of pilots’ mental workload in actual flight environment. The low correct rate, long reaction time, high subjective evaluation, low blink frequency, and short average gaze time could mean the overload of the pilots which should be avoided.

Since the main effect of NASA-TLX is significant in different flight phases, and the correlation between the results of NASA-TLX and the TMW compared with the AMW is better, the TMW can be measured by the subjective evaluation of NASA-TLX, which is consistent with the results of dynamic mental workload framework [13, 14]. The main effects of performance in different flight phases are also significant. AMW negatively correlates with accuracy and positively correlates with reaction time, showing that the performance of embedded subtasks is reliable to AMW. Consequently, the research framework with AMW, TMW. and IMW is more accurate than that with a single workload.

On one hand, studies have proved that gaze time not only is a sensitive indicator of task difficulty but also can be used to judge the readability of display interface [2, 5, 8, 10]. The main effect of the gaze time is significant under the display interface and in the flight phase. Furthermore, a significant positive correlation exists with the theoretical prediction under the display interface, while a significant negative correlation is present in the flight phase. This may be due to the fact that, to achieve better performance, participants allocate more mental resources to complete the embedded subtask under the condition of the lower workload of the main flight task, thus showing the opposite trend of the gaze time. On the other hand, related studies have explained that an increase in the perceived workload can lead to a decrease in the blink frequency, and an increase in cognitive workload can lead to a faster blink frequency [2, 5, 8, 17]. The main effect of the blink frequency in the flight phase is significant: takeoff > straight > 90 turn > 180 turn. The possible reason why the theoretical prediction does not correlate with the blink frequency is that during the takeoff, the mental workload is mainly the cognitive workload caused by decoding memory, while during the cruise, the mental workload is mainly due to the perception workload caused by tracking flight information. With the increase in the turning angle, the perception workload increases and the blink frequency decreases, which is consistent with the theoretical prediction and verifies the prediction model.

The significant main effects and correlations in the flight phase indicate that the prediction model has good validity and availability for complex tasks with dynamic time variation. The display interface is only significant at the gaze time because the three interfaces are from active models that meet airworthiness standards and do not significantly increase pilots’ mental workload. Nevertheless, the trend is basically the same under each metric: the ARJ21 interface has the highest subjective evaluation, the lowest correct rate, the longest gaze time, and the lowest blink frequency. The difference between B737 and A320 is small, which is consistent with the theoretical prediction.

Compared to most of the existing mental workload quantitative models, the present study proposed a dynamic prediction model of mental workload based on timeline analysis. Through the timeline task analysis, complex flight tasks were decomposed into several simple task units. The McCracken-Aldrich scale was used to assign the task units to four channels, and the impact of task type and complexity on mental workload was comprehensively analyzed and quantified. Benefiting from the improved time pressure quantification method and the adjusted proportion of visual coding in the model, the accuracy of multi-factor mental workload prediction improved. Besides, the dynamic time conceptual framework of mental workload was introduced to realize real-time monitoring, analysis, and prediction of pilots’ mental workload status. As a result, the model could be applied to reduce aviation accidents by predicting overload of the pilots and proposing real-time and effective solutions throughout the flight phase. Meanwhile, without considering resource interference between multitasking, the model had limited accuracy in the multi-task environment. Follow-up studies may further improve the mental workload prediction model.

Acknowledgments

This study was co-supported by the joint program of the National Natural Science Foundation of China and Civil Aviation Administration of China (grant no. U1733118), as well as the Aeronautical Science Foundation of China (grant no. 201813300002).

Conflict of interest

None to report.

References

[1] 

Grassmann M, Vlemincx E, von Leupoldt A, et al. Individual differences in cardiorespiratory measures of mental workload: An investigation of negative affectivity and cognitive avoidant coping in pilot candidates. Applied Ergonomics, (2017) , 59: : 274–282.

[2] 

Xiaoru W, Damin Z, Shasha L. Effects of perceptual load related to flight task on auditory ERPs. Acta Scientiarum Naturalium Universitatis Pekinensis, (2011) , 47: (6): 995–1002.

[3] 

Wickens CD. Mental workload: assessment, prediction and consequences. In International Symposium on Human Mental Workload: Models and Applications, Springer, Cham, (2017) : pp. 18–29.

[4] 

Reimer B, Mehler B. The impact of cognitive workload on physiological arousal in young adult drivers: A field study and simulation validation. Ergonomics, (2011) , 54: (10): 932–942.

[5] 

Wei Z, Zhuang D, Wanyan X, et al. A model for discrimination and prediction of mental workload of aircraft cockpit display interface. Chinese Journal of Aeronautics, (2014) , 27: (5): 1070–1077.

[6] 

Vidulich MA, Tsang PS. The confluence of situation awareness and mental workload for adaptable human-machine systems. Journal of Cognitive Engineering and Decision Making, (2015) , 9: (1): 95–97.

[7] 

Young MS, Brookhuis KA, Wickens CD, et al. State of science: Mental workload in ergonomics. Ergonomics, (2015) , 58: (1): 1–17.

[8] 

Rusnock CF, Borghetti BJ. Workload profiles: A continuous measure of mental workload. International Journal of Industrial Ergonomics, (2018) , 63: : 49–64.

[9] 

Longo L. A defeasible reasoning framework for human mental workload representation and assessment. Behaviour & Information Technology, (2015) , 34: (8): 758–786.

[10] 

Hsu BW, Wang MJJ, Chen CY, et al. Effective indices for monitoring mental workload while performing multiple tasks. Perceptual and Motor Skills, (2015) , 121: (1): 94–117.

[11] 

Wanyan X, Zhuang D, Lin Y, et al. Influence of mental workload on detecting information varieties revealed by mismatch negativity during flight simulation. International Journal of Industrial Ergonomics, (2018) , 64: : 1–7.

[12] 

Roy RN, Charbonnier S, Campagne A, et al. Efficient mental workload estimation using task-independent EEG features. Journal of Neural Engineering, (2016) , 13: (2): 026019.

[13] 

Bi S, Salvendy G. Analytical modeling and experimental study of human workload in scheduling of advanced manufacturing systems. International Journal of Human Factors in Manufacturing, (1994) , 4: (2): 205–234.

[14] 

Vera J, Jiménez R, García JA, et al. Intraocular pressure is sensitive to cumulative and instantaneous mental workload. Applied Ergonomics, (2017) , 60: : 313–319.

[15] 

Laughery KR, Plott JB, Matessa M, Archer S. Modeling human performance in complex systems. Handbook of Human Factors and Ergonomics, Fourth Edition, (2012) , 931–961.

[16] 

Liang SFM, Rau CL, Tsai PF, et al. Validation of a task demand measure for predicting mental workloads of physical therapists. International Journal of Industrial Ergonomics, (2014) , 44: (5): 747–752.

[17] 

Xiao X, Wanyan X, Zhuang D. Mental workload prediction based on attentional resource allocation and information processing. Bio-medical Materials and Engineering, (2015) , 26: (s1): S871-S879.

[18] 

Marinescu AC, Sharples S, Ritchie AC, et al. Physiological parameter response to variation of mental workload. Human Factors, (2018) , 60: (1): 31–56.