Quantifying and recommending seat belt reminder timing

1

Quantifying and recommending seat belt reminder timing using naturalistic driving video data

*Daniel V. McGehee1,2, Cheryl A. Roe1, Pranaykumar Kasarla1,2, Chao Wang1,2

University of Iowa

1National Advanced Driving Simulator

2Department of Industrial and Systems Engineering

*Corresponding Author: Daniel V. McGehee, National Advanced Driving Simulator, University of Iowa,

2401 Oakdale Blvd, Iowa City, Iowa 52245; [email protected]; Tel +1-319-430-3052

mailto:[email protected]

2

Abstract

Introduction To better understand the timing of when people buckle their seat belt, an analysis of a

naturalistic driving study was used. The study provided a unique perspective inside of the vehicle where

the entire seat belt was visible from the time the driver entered the vehicle to one minute of driving

forward or 32 kph. Method: Thirteen of these drivers were identified for a seat belt sequencing which

identified the points when the vehicle was put into ignition, shifted, when vehicle movement began, and

when the seat belt was buckled. The speed at belt closure was also identified. The timing from ignition to

buckle and to shifting into forward gear were examined in order to identify the speed and appropriate

timing for seat belt reminders. Results: The data shows that drivers were buckled in over 92% of the

3,102 drives. In 70% of those total drives, the drivers were buckled before the vehicle began movement.

Of greater interest for seat belt reminders/interlocks are those drives when drivers buckle after movement.

When considering time from ignition to seat belt closure, the mean was 27.5 seconds. Because higher

speeds are typically reached when travelling forward rather than reverse, it was important to know the

time duration from shifting into drive to buckling. Conclusions: Since higher speeds lead to an increased

risk of injury and/or death, a recommendation of a 30 second time from forward shift and a 25 kph

threshold for reminder systems should be implemented. The regression analysis also validates that most

of the predicted seat belt buckling times are within 30s. Practical Applications: This would reduce

perception of nuisance alerts and protect the driver from higher speed unbuckled crashes. The seat belt

buckling time prediction model also demonstrates good potential for developing tailored buckling

warning system for different drivers.

Keywords: seat belt; nuisance alarm; safety interlock; buckle timing

3

Quantifying and recommending seat belt reminder timing using naturalistic driving video data

1. Introduction

A continued increase in seat belt use is fundamental to creating safer roads and reducing the

societal burden of crash injuries and fatalities. It is estimated that in 2020, seat belt use for front seat

occupants reached 90.3% in the United States (National Center for Statistics and Analysis, 2021), a 9.1%

gain from 2006. Furthermore, seat belts saved 14,955 passenger vehicle occupants aged five and older in

the U.S. during 2017 (NHTSA, 2020). While seat belt use continues to slowly rise, further increasing belt

use is still the most effective way to reduce fatalities and minimize injuries in motor vehicle crashes

(Campbell, 1987; Evans, 1986; Sato, 1987), as millions still do not buckle up. In 2018, nearly 23% of all

passenger vehicle occupants who died in crashes were unrestrained at the time of the crash (NHTSA,

2020).

Over the years, seat belt reminder systems have played a vital role in increasing seat belt use.

While auditory or visual warnings have been shown to be effective in getting occupants to buckle their

seat belt (Fildes, Fitzharris, Koppel, Vulcan, & Brooks, 2003; Krafft, Kullgren, Lie, & Tingvall, 2006;

Williams, Wells, & Farmer, 2002), they haven’t always proven successful. At issue is the timing. If the

reminder alerts the drivers too soon after engaging forward motion, it could be viewed as a nuisance

(Regan et al., 2006). This could result in manually disconnecting the reminder alert. Reminding a driver

too late may lead to increased risk of injury as they may already be traveling at a higher speed without

being restrained. The change of velocity is captured by delta-v, which is directly related to the severity of

injury and increases the risk of fatality (Richards, 2010). Most seriously, a single car crashing into a tree

or infrastructure would decelerate from some initial velocity to zero in a short time, and even relatively

low speed crashes can be fatal. According to 2018 Fatal Analysis Reporting System (FARS) data, 34.4%

of single car crashes occurred at or below 64 kph, and 52.8% of single car crashes occurred below 80 kph

(NHTSA, 2018). Averaging speeds for the top 17 crash types for vehicle-to-vehicle (V2V) applications,

48% of them occur at or below 56 kph, while 45% occur between 56 and 88 kph (Najm et al., 2013).

4

Three of the 17 crash types (Running stop sign, Straight crossing path, Non-signal, and Turn at Non

signal) occurred at less than 56 kph more than 60% of the time.

Another type of reminder related to reminder timing is the seat belt safety interlocks. The timing

of this type of system is crucial to prevent it from becoming a nuisance. Interlocks are designed to limit

some of the systems operating in the vehicle, such as the radio or other entertainment systems, accelerator

pedal (providing resistance), ignition, or shifting capabilities (Mizruchi, 1996). In the 1970s, the National

Highway Traffic Safety Administration (NHTSA) mandated seat belt ignition interlocks, prohibiting

vehicles from being started unless the seat belt was buckled. These interlocks were widely unaccepted by

the general public and were quickly removed from vehicles. Negative perceptions of these interlocks still

linger today (Eby, Molnar, Kostyniuk, Shope, & Miller, 2004; Kidd, McCartt, & Oesch, 2014).

Furthermore, it is widely understood that wearing a seat belt correctly saves lives and mitigates

injuries during crashes. However, a considerable percentage of the population still does not to wear seat

belts. Understanding why people choose not to wear their seat belt is only part of the issue. Reminders

have been effective, but safety interlocks have not shown to have the same success. Understanding how to

effectively design safety interlocks so that they will be accepted could play a role in getting more people

to buckle up.

To remedy these issues, belting behavior is often studied through crash data, surveys,

observation, or with interviews. More recently, naturalistic studies have been used to investigate seat belt

behavior as they provide a unique perspective, capturing drivers in more everyday environments.

Although researchers in previous studies have identified seat belt users, there is no standard for how this

is done. A seat belt user has been identified by frequency (Kidd et al., 2014; Reagan, McClafferty, Berlin,

& Hankey, 2013) or by the time of buckling. The timing is often associated to different actions taking

place in the vehicle, such as seat belt reminders (Bao, Xiong, Buonarosa, & Sayer, 2015) or shifting into

gear (Malenfant & Van Houten, 2008). Although the research includes some of the actions going on in

the vehicle, there is no research that provides a detailed analysis of the seat belt timing sequence from

5

before ignition through the start of driving (McGehee & Roe, 2017). Our study fills this gap and makes

recommendations on seatbelt timing.

2. Methods

In order to investigate seat belt buckling behavior, data from a previous naturalistic driving study

(NDS) with 30 drivers were examined in the seat belt buckling context. Observations during that study,

which was conducted in suburban Iowa, focused on the participants’ entry into their vehicle until

acceleration to a driving speed as well as the parking maneuver at the end of each drive. Each participant

had a four-camera system (interior cabin view, forward view, and two foot-well cameras) along with an

accelerometer, two small infrared lights (one to light the interior cabin and one in the footwell), and GPS

installed into their vehicle for about one month. These data were collected as part of another study

(McGehee et al., 2016). Video and audio data were recorded at the start (door opening to 32 kph or 1

minute) and end of each drive (last minute), and at accelerometer threshold settings of 0.5g or greater.

These settings were chosen based on the guidance and experience of the manufacturer, as well as on those

used in other naturalistic driving studies. Dingus et al. (2006); McGehee, Raby, Carney, Lee, & Reyes

(2007); and Carney, McGehee, Lee, Reyes, & Raby (2010) used -0.5g as the threshold for defining hard

braking and ±0.4g and ±0.55g as the threshold for defining rapid steering maneuvers (McGehee & Roe,

2017). This analysis used the audio, video, and GPS data collected during the NDS to identify seat belt

behaviors.

2.1 Participants

A total of 30 participants from two age groups completed the NDS in Iowa City, Iowa: ten

between 18 to 35 years and twenty over 65 years of age. Although seat belt use was determined for all 30

participants, only 13 of them (4 younger (2 males and 2 females) and 9 older (6 males and 3 females)

were selected for the seat belt timing sequence that will be used to determine the timing of seat belt

reminder system recommendations (see selection criteria in procedure section). Participants were licensed

drivers with normal color vision and a visual acuity of at least 20/40. They had to make at least one round

trip per day (e.g., to the store and back) and drive a 1996 or newer vehicle model.

6

2.2 Procedure

The video data from the NDS provides a camera view of the interior of the cabin that allows for

the identification of drivers’ seat belt buckling behavior as well as the sequences that drivers take when

buckling (Figure 1).

Figure 1. Interior cabin view

Seat belt use was classified into four groups: driver puts seat belt on before car is moving, driver puts

seat belt on during start-up maneuver, driver puts seat belt on during driving, and no seat belt

throughout the drive. To better understand the details regarding buckling after movement, the drivers

that buckled after vehicle movement at least 25% of the time and had at least ten drives in this

category were identified for the seat belt sequencing. This sequencing identified various time points

from entering the vehicle through seat belt closure. Using this criteria, 13 drivers were identified for

which ten drives were randomly selected for the time series for buckling. Time points at ignition, gear

shifts, grabbing the seat belt, and seat belt closure were collected. Gear selection and speed at closure

were also recorded. Seat belt closure was defined as the point at which the male end of the seat belt

connected to the female end. Coding was completed by two trained coders with discrepancies being

reviewed and mediated between them. Time points were identified from the video and/or audio

available from the interior camera, and speed was identified from the GPS device installed in the

vehicle.

7

Descriptive statistics were calculated for seat belt use, timing to closure from ignition, shifting

into initial gear, and shifting into forward gear as well as speed at buckle. Furthermore, data cleaning and

model selection criteria techniques were applied on the data and regression analyses were performed to

build models for determining the drivers’ seat belt bucking behaviors and buckling time.

3. Results

Out of 3,102 drives, drivers were buckled 92% of the time. The individual participant breakdown

showed buckling rates between 74% and 100%, with 21 out of the 30 drivers buckling more than 90% of

the time. Results show that buckling prior to movement occurred in 70% of the drives. Of the drives when

buckling took place prior to movement, there were:

• 11% who buckled prior to ignition,

• 56% who buckled after ignition and prior to shifting, and

• 6% who buckled after both ignition and shifting but prior to vehicle movement.

An additional 107 drives (3%) were confirmed as buckling prior to vehicle movement, however due to

video sampling errors the timing of seat belt closure was unknown. Figure 2 shows the breakdown of the

buckling sequence of the drives.

304(11%)

1603(56%)

167(6%)

673(24%)

0

200

400

600

800

1000

1200

1400

1600

1800

Buckled prior to ignition Buckled after ignition,prior to shift

Buckled after ignition andshift

Buckled after ignition andshfit

Nu

mb

er

of

dri

ves

Ignition Gear Shift Vehicle Movement

8

Figure 2. Summary of buckle sequence

In order to recommend the appropriate requirements for seat belt reminder warnings, drives when

drivers buckled after movement began were identified. The time duration from ignition to seat belt

closure showed a mean of 27.5 seconds with a range from 4.6 to 86.6 seconds. This duration could lead to

nuisance alarms because this duration would include any actions the driver performs before the act of

driving begins. If considering the act of driving to be when the vehicle was put into gear, the amount of

time that elapsed from initial shift to seat belt closure was 22 seconds with a range of 2.8 to 92.4 seconds.

The majority of drives in this category began with the initial shift position into reverse, however 61% of

the drives had belt closure during forward movement. The time duration from “shift to drive to buckle”

showed a mean of 16.2 seconds with a range of 3.7 to 92.4 seconds. Table 1 shows the details of the

various time durations to buckle. Ninety-one percent of buckling after forward movement took place

under 30 seconds, which can be seen in the cumulative distribution in Figure 3.

The mean speed at buckling when travelling forward was 15.3 kph (4.2 m/s) with a range of 7.9

to 50.7 kph (2.2 to 14.1 m/s). Twelve of the thirteen drivers mean speeds were below 21 kph (5.8 m/s).

The cumulative distribution shown in Figure 3 shows that in 90% of the drives, drivers were buckled at

speeds lower than 28.7 kph (8.0 m/s). Note that the horizontal axis represents both the duration (in unit

second) and the speed (in unit kph), where the two units are aligned to share the same scale.

Table 1. Time duration to buckle

Number of

drives with

duration

Mean

Duration

(s)

Median

Duration

(s)

Standard

Deviation

(s)

Min

(s)

Max

(s)

Ignition to buckle 103 27.5 23.7 18.0 4.6 86.6

Initial shift to buckle 110 22.0 19.7 15.4 2.8 92.4

Shift to drive to

buckle 85 16.2 13.3 12.2 3.7 92.4

9

Figure 3. Distribution and comparison of “time for forward shift to buckle” and “speed at buckle”

Identifying speed relative to how much time has elapsed from “forward shift to buckle” provides

an opportunity to see how speed changes as time increases. Error! Reference source not found. 4

identifies each drive with valid speed and time duration (forward shift to buckle). There were only 63

points identified. Speeds of 0 kph are included in the figure. A gray box was added to the figure to

illustrate the drives that would fall inside the recommendations (25 kph and 30 seconds from forward

shift) discussed in the discussion section. Out of our analysis, 14 drives (22%) fell outside of one of these

recommendations: of these, six fell outside due to buckle taking place at speeds greater than 25 kph,

seven after a time duration (forward shift to buckle) greater than 30 seconds, and one drive fell outside the

range for both. For the drives when drivers buckled more than 30 seconds after forward shift, three were

buckled within 31 seconds.

0

10

20

30

40

50

60

70

80

90

100

0 10 20 30 40 50 60

Per

cen

tage

KPH at buckle

Duration from forward shiftto buckle (s)

90

%

Time (s)/Speed (kph)

10

Figure 4. Time from ‘forward shift to buckle’ and speed

3.1 Regression Analysis

To better facilitate the analysis of the factors that influence drivers’ seat belt buckling behavior

and timing, regression was used. Table 2 highlights the four categories used in the analysis, which are:

• Driver puts seat belt on before car is moving: If participant buckled seat belt prior to any

movement of the vehicle, in reverse or forward direction.

• Driver puts seat belt on during start-up maneuver: If participant put the seat belt on during the

start-up sequence. The start of the drive sequence was defined as the moment the car was put into

gear to the point when the vehicle was completely in the roadway and traveling forward.

• Driver puts seat belt on during driving: If participant put on seat belt after the start-up sequence

had been completed.

• No seat belt throughout the drive: If participant did not have the seat belt on at any time during

either the start-up or parking sequence. As previously discussed, the video system did not

continuously record the entire drive. If the video available showed no belt worn at start-up and

0

5

10

15

20

25

30

35

40

45

50

55

0 10 20 30 40 50 60 70 80 90 100

Spee

d a

t b

uck

le (

kph

)

Time from forward shift to buckle (s)

11

parking, these drives were “assumed no seat belt.” However, it is possible that these drivers could

have buckled and unbuckled during the section of the video that wasn’t available.

Table 2. Seat belt usage – participant drive breakdown

In this analysis, the following research questions were set to investigate the seat belt buckling

behavior and buckling time of the drivers by analyzing the data provided:

1. What is an appropriate model that can represent the driver’s seat belt buckling behaviors?

2. What is an appropriate model that can represent the driver’s seat belt buckling time during

driving?

3. What are the significant variables (contributors) which have influence on the driver’s seat belt

buckling behavior and buckling time during driving?

3.2 Regression Modeling

Both qualitative and quantitative regression analysis was performed on the data, and significant

input variables were identified to build models for determining the seat belt bucking behaviors and

buckling time. Note that in the case of buckling time, we only focus on determining the driver’s seat belt

12

buckling time when the driver falls in the Puts seat belt during driving category. This is because the

“during driving” period is when the seat belt functions most significantly. The Puts seat belt during

driving corresponds to the “vehicle movement” portion in Figure 2.

Selecting the regression model depends on the type of the data to be analyzed. Multinomial

Logistic Regression (Hilbe, 2009), denoted as Model (a), was used for the research question 1 and 3, as

the response variable – “buckling behavior” has four categories. Whereas, Linear Regression, denoted as

Model (b), was used for the research question 2 and 3, as the response variable – “buckle time” is a

continuous variable.

Model - a) Input variables:

For all of the input variables, their range, categories, and how they are represented in the

multinomial logistic regression model are shown in Table 3 below:

Table 3. Input variables for buckling behavior

Model - a) Response variable:

The multinomial logistic regression model gives the log odds of a new category to a reference

category of the response variable. In this case, it is 𝑙𝑜𝑔 (𝑃𝑖

𝑃4) for all i = 1,2,3 where P1, P2, P3, P4 are the

probabilities of each categorical membership of buckling behavior.

P1: Probability of being in category No seat belt throughout drive

P2: Probability of being in category Puts seat belt during driving

P3: Probability of being in category Puts seat belt during startup maneuver

P4: Probability of being in category Puts seat belt on before car is moving

Model - a) Multinomial logistic regression model equation:

13

As we have four categories in the response variable, multinomial logistic regression estimates

parameter values for 𝛽0, 𝛽1, ⋯ , 𝛽7 via maximum likelihood method of all other three categories with

respective to the reference category. Applying multinomial logistic regression to our data, by taking Puts

seat belt on before car is moving category as a reference level for response variable, results in the

following equation for all 𝑖 = 1, 2, 3.

𝑙𝑜𝑔 (𝑃𝑖

𝑃4) = 𝛽0

𝑖 + 𝛽1𝑖 𝑋1 + 𝛽2

𝑖 𝑋2 + 𝛽3𝑖 𝑋3 + 𝛽4

𝑖 𝑋4 + 𝛽51𝑖 𝟏(𝑋5 = 𝑋51) +

𝛽52𝑖 𝟏(𝑋5 = 𝑋52) + 𝛽61

𝑖 𝟏(𝑋6 = 𝑋61) + 𝛽62𝑖 𝟏(𝑋6 = 𝑋62) +

𝛽63𝑖 𝟏(𝑋6 = 𝑋63) + 𝛽7

𝑖 𝑋7

(1)

Note: 𝟏(𝒙) is an indicator function, where 𝟏(𝒙) = {1 𝑖𝑓 𝑥 𝑖𝑠 𝑡𝑟𝑢𝑒0 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒

Model - b) Input variables:

For all the input variables, their range, categories, and how they are represented in the linear

regression model are shown in Table 4 below. Weather was not involved in the regression model related

to buckling time due to the model diagnostic that removes outliers and missing values.

Table 4. Input variables for buckling time

Model - b) Response variable:

The response variable “buckle time” is the time taken by the driver to buckle the seat belt from

the moment the car started moving forward after the startup maneuver (e.g., reversing).

14

Model - b) Linear regression model equation:

Multiple models were generated with all the linear terms and two-way interactions terms of the

input variables. The generalization of these models is as follows, where ‘Y’ is the buckle time.

𝐹𝑢𝑙𝑙 𝑀𝑜𝑑𝑒𝑙 𝑌 ~ 𝑋1 + 𝑋2 + 𝑋3 + 𝑋4 + 𝑋5 + 𝑋6 + ∑ 𝑋𝑖𝑋𝑗

1≤𝑖<𝑗≤6

(2)

3.3. Regression analysis results and discussion

Model - a) Multinomial logistic regression model results:

Model selection criteria: Model selection is an important part of any statistical analysis. It is used

to balance the model complexity and prediction accuracy. The Akaike Information Criterion (AIC)

(Sakamoto, Ishiguro, & Kitagawa, 1986) was applied on our models to select the most representative

input variables.

Model results from AIC:

By applying AIC model selection criteria, preferred model with the minimum AIC value of

2882.37 has included all the input variables. The multinomial logistic regression is then applied to

provide detailed analysis results.

To make it easier to understand the results generated by the model, we provide two examples for

explaining the coefficients: continuous input variable (Height) and categorical input variable (Rain/Mist

category of Weather). All other coefficients of continuous input variables can be explained like the

coefficient of the Height. Similarly, all other coefficients of categorical input variables can be explained

like the coefficient of Rain/Mist. Note: An asterisk (*) indicates the significant variables (p-value < 0.05)

in the model equations.

Scenario 1:

𝑙𝑜𝑔 (𝑃1

𝑃4) = −32.079∗ − 0.010𝑋1 − 0.085𝑋2 + 0.111𝑋3

∗ + 0.011𝑋4 +

13.165 𝟏(𝑋5 = 𝑋51)∗ + 11.588 𝟏(𝑋5 = 𝑋52)∗ + 11.297 𝟏(𝑋6 = 𝑋61)∗

+10.523 𝟏(𝑋6 = 𝑋62)∗ + 11.334 𝟏(𝑋6 = 𝑋63)∗ − 0.008𝑋7∗

(3)

15

𝛽31 = 0.111: If a subject were to increase his Height by one inch, the multinomial log-odds of

preferring category No seat belt throughout drive to category Puts seat belt on before car is moving

would be expected to increase by 0.111 units while holding all other variables in the model constant.

𝛽521 = 11.588: If the weather were to change from Snow/sleet/hail to Rain/Mist, the multinomial

log-odds of preferring category No seat belt throughout drive to category Puts seat belt on before car is

moving would be expected to increase by 11.588 units while holding all other variables in the model

constant.

All other coefficients from the model can be explained like the above explained coefficients.

Similarly, the results from scenario 2 and 3 can be explained except that scenario 2 gives the log-odds of

preferring category Puts seat belt during driving to category Puts seat belt on before car is moving and

scenario 3 gives the log-odds of preferring category Puts seat belt during startup maneuver to category

Puts seat belt on before car is moving.

Scenario 2:

𝑙𝑜𝑔 (𝑃2

𝑃4) = −10.017∗ + 0.022𝑋1

∗ + 1.024𝑋2∗ + 0.020𝑋3 + 0.021𝑋4

∗ +

1.590 𝟏(𝑋5 = 𝑋51)∗ + 1.015 𝟏(𝑋5 = 𝑋52)∗ − 0.072 𝟏(𝑋6 = 𝑋61) + +0.325 𝟏(𝑋6 = 𝑋62) + 0.433 𝟏(𝑋6 = 𝑋63) − 0.0002𝑋7

(4)

Scenario 3:

𝑙𝑜𝑔 (𝑃3

𝑃4) = −0.171∗ − 0.010𝑋1 + 1.062𝑋2

∗ + 0.003𝑋3 − 0.009𝑋4∗ +

0.839 𝟏(𝑋5 = 𝑋51)∗ + 0.649 𝟏(𝑋5 = 𝑋52)∗ + 0.224 𝟏(𝑋6 = 𝑋61) − 0.160 𝟏(𝑋6 = 𝑋62) − 0.401 𝟏(𝑋6 = 𝑋63) − 0.0002𝑋7

(5)

The analysis results in this model provide answers for the research questions 1 and 3: The

multinomial logistic regression model can well represent and identify the significant variables for driver’s

seat belt buckling behaviors. The model interpretations in Equations 3–5 provide quantitative impacts on

the buckling behaviors when changing each variable, and the asterisk (*) in Equations 3–5 identify the

significant variables that have impacts on the buckling behaviors. The significance is determined by

16

choosing variable with the p-value smaller than 0.05 in the analysis of variance in multinomial logistic

regression.

Model - b) Linear regression model results:

The performance of all the models from Equation 2, were evaluated by calculating the Root Mean

Squared Error (RMSE), Mean Absolute Error (MAE), and Mean Absolute Percentage Error (MAPE)

values. To reduce variability, ten-fold cross-validation was performed using ten different partitions

(seeds) and then an average of the results was taken for each model. In addition, outliers with a Cook’s

distance greater than 0.5 were identified in every round and were removed from the data. The lower the

error values the better the performance of the model. Hence, the model with minimum mean RMSE,

MAE, and MAPE values was selected.

In addition, Box Cox transformation (Sakia, 1992) was applied on this model and Box Cox

transformed model was generated. Box Cox transformation is a way to relax the “normality” assumption

in response variables. The core of the Box Cox transformation is an exponent, lambda (𝜆), which varies

from -5 to 5. All values of λ are considered and the optimal value for the data is selected. The “optimal

value” is the one which results in the best approximation of a normal distribution curve.

AIC model selection criteria was applied on both these models. Resulted Normal model with a

minimum AIC value of 189.95 and Box Cox model with a minimum AIC value of -253.86 have included

only the Age and Sex input variables. These models will be referred as Normal final model and Box Cox

final model in this report.

In order to check the performance of these two models, twenty-fold cross-validation was

performed using twenty different partitions (seeds) and then average of RMSE, MAE, and MAPE values

were calculated for each model. Normal final model has the average error values of RMSE = 4.87, MAE

= 3.86, and MAPE = 27.86, and Box Cox final model has the average error values of RMSE = 4.89, MAE

= 3.65, and MAPE = 24.45. Minimum RMSE, MAE, and MAPE values were used in determining the best

partition (seed) for both Normal final model and Box Cox final model. These best seed models’ results are

interpreted and explained in the following sections.

17

Normal final model results:

The best partition (seed) of the Normal final model, with minimum error values of RMSE = 3.40,

MAE = 2.60, and MAPE = 17.33 gave the following results.

𝑌 = 14.163∗ + 0.044𝑋1 − 4.072𝑋2 (6)

𝛽1 = 0.044: If a subject were to increase his Age by one year while holding all other variables in

the model constant, the (buckle time) time taken by the driver to buckle the seat belt from the moment the

car started moving forward after the startup maneuver would be expected to increase by 0.044 seconds.

Box Cox final model results:

The best partition (seed) of the Box Cox final model, with minimum error values of RMSE =

3.72, MAE = 2.77, and MAPE = 16.90 gave the following results.

𝑌𝑏𝑜𝑥 𝑐𝑜𝑥 = 1.451∗ + 0.002𝑋1 − 0.091𝑋2∗ (7)

𝛽2 = −0.091: If the Sex were to change from female to male, while holding all other variables in

the model constant, the 𝑌𝑏𝑜𝑥 𝑐𝑜𝑥 would be expected to decrease by 0.091 units.

Also, for the better visualization of the model evaluation metrics, RMSE, MAE, and MAPE

values from the twenty-seeds of both “Normal” to “Box Cox” models were plotted. The box plot in

Figure 5, contains the twenty seeds error values from both these models.

18

Figure 5. Normal and Box Cox models twenty-fold evaluation metrics boxplots

Figure 6 is for the visualization of the Normal and Box Cox final models’ performance on the best

partition of the data, after eliminating outliers. The red circles represent the actual buckle time from the

training dataset on which the models were trained, the blue circles represent the actual buckle time from

the validation dataset on which the model performance was tested. The black asterisks represent the

predicted buckle time by the Normal model, and the purple square represent the predicted buckle time by

the Box Cox model. Every blue circle will have a corresponding black asterisk and purple square, as the

performance of models was tested only on the validation dataset.

Figure 6 shows most of the predictions approximate the real buckling time well, which

demonstrates the capability of the model for predicting the buckling time. This also reveals the potential

for developing a tailored seat belt buckling warning system for drivers with different characteristics, e.g.,

Sex and Age.

19

Figure. 6 Buckle time predictions of Normal and Box Cox models

The analysis results in this model provide answers for the research questions 2 and 3: The linear

regression model can quantify the driver’s seat belt buckling time, and the AIC helps select two important

variables, i.e., Age and Sex, that contribute to the seat belt buckling time.

4. Discussion

As these results show, this study provided new insights into driver buckling behavior from

vehicle entry, ignition, general shifting, forward shifting, and unbuckling at the end of the drive. Of

greatest interest was the timing from the forward shift point and the speed at seat belt closure. These data

can help better inform seat belt reminder designers of the most appropriate time and speed to alert a

driver.

Among 3,102 drives, drivers in our sample were buckled at a very high rate: 92.0%. This number

is nearly identical to 2017 driver buckling rates cited by the Iowa DOT, when 91.4% of drivers were

observed to be buckled (Larson, Fox, & Berg, 2017). In 70% of the drives for the current study, the

drivers were buckled prior to vehicle movement. For this study, buckling took place after ignition in 89%

of the drives, 70% of drivers buckled prior to vehicle movement, and about 31% were after gear selection.

20

This is in contrast to Malenfant and Van Houten (2008), who found that only 68.7% of drivers were

buckled after ignition and about 20% buckled after placing the vehicle in gear. The primary difference

between these studies is that Malenfant and Van Houten (2008) looked at between-subject observations,

and this study looked at within-subjects across repeated drives. This University of Iowa study was able to

capture more sustainable behavioral trends within drivers.

In terms of belt timing, initial shift to buckling duration showed a mean of 22.0 seconds with a

range from 2.8 to 92.4 seconds. Bao, Funkhouser, Sayer, and Toyoda (2016) found that their “late

bucklers” had a mean duration of 35.1 seconds. The difference between Bao et al. (2016) and the current

analysis could be explained in that Bao et al. (2016) had a larger sample size covering a wider age range.

It is also possible that because Bao et al. eliminated shorter drives with slower speeds, that could impact

the time duration observed. In our analysis, we observed that those that buckled after movement often put

the vehicle into reverse initially, however, did not buckle until traveling forward. As buckling in reverse

was rare in our sample and buckling occurred at lower speeds, we concentrated our focus on shifting into

forward gear to buckle rather than just the initial shift. The duration from shifting into forward gear to

buckling showed a mean of 16.2 seconds with a range from 3.7 to 92.4 seconds. When considering the

shift into forward gear, a majority of the drives (81%) were buckled under 20 seconds and, if increasing to

30 seconds, 91% of the sample would be buckled.

Speed at buckling while traveling forward showed a mean of 15 kph; 76% of the drives were

below 25 kph, with a range was from 2 kph to 51 kph. It must be kept in mind that there were seven out

of 130 drives (5%) when speed was unknown due to unavailable video data, and the impact these drives

had on the speed cannot be determined. It is possible the drivers were traveling at lower or higher speeds

than 15 kph, or even stopped when buckle occurred. In this study, drivers were buckled by the time they

reached 28 kph in 90% of the drives. Given that the age of the drivers for this sample was skewed toward

older drivers (nine aged > 65 and four 18 – 35), this may explain slower speeds. One would expect that,

among a larger sample of young and middle-aged drivers, the speeds might be higher.

21

We use the multinomial logistic regression to model the various contributors’ impact on the

drivers’ seat belt buckling behavior, and linear regression to model the various contributors’ impact on the

drivers’ seat belt buckling time. AIC model selection criteria was applied to select the most appropriate

contributors to construct the regression models. The key results from the analysis in Section 3.3 are

summarized below.

In the seat belt behavior analysis, the results in Equations 3-5 demonstrate that:

1) In a comparison between No seat belt throughout drive and Puts seat belt on before car is

moving, the Age, Height, Weight, Duration, Environment, and Weather are significant

contributors;

2) In a comparison between Puts seat belt during driving and Puts seatbelt on before car is moving,

the Age, Sex, Weight, Environment, and Weather are significant contributors;

3) In a comparison between Puts seat belt during startup maneuver and Puts seat belt on before car

is moving, the Age, Sex, Weight, Environment, and Weather are significant contributors.

For predicting the buckle time, the results in Equations 6-7 demonstrate that Age and Sex input

variables are selected as contributors after applying AIC model selection criteria. Meanwhile, the

regression analysis also shows that Sex is a significant input variable in the buckle time

prediction model.

As described, the logistic regression analysis performed to understand the driver’s seat belt

buckling behaviors provided some interesting results (Equations 3 to 5). The input variables Weather,

Height, and Weight have similar impact on the driver’s seat belt buckling behaviors. For example, if the

Weather changes from snow/sleet/hail to clear/cloudy or rain/mist, the log odds of drivers’ changing their

seat belt behavior from Puts seat belt on before car is moving to other three categories (Puts seat belt

during startup maneuver, Puts seat belt during driving, or No seat belt throughout drive) increase. This

implies that drivers tend to wear the seat belt before the car is moving when the weather condition has

snow/sleet/hail. Similarly, if the driver’s height increases the log odds of drivers changing their seat belt

22

behavior from Puts seat belt on before car is moving to one of the other three categories, this implies that

as their Height increases the drivers tend to not wear the seat belt before the car moves. These analysis

results provide solutions for the research questions 1 and 2.

In the linear regression analysis, just the two input variables Age and Sex were selected to predict

the buckle time. From the results obtained, we can infer that the buckle time increases with the increase in

Age, and the buckle time decreases when the driver’s Sex changes from female to male. The regression

analysis resulted a mean buckle time of 14 seconds, which is similar to Malenfant and Van Houten (2008)

who found a mean buckle time of 12 seconds. Moreover, mean buckle time of 14 seconds is less than the

mean buckle time of 22 seconds obtained from the descriptive statistics. This is because some buckling

data are treated as outliers in the regression analysis, which also reveals most of the outliers are for those

taking a long time to buckle. Thus, the regression analysis provides some complement to the observations

in descriptive statistics.

Finally, the results from the regression analysis demonstrate that the models developed can be

used to predict the driver’s seat belt buckling behaviors and buckling time from the driver’s demographics

and drive conditions. This prediction helps to provide customized reminders to individual drivers by

predicting their seat belt behaviors and buckle times. This can reduce the perception of nuisance alerts

and protects the drivers from higher speed unbuckled crashes.

4.1 Practical applications

The data presented in this paper and the primary takeaway suggest that a 30 second threshold

from forward shift and 25 kph could be used in seat belt reminder systems. This recommendation would

reduce perception of nuisance alerts and protect the driver from higher-speed unbuckled crashes. Using

such timing would ensure that drivers are buckled before they reach hire speeds where injuries become

more severe.

5. Limitations

Naturalistic driving studies have a number of advantages and disadvantages. While we have real world

observations, NDS data have no experimental control. What you see is what you get. NDS studies suffer

23

from this limitation in general and as a result, in order to compare like conditions, some drivers must be

excluded. Sample size, age of participants can therefore be an issue, too. However, this is balanced by the

sheer number of events captured. This study was also not specifically designed to examine seat belt

behavior, so more detailed controls were not possible. Although the video data provided the interior view

of the vehicle to identify the key points associated with buckling the seat belt, the data doesn’t allow for

studying unbuckling during the middle of the drive. We only sampled approximately the first and last

minute of driving in this NDS study, so we were not able to capture the speed or time duration at buckle

or how the placement of the shoulder belt changes after the first minute. We had seven instances out of

130 (5%) when drivers buckled after the video stopped recording. The impact of these seven drives is

unknown as we have no way of knowing what speed drivers were traveling at or how soon after that one

minute they buckled. When considering seat belt placement, this study did not examine how the belt

shifted when buckling prior to movement. It is possible that the shifting of the belt may take place when

drivers are turning their torso and traveling in reverse which is missing from our analysis. Additionally,

our sample size was small, with a majority of the population being 65 years of age and older. An

additional examination of a sample including a wider age range of drivers, as well as additional data

throughout the drive, would be useful in helping to quantify the timing and speeds necessary for the seat

belt safety interlocks, as well as identifying seat belt shifting.

24

Funding

This work was supported by the Toyota Collaborative Safety Research Center (CSRC) and Safety

Research using Simulation University Transportation Center (SAFER-SIM) [U.S. Department of

Transportation’s University Transportation Centers Program, grant number 69A3551747131]. However,

the U.S. government assumes no liability for the contents or use thereof.

25

References

Bao, S., Funkhouser, D.S., Sayer, S.R., & Toyoda, H. (2016). An examination and categorization of

unbelted driving behavior: Insights from naturalistic driving data analysis. Submitted by UMTRI to

TRB 2017 Annual Meeting.

Bao, S., Xiong, H., Buonarosa, M.L., & Sayer, J.R. (2015). Using naturalistic driving data to examine

drivers’ seatbelt use behavior: Comparison between teens and adults. Journal of Safety Research,

54, 69–73. https://doi.org/10.1016/j.jsr.2015.06.006

Campbell, B.J. (1987). Safety Belt Injury Reduction Related to Crash Severity and Front Seated Position.

The Journal of Trauma, 27(7), 733–739. https://doi.org/10.1097/00005373-198707000-00007

Carney, C., McGehee, D.V., Lee, J.D., Reyes, M.L., & Raby, M. (2010). Using an event-triggered video

intervention system to expand the supervised learning of newly licensed adolescent drivers.

American Journal of Public Health, 100(6), 1101–1106. https://doi.org/10.2105/AJPH.2009.165829

Dingus, T.A., Klauer, S.G., Neale, V.L., Petersen, A., Lee, S.E., Sudweeks, J., Perez, M.A., Hankey, J.,

Ramsey, D., & Gupta, S. (2006). The 100-car naturalistic driving study, Phase II-results of the 100-

car field experiment. No. DOT-HS-810-593. United States Department of Transportation. National

Highway Traffic Safety Administration.

Eby, D.W., Molnar, L.J., Kostyniuk, L.P., Shope, J.T., & Miller, L.L. (2004). Developing an Optimal In-

Vehicle Safety Belt Promotion System. Report No. UMTRI-2004-29. In University of Michigan,

Ann Arbor, Transportation Research Institute.

Evans, L. (1986). The effectiveness of safety belts in preventing fatalities. Accident Analysis &

Prevention, 18(3), 229–241. https://doi.org/10.1016/0001-4575(86)90007-2

Fildes, B., Fitzharris, M., Koppel, S., Vulcan, P., & Brooks, C. (2003). Benefits of seat belt reminder

systems. Annual Proceedings/Association for the Advancement of Automotive Medicine, 47, 253.

Hilbe, J.M. (2009). Logistic regression models. New York: CRC press.

26

Kidd, D.G., McCartt, A.T., & Oesch, N.J. (2014). Attitudes toward seat belt use and in-vehicle

technologies for encouraging belt use. Traffic Injury Prevention, 15(1), 10–17.

https://doi.org/10.1080/15389588.2013.792111

Krafft, M., Kullgren, A., Lie, A., & Tingvall, C. (2006). The use of seat belts in cars with smart seat belt

reminders—results of an observational study. Traffic Injury Prevention, 7(2), 125–129.

https://doi.org/10.1080/15389580500509278

Larson, J.M., Fox, J.R., & Berg, E. (2017). Iowa Seat Belt Use Survey 2017 Data Collection Methodology

report. http://www.iowadot.gov/mvd/stats/survey17.pdf

Malenfant, J.E.L., & Van Houten, R. (2008). Observations of how drivers fasten their seatbelts in relation

to various startup tasks. Accident Analysis & Prevention, 40(1), 309–314.

https://doi.org/10.1016/j.aap.2007.06.009

McGehee, D.V., & Roe, C. (2017). An Investigational Analysis of Driver Seat Belt Use, Timing, and

Position in Naturalistic Driving. Toyota Technical Report, CSRC. Ann Arobor, MI.

McGehee, D.V., Raby, M., Carney, C., Lee, J.D., & Reyes, M.L. (2007). Extending parental mentoring

using an event-triggered video intervention in rural teen drivers. Journal of Safety Research, 38(2),

215–227. https://doi.org/10.1016/j.jsr.2007.02.009

McGehee, D.V., Roe, C.A., Boyle, L.N., Wu, Y., Ebe, K., Foley, J., & Angell, L. (2016). The wagging

foot of uncertainty: data collection and reduction methods for examining foot pedal behavior in

naturalistic driving. SAE International Journal of Transportation Safety, 4(2), 289–294.

https://doi.org/10.4271/2016-01-1526

Mizruchi, M.S. (1996). What do interlocks do? An analysis, critique, and assessment of research on

interlocking directorates. Annual Review of Sociology, 22(1), 271–298.

https://doi.org/10.1146/annurev.soc.22.1.271

27

Najm, W.G., Ranganathan, R., Srinivasan, G., Smith, J.D., Toma, S., Swanson, E., & Burgett, A. (2013).

Description of light-vehicle pre-crash scenarios for safety applications based on vehicle-to-vehicle

communications (Final Report No. DOT HS 811 731). United States. National Highway Traffic

Safety Administration.

National Center for Statistics and Analysis. (2021). Seat belt use in 2020 – Overall results. (Traffic Safety

Facts Research Note. Report No. DOT HS 813 072). National Highway Traffic Safety

Administration.

NHTSA. (2018). Vehicles Involved by Speed Limit and Crash Type - State : USA, Year : 2018.

https://www-fars.nhtsa.dot.gov/Vehicles/VehiclesLocation.aspx

NHTSA. (2020). 2018 Traffic Safety Facts A Compilation of Motor Vehicle Crash Data. US Department

of Transportation, National Highway Traffic Safety Administration, Washington, D.C. Report No.

DOT HS 812 981.

Reagan, I.J., McClafferty, J.A., Berlin, S.P., & Hankey, J.M. (2013). Using naturalistic driving data to

identify variables associated with infrequent, occasional, and consistent seat belt use. Accident

Analysis & Prevention, 50, 600–607. https://doi.org/10.1016/j.aap.2012.06.008

Regan, M.A., Triggs, T.J., Young, K.L., Tomasevic, N., Mitsopoulos, E., Stephan, K., & Tingvall, C.

(2006). On-road evaluation of intelligent speed adaptation, following distance warning and seatbelt

reminder systems: Final results of the TAG SafeCar project. Monash University Accident Research

Centre Reports, 253, 270.

Richards, D.C. (2010). Relationship between speed and risk of fatal injury: pedestrians and car

occupants. London, UK: Department of Transport, Transport Research Laboratory.

Sakamoto, Y., Ishiguro, M., & Kitagawa, G. (1986). Akaike information criterion statistics. Dordrecht,

The Netherlands: D. Reidel. 1986; 81. Taylor & Francis.

Sakia, R.M. (1992). The Box‐Cox transformation technique: a review. Journal of the Royal Statistical

Society: Series D (The Statistician), 41(2), 169–178. https://doi.org/10.2307/2348250

28

Sato, T.B. (1987). Effects of seat belts and injuries resulting from improper use. The Journal of Trauma,

27(7), 754–758.

Williams, A.F., Wells, J.K., & Farmer, C.M. (2002). Effectiveness of Ford’s belt reminder system in

increasing seat belt use. Injury Prevention, 8(4), 293–296. http://dx.doi.org/10.1136/ip.8.4.293

Documents

Quantifying and recommending seat belt reminder timing