top of page

Spatiotemporal analysis on GIS to identify Crash-prone Road sections

  • Writer: Arpit Shah
    Arpit Shah
  • Jan 31, 2022
  • 13 min read

Updated: Dec 12

1. Introduction


Spatial proximity playing an important role in determining the outcome of a sporting incident - Hotspot technology in Cricket confirms that the ball has established contact with bat
Figure 1: Spatial proximity playing an important role in determining the outcome of a sporting incident - Hotspot technology in Cricket confirms that the ball has established contact with bat

Spatial relationships strongly influence real-world events—whether in sports, safety, or geopolitics. In cricket (Figure 1), for example, Hotspot technology determines whether the ball touched the bat based on thermal signatures and spatial proximity.


I often recall the childhood game Hot & Cold. You hide an object somewhere in the house; the seeker tries to find it, and you guide them with cues—Hot meaning close, Cold meaning far. The thrill lay equally in concealing the object and racing to discover it before time ran out. And of course, the seeker always protested if they felt the clues were misleading 😄.

Adding 'attributes', i.e. information to spatial positions (coordinates), helps make sense of  geopolitical events - this map depicts which entity controls each provincial territory in Afghanistan during the Taliban offensive of 2021. Source: BBC
Figure 2: Adding 'attributes', i.e. information to spatial positions (coordinates), helps make sense of geopolitical events - this map depicts which entity controls each provincial territory in Afghanistan during the Taliban offensive of 2021. Source: BBC

In Geographic Information Systems (GIS), spatial positions acquire meaning only when enriched with attributes. Consider Figure 2, which maps who controlled each province in Afghanistan during the 2021 Taliban offensive—coordinates alone say nothing; the attribute “controlling entity” tells the story.


In this post, I will demonstrate a real-world analogue of the Hot & Cold game—except here, the “clues” arise from statistics, not guesswork. Using Esri's ArcGIS Pro, an advanced Location Analytics platform, I perform Spatiotemporal Analysis of Vehicle Accidents in Brevard County, Florida (USA).


Much thanks to Lauren Scott Griffith & Lixin Huang for developing the original tutorial on Esri Learn ArcGIS.

SECTION HYPERLINKS



If you prefer a visual walkthrough, here is a 16-minute demonstration (I'd recommend you to read this post before watching it):

Video 1: Walkthrough of Location Analytics on Vehicle Crash Records

Spatiotemporal Analysis enables you to study data through both space (location) and time simultaneously. Before diving into the dual-dimension logic, you may benefit from two simpler examples—Spatial (Site Suitability Analysis for Wildlife Habitat) and Temporal (Detecting Ships in the Suez Canal).


For the crash analysis, I will use the following geoprocessing tools:


These two videos explain the methodology well:

Video 2: Space Time Cube in ArcGIS Pro
Video 3: Hot Spot Analysis in ArcGIS Pro

Interesting, isn't it? Now to begin demonstrating its use...

2. Setting Up the Datasets & Initiating Location Analytics


First, I load the dataset containing the coordinates and attributes of 100,000+ vehicle crashes recorded between 2010 and 2015 in Brevard County, Florida.

In addition to positional coordinates, the dataset contains attributes describing each crash—date and time, injuries, fatalities, contributing factors (alcohol, distraction, etc.), and weather conditions at the time of the incident (Figure 3 and 4). These are standard attributes collected by law-enforcement agencies worldwide, making this workflow broadly replicable.


Vehicle Crash Records Attributes Part I
Figure 3: Vehicle Crash Records Attributes Part I
Vehicle Crash Records Attributes Part II
Figure 4: Vehicle Crash Records Attributes Part II

Once loaded, the crash points can be plotted on a 2D map:


Plotting Vehicle Crash Records on a 2D map
Figure 5: Plotting Vehicle Crash Records on a 2D map

Alongside this, I load a second dataset: the digitized Road Network of Brevard County:


 A cross-section of the digitized Road Network layer of Brevard County
Figure 6: A cross-section of the digitized Road Network layer of Brevard County

To prepare the data for spatiotemporal analysis, ArcGIS restructures the crash points into a Space Time Cube—a collection of Bins representing fixed spatial and temporal intervals.


To explain using an analogy, just as we use Pivot Table tool in Microsoft Excel to convert raw data into a more meaningful tabular summary, similarly, ArcGIS Pro is able to restructure the location dataset into manageable spatiotemporal units for analysis, i.e. Bins, using the Create Space Time Cube by Aggregating Points geoprocessing tool.


Pivot Table tabular summary of raw Sales Data in Microsoft Excel
Figure 7: Pivot Table tabular summary of raw Sales Data in Microsoft Excel













Setting the parameters in the Create Space Time Cube by Aggregating Points geoprocessing tool
Figure 8: Setting the parameters in the Create Space Time Cube by Aggregating Points geoprocessing tool

Using the said tool (Figure 8), I specify:

  • Spatial interval: 2 miles

  • Temporal interval: 16 weeks


I choose not to map the cube immediately; instead, I review the output summary (Figure 9 below), which outlines how the crash data was reorganized into bins. The cube is saved on my system for the next step. Also, in the last phase of this demonstration, I will visually render and explain the processed Space Time Cube results in detail.

Output Summary upon running the Space Time Cube geoprocessing tool on Crash Records data
Figure 9: Output Summary upon running the Space Time Cube geoprocessing tool on Crash Records data

With the Space Time Cube ready, I now run the Emerging Hot Spot Analysis tool, also known as Space Time Pattern Mining.

Setting the parameters in the Emerging Hot Spot Analysis Geoprocessing tool
Figure 10: Setting the parameters in the Emerging Hot Spot Analysis Geoprocessing tool

This output does not simply show crash density. Instead, it evaluates the trend of crashes within each bin—a 2 miles cross-section over 16 weeks of time in this case—based on both:

  • patterns within the bin, and

  • patterns in neighboring bins (spatially and temporally).

The trend for each bin falls into one of eight categories (description in Figure 11 below): New, Consecutive, Intensifying, Persistent, Diminishing, Sporadic, Oscillating or Historical Hot/Cold Spots.



Description of the 8 types of Hot Spot categories. Source: ArcGIS Pro Geoprocessing Tool Reference
Figure 11: Description of the 8 types of Hot Spot categories. Source: ArcGIS Pro Geoprocessing Tool Reference

Next up, I will deploy the Emerging Hot Spot Analysis tool on the Count of Total Road Crashes over a single Neighborhood Time Step. Technical note on Neighborhood Time Step below:


Technical Note on Neighborhood Time Step Statistics. Source: ArcGIS Pro Community
Figure 12: Technical Note on Neighborhood Time Step Statistics. Source: ArcGIS Pro Community

Running the Emerging Hot Spot Analysis tool on the Count of Total Road Crashes over a single Neighborhood Time Step dataset yields this map:

Output generated upon running Emerging Hot Spot Analysis tool on Total Count of Road Crash Records over a single Neighborhood Time Step (the parameter)
Figure 13: Output generated upon running Emerging Hot Spot Analysis tool on Total Count of Road Crash Records over a single Neighborhood Time Step (the parameter)

Alongside the map, a Summary of Results table is also generated (Figure 14 below) which breaks down the 221 bins:

  • 2 New Hot Spots

  • 17 Consecutive Hot Spots

  • 59 Sporadic Hot Spots

  • 13 Oscillating Hot Spots

  • 23 Persistent Cold Spots

  • 18 Diminishing Cold Spots

  • 3 Sporadic Cold Spots


These account for 135 Bins - the remaining 86 Bins are neither Hot Spots nor Cold Spots i.e. show no statistically significant pattern. (Figure 11 contains the description of each of these Hot Spot types)

Emerging Hot Spot Analysis tool's Summary of Results table
Figure 14: Emerging Hot Spot Analysis tool's Summary of Results table

The most common category (59 Bins)—Sporadic Hot Spot—indicates inconsistent yet recurring crash activity that isn’t strong enough to reflect a persistent trend.


However, for an analyst trying to find truly risk-prone zones, the focus should be on:

  • New Hot Spots (2 Bins)

  • Persistent Hot Spots (none in this dataset)

  • Intensifying Hot Spots (also none)

You may have observed that I have not factored in Brevard County's Road Network dataset in the Space Time Cube and Emerging Hot Spot Analysis steps. This is because the spatial distance between two bins (2 miles) used in that dataset is Euclidean, i.e., straight-line distance - it does not represent real-world travel distance i.e. using roads.


Figures 15 and 16 illustrate the difference between Euclidean and Road-Network-based service areas:

Euclidean coverage from a defined starting point for given drive times (15, 30 and 50 minutes) assumes the form of concentric circles
Figure 15: Euclidean coverage from a defined starting point for given drive times (15, 30 and 50 minutes) assumes the form of concentric circles
In comparison, Road Network coverage from a defined starting point for a given drive time would be irregular in shape as road lengths and connectivity are not linear in all directions. While not as beautiful as the Euclidean representation, this is an accurate depiction of how far one would get after driving in any direction for a given period of time (45 mins)
Figure 16: In comparison, Road Network coverage from a defined starting point for a given drive time would be irregular in shape as road lengths and connectivity are not linear in all directions. While not as beautiful as the Euclidean representation, this is an accurate depiction of how far one would get after driving in any direction for a given period of time (45 mins)

To make our crash-trend interpretation realistic, the Road Network must now be incorporated into the analysis. Before doing that though, I must fix an important issue.

Some crash points do not lie directly on roads (observe Figure 17).


Reasons could include:

  • the recorded point marks the final position of the vehicle, not the collision point,

  • GPS inaccuracies.


Regardless of cause, these points must be corrected, or the Hot Spot Analysis will be distorted.


Anomalies in Crash locations - certain spots are outside the Road boundary
Figure 17: Anomalies in Crash locations - certain spots are outside the Road boundary
Snap geoprocessing tool will help move the inaccurate records to on top of the Road Network
Figure 18: Snap geoprocessing tool will help move the inaccurate records to on top of the Road Network

Using the Snap geoprocessing tool (Figure 18), I reposition all crashes within 0.25 miles onto the nearest road. Points farther away are assumed erroneous and ignored.









The corrected output (Figure 19) shows all crash locations now properly aligned to the road network:


All Crash sites on top of Roads after running the Snap tool (compare it with Figure 17)
Figure 19: All Crash sites on top of Roads after running the Snap tool (compare it with Figure 17)
Setting the parameters in the Spatial Join geoprocessing tool
Figure 20: Setting the parameters in the Spatial Join geoprocessing tool





Next, I use Spatial Join geoprocessing tool to integrate the Crash locations dataset with the Road Network dataset.





Road Crash data 'joined' with the Road Network data as depicted in this pop-up - total 4 crashes occurred at the highlighted intersection
Figure 21: Road Crash data 'joined' with the Road Network data as depicted in this pop-up - total 4 crashes occurred at the highlighted intersection



As evident in Figure 21, the Crash locations dataset is now linked to the Road Network dataset.


I am now ready to run the Hot Spot Analysis tool again...

Or am I?...


Using the Calculate Field geoprocessing tool, I will compute the Crash Rate per mile per year
Figure 22: Using the Calculate Field geoprocessing tool, I will compute the Crash Rate per mile per year

Actually, no. One final adjustment remains.


Longer roads naturally accumulate more crashes, which would bias the Hot Spot output. To avoid this, I compute a normalized metric:

Crash Rate per mile, per year.


Using the Calculate Field tool (Figure 22), I create a new field (Figure 23) that expresses crash frequency relative to road length—ensuring a fair comparison across the network.


Newly-computed Crash Rate per mile per year column (Crash_Rate) added to the Attribute Table
Figure 23: Newly-computed Crash Rate per mile per year column (Crash_Rate) added to the Attribute Table

3. Performing Advanced Location Analytics using Getis-Ord Gi* Hot Spot method


Now that all irregularities have been resolved, I am finally ready to run Hot Spot Analysis again. This time, instead of using the Emerging Hot Spot Analysis tool, I will use the Hot Spot Analysis (Getis-Ord Gi*) tool. The Getis-Ord Gi* statistic allows me to spatially restrict the Hot Spot Analysis to the Road Network itself (Figure 26), rather than across the 2-mile Euclidean extent of each bin as before (Figure 13).


Before running the tool, I also want to assign weight not only to the exact location where a crash occurred but to the entire stretch of road where the sequence of events would have unfolded— from the point where a driver first recognized an obstacle to the point of collision. This provides a far more realistic representation of a crash-prone road segment.


Description of the Getis-Ord Gi* Hot Spot Analysis geoprocessing tool
Figure 24: Description of the Getis-Ord Gi* Hot Spot Analysis
Getis-Ord Gi* Hot Spot Analysis Geoprocessing tool parameters
Figure 25: Getis-Ord Gi* Hot Spot Analysis geoprocessing tool parameters

The length of this crash sequence—technically the Impedance Distance Cutoff—is set to 110 meters, equivalent to the length of an American football field. This corresponds to the minimum stopping-sight distance for a vehicle traveling at 45 mph, and will be used in the tool’s Conceptualization of Spatial Relationships parameter (Figure 25) to derive spatial weights.

(Technical note on generating spatial weights for network datasets is available here)











Upon running the Getis-Ord Gi* tool, the resulting Hot Spots now sit directly on the Road Network—a major improvement over the Euclidean output of the earlier Emerging Hot Spot Analysis (Figure 13):


Cross-sectional view of the output upon running Getis-Ord Gi* Hot Spot Analysis Geoprocessing tool
Figure 26: Cross-sectional view of Hot Spots derived using Getis-Ord Gi*

While Figure 26 depicts the hot spots derived from statistically analyzing all the Crash sites within Brevard County, let me refine the analysis by running the Hot Spot tool only on crashes that resulted in fatalities. The statistical method remains the same; only the input attribute field changes.


As expected, the Hot Spot output shifts noticeably. ArcGIS Pro allows me to compare both outputs side-by-side:


Comparing the All Crashes Hot Spot output (left) and Crashes that led to Fatalities Hot Spot output (right) - this is a cross-sectional view and not the entire output
Figure 27: Cross-sectional view - Hot Spots for All Crashes (left) vs Fatal Crashes (right)

This comparison is revealing. Several new Hot Spots appear in the fatal-crash output (right) that were diluted within the larger set of all crashes. These represent high-risk road sections that warrant close attention from analysts and safety authorities.

Similarly, I repeat the process for crashes where the driver was under the influence of alcohol, and again compare it to the “All Crashes” output.


Cross-sectional view of Hot Spots of All Crashes (left) and Hot Spots of Crashes where the driver was under the influence of Alcohol (right)
Figure 28: All Crashes (left) vs Alcohol-related Crashes (right)

The alcohol-related Hot Spots cluster along the road running parallel to the river—an insight that could help law enforcement identify patterns of late-night riverside gatherings or drinking activity.


This form of spatiotemporal analysis can therefore support multiple stakeholders. Municipal authorities might prioritize widening fatality-prone segments, while police departments could ramp up checks near riverside pubs.


By cleaning and preparing the data carefully upfront, I also highlight an essential point: high-quality inputs are critical for producing accurate spatial intelligence outputs.

4. Modeling the Workflow to automate the Analysis


Modern GIS platforms are extremely dynamic. Let me now demonstrate how to analyze the crash records faster and in greater depth.


A natural next question is: During which hours of the day do crashes peak?


ArcGIS Pro allows me to generate Charts and Tables directly. Below is a line chart comparing three attributes: crash count, hour of day, and day of week.


Line Chart generated by GIS itself capturing Crash count, Hour of the day, and Day of the Week
Figure 29:  Line Chart of Crash Count vs Hour of Day vs Day of Week

Do you sense any patterns?


Now let me adjust the symbology—


Line Chart with New Symbology depicting Crash count, Hour of the Day and Day of the Week
Figure 30: Line Chart with enhanced symbology

Try once again. What can you assess?

The insight becomes clear: Crash incidents spike between 3 pm and 5 pm, especially on weekdays.

Given this discovery, I now run a new Hot Spot Analysis specifically for this peak crash window.


Instead of manually re-running every individual step, I automate the entire workflow using a ModelBuilder Analysis Model.


Analysis Model to compute the Hot Spot (Getis-Ord Gi*) during the Peak Crash time frame that I've discovered
Figure 31: Analysis Model to compute Gi* Hot Spots for the peak crash period

Although the model (Figure 31) may appear complex at first glance, it is simply a graphical representation of the workflow you have already seen. It automates four major steps:


the manual Geoprocessing Tool - Create Day/Time Hot Spot Map which will be automated in the Geo-Analysis model
Figure 32: The Create Day/Time Hot Spot Map tool which is automated in the Geo-Analysis model

  1. Selecting the crash attribute of interest

  2. Snapping outlier crash points onto the correct road centerlines

  3. Standardizing crash counts by computing crash rate per mile per year

  4. Running the Getis-Ord Gi* Hot Spot Analysis


Such models are not difficult to configure, and often do not require any programming background. Their true power lies in enabling analysts to rapidly replicate workflows accurately and error-free, saving enormous time and effort.


After running the model for the peak crash window, I compare the results with the “All Crashes” Hot Spot output:








Cross-sectional view of All Crashes Hot Spots (left) compared to the Peak Crashes time frame Hot Spots (right)
Figure 33: Cross-sectional view of All Crashes Hot Spots (left) vs Peak-hour Hot Spots (right)

Once again, new Hot Spots emerge, and these locations deserve detailed investigation. They may correspond to busy commercial corridors or high-traffic commuter routes. Safety interventions such as signals, signage, speed breakers, or traffic controllers may be appropriate.

5. Visualizing the Results of Spatiotemporal Analysis in a 3D Scene


Now for my favorite part—and perhaps the most striking segment of this study: visualizing spatiotemporal patterns in 3D.


And no, you don’t need 3D glasses 😎.


So far, you have seen Hot Spots on a flat 2D map. While this reveals where crashes cluster, it does not show how these Hot Spots evolve across the six-year period of the dataset. A 3D scene allows us to visualize both space and time simultaneously.


First, I create Yearly Hot Spot Maps for each year from 2010 to 2015 (temporal analysis). To avoid manually repeating the workflow six times, I generate it using the Analysis Model (Figure 34):


This Analysis Model will allow me to create Yearly Crash Hot Spot Maps for each of the six years using any Crash Attribute as the parameter
Figure 34: Model to generate Yearly Crash Hot Spot Maps for any selected crash attribute

Next, I build another model (Figure 35 below) to perform Hot Spot Analysis specifically for the peak crash time window (3–5 pm on weekdays). The Yearly Hot Spot Maps produced earlier serve as inputs to this model (see the yellow input boxes).


This Model will allow me to perform Hot Spot Analysis for the Peak Crash time frame using the Yearly Hot Spot Maps generated in the previous Model as inputs
Figure 35: This Analysis Model allows me to perform Hot Spot Analysis for the Peak Crash times using the previously generated Yearly Hot Spot Maps as inputs

Finally, I render the spatiotemporal output—year-on-year Hot Spots for weekday crashes between 3-5 pm—in a 3D Map Scene in ArcGIS Pro.


Spatiotemporal rendition of the Hotspots for All Crashes during Peak time frame for each of the six years on a 3D Map Scene in ArcGIS Pro
Figure 36: Spatiotemporal rendition of the Hot Spots for All Crashes during the peak time frame for each of the six years on a 3D Map Scene in ArcGIS Pro

The output in Figure 36 may appear daunting at first glance. Let me walk you through what it depicts and how to interpret it. Refer to the highlighted portion in Figure 37 below-


This highlighted stack is representative of a 'Sporadic' Hot Spot
Figure 37: This highlighted stack is representative of a 'Sporadic' Hot Spot

At the highlighted intersection on Prospect Ave, you are seeing a Sporadic Hot Spot (refer to the earlier description). Recall that this Hot Spot type was the most prevalent in Brevard County when we ran the Emerging Hot Spot Analysis tool.


The interpretive pattern is as follows:

  • The first year (2010) displays a Hot Spot at the bottom of the stack. The dark red shade indicates a highly statistically significant Hot Spot (99% confidence).

  • In the second year (2011), the Hot Spot disappears completely—classified as not statistically significant, as shown in the legend.

  • In the third year (2012), the Hot Spot reappears with weaker statistical significance (light red shade representing 95% confidence).

  • In the fourth year (2013), it disappears again.

  • In the fifth year (2014), it reappears with maximum intensity.

  • And finally, in the sixth year (2015), the Hot Spot disappears once more.


This alternating pattern—appearing, disappearing, and reappearing with varying confidence levels—is characteristic of a Sporadic Hot Spot. I hope this clarifies how to interpret a spatiotemporal stack and appreciate its analytical value.

Now consider the next Hot Spot stack:


This Hot Spot stack represents a Persistent type as it is highly Statistically Significant (99% confidence) across all the six years (2010-2015) - this is a prime example of a Crash-prone Road Section and should be resolved on priority
Figure 38: This Hot Spot stack represents a 'Persistent' type as it is highly statistically significant (99% confidence) across all six years (2010–2015). This is a prime example of a crash-prone road section and should be addressed on priority.

This is the classic pattern of a Persistent Hot Spot—consistently significant, consistently risky, and carrying a clear call for intervention.

And finally, this one-

This Hot Spot stack represents a  No Pattern Detected type as Hot Spot appears only in the fourth year (2013) where it has a moderately-high Statistical Significance (95% confidence)
Figure 39: This Hot Spot stack represents a 'No Pattern Detected' type as a Hot Spot appears only in the fourth year (2013), with moderately high statistical significance (95% confidence).

Because the Hot Spot occurs only once across the entire six-year series and shows no recurring or intensifying trend, it falls under No Pattern Detected.


Thank you for taking the time to explore this detailed demonstration. I hope you enjoyed navigating through the spatiotemporal interpretation of crash Hot Spots. As always, feel free to share your feedback.


Spatiotemporal rendition of Road Crash Hotspots over a period of six years during Peak time frame. Cross-sectional view of a 3D Scene in ArcGIS Pro.
Figure 40: Spatiotemporal rendition of Road Crash Hot Spots across six years during the peak time frame. Cross-sectional view of a 3D Scene in ArcGIS Pro.

ABOUT US - OPERATIONS MAPPING SOLUTIONS FOR ORGANIZATIONS


Intelloc Mapping Services, Kolkata | Mapmyops.com offers a suite of Mapping and Analytics solutions that seamlessly integrate with Operations Planning, Design, and Audit workflows. Our capabilities include — but are not limited to — Drone Services, Location Analytics & GIS Applications, Satellite Imagery Analytics, Supply Chain Network Design, Subsurface Mapping and Wastewater Treatment. Projects are executed pan-India, delivering actionable insights and operational efficiency across sectors.


My firm's services can be split into two categories - Geographic Mapping and Operations Mapping. Our range of offerings are listed in the infographic below-

Range of solutions that Intelloc Mapping Services (Mapmyops.com) offers
Range of solutions that Intelloc Mapping Services (Mapmyops.com) offers

A majority of our Mapping for Operations-themed workflows (50+) can be accessed from this website's landing page. We respond well to documented queries/requirements. Demonstrations/PoC can be facilitated, on a paid-basis. Looking forward to being of service.


Regards,

Mapmyops I Intelloc Mapping Services

Mapmyops
  • LinkedIn Social Icon
  • Facebook
  • Twitter
  • YouTube
Intelloc Mapping Services - Mapmyops.com
bottom of page