TWI786902B - Device, method and computer program product for exploration of potential event hotspot - Google Patents

Device, method and computer program product for exploration of potential event hotspot Download PDF

Info

Publication number
TWI786902B
TWI786902B TW110139664A TW110139664A TWI786902B TW I786902 B TWI786902 B TW I786902B TW 110139664 A TW110139664 A TW 110139664A TW 110139664 A TW110139664 A TW 110139664A TW I786902 B TWI786902 B TW I786902B
Authority
TW
Taiwan
Prior art keywords
event
target
prospecting
exploration
objects
Prior art date
Application number
TW110139664A
Other languages
Chinese (zh)
Other versions
TW202318220A (en
Inventor
廖士權
謝欣翰
林佳宏
Original Assignee
中華電信股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 中華電信股份有限公司 filed Critical 中華電信股份有限公司
Priority to TW110139664A priority Critical patent/TWI786902B/en
Application granted granted Critical
Publication of TWI786902B publication Critical patent/TWI786902B/en
Publication of TW202318220A publication Critical patent/TW202318220A/en

Links

Images

Abstract

A device, a method, a computer program product for exploration of potential event hotspot are provided. A plurality of exploration target objects, respective characteristics of the plurality of exploration target objects, and respective events of the plurality of exploration target objects are collected. The plurality of exploration target objects are grouped according to the respective characteristics of the plurality of exploration target objects. Exploration target objects in each group are divided into areas according to the respective events of the plurality of the exploration target objects. Exploration target objects in each area in the each group each group in each group are explored according to a risk index and time of occurrence, so as to obtain representative event characteristics of the exploration target objects in the each area in the each group. The representative event characteristics of the exploration target objects in the each area in the each group are compared with each other such that an exploration target object having potential event hotspot is obtained.

Description

用於潛在事件熱點探勘之設備、方法以及電腦程式產品 Apparatus, method and computer program product for potential event hot spot detection

本發明係關於一種資料處理探勘技術,詳而言之,係關於潛在事件熱點探勘之設備以及其執行之方法以及電腦程式產品。 The present invention relates to a data processing and exploration technology, and in particular, relates to equipment for potential event hotspot exploration, a method for its execution, and a computer program product.

隨著科技進步,利用數據來分析道路事故發生的趨勢隨之增加,習知方法是針對已發生事故的路段擷取時間、天氣、地點等等,來分析危險程度,藉此提醒車輛駕駛者那些路段是事故發生熱點。然而,對於完全沒有發生過事故的路段卻無法發出預警。 With the advancement of technology, the use of data to analyze the trend of road accidents has increased. The conventional method is to capture the time, weather, location, etc. of the road where the accident has occurred to analyze the degree of danger, thereby reminding the driver of those accidents. Road sections are accident hotspots. However, early warning cannot be issued for road sections where accidents have not occurred at all.

因此,如何進行資料探勘以快速且精準地找到潛在事件,為業界待解決的課題。 Therefore, how to conduct data mining to quickly and accurately find potential events is a problem to be solved in the industry.

為解決上述問題及其他問題,本發明揭示一種潛在事件熱點探勘之設備及其執行之方法以及電腦程式產品。 In order to solve the above problems and other problems, the present invention discloses a potential event hotspot detection device, its execution method and computer program product.

本發明之潛在事件熱點探勘之設備,係包括:探勘目標資料蒐集模組,蒐集複數探勘目標對象、該複數探勘目標對象之各自的特徵、該複數探勘目標對象之各自的事件;分群模組,根據該複數探勘目標對象之各自的特徵,對該複數探勘目標對象進行分群;事件特徵探勘模組,根據該複數探勘目標對象之各自的事件,對每一群組中的探勘目標對象進行分區,以根據危險指標及發生時間,對該每一群組中每一區的探勘目標對象進行探勘,進而獲得該每一群組中該每一區的探勘目標對象的具代表性之事件特徵;以及事件特徵比對模組,將該每一群組中該每一區的探勘目標對象的該具代表性之事件特徵相互比對,以獲得具潛在事件熱點之探勘目標對象。 The equipment for potential event hotspot exploration of the present invention includes: an exploration target data collection module, which collects multiple exploration target objects, the respective characteristics of the multiple exploration target objects, and the respective events of the multiple exploration target objects; According to the respective characteristics of the plurality of prospecting objects, the plurality of prospecting objects are grouped; the event feature mining module partitions the prospecting objects in each group according to the respective events of the plurality of prospecting objects, According to the risk indicators and time of occurrence, survey the exploration target objects in each area in each group, and then obtain the representative event characteristics of the exploration target objects in each area in each group; and The event feature comparison module compares the representative event features of the prospecting targets in each region in each group to obtain prospecting targets with potential event hotspots.

於一實施例中,該分群模組係自該複數探勘目標對象中選取多個探勘目標對象作為中心點,根據該複數探勘目標對象的特徵,計算未被選取的探勘目標對象與該多個中心點之間各自的距離,以將該未被選取的探勘目標對象與距離最近的中心點劃分為同一群,接著計算該同一群中每一探勘目標對象與該同一群中其他探勘目標對象之間的距離和,將具有最小距離和之探勘目標對象選取為該同一群中的新中心點,再以該新中心點重新進行計算直到該新中心點不再更動為止。 In one embodiment, the grouping module selects a plurality of prospecting objects from the plurality of prospecting objects as center points, and calculates the relationship between the unselected prospecting objects and the plurality of centers according to the characteristics of the plurality of prospecting objects. The respective distances between the points, so as to divide the unselected prospecting target object and the nearest central point into the same group, and then calculate the distance between each prospecting target object in the same group and other prospecting target objects in the same group The distance sum of , select the prospecting target object with the minimum distance sum as the new center point in the same group, and then recalculate with the new center point until the new center point does not change.

於一實施例中,該事件特徵探勘模組係根據該複數探勘目標對象之各自的事件是否已發生目標事件,對該每一群組中的探勘目標對象進行分區,以發生該目標事件前的一時間區間為窗口對已發生目標事件的探勘目標對象之各自的事件進行取樣,且以該時間區間作為滑動窗口對未發生目標事件的探勘目標對象之各自的事件進行取樣。 In one embodiment, the event feature mining module partitions the mining target objects in each group according to whether the respective events of the plurality of mining target objects have occurred the target event, so as to obtain the number of events before the target event occurs A time interval is used as a window to sample the respective events of the survey target objects that have occurred the target event, and the respective events of the survey target objects that have not occurred the target event are sampled using the time interval as a sliding window.

於一實施例中,該事件特徵探勘模組係以長度為1的危險指標及其對應的時間區間對該已發生目標事件的探勘目標對象之經取樣的事件進行探勘,獲得該已發生目標事件的探勘目標對象的有效事件特徵,再逐漸增加該危險指標的長度,獲得該已發生目標事件的探勘目標對象的具代表性之事件特徵,且以長度為1的危險指標及其對應的時間區間對該未發生目標事件的探勘目標對象之經取樣的事件進行探勘,獲得該未發生目標事件的探勘目標對象之各自的有效事件特徵,再逐漸增加該危險指標的長度,獲得該未發生目標事件的探勘目標對象之各自的具代表性之事件特徵。 In one embodiment, the event feature detection module uses the risk index with a length of 1 and its corresponding time interval to detect the sampled events of the detection target object of the target event that has occurred, and obtain the target event that has occurred The effective event characteristics of the exploration target object, and then gradually increase the length of the risk index, to obtain the representative event characteristics of the exploration target object that has occurred the target event, and the length of the risk index is 1 and its corresponding time interval Prospecting for the sampled events of the exploration target object that has not occurred the target event, obtaining the respective effective event characteristics of the exploration target object that has not occurred the target event, and then gradually increasing the length of the risk index to obtain the non-occurrence target event The respective representative event characteristics of the prospecting target objects.

於一實施例中,該事件特徵比對模組係針對該具潛在事件熱點之探勘目標對象,評估該具潛在事件熱點之探勘目標對象發生該目標事件的風險。 In one embodiment, the event feature comparison module evaluates the risk of the target event occurring on the prospecting target object with potential event hotspots for the prospecting target object with potential event hotspots.

於一實施例中,該複數探勘目標對象係為複數道路資料,該複數探勘目標對象之各自的特徵係為該複數道路資料之各自的道路特性,而該複數探勘目標對象之各自的事件係包括由車機編號、危險指標、回報時間、是否發生事故所組成的集合。 In one embodiment, the plurality of prospecting objects are plural road data, the respective characteristics of the plurality of prospecting objects are the respective road characteristics of the plurality of road data, and the respective events of the plurality of prospecting objects include A set consisting of vehicle number, hazard indicator, return time, and whether an accident occurred.

於一實施例中,本發明之設備進一步包括:事件熱點預警模組,根據對應於已發生事故以及潛在事故發生熱點的道路資料之基地台識別碼,對車機發出預警。 In one embodiment, the device of the present invention further includes: an event hotspot early warning module, which issues an early warning to the vehicle according to the base station identification code corresponding to the road data of the hot spots of the accidents and potential accidents.

本發明之由設備所執行之用於潛在事件熱點探勘之方法,係包括:蒐集複數探勘目標對象、該複數探勘目標對象之各自的特徵、該複數探勘目標對象之各自的事件;根據該複數探勘目標對象之各自的特徵,對該複數探勘目標對象進行分群;根據該複數探勘目標對象之各自的事件,對每一群組中 的探勘目標對象進行分區;根據危險指標及發生時間,對該每一群組中每一區的探勘目標對象進行探勘,獲得該每一群組中該每一區的探勘目標對象的具代表性之事件特徵;以及將該每一群組中該每一區的探勘目標對象的該具代表性之事件特徵相互比對,以獲得具潛在事件熱點之探勘目標對象。 The method for potential event hotspot detection performed by the device of the present invention includes: collecting a plurality of exploration target objects, respective characteristics of the plurality of exploration target objects, and respective events of the plurality of exploration target objects; according to the plurality of exploration target objects According to the respective characteristics of the target objects, the plurality of prospecting target objects are grouped; according to the respective events of the plurality of prospecting target objects, the According to the risk index and occurrence time, the exploration target objects in each area in each group are explored, and the representative data of the exploration target objects in each area in each group are obtained. and comparing the representative event characteristics of the exploration target objects in each area in each group to obtain exploration target objects with potential event hotspots.

於一實施例中,所述分群包括:(1)自該複數探勘目標對象中選取多個探勘目標對象作為中心點;(2)根據該複數探勘目標對象之各自的特徵,計算未被選取的探勘目標對象與該多個中心點之間各自的距離,以將該未被選取的探勘目標對象與距離最近的中心點劃分為同一群;(3)計算該同一群中每一探勘目標對象與該同一群中其他探勘目標對象之間的距離和,以將具有最小距離和的探勘目標對象選取為該同一群中的新中心點;以及(4)以該新中心點重複步驟(2)和(3),直到該新中心點不再更動為止。 In one embodiment, the grouping includes: (1) selecting a plurality of prospecting objects from the plurality of prospecting objects as center points; (2) calculating the unselected The respective distances between the exploration target object and the plurality of center points, so that the unselected exploration target object and the nearest center point are divided into the same group; (3) calculate the distance between each exploration target object in the same group and The sum of distances between other prospecting target objects in the same group, to select the prospecting target object with the minimum distance sum as the new central point in the same group; and (4) repeat steps (2) and (3), until the new center point no longer changes.

於一實施例中,所述分區包括:根據該複數探勘目標對象之各自的事件是否已發生目標事件,對每一群組中的探勘目標對象進行分區;針對已發生目標事件的探勘目標對象,以發生該目標事件前的一預定時間區間為窗口,對該已發生目標事件的探勘目標對象的各自的事件進行取樣;以及針對未發生目標事件的探勘目標對象,以該預定時間區間作為滑動窗口,對該未發生目標事件的探勘目標對象的各自的事件進行取樣。 In one embodiment, the partitioning includes: partitioning the prospecting target objects in each group according to whether the respective events of the plurality of prospecting target objects have occurred a target event; for the prospecting target objects that have occurred a target event, Taking a predetermined time interval before the occurrence of the target event as a window, sampling the respective events of the prospecting target objects that have occurred the target event; and using the predetermined time interval as a sliding window for the prospecting target objects that have not occurred the target event , to sample the respective events of the prospecting target object for which no target event occurs.

於一實施例中,所述探勘包括:針對該已發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得該已發生目標事件的探勘目標對象的有效事件特徵,再逐漸增加該危險指標的長度,獲得該已發生目標事件的探勘目標對象的具代表性之事件特徵;以及針對該未發生目標事件的探勘目標對象之經取樣的事件,以長度為1 的危險指標及其對應的時間區間進行探勘,獲得該未發生目標事件的探勘目標對象之各自的有效事件特徵,再逐漸增加該危險指標的長度,獲得該未發生目標事件的探勘目標對象之各自的具代表性之事件特徵。 In one embodiment, the prospecting includes: for the sampled events of the prospecting target object of the target event that has occurred, perform prospecting with a risk indicator with a length of 1 and its corresponding time interval, and obtain the target event that has occurred Exploring the effective event characteristics of the target object, and then gradually increasing the length of the risk index, to obtain the representative event characteristics of the exploration target object that has occurred the target event; event, with length 1 The risk indicators and their corresponding time intervals are explored to obtain the respective effective event characteristics of the exploration target objects that have not occurred the target event, and then gradually increase the length of the risk index to obtain the respective effective event characteristics of the exploration target objects that have not occurred the target event. representative event features.

於一實施例中,所述比對包括:針對該具潛在事件熱點之探勘目標對象,評估該具潛在事件熱點之探勘目標對象發生目標事件的風險。 In one embodiment, the comparison includes: with respect to the prospecting target object with potential event hotspots, evaluating the risk of occurrence of a target event for the prospecting target object with potential event hotspots.

於一實施例中,該複數探勘目標對象係為複數道路資料,該複數探勘目標對象之各自的特徵係為該複數道路資料之各自的道路特性,而該複數探勘目標對象之各自的事件係包括由車機編號、危險指標、回報時間、是否發生事故所組成的集合。 In one embodiment, the plurality of prospecting objects are plural road data, the respective characteristics of the plurality of prospecting objects are the respective road characteristics of the plurality of road data, and the respective events of the plurality of prospecting objects include A set consisting of vehicle number, hazard indicator, return time, and whether an accident occurred.

於一實施例中,該方法包括:根據對應於已發生事故以及潛在事故發生熱點的道路資料之基地台識別碼,對車機發出預警。 In one embodiment, the method includes: issuing an early warning to the vehicle according to the base station identification code corresponding to the road data corresponding to the hotspots of the accidents and potential accidents.

本發明之電腦程式產品,經由電腦載入程式後執行上述之用於潛在事件熱點探勘之方法。 The computer program product of the present invention executes the above-mentioned method for potential event hotspot detection after being loaded into the program by the computer.

換言之,本發明之潛在事件熱點探勘之設備及其執行之方法以及電腦程式產品能依據探勘目標對象之事件發生頻率、發生時間相互關係、發生時間間隔及範圍等,藉此探勘出事件特徵,再依據已發生目標事件之探勘目標對象之具代表性之事件特徵,找出未發生目標事件但具潛在事件熱點之探勘目標對象,另針對已發生目標事件和具潛在事件熱點之探勘目標對象進行警示,俾有效降低目標事件發生的頻率。 In other words, the equipment for detecting potential event hotspots and its execution method, as well as the computer program product of the present invention can detect the event characteristics based on the event occurrence frequency, occurrence time relationship, occurrence time interval and range of the target objects of the detection, and then According to the representative event characteristics of the exploration target objects that have occurred target events, find out the exploration target objects that have not occurred target events but have potential event hotspots, and give warnings to the exploration target objects that have occurred target events and have potential event hot spots , so as to effectively reduce the frequency of target events.

2:潛在事件熱點探勘之設備 2: Equipment for hotspot detection of potential events

21:探勘目標資料蒐集模組 21: Exploration target data collection module

22:分群模組 22: Grouping module

23:事件特徵探勘模組 23:Event Feature Exploration Module

24:事件特徵比對模組 24: Event feature comparison module

25:事件熱點預警模組 25:Event hotspot early warning module

S201~S205:步驟 S201~S205: steps

S301~S306:步驟 S301~S306: steps

S401~S408:步驟 S401~S408: steps

圖1係為本發明之潛在事件熱點探勘之設備之架構示意圖。 FIG. 1 is a schematic diagram of the structure of the equipment for potential event hotspot detection according to the present invention.

圖2係為本發明之潛在事件熱點探勘之方法之流程示意圖。 FIG. 2 is a schematic flow chart of the method for potential event hotspot detection according to the present invention.

圖3係為本發明之潛在事件熱點探勘之方法的分群之流程示意圖。 FIG. 3 is a schematic flow chart of the grouping of the method for potential event hotspot detection of the present invention.

圖4係為本發明之潛在事件熱點探勘之方法的事件特徵探勘比對之流程示意圖。 FIG. 4 is a schematic flow chart of event feature detection and comparison of the method for potential event hotspot detection in the present invention.

以下藉由特定的實施例說明本案之實施方式,熟習此項技藝之人士可由本文所揭示之內容輕易地瞭解本案之其他優點及功效。本說明書所附圖式所繪示之結構、比例、大小等均僅用於配合說明書所揭示之內容,以供熟悉此技藝之人士之瞭解與閱讀,非用於限定本案可實施之限定條件,故任何修飾、改變或調整,在不影響本案所能產生之功效及所能達成之目的下,均應仍落在本案所揭示之技術內容得能涵蓋之範圍內。 The implementation of this case is described below through specific examples, and those skilled in this art can easily understand other advantages and effects of this case from the content disclosed herein. The structures, proportions, sizes, etc. shown in the drawings attached to this manual are only used to match the content disclosed in the manual, for the understanding and reading of those who are familiar with this technology, and are not used to limit the conditions that can be implemented in this case. Therefore, any modifications, changes or adjustments should still fall within the scope covered by the technical content disclosed in this case without affecting the functions and goals that can be achieved in this case.

請參閱圖1,本發明之潛在事件熱點探勘之設備(如伺服器)2包括探勘目標資料蒐集模組21、分群模組22、事件特徵探勘模組23、事件特徵比對模組24、事件熱點預警模組25。 Please refer to Fig. 1, the equipment (such as server) 2 of the potential event hotspot exploration of the present invention comprises exploration target data collection module 21, grouping module 22, event feature exploration module 23, event feature comparison module 24, event Hot spot warning module 25.

在一實施例中,圖1中的各模組均可為軟體、硬體或韌體;若為硬體,則可為具有資料處理與運算能力之處理單元、處理器、電腦或伺服器;若為軟體或韌體,則可包括處理單元、處理器、電腦或伺服器可執行之指令。 In one embodiment, each module in FIG. 1 can be software, hardware or firmware; if it is hardware, it can be a processing unit, processor, computer or server with data processing and computing capabilities; In the case of software or firmware, it may include instructions executable by a processing unit, processor, computer or server.

探勘目標資料蒐集模組21用於蒐集複數探勘目標對象、該複數探勘目標對象之各自的特徵、該複數探勘目標對象之各自的事件,以將複數探勘目標對象、該複數探勘目標對象之各自的特徵、該複數探勘目標對象之各自 的事件輸入至資料庫(未圖示)。此外,蒐集方法係例如為手動輸入或介接外部系統以自動納入。 The prospecting target data collection module 21 is used to collect multiple prospecting target objects, respective features of the plurality of prospecting target objects, and respective events of the plurality of prospecting target objects, so as to collect the plurality of prospecting target objects, the respective features of the plurality of prospecting target objects, Features, each of the multiple prospecting target objects The events of are input to a database (not shown). In addition, the collection method is, for example, manual input or interfaced with an external system for automatic inclusion.

分群模組22用於根據該複數探勘目標對象之各自的特徵,對該複數探勘目標對象進行分群。詳言之,分群模組22可自該複數探勘目標對象中選取多個探勘目標對象作為中心點,根據該複數探勘目標對象的特徵,計算未被選取的探勘目標對象與該多個中心點之間各自的距離,以將該未被選取的探勘目標對象與距離較近或最近的中心點劃分為同一群,接著計算該同一群中每一探勘目標對象與該同一群中其他探勘目標對象之間的距離和,進而將具有最小距離和之探勘目標對象選取為該同一群中的新中心點,再以該新中心點重新進行計算直到所得到的中心點不更動為止,即完成分群。 The grouping module 22 is used for grouping the plurality of prospecting objects according to their respective characteristics. In detail, the grouping module 22 can select a plurality of prospecting objects from the plurality of prospecting objects as center points, and calculate the distance between the unselected prospecting objects and the plurality of center points according to the characteristics of the plurality of prospecting objects. to divide the unselected prospecting target object into the same group with the closer or nearest central point, and then calculate the distance between each prospecting target object in the same group and other prospecting target objects in the same group The sum of the distances between them, and then select the prospecting target object with the smallest sum of distances as the new center point in the same group, and then recalculate with the new center point until the obtained center point does not change, that is, the grouping is completed.

事件特徵探勘模組23根據該複數探勘目標對象之各自的事件,對每一群組中的探勘目標對象進行分區,以根據危險指標及發生時間,對該每一群組中每一區的探勘目標對象進行探勘,進而獲得該每一群組中該每一區的探勘目標對象的具代表性之事件特徵。詳言之,事件特徵探勘模組23可根據該複數探勘目標對象之各自的事件是否已發生目標事件,對該每一群組中的探勘目標對象進行分區,以發生該目標事件前的一時間區間為窗口對已發生目標事件的探勘目標對象之各自的事件進行取樣,且以該時間區間作為滑動窗口對未發生目標事件的探勘目標對象之各自的事件進行取樣。此外,事件特徵探勘模組23進一步以長度為1的危險指標及其對應的時間區間對該已發生目標事件的探勘目標對象之經取樣的事件進行探勘,獲得該已發生目標事件的探勘目標對象的有效事件特徵,再逐漸增加該危險指標的長度,獲得該已發生目標事件的探勘目標對象的具代表性之事件特徵,且以長度為1的危險指標及其對應的時 間區間對該未發生目標事件的探勘目標對象之經取樣的事件進行探勘,獲得該未發生目標事件的探勘目標對象之各自的有效事件特徵,再逐漸增加該危險指標的長度,獲得該未發生目標事件的探勘目標對象之各自的具代表性之事件特徵。 The event feature prospecting module 23 partitions the prospecting target objects in each group according to the respective events of the plurality of prospecting target objects, so that the prospecting of each area in each group is The target object is surveyed, and then the representative event characteristics of the survey target object in each area in each group are obtained. Specifically, the event feature exploration module 23 can partition the exploration target objects in each group according to whether the respective events of the plurality of exploration target objects have occurred the target event, so that a time before the target event occurs The interval is a window to sample the respective events of the prospecting target objects that have occurred the target event, and use the time interval as a sliding window to sample the respective events of the prospecting target objects that have not occurred the target event. In addition, the event feature exploration module 23 further uses the risk index with a length of 1 and its corresponding time interval to explore the sampled events of the exploration target object of the target event that has occurred, and obtains the detection target object of the target event that has occurred Then gradually increase the length of the hazard index to obtain the representative event characteristics of the exploration target object that has occurred the target event, and the hazard index with a length of 1 and its corresponding time Prospecting for the sampled events of the exploration target object that has not occurred the target event, and obtaining the respective effective event characteristics of the exploration target object that has not occurred the target event, and then gradually increasing the length of the risk index, to obtain the non-occurrence Prospecting for Target Events The respective representative event characteristics of the target objects.

事件特徵比對模組24用於將該每一群組中該每一區的探勘目標對象的該具代表性之事件特徵相互比對,以獲得具潛在事件熱點之探勘目標對象。此外,事件特徵比對模組24更針對該具潛在事件熱點之探勘目標對象,評估該具潛在事件熱點之探勘目標對象發生該目標事件的風險。 The event feature comparison module 24 is used for comparing the representative event features of the survey target objects in each region in each group, so as to obtain the survey target objects with potential event hotspots. In addition, the event feature comparison module 24 further evaluates the risk of the target event occurring on the prospecting target object with potential event hotspots for the prospecting target object with potential event hotspots.

事件熱點預警模組25用於根據對應於已發生事故以及潛在事故發生熱點的道路資料之基地台識別碼,對車機發出預警。詳言之,該複數探勘目標對象係為複數道路資料,該複數探勘目標對象之各自的特徵係為該複數道路資料之各自的道路特性,而該複數探勘目標對象之各自的事件係包括由車機編號、危險指標、回報時間、是否發生事故所組成的集合。是以,事件熱點預警模組25可利用基地台識別碼,對在已發生(曾經發生)事故的道路或潛在事故發生熱點的道路附近的車輛發出預警。 The event hotspot early warning module 25 is used to issue an early warning to the vehicle according to the base station identification code corresponding to the road data corresponding to the hot spots of accidents and potential accidents. Specifically, the plurality of prospecting objects are plural road data, the respective characteristics of the plurality of prospecting objects are the respective road characteristics of the plurality of road data, and the respective events of the plurality of prospecting objects include vehicles A set consisting of machine number, hazard indicator, return time, and whether an accident occurred. Therefore, the event hotspot early warning module 25 can use the base station identification code to issue early warnings to vehicles near roads where accidents have occurred (have occurred) or roads where potential accident hotspots have occurred.

請參閱圖2,本發明之潛在事件熱點探勘之方法,乃為電腦或伺服器或資訊處理裝置或類似的裝置設備,藉由電腦軟體與硬體資源的協同運作,在其中運作程式或指令以執行之方法,包括步驟S201~S205。另外,提供一種電腦程式產品,經由電腦載入程式後執行用於潛在事件熱點探勘之方法。須說明的是,電腦軟體可儲存於硬碟、軟碟、光碟、USB隨身碟等電腦可讀取記錄媒體,亦可在網路上直接傳輸提供,電腦軟體例如電腦程式或電腦程式產品。 Please refer to Fig. 2, the method for potential event hotspot exploration of the present invention is a computer or server or information processing device or similar device equipment, through the coordinated operation of computer software and hardware resources, in which the program or instruction is operated to The execution method includes steps S201-S205. In addition, a computer program product is provided, and the method for potential event hotspot detection is executed after the program is loaded into the computer. It should be noted that computer software can be stored in computer-readable recording media such as hard disks, floppy disks, optical disks, and USB flash drives, and can also be directly transmitted and provided on the Internet, such as computer programs or computer program products.

於步驟S201中,蒐集複數探勘目標對象、複數探勘目標對象之各自的特徵、複數探勘目標對象之各自的事件,接著進入步驟S202。 In step S201, collect the plurality of mining target objects, the respective features of the plurality of mining target objects, and the respective events of the plurality of mining target objects, and then proceed to step S202.

於步驟S202中,根據複數探勘目標對象之各自的特徵,對複數探勘目標對象進行分群,接著進入步驟S203。 In step S202, group the plurality of prospecting objects according to their respective characteristics, and then proceed to step S203.

於步驟S203中,根據複數探勘目標對象之各自的事件,對每一群組中的探勘目標對象進行分區,接著進入步驟S204。 In step S203, according to the respective events of the plurality of prospecting target objects, the prospecting target objects in each group are partitioned, and then step S204 is entered.

於步驟S204中,根據危險指標及發生時間,對每一群組中每一區的探勘目標對象進行探勘,獲得每一群組中每一區的探勘目標對象的具代表性之事件特徵,接著進入步驟S205。 In step S204, according to the risk index and the time of occurrence, the exploration target object in each area in each group is surveyed, and the representative event characteristics of the exploration target object in each area in each group are obtained, and then Go to step S205.

於步驟S205中,將每一群組中每一區的探勘目標對象的該具代表性之事件特徵相互比對,以獲得具潛在事件熱點之探勘目標對象。 In step S205, the representative event features of the survey targets in each area in each group are compared with each other to obtain survey targets with potential event hotspots.

請參閱圖3,本發明之潛在事件熱點探勘之方法中之分群方法,包括步驟S301~S306。 Please refer to FIG. 3 , the grouping method in the method for detecting potential event hotspots of the present invention includes steps S301-S306.

於步驟S301中,自複數探勘目標對象中選取多個探勘目標對象作為中心點,接著進至步驟S302。 In step S301, a plurality of prospecting target objects are selected as center points from the plurality of prospecting target objects, and then proceed to step S302.

於步驟S302中,根據複數探勘目標對象之各自的特徵,計算未被選取的探勘目標對象與多個中心點之間各自的距離,以將未被選取的探勘目標對象與距離較近或最近的中心點劃分為同一群,接著進至步驟S303。 In step S302, according to the respective characteristics of the plurality of prospecting objects, the respective distances between the unselected prospecting objects and a plurality of center points are calculated, so as to associate the unselected prospecting objects with the closer or closest distances The central points are divided into the same group, and then proceed to step S303.

於步驟S303中,計算同一群中每一探勘目標對象與同一群中其他探勘目標對象之間的距離和,以將具有最小距離和的探勘目標對象選取為同一群中的新中心點,接著進至步驟S304。 In step S303, calculate the distance sum between each prospecting target object in the same group and other prospecting target objects in the same group, so as to select the prospecting target object with the minimum distance sum as the new center point in the same group, and then carry out Go to step S304.

於步驟S304中,判斷新中心點與舊中心點是否相同,若相同,即進至步驟S306,完成分群,反之,若不相同,則進至步驟S305。 In step S304, it is judged whether the new center point is the same as the old center point, if they are the same, then proceed to step S306 to complete the grouping, otherwise, if not, then proceed to step S305.

於步驟S305中,以新中心點代替舊中心點,接著返回步驟S302,直到中心點不再變動為止。 In step S305, replace the old center point with the new center point, and then return to step S302 until the center point does not change any more.

請參閱圖4,本發明之潛在事件熱點探勘之方法中之事件特徵探勘比對方法,包括步驟S401~S408。 Please refer to FIG. 4 , the event feature detection and comparison method in the method for potential event hotspot detection of the present invention includes steps S401-S408.

於步驟S401中,根據複數探勘目標對象之各自的事件是否已發生目標事件,對每一群組中的探勘目標對象進行分區,其中,一區為已(曾經)發生目標事件的探勘目標對象,進行步驟S402~S404,而另一區為未發生目標事件的探勘目標對象,進行步驟S405~S407。 In step S401, according to whether the respective events of the plurality of prospecting target objects have occurred the target event, the prospecting target objects in each group are partitioned, wherein, one zone is the prospecting target object that has (once) occurred the target event, Proceed to steps S402-S404, and the other area is the prospecting target object where no target event occurs, then proceed to steps S405-S407.

於步驟S402中,針對已發生目標事件的探勘目標對象,以發生目標事件前的一預定時間區間為窗口,對已發生目標事件的探勘目標對象的各自的事件進行取樣,接著進至步驟S403。 In step S402, for the prospecting target objects where the target event has occurred, a predetermined time interval before the target event occurs is used as a window to sample the respective events of the prospecting target objects where the target event has occurred, and then proceed to step S403.

於步驟S403中,針對已發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得已發生目標事件的探勘目標對象的有效事件特徵,接著進至步驟S404。 In step S403, for the sampled events of the prospecting target objects in which the target event has occurred, the risk index with a length of 1 and its corresponding time interval are used for prospecting to obtain the effective event characteristics of the prospecting target objects in which the target event has occurred, Then go to step S404.

於步驟S404中,逐漸增加危險指標的長度,藉此獲得已發生目標事件的探勘目標對象的具代表性之事件特徵,接著進至步驟S408。 In step S404, the length of the risk indicator is gradually increased, thereby obtaining representative event characteristics of the prospecting target object that has occurred the target event, and then proceeding to step S408.

於步驟S405中,針對未發生目標事件的探勘目標對象,以預定時間區間作為滑動窗口,對未發生目標事件的探勘目標對象的各自的事件進行取樣,接著進至步驟S406。 In step S405 , for the prospecting target objects without target events, the predetermined time interval is used as a sliding window to sample the respective events of the prospecting target objects without target events, and then proceed to step S406 .

於步驟S406中,針對未發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得未發生目標事件的探勘目標對象的有效事件特徵,接著進至步驟S407。 In step S406, for the sampled events of the prospecting target objects without the target event, the risk index with a length of 1 and the corresponding time interval are used for prospecting to obtain the effective event characteristics of the prospecting target objects without the target event, Then proceed to step S407.

於步驟S407中,逐漸增加危險指標的長度,藉此獲得未發生目標事件的探勘目標對象的具代表性之事件特徵,接著進至步驟S408。 In step S407, the length of the risk indicator is gradually increased, thereby obtaining representative event characteristics of the surveying target objects without target events, and then proceeding to step S408.

於步驟S408中,比對已發生目標事件的探勘目標對象的具代表性之事件特徵與未發生目標事件的探勘目標對象的具代表性之事件特徵,獲得具潛在事件熱點之探勘目標對象,並評估具潛在事件熱點之探勘目標對象發生目標事件的風險。 In step S408, comparing the representative event characteristics of the prospecting target objects that have occurred the target event with the representative event characteristics of the prospecting target objects that have not occurred the target event, to obtain the prospecting target objects with potential event hotspots, and Assess the risk of target events occurring in prospecting targets with potential event hotspots.

以下舉出一具體實施例,其為本發明之潛在事件熱點探勘之設備(如伺服器)及方法在道路交通安全方面的應用。 A specific embodiment is listed below, which is the application of the equipment (such as server) and method for potential event hotspot detection of the present invention in road traffic safety.

首先,複數探勘目標對象係一道路資料集,其由n筆道路資料組成,表示為Ri,i=1,...,n,每一道路資料包含縣市鄉鎮名稱和道路名稱,如表一所示。 First, the complex prospecting target object is a road data set, which is composed of n road data, expressed as R i , i=1,...,n, each road data includes the name of the county, city, town and road, as shown in Table one shown.

其次,複數探勘目標對象之各自的特徵係一道路特性資料集,其由複數道路特性集合組成,如表一所示。 Secondly, the respective characteristics of the plurality of exploration target objects are a road characteristic data set, which is composed of a plurality of road characteristic sets, as shown in Table 1.

每一道路特性由道路特性項目與道路特性項目所屬類別組成,表示為(Ii,Ci’),Ci

Figure 110139664-A0101-12-0011-24
Ci。這些道路特性項目共有r種,表示為Ii,i=1,...,r。第i種道路特性項目之道路特性項目所屬類別共有s種,表示為cij,j=1,...s,,並以集合表示為Ci={ci1,ci2,...,cis}。每一道路特性集合係由r筆道路特性組成,表示為{(I1,C1’),(I2,C2’),...,(Ir,Cr’)}。 Each road characteristic is composed of road characteristic items and categories of road characteristic items, expressed as (I i ,C i '), C i '
Figure 110139664-A0101-12-0011-24
C i . There are r types of these road characteristic items, expressed as I i , where i=1,...,r. There are s types of road characteristic items of the i-th road characteristic item, expressed as c ij , j=1,...s, and expressed as a set of C i ={c i1 ,c i2 ,..., c is }. Each road characteristic set is composed of r road characteristics, expressed as {(I 1 ,C 1 '),(I 2 ,C 2 '),...,(I r ,C r ')}.

這些道路特性項目至少包含道路種類、道路線數、道路指示標誌,該些道路特性項目分述如下: These road characteristic items at least include road type, road line number, and road indication signs, and these road characteristic items are described as follows:

道路種類表示為I1,其所屬類別共有3種,並以集合表示為C1={國道,快速道路,省縣道},為便於後續分群,針對該集合進行數值化編碼,依序編碼後集合表示為C1={1,2,3}。道路線數表示為I2,其所屬類別共有4種,並以集合表示為C2={四線道,三線道,二線道,一線道},為便於後續分群,針對該集合進行數值化編碼,依序編碼後集合表示為C2={1,2,3,4}。道路指示標誌表示為I3,其所屬類別共有4種,並以集合表示為C3={無,路面顛簸,學校,醫院},為便於後續分群,針對該集合進行數值化編碼,依序編碼後集合表示為C3={1,2,3,4}。例如,道路資料R1的道路特性集合為{(I1,1),(I2,1),(I3,2)},表示此道路之道路特性有:道路種類為國道、道路線數為四線道、道路指示標誌為道路路面顛簸。 The type of road is expressed as I 1 , and there are 3 types of it, and the set is expressed as C 1 = {national road, express road, provincial and county road}. In order to facilitate subsequent grouping, numerical coding is carried out for this set, and after sequential coding A set is denoted as C 1 ={1,2,3}. The number of road lines is expressed as I 2 , which belongs to 4 categories, and is expressed as a set of C 2 ={four-lane road, three-lane road, second-lane road, and one-lane road}. In order to facilitate subsequent grouping, the set is digitized Encoding, the set after sequential encoding is expressed as C 2 ={1,2,3,4}. The road sign is expressed as I 3 , which belongs to 4 categories, and is expressed as a set of C 3 ={none, bumpy road, school, hospital}. In order to facilitate subsequent grouping, the set is numerically coded and coded sequentially The back set is denoted as C 3 ={1,2,3,4}. For example, the set of road characteristics of road data R1 is {(I 1 ,1),(I 2 ,1),(I 3 ,2)}, which means that the road characteristics of this road are: the road type is national road, and the number of road lines is Four-lane roads and road signs are road bumps.

再次,複數探勘目標對象之事件係一道路事件資料集,其由複數道路事件集合組成,如表一所示。 Thirdly, the events of the multiple prospecting target objects are a road event data set, which is composed of multiple road event sets, as shown in Table 1.

每一道路事件係由車機編號、車機回報之危險指標、回報時間與事故註記所組成,表示為(O’,E’,Tn,N’),O’

Figure 110139664-A0101-12-0012-25
O,E’
Figure 110139664-A0101-12-0012-26
E,N’
Figure 110139664-A0101-12-0012-28
N。車機編號共有3筆,並以集合表示為O={OBU1,OBU2,OBU3}。車機回報之危險指標共有6種,並以集合表示為E={超速,急加速,急減速,急過彎,前方防碰撞,車道偏移},為便於後續事件特徵之探勘,針對該集合進行類別化編碼,依序編碼後集合表示為E={a,b,c,d,e}。回報時間共有t筆,表示為Tn,n=1,...,t。事故註記共有2種,並以集合表示為N={已發生事故,未發生事故},為便於後續事件特徵之探勘,針對該集合進行類別化編碼,依序編碼後集合表示為N={T,F}。每一道路事件 集合係由u筆道路事件組成,表示為{(O’,E’,T1,N’),(O’,E’,T2,N’),...,(O’,E’,Tu,N’)}。 Each road event is composed of the vehicle number, the risk indicator reported by the vehicle, the return time and the accident note, expressed as (O',E',T n ,N'), O'
Figure 110139664-A0101-12-0012-25
O, E'
Figure 110139664-A0101-12-0012-26
E, N'
Figure 110139664-A0101-12-0012-28
N. There are 3 vehicle numbers, and they are expressed as O={OBU1,OBU2,OBU3} in aggregate. There are 6 types of risk indicators reported by vehicles, and they are expressed as a set of E={speeding, rapid acceleration, rapid deceleration, sharp cornering, front collision avoidance, lane deviation}. Carry out categorical encoding, and the set after sequential encoding is expressed as E={a,b,c,d,e}. There are t times of return time, denoted as T n , n=1,...,t. There are 2 types of accident notes, and they are represented by a set as N={Accidents have occurred, No accidents}. In order to facilitate the exploration of the characteristics of subsequent events, the set is classified and coded, and the set is represented as N={T after sequential coding ,F}. Each road event set is composed of u road events, expressed as {(O',E',T 1 ,N'),(O',E',T 2 ,N'),...,(O ',E',T u ,N')}.

表一:道路資料集。

Figure 110139664-A0101-12-0013-2
Table 1: Road dataset.
Figure 110139664-A0101-12-0013-2

接著,依據複數探勘目標對象之特徵,針對該複數探勘目標對象進行分群。 Then, according to the characteristics of the complex prospecting target objects, the complex prospecting target objects are grouped.

在一實施例中,分群方法例如為分割式分群法中之k-medoids,包含四步驟: In one embodiment, the clustering method is, for example, k-medoids in the divisional clustering method, which includes four steps:

在第一步驟中,依據欲分群數量,於探勘目標資料集中隨機選取k個探勘目標對象當作中心點;在第二步驟中,依據這些探勘目標對象之特徵值計算這些探勘目標對象與k個中心點間之距離,並與最近之中心點劃分為同一群;在第三步驟中,依據每一群組內該些探勘目標對象之特徵值計算該些探勘目標對象間之距離,選取讓所有距離和最小之探勘目標對象當作新中心點;以及在第四步驟中,依據第三步驟之新中心點取代舊中心點,並重複上述第二與第三步驟,直到中心點不再變動。 In the first step, according to the number of clusters to be grouped, randomly select k prospecting target objects in the prospecting target data set as the center point; in the second step, calculate the relationship between these prospecting target objects and k The distance between the center points is divided into the same group with the nearest center point; in the third step, the distance between these exploration target objects is calculated according to the characteristic values of the exploration target objects in each group, and all The prospecting target object with the minimum distance and minimum is taken as the new center point; and in the fourth step, the old center point is replaced by the new center point according to the third step, and the above-mentioned second and third steps are repeated until the center point does not change any more.

分群方法中之距離計算方法例如為歐式距離計算法,可選取兩探勘目標對象,並依據兩探勘目標對象之特徵值,計算出兩探勘目標對象之距離,距離D運用下列公式計算: The distance calculation method in the clustering method is, for example, the Euclidean distance calculation method. Two prospecting targets can be selected, and the distance between the two prospecting targets can be calculated according to the characteristic values of the two prospecting targets. The distance D can be calculated using the following formula:

Figure 110139664-A0101-12-0014-4
Figure 110139664-A0101-12-0014-4

換言之,運用上述公式,依據該些道路特性項目所屬類別計算該些道路資料之距離,以作為道路資料間相似度依據。 In other words, the above formula is used to calculate the distance of the road data according to the categories of the road characteristic items as the basis for the similarity between the road data.

在第一步驟中,將道路資料R1~R7分為兩群,隨機選取兩道路資料R1、R5當作中心點;在第二步驟中,依據道路特性項目所屬類別計算道路資料R1~R7和兩中心點R1、R5之距離,並與最近之中心點劃分為同一群。以道路資料R2為例,將道路資料R2之道路特性集合{(I1,1),(I2,3),(I3,1)}和中心點R1之道路特性集合{(I1,1),(I2,1),(I3,2)}中之些道路特性項目所屬類別,輸入歐式距離計 算方法,可得兩點間距離為2.236067977,並將道路資料R2之道路特性集合{(I1,1),(I2,3),(I3,1)}和中心點R5之道路特性集合{(I1,3),(I2,4),(I3,4)}中之些道路特性項目所屬類別,輸入歐式距離計算方法,可得兩點間距離為3.741657387。 In the first step, the road data R 1 ~ R 7 are divided into two groups, and the two road data R 1 and R 5 are randomly selected as the center point; in the second step, the road data R is calculated according to the category of the road characteristic item. 1 ~ R 7 and the distance between two central points R 1 and R 5 , and they are divided into the same group as the nearest central point. Taking the road data R 2 as an example, the road characteristic set {(I 1 ,1),(I 2 ,3),(I 3 , 1 )} of the road data R 2 and the road characteristic set {( I 1 ,1),(I 2 ,1),(I 3 ,2)} belong to the categories of road characteristic items, input the Euclidean distance calculation method, the distance between two points can be obtained as 2.236067977, and the road data R 2 The road characteristic set {(I 1 ,1),(I 2 ,3),(I 3 , 1 )} and the road characteristic set {(I 1 ,3),(I 2 ,4), (I 3 ,4)} The categories of road characteristic items belong to, input the Euclidean distance calculation method, and the distance between two points can be obtained as 3.741657387.

以此類推,可得道路資料R2~R4、R6~R7和兩中心點R1、R5之距離(相似度),如表二所示。 By analogy, the distance (similarity) between the road data R 2 ~R 4 , R 6 ~R 7 and the two center points R 1 , R 5 can be obtained, as shown in Table 2.

表二:道路資料和中心點之距離。

Figure 110139664-A0101-12-0015-3
Table 2: Road data and the distance from the center point.
Figure 110139664-A0101-12-0015-3

於表二中,道路資料R2與中心點R1之距離最近,故劃分為同一群組,以此類推,第一次分群結果為:群組一以道路資料R1為中心點,依最近距離進行分群,分群結果屬於群組一的道路資料包含R1、R2、R3;群組二以道路資料R5為中心點,依最近距離進行分群,分群結果屬於群組二的道路資料包含R4、R5、R6、R7In Table 2, the distance between road data R 2 and center point R 1 is the shortest, so they are divided into the same group, and so on, the result of the first grouping is: Group 1 takes road data R 1 as the center point, according to the closest Grouping by distance, the grouping result belongs to the road data of group 1 including R 1 , R 2 , R 3 ; group 2 takes the road data R 5 as the center point, and performs grouping according to the shortest distance, and the grouping result belongs to the road data of group 2 Contains R 4 , R 5 , R 6 , R 7 .

在第三步驟中,將第二步驟分群結果中各自成一群組之道路資料,依據道路特性項目所屬類別計算道路資料間之距離,選取讓所有距離和最小之道路資料當作新中心點。 In the third step, the distance between the road data is calculated according to the category of the road characteristic items of the road data grouped in the grouping result of the second step, and the road data with the minimum sum of all distances is selected as the new center point.

以群組一中之道路資料R1為例,將道路資料R1之道路特性集合{(I1,1),(I2,1),(I3,2)}和道路資料R2之道路特性集合{(I1,1),(I2,3),(I3,1)}中之道路特性項目所屬類別,輸入歐式距離計算方法,可得兩點間距離為2.236067977,再將道路資料R1之道路特性集合{(I1,1),(I2,1),(I3,2)}和道路資料R3之道路特性集合 {(I1,1),(I2,2),(I3,2)}中之道路特性項目所屬類別,輸入歐式距離計算方法,可得兩點間距離是1,其距離和是3.236067977。以此類推,可得該些群組內,該些道路資料間之距離和,如表三與表四所示。 Taking the road data R 1 in group 1 as an example, the road characteristic set {(I 1 ,1),(I 2 ,1),(I 3 ,2)} of the road data R 1 and the road data R 2 The category of the road characteristic items in the road characteristic set {(I 1 ,1),(I 2 ,3),(I 3 ,1)} belongs to the category, input the Euclidean distance calculation method, the distance between two points can be obtained as 2.236067977, and then Road characteristic set {(I 1 ,1), (I 2 ,1), (I 3 ,2)} of road data R 1 and road characteristic set {(I 1 ,1), (I 2 , 2),(I 3 ,2)} The category of the road characteristic item belongs to, input the Euclidean distance calculation method, the distance between two points is 1, and the distance sum is 3.236067977. By analogy, the sum of the distances between the road data in the groups can be obtained, as shown in Table 3 and Table 4.

表三:群組一中道路資料間之距離。

Figure 110139664-A0101-12-0016-5
Table 3: Distances between road data in Group 1.
Figure 110139664-A0101-12-0016-5

表四:群組二中道路資料間之距離。

Figure 110139664-A0101-12-0016-7
Table 4: Distance between road data in group 2.
Figure 110139664-A0101-12-0016-7

在表三中,群組一中之道路資料R3與其他道路資料距離和最小,故選取R3為新中心點;在表二中,群組二中之道路資料R6與其他道路資料距離和最小,故選取R6為新中心點。 In Table 3, the road data R 3 in group 1 has the smallest distance from other road data, so R 3 is selected as the new center point; in Table 2, the distance between road data R 6 in group 2 and other road data is the smallest. and the minimum, so choose R 6 as the new center point.

在第四步驟中,依據第三步驟結果以新中心點取代舊中心點,並重複第二與第三步驟,直到中心點不再變動,其最終分群結果如下:群組一以道路資料R3為中心點,依最近距離進行分群,分群結果屬於群組一的道路資料包含R1、R2、R3;群組二以道路資料R6為中心點,依最近距離進行分群,分群結果屬於群組二的道路資料包含R4、R5、R6、R7In the fourth step, the old center point is replaced by the new center point according to the result of the third step, and the second and third steps are repeated until the center point does not change any more. The final grouping result is as follows: Group 1 uses road data R The center point is grouped according to the shortest distance, and the grouping result belongs to the road data of group 1 including R 1 , R 2 , R 3 ; the group 2 takes the road data R 6 as the center point, and is grouped according to the shortest distance, and the grouping result belongs to The road data of group 2 includes R 4 , R 5 , R 6 , and R 7 .

接著,針對每一群組中複數探勘目標對象之事件探勘出複數具代表性之事件特徵,包括自已發生目標事件之探勘目標對象之事件探勘出之複數具代表性之事件特徵,以及自每一未發生目標事件之探勘目標對象之事件探勘出之複數具代表性之事件特徵。在一實施例中,該目標事件係一事故,即依據表一中道路事件之事故註記,區分每一道路資料是否已發生事故。 Then, a plurality of representative event features are mined out for the events of the plurality of target objects in each group, including a plurality of representative event features mined from the events of the target objects where the target event has occurred, and from each A plurality of representative event characteristics detected by the event detection of the target object without the occurrence of the target event. In one embodiment, the target event is an accident, that is, according to the accident note of the road event in Table 1, it is distinguished whether each road data has an accident or not.

以群組二為例,道路資料R6的道路事件之事故註記包含T,故區分成已發生事故之道路資料;以此類推,群組二區分結果屬於已發生事故之道路資料包含R6、R7,未發生事故之道路資料包含R4、R5Taking group 2 as an example, the accident note of the road event in road data R 6 contains T, so it is divided into road data that has occurred in accidents; and so on, the road data that group 2 distinguishes results in accidents that have occurred includes R 6 , R 7 , road data without accidents include R 4 and R 5 .

此外,針對已發生目標事件之探勘目標對象之事件而言,取樣方法係為目標事件發生前一時間區間,對已發生事故之道路資料之道路事件而言,取樣之時間區間係事故發生當下至發生前5分鐘。 In addition, for the event of the target object of the target event that has occurred, the sampling method is the time interval before the occurrence of the target event. 5 minutes before it happened.

以群組二中之道路資料R6為例,先分析其道路事件之事故註記是否為T,若為T則表示該道路事件係事故發生點,經分析後屬於事故發生點之道路事件包含(OBU1,e,0924,T)、(OBU2,e,0934,T)、(OBU3,e,0954,T)。 Taking the road data R6 in group 2 as an example, first analyze whether the accident note of the road event is T, if it is T, it means that the road event is an accident occurrence point, after analysis, the road event belonging to the accident occurrence point includes ( OBU1, e, 0924, T), (OBU2, e, 0934, T), (OBU3, e, 0954, T).

依據上述分析結果,針對這些發生事故之車機分別取樣事故發生點至發生前一段時間(例如五分鐘)之道路事件,以組成一道路事件序列。以道路事件(OBU3,e,0954,T)為例,取樣後該道路事件序列係{(OBU3,b,0951,F)、(OBU3,d,0952,F)、(OBU3,c,0953,F)、(OBU3,e,0954,T)}。以此類推,可得該群組內該些事故發生點之道路事件序列。取樣結果如表五所示。 According to the above analysis results, the road events from the point of accident occurrence to a period of time (for example, five minutes) before the occurrence of the accident are sampled respectively for these accident-prone vehicles to form a road event sequence. Taking the road event (OBU3,e,0954,T) as an example, the road event sequence after sampling is {(OBU3,b,0951,F), (OBU3,d,0952,F), (OBU3,c,0953, F), (OBU3,e,0954,T)}. By analogy, the road event sequences of the accident occurrence points in the group can be obtained. The sampling results are shown in Table 5.

表五:群組二中已發生事故之道路資料之道路事件取樣結果。

Figure 110139664-A0101-12-0017-8
Table 5: The road event sampling results of the road data of the accidents in group 2.
Figure 110139664-A0101-12-0017-8

Figure 110139664-A0101-12-0018-9
Figure 110139664-A0101-12-0018-9

對於未發生目標事件之探勘目標對象之事件資料,取樣方法係一滑動窗口方法,且依時間序逐一取樣其中,該滑動窗口方法之窗口範圍為一時間區間,該滑動窗口方法之窗口平移距離則為一可調參數。例如,每一道路資料針對道路事件中之車機編號,依時間序逐一擷取道路事件,滑動窗口方法之窗口範圍之時間區間係5分鐘,即窗口平移距離以時間為單位,且該單位係5分鐘。 For the event data of the exploration target object that has not occurred the target event, the sampling method is a sliding window method, and one by one is sampled according to the time sequence. The window range of the sliding window method is a time interval, and the window translation distance of the sliding window method is is an adjustable parameter. For example, each road data captures the road events one by one according to the time sequence for the number of vehicles in the road event. The time interval of the window range of the sliding window method is 5 minutes, that is, the window translation distance is in time, and the unit is 5 minutes.

以群組二中之道路資料R4為例,針對該些道路事件之車機編號為OBU1進行取樣。從整點開始逐一取樣,滑動窗口第一次涵蓋時間範圍從12:00至12:05,屬於該時間範圍內之道路事件包含(OBU1,f,1203,F)、(OBU1,c,1205,F)。滑動窗口經第一次平移後,第二次涵蓋時間範圍從12:05至12:10,並無道路事件屬於該時間範圍內。滑動窗口經第二次平移後,第三次涵蓋時間範圍從12:10至12:15,屬於該時間範圍內之道路事件包含(OBU1,a,1211,F)、(OBU1,c,1213,F)、(OBU1,e,1214,F)、(OBU1,f,1215,F)。以此類推,可得該群組內每一道路資料取樣結果,如表六所示。 Taking the road data R4 in group 2 as an example, the vehicle number of these road events is OBU1 for sampling. Sampled one by one from the whole point, the sliding window covers the time range from 12:00 to 12:05 for the first time, and the road events belonging to this time range include (OBU1, f, 1203, F), (OBU1, c, 1205, F). After the first shift of the sliding window, the second time covers the time range from 12:05 to 12:10, and no road event falls within this time range. After the sliding window is shifted for the second time, the third time covers the time range from 12:10 to 12:15. The road events within this time range include (OBU1,a,1211,F), (OBU1,c,1213, F), (OBU1, e, 1214, F), (OBU1, f, 1215, F). By analogy, the sampling results of each road data in the group can be obtained, as shown in Table 6.

表六:群組二中未發生事故之道路資料之道路事件取樣結果。

Figure 110139664-A0101-12-0018-10
Table 6: Sampling results of road incidents on road data without accidents in Group 2.
Figure 110139664-A0101-12-0018-10

Figure 110139664-A0101-12-0019-11
Figure 110139664-A0101-12-0019-11

接著,依據探勘目標對象之事件資料間發生頻率、發生時間相互關係、發生時間間隔及範圍,以探勘出事件特徵。該事件特徵探勘方法係一頻繁情節探勘分析方法,該方法探勘出之頻繁情節包含有序情節(serial episode)、並行情節(parallel episode)和複合情節(composite episode),其中,有序情節表示為事件特徵間有時間先後順序關係,並行情節表示為事件特徵間之時間關係並不重要,及複合情節由有序情節與並行情節組合而成。 Then, according to the occurrence frequency, the relationship between occurrence time, the occurrence time interval and the scope of the event data of the exploration target object, the event characteristics are detected. The event feature mining method is a frequent episode mining analysis method, and the frequent episodes explored by this method include serial episodes, parallel episodes and composite episodes, where the serial episodes are expressed as There is a chronological sequence relationship between event features, parallel plots indicate that the time relationship between event features is not important, and compound plots are composed of ordered plots and parallel plots.

在一實施例中,事件特徵探勘方法係例如為一頻繁情節探勘分析方法,包含四步驟: In one embodiment, the event feature mining method is, for example, a frequent scenario mining analysis method, which includes four steps:

在第一步驟中,設定探勘條件相關參數,除minimal-occurrences用於避免事件特徵重複計算外,其餘探勘條件相關參數包含max-gap、min-gap、max-during、min-sup;其中,max-gap表示為兩事件特徵發生時間之最大時間單位;min-gap表示為兩事件特徵間發生時間之最小時間單位;max-during表示為事件特徵所涵蓋時間區間之最大時間單位;min-sup表示為事件特徵發生之最小支持度;在第二步驟中,針對取樣後之探勘目標對象之事件資料進行掃描,取得長度為1之事件特徵與其發生時間區間集合,並依序檢查是否符合min-sup,若符合,則放入有效事件特徵集;在第三步驟中,從有效事件特徵集中取出一事件特徵作為前綴,並取出其發生時間區間集合,再依據時間區間集合,針對取樣後之探勘目標對象之事件資料進行掃描,取得以該事件特徵為前綴之 事件特徵與其發生時間區間集合,依序檢查是否符合探勘條件相關參數;若有符合,則將以該事件特徵為前綴之事件特徵與其發生時間區間放入一新的有效事件特徵集,並針對此新的有效事件特徵集重複執行本步驟,直到此新的有事件特徵集為空;若無符合,則將該事件特徵視為具代表性之事件特徵並回傳;在第四步驟中,重複第三步驟,直到取得所有具代表性之事件特徵。 In the first step, parameters related to exploration conditions are set. Except minimal-occurrences is used to avoid repeated calculation of event characteristics, other parameters related to exploration conditions include max-gap, min-gap, max-during, and min-sup; among them, max -gap indicates the maximum time unit of the occurrence time of two event features; min-gap indicates the minimum time unit of the occurrence time between two event features; max-during indicates the maximum time unit of the time interval covered by the event feature; min-sup indicates is the minimum support for the occurrence of event features; in the second step, scan the event data of the survey target object after sampling, obtain the set of event features with a length of 1 and their occurrence time intervals, and check whether they meet the min-sup , if it matches, put it into the effective event feature set; in the third step, take out an event feature from the effective event feature set as a prefix, and take out its occurrence time interval set, and then according to the time interval set, for the exploration target after sampling Scan the event data of the object to obtain the A set of event features and their occurrence time intervals are checked in order to see if they meet the relevant parameters of the exploration conditions; Repeat this step for the new valid event feature set until the new event feature set is empty; if there is no match, the event feature is regarded as a representative event feature and returned; in the fourth step, repeat The third step, until all the representative event features are obtained.

運用上述探勘方法,針對已發生事故與每一未發生事故之道路資料之道路事件取樣結果,探勘出複數以危險指標組成之具代表性之事件特徵。 Using the above-mentioned exploration method, according to the road event sampling results of the road data that has occurred accidents and each road data that has not occurred accidents, a plurality of representative event characteristics composed of hazard indicators are explored.

以群組二中已發生事故之道路資料之道路事件取樣結果為例,運用上述探勘方法,探勘出複數以危險指標組成之具代表性之事件特徵。 Taking the road incident sampling results of the road accident data in Group 2 as an example, using the above-mentioned exploration method, a plurality of representative event characteristics composed of hazard indicators are explored.

在第一步驟中,設定探勘條件相關參數,除minimal-occurrences用於避免事件特徵重複計算外,其餘探勘條件相關參數分別為min-gap係1分鐘、max-gap係5分鐘、max-duiing係5分鐘、min-sup係0.5;在第二步驟中,針對該些道路事件取樣結果進行掃描,取得長度為1之危險指標與其發生時間區間集合,並依序檢查是否符合min-sup,若符合,則放入有效事件特徵集。 In the first step, the parameters related to exploration conditions are set. Except minimal-occurrences is used to avoid repeated calculation of event characteristics, the other parameters related to exploration conditions are 1 minute for min-gap system, 5 minutes for max-gap system, and 5 minutes for max-duiing system. 5 minutes, min-sup is 0.5; in the second step, scan the sampling results of these road incidents, obtain a set of risk indicators with a length of 1 and their occurrence time intervals, and check in order whether they meet min-sup, if they meet , put into the valid event feature set.

以已發生事故之道路資料之道路事件取樣結果為例,即表五的R6、R7,取得長度為1之危險指標與其發生時間區間集合,請如表七所示。 Take the road event sampling results of road accidents that have occurred as an example, that is, R 6 and R 7 in Table 5, to obtain a set of risk indicators with a length of 1 and their occurrence time intervals, as shown in Table 7.

表七:群組二中已發生事故之道路資料之道路事件取樣結果之以長度為1之危險指標探勘資訊。

Figure 110139664-A0101-12-0020-12
Table 7: Exploration information of hazard indicators whose length is 1 in the road event sampling results of the road data that have had accidents in Group 2.
Figure 110139664-A0101-12-0020-12

Figure 110139664-A0101-12-0021-13
Figure 110139664-A0101-12-0021-13

表七中,危險指標a之min-sup係3/6>=0.5,故放入有效事件特徵集;危險指標d之min-sup係2/6<0.5,故不放入有效事件特徵集。以此類推,依序檢查是否符合min-sup,檢查後有效事件特徵集包含a、b、c、e、f。 In Table 7, the min-sup of risk indicator a is 3/6>=0.5, so it is included in the effective event feature set; the min-sup of risk indicator d is 2/6<0.5, so it is not included in the effective event feature set. By analogy, check whether min-sup is met in sequence, and after checking, the effective event feature set includes a, b, c, e, f.

在第三步驟中,從有效事件特徵集中取出一危險指標作為前綴,並取出其發生時間區間集合,再依據時間區間集合,針對該些道路事件取樣結果進行掃描,取得以該危險指標為前綴之危險指標與其發生時間區間集合,以依序檢查是否符合探勘條件相關參數;若符合,則將以該危險指標為前綴之危險指標放入一新的有效事件特徵集,並針對此新的有效事件特徵集重複執行本步驟,直到此新的有事件特徵集為空;若不符合,則將該危險指標視為具代表性之事件特徵並回傳。 In the third step, a dangerous indicator is taken out from the effective event feature set as a prefix, and its occurrence time interval set is taken out, and then according to the time interval set, the sampling results of these road events are scanned to obtain the risk index as a prefix. A set of dangerous indicators and their occurrence time intervals is used to check whether the relevant parameters of the exploration conditions are met in sequence; The feature set repeats this step until the new feature set with events is empty; if not, the risk indicator is regarded as a representative event feature and returned.

以危險指標a為例,從有效事件特徵集中取出危險指標a作為前綴,針對已發生事故之道路資料之道路事件取樣結果進行掃描,取得以危險指標a為前綴之危險指標與其發生時間區間集合,如表八所示。以危險指標a為前綴對危險指標b而言沒有符合的時間區間。又,表八係以危險指標a為前綴,另能以危險指標b、c、e、f來取樣,例如(b,a)、(b,c)、(b,e)、(b,f)、(c,a)、(c,b)、(c,e)、(c,f)、(f,a)、(f,b)、(f,c)、(f,e)。 Take the danger index a as an example, take the danger index a as a prefix from the effective event feature set, scan the road event sampling results of the road data that have occurred accidents, and obtain the danger index with the danger index a as the prefix and the set of its occurrence time interval, As shown in Table 8. Prefixing hazard index a has no matching time interval for hazard index b. Also, Table 8 is prefixed with risk indicator a, and can be sampled with risk indicators b, c, e, f in addition, such as (b, a), (b, c), (b, e), (b, f ), (c,a), (c,b), (c,e), (c,f), (f,a), (f,b), (f,c), (f,e).

表八:群組二中已發生事故之道路資料之道路事件取樣結果之前綴為a之危險指標探勘資訊。

Figure 110139664-A0101-12-0021-14
Table 8: The road event sampling results of road accidents in group 2. The hazard index exploration information prefixed with a.
Figure 110139664-A0101-12-0021-14

Figure 110139664-A0101-12-0022-15
Figure 110139664-A0101-12-0022-15

在表八中,危險指標(a,e)之時間區間間隔之時間單位分別係1、2、2,皆符合max-gap、min-gap、max-during之限制,且min-sup係3/6>=0.5,故放入一新的有效事件特徵集。以此類推,依序檢查是否符合探勘條件相關參數,則新的有效事件特徵集包含(a,e),並針對此新的事件特徵集,取出危險指標(a,e)做為前綴,針對已發生事故之道路事件取樣結果,取得以危險指標(a,e)為前綴之危險指標與其發生時間區間集合,探勘後並無以危險指標(a,e)為前綴之危險指標與其發生時間區間集合,故將危險指標(a,e)視為具代表性之事件特徵並回傳。以此類推,重複本步驟,直到新的事件特徵集為空。 In Table 8, the time units of the time intervals of the risk indicators (a, e) are 1, 2, and 2 respectively, all of which meet the limits of max-gap, min-gap, and max-during, and min-sup is 3/ 6>=0.5, so put in a new effective event feature set. By analogy, check whether the relevant parameters of the exploration conditions are met in sequence, then the new effective event feature set contains (a, e), and for this new event feature set, take out the danger indicator (a, e) as a prefix, for Sampling results of road accidents that have occurred, and obtain the set of hazard indicators prefixed with hazard indicators (a, e) and their occurrence time intervals. After exploration, there are no hazard indicators prefixed with hazard indicators (a, e) and their occurrence time intervals Therefore, the hazard indicators (a, e) are regarded as representative event characteristics and returned. By analogy, repeat this step until the new event feature set is empty.

在第四步驟中,重複第三步驟,直到取得所有具代表性之事件特徵。 In the fourth step, the third step is repeated until all representative event features are obtained.

以群組二中已發生事故之道路資料之道路事件取樣結果為例,即表七,具代表性之事件特徵集包含(a,e)、(b,e),如表九所示。 Take the road incident sampling results of the road accident data in Group 2 as an example, that is, Table 7. The representative event feature set includes (a, e), (b, e), as shown in Table 9.

表九:群組二中已發生事故之道路資料之具代表性之事件特徵集。

Figure 110139664-A0101-12-0022-16
Table 9: Representative event feature set of the road data of the accidents in group 2.
Figure 110139664-A0101-12-0022-16

以此類推,可得群組二中每一未發生事故之道路資料之道路事件取樣結果之具代表性之事件特徵集,如表十所示。 By analogy, the representative event feature set of the road event sampling results of each accident-free road data in Group 2 can be obtained, as shown in Table 10.

表十:群組二中未發生事故之道路資料之具代表性之事件特徵集。

Figure 110139664-A0101-12-0023-17
Table 10: Representative event feature set of road data without accidents in Group 2.
Figure 110139664-A0101-12-0023-17

接著,針對每一群組之複數具代表性之事件特徵,執行一事件特徵比對,找出具潛在事件熱點之探勘目標對象,並且針對該些具潛在事件熱點之探勘目標對象計算出一事件特徵支持度,以據之評估該些具潛在事件熱點之探勘目標對象發生目標事件之好發程度。該事件特徵比對方法係一完全比對方法,例如針對每一群組中每一未發生目標事件之探勘目標對象之事件資料探勘出之複數具代表性之事件特徵,若該些具代表性之事件特徵存在任一事件特徵與已發生目標事件之探勘目標對象之事件資料探勘出之複數具代表性之事件特徵完全相同,則將該探勘目標對象視為具潛在事件熱點之探勘目標對象。 Then, for the plurality of representative event features of each group, perform an event feature comparison to find out the prospecting target objects with potential event hotspots, and calculate an event feature for these prospecting target objects with potential event hotspots The degree of support is used to evaluate the frequency of occurrence of target events for those prospecting targets with potential event hotspots. The event feature comparison method is a complete comparison method, for example, a plurality of representative event features are extracted from the event data of each target object in each group that has not occurred the target event, if these representative If there is any event feature that is exactly the same as a plurality of representative event features mined from the event data of the prospecting target object that has occurred the target event, then the prospecting target object is regarded as a prospecting target object with potential event hotspots.

換言之,針對每一群組中每一未發生事故之道路資料之道路事件探勘出之複數具代表性之事件特徵,若該些具代表性之事件特徵存在任一事件特徵與已發生事故之道路資料之道路事件探勘出之複數具代表性之事件特徵完全相同,表示該些未發生事故之道路資料為潛在事故發生熱點,並可進一步計算出該些潛在事故發生熱點之事件特徵支持度,以評估發生事故之好發程度。 In other words, a plurality of representative event features are detected for each road event of road data without accidents in each group, if there is any event feature in these representative event features The characteristics of multiple representative events extracted from the road event data are exactly the same, indicating that these road data without accidents are potential accident hotspots, and the event feature support degree of these potential accident hotspots can be further calculated. Assess the likelihood of accidents occurring.

以群組二為例,設定一事故好發程度門檻值係例如0.5,其未發生事故之道路資料R5之道路事件取樣結果之具代表性之事件特徵包含(a,c)、(a,e),該些事件特徵存在任一與已發生事故之道路資料之道路事件取樣結果之具代表性之事件特徵(a,e)、(b,e),其(a,e)完全相同,故道路資料R5為潛在事故 發生熱點;再進一步計算其事件特徵支持度係0.5,且其該事件特徵支持度>=0.5,故屬於高好發潛在事故發生熱點。 Taking group 2 as an example, set an accident occurrence threshold value such as 0.5, and the representative event characteristics of the road event sampling results of the road data R 5 without accidents include (a, c), (a, e) These event features have any representative event features (a, e), (b, e) of the road event sampling results of the road data that have occurred accidents, and (a, e) are exactly the same, Therefore, the road data R 5 is a potential accident hotspot; further calculation of its event feature support degree is 0.5, and its event feature support degree >=0.5, so it belongs to a high-prone potential accident hotspot.

以此類推,可得群組二內該些未發生事故之道路資料比對結果,如表十一所示。 By analogy, the road data comparison results of the accident-free roads in Group 2 can be obtained, as shown in Table 11.

表十一:群組二中未發生事故之道路資料之事件特徵比對結果。

Figure 110139664-A0101-12-0024-18
Table 11: Comparison results of event characteristics of road data without accidents in Group 2.
Figure 110139664-A0101-12-0024-18

接著,針對已發生目標事件和具潛在事件熱點之探勘目標對象進行警示。伺服器中儲存有已發生事故和潛在事故發生熱點之道路資料與鄰近基地台對應關係,藉由分析車機回報所屬基地台識別碼,將警示資訊傳送給車機。 Then, alerts are issued for the target events that have occurred and the prospecting target objects with potential event hotspots. The server stores the corresponding relationship between the road data of the accidents and potential accident hotspots and the adjacent base stations, and sends the warning information to the vehicle by analyzing the identification code of the base station reported by the vehicle.

以群組二為例,已發生事故之道路資料(例如R6、R7)和潛在事故發生熱點之道路資料(例如R5)與鄰近基地台對應關係,如表十二所示。 Taking group 2 as an example, the corresponding relationship between the road data (such as R 6 and R 7 ) of accidents and the road data of potential accident hotspots (such as R 5 ) and adjacent base stations is shown in Table 12.

表十二:群組二中需預警之道路資料與基地台識別碼對應表。

Figure 110139664-A0101-12-0024-19
Table 12: Correspondence table between road data requiring early warning and base station identification codes in Group 2.
Figure 110139664-A0101-12-0024-19

綜上所述,本發明之潛在事件熱點探勘之設備及其執行之方法以及電腦程式產品係透過資料蒐集、資料分群、事件探勘、事件特徵比對等手段,找出具潛在事件熱點之探勘目標對象,藉此在交通安全能有效找到未發生 目標事件但具潛在事件熱點的探勘目標對象,並提供預警,以有效降低目標事件發生的頻率。 To sum up, the equipment for potential event hotspot detection and its implementation method and computer program product of the present invention are to find out the prospecting targets with potential event hotspots by means of data collection, data grouping, event detection, event feature comparison, etc. , so that in traffic safety can effectively find the unoccurring Prospecting target objects with potential event hotspots for target events, and providing early warning to effectively reduce the frequency of target events.

上述實施例僅例示性說明本案之功效,而非用於限制本案,任何熟習此項技藝之人士均可在不違背本案之精神及範疇下對上述該些實施態樣進行修飾與改變。因此本案之權利保護範圍,應如後述之申請專利範圍所列。 The above-mentioned embodiments are only illustrative of the effects of this case, and are not intended to limit this case. Any person familiar with this technology can modify and change the above-mentioned implementations without violating the spirit and scope of this case. Therefore, the scope of protection of rights in this case should be listed in the scope of patent application described later.

S201~S205:步驟 S201~S205: steps

Claims (9)

一種用於潛在事件熱點探勘之設備,係包括:探勘目標資料蒐集模組,蒐集複數探勘目標對象、該複數探勘目標對象之各自的特徵、該複數探勘目標對象之各自的事件;分群模組,根據該複數探勘目標對象之各自的特徵,對該複數探勘目標對象進行分群;事件特徵探勘模組,係根據該複數探勘目標對象之各自的事件是否已發生目標事件,對每一群組中的探勘目標對象進行分區,其中,針對已發生目標事件的探勘目標對象,以發生該目標事件前的一預定時間區間為窗口,對該已發生目標事件的探勘目標對象之各自的事件進行取樣;以及針對未發生目標事件的探勘目標對象,以該預定時間區間作為滑動窗口,對該未發生目標事件的探勘目標對象之各自的事件進行取樣,該事件特徵探勘模組係根據危險指標及發生時間,對該每一群組中每一區的探勘目標對象進行探勘,進而獲得該每一群組中該每一區的探勘目標對象的具代表性之事件特徵,其中,針對該已發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得該已發生目標事件的探勘目標對象的有效事件特徵,再逐漸增加該危險指標的長度,獲得該已發生目標事件的探勘目標對象的具代表性之事件特徵;以及針對該未發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得該未發生目標事件的探勘目標對象之各自的有效事件特徵,再逐漸增加該危險指標的長度,獲得該未發生目標事件的探勘目標對象之各自的具代表性之事件特徵;以及 事件特徵比對模組,將該每一群組中該每一區的探勘目標對象的該具代表性之事件特徵相互比對,以獲得具潛在事件熱點之探勘目標對象。 A device for potential event hotspot exploration, including: a data collection module for exploration targets, which collects multiple exploration targets, their respective characteristics, and their respective events; a grouping module, According to the respective characteristics of the plurality of exploration target objects, the plurality of exploration target objects are grouped; the event feature detection module is based on whether a target event has occurred in each of the plurality of exploration target objects. Partitioning the prospecting target objects, wherein, for the prospecting target objects where the target event has occurred, using a predetermined time interval before the target event occurs as a window, sampling the respective events of the prospecting target objects where the target event has occurred; and For the prospecting target object without the target event, the predetermined time interval is used as the sliding window to sample the respective events of the prospecting target object without the target event. The event feature exploration module is based on the risk index and the occurrence time, Prospecting the exploration target object in each area in each group, and then obtaining the representative event characteristics of the exploration target object in each area in each group, wherein, for the target event that has occurred The sampled events of the exploration target object are explored with the risk index of length 1 and its corresponding time interval to obtain the effective event characteristics of the exploration target object that has occurred the target event, and then gradually increase the length of the risk index to obtain The representative event characteristics of the exploration target object where the target event has occurred; and for the sampled event of the exploration target object where the target event has not occurred, the risk index with a length of 1 and its corresponding time interval are used for exploration, Obtain the respective effective event characteristics of the prospecting target objects that have not occurred the target event, and then gradually increase the length of the risk index to obtain the respective representative event characteristics of the prospecting target objects that have not occurred the target event; and The event feature comparison module compares the representative event features of the prospecting targets in each region in each group to obtain prospecting targets with potential event hotspots. 如請求項1所述之設備,其中,該分群模組係自該複數探勘目標對象中選取多個探勘目標對象作為中心點,根據該複數探勘目標對象的特徵,計算未被選取的探勘目標對象與該多個中心點之間各自的距離,以將該未被選取的探勘目標對象與距離最近的中心點劃分為同一群,接著計算該同一群中每一探勘目標對象與該同一群中其他探勘目標對象之間的距離和,將具有最小距離和之探勘目標對象選取為該同一群中的新中心點,再以該新中心點重新進行計算直到該新中心點不再更動為止。 The device according to claim 1, wherein the grouping module selects a plurality of prospecting objects from the plurality of prospecting objects as center points, and calculates the unselected prospecting objects according to the characteristics of the plurality of prospecting objects Respective distances between the plurality of center points, so as to divide the unselected prospecting target object and the nearest center point into the same group, and then calculate the relationship between each prospecting target object in the same group and other survey target objects in the same group For the sum of distances between prospecting target objects, select the prospecting target object with the smallest distance sum as the new center point in the same group, and then recalculate with the new center point until the new center point does not change. 如請求項1所述之設備,其中,該事件特徵比對模組係針對該具潛在事件熱點之探勘目標對象,評估該具潛在事件熱點之探勘目標對象發生該目標事件的風險。 The device according to claim 1, wherein the event feature comparison module evaluates the risk of the target event occurring on the prospecting target object with potential event hotspots for the prospecting target object with potential event hotspots. 如請求項1所述之設備,其中,該複數探勘目標對象係為複數道路資料,該複數探勘目標對象之各自的特徵係為該複數道路資料之各自的道路特性,而該複數探勘目標對象之各自的事件係包括由車機編號、危險指標、回報時間、是否發生事故所組成的集合。 The device as described in Claim 1, wherein the plurality of prospecting objects are plural road data, the respective characteristics of the plurality of prospecting objects are the respective road characteristics of the plurality of road data, and the plurality of prospecting objects are Each event system includes a set consisting of vehicle serial number, danger indicator, return time, and whether an accident occurs. 如請求項4所述之設備,進一步包括:事件熱點預警模組,根據對應於已發生事故以及潛在事故發生熱點的道路資料之基地台識別碼,對車機發出預警。 The device as described in claim 4 further includes: an event hotspot early warning module, which sends an early warning to the vehicle according to the base station identification code corresponding to the road data of the hot spots of the accidents that have occurred and potential accidents. 一種由伺服器所執行之用於潛在事件熱點探勘之方法,係包括:蒐集複數探勘目標對象、該複數探勘目標對象之各自的特徵、該複數探 勘目標對象之各自的事件;根據該複數探勘目標對象之各自的特徵,對該複數探勘目標對象進行分群;根據該複數探勘目標對象之各自的事件是否已發生目標事件,對每一群組中的探勘目標對象進行分區,其中,針對已發生目標事件的探勘目標對象,以發生該目標事件前的一預定時間區間為窗口,對該已發生目標事件的探勘目標對象的各自的事件進行取樣;以及針對未發生目標事件的探勘目標對象,以該預定時間區間作為滑動窗口,對該未發生目標事件的探勘目標對象的各自的事件進行取樣;根據危險指標及發生時間,對該每一群組中每一區的探勘目標對象進行探勘,獲得該每一群組中該每一區的探勘目標對象的具代表性之事件特徵,其中,針對該已發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得該已發生目標事件的探勘目標對象的有效事件特徵,再逐漸增加該危險指標的長度,獲得該已發生目標事件的探勘目標對象的具代表性之事件特徵;以及針對該未發生目標事件的探勘目標對象之經取樣的事件,以長度為1的危險指標及其對應的時間區間進行探勘,獲得該未發生目標事件的探勘目標對象之各自的有效事件特徵,再逐漸增加該危險指標的長度,獲得該未發生目標事件的探勘目標對象之各自的具代表性之事件特徵;以及將該每一群組中該每一區的探勘目標對象的該具代表性之事件特徵相互比對,以獲得具潛在事件熱點之探勘目標對象。 A method for potential event hotspot detection performed by a server, comprising: collecting a plurality of detection objects, respective characteristics of the plurality of detection objects, the plurality of detection According to the respective events of the plurality of exploration target objects; according to the respective characteristics of the plurality of exploration target objects, group the plurality of exploration target objects; Partitioning the prospecting target objects, wherein, for the prospecting target objects where the target events have occurred, the respective events of the prospecting target objects where the target events have occurred are sampled using a predetermined time interval before the target event occurs as a window; And for the prospecting target objects that have not occurred target events, use the predetermined time interval as a sliding window to sample the respective events of the prospecting target objects that have not occurred target events; Prospecting for the exploration target object in each area in each group, to obtain the representative event characteristics of the exploration target object in each area in each group, wherein, for the exploration target object that has occurred in the target event, the sampled Events, the risk index with a length of 1 and its corresponding time interval are used for exploration to obtain the effective event characteristics of the exploration target object of the target event that has occurred, and then gradually increase the length of the risk index to obtain the exploration of the target event that has occurred The representative event characteristics of the target object; and for the sampled events of the exploration target object where the target event has not occurred, the risk index with a length of 1 and its corresponding time interval are used for exploration to obtain the Exploring the respective effective event characteristics of the target object, and then gradually increasing the length of the risk indicator, to obtain the respective representative event characteristics of the exploration target object that has not occurred the target event; The representative event characteristics of the exploration target objects in the area are compared with each other to obtain the exploration target objects with potential event hotspots. 如請求項6所述之方法,其中,所述分群之方式包括:(1)自該複數探勘目標對象中選取多個探勘目標對象作為中心點;(2)根據該複數探勘目標對象之各自的特徵,計算未被選取的探勘目標對象與該多個中心點之間各自的距離,以將該未被選取的探勘目標對象與距離最近的中心點劃分為同一群;(3)計算該同一群中每一探勘目標對象與該同一群中其他探勘目標對象之間的距離和,以將具有最小距離和的探勘目標對象選取為該同一群中的新中心點;以及(4)以該新中心點重複步驟(2)和(3),直到該新中心點不再更動為止。 The method as described in claim 6, wherein the grouping method includes: (1) selecting a plurality of prospecting objects from the plurality of prospecting objects as center points; (2) selecting multiple prospecting objects according to the respective feature, calculate the respective distances between the unselected prospecting target object and the plurality of center points, so as to divide the unselected prospecting target object and the nearest center point into the same group; (3) calculate the same group The sum of distances between each prospecting target object and other prospecting target objects in the same group, so as to select the prospecting target object with the smallest distance sum as the new center point in the same group; and (4) use the new center Repeat steps (2) and (3) until the new center point does not change. 如請求項6所述之方法,其中,所述比對之方式包括:針對該具潛在事件熱點之探勘目標對象,評估該具潛在事件熱點之探勘目標對象發生目標事件的風險。 The method as described in claim 6, wherein the comparison method includes: for the prospecting target object with potential event hotspots, assessing the risk of a target event occurring at the prospecting target object with potential event hotspots. 一種電腦程式產品,經由電腦載入程式後執行如請求項6-8任一項所述之方法。 A computer program product, which executes the method described in any one of claims 6-8 after the program is loaded into the computer.
TW110139664A 2021-10-26 2021-10-26 Device, method and computer program product for exploration of potential event hotspot TWI786902B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
TW110139664A TWI786902B (en) 2021-10-26 2021-10-26 Device, method and computer program product for exploration of potential event hotspot

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
TW110139664A TWI786902B (en) 2021-10-26 2021-10-26 Device, method and computer program product for exploration of potential event hotspot

Publications (2)

Publication Number Publication Date
TWI786902B true TWI786902B (en) 2022-12-11
TW202318220A TW202318220A (en) 2023-05-01

Family

ID=85794970

Family Applications (1)

Application Number Title Priority Date Filing Date
TW110139664A TWI786902B (en) 2021-10-26 2021-10-26 Device, method and computer program product for exploration of potential event hotspot

Country Status (1)

Country Link
TW (1) TWI786902B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201508680A (en) * 2013-08-22 2015-03-01 Microsoft Corp Realtime activity suggestion from social and event data
CN110555568A (en) * 2019-09-12 2019-12-10 重庆交通大学 Road traffic running state real-time perception method based on social network information
CN111353637A (en) * 2020-02-24 2020-06-30 北京工业大学 Space-time sequence-based large-scale activity emergency prediction layered framework and method
US20200257992A1 (en) * 2014-05-23 2020-08-13 DataRobot, Inc. Systems for time-series predictive data analytics, and related methods and apparatus
CN112287118A (en) * 2020-10-30 2021-01-29 西南电子技术研究所(中国电子科技集团公司第十研究所) Event pattern frequent subgraph mining and predicting method
CN112418269A (en) * 2020-10-23 2021-02-26 西安电子科技大学 Method, system and medium for predicting social media network event propagation key time
WO2021178385A1 (en) * 2020-03-02 2021-09-10 Strong Force Intellectual Capital, Llc Intelligent transportation systems including digital twin interface for a passenger vehicle

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW201508680A (en) * 2013-08-22 2015-03-01 Microsoft Corp Realtime activity suggestion from social and event data
US20200257992A1 (en) * 2014-05-23 2020-08-13 DataRobot, Inc. Systems for time-series predictive data analytics, and related methods and apparatus
CN110555568A (en) * 2019-09-12 2019-12-10 重庆交通大学 Road traffic running state real-time perception method based on social network information
CN111353637A (en) * 2020-02-24 2020-06-30 北京工业大学 Space-time sequence-based large-scale activity emergency prediction layered framework and method
WO2021178385A1 (en) * 2020-03-02 2021-09-10 Strong Force Intellectual Capital, Llc Intelligent transportation systems including digital twin interface for a passenger vehicle
CN112418269A (en) * 2020-10-23 2021-02-26 西安电子科技大学 Method, system and medium for predicting social media network event propagation key time
CN112287118A (en) * 2020-10-30 2021-01-29 西南电子技术研究所(中国电子科技集团公司第十研究所) Event pattern frequent subgraph mining and predicting method

Also Published As

Publication number Publication date
TW202318220A (en) 2023-05-01

Similar Documents

Publication Publication Date Title
WO2020244288A1 (en) Method and apparatus for evaluating truck driving behaviour based on gps trajectory data
CN103971523B (en) A kind of mountain road traffic safety dynamic early-warning system
CN116644373B (en) Automobile flow data analysis management system based on artificial intelligence
CN110929939B (en) Landslide hazard susceptibility spatial prediction method based on clustering-information coupling model
CN102982081A (en) Method and system for screening potential traffic safety hazard road sections
Weng et al. Real-time bus travel speed estimation model based on bus GPS data
Ma et al. Modeling crash risk of horizontal curves using large-scale auto-extracted roadway geometry data
Raju et al. Examining smoothening techniques for developing vehicular trajectory data under heterogeneous conditions
Qaseem Ghadi et al. Comparison of different road segmentation methods
Ou et al. Systematic clustering method to identify and characterise spatiotemporal congestion on freeway corridors
Sousa et al. Violence in public transportation: an approach based on spatial analysis
TWI786902B (en) Device, method and computer program product for exploration of potential event hotspot
CN114299742A (en) Dynamic recognition and updating recommendation method for speed limit information of expressway
Bhattarai et al. Proactive Safety Analysis Using Roadside LiDAR Based Vehicle Trajectory Data: A Study of Rear-End Crashes
Farooq The Effects of Inaccurate and Missing Highway-Rail Grade Crossing Inventory Data on Crash and Severity Model Estimation and Prediction
Imprialou et al. Multilevel logistic regression modeling for crash mapping in metropolitan areas
CN110020686A (en) A kind of road surface method for detecting abnormality based on intelligent perception sensing data
CN113379334B (en) Road section bicycle riding quality identification method based on noisy track data
CN115565373A (en) Real-time risk prediction method, device, equipment and medium for highway tunnel accident
Patil et al. Road accident analysis and hotspot prediction using clustering
CN114360735A (en) Infectious disease time-space aggregation detection and analysis method and system and electronic equipment
CN108986542B (en) Automatic distinguishing method for urban intersection accident potential black points
Feng et al. Formulating an Innovative Spatial-Autocorrelation-based Method for Identifying Road Accident Hot Zones
Andrášik et al. Identification of Curves and Straight Sections on Road Networks from Digital Vector Data
Gong The road traffic safety risk projection based on improved random forest.