CN102855272B - A kind of micro-blog contains the D-S evidence theory method of traffic information fusion - Google Patents
A kind of micro-blog contains the D-S evidence theory method of traffic information fusion Download PDFInfo
- Publication number
- CN102855272B CN102855272B CN201210243199.1A CN201210243199A CN102855272B CN 102855272 B CN102855272 B CN 102855272B CN 201210243199 A CN201210243199 A CN 201210243199A CN 102855272 B CN102855272 B CN 102855272B
- Authority
- CN
- China
- Prior art keywords
- micro
- blog
- traffic
- microblog
- traffic information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000034 method Methods 0.000 title claims abstract description 42
- 230000004927 fusion Effects 0.000 title claims abstract description 33
- 238000011156 evaluation Methods 0.000 claims abstract description 12
- 238000004364 calculation method Methods 0.000 claims abstract description 7
- 230000010354 integration Effects 0.000 claims abstract description 3
- 230000006870 function Effects 0.000 claims description 5
- 230000005284 excitation Effects 0.000 claims 1
- 230000015572 biosynthetic process Effects 0.000 abstract description 8
- 238000003786 synthesis reaction Methods 0.000 abstract description 8
- 238000005315 distribution function Methods 0.000 abstract description 6
- 238000001035 drying Methods 0.000 description 4
- 238000005516 engineering process Methods 0.000 description 4
- 230000011218 segmentation Effects 0.000 description 4
- 238000007781 pre-processing Methods 0.000 description 3
- 238000000605 extraction Methods 0.000 description 2
- 244000097202 Rathbunia alamosensis Species 0.000 description 1
- 235000009776 Rathbunia alamosensis Nutrition 0.000 description 1
- 238000013473 artificial intelligence Methods 0.000 description 1
- 230000033228 biological regulation Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 238000013210 evaluation model Methods 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 238000007500 overflow downdraw method Methods 0.000 description 1
Landscapes
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明公开了一种微博客蕴含交通信息融合的D‑S证据理论方法,所述方法包括:根据交通拥挤程度,确定证据理论辨识框架及命题空间;根据时间窗口及道路路段,抓取与交通信息主题相关的微博客内容,组成需要融合的微博客数据集;微博客数据集预处理;计算微博客数据集交通状态描述词汇的词义相似度,引入中文语料库资源,提高词义相似度计算精度;计算微博客词义相似度加权评价,构建证据理论基本概率分配函数;根据Dempster组合规则对多条微博客证据源,进行证据合成,确定辨识空间内各命题的信任区间,选取信任区间中信任函数最大的命题作为融合结果。利用本发明可以实现微博客蕴含交通信息的融合,为城市交通信息的采集提供一种重要数据源。
The invention discloses a D-S evidence theory method for microblog containing traffic information fusion. The method includes: determining the identification framework and proposition space of the evidence theory according to the degree of traffic congestion; The microblog content related to the information theme is composed of the microblog data set that needs to be integrated; the microblog data set is preprocessed; the word meaning similarity of the traffic state description vocabulary of the microblog data set is calculated, and the Chinese corpus resources are introduced to improve the calculation accuracy of the word meaning similarity; Calculate the weighted evaluation of microblog word meaning similarity, construct the basic probability distribution function of evidence theory; perform evidence synthesis on multiple microblog evidence sources according to the Dempster combination rule, determine the trust interval of each proposition in the identification space, and select the trust interval with the largest trust function propositions as a fusion result. The invention can realize the integration of traffic information contained in microblogs, and provide an important data source for the collection of urban traffic information.
Description
技术领域 technical field
本发明涉及移动位置服务、互联网空间信息搜索,移动互联网络技术,具体涉及一种微博客蕴含交通信息融合的D-S证据理论方法。 The invention relates to mobile location service, Internet space information search, and mobile Internet technology, in particular to a D-S evidence theory method for fusion of traffic information contained in microblogs.
背景技术 Background technique
实时交通信息能够缓解交通拥堵、提高交通运输效率,保障交通安全,方便公众出行,现有的交通信息获取方式主要包括固定传感器技术(感应线圈、视频监控和微波探测),安装GPS和无线通讯设备的浮动车技术、移动通讯终端信令分析技术等,但这些采集手段在获取临时交通管制限制信息以及应对突发性交通事件等方面仍存在很大局限。微博客中蕴含着丰富的时效性很高的实时交通信息,涵盖各种交通信息类型,例如包括道路交通流、道路畅通度和行驶速度、交通限制、临时交通管制、突发性交通事件、针对特定地点的交通状态描述信息等,获取微博客中蕴含的高动态的实时交通信息能够弥补现有交通信息采集手段的不足。 Real-time traffic information can alleviate traffic congestion, improve transportation efficiency, ensure traffic safety, and facilitate public travel. The existing methods of obtaining traffic information mainly include fixed sensor technology (induction coils, video surveillance, and microwave detection), and the installation of GPS and wireless communication equipment. However, these collection methods still have great limitations in obtaining temporary traffic control restriction information and responding to sudden traffic incidents. Microblogs contain a wealth of real-time traffic information with high timeliness, covering various types of traffic information, such as road traffic flow, road smoothness and driving speed, traffic restrictions, temporary traffic control, sudden traffic incidents, The traffic status description information of a specific location, etc., and the acquisition of highly dynamic real-time traffic information contained in microblogs can make up for the shortcomings of existing traffic information collection methods.
然而,微博客消息的高动态性、模糊性及其不同微博客用户发布消息的描述差异性使得信息融合成为信息提取的瓶颈问题,直接影响了微博客蕴含实时交通信息的利用。微博客交通信息融合是对不同微博客消息所蕴含的交通信息内容进行推理决策,获取准确的交通状态描述信息,更好地服务于交通管理与出行服务。微博客交通信息融合的难点在于:(1)微博客消息的非结构化特征造成语义理解困难:由于微博客消息内容精简,仅140字左右,且口语化特征明显,含有较多冗余内容,给自动化的语义判断与提取造成很大压力;(2)不同微博客用户发布消息对交通状态的描述差异造成信息汇集矛盾:在一定时间段内,可能存在描述同一路段交通状态的多条微博客消息。针对同一路况,不同用户的描述可能差异很大,有些描述甚至语义相斥。 However, the high dynamics and ambiguity of microblog messages and the differences in the descriptions of messages posted by different microblog users make information fusion a bottleneck in information extraction, which directly affects the utilization of real-time traffic information contained in microblogs. Microblog traffic information fusion is to reason and make decisions on traffic information content contained in different microblog messages, obtain accurate description information of traffic status, and better serve traffic management and travel services. The difficulty of microblog traffic information fusion lies in: (1) The unstructured characteristics of microblog messages cause semantic understanding difficulties: because the content of microblog messages is simplified, only about 140 characters, and the characteristics of colloquial language are obvious, it contains more redundant content. It puts a lot of pressure on automatic semantic judgment and extraction; (2) Different microblog users post messages that describe traffic conditions differently, causing information collection contradictions: within a certain period of time, there may be multiple microblogs describing the traffic conditions of the same road segment information. For the same road condition, the descriptions of different users may be very different, and some descriptions are even semantically mutually exclusive.
为解决该问题,目前采用的技术为文本聚类方法,当文本具有一定词汇数量时,文本聚类过程才能够确定准确的文本主题描述。但微博客消息内容短小,在经过分词、词义消歧、词义干化等过程后,可以利用的交通状态描述关键词汇很少。因此,文本聚类并不能很好地解决微博客消息描述的模糊性以及不同微博客用户消息描述的差异性。 To solve this problem, the technology currently used is the text clustering method. When the text has a certain number of words, the text clustering process can determine the accurate text topic description. However, the content of microblog messages is short, and after the process of word segmentation, word meaning disambiguation, and word meaning drying, there are very few key words that can be used to describe traffic conditions. Therefore, text clustering cannot well solve the ambiguity of microblog message descriptions and the differences in message descriptions of different microblog users.
为此,本专利针对上述信息融合问题,提出一种基于D-S证据理论的微博客交通信息融合方法。该方法通过引入中文语料库资源,来丰富微博客的语义信息,解决微博客描述的模糊性问题。在中文语料库知识基础之上,实现了对微博客内容进行词义相似度加权评价,接着利用证据理论来处理由于不同微博用户的差异性造成的信息融合的不确定性推理问题,从而确定融合结果。给动态交通信息收集提供了一种新的解决方案,弥补传统的动态交通信息采集技术相对比较薄弱的环节。 Therefore, this patent proposes a microblog traffic information fusion method based on the D-S evidence theory for the above information fusion problem. This method enriches the semantic information of microblogs by introducing Chinese corpus resources, and solves the fuzzy problem of microblog descriptions. Based on the knowledge of the Chinese corpus, the weighted evaluation of word meaning similarity to the microblog content is realized, and then the evidence theory is used to deal with the uncertainty reasoning problem of information fusion caused by the differences of different microblog users, so as to determine the fusion result . It provides a new solution for dynamic traffic information collection and makes up for the relatively weak link of traditional dynamic traffic information collection technology.
发明内容 Contents of the invention
本发明要解决的技术问题是:针对目前微博客中蕴含的大量交通信息难以充分融合利用,传统动态交通信息收集方法,很难及时反应突发性的路况信息的现状。本发明提出一种微博客蕴含交通信息融合的D-S证据理论方法,为动态交通信息的收集提供又一重要的数据源。该方法解决了微博客消息的非结构化特征造成语义理解困难以及不同微博客用户发布消息对交通状态的描述差异造成信息汇集矛盾,可直接应用于个人及车载导航、移动位置服务、地图网站、专业的出行信息服务平台、物流调度以及交通应急预案。 The technical problem to be solved by the present invention is: it is difficult to fully integrate and utilize a large amount of traffic information contained in the current microblog, and it is difficult for the traditional dynamic traffic information collection method to respond to the current situation of sudden road condition information in a timely manner. The invention proposes a D-S evidence theory method of microblog containing traffic information fusion, which provides another important data source for the collection of dynamic traffic information. This method solves the difficulty in semantic understanding caused by the unstructured features of microblog messages and the contradictions in information collection caused by the differences in the description of traffic status in the messages posted by different microblog users. It can be directly applied to personal and vehicle navigation, mobile location services, map websites, Professional travel information service platform, logistics dispatch and traffic emergency plan.
本发明的技术解决方案为: Technical solution of the present invention is:
一种微博客蕴含交通信息融合的D-S证据理论方法,包括: A D-S evidence theory method for the fusion of traffic information in microblog, including:
根据交通拥挤程度,确定证据理论辨识框架Θ及命题空间2Θ; According to the degree of traffic congestion, determine the evidence theory identification framework Θ and the proposition space 2 Θ ;
根据有效时间窗Tinterval和路网路段road,抓取与交通信息主题相关的微博客内容,组成需要融合的微博客数据集V; According to the effective time window T interval and the road network section road, capture the microblog content related to the topic of traffic information, and form the microblog data set V that needs to be fused;
微博客信息Vi预处理操作,包括自然语言分词、词义消歧、词义干化,得到微博客交通状态描述词汇集Wi; The microblog information V i preprocessing operation includes natural language word segmentation, word meaning disambiguation, and word meaning drying to obtain the microblog traffic state description vocabulary set W i ;
引入中文语料库资源Corpus={Cwikipedia,Chownet,...,Cnum},计算微博客交通状态词汇集Wi与命题空间中词汇的词义相似度Sim; Introduce the Chinese corpus resource Corpus={C wikipedia , C hownet ,..., C num }, and calculate the word sense similarity Sim between the microblog traffic state vocabulary set W i and the words in the proposition space;
计算微博客消息Vi的词义相似度加权评价Scorei,确定证据理论基本概率分配函数m(Vi); Calculate the word meaning similarity weighted evaluation Score i of the microblog message V i , and determine the basic probability distribution function m(V i ) of the evidence theory;
通过Dempster合成法则进行证据合成及证据决策,确定该路段road微博客蕴含交通信息融合结果TStateroad; Carry out evidence synthesis and evidence decision-making through the Dempster synthesis rule, and determine that the road microblog of this road section contains the traffic information fusion result TState road ;
优选地,所述根据交通拥挤程度确定证据理论辨识框架包括: Preferably, the identification framework for determining the evidence theory according to the degree of traffic congestion includes:
交通拥挤程度的确定可参照部分国家标准,如公安部2002年公布的《城市交通管理评价指标体系》对路网交通拥挤程度分类; The determination of traffic congestion can refer to some national standards, such as the "Urban Traffic Management Evaluation Index System" published by the Ministry of Public Security in 2002 to classify road network traffic congestion;
交通拥挤程度的确定与实际融合需求相关; The determination of the degree of traffic congestion is related to the actual integration needs;
证据理论辨识框架及命题空间不限制大小; There is no limit to the size of evidence theory identification framework and proposition space;
优选地,所述根据有效时间窗口及路网路段,抓取交通信息主题相关微博客,组成需要融合微博客数据集V包括: Preferably, according to the effective time window and the road network section, grabbing the relevant microblogs of traffic information topics, forming the microblog data set V that needs to be fused includes:
有效时间窗口T定义为交通信息时间Tcurrent进行扩充而形成的时间段,即T=[Tcurrent-Δta,Tcurrent+Δtb]其中Δta与Δtb用户定义参数; The effective time window T is defined as the time period formed by expanding the traffic information time T current , that is, T=[T current -Δt a , T current +Δt b ] where Δt a and Δt b are user-defined parameters;
路网中路段以路网道路名称为研究对象或者以导航路网中的道路分段为研究对象; The road section in the road network takes the road name of the road network as the research object or takes the road segment in the navigation road network as the research object;
微博客数据集V与实际选取的路网路段相关; The microblog data set V is related to the actual selected road network section;
微博客数据集V的构建过程是本专利不涉及的; The construction process of the microblog dataset V is not involved in this patent;
微博客数据集V不限制其存储形式,可以是数据库或者数据文件; Microblog dataset V does not limit its storage form, it can be a database or a data file;
优选地,所述对引入中文语料库资源Corpus,计算微博客交通状态词汇集Wi与命题空间中词汇的词义相似度Sim包括: Preferably, the introduction of the Chinese corpus resource Corpus to calculate the word sense similarity Sim between the microblog traffic state vocabulary set Wi and the vocabulary in the proposition space includes:
中文语料库资源Corpus不限制语料库类型,可以为维基百科,知网等; The Chinese corpus resource Corpus does not limit the type of corpus, it can be Wikipedia, HowNet, etc.;
词义相似度Sim的计算过程是本专利不涉及的; The calculation process of word meaning similarity Sim is not involved in this patent;
优选地,所述微博客内容的词义相似度相似度加权评价Score,确定微博客的基本概率分配函数m(Vi)包括: Preferably, the word meaning similarity similarity weighted evaluation Score of described micro-blog content, the basic probability distribution function m (V i ) that determines micro-blog comprises:
微博客内容词义相似度加权评价计算,具体为: The weighted evaluation calculation of word meaning similarity in microblog content is as follows:
其中,k为证据理论命题空间中的命题,term微博客交通状态描述词汇集Wi词汇,sum为微博客的数量,num(k)为包含命题k的微博客数量,boost(user)为该微博客用户的激励函数,反映了该用户的重要程度,默认值为1,该值越大说明该用户越重要。 Among them, k is a proposition in the proposition space of evidence theory, term microblog traffic state description vocabulary set W i vocabulary, sum is the number of microblogs, num(k) is the number of microblogs containing proposition k, boost(user) is the The incentive function of a microblog user reflects the importance of the user. The default value is 1. The larger the value, the more important the user is.
微博客内容的基本概率分配函数计算,具体为: The calculation of the basic probability distribution function of microblog content is as follows:
优选地,所述方法还包括: Preferably, the method also includes:
根据用户提供的所要融合路段及融合时间条件,完成所述微博客蕴含交通信息融合; According to the desired fusion road section and fusion time conditions provided by the user, complete the fusion of traffic information contained in the microblog;
根据用户提供的所要融合区域内路段及融合时间条件,完成所述微博客蕴 含交通信息融合; According to the road section and fusion time condition in the desired fusion area provided by the user, complete the fusion of traffic information contained in the microblog;
本发明与现有技术相比的优点在于:本发明克服了微博客蕴含交通信息难以充分融合利用的缺点;通过引入中文语料库资源,丰富微博客语义理解过程,有效解决了微博客消息的非结构化特征造成语义理解困难;利用D-S证据理论在处理不确定信息的表达和合成方面的优势,解决了不同微博客用户发布消息对交通状态的描述差异造成信息汇集矛盾。 Compared with the prior art, the present invention has the advantages that: the present invention overcomes the shortcoming that the traffic information contained in the micro-blog is difficult to be fully integrated and utilized; by introducing Chinese corpus resources, the semantic understanding process of the micro-blog is enriched, and the unstructured information of the micro-blog is effectively solved. Difficulties in understanding semantics due to culturalized features; using the advantages of D-S evidence theory in dealing with the expression and synthesis of uncertain information, it solves the contradictions in information collection caused by the differences in the descriptions of traffic states in the messages posted by different microblog users.
附图说明 Description of drawings
图1为本发明实施例微博客蕴含交通信息融合D-S证据理论方法流程图; Fig. 1 is the flow chart of the theoretical method of D-S evidence fusion of traffic information contained in the microblog of the embodiment of the present invention;
图2为本发明实施例案例实施方式流程图 Fig. 2 is the flow chart of the implementation of the case of the embodiment of the present invention
图3为本发明实施例案例微博客信息记录集 Fig. 3 is the case microblog information record collection of the embodiment of the present invention
图4为本发明实施例案例维基百科数据源 Fig. 4 is the Wikipedia data source of the case of the embodiment of the present invention
具体实施方式 detailed description
为了使本技术领域的人员更好地理解本发明实施例的方案,下面结合附图和实施方式对本发明实施例作进一步的详细说明。 In order to enable those skilled in the art to better understand the solutions of the embodiments of the present invention, the embodiments of the present invention will be further described in detail below in conjunction with the drawings and implementations.
如图1所示,是本发明实施例一种微博客蕴含交通信息融合的D-S证据理论方法流程图,包括以下步骤: As shown in Figure 1, it is a flow chart of a D-S evidence theory method for the fusion of traffic information contained in a microblog according to an embodiment of the present invention, including the following steps:
步骤101,根据交通拥挤程度,确定证据理论辨识框架Θ及命题空间2Θ; Step 101, according to the degree of traffic congestion, determine evidence theory identification framework Θ and proposition space 2 Θ ;
步骤102,根据有效时间窗Tinterval和路网路段road,抓取与交通信息主题相关的微博客内容,组成需要融合的微博客数据集V; Step 102, according to the effective time window T interval and the road network section road, capture the microblog content related to the traffic information topic, and form the microblog data set V that needs to be fused;
步骤103,微博客信息Vi预处理操作,包括自然语言分词、词义消歧、词义干化,得到微博客交通状态描述词汇集Wi; Step 103, the microblog information V i preprocessing operation, including natural language word segmentation, word meaning disambiguation, and word meaning drying, to obtain the microblog traffic state description vocabulary set W i ;
步骤104,引入中文语料库资源Corpus={Cwikipedia,Chownet,...,Cnum},计算微博客交通状态词汇集Wi与命题空间中词汇的词义相似度Sim; Step 104, introducing the Chinese corpus resource Corpus={C wikipedia , C hownet , ..., C num }, calculating the word sense similarity Sim between the microblog traffic state vocabulary set W i and the vocabulary in the proposition space;
步骤105,计算微博客消息Vi的词义相似度加权评价Scorei,确定证据理论基本概率分配函数m(Vi); Step 105, calculating the word meaning similarity weighted evaluation Score i of the microblog message V i , and determining the basic probability distribution function m(V i ) of the evidence theory;
步骤106,通过Dempster合成法则进行证据合成及证据决策,确定该路段road微博客蕴含交通信息融合结果TStateroad; Step 106, carry out evidence synthesis and evidence decision-making through Dempster synthesis rule, and determine that the road section road microblog contains traffic information fusion result TState road ;
下面进一步以北京市道路网举例详细说明本发明实施例的实际应用过程。 The actual application process of the embodiment of the present invention will be further described in detail below by taking the road network of Beijing as an example.
如图2所示,是本发明一种微博客蕴含交通信息融合的D-S证据理论方法实施例案例实施方式流程图。本实施例在以本实施方式为前提下进行实施,给 出了详细的实施方式和具体的操作过程,但本发明的保护范围不限于下述的实施例,本实施例的具体过程包括以下的步骤: As shown in FIG. 2 , it is a flow chart of a case implementation of a D-S evidence theory method embodiment of a microblog containing traffic information fusion in the present invention. This embodiment is implemented under the premise of this embodiment, and detailed implementation and specific operation process are provided, but the scope of protection of the present invention is not limited to the following embodiments, and the specific process of this embodiment includes the following step:
步骤201,借鉴我国公安部2002年公布的《城市交通管理评价系标体系》中规定,其交通拥挤程度主要包含以下四种:畅通(机动车平均行驶速度大于30km/h)、轻度拥挤(机动车平均行驶速度在20~30km/h)、拥挤(机动车平均行驶速度在10~20km/h),拥堵(机动车平均行驶速度低于10km/h)。由于微博客内容是对交通状况的概况描述,不难以精确到速度级别,为了保证微博客信息融合的精度,将轻度拥挤与拥挤合并,定义辨识框架为; Step 201, referring to the regulations in the "Urban Traffic Management Evaluation System" promulgated by the Ministry of Public Security of my country in 2002, its traffic congestion mainly includes the following four types: smooth (the average speed of motor vehicles is greater than 30km/h), mild congestion ( The average speed of motor vehicles is 20-30km/h), congestion (the average speed of motor vehicles is 10-20km/h), congestion (the average speed of motor vehicles is less than 10km/h). Since the microblog content is an overview description of traffic conditions, it is not difficult to be accurate to the speed level. In order to ensure the accuracy of microblog information fusion, mild congestion and congestion are combined, and the identification framework is defined as;
Θ={畅通,拥挤,拥堵} Θ = {smooth, congested, congested}
则命题空间2Θ为: Then the proposition space 2 Θ is:
步骤202,循环遍历北京市道路网中的所有路段,选定处理路网中东五环路段; Step 202, loop through all road sections in the Beijing road network, and select and process the Middle East Fifth Ring Road section of the road network;
步骤203,判断路网中所有路段是否都已经处理完毕,如果是,执行步骤217;如果否,则执行步骤204; Step 203, judging whether all road sections in the road network have been processed, if yes, execute step 217; if not, execute step 204;
步骤204,构建时间窗口,通过搜索新浪微博客、网易微博客、搜狐微博客以及腾讯微博客等关于北京市动态交通信息的微博客内容建立与与东五环相关交通信息主题的微博客信息记录集,如图3所示; Step 204, constructing a time window, and establishing a microblog information record related to traffic information topics related to the East Fifth Ring Road by searching microblog content such as Sina microblog, Netease microblog, Sohu microblog, and Tencent microblog about dynamic traffic information in Beijing set, as shown in Figure 3;
步骤205,微博客信息预处理,自然语言分词、词义消歧、词义干化,得到微博客交通状态描述词汇,然后执行步骤206; Step 205, microblog information preprocessing, natural language word segmentation, word meaning disambiguation, word meaning drying, to obtain microblog traffic state description vocabulary, and then perform step 206;
步骤206,下载并解析维基百科数据源(如图4所示),采用文献(Milne D,Witten IH.An effective low-cost measure of semantic relatedness obtained from Wikipedia links.In Proceedings of the AAAI 2008 Workshop on Wikipedia and Artificial Intelligence)的方法建立语义模型,计算微博客交通状态词汇的语义相似度,然后执行步骤207; Step 206, download and analyze the Wikipedia data source (as shown in Figure 4), using the literature (Milne D, Witten IH.An effective low-cost measure of semantic relatedness obtained from Wikipedia links.In Proceedings of the AAAI 2008 Workshop on Wikipedia and Artificial Intelligence) method sets up semantic model, calculates the semantic similarity of micro-blog traffic state vocabulary, then executes step 207;
步骤207,计算微博客消息词义相似度加权评价,确定证据理论的基本概率分配函数,则执行步骤208; Step 207, calculate the weighted evaluation of the word meaning similarity of the microblog message, determine the basic probability distribution function of the evidence theory, then execute step 208;
步骤208,通过Dempster合成法则进行证据合成及证据决策,确定该路段微博客蕴含交通信息融合结果,则执行步骤209; Step 208, carry out evidence synthesis and evidence decision-making through Dempster synthesis rule, and determine that the microblog of this road section contains traffic information fusion results, then execute step 209;
步骤209,判断北京市所有路段是否都已经处理完毕,如果是,执行步骤 217;如果否,则执行步骤210; Step 209, judge whether all road sections in Beijing have been processed, if yes, execute step 217; if not, execute step 210;
步骤210,结束本次计算过程; Step 210, end the calculation process;
可见,本发明实施例一种微博客蕴含交通信息融合的D-S证据理论方法,可以解决微博客蕴含交通信息融合过程中的语义理解困难与多微博用户之间的描述差异性问题,为动态交通信息收集提供一种新的信息采集方案,准确快速反应突发路况信息。本发明可以直接应用于动态交通信息的发布,服务于地图网站系统、公共出行信息平台和移动位置服务。 It can be seen that in the embodiment of the present invention, a D-S evidence theory method for the fusion of traffic information contained in microblogs can solve the problem of semantic understanding difficulties in the process of fusion of traffic information contained in microblogs and the problem of description differences between multiple microblog users, providing dynamic traffic information. Information collection provides a new information collection scheme to accurately and quickly respond to unexpected road condition information. The invention can be directly applied to release of dynamic traffic information, and serves map website system, public travel information platform and mobile location service.
需要说明的是,本发明实施例的方法适用于所有城市道路网络的动态信息收集;本发明不限制微博客内容的抓取方式及微博客内容的提供网站;本发明不只局限于具体实施方式中所采用的词义相似度计算方法以及微博用户的词义相似度加权评价模型。 It should be noted that the method of the embodiment of the present invention is applicable to the dynamic information collection of all urban road networks; the present invention does not limit the grabbing method of microblog content and the website providing microblog content; the present invention is not limited to the specific implementation The word-sense similarity calculation method used and the weighted evaluation model of word-sense similarity for Weibo users.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分步骤是可以通过程序来指令相关的硬件来完成,所述的程序可以存储于一计算机可读取存储介质中,所述的存储介质,如:ROM/RAM、磁碟、光盘等。 Those of ordinary skill in the art can understand that all or part of the steps in the method of the above-mentioned embodiments can be completed by instructing related hardware through a program, and the program can be stored in a computer-readable storage medium, and the storage Media, such as: ROM/RAM, disk, CD, etc.
以上对本发明实施例进行了详细介绍,本文中应用了具体实施方式对本发明进行了阐述,以上实施例的说明只是用于帮助理解本发明的方法;同时,对于本领域的一般技术人员,依据本发明的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本发明的限制。 The embodiments of the present invention have been described in detail above, and the present invention has been described by using specific embodiments herein. The description of the above embodiments is only used to help understand the method of the present invention; meanwhile, for those of ordinary skill in the art, according to this The idea of the invention will have changes in the specific implementation and scope of application. To sum up, the contents of this specification should not be construed as limiting the present invention.
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210243199.1A CN102855272B (en) | 2012-07-16 | 2012-07-16 | A kind of micro-blog contains the D-S evidence theory method of traffic information fusion |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210243199.1A CN102855272B (en) | 2012-07-16 | 2012-07-16 | A kind of micro-blog contains the D-S evidence theory method of traffic information fusion |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102855272A CN102855272A (en) | 2013-01-02 |
CN102855272B true CN102855272B (en) | 2016-08-17 |
Family
ID=47401860
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210243199.1A Expired - Fee Related CN102855272B (en) | 2012-07-16 | 2012-07-16 | A kind of micro-blog contains the D-S evidence theory method of traffic information fusion |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102855272B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105512166B (en) * | 2015-10-30 | 2019-07-02 | 青岛智能产业技术研究院 | A kind of traffic parallel mode of microblogging public sentiment and urban traffic situation phase mapping |
CN107622309B (en) * | 2017-08-18 | 2021-01-08 | 长安大学 | Road congestion detection method based on VANETs and improved D-S evidence theory |
CN110967678A (en) * | 2019-12-20 | 2020-04-07 | 安徽博微长安电子有限公司 | Data fusion algorithm and system for multiband radar target identification |
CN112906472A (en) * | 2021-01-19 | 2021-06-04 | 中国矿业大学(北京) | Circuit breaker defect identification method based on self-service sampling method |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101308487A (en) * | 2008-06-25 | 2008-11-19 | 中国科学院地理科学与资源研究所 | A spatio-temporal fusion method for expressing dynamic traffic information in natural language |
CN101794508A (en) * | 2009-12-30 | 2010-08-04 | 北京世纪高通科技有限公司 | Traffic information filling method, device and system |
CN102163225A (en) * | 2011-04-11 | 2011-08-24 | 中国科学院地理科学与资源研究所 | A fusion evaluation method of traffic information collected based on micro blogs |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7831538B2 (en) * | 2007-05-23 | 2010-11-09 | Nec Laboratories America, Inc. | Evolutionary spectral clustering by incorporating temporal smoothness |
-
2012
- 2012-07-16 CN CN201210243199.1A patent/CN102855272B/en not_active Expired - Fee Related
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101308487A (en) * | 2008-06-25 | 2008-11-19 | 中国科学院地理科学与资源研究所 | A spatio-temporal fusion method for expressing dynamic traffic information in natural language |
CN101794508A (en) * | 2009-12-30 | 2010-08-04 | 北京世纪高通科技有限公司 | Traffic information filling method, device and system |
CN102163225A (en) * | 2011-04-11 | 2011-08-24 | 中国科学院地理科学与资源研究所 | A fusion evaluation method of traffic information collected based on micro blogs |
Non-Patent Citations (2)
Title |
---|
城市多模式交通网络特征连通关系表达模型;熊丽音,陆锋,陈传彬;《武汉大学学报(信息科学版)》;20080405;393-396 * |
自然语言表达实时路况信息的路网匹配融合技术;陈传彬,陆锋,励惠国,王钦敏;《中国图像图形学报》;20090915;1669-1675 * |
Also Published As
Publication number | Publication date |
---|---|
CN102855272A (en) | 2013-01-02 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Arooj et al. | Big data processing and analysis in internet of vehicles: architecture, taxonomy, and open research challenges | |
CN102163225B (en) | A traffic information fusion method based on microblog collection | |
Li et al. | Transportation mode identification with GPS trajectory data and GIS information | |
Alomari et al. | Iktishaf: A big data road-traffic event detection tool using Twitter and spark machine learning | |
CN110555568B (en) | A real-time perception method of road traffic operation status based on social network information | |
WO2019085807A1 (en) | Road condition information acquisition method and device thereof, and storage medium | |
CN112418696B (en) | Construction method and device of urban traffic dynamic knowledge graph | |
Nasrollahi et al. | Identification and prioritization of connected vehicle technologies for sustainable development in Iran | |
Suma et al. | Automatic detection and validation of smart city events using hpc and apache spark platforms | |
CN102855272B (en) | A kind of micro-blog contains the D-S evidence theory method of traffic information fusion | |
WO2024139291A1 (en) | Multi-level classification model classification method, training method and apparatus, device, and medium | |
CN109410576A (en) | Road condition analyzing method, apparatus, storage medium and the system of multisource data fusion | |
Wang et al. | Traffic information mining from social media based on the MC-LSTM-Conv model | |
Wang et al. | The level of delay caused by crashes (LDC) in metropolitan and non-metropolitan areas: a comparative analysis of improved Random Forests and LightGBM | |
Du et al. | Learning‐Based Lane‐Change Behaviour Detection for Intelligent and Connected Vehicles | |
Noaeen et al. | The efficacy of using social media data for designing traffic management systems | |
Yang et al. | Applying masked language model for transport mode choice behavior prediction | |
Zhang et al. | Research on taxi driver strategy game evolution with carpooling detour | |
Jain et al. | Twitter mining and analysis of traffic conditions in India | |
Jiang et al. | Intelligent transportation | |
Wan et al. | Leveraging personal navigation assistant systems using automated social media traffic reporting | |
CN111523034B (en) | Application processing method, device, equipment and medium | |
Xiong et al. | Exploring the Citywide Human Mobility Patterns of Taxi Trips through a Topic‐Modeling Analysis | |
da Costa Albuquerque et al. | A proactive application to monitor truck fleets | |
Jones et al. | Real-time traffic event detection using Twitter data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160817 Termination date: 20180716 |