CN107562814A - A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system - Google Patents

A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system Download PDF

Info

Publication number
CN107562814A
CN107562814A CN201710692655.3A CN201710692655A CN107562814A CN 107562814 A CN107562814 A CN 107562814A CN 201710692655 A CN201710692655 A CN 201710692655A CN 107562814 A CN107562814 A CN 107562814A
Authority
CN
China
Prior art keywords
information
network information
segmented
text
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710692655.3A
Other languages
Chinese (zh)
Inventor
李林
柏雪松
彭帆
张蓓芸
苏伟
张晓东
苏晓慧
陆书涵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Agricultural University
Original Assignee
China Agricultural University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Agricultural University filed Critical China Agricultural University
Priority to CN201710692655.3A priority Critical patent/CN107562814A/en
Publication of CN107562814A publication Critical patent/CN107562814A/en
Pending legal-status Critical Current

Links

Abstract

The present invention provides a kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system.Methods described includes:By data reptile instrument, objective network information is captured;The objective network information grabbed is segmented, and marks part of speech, obtains text participle information;Information is segmented according to the initial weight of preset keyword and the text, calculates the scale value that refers generally to of the objective network information, and to information type corresponding to objective network information matching.Method provided by the invention, according to the characteristics of Earthquake emergency information different type, by being segmented to the network information, the accuracy of grouped data is ensure that according to Keyword Weight progress text classification.The present invention on the whole, constructs the emergency information index system of public participation, realizes emergency information Quick Acquisition and extracting tool, and extends the information source of earthquake emergency, improves the ageing and accuracy of earthquake disaster emergency service.

Description

A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system
Technical field
The present invention relates to Computer Science and Technology field, is obtained more particularly, to a kind of earthquake emergency and the condition of a disaster information Take sorting technique and system.
Background technology
China is the more country of the accident such as natural calamity, an accident, and disaster species is more, disaster occurs Frequency is high, casualty loss is serious.At the same time, and in the world most strong and earthquake disaster most serious the country of seismic activity One of.Wenchuan earthquake in 2008, Lushan earthquake in 2013 all bring huge economic loss.And earthquake emergency commanding arrives Position and the achievement for being related to the disaster relief in time.
7.2 grades of earthquakes of nineteen ninety-five Japan's slope god, the 7.8 grades of earthquakes of Turkey's Izmit in 1999 and India's Gu in 2001 Why the post-earthquake emergency response of 7.9 grades of earthquakes of Ji Latebang is slow in action, commanding and decision-making is not in place, even be delayed the disaster relief when Machine etc., it is particularly since and lacks timely and accurate, comprehensive Informational support.On the contrary, Los Angeles,U.S in 1994- During the northern 7.1 grades of earthquakes in ridge, the earthquake emergency commanding technological system under being supported due to Earthquake emergency information has played important function, Information and the technical support of key are provided for Emergency decision, effectively alleviates earthquake disaster damage.Can be with by above-mentioned example Find out, Earthquake emergency information occupies core status in earthquake emergency mechanism.In recent years, China pays much attention to earthquake emergency work Make, set up within 2000 State Council's earthquake relief headquarter, set up within 2001 national antidetonation rescue group and embodied country Attention for earthquake emergency Mechanism establishing.Earthquake emergency refers to the various Emergency Preparedness that ruinous earthquake is done before and after occurring And the urgent rescue and relief work action taken after the earthquake.
After the earthquake, both implied in substantial amounts of micro-blog information studied and judged available for the condition of a disaster, rescue guarantee etc. it is valuable Emergency information, while there is also the redundancy of many, deceptive information, this category information to shake rescue and play interference effect over the ground, On the other hand there is also the problems such as data volume is huge, true and information category is various so that target information is extracted and produces obstruction.
The content of the invention
For the present invention to overcome in the prior art, can not be directed to earthquake information data huge in network after earthquake can not The problem of effectively being classified, propose a kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system.
According to the first aspect of the invention, there is provided a kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique, including:
S1, by data reptile instrument, capture objective network information;;
S2, the objective network information grabbed is segmented, and mark part of speech, obtain text participle information;
S3, information is segmented according to the initial weight of preset keyword and the text, calculates the objective network information Scale value is referred generally to, and to information type corresponding to objective network information matching.
Wherein, the network information is included at least one in the mhkc network information, news network information and micro blog network information The kind network information.
Wherein, also include after the S1:Based on the training text for completion of classifying under manual intervention, keyword is calculated Weight.
Wherein, the data based on completion of classifying under manual intervention, the step of calculating the weight of preset keyword, are specific Including:
According to the training text, descriptor is extracted;
The word frequency and inverse document frequency of preset keyword in the training text are calculated using TF-IDF algorithms, is led The classified weight table of epigraph.
Wherein, participle is carried out to the network information of the crawl in the S2 to specifically include:
S21, segmented using the network information captured described in NLPIR Words partition systems;
S22, part-of-speech information is marked, extract location information and temporal information, and the part-of-speech information is saved into data Storehouse.
Wherein, it is described to be segmented specially using the network information captured described in NLPIR Words partition systems:Pass through N member texts Method model algorithm, the network information is segmented.
Wherein, the mark part-of-speech information is specially:By HMM part-of-speech tagging algorithm, described in mark Part of speech of the network information after participle.
According to the second aspect of the invention, there is provided a kind of earthquake emergency and the condition of a disaster acquisition of information categorizing system, including:
Data acquisition module, for by data reptile instrument, capturing objective network information;
Word-dividing mode, the objective network information grabbed is segmented, and mark part of speech, obtain text participle Information
Sort module is calculated, for the initial weight according to preset keyword and text participle information, described in calculating Objective network information refers generally to scale value, and to information type corresponding to objective network information matching.
According to the third aspect of the invention we, there is provided a kind of computer-readable recording medium, be stored thereon with computer journey Sequence, a kind of earthquake emergency that the various possible implementations that the program is executed by processor above-mentioned first aspect are provided and The condition of a disaster acquisition of information sorting technique.
According to the fourth aspect of the invention, there is provided a kind of earthquake emergency and the condition of a disaster acquisition of information sorting device, including:
At least one processor;And
At least one memory being connected with the processor, wherein:
The memory storage has and can call described program by the programmed instruction of the computing device, the processor Instruction is able to carry out operating as follows:
By data reptile instrument, objective network information is captured;
The objective network information grabbed is segmented, and marks part of speech, obtains text participle information;
According to the initial weight of preset keyword and text participle information, the total of the objective network information is calculated Body desired value, and to information type corresponding to objective network information matching.
Method provided by the invention, according to the characteristics of Earthquake emergency information different type, by dividing the network information Word, the accuracy of grouped data is ensure that according to Keyword Weight progress text classification.On the whole, public participation is constructed Emergency information index system, realize emergency information Quick Acquisition and extracting tool, and extend the information of earthquake emergency Source, improve the ageing and accuracy of earthquake disaster emergency service.
Brief description of the drawings
Fig. 1 is the bulk flow of a kind of earthquake emergency that one embodiment of the invention provides and the condition of a disaster acquisition of information sorting technique Journey schematic diagram;
Fig. 2 is that descriptor is weighed in a kind of earthquake emergency provided in an embodiment of the present invention and the condition of a disaster acquisition of information sorting technique Re-computation schematic flow sheet;
Fig. 3 is participle and word in a kind of earthquake emergency provided in an embodiment of the present invention and the condition of a disaster acquisition of information sorting technique Property annotation step schematic flow sheet;
Fig. 4 is a kind of earthquake emergency provided in an embodiment of the present invention and the structural representation of the condition of a disaster acquisition of information categorizing system Figure;
A kind of Fig. 5 earthquake emergencies provided in an embodiment of the present invention and the structure chart of the condition of a disaster acquisition of information sorting device.
Embodiment
With reference to the accompanying drawings and examples, the embodiment of the present invention is described in further detail.Implement below Example is used to illustrate the present invention, but is not limited to the scope of the present invention.
With reference to figure 1, Fig. 1 is a kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique that one embodiment of the invention provides Flow chart, methods described includes:
S1, by data reptile instrument, capture objective network information.
Specifically, turn-on data reptile instrument, and descriptor is set and preserves database information;By monitoring descriptor, The Webpage for possessing related subject word is captured, obtains the network information related to earthquake, and periodic refreshing interface, If fresh information is then grabbed into database.
Web crawlers is a kind of program for capturing info web automatically according to established rule, and it is from an initial link Start to access, the link included in the webpage having access to or network documentation is put into queue to be visited, afterwards from team New connection is taken out in row to continue to access, and activity above is then repeated, untill termination condition is met.
By the method, by web crawlers instrument, quickly the information in network can be captured, and passes through Descriptor is set, so as to more accurately capture news and various information about earthquake in network.
S2, the objective network information grabbed is segmented, and mark part of speech, obtain text participle information.
Specifically, being segmented to the information crawled, and mark part of speech.Wherein part of speech mark table is as shown in table 1.
Table 1:Part of speech marks table
The text data of crawl is segmented using NLPIR participle instruments, so as to which the network information is split as into multiple words The text participle information of composition, while each word to splitting out carries out the mark of part of speech, and the network information will be carried out Text message after word segmentation processing is stored in database.
By the method, the word composition in the earthquake information obtained from network can be quickly distinguished, so as to fast Speed, targetedly extraction includes the information for specifying part of speech.
S3, information is segmented according to the initial weight of preset keyword and the text, calculates the objective network information Scale value is referred generally to, and to information type corresponding to objective network information matching.
Specifically, using descriptor classified weight table, term weighing matching is carried out to the word of upper step participle, it is accumulative to ask With obtain total weight of text, the principle based on statistics, a word or phrase are characterized for a text with TF-IDF values Important degree size.Some word is for the significance level of a text and its frequency occurred in this text into just Than, but be inversely proportional with its frequency in whole corpus.
By selecting the theme of weight limit to be used as the affiliated classification of Seismic net information, by Seismic net information and its Affiliated classification matched.
By the method, attempt to believe earthquake emergency using the reverse text matches algorithm based on descriptor weight Breath and the condition of a disaster information carry out text classification, and combine practical application with web crawlers, ensure that the accurate of grouped data Property.Emergency information Quick Acquisition and extracting tool are realized, improves the ageing and accuracy of earthquake disaster emergency service.
On the basis of above-described embodiment, the network information includes the mhkc network information, news network information and microblogging At least one of network information network information.
Specifically, pass through microblog data reptile module, mhkc data reptile module and news website data reptile module point The other information in microblogging, the information in information and news website in mhkc are extracted, wherein, microblog data reptile mould Block captures the page info, and periodic refreshing interface, if fresh information is then grabbed by monitoring s.weibo.com/ descriptor Enter database;Mhkc data reptile module is scanned to all models of mhkc, content of pages is captured, by model theme, note Son is replied, and information-reply deposit database;News website data reptile module is scanned to news information in website, is grabbed Content of pages is taken, headline, is given a news briefing the time at news spokesman, and news content deposit database.
By the method, three kinds of Network Information Sources have chosen according to network information feature, ensure that the various of information source Property.
Also include the training text based on completion of classifying under manual intervention on the basis of above-described embodiment, after the S1 This, calculates the weight of keyword.
Wherein, the data based on completion of classifying under manual intervention, the step of calculating the weight of preset keyword, are specific Including:
According to the training text, descriptor is extracted;
The word frequency and inverse document frequency of keyword in the training text are calculated using TF-IDF algorithms, obtains descriptor Classified weight table.
Specifically, algorithm flow to part earthquake emergency and the condition of a disaster information as shown in Fig. 2 first carry out manual sort, preservation For training text, the descriptor in training text is extracted, descriptor classification chart is as shown in table 2.
Table 2:Descriptor classification chart
Using TF-IDF algorithms, the word frequency and inverse document frequency of descriptor in training text are calculated, it is each so as to calculate The weight of descriptor, obtain the classified weight table of descriptor.
Wherein TF-IDF calculation formula is:
TF-IDF=TF × IDF
Wherein TF represents the frequency that entry occurs in classification based training text, i.e. word frequency, and IDF represents inverse document frequency, i.e., Number of files comprising entry is fewer, and the entry is stronger to the separating capacity of document.Using TF-IDF algorithms, training text is calculated The word frequency and inverse document frequency of middle keyword, so as to which the classified weight of descriptor be calculated, descriptor weight table is formed, is protected Save as text.
On the basis of the various embodiments described above, with reference to figure 3, the network information of the crawl is segmented in the S2 Specifically include:
S21, segmented using the network information captured described in NLPIR Words partition systems;
S22, part-of-speech information is marked, extract location information and temporal information, and the part-of-speech information is saved into data Storehouse.
Wherein, it is described to be segmented specially using the network information captured described in NLPIR Words partition systems:Pass through N member texts Method model algorithm, the network information is segmented.
Wherein, the mark part-of-speech information is specially:By HMM part-of-speech tagging algorithm, described in mark Part of speech of the network information after participle.
Specifically, language material object is pre-processed:Just it is cut into according to the progress such as punctuation mark, space, carriage return character short Sentence;Short sentence is carried out to carry out atom cutting by rule:Numeral, hyphen and Chinese character are all the rules of atom cutting;With N- shortest paths carry out thick cutting:Part-of-speech tagging (POS...) is carried out for each atom, after having marked, to each atom Matched backward according to dictionary maximum possible into word, initial position of these words in sentence and end position are entered Row record, initial participle is completed, and the result of record is formed into 2 fork tabular forms;From y-bend list in word segmentation result, meter Calculate be likely to become front and rear word two-by-two between smoothness (i.e. twice may be independently into the possibility of word);Utilize N- shortest paths Segmentation methods (are similar to Dijkstra shortest path first thoughts, simply need the beeline calculated to top n node) N number of possible 2 fork participle path is calculated;To this word segmentation result, according to unknown word identification regular (name, place name, machine Structure name recognition rule) identification of unregistered word is carried out, and carry out part-of-speech tagging using HMM;To whole point Word result carries out part-of-speech tagging using HMM;The carry out repeated to other such as folded words, part of speech is further Merging treatment;Obtain final word segmentation result.
Specifically, the N-gram model algorithm formula being related to during described text participle is
P(W)≈P(W1)P(W2/W1I=3 ... n P(Wi/IWi-2Wi-1)
Here by taking N=3 as an example, wherein P represents the probability that W occurs, w1,w2,w3,...,wnIt is the character string that length is n, P (A/B) represents the probability that A occurs under B, it is specified that any word wiIt is only related to its first two.
After being segmented again to information and marking part of speech, place and temporal information can further be extracted, on ground Point information extraction in HMM part-of-speech tagging algorithm be:
First, W=w is made1w2…wnThe word string being made up of n word, T=t1t2…tnIt is mark string corresponding to word string W, Wherein tkIt is wkPart-of-speech tagging;Further according to HMM, calculate and cause conditional probability P (T | W) maximum situation of value T '=argmaxP (T | W).
Then according to Bayesian formula:P (T | W)=P (T) * P (W | T)/P (W).Because word string is constant, p (W) does not influence Total probable value, therefore continue to be reduced to:P (T | W)=P (T) * P (W | T).Wherein p (T)=p (t1|t0)*p(t2|t1)…p (ti|ti-1).According to single order HMM independence assumption, can obtain:P (T)=p (t1|t0)*p(t2|t1)…p(ti| ti-1), i.e. P (ti|ti-1T in)=training corpusiAppear in ti-1T in number/training corpus afterwardsi-1The total degree of appearance.
According to Bayesian formula:P (W | T)=p (w1|t1)*p(w2|t2, t1)…p(wi|ti,ti-1,…,t1) and according to one Rank HMM independence assumption:P (W | T)=p (w1|t1)*p(w2|t2)…p(wi|ti).It can obtain:P(wi|ti)= W in training corpusiPart of speech be marked as tiNumber/training corpus in ti occur total degree.
Then, part-of-speech information is marked, extraction belongs to the information of place (ns) and time (t), is saved into database.
By the method, analyzed by Text Feature Extraction, obtain the contents such as scene, the time of origin of earthquake, by information It is divided into multiple classifications and is stored in database;By exporting Excel file, derived information content, affiliated classification, scene etc. Function, emergency information Quick Acquisition and extracting tool are realized, improve the ageing and accuracy of earthquake disaster emergency service.
With reference to figure 4, Fig. 4 is a kind of earthquake emergency provided in an embodiment of the present invention and the condition of a disaster acquisition of information categorizing system Mechanism map, the system include:Data acquisition module 41, word-dividing mode 42 and calculating sort module 43.
Wherein data acquisition module 41 is used to pass through data reptile instrument, captures objective network information.
Specifically, turn-on data reptile instrument, and descriptor is set and preserves database information;By monitoring descriptor, The Webpage for possessing related subject word is captured, obtains the network information related to earthquake, and periodic refreshing interface, If fresh information is then grabbed into database.
Web crawlers is a kind of program for capturing info web automatically according to established rule, and it is from an initial link Start to access, the link included in the webpage having access to or network documentation is put into queue to be visited, afterwards from team New connection is taken out in row to continue to access, and activity above is then repeated, untill termination condition is met.
By this system, by web crawlers instrument, quickly the information in network can be captured, and passes through Descriptor is set, so as to more accurately capture news and various information about earthquake in network.
Wherein, word-dividing mode 42 is used to segment the objective network information grabbed, and marks part of speech, obtains Text is taken to segment information.
Specifically, specifically, segmented to the information crawled, and part of speech is marked, utilize NLPIR participle instruments will The text data participle of crawl, information is segmented so as to which the network information to be split as to the text of multiple word compositions, while to tearing open Each word for branching away carries out the mark of part of speech, and is stored in the text message after word segmentation processing is carried out to the network information In database.
By this system, the word composition in the earthquake information obtained from network can be quickly distinguished, so as to fast Speed, targetedly extraction includes the information for specifying part of speech.
Wherein, sort module 43 is calculated to be used to segment information, meter according to the initial weight of preset keyword and the text The scale value that refers generally to of the objective network information is calculated, and to information type corresponding to objective network information matching.
Specifically, using descriptor classified weight table, term weighing matching is carried out to the word of upper step participle, it is accumulative to ask With obtain total weight of text, the principle based on statistics, a word or phrase are characterized for a text with TF-IDF values Important degree size.Some word is for the significance level of a text and its frequency occurred in this text into just Than, but be inversely proportional with its frequency in whole corpus.
By selecting the theme of weight limit to be used as the affiliated classification of Seismic net information, by Seismic net information and its Affiliated classification matched.
By this system, attempt to believe earthquake emergency using the reverse text matches algorithm based on descriptor weight Breath and the condition of a disaster information carry out text classification, and combine practical application with web crawlers, ensure that the accurate of grouped data Property.Emergency information Quick Acquisition and extracting tool are realized, improves the ageing and accuracy of earthquake disaster emergency service.
On the basis of above-described embodiment, the system also includes:Keyword Weight computing module, for based on artificial Intervene the training text that lower classification is completed, calculate the weight of preset keyword.
Specifically, according to the training text, descriptor is extracted;
The word frequency and inverse document frequency of keyword in the training text are calculated using TF-IDF algorithms, obtains descriptor Classified weight table.
Using TF-IDF algorithms, the word frequency and inverse document frequency of descriptor in training text are calculated, it is each so as to calculate The weight of descriptor, obtain the classified weight table of descriptor.
Wherein TF-IDF calculation formula is:
TF-IDF=TF × IDF
Wherein TF represents the frequency that entry occurs in classification based training text, i.e. word frequency, and IDF represents inverse document frequency, i.e., Number of files comprising entry is fewer, and the entry is stronger to the separating capacity of document.Using TF-IDF algorithms, training text is calculated The word frequency and inverse document frequency of middle keyword, so as to which the classified weight of descriptor be calculated, form descriptor weight table and protect Save as text.
By this system, using the reverse text matches algorithm based on descriptor weight come to Earthquake emergency information and calamity Feelings information carries out text classification, and combines practical application with data obtaining module, ensure that the accuracy of grouped data, Improve the ageing and accuracy of earthquake disaster emergency service.
With reference to figure 5, Fig. 5 is a kind of earthquake emergency provided in an embodiment of the present invention and the condition of a disaster acquisition of information sorting device knot Composition, as shown in figure 5, the equipment includes:Processor 501, memory 502 and bus 503.
The processor 501 is used to call the programmed instruction in the memory 502, is implemented with performing above-mentioned each method The method that example is provided, such as including by data reptile instrument, capturing objective network information;To the target grabbed The network information is segmented, and marks part of speech, obtains text participle information;According to the initial weight of preset keyword and described Text segments information, calculates the scale value that refers generally to of the objective network information, and to letter corresponding to the network information matching Cease type.
Through the above description of the embodiments, those skilled in the art can be understood that each embodiment The mode of required general hardware platform can be added by software to realize, naturally it is also possible to pass through hardware.Based on such reason Solution, the part that above-mentioned technical proposal substantially contributes to prior art in other words can be embodied in the form of software product Out, the computer software product can store in a computer-readable storage medium, such as ROM/RAM, magnetic disc, CD, Including some instructions causing a computer equipment (can be personal computer, server, or network equipment etc.) to hold Method described in some parts of each embodiment of row or embodiment.
Finally it should be noted that:The above embodiments are merely illustrative of the technical solutions of the present invention, rather than its limitations;To the greatest extent The present invention is described in detail with reference to the foregoing embodiments for pipe, it will be understood by those within the art that:It is still Technical scheme described in foregoing embodiments can be modified, or which part technical characteristic is equally replaced Change;And these modifications or replacement, the essence of appropriate technical solution is departed from various embodiments of the present invention technical scheme Spirit and scope.

Claims (10)

1. a kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique, it is characterised in that including:
S1, by data reptile instrument, capture objective network information;
S2, the objective network information grabbed is segmented, and mark part of speech, obtain text participle information;
S3, information is segmented according to the initial weight of preset keyword and the text, calculates the totality of the objective network information Desired value, and to information type corresponding to objective network information matching.
2. according to the method for claim 1, it is characterised in that the network information includes the mhkc network information, News Network At least one of network information and micro blog network the information network information.
3. according to the method for claim 1, it is characterised in that also include after the S1:Based on classifying under manual intervention The training text of completion, calculate the weight of preset keyword.
4. according to the method for claim 3, it is characterised in that the data based on completion of classifying under manual intervention, meter The step of weight for calculating keyword, specifically includes:
According to the training text, descriptor is extracted;
The word frequency and inverse document frequency of preset keyword in the training text are calculated using TF-IDF algorithms, obtains descriptor point Class weight table.
5. according to the method for claim 1, it is characterised in that the network information of the crawl is segmented in the S2 Specifically include:
S21, segmented using the network information captured described in NLPIR Words partition systems;
S22, part-of-speech information is marked, extract location information and temporal information, and the part-of-speech information is saved into database.
6. according to the method for claim 5, it is characterised in that described to utilize the network captured described in NLPIR Words partition systems Information is segmented specially:By N-gram model algorithm, the network information is segmented.
7. according to the method for claim 5, it is characterised in that it is described mark part-of-speech information be specially:Pass through hidden Ma Erke Husband's model part-of-speech tagging algorithm, marks part of speech of the network information after participle.
8. a kind of earthquake emergency and the condition of a disaster acquisition of information categorizing system, it is characterised in that including:
Data acquisition module, for by data reptile instrument, capturing objective network information;
Word-dividing mode, for being segmented to the objective network information grabbed, and part of speech is marked, obtain text participle letter Breath
Sort module is calculated, for the initial weight according to preset keyword and text participle information, calculates the target The network information refers generally to scale value, and to information type corresponding to objective network information matching.
9. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is held by processor The step of any methods described in such as claim 1-7 is realized during row.
10. a kind of earthquake emergency and the condition of a disaster acquisition of information sorting device, it is characterised in that including:
At least one processor;And
At least one memory being connected with the processor, wherein:
The memory storage has can be by the programmed instruction of the computing device, and the processor calls described program instruction energy Enough perform the method as described in any in claim 1-7.
CN201710692655.3A 2017-08-14 2017-08-14 A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system Pending CN107562814A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710692655.3A CN107562814A (en) 2017-08-14 2017-08-14 A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710692655.3A CN107562814A (en) 2017-08-14 2017-08-14 A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system

Publications (1)

Publication Number Publication Date
CN107562814A true CN107562814A (en) 2018-01-09

Family

ID=60975487

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710692655.3A Pending CN107562814A (en) 2017-08-14 2017-08-14 A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system

Country Status (1)

Country Link
CN (1) CN107562814A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108391177A (en) * 2018-02-07 2018-08-10 广州视源电子科技股份有限公司 TV method for early warning and system
CN108694247A (en) * 2018-05-08 2018-10-23 北京师范大学 A kind of typhoon disaster the condition of a disaster analysis method based on microblog topic temperature
CN108897792A (en) * 2018-06-11 2018-11-27 中国科学院遥感与数字地球研究所 Extract the disaster surveillance analysis method that internet multidimensional relates to calamity information
CN109145117A (en) * 2018-09-05 2019-01-04 杭州安恒信息技术股份有限公司 Bonus system recognition methods, device and the electronic equipment of multiple level marketing project
CN110377680A (en) * 2019-07-11 2019-10-25 中国水利水电科学研究院 The method of mountain flood database sharing and update based on web crawlers and semantics recognition
CN111382332A (en) * 2019-04-02 2020-07-07 江苏省地震局 Earthquake disaster information processing method and system
CN111626568A (en) * 2020-05-06 2020-09-04 厦门理工学院 Knowledge base construction method and device and knowledge search method and system
CN112069814A (en) * 2020-09-01 2020-12-11 应急管理部沈阳消防研究所 Fire-fighting plan classification method based on deep learning
CN112966169A (en) * 2021-04-13 2021-06-15 四川省广播电视科学技术研究所 Internet emergency information capturing method
CN113468320A (en) * 2021-07-22 2021-10-01 中国地震台网中心 Method and system for quickly visualizing earthquake emergency information
CN114201959A (en) * 2021-11-16 2022-03-18 湖南长泰工业科技有限公司 Mobile emergency command method
CN116361463A (en) * 2023-03-27 2023-06-30 应急管理部国家减灾中心(应急管理部卫星减灾应用中心) Earthquake disaster information extraction method, device, equipment and medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102289522A (en) * 2011-09-19 2011-12-21 北京金和软件股份有限公司 Method of intelligently classifying texts
CN102622373A (en) * 2011-01-31 2012-08-01 中国科学院声学研究所 Statistic text classification system and statistic text classification method based on term frequency-inverse document frequency (TF*IDF) algorithm
CN103744981A (en) * 2014-01-14 2014-04-23 南京汇吉递特网络科技有限公司 System for automatic classification analysis for website based on website content
CN105045857A (en) * 2015-07-09 2015-11-11 中国科学院计算技术研究所 Social network rumor recognition method and system
CN105468713A (en) * 2015-11-19 2016-04-06 西安交通大学 Multi-model fused short text classification method
CN105701084A (en) * 2015-12-28 2016-06-22 广东顺德中山大学卡内基梅隆大学国际联合研究院 Characteristic extraction method of text classification on the basis of mutual information
CN106202372A (en) * 2016-07-08 2016-12-07 中国电子科技网络信息安全有限公司 A kind of method of network text information emotional semantic classification

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102622373A (en) * 2011-01-31 2012-08-01 中国科学院声学研究所 Statistic text classification system and statistic text classification method based on term frequency-inverse document frequency (TF*IDF) algorithm
CN102289522A (en) * 2011-09-19 2011-12-21 北京金和软件股份有限公司 Method of intelligently classifying texts
CN103744981A (en) * 2014-01-14 2014-04-23 南京汇吉递特网络科技有限公司 System for automatic classification analysis for website based on website content
CN105045857A (en) * 2015-07-09 2015-11-11 中国科学院计算技术研究所 Social network rumor recognition method and system
CN105468713A (en) * 2015-11-19 2016-04-06 西安交通大学 Multi-model fused short text classification method
CN105701084A (en) * 2015-12-28 2016-06-22 广东顺德中山大学卡内基梅隆大学国际联合研究院 Characteristic extraction method of text classification on the basis of mutual information
CN106202372A (en) * 2016-07-08 2016-12-07 中国电子科技网络信息安全有限公司 A kind of method of network text information emotional semantic classification

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108391177A (en) * 2018-02-07 2018-08-10 广州视源电子科技股份有限公司 TV method for early warning and system
CN108694247B (en) * 2018-05-08 2020-11-20 北京师范大学 Typhoon disaster analysis method based on microblog topic popularity
CN108694247A (en) * 2018-05-08 2018-10-23 北京师范大学 A kind of typhoon disaster the condition of a disaster analysis method based on microblog topic temperature
CN108897792A (en) * 2018-06-11 2018-11-27 中国科学院遥感与数字地球研究所 Extract the disaster surveillance analysis method that internet multidimensional relates to calamity information
CN108897792B (en) * 2018-06-11 2022-05-03 中国科学院遥感与数字地球研究所 Disaster monitoring and analyzing method for extracting multi-dimensional disaster-related information of Internet
CN109145117A (en) * 2018-09-05 2019-01-04 杭州安恒信息技术股份有限公司 Bonus system recognition methods, device and the electronic equipment of multiple level marketing project
CN111382332B (en) * 2019-04-02 2021-12-17 江苏省地震局 Earthquake disaster information processing method and system
CN111382332A (en) * 2019-04-02 2020-07-07 江苏省地震局 Earthquake disaster information processing method and system
CN110377680B (en) * 2019-07-11 2020-04-28 中国水利水电科学研究院 Method for constructing and updating mountain torrent disaster database based on web crawler and semantic recognition
CN110377680A (en) * 2019-07-11 2019-10-25 中国水利水电科学研究院 The method of mountain flood database sharing and update based on web crawlers and semantics recognition
CN111626568A (en) * 2020-05-06 2020-09-04 厦门理工学院 Knowledge base construction method and device and knowledge search method and system
CN111626568B (en) * 2020-05-06 2024-02-20 厦门理工学院 Knowledge base construction method and knowledge search method and system in natural disaster field
CN112069814A (en) * 2020-09-01 2020-12-11 应急管理部沈阳消防研究所 Fire-fighting plan classification method based on deep learning
CN112966169A (en) * 2021-04-13 2021-06-15 四川省广播电视科学技术研究所 Internet emergency information capturing method
CN113468320A (en) * 2021-07-22 2021-10-01 中国地震台网中心 Method and system for quickly visualizing earthquake emergency information
CN114201959A (en) * 2021-11-16 2022-03-18 湖南长泰工业科技有限公司 Mobile emergency command method
CN116361463A (en) * 2023-03-27 2023-06-30 应急管理部国家减灾中心(应急管理部卫星减灾应用中心) Earthquake disaster information extraction method, device, equipment and medium
CN116361463B (en) * 2023-03-27 2023-12-08 应急管理部国家减灾中心(应急管理部卫星减灾应用中心) Earthquake disaster information extraction method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN107562814A (en) A kind of earthquake emergency and the condition of a disaster acquisition of information sorting technique and system
CN107204184B (en) Audio recognition method and system
CN103744905B (en) Method for judging rubbish mail and device
CN107808011B (en) Information classification extraction method and device, computer equipment and storage medium
CN109582704B (en) Recruitment information and the matched method of job seeker resume
CN103914494B (en) Method and system for identifying identity of microblog user
CN104991891B (en) A kind of short text feature extracting method
US10169331B2 (en) Text mining for automatically determining semantic relatedness
CN108268668B (en) Topic diversity-based text data viewpoint abstract mining method
CN107391614A (en) A kind of Chinese question and answer matching process based on WMD
CN108763510A (en) Intension recognizing method, device, equipment and storage medium
CN103425777B (en) A kind of based on the short message intelligent classification and the searching method that improve Bayes's classification
CN107273359A (en) A kind of text similarity determines method
CN103324666A (en) Topic tracing method and device based on micro-blog data
CN110083832B (en) Article reprint relation identification method, device, equipment and readable storage medium
JP2006293767A (en) Sentence categorizing device, sentence categorizing method, and categorization dictionary creating device
CN109325124B (en) Emotion classification method, device, server and storage medium
CN106844349A (en) Comment spam recognition methods based on coorinated training
CN107577671A (en) A kind of key phrases extraction method based on multi-feature fusion
CN103282903A (en) Topic extraction device and program
US11048934B2 (en) Identifying augmented features based on a bayesian analysis of a text document
CN105550359B (en) Webpage sorting method and device based on vertical search and server
CN106484797A (en) Accident summary abstracting method based on sparse study
CN106557777A (en) It is a kind of to be based on the improved Kmeans clustering methods of SimHash
CN111753082A (en) Text classification method and device based on comment data, equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20180109

RJ01 Rejection of invention patent application after publication