CN110020052A - Food and medicine data processing method and device - Google Patents
Food and medicine data processing method and device Download PDFInfo
- Publication number
- CN110020052A CN110020052A CN201711344267.2A CN201711344267A CN110020052A CN 110020052 A CN110020052 A CN 110020052A CN 201711344267 A CN201711344267 A CN 201711344267A CN 110020052 A CN110020052 A CN 110020052A
- Authority
- CN
- China
- Prior art keywords
- word
- food
- medicine
- information
- matched
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 239000003814 drug Substances 0.000 title claims abstract description 169
- 235000013305 food Nutrition 0.000 title claims abstract description 168
- 238000003672 processing method Methods 0.000 title claims abstract description 13
- 238000001514 detection method Methods 0.000 claims abstract description 25
- 238000004458 analytical method Methods 0.000 claims abstract description 20
- 239000000284 extract Substances 0.000 claims abstract description 10
- 238000000034 method Methods 0.000 claims description 11
- 238000005516 engineering process Methods 0.000 claims description 8
- 238000007726 management method Methods 0.000 claims description 8
- 230000002123 temporal effect Effects 0.000 claims description 8
- 238000012545 processing Methods 0.000 claims description 7
- 238000000605 extraction Methods 0.000 claims description 6
- 230000011218 segmentation Effects 0.000 claims description 5
- 238000012544 monitoring process Methods 0.000 abstract 1
- 230000002354 daily effect Effects 0.000 description 14
- 230000000694 effects Effects 0.000 description 6
- 238000012360 testing method Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 5
- 206010016952 Food poisoning Diseases 0.000 description 4
- 208000019331 Foodborne disease Diseases 0.000 description 4
- 229940079593 drug Drugs 0.000 description 3
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000003044 adaptive effect Effects 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 235000013365 dairy product Nutrition 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 230000003203 everyday effect Effects 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 239000002574 poison Substances 0.000 description 1
- 231100000614 poison Toxicity 0.000 description 1
- 229940126532 prescription medicine Drugs 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/242—Query formulation
- G06F16/243—Natural language query formulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/332—Query formulation
- G06F16/3329—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
Abstract
The disclosure provides a kind of food and medicine data processing method and device, is related to food and medicine supervision field, is able to solve the not comprehensive enough problem of food and medicine quality monitoring.The specific technical proposal is: the detection food medicine hot word in the website of the presets list instruction or application APP;The presets list instruction website or APP in detect food medicine hot word when, obtain detect food medicine hot word where message segment as raw information;Raw information is segmented to obtain word to be matched, and extracts word to be matched from raw information;Word to be matched is matched with the food medicine keyword in database;When the successful quantity of word match to be matched is greater than or equal to preset quantity, semantic analysis is carried out to raw information, determines the event content and event body for including in raw information;Information to be stored is generated according to the event content and event body that include in raw information, information to be stored is uploaded into food medicine supervisory systems.The present invention is supervised for food and medicine.
Description
Technical field
This disclosure relates to food and medicine supervision field more particularly to food and medicine data processing method and device.
Background technique
The safety of food and medicine is the hot spot of society always, it is related to that masses' is healthy and safe.Currently, to food medicine
The supervision of quality safety of product is sampled inspection dependent on relevant departments, consumes a large amount of manpower and material resources, and cannot be to entirety
Quality condition accurately evaluated, can only determine whether its quality qualified to the commodity of sampling, to product quality
It evaluates not comprehensive enough.
Summary of the invention
The embodiment of the present disclosure provides a kind of food and medicine data processing method and device, is able to solve food and medicine quality prison
Manage not comprehensive enough problem.The technical solution is as follows:
According to the first aspect of the embodiments of the present disclosure, a kind of food and medicine data processing method is provided, this method comprises:
Detection food medicine hot word, the presets list include at least one and food in the website of the presets list instruction or application APP
The relevant website of medicine industry or APP, food medicine hot word are the common words that food medicine supervises industry in network;
When detecting food medicine hot word in the website of the presets list instruction or APP, the food medicine hot word place detected is obtained
Message segment as raw information;
Raw information is segmented to obtain word to be matched, and extracts word to be matched from raw information;
Word to be matched is matched with the food medicine keyword in database, if word to be matched and food medicine keyword
Identical then successful match;
When the successful quantity of word match to be matched is greater than or equal to preset quantity, semantic point is carried out to raw information
Analysis, determines the event content and event body for including in raw information;
Information to be stored is generated according to the event content and event body that include in raw information, information to be stored is uploaded
To food medicine supervisory systems.
The raw information comprising food medicine hot word is obtained from website relevant to food and medicine or APP, and the information is carried out
Matching and analysis, in this way, the largely description information about food and medicine can be obtained from network, passes through these
Information can reflect the event that many supervisors can not supervise because of limited personnel, more fully hereinafter to food and medicine into
Row supervision.
In one embodiment, to raw information carry out semantic analysis, determine the event content for including in raw information and
Event body, comprising:
Word to be matched is matched with management body list, name of product list respectively, determines event body;
Extracting time information and address information in word to be matched, and by word to be matched and event type keyword into
Row matching, determines event content according to the temporal information, address information and event type keyword that extract.
Word to be matched is matched with multiple lists respectively, while extracting temporal information and address information, it can
The event for including in raw information is accurately analyzed, more accurate reference is also capable of providing.
In one embodiment, obtain detect food medicine hot word where message segment as raw information, comprising:
Original letter is grabbed by the data source of website or APP where web crawlers technology from the food medicine hot word detected
Breath.
Raw information is grabbed from the data source of website or APP by web crawlers technology, can guarantee to obtain in time original
Information, and guarantee the accuracy of raw information.
In one embodiment, information to be stored is uploaded into food medicine supervisory systems, comprising:
According to wait store the event body and event content that include in information, information to be stored is uploaded into food medicine supervision system
Corresponding subsystem in system, food medicine supervisory systems include current check subsystem, check handle a case subsystem, complaints and denunciation subsystem
System.
According to event body and event content, information to be stored is uploaded in corresponding subsystem, is convenient for user management
And inquiry.
In one embodiment, food medicine hot word includes that enterprise name, food and medicine title, food and medicine event, quality are retouched
At least one of in stating.
According to the second aspect of an embodiment of the present disclosure, a kind of food and medicine data processing equipment is provided, comprising: detection mould
Block, handling module, word segmentation module, extraction module, matching module, analysis module, memory module;
Wherein, detection module presets column for the detection food medicine hot word in the website that the presets list indicates or application APP
Table includes at least one website relevant to food medicine industry or APP, and food medicine hot word is the everyday words that food medicine supervises industry in network
It converges;
Handling module is obtained and is detected when for detecting in the website that the presets list indicates or APP food medicine hot word
Message segment where food medicine hot word is as raw information;
Word segmentation module obtains word to be matched for being segmented to raw information;
Extraction module, for extracting word to be matched from raw information;
Matching module, for matching word to be matched with the food medicine keyword in database, if word to be matched
Language then successful match identical with food medicine keyword;
Analysis module is used for when the successful quantity of word match to be matched is greater than or equal to preset quantity, to original letter
Breath carries out semantic analysis, determines the event content and event body for including in raw information;
Memory module will for generating information to be stored according to the event content and event body that include in raw information
Information to be stored uploads to food medicine supervisory systems.
In one embodiment, analysis module includes main body submodule and content submodule;
Main body submodule, for matching word to be matched with management body list, name of product list respectively, really
Determine event body;
Content submodule, in word to be matched extracting time information and address information, and by word to be matched with
Event type keyword is matched, and determines thing according to the temporal information, address information and event type keyword that extract
Part content.
In one embodiment, handling module is also used to by where web crawlers technology from the food medicine hot word detected
Website or APP data source grab raw information.
In one embodiment, memory module is also used to according to wait store in the event body for including in information and event
Hold, information to be stored is uploaded into corresponding subsystem in food medicine supervisory systems, food medicine supervisory systems includes current check subsystem
System checks handle a case subsystem, complaints and denunciation subsystem.
In one embodiment, food medicine hot word includes that enterprise name, food and medicine title, food and medicine event, quality are retouched
At least one of in stating.
It should be understood that above general description and following detailed description be only it is exemplary and explanatory, not
The disclosure can be limited.
Detailed description of the invention
The drawings herein are incorporated into the specification and forms part of this specification, and shows the implementation for meeting the disclosure
Example, and together with specification for explaining the principles of this disclosure.
Fig. 1 is a kind of flow chart for food and medicine data processing method that the embodiment of the present disclosure provides;
Fig. 2 is a kind of detection effect schematic diagram that the embodiment of the present disclosure provides;
Fig. 3 is a kind of participle effect diagram that the embodiment of the present disclosure provides;
Fig. 4 is a kind of food medicine Keywords matching effect diagram that the embodiment of the present disclosure provides;
Fig. 5 is a kind of structure chart for food and medicine data processing equipment that the embodiment of the present disclosure provides;
Fig. 6 is a kind of structure chart for food and medicine data processing equipment that the embodiment of the present disclosure provides.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to
When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment
Described in embodiment do not represent all implementations consistent with this disclosure.On the contrary, they be only with it is such as appended
The example of the consistent device and method of some aspects be described in detail in claims, the disclosure.
The embodiment of the present disclosure provides a kind of food and medicine data processing method, as shown in Figure 1, the food and medicine data processing
Method the following steps are included:
101, the detection food medicine hot word in the website of the presets list instruction or application APP.
The presets list includes at least one website relevant to food medicine industry or application (English: Application, APP),
It is food and medicine public platform in wechat, micro- for example, the presets list may include: 12331 platforms to food and medicine complaints and denunciation
The webpage etc. under food and medicine classification in rich.
Eating medicine hot word is the common words that food medicine supervises industry in network.In one embodiment, food medicine hot word can wrap
Include enterprise name, food and medicine title, food and medicine event, quality description at least one of.For example, poison by food, is expired,
The words such as rotten, health can belong to food medicine hot word.
102, when detecting food medicine hot word in the website of the presets list instruction or APP, the food medicine hot word detected is obtained
The message segment at place is as raw information.
In one embodiment, it can detect by shortest food medicine hot word, successively increase according to the length of food medicine hot word
Add, until the longest food medicine hot word detection of length finishes, it is of course also possible to be detected since the longest word of length.For example, with food
For medicine hot word includes food poisoning, is unhygienic, expired, certainly, only exemplary illustration, does not represent disclosure limitation herein
In this, it may include more for eating medicine hot word.Firstly, detecting the word of word two of text information two in webpage, detect whether to wrap
Containing " expired " this word, as shown in Fig. 2, Fig. 2 is a kind of detection effect schematic diagram that the embodiment of the present disclosure provides, in Fig. 2, with length
For degree is a word of 10 words, a square represents a word, the 1st and the 2nd word is first formed a word detection, so
The 2nd and the 3rd word a word is formed afterwards to detect, and so on, the 9th and the 10th word are finally formed into a word
It is detected;If detect a certain segment information include " expired ", directly this segment information can be extracted, then after
Continuous next section of detection;Do not include " expired " this word if detected, detects whether that method can comprising " unhygienic " this word
With according to the detection method of the word of two word lengths shown in Fig. 2, if do not included equally, continue to test whether include
" food poisoning " this word.Certainly, only exemplary illustration herein, because word length is shorter, it is easier to detect, as long as judgement
Whether less word matches, so that it may determine whether comprising this word, and word length is longer, efficiency is lower, if including length
Shorter word is spent, then illustrates that this segment information can be used as raw information, needs not continue to the detection longer word of length, in this way
Detection efficiency can be improved.
For another example, for eating medicine hot word and include food poisoning, is unhygienic, expired, it can detecte first of all words
Whether word, herein, the first character of food medicine hot word are " eating, or not excessively " after detecting first character, detect the food medicine hot word
Second word then continues to test whether next word is " object ", for another example, detects " mistake " word, then for example, detecting " food " word
Continue to test whether next word is " phase ", until detecting that food medicine hot word, or the complete segment information of detection all do not detect
Medicine hot word is eaten, then continues to test next section.
Certainly, above is exemplary illustration, does not represent the disclosure and is confined to this.
In one embodiment, obtain detect food medicine hot word where message segment as raw information, comprising: pass through
The data source of website or APP where web crawlers technology from the food medicine hot word detected grabs raw information.
Raw information is grabbed from the data source of website or APP by web crawlers technology, can guarantee to obtain in time original
Information, and guarantee the accuracy of raw information.
103, raw information is segmented to obtain word to be matched, and extracts word to be matched from raw information.
It should be noted that can use daily term database when being segmented to raw information and segmented, from original letter
First character in breath starts, and is matched according to the longest word of number of words in daily term database, if matched, from
Next word of the word prepared continues to start to match.For example, as shown in figure 3, Fig. 3 is a kind of participle that the embodiment of the present disclosure provides
Effect diagram segments a word of 10 word lengths in Fig. 3, for example, the word length that daily term database includes
It is 1 word to 8 words, then, the daily word whether the 1st word in raw information belongs to 8 words to the 8th word is first detected,
If the 1st word is a daily word to the 8th word, continue to test the 9th word to the 16th word whether be daily word;
If the 1st word is not daily word to the 8th word, whether the 1st word of detection to the 7th word is daily word, if also
It is not whether then continue to test the 1st word to the 6th word be daily word, so successively successively decreases, until detecting daily word
Until, or detect that the word of the 1st word and the 2nd word composition is not daily word, then it is the 1st word is daily as one
Word is detected since the 2nd word, that is, detect the 2nd word to the 9th word whether be daily word, subsequent information according to
Such method is segmented.It should be noted that detected since the longest word of length, until the shortest word of length,
It in this way can be to avoid participle mistake, for example, " food safety is unqualified " is a daily word, if from the shortest word of length
Start to detect, be then divided into " food ", " safety ", " unqualified ", can not determine this segment information on earth in this way is to say that food is pacified
Entirely, still say that food is dangerous;Moreover, detecting participle since the longest word of length, operand can be reduced, because of length
Long once matches, so that it may since next word of the word matched;Meanwhile because since the long word of length
Detection ensure that the length of the word after participle is as long as possible, just reduce the quantity of word to be matched after participle.
Certainly, only exemplary illustration herein, does not represent the disclosure and is confined to this.
104, word to be matched is matched with the food medicine keyword in database.
The successful match if word to be matched is identical with food medicine keyword.
105, when the successful quantity of word match to be matched is greater than or equal to preset quantity, raw information is carried out semantic
Analysis, determines the event content and event body for including in raw information.
If more with the word of food medicine Keywords matching in word to be matched, illustrate raw information and Food and drug administration
Correlation it is higher, information is more important, if in word to be matched with food medicine Keywords matching word it is fewer, illustrate original letter
It ceases lower with the correlation of Food and drug administration.
It should be noted that, according to the quantity of the word with food medicine Keywords matching, can be counted more for multiple information
The matching degree of a information, is ranked up according to matching degree, and the high information of matching degree is come before the low information of matching degree, benefit
It is checked in user.For example, as shown in figure 4, Fig. 4 is a kind of food medicine Keywords matching effect signal that the embodiment of the present disclosure provides
Figure;In Fig. 4, matching degree is set according to the word quantity in a segment information with food medicine Keywords matching, for example, matched word number
Amount is 1~20, then matching degree is 1;Matched word quantity is 21~40, matching degree 2;Matched word quantity is 41
~60, matching degree 3, and so on.Certainly, only exemplary illustration herein, the calculating of matching degree can there are many, this public affairs
It opens without limitation.As shown in figure 4, user can preferentially see that sequence is forward when checking after sorting according to matching degree
The high information of matching degree, the i.e. information high with the food medicine supervision degree of correlation, improve the convenience that user checks.
In one embodiment, to raw information carry out semantic analysis, determine the event content for including in raw information and
Event body, comprising: word to be matched is matched with management body list, name of product list respectively, determines event master
Body;Extracting time information and address information in word to be matched, and word to be matched and event type keyword are carried out
Match, event content is determined according to the temporal information, address information and event type keyword that extract.For example, event type
Keyword may include: food poisoning, expired, rotten, dangerous etc..
Word to be matched is matched with multiple lists respectively, while extracting temporal information and address information, it can
The event for including in raw information is accurately analyzed, more accurate reference is also capable of providing.
106, information to be stored is generated according to the event content and event body that include in raw information, it will information be stored
Upload to food medicine supervisory systems.
In one embodiment, information to be stored is uploaded into food medicine supervisory systems, comprising:
According to wait store the event body and event content that include in information, information to be stored is uploaded into food medicine supervision system
Corresponding subsystem in system, food medicine supervisory systems include current check subsystem, check handle a case subsystem, complaints and denunciation subsystem
System.
According to event body and event content, information to be stored is uploaded in corresponding subsystem, is convenient for user management
And inquiry.
Certainly, herein, the subsystem that food medicine supervisory systems is included is exemplary illustration, and the classification of subsystem can have
It is a variety of, for example, according to area, food medicine supervisory systems is divided into different subsystems, such as food medicine supervisory systems of a province can be with
The subsystem in different urban districts is classified as according to urban district, the food medicine supervisory systems in a city can be classified as not according to district
With the subsystem in district;For another example, classify according to food and medicine, food medicine supervisory systems is divided into different subsystems, such as health care medicinal
Subsystem, prescription medicine subsystem, grain subsystem, dairy produce subsystem etc..
The food and medicine data processing method that the embodiment of the present disclosure provides, is obtained from website relevant to food and medicine or APP
The raw information comprising food medicine hot word is taken, and the information is matched and analyzed, in this way, can be obtained from network
The description information largely about food and medicine is taken, can reflect many supervisors by these information because of limited personnel
The event that can not be supervised, more fully hereinafter supervises food and medicine.
It is following to be filled for the disclosure based on food and medicine data processing method described in the corresponding embodiment of above-mentioned Fig. 1
Embodiment is set, can be used for executing embodiments of the present disclosure.
The embodiment of the present disclosure provides a kind of food and medicine data processing equipment, as shown in figure 5, the food and medicine data processing
Device 50 includes: detection module 501, handling module 502, word segmentation module 503, extraction module 504, matching module 505, analysis mould
Block 506, memory module 507;
Wherein, it is preset for the detection food medicine hot word in the website that the presets list indicates or application APP detection module 501
List includes at least one website relevant to food medicine industry or APP, and food medicine hot word is that food medicine supervises the common of industry in network
Vocabulary;
Handling module 502 obtains detection when for detecting food medicine hot word in the website that the presets list indicates or APP
Message segment where the food medicine hot word arrived is as raw information;
Word segmentation module 503 obtains word to be matched for being segmented to raw information;
Extraction module 504, for extracting word to be matched from raw information;
Matching module 505, for matching word to be matched with the food medicine keyword in database, if to be matched
Word then successful match identical with food medicine keyword;
Analysis module 506 is used for when the successful quantity of word match to be matched is greater than or equal to preset quantity, to original
Information carries out semantic analysis, determines the event content and event body for including in raw information;
Memory module 507, for generating information to be stored according to the event content and event body that include in raw information,
Information to be stored is uploaded into food medicine supervisory systems.
In one embodiment, as shown in fig. 6, analysis module 506 includes main body submodule 5061 and content submodule
5062;
Main body submodule 5061, for by word to be matched respectively with management body list, name of product list carry out
Match, determines event body;
Content submodule 5062, for extracting time information and address information in word to be matched, and by word to be matched
Language is matched with event type keyword, true according to the temporal information, address information and event type keyword that extract
Determine event content.
In one embodiment, handling module 502 are also used to through web crawlers technology from the food medicine hot word institute detected
Website or APP data source grab raw information.
In one embodiment, memory module 507 are also used to according to wait store the event body for including in information and event
Information to be stored is uploaded to corresponding subsystem in food medicine supervisory systems by content, and food medicine supervisory systems includes current check
System checks handle a case subsystem, complaints and denunciation subsystem.
In one embodiment, food medicine hot word includes that enterprise name, food and medicine title, food and medicine event, quality are retouched
At least one of in stating.
The food and medicine data processing equipment that the embodiment of the present disclosure provides, is obtained from website relevant to food and medicine or APP
The raw information comprising food medicine hot word is taken, and the information is matched and analyzed, in this way, can be obtained from network
The description information largely about food and medicine is taken, can reflect many supervisors by these information because of limited personnel
The event that can not be supervised, more fully hereinafter supervises food and medicine.
Based on food and medicine data processing method described in the corresponding embodiment of above-mentioned Fig. 1, the embodiment of the present disclosure is also
A kind of computer readable storage medium is provided, for example, non-transitorycomputer readable storage medium can be read-only memory (English
Text: Read Only Memory, ROM), random access memory (English: Random Access Memory, RAM), CD-
ROM, tape, floppy disk and optical data storage devices etc..It is stored with computer instruction on the storage medium, for executing above-mentioned Fig. 1
Data transmission method described in corresponding embodiment, details are not described herein again.
Those skilled in the art will readily occur to its of the disclosure after considering specification and practicing disclosure disclosed herein
Its embodiment.This application is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or
Person's adaptive change follows the general principles of this disclosure and including the undocumented common knowledge in the art of the disclosure
Or conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the disclosure are by following
Claim is pointed out.
It should be understood that the present disclosure is not limited to the precise structures that have been described above and shown in the drawings, and
And various modifications and changes may be made without departing from the scope thereof.The scope of the present disclosure is only limited by the accompanying claims.
Claims (10)
1. a kind of food and medicine data processing method, which is characterized in that the described method includes:
Detection food medicine hot word, described the presets list include at least one and food in the website of the presets list instruction or application APP
The relevant website of medicine industry or APP, the food medicine hot word are the common words that food medicine supervises industry in network;
When detecting the food medicine hot word in the website of described the presets list instruction or APP, the food medicine hot word detected is obtained
The message segment at place is as raw information;
The raw information is segmented to obtain word to be matched, and extracts the word to be matched from the raw information
Language;
The word to be matched is matched with the food medicine keyword in database, if the word to be matched and the food
The identical then successful match of medicine keyword;
When the successful quantity of word match to be matched is greater than or equal to preset quantity, the raw information is carried out semantic
Analysis, determines the event content and event body for including in the raw information;
Information to be stored is generated according to the event content and event body that include in the raw information, by the information to be stored
Upload to food medicine supervisory systems.
2. being determined the method according to claim 1, wherein described carry out semantic analysis to the raw information
The event content and event body for including in the raw information, comprising:
The word to be matched is matched with management body list, name of product list respectively, determines event body;
Extracting time information and address information in the word to be matched, and the word to be matched and event type is crucial
Word is matched, and determines event content according to the temporal information, address information and event type keyword that extract.
3. the method according to claim 1, wherein the message segment obtained where the food medicine hot word detected
As raw information, comprising:
The original letter is grabbed by the data source of website or APP where web crawlers technology from the food medicine hot word detected
Breath.
4. medicine supervisory systems is eaten the method according to claim 1, wherein the information to be stored is uploaded to,
Include:
According to described wait store the event body and event content that include in information, the information to be stored is uploaded into the food
Corresponding subsystem in medicine supervisory systems, the food medicine supervisory systems include current check subsystem, check subsystem of handling a case, throw
Tell report subsystem.
5. method according to claim 1-4, which is characterized in that
It is described food medicine hot word include enterprise name, food and medicine title, food and medicine event, quality description at least one of.
6. a kind of food and medicine data processing equipment, which is characterized in that described device includes detection module, handling module, participle
Module, extraction module, matching module, analysis module, memory module;
Wherein, the detection module, it is described pre- for the detection food medicine hot word in the website that the presets list indicates or application APP
If list includes at least one website relevant to food medicine industry or APP, the food medicine hot word is that food medicine supervises industry in network
Common words;
The handling module when for detecting the food medicine hot word in the website that described the presets list indicates or APP, obtains
Message segment where the food medicine hot word detected is as raw information;
The word segmentation module obtains word to be matched for being segmented to the raw information;
The extraction module, for extracting the word to be matched from the raw information;
The matching module, for matching the word to be matched with the food medicine keyword in database, if described
The identical then successful match of word to be matched and the food medicine keyword;
The analysis module is used for when the successful quantity of word match to be matched is greater than or equal to preset quantity, to institute
It states raw information and carries out semantic analysis, determine the event content and event body for including in the raw information;
The memory module, for generating letter to be stored according to the event content and event body that include in the raw information
The information to be stored is uploaded to food medicine supervisory systems by breath.
7. device according to claim 6, which is characterized in that the analysis module includes main body submodule and content submodule
Block;
The main body submodule, for will the word to be matched respectively with management body list, name of product list progress
Match, determines event body;
The content submodule, for extracting time information and address information in the word to be matched, and will it is described to
It is matched with word with event type keyword, it is crucial according to the temporal information, address information and event type that extract
Word determines event content.
8. device according to claim 6, which is characterized in that
The handling module is also used to the number by website or APP where web crawlers technology from the food medicine hot word detected
The raw information is grabbed according to source.
9. device according to claim 6, which is characterized in that
The memory module is also used to according to described wait store the event body and event content that include in information, will it is described to
Storage information uploads to corresponding subsystem in the food medicine supervisory systems, and the food medicine supervisory systems includes current check subsystem
System checks handle a case subsystem, complaints and denunciation subsystem.
10. according to the described in any item devices of claim 6-9, which is characterized in that
It is described food medicine hot word include enterprise name, food and medicine title, food and medicine event, quality description at least one of.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711344267.2A CN110020052A (en) | 2017-12-15 | 2017-12-15 | Food and medicine data processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711344267.2A CN110020052A (en) | 2017-12-15 | 2017-12-15 | Food and medicine data processing method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110020052A true CN110020052A (en) | 2019-07-16 |
Family
ID=67186954
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711344267.2A Pending CN110020052A (en) | 2017-12-15 | 2017-12-15 | Food and medicine data processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110020052A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111552775A (en) * | 2020-04-24 | 2020-08-18 | 山东瑞银农业科技有限公司 | Food data processing method and system |
Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103823792A (en) * | 2014-03-07 | 2014-05-28 | 网易(杭州)网络有限公司 | Method and equipment for detecting hotspot events from text document |
CN105426509A (en) * | 2015-11-30 | 2016-03-23 | 用友网络科技股份有限公司 | Abnormal quality event fast response system applied to enterprise |
CN106055658A (en) * | 2016-06-02 | 2016-10-26 | 中国人民解放军国防科学技术大学 | Extraction method aiming at Twitter text event |
CN106294619A (en) * | 2016-08-01 | 2017-01-04 | 上海交通大学 | Public sentiment intelligent supervision method |
CN106383905A (en) * | 2016-09-30 | 2017-02-08 | 南京索酷信息科技股份有限公司 | Public sentiment network monitoring method and device of smart home care big data |
-
2017
- 2017-12-15 CN CN201711344267.2A patent/CN110020052A/en active Pending
Patent Citations (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103823792A (en) * | 2014-03-07 | 2014-05-28 | 网易(杭州)网络有限公司 | Method and equipment for detecting hotspot events from text document |
CN105426509A (en) * | 2015-11-30 | 2016-03-23 | 用友网络科技股份有限公司 | Abnormal quality event fast response system applied to enterprise |
CN106055658A (en) * | 2016-06-02 | 2016-10-26 | 中国人民解放军国防科学技术大学 | Extraction method aiming at Twitter text event |
CN106294619A (en) * | 2016-08-01 | 2017-01-04 | 上海交通大学 | Public sentiment intelligent supervision method |
CN106383905A (en) * | 2016-09-30 | 2017-02-08 | 南京索酷信息科技股份有限公司 | Public sentiment network monitoring method and device of smart home care big data |
Non-Patent Citations (2)
Title |
---|
中国人工智能学会, 北京邮电大学出版社 * |
闫亮等: "基于网页特征关键词的近似检测算法", 《科学技术与工程》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111552775A (en) * | 2020-04-24 | 2020-08-18 | 山东瑞银农业科技有限公司 | Food data processing method and system |
CN111552775B (en) * | 2020-04-24 | 2023-11-03 | 上海薄荷健康科技股份有限公司 | Food data processing method and system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20200143289A1 (en) | Systems and method for performing contextual classification using supervised and unsupervised training | |
Imran et al. | Practical extraction of disaster-relevant information from social media | |
Zubiaga et al. | Exploiting context for rumour detection in social media | |
Imran et al. | Coordinating human and machine intelligence to classify microblog communications in crises. | |
CN112765442A (en) | Network emotion fluctuation index monitoring and analyzing method and system based on news big data | |
KR102353545B1 (en) | Method and Apparatus for Recommending Disaster Response | |
US9672475B2 (en) | Automated opinion prediction based on indirect information | |
CN108153732A (en) | The checking method and device of a kind of hearing record | |
US20150142780A1 (en) | Apparatus and method for analyzing event time-space correlation in social web media | |
Katragadda et al. | Detection of event onset using twitter | |
Sciandra | COVID-19 outbreak through Tweeters’ words: Monitoring Italian social media communication about COVID-19 with text mining and word embeddings | |
TW201640383A (en) | Internet events automatic collection and analysis method and system thereof | |
CN107506349A (en) | A kind of user's negative emotions Forecasting Methodology and system based on network log | |
Bhowmik et al. | A novel three-level voting model for detecting misleading information on covid-19 | |
KR102280490B1 (en) | Training data construction method for automatically generating training data for artificial intelligence model for counseling intention classification | |
Mahmoudi et al. | Arabic language modeling based on supervised machine learning | |
Carducci et al. | Semantically aware text categorisation for metadata annotation | |
Ruposh et al. | A computational approach of recognizing emotion from Bengali texts | |
CN112464036B (en) | Method and device for auditing violation data | |
CN109660621A (en) | A kind of content delivery method and service equipment | |
CN110020052A (en) | Food and medicine data processing method and device | |
KR102126911B1 (en) | Key player detection method in social media using KeyplayerRank | |
Khademi Habibabadi et al. | Vaccine adverse event mining of Twitter conversations: 2-phase classification study | |
Oo | Comparing accuracy between svm, random forest, k-nn text classifier algorithms for detecting syntactic ambiguity in software requirements | |
CN116976321A (en) | Text processing method, apparatus, computer device, storage medium, and program product |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190716 |