CN110347702A - A kind of data processing method and device - Google Patents
A kind of data processing method and device Download PDFInfo
- Publication number
- CN110347702A CN110347702A CN201910655989.2A CN201910655989A CN110347702A CN 110347702 A CN110347702 A CN 110347702A CN 201910655989 A CN201910655989 A CN 201910655989A CN 110347702 A CN110347702 A CN 110347702A
- Authority
- CN
- China
- Prior art keywords
- data
- result
- participle
- stored
- word segmentation
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/24—Querying
- G06F16/245—Query processing
- G06F16/2452—Query translation
- G06F16/24522—Translation of natural language queries to structured queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/279—Recognition of textual entities
- G06F40/289—Phrasal analysis, e.g. finite state techniques or chunking
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- General Health & Medical Sciences (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The embodiment of the present application provides a kind of data processing method and device, after getting to the data storage request of data to be stored, data to be stored can be segmented, obtain first participle result, judge to whether there is and the matched target data of equipment to be stored in target database according to first participle result, target data has the second word segmentation result, the matching value of second word segmentation result and first participle result is greater than or equal to the first preset value, if, illustrate that data to be stored is the repeated data of target data, data storage request can then be refused to respond, since repeated data is determined according to the matching value of word segmentation result, with certain accuracy, to effectively prevent the increase of repeated data in target database, improve the utilization rate of database.
Description
Technical field
The present invention relates to computer fields, more particularly to a kind of data processing method and device.
Background technique
With the arrival of information age, people face more and more data, can be carried out to data by database
Storage and management, user can into database storing data, relevant to term data can also be inquired by term,
Such as the data including term can be searched as lookup result.
Currently, can be limited, be prevented by uniqueness of the database to data in user's storing data into database
The increase of the data of exact matching, for example, the title of data to be stored and the title of data with existing it is consistent, then can be without this
The storage of data.However, this mode not can effectively prevent the increase of repeated data, it is easy to cause in database that there are redundancies
Data.
Summary of the invention
In order to solve the above technical problems, the embodiment of the present application provides a kind of data processing method and device, database is reduced
In repeated data, improve the utilization rate of database.
The embodiment of the present application provides a kind of data processing method, comprising:
The data storage request to data to be stored is obtained, the data to be stored is deposited in the data storage request instruction
It stores up to target database;
The data to be stored is segmented, first participle result is obtained;
It is matched as a result, judging to whether there is in the target database with the data to be stored according to the first participle
Target data, the target data has the second word segmentation result, second word segmentation result and the first participle result
Matching value is greater than or equal to the first preset value;
If so, refusing to respond the data storage request.
Optionally, the first participle is the result is that segment the data name of the data to be stored, institute
Stating the second word segmentation result is segmented to the data name of the target data;Or, the first participle the result is that
What data name and data content to the data to be stored were segmented, second word segmentation result is to the mesh
What the data name and data content for marking data were segmented.
Optionally, the first participle result includes multiple first words, and second word segmentation result includes multiple second words;
Then,
The matching value of the first participle result and second word segmentation result according to second word matched first
The quantity of word determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to
It is determined with the quantity and weight of matched first word of second word.
Optionally, the method also includes:
If it is not, then storing the data to be stored to the target database.
It is optionally, described to refuse to respond the data storage request, comprising:
Show the target data;
Request is stored according to the cancellation to the data to be stored of user's triggering, the data storage is refused to respond and asks
It asks.
Optionally, the display target data, comprising:
Determining the matched data in the target database, the matched data has the 4th word segmentation result, and the described 4th
The matching value of word segmentation result and the first participle result is greater than or equal to the second preset value, and second preset value is less than or waits
In first preset value;
From high to low according to the matching value of the 4th word segmentation result and the first participle result, the coupling number is shown
According to.
The embodiment of the present application provides a kind of data processing equipment, and described device includes:
Request unit, for obtaining the data storage request to data to be stored, the data storage request instruction
The data to be stored is stored to target database;
Participle unit obtains first participle result for segmenting to the data to be stored;
Judging unit, for according to the first participle as a result, judge in the target database with the presence or absence of with it is described
The matched target data of data to be stored, the target data have the second word segmentation result, second word segmentation result with it is described
The matching value of first participle result is greater than or equal to the first preset value;If the determination result is YES, then refusal unit is activated;
The refusal unit, for refusing to respond the data storage request.
Optionally, the first participle is the result is that segment the data name of the data to be stored, institute
Stating the second word segmentation result is segmented to the data name of the target data;Or, the first participle the result is that
What data name and data content to the data to be stored were segmented, second word segmentation result is to the mesh
What the data name and data content for marking data were segmented.
Optionally, the first participle result includes multiple first words, and second word segmentation result includes multiple second words,
Then,
The matching value of the first participle result and second word segmentation result according to second word matched first
The quantity of word determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to
It is determined with the quantity and weight of matched first word of second word.
Optionally, described device further include:
Storage unit, for storing the data to be stored to the target database;
The judging unit is also used to, if judging result be it is no, activate the storage unit.
Optionally, the refusal unit, comprising:
Display unit, for showing the target data;
Refuse subelement, the cancellation storage request to the data to be stored for triggering according to user refuses to respond
The data storage request.
Optionally, the display unit, comprising:
Data determination unit, for determining that the matched data in the target database, the matched data have the 4th
The matching value of word segmentation result, the 4th word segmentation result and the first participle result is greater than or equal to the second preset value, described
Second preset value is less than or equal to first preset value;
Show subelement, for the matching value according to the 4th word segmentation result and the first participle result from height to
It is low, show the matched data.
The embodiment of the present application provides a kind of data processing method and device, deposits getting the data to data to be stored
After storage request, data to be stored can be segmented, obtain the first participle as a result, judging number of targets according to first participle result
There is the second word segmentation result, the second participle knot with the matched target data of equipment to be stored, target data according to whether there is in library
The matching value of fruit and first participle result is greater than or equal to the first preset value, if so, illustrating that data to be stored is target data
Repeated data can then refuse to respond data storage request, since repeated data is determined according to the matching value of word segmentation result,
With certain accuracy, to effectively prevent the increase of repeated data in target database, the utilization rate of database is improved.
Detailed description of the invention
In order to more clearly explain the technical solutions in the embodiments of the present application, make required in being described below to embodiment
Attached drawing is briefly described, it should be apparent that, the accompanying drawings in the following description is only some implementations as described in this application
Example, for those of ordinary skill in the art, is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow chart of data processing method provided by the embodiments of the present application;
Fig. 2 is a kind of structural block diagram of data processing equipment provided by the embodiments of the present application.
Specific embodiment
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application
Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only this
Apply for a part of the embodiment, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art exist
Every other embodiment obtained under the premise of creative work is not made, shall fall in the protection scope of this application.
Currently, can be limited, be prevented by uniqueness of the database to data in user's storing data into database
The increase of the data of exact matching, for example, the title of data to be stored and the title of data with existing it is consistent, at this time, it is believed that wait deposit
It stores up data and data with existing is repeated data, then it can be without the storage of the data.
However, this mode is only capable of preventing the increase of completely the same data, if the character express between two data is not
It is completely the same, but meaning is consistent, then cannot identify that the two data are repeated data, for example, " Local Tax Bureau, Shanxi Province " and " Shanxi
Land tax " indicates same company, is repeated data, but is limited by uniqueness of the database to data, cannot recognize that this
Repeated data, therefore this mode not can effectively prevent the increase of repeated data, is easy to cause in database that there are redundant digits
According to.
In order to solve the above-mentioned technical problem, the embodiment of the present application provides a kind of data processing method and device, is obtaining
To after the data storage request to data to be stored, data to be stored can be segmented, obtain the first participle as a result, according to
First participle result judge in target database with the presence or absence of with the matched target data of equipment to be stored, target data has the
The matching value of two word segmentation results, the second word segmentation result and first participle result is greater than or equal to the first preset value, if so, explanation to
Storing data is the repeated data of target data, then can refuse to respond data storage request, since repeated data is basis point
What the matching value of word result determined, there is certain accuracy, so that it is effectively prevent the increase of repeated data in target database,
Improve the utilization rate of database.
With reference to the accompanying drawing, be described in detail by embodiment a kind of data processing method provided by the embodiments of the present application and
The specific implementation of device.
It may include following refering to what is shown in Fig. 1, being a kind of flow chart of data processing method provided by the embodiments of the present application
Step:
S101 obtains the data storage request to data to be stored.
In the embodiment of the present application, user can into database storing data, need in user into target database
When storing data to be stored, the data storage request to data to be stored can be triggered, instruction stores data to be stored to mesh
Mark database.Wherein, data to be stored can be text information, including data name and data content, such as may include visitor
Name in an account book claims and customer profile etc., and by taking " Shanxi land tax " as an example, customer profile may include address, contact person, registion time
Deng;Target database can be PostgreSQL, MySQL, Oracle, SQL Server database etc., in target database
Data can be presented in the form of a web page, can also be presented in a text form in display.
Specifically, user can pass through the corresponding browser page of target database, trigger data storage request, such as point
It hits and increases a data " Shanxi land tax ", browser can be sent by network protocol to the corresponding server of target database
Data storage request, so that server be made to get the data storage request to data to be stored.
S102 segments data to be stored, obtains first participle result.
After getting to the data storage request of data to be stored, in order to judge whether data to be stored is repeat number
According to, first data to be stored can be segmented, obtain the first participle as a result, participle mode can be according to speech habits will
Content of text is decomposed into multiple words or word, as first participle result.In the embodiment of the present application, the movement of participle can be with
By calling full text engine tool to carry out, for example, by using Lucene full-text search engine tool.
Data to be stored is segmented, can be with customized word segmentation regulation, such as Chinese everyday expressions can be pre-defined
Dictionary, data to be stored is decomposed after obtaining multiple words, the obtained word of participle and dictionary can be matched,
The word of successful match can be used as first participle result.The dictionary of Chinese everyday expressions can regularly update, calibrated to obtain
True first participle result.
As a kind of possible implementation, the data name of data to be stored can be segmented, obtain first point
Word result.This is because the data name of data to be stored tends to represent the main meaning of data to be stored, if data name
Title be it is duplicate, then data content is largely also duplicate, such as the brief introduction of the same client is usually similar.
For example, the data name for data to be stored is " Shanxi land tax ", can segment to obtain " Shanxi " and " land tax ", as first
Word segmentation result.
As alternatively possible implementation, can data name to data to be stored and data content divide
Word obtains first participle result.It can be covered in this way by first participle result in the data name and data of data to be stored
Hold, more accurately to determine whether data to be stored is repetition according to the data name and data content of data to be stored
Data.
The data name of data to be stored is segmented the first participle as a result, be may include multiple first words,
Each first word can have weight, those skilled in the art can sets itself according to actual needs, the weight of the first word can
To be stored in the dictionary of Chinese everyday expressions.Such as the first participle result obtained after segmenting to data name is " mountain
When west " and " land tax ", can enable the weight in " Shanxi " is 0.9, and the weight of " land tax " is 0.1, can also enable the weight in " Shanxi "
It is 0.2, the weight of " land tax " is 0.8.
In the embodiment of the present application, the setting of weight can also be related to the location of the first word, such as to data name
The first word segmented is claimed to can have higher weight, the first word segmented to data content can have
There is lower weight.
S103, according to the first participle as a result, judging to whether there is and the matched target of data to be stored in target database
Data.
It, can be according to the obtained first participle as a result, judging in target database after being segmented to data to be stored
With the presence or absence of with the matched target data of data to be stored.Wherein, each data in target database can have third point
Word judges whether to sieve from third word segmentation result as a result, according to the matching degree of first participle result and third word segmentation result
The second word segmentation result for being greater than or equal to the first preset value with the matching degree of first participle result is selected, if it exists the second participle knot
The matching degree of fruit, the second word segmentation result and first participle result is higher, then the degree of correlation is higher, illustrates that the second word segmentation result is corresponding
The degree of correlation of data and data to be stored is higher, thus can using the corresponding data of the second word segmentation result as with data to be stored
Matched target data.
Specifically, in the first participle the result is that in the case that the data name to data to be stored is segmented to obtain, class
The first participle is similar to as a result, third word segmentation result can be segments to obtain to the data name of each data in target database
, the data name of each data can correspond to a third word segmentation result, and third word segmentation result may include multiple third words,
The second word segmentation result filtered out from third word segmentation result, is segmented for the data name to target data.?
That is in the embodiment of the present application can according only to data data name as judge data whether be repeated data according to
According to.
For example, the data name of data to be stored is " Shanxi land tax ", and corresponding first participle result includes multiple
First word: " Shanxi " and " land tax ".The data name of the first data in target database is " industrial and commercial bureau, Shanxi Province ", corresponding
Third word segmentation result includes multiple third words: " Shanxi Province ", " Shanxi ", " industrial and commercial bureau ", " industry and commerce ";Second in target database
The data name of data is " Local Tax Bureau, Shanxi Province ", and corresponding third word segmentation result includes multiple third words: " Shanxi Province ", " mountain
West ", " saving ground ", " Local Tax Bureau ", " land tax ", " tax office ";The data names of third data in target database is " Hebei province
Tax office ", corresponding third word segmentation result include multiple third words: " Hebei province ", " Hebei ", " saving ground ", " Local Tax Bureau ", "
Tax ", " tax office ".
As a kind of possible implementation, the first participle can be determined according to the quantity with matched first word of third word
As a result more with the quantity of matched first word of third word with the matching value of third word segmentation result, then first participle result and
The matching value of three word segmentation results is higher.It for example, include " Shanxi " with matched first word of third word for the first data,
Then the matching degree of first participle result and third word segmentation result can be 0.5;For the second data, with third word matched first
Word includes " Shanxi " and " land tax ", then the matching degree of first participle result and third word segmentation result can be 1;For third number
According to, with matched first word of third word include " land tax ", then the matching degree of first participle result and third word segmentation result can be
0.5.It can be seen that the corresponding third word segmentation result of the second data and first participle result matching degree highest.
As alternatively possible implementation, the first word has weight, then can according to third word matched first
The quantity and weight of word, determine the matching value of first participle result and third word segmentation result, with matched first word of third word
Quantity it is more, weight is bigger, then the matching value of first participle result and third word segmentation result is higher.It for example, can be with
The weight for enabling " Shanxi " is 0.2, and the weight of " land tax " is 0.8, then for the first data, first participle result and third participle knot
The matching degree of fruit can be 0.1, and for the second data, the matching degree of first participle result and third word segmentation result can be 0.5,
For third data, the matching degree of first participle result and third word segmentation result can be 0.4.
Specifically, in the first participle the result is that data name and data content to data to be stored were segmented
In the case of, similar to the first participle as a result, third word segmentation result can be the data name to each data in target database
What title and data content segmented, each data can correspond to a third word segmentation result, and third word segmentation result may include
Multiple third words, the second word segmentation result filtered out from third word segmentation result, for the data name and data to target data
What content was segmented.That is, can comprehensively consider in the data name and data of data in the embodiment of the present application
Hold, to judge whether data are repeated data.
The method of determination of the matching value of first participle result and third word segmentation result can refer to above two possible reality
Existing mode.
In addition to above-mentioned implementation, the data name of data to be stored can also be segmented to obtain the 4th participle, it is right
The data content of data to be stored is segmented to obtain the 5th participle, i.e. first participle result includes the 4th participle and the 5th point
Word segments the data name of each data in target database to obtain the 6th participle, to each in target database
The data content of a data is segmented to obtain the 7th participle, i.e. third word segmentation result includes that the 6th participle and the 7th segment, point
Not Ji Suan the 4th participle and the 6th participle name-matches value and the 5th participle and the 7th participle content matching value, in turn
According to above-mentioned name-matches value and content matching value, the matching value of first participle result and third word segmentation result is calculated.From
The second word segmentation result filtered out in third word segmentation result may include the 8th participle and the 9th participle, wherein the 8th participle is
The data name of target data is segmented to obtain, the 9th participle is to be segmented to obtain to the data content of target data.
After the matching degree that third word segmentation result and first participle result is calculated, can by with first participle result
The third word segmentation result that matching degree is greater than or equal to the first preset value screens, as the second word segmentation result, the second participle knot
Fruit includes multiple second words.That is, the matching value of first participle result and the second word segmentation result, can be basis and second
What the data of matched first word of word determined, it is also possible to determine according to the quantity and weight of matched first word of the second word
's.It is understood that different matching value calculations, can correspond to the first different preset values.For example, according to data name
When claiming to determine matching value, the first preset value can be 0.8, then the third word segmentation result of only the second data and first participle result
Matching degree be greater than the first preset value, the third word segmentation result of the second data can be screened as the second word segmentation result.
Since the matching degree of the second word segmentation result and first participle result is higher, then the second word segmentation result and first participle knot
The degree of correlation of fruit is higher, and the degree of correlation of the corresponding data of the second word segmentation result and data to be stored is also higher, can be used as with to
A possibility that matched target data of storing data, target data and data to be stored are repeated data is higher.Such as second number
According to word segmentation result be the second word segmentation result, the second data can be used as target data, and the second data and data to be stored are
The data matched.
If judge in target database exist with the matched target data of data to be stored, S104 can be executed, if judgement
In target database there is no with the matched target data of data to be stored, S105 can be executed.
S104 refuses to respond the data storage request.
If judge in target database exist with the matched target data of data to be stored, illustrate data to be stored and target
Data are repeated datas, at this time in order to prevent the increase of repeated data, therefore can refuse to respond data storage request, i.e., not into
The storage of row data to be stored improves the utilization rate of database to reduce the redundant data in database.
If judge in target database exist with the matched target data of data to be stored, can also with displaying target data,
User can judge whether to continue the storage of data to be stored according to the target data of display, if so, can according to
Family triggering continues storage request, and data to be stored is stored into target database, if it is not, can then be taken according to what user triggered
Disappear storage request, the data storage request to data to be stored is refused to respond, to reduce the redundant data in database.
Specifically, in displaying target data, can displaying target data, can also be with including displaying target data
With data, matched data is the data in target database, wherein matched data has the 4th word segmentation result, the 4th participle knot
The matching value of fruit and first participle result is greater than or equal to the second preset value, and it is default that the second preset value can be less than or equal to first
Value can be shown more in this way, may include target data in matched data, and for only displaying target data
Data relevant to data to be stored.4th word segmentation result of matched data is screened from third word segmentation result, the
The calculation of the matching value of four word segmentation results and first participle result can be with reference to third word segmentation result and first participle result
Matching value calculation.
When showing matched data, can according to the 4th word segmentation result and first participle result matching value from high to low,
Show matched data, thus make user get with the higher multiple data of the data to be stored degree of correlation, and then judge whether after
The continuous storage for carrying out data to be stored.
S105 stores data to be stored to target database.
If judge in target database there is no with the matched target data of data to be stored, illustrate that data to be stored is
Newly-increased data can store data to be stored to target database according to data storage request at this time, to realize to target
The maintenance and update of database.
The embodiment of the present application provides a kind of data processing method, is getting the data storage request to data to be stored
Afterwards, data to be stored can be segmented, obtains the first participle as a result, judging in target database according to first participle result
With the presence or absence of with the matched target data of equipment to be stored, target data has the second word segmentation result, the second word segmentation result and the
The matching value of one word segmentation result is greater than or equal to the first preset value, if so, illustrating that data to be stored is the repeat number of target data
According to, then can refuse to respond data storage request, due to repeated data be according to the matching value of word segmentation result determine, have one
Fixed accuracy improves the utilization rate of database to effectively prevent the increase of repeated data in target database.
Based on one of the above data processing method, the embodiment of the present application also provides a kind of data processing equipments, with reference to Fig. 2
It is shown, it is a kind of structural block diagram of data processing equipment provided by the embodiments of the present application, described device includes:
Request unit 110, for obtaining the data storage request to data to be stored, the data storage request refers to
Show and stores the data to be stored to target database;
Participle unit 120 obtains first participle result for segmenting to the data to be stored;
Judging unit 130, for according to the first participle as a result, judge in the target database whether there is and institute
The matched target data of data to be stored is stated, the target data has the second word segmentation result, second word segmentation result and institute
The matching value for stating first participle result is greater than or equal to the first preset value;If the determination result is YES, then refusal unit 140 is activated;
The refusal unit 140, for refusing to respond the data storage request.
Optionally, the first participle is the result is that segment the data name of the data to be stored, institute
Stating the second word segmentation result is segmented to the data name of the target data;Or, the first participle the result is that
What data name and data content to the data to be stored were segmented, second word segmentation result is to the mesh
What the data name and data content for marking data were segmented.
Optionally, the first participle result includes multiple first words, and second word segmentation result includes multiple second words,
Then,
The matching value of the first participle result and second word segmentation result according to second word matched first
The quantity of word determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to
It is determined with the quantity and weight of matched first word of second word.
Optionally, described device further include:
Storage unit, for storing the data to be stored to the target database;
The judging unit is also used to, if judging result be it is no, activate the storage unit.
Optionally, the refusal unit, comprising:
Display unit, for showing the target data;
Refuse subelement, the cancellation storage request to the data to be stored for triggering according to user refuses to respond
The data storage request.
Optionally, the display unit, comprising:
Data determination unit, for determining that the matched data in the target database, the matched data have the 4th
The matching value of word segmentation result, the 4th word segmentation result and the first participle result is greater than or equal to the second preset value, described
Second preset value is less than or equal to the first preset value;
Show subelement, for the matching value according to the 4th word segmentation result and the first participle result from height to
It is low, show the matched data.
The embodiment of the present application provides a kind of data processing method and device, deposits getting the data to data to be stored
After storage request, data to be stored can be segmented, obtain the first participle as a result, judging number of targets according to first participle result
There is the second word segmentation result, the second participle knot with the matched target data of equipment to be stored, target data according to whether there is in library
The matching value of fruit and first participle result is greater than or equal to the first preset value, if so, illustrating that data to be stored is target data
Repeated data can then refuse to respond data storage request, since repeated data is determined according to the matching value of word segmentation result,
With certain accuracy, to effectively prevent the increase of repeated data in target database, the utilization rate of database is improved.
" first " in the titles such as " first ... " mentioned in the embodiment of the present application, " first ... " is used only to do name
Word mark, does not represent first sequentially.The rule is equally applicable to " second " etc..
As seen through the above description of the embodiments, those skilled in the art can be understood that above-mentioned implementation
All or part of the steps in example method can add the mode of general hardware platform to realize by software.Based on this understanding,
The technical solution of the application can be embodied in the form of software products, which can store is situated between in storage
In matter, such as read-only memory (English: read-only memory, ROM)/RAM, magnetic disk, CD etc., including some instructions to
So that a computer equipment (can be the network communication equipments such as personal computer, server, or router) executes
Method described in certain parts of each embodiment of the application or embodiment.
All the embodiments in this specification are described in a progressive manner, same and similar portion between each embodiment
Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.Especially for equipment reality
For applying example, since it is substantially similar to the method embodiment, so describing fairly simple, related place is referring to embodiment of the method
Part explanation.Apparatus embodiments described above are merely indicative, wherein mould as illustrated by the separation member
Block may or may not be physically separated, and the component shown as module may or may not be physics
Module, it can it is in one place, or may be distributed over multiple network units.It can select according to the actual needs
Some or all of the modules therein achieves the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying creation
Property labour in the case where, it can understand and implement.
The above is only the preferred embodiment of the application, is not intended to limit the protection scope of the application.It should refer to
Out, for those skilled in the art, it under the premise of not departing from the application, can also make several improvements
And retouching, these improvements and modifications also should be regarded as the protection scope of the application.
Claims (12)
1. a kind of data processing method, which is characterized in that the described method includes:
Obtain to the data storage request of data to be stored, the data storage request instruction by the data to be stored store to
Target database;
The data to be stored is segmented, first participle result is obtained;
According to the first participle as a result, judging to whether there is and the matched mesh of the data to be stored in the target database
Data are marked, the target data has the second word segmentation result, the matching of second word segmentation result and the first participle result
Value is greater than or equal to the first preset value;
If so, refusing to respond the data storage request.
2. the method according to claim 1, wherein the first participle is the result is that the data to be stored
What data name was segmented, second word segmentation result is to be segmented to obtain to the data name of the target data
's;Or, the first participle is the result is that data name and data content to the data to be stored were segmented, institute
Stating the second word segmentation result is segmented to the data name and data content of the target data.
3. described the method according to claim 1, wherein the first participle result includes multiple first words
Second word segmentation result includes multiple second words;Then,
The matching value of the first participle result and second word segmentation result according to matched first word of the second word
Quantity determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to institute
The quantity and weight for stating matched first word of the second word determine.
4. method according to claim 1 to 3, which is characterized in that the method also includes:
If it is not, then storing the data to be stored to the target database.
5. method according to claim 1 to 3, which is characterized in that described to refuse to respond the data storage and ask
It asks, comprising:
Show the target data;
Request is stored according to the cancellation to the data to be stored of user's triggering, refuses to respond the data storage request.
6. according to the method described in claim 5, it is characterized in that, the display target data, comprising:
Determine that the matched data in the target database, the matched data have the 4th word segmentation result, the 4th participle
As a result it is greater than or equal to the second preset value with the matching value of the first participle result, second preset value is less than or equal to institute
State the first preset value;
From high to low according to the matching value of the 4th word segmentation result and the first participle result, the matched data is shown.
7. a kind of data processing equipment, which is characterized in that described device includes:
Request unit, for obtaining the data storage request to data to be stored, the data storage request is indicated institute
Data to be stored is stated to store to target database;
Participle unit obtains first participle result for segmenting to the data to be stored;
Judging unit, for according to the first participle as a result, judge in the target database with the presence or absence of with described wait deposit
The target data of Data Matching is stored up, the target data has the second word segmentation result, second word segmentation result and described first
The matching value of word segmentation result is greater than or equal to the first preset value;If the determination result is YES, then refusal unit is activated;
The refusal unit, for refusing to respond the data storage request.
8. device according to claim 7, which is characterized in that the first participle is the result is that the data to be stored
What data name was segmented, second word segmentation result is to be segmented to obtain to the data name of the target data
's;Or, the first participle is the result is that data name and data content to the data to be stored were segmented, institute
Stating the second word segmentation result is segmented to the data name and data content of the target data.
9. device according to claim 7, which is characterized in that the first participle result includes multiple first words, described
Second word segmentation result includes multiple second words, then,
The matching value of the first participle result and second word segmentation result according to matched first word of the second word
Quantity determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to institute
The quantity and weight for stating matched first word of the second word determine.
10. according to device described in claim 7-9 any one, which is characterized in that described device further include:
Storage unit, for storing the data to be stored to the target database;
The judging unit is also used to, if judging result be it is no, activate the storage unit.
11. according to device described in claim 7-9 any one, which is characterized in that the refusal unit, comprising:
Display unit, for showing the target data;
Refuse subelement, the cancellation storage request to the data to be stored for triggering according to user refuses to respond described
Data storage request.
12. device according to claim 11, which is characterized in that the display unit, comprising:
Data determination unit, for determining that the matched data in the target database, the matched data have the 4th participle
As a result, the matching value of the 4th word segmentation result and the first participle result is greater than or equal to the second preset value, described second
Preset value is less than or equal to first preset value;
It shows subelement, from high to low for the matching value according to the 4th word segmentation result and the first participle result, shows
Show the matched data.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910655989.2A CN110347702A (en) | 2019-07-19 | 2019-07-19 | A kind of data processing method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910655989.2A CN110347702A (en) | 2019-07-19 | 2019-07-19 | A kind of data processing method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110347702A true CN110347702A (en) | 2019-10-18 |
Family
ID=68179428
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201910655989.2A Pending CN110347702A (en) | 2019-07-19 | 2019-07-19 | A kind of data processing method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110347702A (en) |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104881503A (en) * | 2015-06-24 | 2015-09-02 | 郑州悉知信息技术有限公司 | Data processing method and device |
US20150278300A1 (en) * | 2006-12-22 | 2015-10-01 | Emc Corporation | Query translation for searching complex structures of objects |
CN109785919A (en) * | 2018-11-30 | 2019-05-21 | 平安科技(深圳)有限公司 | Noun matching process, device, equipment and computer readable storage medium |
-
2019
- 2019-07-19 CN CN201910655989.2A patent/CN110347702A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20150278300A1 (en) * | 2006-12-22 | 2015-10-01 | Emc Corporation | Query translation for searching complex structures of objects |
CN104881503A (en) * | 2015-06-24 | 2015-09-02 | 郑州悉知信息技术有限公司 | Data processing method and device |
CN109785919A (en) * | 2018-11-30 | 2019-05-21 | 平安科技(深圳)有限公司 | Noun matching process, device, equipment and computer readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9639579B2 (en) | Determination of a desired repository for retrieving search results | |
CN107992514B (en) | Structured information card search and retrieval | |
US8615516B2 (en) | Grouping similar values for a specific attribute type of an entity to determine relevance and best values | |
US10621462B2 (en) | Density sampling map data | |
US20130218620A1 (en) | Method and system for skill extraction, analysis and recommendation in competency management | |
WO2012092196A1 (en) | Recommendation of search keywords based on indication of user intention | |
CN104021125B (en) | A kind of method, system and a kind of search engine of search engine sequence | |
CN105095231A (en) | Method and device for presenting search result | |
EP1890257A2 (en) | Clustering for structured data | |
CN105894183A (en) | Project evaluation method and apparatus | |
US20160070984A1 (en) | Density sampling map labels | |
CN110851729A (en) | Resource information recommendation method, device, equipment and computer storage medium | |
CN110191183A (en) | Accurate intelligent method for pushing, system, device and computer readable storage medium | |
CN108304112A (en) | Data processing method and device | |
CN111932308A (en) | Data recommendation method, device and equipment | |
US10846462B2 (en) | Web page output selection | |
CN108415748A (en) | Method for information display and system, computer storage media and equipment | |
CN108182200A (en) | Keyword expanding method and device based on semantic similarity | |
CN106156275A (en) | A kind of method and apparatus of singulated inquiry | |
CN106033444A (en) | Method and device for clustering text content | |
CN110347702A (en) | A kind of data processing method and device | |
US20200402125A1 (en) | Guide word recommendation | |
CN108334522B (en) | Method for determining customs code, and method and system for determining type information | |
CN109656954A (en) | Trade mark inquiry method, apparatus and computer equipment | |
CN112966176B (en) | Object display method and device, electronic equipment and readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20191018 |
|
RJ01 | Rejection of invention patent application after publication |