CN110347702A - A kind of data processing method and device - Google Patents

A kind of data processing method and device Download PDF

Info

Publication number
CN110347702A
CN110347702A CN201910655989.2A CN201910655989A CN110347702A CN 110347702 A CN110347702 A CN 110347702A CN 201910655989 A CN201910655989 A CN 201910655989A CN 110347702 A CN110347702 A CN 110347702A
Authority
CN
China
Prior art keywords
data
result
participle
stored
word segmentation
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910655989.2A
Other languages
Chinese (zh)
Inventor
孟宾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Qi Polytron Technologies Inc
Original Assignee
Zhejiang Qi Polytron Technologies Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang Qi Polytron Technologies Inc filed Critical Zhejiang Qi Polytron Technologies Inc
Priority to CN201910655989.2A priority Critical patent/CN110347702A/en
Publication of CN110347702A publication Critical patent/CN110347702A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2452Query translation
    • G06F16/24522Translation of natural language queries to structured queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present application provides a kind of data processing method and device, after getting to the data storage request of data to be stored, data to be stored can be segmented, obtain first participle result, judge to whether there is and the matched target data of equipment to be stored in target database according to first participle result, target data has the second word segmentation result, the matching value of second word segmentation result and first participle result is greater than or equal to the first preset value, if, illustrate that data to be stored is the repeated data of target data, data storage request can then be refused to respond, since repeated data is determined according to the matching value of word segmentation result, with certain accuracy, to effectively prevent the increase of repeated data in target database, improve the utilization rate of database.

Description

A kind of data processing method and device
Technical field
The present invention relates to computer fields, more particularly to a kind of data processing method and device.
Background technique
With the arrival of information age, people face more and more data, can be carried out to data by database Storage and management, user can into database storing data, relevant to term data can also be inquired by term, Such as the data including term can be searched as lookup result.
Currently, can be limited, be prevented by uniqueness of the database to data in user's storing data into database The increase of the data of exact matching, for example, the title of data to be stored and the title of data with existing it is consistent, then can be without this The storage of data.However, this mode not can effectively prevent the increase of repeated data, it is easy to cause in database that there are redundancies Data.
Summary of the invention
In order to solve the above technical problems, the embodiment of the present application provides a kind of data processing method and device, database is reduced In repeated data, improve the utilization rate of database.
The embodiment of the present application provides a kind of data processing method, comprising:
The data storage request to data to be stored is obtained, the data to be stored is deposited in the data storage request instruction It stores up to target database;
The data to be stored is segmented, first participle result is obtained;
It is matched as a result, judging to whether there is in the target database with the data to be stored according to the first participle Target data, the target data has the second word segmentation result, second word segmentation result and the first participle result Matching value is greater than or equal to the first preset value;
If so, refusing to respond the data storage request.
Optionally, the first participle is the result is that segment the data name of the data to be stored, institute Stating the second word segmentation result is segmented to the data name of the target data;Or, the first participle the result is that What data name and data content to the data to be stored were segmented, second word segmentation result is to the mesh What the data name and data content for marking data were segmented.
Optionally, the first participle result includes multiple first words, and second word segmentation result includes multiple second words; Then,
The matching value of the first participle result and second word segmentation result according to second word matched first The quantity of word determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to It is determined with the quantity and weight of matched first word of second word.
Optionally, the method also includes:
If it is not, then storing the data to be stored to the target database.
It is optionally, described to refuse to respond the data storage request, comprising:
Show the target data;
Request is stored according to the cancellation to the data to be stored of user's triggering, the data storage is refused to respond and asks It asks.
Optionally, the display target data, comprising:
Determining the matched data in the target database, the matched data has the 4th word segmentation result, and the described 4th The matching value of word segmentation result and the first participle result is greater than or equal to the second preset value, and second preset value is less than or waits In first preset value;
From high to low according to the matching value of the 4th word segmentation result and the first participle result, the coupling number is shown According to.
The embodiment of the present application provides a kind of data processing equipment, and described device includes:
Request unit, for obtaining the data storage request to data to be stored, the data storage request instruction The data to be stored is stored to target database;
Participle unit obtains first participle result for segmenting to the data to be stored;
Judging unit, for according to the first participle as a result, judge in the target database with the presence or absence of with it is described The matched target data of data to be stored, the target data have the second word segmentation result, second word segmentation result with it is described The matching value of first participle result is greater than or equal to the first preset value;If the determination result is YES, then refusal unit is activated;
The refusal unit, for refusing to respond the data storage request.
Optionally, the first participle is the result is that segment the data name of the data to be stored, institute Stating the second word segmentation result is segmented to the data name of the target data;Or, the first participle the result is that What data name and data content to the data to be stored were segmented, second word segmentation result is to the mesh What the data name and data content for marking data were segmented.
Optionally, the first participle result includes multiple first words, and second word segmentation result includes multiple second words, Then,
The matching value of the first participle result and second word segmentation result according to second word matched first The quantity of word determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to It is determined with the quantity and weight of matched first word of second word.
Optionally, described device further include:
Storage unit, for storing the data to be stored to the target database;
The judging unit is also used to, if judging result be it is no, activate the storage unit.
Optionally, the refusal unit, comprising:
Display unit, for showing the target data;
Refuse subelement, the cancellation storage request to the data to be stored for triggering according to user refuses to respond The data storage request.
Optionally, the display unit, comprising:
Data determination unit, for determining that the matched data in the target database, the matched data have the 4th The matching value of word segmentation result, the 4th word segmentation result and the first participle result is greater than or equal to the second preset value, described Second preset value is less than or equal to first preset value;
Show subelement, for the matching value according to the 4th word segmentation result and the first participle result from height to It is low, show the matched data.
The embodiment of the present application provides a kind of data processing method and device, deposits getting the data to data to be stored After storage request, data to be stored can be segmented, obtain the first participle as a result, judging number of targets according to first participle result There is the second word segmentation result, the second participle knot with the matched target data of equipment to be stored, target data according to whether there is in library The matching value of fruit and first participle result is greater than or equal to the first preset value, if so, illustrating that data to be stored is target data Repeated data can then refuse to respond data storage request, since repeated data is determined according to the matching value of word segmentation result, With certain accuracy, to effectively prevent the increase of repeated data in target database, the utilization rate of database is improved.
Detailed description of the invention
In order to more clearly explain the technical solutions in the embodiments of the present application, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, the accompanying drawings in the following description is only some implementations as described in this application Example, for those of ordinary skill in the art, is also possible to obtain other drawings based on these drawings.
Fig. 1 is a kind of flow chart of data processing method provided by the embodiments of the present application;
Fig. 2 is a kind of structural block diagram of data processing equipment provided by the embodiments of the present application.
Specific embodiment
In order to make those skilled in the art more fully understand application scheme, below in conjunction in the embodiment of the present application Attached drawing, the technical scheme in the embodiment of the application is clearly and completely described, it is clear that described embodiment is only this Apply for a part of the embodiment, instead of all the embodiments.Based on the embodiment in the application, those of ordinary skill in the art exist Every other embodiment obtained under the premise of creative work is not made, shall fall in the protection scope of this application.
Currently, can be limited, be prevented by uniqueness of the database to data in user's storing data into database The increase of the data of exact matching, for example, the title of data to be stored and the title of data with existing it is consistent, at this time, it is believed that wait deposit It stores up data and data with existing is repeated data, then it can be without the storage of the data.
However, this mode is only capable of preventing the increase of completely the same data, if the character express between two data is not It is completely the same, but meaning is consistent, then cannot identify that the two data are repeated data, for example, " Local Tax Bureau, Shanxi Province " and " Shanxi Land tax " indicates same company, is repeated data, but is limited by uniqueness of the database to data, cannot recognize that this Repeated data, therefore this mode not can effectively prevent the increase of repeated data, is easy to cause in database that there are redundant digits According to.
In order to solve the above-mentioned technical problem, the embodiment of the present application provides a kind of data processing method and device, is obtaining To after the data storage request to data to be stored, data to be stored can be segmented, obtain the first participle as a result, according to First participle result judge in target database with the presence or absence of with the matched target data of equipment to be stored, target data has the The matching value of two word segmentation results, the second word segmentation result and first participle result is greater than or equal to the first preset value, if so, explanation to Storing data is the repeated data of target data, then can refuse to respond data storage request, since repeated data is basis point What the matching value of word result determined, there is certain accuracy, so that it is effectively prevent the increase of repeated data in target database, Improve the utilization rate of database.
With reference to the accompanying drawing, be described in detail by embodiment a kind of data processing method provided by the embodiments of the present application and The specific implementation of device.
It may include following refering to what is shown in Fig. 1, being a kind of flow chart of data processing method provided by the embodiments of the present application Step:
S101 obtains the data storage request to data to be stored.
In the embodiment of the present application, user can into database storing data, need in user into target database When storing data to be stored, the data storage request to data to be stored can be triggered, instruction stores data to be stored to mesh Mark database.Wherein, data to be stored can be text information, including data name and data content, such as may include visitor Name in an account book claims and customer profile etc., and by taking " Shanxi land tax " as an example, customer profile may include address, contact person, registion time Deng;Target database can be PostgreSQL, MySQL, Oracle, SQL Server database etc., in target database Data can be presented in the form of a web page, can also be presented in a text form in display.
Specifically, user can pass through the corresponding browser page of target database, trigger data storage request, such as point It hits and increases a data " Shanxi land tax ", browser can be sent by network protocol to the corresponding server of target database Data storage request, so that server be made to get the data storage request to data to be stored.
S102 segments data to be stored, obtains first participle result.
After getting to the data storage request of data to be stored, in order to judge whether data to be stored is repeat number According to, first data to be stored can be segmented, obtain the first participle as a result, participle mode can be according to speech habits will Content of text is decomposed into multiple words or word, as first participle result.In the embodiment of the present application, the movement of participle can be with By calling full text engine tool to carry out, for example, by using Lucene full-text search engine tool.
Data to be stored is segmented, can be with customized word segmentation regulation, such as Chinese everyday expressions can be pre-defined Dictionary, data to be stored is decomposed after obtaining multiple words, the obtained word of participle and dictionary can be matched, The word of successful match can be used as first participle result.The dictionary of Chinese everyday expressions can regularly update, calibrated to obtain True first participle result.
As a kind of possible implementation, the data name of data to be stored can be segmented, obtain first point Word result.This is because the data name of data to be stored tends to represent the main meaning of data to be stored, if data name Title be it is duplicate, then data content is largely also duplicate, such as the brief introduction of the same client is usually similar. For example, the data name for data to be stored is " Shanxi land tax ", can segment to obtain " Shanxi " and " land tax ", as first Word segmentation result.
As alternatively possible implementation, can data name to data to be stored and data content divide Word obtains first participle result.It can be covered in this way by first participle result in the data name and data of data to be stored Hold, more accurately to determine whether data to be stored is repetition according to the data name and data content of data to be stored Data.
The data name of data to be stored is segmented the first participle as a result, be may include multiple first words, Each first word can have weight, those skilled in the art can sets itself according to actual needs, the weight of the first word can To be stored in the dictionary of Chinese everyday expressions.Such as the first participle result obtained after segmenting to data name is " mountain When west " and " land tax ", can enable the weight in " Shanxi " is 0.9, and the weight of " land tax " is 0.1, can also enable the weight in " Shanxi " It is 0.2, the weight of " land tax " is 0.8.
In the embodiment of the present application, the setting of weight can also be related to the location of the first word, such as to data name The first word segmented is claimed to can have higher weight, the first word segmented to data content can have There is lower weight.
S103, according to the first participle as a result, judging to whether there is and the matched target of data to be stored in target database Data.
It, can be according to the obtained first participle as a result, judging in target database after being segmented to data to be stored With the presence or absence of with the matched target data of data to be stored.Wherein, each data in target database can have third point Word judges whether to sieve from third word segmentation result as a result, according to the matching degree of first participle result and third word segmentation result The second word segmentation result for being greater than or equal to the first preset value with the matching degree of first participle result is selected, if it exists the second participle knot The matching degree of fruit, the second word segmentation result and first participle result is higher, then the degree of correlation is higher, illustrates that the second word segmentation result is corresponding The degree of correlation of data and data to be stored is higher, thus can using the corresponding data of the second word segmentation result as with data to be stored Matched target data.
Specifically, in the first participle the result is that in the case that the data name to data to be stored is segmented to obtain, class The first participle is similar to as a result, third word segmentation result can be segments to obtain to the data name of each data in target database , the data name of each data can correspond to a third word segmentation result, and third word segmentation result may include multiple third words, The second word segmentation result filtered out from third word segmentation result, is segmented for the data name to target data.? That is in the embodiment of the present application can according only to data data name as judge data whether be repeated data according to According to.
For example, the data name of data to be stored is " Shanxi land tax ", and corresponding first participle result includes multiple First word: " Shanxi " and " land tax ".The data name of the first data in target database is " industrial and commercial bureau, Shanxi Province ", corresponding Third word segmentation result includes multiple third words: " Shanxi Province ", " Shanxi ", " industrial and commercial bureau ", " industry and commerce ";Second in target database The data name of data is " Local Tax Bureau, Shanxi Province ", and corresponding third word segmentation result includes multiple third words: " Shanxi Province ", " mountain West ", " saving ground ", " Local Tax Bureau ", " land tax ", " tax office ";The data names of third data in target database is " Hebei province Tax office ", corresponding third word segmentation result include multiple third words: " Hebei province ", " Hebei ", " saving ground ", " Local Tax Bureau ", " Tax ", " tax office ".
As a kind of possible implementation, the first participle can be determined according to the quantity with matched first word of third word As a result more with the quantity of matched first word of third word with the matching value of third word segmentation result, then first participle result and The matching value of three word segmentation results is higher.It for example, include " Shanxi " with matched first word of third word for the first data, Then the matching degree of first participle result and third word segmentation result can be 0.5;For the second data, with third word matched first Word includes " Shanxi " and " land tax ", then the matching degree of first participle result and third word segmentation result can be 1;For third number According to, with matched first word of third word include " land tax ", then the matching degree of first participle result and third word segmentation result can be 0.5.It can be seen that the corresponding third word segmentation result of the second data and first participle result matching degree highest.
As alternatively possible implementation, the first word has weight, then can according to third word matched first The quantity and weight of word, determine the matching value of first participle result and third word segmentation result, with matched first word of third word Quantity it is more, weight is bigger, then the matching value of first participle result and third word segmentation result is higher.It for example, can be with The weight for enabling " Shanxi " is 0.2, and the weight of " land tax " is 0.8, then for the first data, first participle result and third participle knot The matching degree of fruit can be 0.1, and for the second data, the matching degree of first participle result and third word segmentation result can be 0.5, For third data, the matching degree of first participle result and third word segmentation result can be 0.4.
Specifically, in the first participle the result is that data name and data content to data to be stored were segmented In the case of, similar to the first participle as a result, third word segmentation result can be the data name to each data in target database What title and data content segmented, each data can correspond to a third word segmentation result, and third word segmentation result may include Multiple third words, the second word segmentation result filtered out from third word segmentation result, for the data name and data to target data What content was segmented.That is, can comprehensively consider in the data name and data of data in the embodiment of the present application Hold, to judge whether data are repeated data.
The method of determination of the matching value of first participle result and third word segmentation result can refer to above two possible reality Existing mode.
In addition to above-mentioned implementation, the data name of data to be stored can also be segmented to obtain the 4th participle, it is right The data content of data to be stored is segmented to obtain the 5th participle, i.e. first participle result includes the 4th participle and the 5th point Word segments the data name of each data in target database to obtain the 6th participle, to each in target database The data content of a data is segmented to obtain the 7th participle, i.e. third word segmentation result includes that the 6th participle and the 7th segment, point Not Ji Suan the 4th participle and the 6th participle name-matches value and the 5th participle and the 7th participle content matching value, in turn According to above-mentioned name-matches value and content matching value, the matching value of first participle result and third word segmentation result is calculated.From The second word segmentation result filtered out in third word segmentation result may include the 8th participle and the 9th participle, wherein the 8th participle is The data name of target data is segmented to obtain, the 9th participle is to be segmented to obtain to the data content of target data.
After the matching degree that third word segmentation result and first participle result is calculated, can by with first participle result The third word segmentation result that matching degree is greater than or equal to the first preset value screens, as the second word segmentation result, the second participle knot Fruit includes multiple second words.That is, the matching value of first participle result and the second word segmentation result, can be basis and second What the data of matched first word of word determined, it is also possible to determine according to the quantity and weight of matched first word of the second word 's.It is understood that different matching value calculations, can correspond to the first different preset values.For example, according to data name When claiming to determine matching value, the first preset value can be 0.8, then the third word segmentation result of only the second data and first participle result Matching degree be greater than the first preset value, the third word segmentation result of the second data can be screened as the second word segmentation result.
Since the matching degree of the second word segmentation result and first participle result is higher, then the second word segmentation result and first participle knot The degree of correlation of fruit is higher, and the degree of correlation of the corresponding data of the second word segmentation result and data to be stored is also higher, can be used as with to A possibility that matched target data of storing data, target data and data to be stored are repeated data is higher.Such as second number According to word segmentation result be the second word segmentation result, the second data can be used as target data, and the second data and data to be stored are The data matched.
If judge in target database exist with the matched target data of data to be stored, S104 can be executed, if judgement In target database there is no with the matched target data of data to be stored, S105 can be executed.
S104 refuses to respond the data storage request.
If judge in target database exist with the matched target data of data to be stored, illustrate data to be stored and target Data are repeated datas, at this time in order to prevent the increase of repeated data, therefore can refuse to respond data storage request, i.e., not into The storage of row data to be stored improves the utilization rate of database to reduce the redundant data in database.
If judge in target database exist with the matched target data of data to be stored, can also with displaying target data, User can judge whether to continue the storage of data to be stored according to the target data of display, if so, can according to Family triggering continues storage request, and data to be stored is stored into target database, if it is not, can then be taken according to what user triggered Disappear storage request, the data storage request to data to be stored is refused to respond, to reduce the redundant data in database.
Specifically, in displaying target data, can displaying target data, can also be with including displaying target data With data, matched data is the data in target database, wherein matched data has the 4th word segmentation result, the 4th participle knot The matching value of fruit and first participle result is greater than or equal to the second preset value, and it is default that the second preset value can be less than or equal to first Value can be shown more in this way, may include target data in matched data, and for only displaying target data Data relevant to data to be stored.4th word segmentation result of matched data is screened from third word segmentation result, the The calculation of the matching value of four word segmentation results and first participle result can be with reference to third word segmentation result and first participle result Matching value calculation.
When showing matched data, can according to the 4th word segmentation result and first participle result matching value from high to low, Show matched data, thus make user get with the higher multiple data of the data to be stored degree of correlation, and then judge whether after The continuous storage for carrying out data to be stored.
S105 stores data to be stored to target database.
If judge in target database there is no with the matched target data of data to be stored, illustrate that data to be stored is Newly-increased data can store data to be stored to target database according to data storage request at this time, to realize to target The maintenance and update of database.
The embodiment of the present application provides a kind of data processing method, is getting the data storage request to data to be stored Afterwards, data to be stored can be segmented, obtains the first participle as a result, judging in target database according to first participle result With the presence or absence of with the matched target data of equipment to be stored, target data has the second word segmentation result, the second word segmentation result and the The matching value of one word segmentation result is greater than or equal to the first preset value, if so, illustrating that data to be stored is the repeat number of target data According to, then can refuse to respond data storage request, due to repeated data be according to the matching value of word segmentation result determine, have one Fixed accuracy improves the utilization rate of database to effectively prevent the increase of repeated data in target database.
Based on one of the above data processing method, the embodiment of the present application also provides a kind of data processing equipments, with reference to Fig. 2 It is shown, it is a kind of structural block diagram of data processing equipment provided by the embodiments of the present application, described device includes:
Request unit 110, for obtaining the data storage request to data to be stored, the data storage request refers to Show and stores the data to be stored to target database;
Participle unit 120 obtains first participle result for segmenting to the data to be stored;
Judging unit 130, for according to the first participle as a result, judge in the target database whether there is and institute The matched target data of data to be stored is stated, the target data has the second word segmentation result, second word segmentation result and institute The matching value for stating first participle result is greater than or equal to the first preset value;If the determination result is YES, then refusal unit 140 is activated;
The refusal unit 140, for refusing to respond the data storage request.
Optionally, the first participle is the result is that segment the data name of the data to be stored, institute Stating the second word segmentation result is segmented to the data name of the target data;Or, the first participle the result is that What data name and data content to the data to be stored were segmented, second word segmentation result is to the mesh What the data name and data content for marking data were segmented.
Optionally, the first participle result includes multiple first words, and second word segmentation result includes multiple second words, Then,
The matching value of the first participle result and second word segmentation result according to second word matched first The quantity of word determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to It is determined with the quantity and weight of matched first word of second word.
Optionally, described device further include:
Storage unit, for storing the data to be stored to the target database;
The judging unit is also used to, if judging result be it is no, activate the storage unit.
Optionally, the refusal unit, comprising:
Display unit, for showing the target data;
Refuse subelement, the cancellation storage request to the data to be stored for triggering according to user refuses to respond The data storage request.
Optionally, the display unit, comprising:
Data determination unit, for determining that the matched data in the target database, the matched data have the 4th The matching value of word segmentation result, the 4th word segmentation result and the first participle result is greater than or equal to the second preset value, described Second preset value is less than or equal to the first preset value;
Show subelement, for the matching value according to the 4th word segmentation result and the first participle result from height to It is low, show the matched data.
The embodiment of the present application provides a kind of data processing method and device, deposits getting the data to data to be stored After storage request, data to be stored can be segmented, obtain the first participle as a result, judging number of targets according to first participle result There is the second word segmentation result, the second participle knot with the matched target data of equipment to be stored, target data according to whether there is in library The matching value of fruit and first participle result is greater than or equal to the first preset value, if so, illustrating that data to be stored is target data Repeated data can then refuse to respond data storage request, since repeated data is determined according to the matching value of word segmentation result, With certain accuracy, to effectively prevent the increase of repeated data in target database, the utilization rate of database is improved.
" first " in the titles such as " first ... " mentioned in the embodiment of the present application, " first ... " is used only to do name Word mark, does not represent first sequentially.The rule is equally applicable to " second " etc..
As seen through the above description of the embodiments, those skilled in the art can be understood that above-mentioned implementation All or part of the steps in example method can add the mode of general hardware platform to realize by software.Based on this understanding, The technical solution of the application can be embodied in the form of software products, which can store is situated between in storage In matter, such as read-only memory (English: read-only memory, ROM)/RAM, magnetic disk, CD etc., including some instructions to So that a computer equipment (can be the network communication equipments such as personal computer, server, or router) executes Method described in certain parts of each embodiment of the application or embodiment.
All the embodiments in this specification are described in a progressive manner, same and similar portion between each embodiment Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.Especially for equipment reality For applying example, since it is substantially similar to the method embodiment, so describing fairly simple, related place is referring to embodiment of the method Part explanation.Apparatus embodiments described above are merely indicative, wherein mould as illustrated by the separation member Block may or may not be physically separated, and the component shown as module may or may not be physics Module, it can it is in one place, or may be distributed over multiple network units.It can select according to the actual needs Some or all of the modules therein achieves the purpose of the solution of this embodiment.Those of ordinary skill in the art are not paying creation Property labour in the case where, it can understand and implement.
The above is only the preferred embodiment of the application, is not intended to limit the protection scope of the application.It should refer to Out, for those skilled in the art, it under the premise of not departing from the application, can also make several improvements And retouching, these improvements and modifications also should be regarded as the protection scope of the application.

Claims (12)

1. a kind of data processing method, which is characterized in that the described method includes:
Obtain to the data storage request of data to be stored, the data storage request instruction by the data to be stored store to Target database;
The data to be stored is segmented, first participle result is obtained;
According to the first participle as a result, judging to whether there is and the matched mesh of the data to be stored in the target database Data are marked, the target data has the second word segmentation result, the matching of second word segmentation result and the first participle result Value is greater than or equal to the first preset value;
If so, refusing to respond the data storage request.
2. the method according to claim 1, wherein the first participle is the result is that the data to be stored What data name was segmented, second word segmentation result is to be segmented to obtain to the data name of the target data 's;Or, the first participle is the result is that data name and data content to the data to be stored were segmented, institute Stating the second word segmentation result is segmented to the data name and data content of the target data.
3. described the method according to claim 1, wherein the first participle result includes multiple first words Second word segmentation result includes multiple second words;Then,
The matching value of the first participle result and second word segmentation result according to matched first word of the second word Quantity determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to institute The quantity and weight for stating matched first word of the second word determine.
4. method according to claim 1 to 3, which is characterized in that the method also includes:
If it is not, then storing the data to be stored to the target database.
5. method according to claim 1 to 3, which is characterized in that described to refuse to respond the data storage and ask It asks, comprising:
Show the target data;
Request is stored according to the cancellation to the data to be stored of user's triggering, refuses to respond the data storage request.
6. according to the method described in claim 5, it is characterized in that, the display target data, comprising:
Determine that the matched data in the target database, the matched data have the 4th word segmentation result, the 4th participle As a result it is greater than or equal to the second preset value with the matching value of the first participle result, second preset value is less than or equal to institute State the first preset value;
From high to low according to the matching value of the 4th word segmentation result and the first participle result, the matched data is shown.
7. a kind of data processing equipment, which is characterized in that described device includes:
Request unit, for obtaining the data storage request to data to be stored, the data storage request is indicated institute Data to be stored is stated to store to target database;
Participle unit obtains first participle result for segmenting to the data to be stored;
Judging unit, for according to the first participle as a result, judge in the target database with the presence or absence of with described wait deposit The target data of Data Matching is stored up, the target data has the second word segmentation result, second word segmentation result and described first The matching value of word segmentation result is greater than or equal to the first preset value;If the determination result is YES, then refusal unit is activated;
The refusal unit, for refusing to respond the data storage request.
8. device according to claim 7, which is characterized in that the first participle is the result is that the data to be stored What data name was segmented, second word segmentation result is to be segmented to obtain to the data name of the target data 's;Or, the first participle is the result is that data name and data content to the data to be stored were segmented, institute Stating the second word segmentation result is segmented to the data name and data content of the target data.
9. device according to claim 7, which is characterized in that the first participle result includes multiple first words, described Second word segmentation result includes multiple second words, then,
The matching value of the first participle result and second word segmentation result according to matched first word of the second word Quantity determines;Or,
Each first word has a weight, the matching value of the first participle result and second word segmentation result according to institute The quantity and weight for stating matched first word of the second word determine.
10. according to device described in claim 7-9 any one, which is characterized in that described device further include:
Storage unit, for storing the data to be stored to the target database;
The judging unit is also used to, if judging result be it is no, activate the storage unit.
11. according to device described in claim 7-9 any one, which is characterized in that the refusal unit, comprising:
Display unit, for showing the target data;
Refuse subelement, the cancellation storage request to the data to be stored for triggering according to user refuses to respond described Data storage request.
12. device according to claim 11, which is characterized in that the display unit, comprising:
Data determination unit, for determining that the matched data in the target database, the matched data have the 4th participle As a result, the matching value of the 4th word segmentation result and the first participle result is greater than or equal to the second preset value, described second Preset value is less than or equal to first preset value;
It shows subelement, from high to low for the matching value according to the 4th word segmentation result and the first participle result, shows Show the matched data.
CN201910655989.2A 2019-07-19 2019-07-19 A kind of data processing method and device Pending CN110347702A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910655989.2A CN110347702A (en) 2019-07-19 2019-07-19 A kind of data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910655989.2A CN110347702A (en) 2019-07-19 2019-07-19 A kind of data processing method and device

Publications (1)

Publication Number Publication Date
CN110347702A true CN110347702A (en) 2019-10-18

Family

ID=68179428

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910655989.2A Pending CN110347702A (en) 2019-07-19 2019-07-19 A kind of data processing method and device

Country Status (1)

Country Link
CN (1) CN110347702A (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104881503A (en) * 2015-06-24 2015-09-02 郑州悉知信息技术有限公司 Data processing method and device
US20150278300A1 (en) * 2006-12-22 2015-10-01 Emc Corporation Query translation for searching complex structures of objects
CN109785919A (en) * 2018-11-30 2019-05-21 平安科技(深圳)有限公司 Noun matching process, device, equipment and computer readable storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150278300A1 (en) * 2006-12-22 2015-10-01 Emc Corporation Query translation for searching complex structures of objects
CN104881503A (en) * 2015-06-24 2015-09-02 郑州悉知信息技术有限公司 Data processing method and device
CN109785919A (en) * 2018-11-30 2019-05-21 平安科技(深圳)有限公司 Noun matching process, device, equipment and computer readable storage medium

Similar Documents

Publication Publication Date Title
US9639579B2 (en) Determination of a desired repository for retrieving search results
CN107992514B (en) Structured information card search and retrieval
US8615516B2 (en) Grouping similar values for a specific attribute type of an entity to determine relevance and best values
US10621462B2 (en) Density sampling map data
US20130218620A1 (en) Method and system for skill extraction, analysis and recommendation in competency management
WO2012092196A1 (en) Recommendation of search keywords based on indication of user intention
CN104021125B (en) A kind of method, system and a kind of search engine of search engine sequence
CN105095231A (en) Method and device for presenting search result
EP1890257A2 (en) Clustering for structured data
CN105894183A (en) Project evaluation method and apparatus
US20160070984A1 (en) Density sampling map labels
CN110851729A (en) Resource information recommendation method, device, equipment and computer storage medium
CN110191183A (en) Accurate intelligent method for pushing, system, device and computer readable storage medium
CN108304112A (en) Data processing method and device
CN111932308A (en) Data recommendation method, device and equipment
US10846462B2 (en) Web page output selection
CN108415748A (en) Method for information display and system, computer storage media and equipment
CN108182200A (en) Keyword expanding method and device based on semantic similarity
CN106156275A (en) A kind of method and apparatus of singulated inquiry
CN106033444A (en) Method and device for clustering text content
CN110347702A (en) A kind of data processing method and device
US20200402125A1 (en) Guide word recommendation
CN108334522B (en) Method for determining customs code, and method and system for determining type information
CN109656954A (en) Trade mark inquiry method, apparatus and computer equipment
CN112966176B (en) Object display method and device, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191018

RJ01 Rejection of invention patent application after publication