CN106681524A - Method and device for processing information - Google Patents
Method and device for processing information Download PDFInfo
- Publication number
- CN106681524A CN106681524A CN201510763353.1A CN201510763353A CN106681524A CN 106681524 A CN106681524 A CN 106681524A CN 201510763353 A CN201510763353 A CN 201510763353A CN 106681524 A CN106681524 A CN 106681524A
- Authority
- CN
- China
- Prior art keywords
- information
- standard
- potential
- input
- input information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/02—Input arrangements using manually operated switches, e.g. using keyboards or dials
- G06F3/023—Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
- G06F3/0233—Character input methods
- G06F3/0237—Character input methods using prediction or retrieval techniques
Landscapes
- Engineering & Computer Science (AREA)
- General Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a method and device for processing information. The method comprises the steps of obtaining input information of a user, determining each potential standard information which corresponds to the input information in a preset information standard list according to the input information, determining the similarity between the input information and each potential standard information hereby, determining final standard information in each potential standard information according to the determined similarity, and processing the final standard information. Through the method, a corresponding relationship between wrong information and standard information does not need to be established, no matter what kind of wrong information input by the user is, the final standard information which corresponds to the wrong input information can be determined according to the similarity between the wrong input information and the standard information so as to process the final standard information, and thus the accuracy of information processing can be effectively improved.
Description
Technical field
The application is related to field of computer technology, more particularly to a kind of method and device of information processing.
Background technology
With the continuous development of network technology, internet has become not retrievable part in people's life,
After the information for receiving user's offer, the various services that can be provided the user are use e.g. for Internet service provider
Family provides download service etc., and at the same time, Internet service provider preferably services to provide the user with, generally
The information of user input will be processed, e.g., air control process of statistics and analysis etc. be carried out to information.
In actual applications, before processing the input information of user, the input letter for verifying user is needed
Whether breath is standard information, if not standard information, then needs for input information to be modified to standard information,
The standard information is processed again.
In the prior art, input information is modified into standard information has two ways, first kind of way:Root
According to default regular expression, for the input information for not meeting standard information form, according to standard information lattice
Formula, by the form of input information the form of standard information is modified to.The second way:For not being inconsistent standardization
The input information of information, according to the corresponding relation of the error message and standard information for pre-building, by mistake
Input information is modified to corresponding standard information.
But, for first kind of way, the form of the input information of user can only be modified to standard letter
The form of breath, it is impossible to the which whether content for determining input information meets the content of standard information, such as, it is assumed that use
The input information at family is mailbox, but "@163.com " is entered as "@164.com " by user's hand by mistake, then
"@164.com " meets the form of standard information, but does not meet the content of standard information, is subsequently based on
"@164.com " carries out the accuracy of such as air control process will be reduced.
It is first when the corresponding relation of error message and standard information is pre-build for the second way
The error message for obtaining history is first needed, then to the corresponding standard information of error message setting of each history.
Such as, the corresponding relation of error message "@164.com " and standard information "@163.com " is set up.But,
In actual applications, the species of the information of mistake is a lot, and is unpredictable, it is difficult to collect
The information of all of mistake, such as, it is impossible to predict whether user can be input into standard information "@163.com "
Into the error message such as "@165.com ", "@166.com " so that cannot subsequently correct into error message
Standard information, reduces so as to also result in the follow-up accuracy for carrying out such as air control process based on input information,
At the same time, the second way needs frequently the wrong information not occurred to be added into error message with mark
In the corresponding relation of calibration information, the cost of information processing is also increased.
The content of the invention
The embodiment of the present application provides a kind of method and device of information processing, to solve prior art in be based on
The relatively low problem of accuracy that input information is processed.
A kind of method of information processing that the embodiment of the present application is provided, methods described includes:
Obtain the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined
Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard
Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information,
Determine ultimate criterion information;
The ultimate criterion information is processed.
The method that a kind of air control that the embodiment of the present application is provided is processed, methods described includes:
Air control system obtains the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined
Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard
Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information,
Determine ultimate criterion information;
Risk prevention system process is carried out to the ultimate criterion information.
A kind of device of information processing that the embodiment of the present application is provided, described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that
Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input
Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information,
In each potential standard information, ultimate criterion information is determined;
Processing module, for processing the ultimate criterion information.
The device that a kind of air control that the embodiment of the present application is provided is processed, described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that
Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input
Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information,
In each potential standard information, ultimate criterion information is determined;
Air control module, for carrying out risk prevention system process to the ultimate criterion information.
The embodiment of the present application provides a kind of method and device of information processing, and the method obtains the input letter of user
Breath, according to the input information, in default information standard list, determines that the input information is corresponding each
Potential standard information, and the similarity between the input information and each potential standard information is determined therefrom that, according to
The similarity determined, in each potential standard information, determines ultimate criterion information, and to the final mark
Calibration information is processed.By said method, the corresponding relation without the need for setting up error message and standard information,
No matter which kind of error message is the input information of user be, all can according to its similarity with standard information, it is determined that
The corresponding ultimate criterion information of the wrong input information, is processed the ultimate criterion information, you can have
Effect improves the accuracy of information processing.
Description of the drawings
Accompanying drawing described herein is used for providing further understanding of the present application, constitutes the part of the application,
The schematic description and description of the application does not constitute the improper limit to the application for explaining the application
It is fixed.In the accompanying drawings:
The process of the information processing that Fig. 1 is provided for the embodiment of the present application;
The process that Fig. 2 is processed for the air control that the embodiment of the present application is provided;
The apparatus structure schematic diagram of the information processing that Fig. 3 is provided for the embodiment of the present application;
The apparatus structure schematic diagram that Fig. 4 is processed for the air control that the embodiment of the present application is provided.
Specific embodiment
It is specifically real below in conjunction with the application to make purpose, technical scheme and the advantage of the application clearer
Apply example and corresponding accompanying drawing is clearly and completely described to technical scheme.Obviously, it is described
Embodiment is only some embodiments of the present application, rather than the embodiment of whole.Based on the enforcement in the application
Example, the every other enforcement that those of ordinary skill in the art are obtained under the premise of creative work is not made
Example, belongs to the scope of the application protection.
The process of the information processing that Fig. 1 is provided for the embodiment of the present application, specifically includes following steps:
S101:Obtain the input information of user.
In actual applications, the various services that user generally can be by input information to be provided using server,
Such as, user is by being input into E-mail address information come register account number, so as to the purchase provided using certain commodity website
Commodity and service.And server is also required to be processed based on the input information of user, e.g., by the input of user
Information logs in the account etc. as account.Therefore, in the embodiment of the present application, server needs first to obtain
The input information of user.
Specifically, server can receiving terminal send user input information, as the user's for getting
Input information.Also partial information can be extracted from the input information according to default rule, will be extracted
Partial information again as the input information for getting.
For example, it is assumed that user wants to be logged in by the hotmail.co.uk of its E-mail address 37927397, but
Its mistake is entered as into 37927397@hoymail.couk in terminal, then server receives what terminal was sent
After 37927397@hoymail.couk, can be according to default default rule, will
"@" in 37927397@hoymail.couk and all characters after "@" are extracted,
As the input information for getting, as@hoymail.couk.
S102:According to the input information, in default information standard list, the input letter is determined
Cease corresponding each potential standard information.
In the embodiment of the present application, described information standard list can be server previously according to a large amount of in history
Correctly enter information generation, e.g., can in advance by it is historical correctly enter information "@163.com ",
During "@hotmail.co.uk " is added to information standard list.
It is determined that during the corresponding each potential standard information of input information, specifically can be according to default search fractionation side
Formula, by input information corresponding search word is split into, further according to splitting the search word that obtains and pre-build
Inverted index, in each standard information that default information standard list is included, determine the input information
Corresponding each potential standard information.
Continuation of the previous cases, it is assumed that default search splits mode and is:By the part before ". " as a search
Word, by the part after ". " as a search word.The input information that then server will can be obtained
"@hoymail.couk " is split as search word " hoymail " and " couk ", further according to the two search words
And the inverted index for pre-building, by the way of searching for generally, in each mark that information standard list is included
In calibration information, each potential standard information is searched out.
Assume that the standard information included in information standard list is:@163.com、@sohu.com、
@hotmail.co.uk、@hormail.co.uk、@htmail.co.uk.Then according to search word " hoymail " and
" couk ", it may be determined that going out each potential standard information corresponding with input information "@hoymail.couk " is:
@hotmail.co.uk、@hormail.co.uk、@htmail.co.uk。
In addition, the mode of setting up of the inverted index is specifically as follows:Tear open according to above-mentioned same search in advance
Point mode, by each standard information included in standard information list corresponding search word is split into, and according to searching
Rope word and the standard information comprising the search word, set up inverted index.
S103:According to the input information and each potential standard information, determine that the input information is potential with each
Similarity between standard information.
In actual applications, user due to hand by mistake the input information of mistake input and standard information on character not
Can differ greatly, therefore, in this application, can using calculate input information and each potential standard information it
Between similarity come to determine which potential standard information on earth be the information that user really wants to be input into.
Similarity characterization input information between the input information and potential standard information is believed with potential standard
Similar degree between breath, if similarity is bigger, illustrates between input information and potential standard information more
It is similar, that is to say, that the potential standard information is that the possibility of the information that user really thinks input is bigger, such as
Fruit similarity is less, then illustrate more dissimilar between input information and potential standard information, that is to say, that should
Potential standard information is that the possibility of the information that user really thinks input is less.
Therefore, server determined after the corresponding each potential information of input information by step S102, can be counted
Calculate the similarity between input information and each potential information, the application can be with when similarity is calculated, specifically
For arbitrary potential standard information, determine and the input information is modified to needed for the potential standard information most
Few number of operations (that is, editing distance), then determine therefrom that out the input information and the potential standard information
Between similarity.Wherein, described minimal action number of times is bigger, then similarity is less, and that what is counted is minimum
Number of operations is less, then similarity is bigger.
Here needs to say that the above is pair similarity for determining input information and each potential standard information
The exemplary illustration of method, the method for above-mentioned determination similarity is not unique method, e.g., can also be led to
The method for crossing calculating Euclidean distance determines the similarity of input information and each potential standard information, as long as calculating side
The practical significance that method is embodied is input information and each potential standard information similarity degree.
Continuation of the previous cases, server is determined and is modified to "@hoymail.couk " "@hotmail.co.uk "
Required minimal action number of times is 2 times, i.e. only need to be modified to " y " " t ", and at " o " and " u "
Centre addition one ". ", then determine therefrom that out " hoymail.couk " and " hotmail.co.uk " it
Between similarity be:3.432.Similar, determine and be modified to "@hoymail.couk "
Minimal action number of times needed for "@hormail.co.uk " is 2 times, then is determined therefrom that out "@hoymail.couk "
Similarity between "@hormail.co.uk " is:3.432;Determine "@hoymail.couk " more
The minimal action number of times made into needed for "@htmail.co.uk " is 3 times, is determined therefrom that out "@hoymail.couk "
Similarity between "@htmail.co.uk " is:3.099.
S104:According to the similarity between the input information and each potential standard information, in each potential standard
In information, ultimate criterion information is determined.
In the embodiment of the present application, it is between input information and potential standard information similar due to similarity characterization
Degree, that is to say, that similarity is bigger, then illustrate more similar between input information and potential standard information,
Therefore, in this application can according to the size of input information and the similarity of each potential standard information by height to
It is low to be ranked up, and the potential standard information of similarity highest is defined as into ultimate criterion information, it is described final
Standard information is the information that user really wants to be input into.
In actual applications, it is possible to there are the feelings that the potential standard information of similarity highest has at least two
Condition, in this case, can directly according to the priority of default each potential standard information, by priority
The potential standard information of highest, is defined as ultimate criterion information.
Continuation of the previous cases, server determines that the potential standard information of similarity highest is:@hotmail.co.uk、
@hormail.co.uk, the similarity of the two is all 3.432, it is assumed that default each potential standard information it is preferential
Level is followed successively by from high to low:@163.com、@sohu.com、@hotmail.co.uk、@hormail.co.uk、
@htmail.co.uk, then according to default priority, by "@hotmail.co.uk " ultimate criterion is defined as
Information.
In addition, preset priority being specifically as follows:Count each standard information in standard information list
The quantity for occurring in history, quantity is more, then priority is higher, and quantity is lower, then priority is lower.
S105:The ultimate criterion information is processed.
Continue to use the example above, server determine "@hotmail.co.uk " for ultimate criterion information (i.e.,
The correct suffix in E-mail address of user be "@hotmail.co.uk ") after, will " 37927397 " and
" hotmail.co.uk " is combined into " 37927397 hotmail.co.uk ", and carries out login process, is
User provides follow-up service.
By said method, the corresponding relation without the need for setting up error message and standard information, no matter user's is defeated
Enter which kind of error message is information be, all the input of the mistake can be determined according to its similarity with standard information
The corresponding ultimate criterion information of information, is processed the ultimate criterion information, you can effectively improve at information
The accuracy of reason.
Here it should be noted that it is above-mentioned be with by server by method as shown in Figure 1 come processing information
As a example by illustrate, method certainly as shown in Figure 1 can also be completed by terminal, when by terminal processes information
When, in step S101, terminal can directly receive the input information of user, and according to the input information,
In subsequent step, terminal then can determine defeated in the information standard list of the terminal local is pre-stored in
Enter the corresponding each potential standard information of information, then determine the similarity between input information and each potential standard,
And according to the similarity, ultimate criterion information is determined, finally the ultimate criterion information is processed, e.g.,
The ultimate criterion information for obtaining is sent into server etc..
In actual applications, whether input information is standard information, mainly with the form of input information and defeated
The content for entering information is defined, that is to say, that the reason for input information is not standard information is possible to simply enter letter
The form of breath occurs in that mistake, it is also possible to which the content for simply entering information occurs in that mistake, it is also possible to be
The format and content of input information all occurs in that mistake.If the form of input information there is a problem,
It is determined that when input information is modified into the minimal action number of times needed for the potential standard information, inherently increasing and repairing
Change the operation of form, such minimal action number of times will increase, so that the similarity finally determined
Can increase, therefore, in the embodiment of the present application, can be first defeated by this after the input information for getting user
Enter information to be adjusted according to default standard information form.
Here it should be noted that the application can with by the way of regular expression come preset standard information
Form, e.g., default regular expression is:@w+([-.]w+)*.w+([-.]w+)*.
In addition, in actual applications, it is possible to which user occurs the information of subject intent input error, i.e.
The information of user input is inherently unrelated with standard information, so after step S102~S104, also necessarily
A ultimate criterion information is can determine whether out, it is so follow-up when processing the ultimate criterion information, reduce
The accuracy that follow-up is processed, therefore, it can preset a similarity threshold, if similarity
The similarity of the potential standard information of highest is more than or equal to default similarity threshold, then can be by similarity highest
Potential standard information be defined as ultimate criterion information, if the potential standard information of similarity highest is similar
Degree is less than default similarity threshold, then can give up to fall the potential standard information of similarity highest, i.e. follow-up
Any process is not carried out to the potential standard information of similarity highest.
Finally, here is it should be noted that it is all the letter of mistake that the above is all the input information of the user for obtaining
Breath, i.e. be not standard information, in actual applications, the input information of the user of acquisition it could also be possible that
Standard information, when the input information of the user for obtaining is standard information, without the need for passing through step S102~S104,
Directly the input information is processed, if this is because directly by standard information also by step
S102~S104, will certainly reduce the efficiency of information processing.
Below as a example by carrying out air control process to information, the information processing method that the application is provided is described in detail.
The process that Fig. 2 is processed for the air control that the application is provided, specifically includes following steps:
S201:Obtain the input information of user.
In the embodiment of the present application, the input information of the user that air control system sends receiving terminal, as obtaining
The input information of the user for getting, and being processed accordingly the input information, the input information can be with
Be E-mail address information, or other there is the information of set form and immobilized substance, e.g., interconnection
Net address, because for the process of air control system, E-mail address information is an important air control dimension,
Therefore, it is following in the application all to be illustrated with Email Information.
S202:According to the input information, in default information standard list, the input letter is determined
Cease corresponding each potential standard information.
In the embodiment of the present application, air control system receives again E-mail address information (that is, the input letter of user
Breath) after, it is determined that during the corresponding each potential standard electronic mailbox message of E-mail address information, step may also be employed
The inverted index set up in rapid 102, by E-mail address information according to default search fractionation mode, by electricity
Sub-voice mailbox information splits into corresponding search word, and according to the search word in inverted index, determines each latent
In standard electronic mailbox message.
S203:According to the input information and each potential standard information, determine that the input information is potential with each
Similarity between standard information.
Further, air control system is after each potential E-mail address standard information is determined, for arbitrary potential
For E-mail address standard information, determine and the E-mail address information is modified into the potential E-mail address standard
Minimal action number of times (that is, editing distance) needed for information, then determine therefrom that out the E-mail address information
With the similarity between the potential E-mail address standard information.
S204:According to the similarity between the input information and each potential standard information, in each potential standard
In information, ultimate criterion information is determined.
In the embodiment of the present application, air control system is in the similar of each potential E-mail address standard information determined
In degree, the potential E-mail address standard information of similarity highest is determined, the potential E-mail address standard is believed
Breath is defined as ultimate criterion information.
In actual applications, have at least two if there is the potential E-mail address standard information of similarity highest
It is individual, directly according to the priority of default each potential E-mail address standard information, by the potential of highest priority
Standard information, is defined as final E-mail address standard information.
S205:Risk prevention system process is carried out to the ultimate criterion information.
Finally, in this application, air control system is after the final E-mail address standard information determined, subsequently
Statistics and analysis can be carried out to E-mail address standard information, and according to the result of statistics and analysis, makes corresponding
Measure, e.g., statistics and analysis are carried out by the E-mail address information being input into a large number of users, which is determined
A little E-mail addresses are the malice mailboxes that high-volume is generated, subsequently the E-mail address determined can be tracked or
Control.
For example, it is assumed that user's first is done shopping on certain commodity website, in primary transactional operation, use
The E-mail address information of family first input is " abx@164.com ", in secondary transactional operation, user
The E-mail address information of first input is " abx@163.com ", and the air control system of the follow-up commodity website is obtained
To the E-mail address information " abx@164.com " and " abx@163.com " of the input of user's first, if air control
The E-mail address information of system of users first input not by the process of above-mentioned step S201~S204, then wind
E-mail address information is " abx 164.com " corresponding Transaction Information and electronics postal in rear extended meeting by control system
Case carries out respectively statistics and analysis for " abx@163.com " corresponding Transaction Information, and actually the two
The corresponding Transaction Information in E-mail address is all user's first, so as to reduce the accuracy of ultimate risk process.
But, by the process of above-mentioned steps S201~S204, air control system is determined " abx@164.com "
Ultimate criterion information be " abx@163.com ", and user's first input " abx@163.com " inherently
It is standard information, it is therefore not necessary to pass through the process of step S201~S204, follow-up air control system is by electronics postal
Case information is counted on together for " abx@163.com " corresponding Transaction Information, and makes corresponding data point
Analysis, it is follow-up according to the analysis result for obtaining, make corresponding risk and process.
The method and air control processing method of the information processing for providing for the embodiment of the present application above, based on same
Thinking, the embodiment of the present application also provides the device that a kind of device of information processing and air control are processed, such as Fig. 3,
Shown in Fig. 4.
The apparatus structure schematic diagram of the information processing that Fig. 3 is provided for the embodiment of the present application, described device includes:
Acquisition module 301, for obtaining the input information of user;
First determining module 302, for according to the input information, in default information standard list,
Determine the corresponding each potential standard information of the input information;
Second determining module 303, for according to the input information and each potential standard information, it is determined that described
Similarity between input information and each potential standard information;
3rd determining module 304, for according to similar between the input information and each potential standard information
Degree, in each potential standard information, determines ultimate criterion information;
Processing module 305, for processing the ultimate criterion information.
Described device also includes:
Adjusting module 306, for determining that the input information is corresponding in first determining module 302
Before each potential standard information, the input information is adjusted according to default standard information form.
First determining module 302 is specifically for according to default search fractionation mode, by the input
Information splits into corresponding search word, according to the search word and the inverted index for pre-building, default
Each standard information for including of information standard list in, determine the corresponding each potential standard of the input information
Information.
Second determining module 303 is specifically for for arbitrary potential standard information, determining will be described
Input information is modified to the minimal action number of times needed for the potential standard information, according to the minimal action number of times,
Determine the similarity between the input information and the potential standard information.
3rd determining module 304 is specifically for the potential standard information of similarity highest is defined as most
Whole standard information, or when the potential standard information of similarity highest has at least two, according to pre-building
Potential standard information priority, the potential standard information of highest priority is defined as into ultimate criterion information.
3rd determining module 304 more than each potential standard of predetermined threshold value in similarity specifically for believing
In breath, ultimate criterion information is determined.
Described device also includes:
4th determining module 307, for determining the input information pair in first determining module 302
Before each potential standard information answered, determine that the input information is non-standard information.
The apparatus structure schematic diagram that Fig. 4 is processed for the air control that the embodiment of the present application is provided, described device includes:
Acquisition module 401, for obtaining the input information of user;
First determining module 402, for according to the input information, in default information standard list,
Determine the corresponding each potential standard information of the input information;
Second determining module 403, for according to the input information and each potential standard information, it is determined that described
Similarity between input information and each potential standard information;
3rd determining module 404, for according to similar between the input information and each potential standard information
Degree, in each potential standard information, determines ultimate criterion information;
Air control module 405, for carrying out risk prevention system process to the ultimate criterion information.
The input information includes E-mail address information.
In a typical configuration, computing device includes one or more processors (CPU), input/defeated
Outgoing interface, network interface and internal memory.
Internal memory potentially includes the volatile memory in computer-readable medium, random access memory
And/or the form, such as read-only storage (ROM) or flash memory (flash RAM) such as Nonvolatile memory (RAM).
Internal memory is the example of computer-readable medium.
Computer-readable medium includes that permanent and non-permanent, removable and non-removable media can be by appointing
What method or technique is realizing information Store.Information can be computer-readable instruction, data structure, program
Module or other data.The example of the storage medium of computer include, but are not limited to phase transition internal memory (PRAM),
Static RAM (SRAM), dynamic random access memory (DRAM), it is other kinds of with
Machine access memory (RAM), read-only storage (ROM), Electrically Erasable Read Only Memory
(EEPROM), fast flash memory bank or other memory techniques, read-only optical disc read-only storage (CD-ROM),
Digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk is stored or it
His magnetic storage apparatus or any other non-transmission medium, can be used to store the letter that can be accessed by a computing device
Breath.Define according to herein, computer-readable medium does not include temporary computer readable media (transitory
Media), such as the data-signal and carrier wave of modulation.
Also, it should be noted that term " including ", "comprising" or its any other variant are intended to non-row
His property is included, so that a series of process, method, commodity or equipment including key elements not only includes
Those key elements, but also including other key elements being not expressly set out, or also include for this process,
The intrinsic key element of method, commodity or equipment.In the absence of more restrictions, by sentence " including
One ... " key element that limits, it is not excluded that including the process of the key element, method, commodity or setting
Also there is other identical element in standby.
It will be understood by those skilled in the art that embodiments herein can be provided as method, system or computer journey
Sequence product.Therefore, the application can using complete hardware embodiment, complete software embodiment or with reference to software and
The form of the embodiment of hardware aspect.And, the application can be adopted and wherein include calculating at one or more
Machine usable program code computer-usable storage medium (including but not limited to magnetic disc store, CD-ROM,
Optical memory etc.) on implement computer program form.
Embodiments herein is the foregoing is only, the application is not limited to.For this area skill
For art personnel, the application can have various modifications and variations.All institutes within spirit herein and principle
Any modification, equivalent substitution and improvements of work etc., within the scope of should be included in claims hereof.
Claims (18)
1. a kind of method of information processing, it is characterised in that methods described includes:
Obtain the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined
Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard
Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information,
Determine ultimate criterion information;
The ultimate criterion information is processed.
2. the method for claim 1, it is characterised in that determining the input information correspondence
Each potential standard information before, methods described also includes:
The input information is adjusted according to default standard information form.
3. the method for claim 1, it is characterised in that according to the input information of user, pre-
If information standard list in, determine the corresponding each potential standard information of the input information, specifically include:
According to default search fractionation mode, the input information is split into corresponding search word;
According to the search word and the inverted index for pre-building, include in default information standard list
In each standard information, the corresponding each potential standard information of the input information is determined.
4. the method for claim 1, it is characterised in that determine that the input information is potential with each
Similarity between standard information, specifically includes:
For arbitrary potential standard information, determine and the input information is modified into the potential standard information institute
The minimal action number of times for needing;
According to the minimal action number of times, the phase between the input information and the potential standard information is determined
Like degree.
5. the method for claim 1, it is characterised in that potential with each according to the input information
Similarity between standard information, in each potential standard information, determines ultimate criterion information, concrete bag
Include:
The potential standard information of similarity highest is defined as into ultimate criterion information;Or
When the potential standard information of similarity highest has at least two, according to the potential standard for pre-building
Information priorities, by the potential standard information of highest priority ultimate criterion information is defined as.
6. the method as described in Claims 1 to 5 is arbitrary, it is characterised in that in each potential standard information,
Ultimate criterion information is determined, is specifically included:
In each potential standard information of the similarity more than predetermined threshold value, ultimate criterion information is determined.
7. the method for claim 1, it is characterised in that determining the input information correspondence
Each potential standard information before, methods described also includes:
Determine that the input information is non-standard information.
8. a kind of method that air control is processed, it is characterised in that methods described includes:
Air control system obtains the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined
Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard
Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information,
Determine ultimate criterion information;
Risk prevention system process is carried out to the ultimate criterion information.
9. method as claimed in claim 8, it is characterised in that the input information includes E-mail address
Information.
10. a kind of device of information processing, it is characterised in that described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that
Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input
Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information,
In each potential standard information, ultimate criterion information is determined;
Processing module, for processing the ultimate criterion information.
11. devices as claimed in claim 10, it is characterised in that described device also includes:
Adjusting module, for determining the corresponding each potential mark of the input information in first determining module
Before calibration information, the input information is adjusted according to default standard information form.
12. devices as claimed in claim 10, it is characterised in that first determining module is specifically used
In, fractionation mode is searched for according to default, the input information is split into corresponding search word, according to institute
Search word and the inverted index for pre-building are stated, in each standard information that default information standard list is included
In, determine the corresponding each potential standard information of the input information.
13. devices as claimed in claim 10, it is characterised in that second determining module is specifically used
In for arbitrary potential standard information, determining and for the input information to be modified to the potential standard information institute
The minimal action number of times for needing, according to the minimal action number of times, determines the input information and the potential mark
Similarity between calibration information.
14. devices as claimed in claim 10, it is characterised in that the 3rd determining module is specifically used
In, the potential standard information of similarity highest is defined as into ultimate criterion information, or when similarity highest is latent
When standard information has at least two, according to the potential standard information priority for pre-building, by priority
The potential standard information of highest is defined as ultimate criterion information.
15. devices as described in claim 10~14 is arbitrary, it is characterised in that the 3rd determining module
Specifically in each potential standard information of the similarity more than predetermined threshold value, determining ultimate criterion information.
16. devices as claimed in claim 10, it is characterised in that described device also includes:
4th determining module, for determining that the input information is corresponding each latent in first determining module
Before standard information, determine that the input information is non-standard information.
The device that a kind of 17. air controls are processed, it is characterised in that described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that
Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input
Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information,
In each potential standard information, ultimate criterion information is determined;
Air control module, for carrying out risk prevention system process to the ultimate criterion information.
18. devices as claimed in claim 17, it is characterised in that the input information includes electronics postal
Case information.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510763353.1A CN106681524A (en) | 2015-11-10 | 2015-11-10 | Method and device for processing information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510763353.1A CN106681524A (en) | 2015-11-10 | 2015-11-10 | Method and device for processing information |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106681524A true CN106681524A (en) | 2017-05-17 |
Family
ID=58865621
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510763353.1A Pending CN106681524A (en) | 2015-11-10 | 2015-11-10 | Method and device for processing information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106681524A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109118353A (en) * | 2018-07-20 | 2019-01-01 | 中国邮政储蓄银行股份有限公司 | The data processing method and device of air control model |
CN111930809A (en) * | 2020-09-17 | 2020-11-13 | 支付宝(杭州)信息技术有限公司 | Data processing method, device and equipment |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040148280A1 (en) * | 2002-12-30 | 2004-07-29 | Moriyuki Chimura | Management information processing method and keyword determination method |
CN102663016A (en) * | 2012-03-21 | 2012-09-12 | 上海汉翔信息技术有限公司 | System and method for implementing input information extension on input candidate box on electronic device |
CN103207878A (en) * | 2012-01-17 | 2013-07-17 | 阿里巴巴集团控股有限公司 | Inspection method and device of published information |
CN103237018A (en) * | 2013-03-29 | 2013-08-07 | 东莞宇龙通信科技有限公司 | Method, server and communication system for matching clients |
CN103412947A (en) * | 2013-08-26 | 2013-11-27 | 浙江大学 | Polygon search method for big space data |
CN103473373A (en) * | 2013-09-29 | 2013-12-25 | 方正国际软件有限公司 | Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method |
CN103489013A (en) * | 2013-09-18 | 2014-01-01 | 航天科工深圳(集团)有限公司 | Image recognition method for electrical equipment monitoring |
CN103530334A (en) * | 2013-09-29 | 2014-01-22 | 方正国际软件有限公司 | System and method for data matching based on comparison module |
CN104111977A (en) * | 2014-06-24 | 2014-10-22 | 小米科技有限责任公司 | Information matching method and device and terminal |
-
2015
- 2015-11-10 CN CN201510763353.1A patent/CN106681524A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040148280A1 (en) * | 2002-12-30 | 2004-07-29 | Moriyuki Chimura | Management information processing method and keyword determination method |
CN103207878A (en) * | 2012-01-17 | 2013-07-17 | 阿里巴巴集团控股有限公司 | Inspection method and device of published information |
CN102663016A (en) * | 2012-03-21 | 2012-09-12 | 上海汉翔信息技术有限公司 | System and method for implementing input information extension on input candidate box on electronic device |
CN103237018A (en) * | 2013-03-29 | 2013-08-07 | 东莞宇龙通信科技有限公司 | Method, server and communication system for matching clients |
CN103412947A (en) * | 2013-08-26 | 2013-11-27 | 浙江大学 | Polygon search method for big space data |
CN103489013A (en) * | 2013-09-18 | 2014-01-01 | 航天科工深圳(集团)有限公司 | Image recognition method for electrical equipment monitoring |
CN103473373A (en) * | 2013-09-29 | 2013-12-25 | 方正国际软件有限公司 | Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method |
CN103530334A (en) * | 2013-09-29 | 2014-01-22 | 方正国际软件有限公司 | System and method for data matching based on comparison module |
CN104111977A (en) * | 2014-06-24 | 2014-10-22 | 小米科技有限责任公司 | Information matching method and device and terminal |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109118353A (en) * | 2018-07-20 | 2019-01-01 | 中国邮政储蓄银行股份有限公司 | The data processing method and device of air control model |
CN109118353B (en) * | 2018-07-20 | 2022-03-15 | 中国邮政储蓄银行股份有限公司 | Data processing method and device of wind control model |
CN111930809A (en) * | 2020-09-17 | 2020-11-13 | 支付宝(杭州)信息技术有限公司 | Data processing method, device and equipment |
US11436252B2 (en) | 2020-09-17 | 2022-09-06 | Alipay (Hangzhou) Information Technology Co., Ltd. | Data processing methods, apparatuses, and devices |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US10257187B2 (en) | Prompting login account | |
US10915706B2 (en) | Sorting text report categories | |
EP2803031B1 (en) | Machine-learning based classification of user accounts based on email addresses and other account information | |
CN109255564B (en) | Pick-up point address recommendation method and device | |
US20120136812A1 (en) | Method and system for machine-learning based optimization and customization of document similarities calculation | |
US20140172415A1 (en) | Apparatus, system, and method of providing sentiment analysis result based on text | |
US20160117328A1 (en) | Influence score of a social media domain | |
CN114143216A (en) | System and method for network-based advertisement data traffic latency reduction | |
CN109388634B (en) | Address information processing method, terminal device and computer readable storage medium | |
WO2016101811A1 (en) | Information arrangement method and apparatus | |
WO2022134829A1 (en) | Method and apparatus for identifying same user, and computer device and storage medium | |
CN103544150B (en) | For browser of mobile terminal provides the method and system of recommendation information | |
JP6664585B2 (en) | Information processing apparatus, information processing method, and information processing program | |
WO2019056496A1 (en) | Method for generating picture review probability interval and method for picture review determination | |
CN106991090A (en) | The analysis method and device of public sentiment event entity | |
CN112650858A (en) | Method and device for acquiring emergency assistance information, computer equipment and medium | |
CN111861733B (en) | Fraud prevention and control system and method based on address fuzzy matching | |
CN106681524A (en) | Method and device for processing information | |
CN106469182A (en) | A kind of information recommendation method based on mapping relations and device | |
CN110674383B (en) | Public opinion query method, device and equipment | |
CN107220260A (en) | The method and device that a kind of page is shown | |
CN105677677A (en) | Information classification and device | |
CN113220949B (en) | Construction method and device of private data identification system | |
CN113656466B (en) | Policy data query method, device, equipment and storage medium | |
US10970341B2 (en) | Predictive modeling in event processing systems for big data processing in cloud |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170517 |