CN106681524A - Method and device for processing information - Google Patents

Method and device for processing information Download PDF

Info

Publication number
CN106681524A
CN106681524A CN201510763353.1A CN201510763353A CN106681524A CN 106681524 A CN106681524 A CN 106681524A CN 201510763353 A CN201510763353 A CN 201510763353A CN 106681524 A CN106681524 A CN 106681524A
Authority
CN
China
Prior art keywords
information
standard
potential
input
input information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510763353.1A
Other languages
Chinese (zh)
Inventor
钱宣统
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba Group Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201510763353.1A priority Critical patent/CN106681524A/en
Publication of CN106681524A publication Critical patent/CN106681524A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • G06F3/0233Character input methods
    • G06F3/0237Character input methods using prediction or retrieval techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a method and device for processing information. The method comprises the steps of obtaining input information of a user, determining each potential standard information which corresponds to the input information in a preset information standard list according to the input information, determining the similarity between the input information and each potential standard information hereby, determining final standard information in each potential standard information according to the determined similarity, and processing the final standard information. Through the method, a corresponding relationship between wrong information and standard information does not need to be established, no matter what kind of wrong information input by the user is, the final standard information which corresponds to the wrong input information can be determined according to the similarity between the wrong input information and the standard information so as to process the final standard information, and thus the accuracy of information processing can be effectively improved.

Description

A kind of method and device of information processing
Technical field
The application is related to field of computer technology, more particularly to a kind of method and device of information processing.
Background technology
With the continuous development of network technology, internet has become not retrievable part in people's life, After the information for receiving user's offer, the various services that can be provided the user are use e.g. for Internet service provider Family provides download service etc., and at the same time, Internet service provider preferably services to provide the user with, generally The information of user input will be processed, e.g., air control process of statistics and analysis etc. be carried out to information.
In actual applications, before processing the input information of user, the input letter for verifying user is needed Whether breath is standard information, if not standard information, then needs for input information to be modified to standard information, The standard information is processed again.
In the prior art, input information is modified into standard information has two ways, first kind of way:Root According to default regular expression, for the input information for not meeting standard information form, according to standard information lattice Formula, by the form of input information the form of standard information is modified to.The second way:For not being inconsistent standardization The input information of information, according to the corresponding relation of the error message and standard information for pre-building, by mistake Input information is modified to corresponding standard information.
But, for first kind of way, the form of the input information of user can only be modified to standard letter The form of breath, it is impossible to the which whether content for determining input information meets the content of standard information, such as, it is assumed that use The input information at family is mailbox, but "@163.com " is entered as "@164.com " by user's hand by mistake, then "@164.com " meets the form of standard information, but does not meet the content of standard information, is subsequently based on "@164.com " carries out the accuracy of such as air control process will be reduced.
It is first when the corresponding relation of error message and standard information is pre-build for the second way The error message for obtaining history is first needed, then to the corresponding standard information of error message setting of each history. Such as, the corresponding relation of error message "@164.com " and standard information "@163.com " is set up.But, In actual applications, the species of the information of mistake is a lot, and is unpredictable, it is difficult to collect The information of all of mistake, such as, it is impossible to predict whether user can be input into standard information "@163.com " Into the error message such as "@165.com ", "@166.com " so that cannot subsequently correct into error message Standard information, reduces so as to also result in the follow-up accuracy for carrying out such as air control process based on input information, At the same time, the second way needs frequently the wrong information not occurred to be added into error message with mark In the corresponding relation of calibration information, the cost of information processing is also increased.
The content of the invention
The embodiment of the present application provides a kind of method and device of information processing, to solve prior art in be based on The relatively low problem of accuracy that input information is processed.
A kind of method of information processing that the embodiment of the present application is provided, methods described includes:
Obtain the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information, Determine ultimate criterion information;
The ultimate criterion information is processed.
The method that a kind of air control that the embodiment of the present application is provided is processed, methods described includes:
Air control system obtains the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information, Determine ultimate criterion information;
Risk prevention system process is carried out to the ultimate criterion information.
A kind of device of information processing that the embodiment of the present application is provided, described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information, In each potential standard information, ultimate criterion information is determined;
Processing module, for processing the ultimate criterion information.
The device that a kind of air control that the embodiment of the present application is provided is processed, described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information, In each potential standard information, ultimate criterion information is determined;
Air control module, for carrying out risk prevention system process to the ultimate criterion information.
The embodiment of the present application provides a kind of method and device of information processing, and the method obtains the input letter of user Breath, according to the input information, in default information standard list, determines that the input information is corresponding each Potential standard information, and the similarity between the input information and each potential standard information is determined therefrom that, according to The similarity determined, in each potential standard information, determines ultimate criterion information, and to the final mark Calibration information is processed.By said method, the corresponding relation without the need for setting up error message and standard information, No matter which kind of error message is the input information of user be, all can according to its similarity with standard information, it is determined that The corresponding ultimate criterion information of the wrong input information, is processed the ultimate criterion information, you can have Effect improves the accuracy of information processing.
Description of the drawings
Accompanying drawing described herein is used for providing further understanding of the present application, constitutes the part of the application, The schematic description and description of the application does not constitute the improper limit to the application for explaining the application It is fixed.In the accompanying drawings:
The process of the information processing that Fig. 1 is provided for the embodiment of the present application;
The process that Fig. 2 is processed for the air control that the embodiment of the present application is provided;
The apparatus structure schematic diagram of the information processing that Fig. 3 is provided for the embodiment of the present application;
The apparatus structure schematic diagram that Fig. 4 is processed for the air control that the embodiment of the present application is provided.
Specific embodiment
It is specifically real below in conjunction with the application to make purpose, technical scheme and the advantage of the application clearer Apply example and corresponding accompanying drawing is clearly and completely described to technical scheme.Obviously, it is described Embodiment is only some embodiments of the present application, rather than the embodiment of whole.Based on the enforcement in the application Example, the every other enforcement that those of ordinary skill in the art are obtained under the premise of creative work is not made Example, belongs to the scope of the application protection.
The process of the information processing that Fig. 1 is provided for the embodiment of the present application, specifically includes following steps:
S101:Obtain the input information of user.
In actual applications, the various services that user generally can be by input information to be provided using server, Such as, user is by being input into E-mail address information come register account number, so as to the purchase provided using certain commodity website Commodity and service.And server is also required to be processed based on the input information of user, e.g., by the input of user Information logs in the account etc. as account.Therefore, in the embodiment of the present application, server needs first to obtain The input information of user.
Specifically, server can receiving terminal send user input information, as the user's for getting Input information.Also partial information can be extracted from the input information according to default rule, will be extracted Partial information again as the input information for getting.
For example, it is assumed that user wants to be logged in by the hotmail.co.uk of its E-mail address 37927397, but Its mistake is entered as into 37927397@hoymail.couk in terminal, then server receives what terminal was sent After 37927397@hoymail.couk, can be according to default default rule, will "@" in 37927397@hoymail.couk and all characters after "@" are extracted, As the input information for getting, as@hoymail.couk.
S102:According to the input information, in default information standard list, the input letter is determined Cease corresponding each potential standard information.
In the embodiment of the present application, described information standard list can be server previously according to a large amount of in history Correctly enter information generation, e.g., can in advance by it is historical correctly enter information "@163.com ", During "@hotmail.co.uk " is added to information standard list.
It is determined that during the corresponding each potential standard information of input information, specifically can be according to default search fractionation side Formula, by input information corresponding search word is split into, further according to splitting the search word that obtains and pre-build Inverted index, in each standard information that default information standard list is included, determine the input information Corresponding each potential standard information.
Continuation of the previous cases, it is assumed that default search splits mode and is:By the part before ". " as a search Word, by the part after ". " as a search word.The input information that then server will can be obtained "@hoymail.couk " is split as search word " hoymail " and " couk ", further according to the two search words And the inverted index for pre-building, by the way of searching for generally, in each mark that information standard list is included In calibration information, each potential standard information is searched out.
Assume that the standard information included in information standard list is:@163.com、@sohu.com、 @hotmail.co.uk、@hormail.co.uk、@htmail.co.uk.Then according to search word " hoymail " and " couk ", it may be determined that going out each potential standard information corresponding with input information "@hoymail.couk " is: @hotmail.co.uk、@hormail.co.uk、@htmail.co.uk。
In addition, the mode of setting up of the inverted index is specifically as follows:Tear open according to above-mentioned same search in advance Point mode, by each standard information included in standard information list corresponding search word is split into, and according to searching Rope word and the standard information comprising the search word, set up inverted index.
S103:According to the input information and each potential standard information, determine that the input information is potential with each Similarity between standard information.
In actual applications, user due to hand by mistake the input information of mistake input and standard information on character not Can differ greatly, therefore, in this application, can using calculate input information and each potential standard information it Between similarity come to determine which potential standard information on earth be the information that user really wants to be input into.
Similarity characterization input information between the input information and potential standard information is believed with potential standard Similar degree between breath, if similarity is bigger, illustrates between input information and potential standard information more It is similar, that is to say, that the potential standard information is that the possibility of the information that user really thinks input is bigger, such as Fruit similarity is less, then illustrate more dissimilar between input information and potential standard information, that is to say, that should Potential standard information is that the possibility of the information that user really thinks input is less.
Therefore, server determined after the corresponding each potential information of input information by step S102, can be counted Calculate the similarity between input information and each potential information, the application can be with when similarity is calculated, specifically For arbitrary potential standard information, determine and the input information is modified to needed for the potential standard information most Few number of operations (that is, editing distance), then determine therefrom that out the input information and the potential standard information Between similarity.Wherein, described minimal action number of times is bigger, then similarity is less, and that what is counted is minimum Number of operations is less, then similarity is bigger.
Here needs to say that the above is pair similarity for determining input information and each potential standard information The exemplary illustration of method, the method for above-mentioned determination similarity is not unique method, e.g., can also be led to The method for crossing calculating Euclidean distance determines the similarity of input information and each potential standard information, as long as calculating side The practical significance that method is embodied is input information and each potential standard information similarity degree.
Continuation of the previous cases, server is determined and is modified to "@hoymail.couk " "@hotmail.co.uk " Required minimal action number of times is 2 times, i.e. only need to be modified to " y " " t ", and at " o " and " u " Centre addition one ". ", then determine therefrom that out " hoymail.couk " and " hotmail.co.uk " it Between similarity be:3.432.Similar, determine and be modified to "@hoymail.couk " Minimal action number of times needed for "@hormail.co.uk " is 2 times, then is determined therefrom that out "@hoymail.couk " Similarity between "@hormail.co.uk " is:3.432;Determine "@hoymail.couk " more The minimal action number of times made into needed for "@htmail.co.uk " is 3 times, is determined therefrom that out "@hoymail.couk " Similarity between "@htmail.co.uk " is:3.099.
S104:According to the similarity between the input information and each potential standard information, in each potential standard In information, ultimate criterion information is determined.
In the embodiment of the present application, it is between input information and potential standard information similar due to similarity characterization Degree, that is to say, that similarity is bigger, then illustrate more similar between input information and potential standard information, Therefore, in this application can according to the size of input information and the similarity of each potential standard information by height to It is low to be ranked up, and the potential standard information of similarity highest is defined as into ultimate criterion information, it is described final Standard information is the information that user really wants to be input into.
In actual applications, it is possible to there are the feelings that the potential standard information of similarity highest has at least two Condition, in this case, can directly according to the priority of default each potential standard information, by priority The potential standard information of highest, is defined as ultimate criterion information.
Continuation of the previous cases, server determines that the potential standard information of similarity highest is:@hotmail.co.uk、 @hormail.co.uk, the similarity of the two is all 3.432, it is assumed that default each potential standard information it is preferential Level is followed successively by from high to low:@163.com、@sohu.com、@hotmail.co.uk、@hormail.co.uk、 @htmail.co.uk, then according to default priority, by "@hotmail.co.uk " ultimate criterion is defined as Information.
In addition, preset priority being specifically as follows:Count each standard information in standard information list The quantity for occurring in history, quantity is more, then priority is higher, and quantity is lower, then priority is lower.
S105:The ultimate criterion information is processed.
Continue to use the example above, server determine "@hotmail.co.uk " for ultimate criterion information (i.e., The correct suffix in E-mail address of user be "@hotmail.co.uk ") after, will " 37927397 " and " hotmail.co.uk " is combined into " 37927397 hotmail.co.uk ", and carries out login process, is User provides follow-up service.
By said method, the corresponding relation without the need for setting up error message and standard information, no matter user's is defeated Enter which kind of error message is information be, all the input of the mistake can be determined according to its similarity with standard information The corresponding ultimate criterion information of information, is processed the ultimate criterion information, you can effectively improve at information The accuracy of reason.
Here it should be noted that it is above-mentioned be with by server by method as shown in Figure 1 come processing information As a example by illustrate, method certainly as shown in Figure 1 can also be completed by terminal, when by terminal processes information When, in step S101, terminal can directly receive the input information of user, and according to the input information, In subsequent step, terminal then can determine defeated in the information standard list of the terminal local is pre-stored in Enter the corresponding each potential standard information of information, then determine the similarity between input information and each potential standard, And according to the similarity, ultimate criterion information is determined, finally the ultimate criterion information is processed, e.g., The ultimate criterion information for obtaining is sent into server etc..
In actual applications, whether input information is standard information, mainly with the form of input information and defeated The content for entering information is defined, that is to say, that the reason for input information is not standard information is possible to simply enter letter The form of breath occurs in that mistake, it is also possible to which the content for simply entering information occurs in that mistake, it is also possible to be The format and content of input information all occurs in that mistake.If the form of input information there is a problem, It is determined that when input information is modified into the minimal action number of times needed for the potential standard information, inherently increasing and repairing Change the operation of form, such minimal action number of times will increase, so that the similarity finally determined Can increase, therefore, in the embodiment of the present application, can be first defeated by this after the input information for getting user Enter information to be adjusted according to default standard information form.
Here it should be noted that the application can with by the way of regular expression come preset standard information Form, e.g., default regular expression is:@w+([-.]w+)*.w+([-.]w+)*.
In addition, in actual applications, it is possible to which user occurs the information of subject intent input error, i.e. The information of user input is inherently unrelated with standard information, so after step S102~S104, also necessarily A ultimate criterion information is can determine whether out, it is so follow-up when processing the ultimate criterion information, reduce The accuracy that follow-up is processed, therefore, it can preset a similarity threshold, if similarity The similarity of the potential standard information of highest is more than or equal to default similarity threshold, then can be by similarity highest Potential standard information be defined as ultimate criterion information, if the potential standard information of similarity highest is similar Degree is less than default similarity threshold, then can give up to fall the potential standard information of similarity highest, i.e. follow-up Any process is not carried out to the potential standard information of similarity highest.
Finally, here is it should be noted that it is all the letter of mistake that the above is all the input information of the user for obtaining Breath, i.e. be not standard information, in actual applications, the input information of the user of acquisition it could also be possible that Standard information, when the input information of the user for obtaining is standard information, without the need for passing through step S102~S104, Directly the input information is processed, if this is because directly by standard information also by step S102~S104, will certainly reduce the efficiency of information processing.
Below as a example by carrying out air control process to information, the information processing method that the application is provided is described in detail.
The process that Fig. 2 is processed for the air control that the application is provided, specifically includes following steps:
S201:Obtain the input information of user.
In the embodiment of the present application, the input information of the user that air control system sends receiving terminal, as obtaining The input information of the user for getting, and being processed accordingly the input information, the input information can be with Be E-mail address information, or other there is the information of set form and immobilized substance, e.g., interconnection Net address, because for the process of air control system, E-mail address information is an important air control dimension, Therefore, it is following in the application all to be illustrated with Email Information.
S202:According to the input information, in default information standard list, the input letter is determined Cease corresponding each potential standard information.
In the embodiment of the present application, air control system receives again E-mail address information (that is, the input letter of user Breath) after, it is determined that during the corresponding each potential standard electronic mailbox message of E-mail address information, step may also be employed The inverted index set up in rapid 102, by E-mail address information according to default search fractionation mode, by electricity Sub-voice mailbox information splits into corresponding search word, and according to the search word in inverted index, determines each latent In standard electronic mailbox message.
S203:According to the input information and each potential standard information, determine that the input information is potential with each Similarity between standard information.
Further, air control system is after each potential E-mail address standard information is determined, for arbitrary potential For E-mail address standard information, determine and the E-mail address information is modified into the potential E-mail address standard Minimal action number of times (that is, editing distance) needed for information, then determine therefrom that out the E-mail address information With the similarity between the potential E-mail address standard information.
S204:According to the similarity between the input information and each potential standard information, in each potential standard In information, ultimate criterion information is determined.
In the embodiment of the present application, air control system is in the similar of each potential E-mail address standard information determined In degree, the potential E-mail address standard information of similarity highest is determined, the potential E-mail address standard is believed Breath is defined as ultimate criterion information.
In actual applications, have at least two if there is the potential E-mail address standard information of similarity highest It is individual, directly according to the priority of default each potential E-mail address standard information, by the potential of highest priority Standard information, is defined as final E-mail address standard information.
S205:Risk prevention system process is carried out to the ultimate criterion information.
Finally, in this application, air control system is after the final E-mail address standard information determined, subsequently Statistics and analysis can be carried out to E-mail address standard information, and according to the result of statistics and analysis, makes corresponding Measure, e.g., statistics and analysis are carried out by the E-mail address information being input into a large number of users, which is determined A little E-mail addresses are the malice mailboxes that high-volume is generated, subsequently the E-mail address determined can be tracked or Control.
For example, it is assumed that user's first is done shopping on certain commodity website, in primary transactional operation, use The E-mail address information of family first input is " abx@164.com ", in secondary transactional operation, user The E-mail address information of first input is " abx@163.com ", and the air control system of the follow-up commodity website is obtained To the E-mail address information " abx@164.com " and " abx@163.com " of the input of user's first, if air control The E-mail address information of system of users first input not by the process of above-mentioned step S201~S204, then wind E-mail address information is " abx 164.com " corresponding Transaction Information and electronics postal in rear extended meeting by control system Case carries out respectively statistics and analysis for " abx@163.com " corresponding Transaction Information, and actually the two The corresponding Transaction Information in E-mail address is all user's first, so as to reduce the accuracy of ultimate risk process.
But, by the process of above-mentioned steps S201~S204, air control system is determined " abx@164.com " Ultimate criterion information be " abx@163.com ", and user's first input " abx@163.com " inherently It is standard information, it is therefore not necessary to pass through the process of step S201~S204, follow-up air control system is by electronics postal Case information is counted on together for " abx@163.com " corresponding Transaction Information, and makes corresponding data point Analysis, it is follow-up according to the analysis result for obtaining, make corresponding risk and process.
The method and air control processing method of the information processing for providing for the embodiment of the present application above, based on same Thinking, the embodiment of the present application also provides the device that a kind of device of information processing and air control are processed, such as Fig. 3, Shown in Fig. 4.
The apparatus structure schematic diagram of the information processing that Fig. 3 is provided for the embodiment of the present application, described device includes:
Acquisition module 301, for obtaining the input information of user;
First determining module 302, for according to the input information, in default information standard list, Determine the corresponding each potential standard information of the input information;
Second determining module 303, for according to the input information and each potential standard information, it is determined that described Similarity between input information and each potential standard information;
3rd determining module 304, for according to similar between the input information and each potential standard information Degree, in each potential standard information, determines ultimate criterion information;
Processing module 305, for processing the ultimate criterion information.
Described device also includes:
Adjusting module 306, for determining that the input information is corresponding in first determining module 302 Before each potential standard information, the input information is adjusted according to default standard information form.
First determining module 302 is specifically for according to default search fractionation mode, by the input Information splits into corresponding search word, according to the search word and the inverted index for pre-building, default Each standard information for including of information standard list in, determine the corresponding each potential standard of the input information Information.
Second determining module 303 is specifically for for arbitrary potential standard information, determining will be described Input information is modified to the minimal action number of times needed for the potential standard information, according to the minimal action number of times, Determine the similarity between the input information and the potential standard information.
3rd determining module 304 is specifically for the potential standard information of similarity highest is defined as most Whole standard information, or when the potential standard information of similarity highest has at least two, according to pre-building Potential standard information priority, the potential standard information of highest priority is defined as into ultimate criterion information.
3rd determining module 304 more than each potential standard of predetermined threshold value in similarity specifically for believing In breath, ultimate criterion information is determined.
Described device also includes:
4th determining module 307, for determining the input information pair in first determining module 302 Before each potential standard information answered, determine that the input information is non-standard information.
The apparatus structure schematic diagram that Fig. 4 is processed for the air control that the embodiment of the present application is provided, described device includes:
Acquisition module 401, for obtaining the input information of user;
First determining module 402, for according to the input information, in default information standard list, Determine the corresponding each potential standard information of the input information;
Second determining module 403, for according to the input information and each potential standard information, it is determined that described Similarity between input information and each potential standard information;
3rd determining module 404, for according to similar between the input information and each potential standard information Degree, in each potential standard information, determines ultimate criterion information;
Air control module 405, for carrying out risk prevention system process to the ultimate criterion information.
The input information includes E-mail address information.
In a typical configuration, computing device includes one or more processors (CPU), input/defeated Outgoing interface, network interface and internal memory.
Internal memory potentially includes the volatile memory in computer-readable medium, random access memory And/or the form, such as read-only storage (ROM) or flash memory (flash RAM) such as Nonvolatile memory (RAM). Internal memory is the example of computer-readable medium.
Computer-readable medium includes that permanent and non-permanent, removable and non-removable media can be by appointing What method or technique is realizing information Store.Information can be computer-readable instruction, data structure, program Module or other data.The example of the storage medium of computer include, but are not limited to phase transition internal memory (PRAM), Static RAM (SRAM), dynamic random access memory (DRAM), it is other kinds of with Machine access memory (RAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc read-only storage (CD-ROM), Digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk is stored or it His magnetic storage apparatus or any other non-transmission medium, can be used to store the letter that can be accessed by a computing device Breath.Define according to herein, computer-readable medium does not include temporary computer readable media (transitory Media), such as the data-signal and carrier wave of modulation.
Also, it should be noted that term " including ", "comprising" or its any other variant are intended to non-row His property is included, so that a series of process, method, commodity or equipment including key elements not only includes Those key elements, but also including other key elements being not expressly set out, or also include for this process, The intrinsic key element of method, commodity or equipment.In the absence of more restrictions, by sentence " including One ... " key element that limits, it is not excluded that including the process of the key element, method, commodity or setting Also there is other identical element in standby.
It will be understood by those skilled in the art that embodiments herein can be provided as method, system or computer journey Sequence product.Therefore, the application can using complete hardware embodiment, complete software embodiment or with reference to software and The form of the embodiment of hardware aspect.And, the application can be adopted and wherein include calculating at one or more Machine usable program code computer-usable storage medium (including but not limited to magnetic disc store, CD-ROM, Optical memory etc.) on implement computer program form.
Embodiments herein is the foregoing is only, the application is not limited to.For this area skill For art personnel, the application can have various modifications and variations.All institutes within spirit herein and principle Any modification, equivalent substitution and improvements of work etc., within the scope of should be included in claims hereof.

Claims (18)

1. a kind of method of information processing, it is characterised in that methods described includes:
Obtain the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information, Determine ultimate criterion information;
The ultimate criterion information is processed.
2. the method for claim 1, it is characterised in that determining the input information correspondence Each potential standard information before, methods described also includes:
The input information is adjusted according to default standard information form.
3. the method for claim 1, it is characterised in that according to the input information of user, pre- If information standard list in, determine the corresponding each potential standard information of the input information, specifically include:
According to default search fractionation mode, the input information is split into corresponding search word;
According to the search word and the inverted index for pre-building, include in default information standard list In each standard information, the corresponding each potential standard information of the input information is determined.
4. the method for claim 1, it is characterised in that determine that the input information is potential with each Similarity between standard information, specifically includes:
For arbitrary potential standard information, determine and the input information is modified into the potential standard information institute The minimal action number of times for needing;
According to the minimal action number of times, the phase between the input information and the potential standard information is determined Like degree.
5. the method for claim 1, it is characterised in that potential with each according to the input information Similarity between standard information, in each potential standard information, determines ultimate criterion information, concrete bag Include:
The potential standard information of similarity highest is defined as into ultimate criterion information;Or
When the potential standard information of similarity highest has at least two, according to the potential standard for pre-building Information priorities, by the potential standard information of highest priority ultimate criterion information is defined as.
6. the method as described in Claims 1 to 5 is arbitrary, it is characterised in that in each potential standard information, Ultimate criterion information is determined, is specifically included:
In each potential standard information of the similarity more than predetermined threshold value, ultimate criterion information is determined.
7. the method for claim 1, it is characterised in that determining the input information correspondence Each potential standard information before, methods described also includes:
Determine that the input information is non-standard information.
8. a kind of method that air control is processed, it is characterised in that methods described includes:
Air control system obtains the input information of user;
According to the input information, in default information standard list, the input information correspondence is determined Each potential standard information;
According to the input information and each potential standard information, determine that the input information is believed with each potential standard Similarity between breath;
According to the similarity between the input information and each potential standard information, in each potential standard information, Determine ultimate criterion information;
Risk prevention system process is carried out to the ultimate criterion information.
9. method as claimed in claim 8, it is characterised in that the input information includes E-mail address Information.
10. a kind of device of information processing, it is characterised in that described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information, In each potential standard information, ultimate criterion information is determined;
Processing module, for processing the ultimate criterion information.
11. devices as claimed in claim 10, it is characterised in that described device also includes:
Adjusting module, for determining the corresponding each potential mark of the input information in first determining module Before calibration information, the input information is adjusted according to default standard information form.
12. devices as claimed in claim 10, it is characterised in that first determining module is specifically used In, fractionation mode is searched for according to default, the input information is split into corresponding search word, according to institute Search word and the inverted index for pre-building are stated, in each standard information that default information standard list is included In, determine the corresponding each potential standard information of the input information.
13. devices as claimed in claim 10, it is characterised in that second determining module is specifically used In for arbitrary potential standard information, determining and for the input information to be modified to the potential standard information institute The minimal action number of times for needing, according to the minimal action number of times, determines the input information and the potential mark Similarity between calibration information.
14. devices as claimed in claim 10, it is characterised in that the 3rd determining module is specifically used In, the potential standard information of similarity highest is defined as into ultimate criterion information, or when similarity highest is latent When standard information has at least two, according to the potential standard information priority for pre-building, by priority The potential standard information of highest is defined as ultimate criterion information.
15. devices as described in claim 10~14 is arbitrary, it is characterised in that the 3rd determining module Specifically in each potential standard information of the similarity more than predetermined threshold value, determining ultimate criterion information.
16. devices as claimed in claim 10, it is characterised in that described device also includes:
4th determining module, for determining that the input information is corresponding each latent in first determining module Before standard information, determine that the input information is non-standard information.
The device that a kind of 17. air controls are processed, it is characterised in that described device includes:
Acquisition module, for obtaining the input information of user;
First determining module, for according to the input information, in default information standard list, it is determined that Go out the corresponding each potential standard information of the input information;
Second determining module, for according to the input information and each potential standard information, determining the input Similarity between information and each potential standard information;
3rd determining module, for according to the similarity between the input information and each potential standard information, In each potential standard information, ultimate criterion information is determined;
Air control module, for carrying out risk prevention system process to the ultimate criterion information.
18. devices as claimed in claim 17, it is characterised in that the input information includes electronics postal Case information.
CN201510763353.1A 2015-11-10 2015-11-10 Method and device for processing information Pending CN106681524A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510763353.1A CN106681524A (en) 2015-11-10 2015-11-10 Method and device for processing information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510763353.1A CN106681524A (en) 2015-11-10 2015-11-10 Method and device for processing information

Publications (1)

Publication Number Publication Date
CN106681524A true CN106681524A (en) 2017-05-17

Family

ID=58865621

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510763353.1A Pending CN106681524A (en) 2015-11-10 2015-11-10 Method and device for processing information

Country Status (1)

Country Link
CN (1) CN106681524A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109118353A (en) * 2018-07-20 2019-01-01 中国邮政储蓄银行股份有限公司 The data processing method and device of air control model
CN111930809A (en) * 2020-09-17 2020-11-13 支付宝(杭州)信息技术有限公司 Data processing method, device and equipment

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040148280A1 (en) * 2002-12-30 2004-07-29 Moriyuki Chimura Management information processing method and keyword determination method
CN102663016A (en) * 2012-03-21 2012-09-12 上海汉翔信息技术有限公司 System and method for implementing input information extension on input candidate box on electronic device
CN103207878A (en) * 2012-01-17 2013-07-17 阿里巴巴集团控股有限公司 Inspection method and device of published information
CN103237018A (en) * 2013-03-29 2013-08-07 东莞宇龙通信科技有限公司 Method, server and communication system for matching clients
CN103412947A (en) * 2013-08-26 2013-11-27 浙江大学 Polygon search method for big space data
CN103473373A (en) * 2013-09-29 2013-12-25 方正国际软件有限公司 Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method
CN103489013A (en) * 2013-09-18 2014-01-01 航天科工深圳(集团)有限公司 Image recognition method for electrical equipment monitoring
CN103530334A (en) * 2013-09-29 2014-01-22 方正国际软件有限公司 System and method for data matching based on comparison module
CN104111977A (en) * 2014-06-24 2014-10-22 小米科技有限责任公司 Information matching method and device and terminal

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040148280A1 (en) * 2002-12-30 2004-07-29 Moriyuki Chimura Management information processing method and keyword determination method
CN103207878A (en) * 2012-01-17 2013-07-17 阿里巴巴集团控股有限公司 Inspection method and device of published information
CN102663016A (en) * 2012-03-21 2012-09-12 上海汉翔信息技术有限公司 System and method for implementing input information extension on input candidate box on electronic device
CN103237018A (en) * 2013-03-29 2013-08-07 东莞宇龙通信科技有限公司 Method, server and communication system for matching clients
CN103412947A (en) * 2013-08-26 2013-11-27 浙江大学 Polygon search method for big space data
CN103489013A (en) * 2013-09-18 2014-01-01 航天科工深圳(集团)有限公司 Image recognition method for electrical equipment monitoring
CN103473373A (en) * 2013-09-29 2013-12-25 方正国际软件有限公司 Threshold matching model-based similarity analysis system and threshold matching model-based similarity analysis method
CN103530334A (en) * 2013-09-29 2014-01-22 方正国际软件有限公司 System and method for data matching based on comparison module
CN104111977A (en) * 2014-06-24 2014-10-22 小米科技有限责任公司 Information matching method and device and terminal

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109118353A (en) * 2018-07-20 2019-01-01 中国邮政储蓄银行股份有限公司 The data processing method and device of air control model
CN109118353B (en) * 2018-07-20 2022-03-15 中国邮政储蓄银行股份有限公司 Data processing method and device of wind control model
CN111930809A (en) * 2020-09-17 2020-11-13 支付宝(杭州)信息技术有限公司 Data processing method, device and equipment
US11436252B2 (en) 2020-09-17 2022-09-06 Alipay (Hangzhou) Information Technology Co., Ltd. Data processing methods, apparatuses, and devices

Similar Documents

Publication Publication Date Title
US10257187B2 (en) Prompting login account
US10915706B2 (en) Sorting text report categories
EP2803031B1 (en) Machine-learning based classification of user accounts based on email addresses and other account information
CN109255564B (en) Pick-up point address recommendation method and device
US20120136812A1 (en) Method and system for machine-learning based optimization and customization of document similarities calculation
US20140172415A1 (en) Apparatus, system, and method of providing sentiment analysis result based on text
US20160117328A1 (en) Influence score of a social media domain
CN114143216A (en) System and method for network-based advertisement data traffic latency reduction
CN109388634B (en) Address information processing method, terminal device and computer readable storage medium
WO2016101811A1 (en) Information arrangement method and apparatus
WO2022134829A1 (en) Method and apparatus for identifying same user, and computer device and storage medium
CN103544150B (en) For browser of mobile terminal provides the method and system of recommendation information
JP6664585B2 (en) Information processing apparatus, information processing method, and information processing program
WO2019056496A1 (en) Method for generating picture review probability interval and method for picture review determination
CN106991090A (en) The analysis method and device of public sentiment event entity
CN112650858A (en) Method and device for acquiring emergency assistance information, computer equipment and medium
CN111861733B (en) Fraud prevention and control system and method based on address fuzzy matching
CN106681524A (en) Method and device for processing information
CN106469182A (en) A kind of information recommendation method based on mapping relations and device
CN110674383B (en) Public opinion query method, device and equipment
CN107220260A (en) The method and device that a kind of page is shown
CN105677677A (en) Information classification and device
CN113220949B (en) Construction method and device of private data identification system
CN113656466B (en) Policy data query method, device, equipment and storage medium
US10970341B2 (en) Predictive modeling in event processing systems for big data processing in cloud

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170517