CN105430654B - The recognition methods of the attaching information of number and device - Google Patents

The recognition methods of the attaching information of number and device Download PDF

Info

Publication number
CN105430654B
CN105430654B CN201510728723.8A CN201510728723A CN105430654B CN 105430654 B CN105430654 B CN 105430654B CN 201510728723 A CN201510728723 A CN 201510728723A CN 105430654 B CN105430654 B CN 105430654B
Authority
CN
China
Prior art keywords
short message
title
sender
sample
subsample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201510728723.8A
Other languages
Chinese (zh)
Other versions
CN105430654A (en
Inventor
汪平仄
张涛
陈志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiaomi Inc
Original Assignee
Xiaomi Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiaomi Inc filed Critical Xiaomi Inc
Priority to CN201510728723.8A priority Critical patent/CN105430654B/en
Publication of CN105430654A publication Critical patent/CN105430654A/en
Application granted granted Critical
Publication of CN105430654B publication Critical patent/CN105430654B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W12/00Security arrangements; Authentication; Protecting privacy or anonymity
    • H04W12/12Detection or prevention of fraud
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/12Messaging; Mailboxes; Announcements
    • H04W4/14Short messaging services, e.g. short message services [SMS] or unstructured supplementary service data [USSD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W4/00Services specially adapted for wireless communication networks; Facilities therefor
    • H04W4/18Information format or content conversion, e.g. adaptation by the network of the transmitted or received information for the purpose of wireless delivery to users or terminals
    • H04W4/185Information format or content conversion, e.g. adaptation by the network of the transmitted or received information for the purpose of wireless delivery to users or terminals by embedding added-value information into content, e.g. geo-tagging
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W8/00Network data management
    • H04W8/02Processing of mobility data, e.g. registration information at HLR [Home Location Register] or VLR [Visitor Location Register]; Transfer of mobility data, e.g. between HLR, VLR or external networks
    • H04W8/04Registration at HLR or HSS [Home Subscriber Server]

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Databases & Information Systems (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The disclosure is directed to the recognition methods of the attaching information of number and devices, which comprises obtain sample short message collection;The title for being used for identification number attaching information is extracted from the sample short message of the sample short message collection;The title of the corresponding sample short message of same number of sender is merged;The attaching information of the number of sender is identified according to pooling information, is realized and is automatically identified the attaching information that sample short message concentrates number of sender, avoids waste of human resource caused by manual identified number, while improving recognition efficiency.

Description

The recognition methods of the attaching information of number and device
Technical field
This application involves the recognition methods of field of communication technology more particularly to the attaching information of number and devices.
Background technique
With the rapid development of mobile communication technology, terminal has become the necessary article of many modern's work and lifes, Terminal is brought while facilitate, and also brings hidden danger to people's lives.Such as receive fraudulent call and rubbish is short The undesirable communication information received of the users such as letter.
In the related technology, manually each telephone number is identified and is marked, establish telephone number and attaching information Corresponding relationship.In the telephone number for receiving caller or when for the dial instruction of telephone number, identified by corresponding relationship The corresponding attaching information of telephone number out, and reminded according to attaching information.But it due to number diversification, manually needs to identify Number it is relatively more, expend a large amount of human resources, and identify the low efficiency of the corresponding attaching information of telephone number.
Summary of the invention
To overcome the problems in correlation technique, present disclose provides the recognition methods of the attaching information of number and dresses It sets.
According to the first aspect of the embodiments of the present disclosure, a kind of recognition methods of the attaching information of number, the method are provided It include: to obtain sample short message collection;
The title for being used for identification number attaching information is extracted from the sample short message of the sample short message collection;
The title of the corresponding sample short message of same number of sender is merged;
The attaching information of the number of sender is identified according to pooling information.
Optionally, the acquisition sample short message collection includes:
Obtain the history short message in preset time period;
The sender number of the history short message is identified;
It is that the history short message of class note number is notified to be determined as sample short message by described sender number, obtains sample short message Collection.
Optionally, described that the mark for being used for identification number attaching information is extracted from the sample short message of the sample short message collection Topic, comprising:
When in the sample short message of the sample short message collection including special symbol group, extracted from the sample short message specific Information between set of symbols determines the title of the sample short message according to the information of extraction;
When not including the special symbol group in the sample short message of the sample short message collection, by the mark of the sample short message Topic is determined as sky information.
Optionally, the method also includes:
The corresponding sender number of sample short message comprising the special symbol group is concentrated to be determined as the sample short message Number of sender;
The corresponding subsample short message collection of each number of sender is filtered out respectively from sample short message concentration, it is described It includes number of sender, short message receiver number and title that subsample short message, which concentrates each sample short message,.
Optionally, the title by the corresponding sample short message of same number of sender merges, comprising:
It calculates subsample short message and concentrates the corresponding short message receiver number number of each title, obtain subsample short message and merge Collection, it includes number of sender, title, short message receiver number that the subsample short message, which merges each sample short message of concentration, Number, the subsample short message collection includes the corresponding sample short message of same number of sender.
Optionally, the attaching information that the number of sender is identified according to pooling information, comprising:
Subsample short message, which is calculated, using following formula merges the probability for concentrating each title to concentrate in the merging of subsample short message Value:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate Subsample short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message is closed Title title in unionkCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number;
The title that the probability value is greater than probability threshold value is determined as to the attaching information of the number of sender.
Optionally, described to calculate the subsample short message each title of merging concentration in the merging of subsample short message using following formula Before the probability value of concentration, further includes:
Judge that the subsample short message merges and concentrates whether the corresponding short message receiver number number of title is less than number threshold Value;
The corresponding title of short message receiver number number for being less than number threshold value is deleted.
Optionally, the method also includes:
The incidence relation of number of sender and attaching information is determined according to the attaching information of each number of sender;
Destination number to be identified is identified according to the incidence relation of described sender number and attaching information, is determined The attaching information of the destination number, the destination number includes calling party's number to be transferred to, the received number of callee, short Believe sender's received number of number or short message receiver to be sent.
According to the second aspect of an embodiment of the present disclosure, a kind of identification device of the attaching information of number is provided, comprising:
Short message collection obtains module, is configured as obtaining sample short message collection;
Title abstraction module is configured as being extracted from the sample short message of the sample short message collection for identification number ownership The title of information;
Title merging module is configured as closing the title of the corresponding sample short message of same number of sender And;
First attaching information identification module is configured as identifying returning for the number of sender according to pooling information Belong to information.
Optionally, the short message collection acquisition module includes:
Short message acquisition submodule is configured to obtain the history short message in preset time period;
Number Reorganization submodule is configured to identify the sender number of the history short message;
Sample short message collection determines submodule, is configured to be the history short message for notifying class note number by described sender number It is determined as sample short message, obtains sample short message collection.
Optionally, the title abstraction module includes:
Title extracts submodule, is configured as when in the sample short message of the sample short message collection including special symbol group, From the information extracted between special symbol group in the sample short message, the mark of the sample short message is determined according to the information of extraction Topic;It is when not including the special symbol group in the sample short message of the sample short message collection, the title of the sample short message is true It is set to sky information.
Optionally, described device further include:
Number of sender determining module is configured as concentrating the sample short message comprising the special symbol group The corresponding sender number of sample short message is determined as number of sender;
Subsample short message collection determining module is configured as filtering out each short message transmission respectively from sample short message concentration The corresponding subsample short message collection of square number, it includes number of sender that the subsample short message, which concentrates each sample short message, short Believe recipient's number and title.
Optionally, the title merging module includes:
Merge to collect and determine submodule, is configured as calculating the corresponding short message receiver number of each title of subsample short message concentration Code number obtains subsample short message and merges collection, and it includes SMS sender that each sample short message is concentrated in the subsample short message merging Number, title, short message receiver number number, the subsample short message collection include the corresponding sample of same number of sender Short message.
Optionally, the first attaching information identification module includes:
Probability value computational submodule is configured as being merged using following formula calculating subsample short message and each title is concentrated to exist Subsample short message merges the probability value concentrated:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate Subsample short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message is closed Title title in unionkCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number;
Attaching information determines submodule, is configured as the title that the probability value is greater than probability threshold value being determined as described short Believe the attaching information of sender number.
Optionally, the first attaching information identification module further include:
Attaching information filter submodule judges that the subsample short message merges and concentrates the corresponding short message receiver number of title Whether number is less than number threshold value;The corresponding title of short message receiver number number for being less than number threshold value is deleted.
Optionally, described device further include:
Incidence relation determining module is configured as determining SMS sender according to the attaching information of each number of sender The incidence relation of number and attaching information;
Second attaching information identification module is configured as the incidence relation pair according to described sender number and attaching information Destination number to be identified is identified, determines that the attaching information of the destination number, the destination number include that calling party waits for The number of dial-out, the received number of callee, the SMS sender received number of number or short message receiver to be sent.
According to the third aspect of an embodiment of the present disclosure, a kind of identification device of the attaching information of number is provided, comprising:
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is configured to:
Obtain sample short message collection;
The title for being used for identification number attaching information is extracted from the sample short message of the sample short message collection;
The title of the corresponding sample short message of same number of sender is merged;
The attaching information of the number of sender is identified according to pooling information.
The technical scheme provided by this disclosed embodiment can include the following benefits:
The disclosure obtains sample short message collection, then extracts from the sample short message of sample short message collection and belongs to for identification number The title of the corresponding sample short message of same number of sender is merged, is identified according to pooling information by the title of information The attaching information of number of sender out is realized and automatically identifies the ownership letter that sample short message concentrates number of sender Breath, avoids waste of human resource caused by manual identified number, while improving recognition efficiency.
In the disclosure, since the sender number of notice class short message is different from other conventional sender numbers of short message, because This present embodiment can identify whether history short message is notice class short message by sender number, to class short message will be notified true It is set to sample short message, sample short message collection is obtained, to improve the efficiency of the attaching information of subsequent identification number.
In the disclosure, when in sample short message including the special symbol group, specific symbol can be extracted from sample short message Information between number group, the title of sample short message is determined according to the information of extraction, when not including special symbol group in sample short message When, the title of this sample short message can be determined as sky information, to improve the efficiency of the title of determining sample short message.
The disclosure can concentrate sample short message the corresponding sender number of the sample short message comprising special symbol group to determine For number of sender, so that avoiding pooling information is empty situation;In addition, it includes special symbol group that sample short message, which is concentrated, The corresponding sender number of sample short message be determined as number of sender;It is filtered out respectively from sample short message concentration each short The corresponding subsample short message collection of sender number is believed, to all including the corresponding sample short message of number of sender in son Sample short message is concentrated, and short message concentration in subsample both includes the sample short message comprising special symbol group, can also include not including The sample short message of special symbol group avoids to improve the accuracy of the subsequent attaching information for identifying number of sender When sample short message quantity not comprising special symbol group is larger, only determine that short message is sent out according to the sample short message comprising special symbol group Error caused by the corresponding attaching information of the side's of sending number.
The disclosure calculates title by the corresponding short message receiver number number of title and merges concentration in subsample short message Probability, and the biggish title of probability is determined as to the attaching information of number of sender, improve the standard of determining attaching information True property, and reduce the quantity of attaching information, convenience is brought to user.
The disclosure calculate subsample short message merge concentrate each title subsample short message merge the probability value concentrated it Before, further includes: judge that the subsample short message merges and concentrates whether the corresponding short message receiver number number of title is less than number Threshold value;The corresponding title of short message receiver number number for being less than number threshold value is deleted.Merge collection calculating subsample short message In each title when subsample short message merges the probability value concentrated, calculate the subsample short message after deleting title merge concentrate it is every A title merges the probability value concentrated in subsample short message, to reduce the calculation amount for calculating probability.
It should be understood that above general description and following detailed description be only it is exemplary and explanatory, not The disclosure can be limited.
Detailed description of the invention
The drawings herein are incorporated into the specification and forms part of this specification, and shows the implementation for meeting the disclosure Example, and together with specification for explaining the principles of this disclosure.
Fig. 1 is a kind of flow chart of the recognition methods of the attaching information of number shown in one exemplary embodiment of the disclosure.
Fig. 2 is a kind of disclosure application of the recognition methods of the attaching information of number shown according to an exemplary embodiment Scene figure.
Fig. 3 is a kind of disclosure frame of the identification device of the attaching information of number shown according to an exemplary embodiment Figure.
Fig. 4 is the frame of the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Figure.
Fig. 5 is the frame of the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Figure.
Fig. 6 is the frame of the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Figure.
Fig. 7 is the frame of the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Figure.
Fig. 8 is the frame of the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Figure.
Fig. 9 is the frame of the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Figure.
Figure 10 is the identification device of the attaching information of the disclosure another number shown according to an exemplary embodiment Block diagram.
A kind of Figure 11 disclosure identification device of the attaching information for number shown according to an exemplary embodiment Block diagram.
Specific embodiment
Example embodiments are described in detail here, and the example is illustrated in the accompanying drawings.Following description is related to When attached drawing, unless otherwise indicated, the same numbers in different drawings indicate the same or similar elements.Following exemplary embodiment Described in embodiment do not represent all implementations consistent with this disclosure.On the contrary, they be only with it is such as appended The example of the consistent device and method of some aspects be described in detail in claims, the disclosure.
It is only to be not intended to be limiting the disclosure merely for for the purpose of describing particular embodiments in the term that the disclosure uses. The "an" of the singular used in disclosure and the accompanying claims book, " described " and "the" are also intended to including majority Form, unless the context clearly indicates other meaning.It is also understood that term "and/or" used herein refers to and wraps It may be combined containing one or more associated any or all of project listed.
It will be appreciated that though various information, but this may be described using term first, second, third, etc. in the disclosure A little information should not necessarily be limited by these terms.These terms are only used to for same type of information being distinguished from each other out.For example, not departing from In the case where disclosure range, the first information can also be referred to as the second information, and similarly, the second information can also be referred to as One information.Depending on context, word as used in this " if " can be construed to " ... when " or " when ... When " or " in response to determination ".
As shown in Figure 1, Fig. 1 is a kind of disclosure identification of the attaching information of number shown according to an exemplary embodiment The flow chart of method, comprising the following steps:
In a step 101, sample short message collection is obtained.
In a step 102, the mark for being used for identification number attaching information is extracted from the sample short message of the sample short message collection Topic.
In step 103, the title of the corresponding sample short message of same number of sender is merged.
At step 104, the attaching information of the number of sender is identified according to pooling information.
The embodiment of the present disclosure can be used in terminal, and related terminal can be intelligent terminal, such as can be and have Smart phone, Intelligent bracelet, smartwatch of communication function etc..Intelligent terminal can obtain sample short message collection from local, can To obtain sample short message collection from server, then extracted from the sample short message of sample short message collection for identification number ownership letter The title of the corresponding sample short message of same number of sender is merged, is identified according to pooling information by the title of breath The attaching information of number of sender.
The embodiment of the present disclosure can be used in server, and related server can be individual server, can also be with It is server cluster, can also be Cloud Server etc..Server obtains sample short message collection, then short from the sample of sample short message collection In letter extract be used for identification number attaching information title, by the title of the corresponding sample short message of same number of sender into Row merges, and the attaching information of number of sender is identified according to pooling information.
As seen from the above-described embodiment, then available sample short message collection is extracted from the sample short message of sample short message collection For the title of identification number attaching information, the title of the corresponding sample short message of same number of sender is merged, The attaching information of number of sender is identified according to pooling information, realization automatically identifies sample short message and short message is concentrated to send The attaching information of square number avoids waste of human resource caused by manual identified number, while improving recognition efficiency.
In an optional implementation, the embodiment of the present disclosure can be used in terminal, and terminal recognition goes out sample short message After the attaching information for concentrating number of sender, the incidence relation of number of sender and attaching information can be obtained, it will Incidence relation is stored.It then can be according to the incidence relation of described sender number and attaching information to target to be identified Number is identified, determines that the attaching information of the destination number, the destination number may include calling party's number to be transferred to Code, the received number of callee, the SMS sender received number of number or short message receiver to be sent.
For the application of sender number and the corresponding relationship of attaching information, terminal can be in the electricity for receiving caller When talking about number, the corresponding attaching information of telephone number is identified by corresponding relationship, and reminded according to attaching information.It reminds Mode can be shows attaching information on a display screen, can also be voice broadcast attaching information etc., so as to avoid because answering Loss caused by dangerous phone.For another example, terminal is identified before receiving the dial instruction for telephone number by corresponding relationship The corresponding attaching information of telephone number out, and reminded according to attaching information, avoiding causes because dialing dangerous telephone number Loss, save call cost.For another example, terminal identifies the sender number of short message by corresponding relationship when receiving short message The corresponding attaching information of code, and SMS interception or prompting are carried out according to attaching information, convenience is brought to user.Another can In the implementation of choosing, the embodiment of the present disclosure is in server, server to identify that sample short message concentrates SMS sender number After the attaching information of code, the incidence relation of number of sender and attaching information can be obtained;Incidence relation is sent to again Terminal determines the ownership letter of the destination number so that terminal identifies destination number to be identified according to incidence relation Breath.
As shown in Fig. 2, Fig. 2 is a kind of disclosure identification of the attaching information of number shown according to an exemplary embodiment The application scenario diagram of method.Disclosure scheme can execute in server beyond the clouds, and cloud server is determining SMS sender After the incidence relation of number and attaching information, incidence relation can be pushed into terminal, so that terminal is treated according to incidence relation The destination number of identification is identified, determines the attaching information of destination number.Terminal can be smart phone, Intelligent bracelet, Ipad etc..
Wherein, it the time that incidence relation is sent for server, can be after determining incidence relation, incidence relation is wide Each terminal is cast to, incidence relation can be stored in local by each terminal;It is also possible to receive terminal initiation in server When attaching information acquisition request, the incidence relation of number of sender and attaching information is sent to by terminal according to request.
Wherein, how according to the incidence relation of number of sender and attaching information to be identified and be marked for terminal Note, no longer limits herein.
In this embodiment, identify that sample short message concentrates the ownership of number of sender to believe by server centered Breath, determines the incidence relation of number of sender and attaching information, and incidence relation is pushed to each terminal, keeps each terminal total The incidence relation is enjoyed, has more the incidence relation of number of sender and attaching information comprehensive, while by server set Middle determining incidence relation avoids each terminal from all determining the wasting of resources caused by incidence relation.
Then, the disclosure is respectively illustrated each step in Fig. 1 respectively.
About step 101, sample short message collection can be the set of the short message obtained in certain time.In order to save meter Calculation amount, sample short message collection can be the set of the notice class short message obtained in certain time.An optional realization side In formula, sample short message collection can be obtained using following manner:
A1: the history short message in preset time period is obtained.
Wherein, preset time period can be preset, for example, preset time period can be set as in one month, in one week Deng.History short message can be the short message that different senders issue different recipients.In this step, history short message, which includes at least, to be sent Square number.It further, can also include short message content and recipient's number.
A2: the sender number of the history short message is identified.
A3: it is that the history short message of class note number is notified to be determined as sample short message by described sender number, obtains sample Short message collection.
Since the sender number of notice class short message is different from other conventional sender numbers of short message, the present embodiment Can identify whether history short message is notice class short message by sender number, so that it is short that notice class short message is determined as sample Letter obtains sample short message collection, to improve the efficiency of subsequent determining number and attaching information corresponding relationship.
It should be understood that, in addition to the above method, can also be used when whether judge history short message is notice class short message Judgment method in the related technology, no longer limits herein.
About step 102, the mark for identification number attaching information can be extracted from the sample short message of sample short message collection Topic.The attaching information of number can be number source side title, for example, it may be the machine of the Business Name of number home or ownership Structure title etc., the attaching information of number is also possible to the purpose information of the number communication content, for example, the purpose of this short message is " prompting ".
There are many kinds of the methods of extracting header, for example, word segmentation processing, keyword match etc..The disclosure is enumerated one kind and is passed through The method of special symbol group extracting header, which comprises
B1: it when in the sample short message of the sample short message collection including special symbol group, is extracted from the sample short message Information between special symbol group determines the title of the sample short message according to the information of extraction.
B2: when not including the special symbol group in the sample short message of the sample short message collection, by the sample short message Title be determined as sky information.
Wherein, special symbol group can be with annotated set of symbols, such as bracket, and bracket may include braces The forms such as " { } ", bracket " [] ", round bracket " () ", hexagonal bracket " () ", angle brackets "<>" and square toes bracket " [] ".
The present embodiment specifically describe it is a kind of extract title method, when in sample short message include the special symbol group When, it can be from the information extracted in the short message content of sample short message between special symbol group, for example, regular expression can be used Mode extract the information between special symbol group.It is understood that the information extracted is in the short message content of sample short message Information, for example, when special symbol group be square brackets when, then extract the content in short message content in square brackets.When sample short message In do not include special symbol group when, the title of this sample short message can be determined as sky information.So-called sky information, can be not There is character, be also possible to specific character, which indicates that attaching information is sky.
About step 103, after extracting header, every sample short message that sample short message is concentrated has corresponding title, can To merge the title of the corresponding sample short message of same number of sender, pooling information is obtained.
When merging, all titles that subsample short message can be concentrated are merged, and obtain the short of subsample short message collection Believe the pooling information of sender number, the subsample short message collection includes the corresponding sample short message of same number of sender; Subsample short message can also be calculated and concentrate the corresponding short message receiver number number of each title, subsample short message is obtained and merge Collection, it includes number of sender, title, short message receiver number that the subsample short message, which merges each sample short message of concentration, Number, the subsample short message collection includes the corresponding sample short message of same number of sender.
In addition, being directed to subsample short message collection, number of sender can be sample short message and concentrate all sample short messages The corresponding sample short message group of same number of sender then can be combined into subsample short message collection, for every by sender number A sender number has corresponding subsample short message collection, the title of the sample short message that can directly concentrate subsample short message into Row merges, and obtains the corresponding pooling information of each sender number.However due to being concentrated in sample short message, the short message of sample short message There may be titles in content, it is also possible to title be not present, then the not necessarily corresponding subsample short message collection of each sender number In all there is title, then use the above method there may be pooling informations as empty situation, increase calculation amount.
In order to avoid such case, the title of the corresponding sample short message of same number of sender is merged it Before, can also include C1 step and C2 step:
C1: the corresponding sender number of the sample short message comprising the special symbol group is concentrated to determine the sample short message For number of sender.
C2: filtering out the corresponding subsample short message collection of each number of sender from sample short message concentration respectively, It includes number of sender, short message receiver number and title that the subsample short message, which concentrates each sample short message,.
Because having the annotated information of the short message in special symbol group, the short message of special symbol group can will be had Sender number be considered that headed number of sender may be associated with.
It is concentrated in sample short message, there may be special symbol groups in the short message content of sample short message, it is also possible to which there is no spies Determine set of symbols, the present embodiment filters out the sample short message comprising special symbol group, can be by the transmission of the sample short message filtered out Square number is determined as number of sender, and multiple number of sender can form number of sender collection, then needle Number of sender is carried out to divide subsample short message collection, guarantees that number of sender is number at least with a title Code is empty situation so as to avoid pooling information.
Wherein, from sample short message concentration filter out respectively the corresponding subsample short message collection of each number of sender be from Sample short message concentrates the sample short message for filtering out that sender number is number of sender, obtains the number of sender pair The subsample short message collection answered.When number of sender is multiple, then the corresponding son of each number of sender is filtered out Sample short message collection realizes the corresponding subsample short message collection of a number of sender.It, can for each subsample short message collection The corresponding subsample short message merging collection of the number of sender is obtained to execute step 103.
Wherein, subsample short message concentrates every sample short message there are corresponding title, the quantity of title can for one or It is multiple, can also for sky.Since same subsample short message is concentrated, the title under different sample short messages may be different, therefore can be with The incidence relation of every sample short message and title is established, such as can establish the triplet information of every sample short message, triple Information may include number of sender, short message receiver number, title.It is understood that the same subsample short message It concentrates, the number of sender in the triple of every sample short message is identical.
The sample short message is concentrated the corresponding sender number of sample short message comprising the special symbol group by this step It is determined as number of sender, can is empty situation to avoid pooling information;In addition, being filtered out respectively from sample short message concentration The corresponding subsample short message collection of each number of sender, thus by the corresponding sample short message whole capsule of number of sender It includes and is concentrated in subsample short message, short message concentration in subsample had both included the sample short message comprising special symbol group, can also include Sample short message not comprising special symbol group, to improve the accurate of the subsequent attaching information for identifying number of sender Property, it avoids when the sample short message quantity for not including special symbol group is larger, only according to the sample short message comprising special symbol group Determine error caused by the corresponding attaching information of number of sender.
Further it will be understood that about step 102, step C1 and step C2, can first sample drawn short message concentrate it is every The title of sample short message divides subsample short message collection further according to result is extracted;It can also first divide subsample short message collection, then from Subsample short message concentrates the title for extracting each sample short message, and it includes SMS sender number that final purpose, which is provided to obtain, The subsample short message set of code, short message receiver number and title.
For example, the title in all sample short messages can be extracted first, each sample short message includes sender number, recipient Number, title.Then the corresponding sender number of sample short message comprising special symbol group is concentrated to be determined as sample short message short Believe sender number;The corresponding subsample short message collection of each number of sender, institute are filtered out respectively from sample short message concentration Stating subsample short message and concentrating each sample short message includes number of sender, short message receiver number and title.
For another example, it after acquisition includes the sample short message collection of short message content, sender number, recipient's number, can incite somebody to action Sample short message concentrates the corresponding sender number of sample short message comprising special symbol group to be determined as number of sender;From sample This short message concentration filters out the corresponding initial sample short message collection of each number of sender respectively, and initial sample short message is concentrated every Sample short message includes short message content, sender number, recipient's number.Then from the sample short message of initial sample short message collection The title for being used for identification number attaching information is extracted in short message content, obtains the corresponding final subsample of the number of sender Short message collection, it includes number of sender, short message receiver number and mark which, which concentrates each sample short message, Topic.
Based on this, the increment including number of sender, short message receiver number and title is obtained using the above method After this short message collection, the title of the corresponding sample short message of same number of sender can be closed according to subsample short message collection And.
It include number of sender, short message receiver number and title obtaining in an optional implementation Subsample short message collection after, the title by the corresponding sample short message of same number of sender merges, and can wrap It includes: calculating the subsample short message and concentrate the corresponding short message receiver number number of each title, obtain subsample short message and merge Collection, it includes number of sender, title, short message receiver number that the subsample short message, which merges each sample short message of concentration, Number.
Due to subsample short message centralized recording have the number of sender of every sample short message, short message receiver number and Title can then count subsample short message and concentrate the corresponding short message receiver number number of each title, so as to obtain Subsample short message comprising number of sender, title, short message receiver number number incidence relation merges collection.
It include number of sender, short message receiver number and mark obtaining in another optional implementation After the subsample short message collection of topic, all titles that subsample short message can be concentrated are merged, and obtain subsample short message collection The pooling information of number of sender.
SMS sender directly can be determined according to pooling information in an optional implementation about step 104 The attaching information of number.For example, when directly merging the title that subsample short message is concentrated in step 103, by merging Attaching information of the title as the number of sender of the subsample short message collection.The feelings that this mode is suitble to title fewer Condition, and the situation that empty information is fewer, the mode efficiency of this determining attaching information are higher.
It include number of sender, title, short message when being obtained in step 103 in another optional implementation When the subsample short message of recipient's number number merges collection, it can be determined that subsample short message, which merges, concentrates the corresponding short message of title to connect Whether debit's number number is greater than number threshold value;It will be greater than the corresponding title conduct of short message receiver number number of number threshold value The attaching information of the number of sender.This mode can reduce the amount of attaching information.
It include number of sender, title, short message when being obtained in step 103 in another optional implementation When the subsample short message of recipient's number number merges collection, it can be concentrated using the calculating subsample short message merging of following formula each Title merges the probability value concentrated in subsample short message:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate Subsample short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message is closed Title title in unionkCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number;By institute State the attaching information that probability value is determined as the number of sender greater than the title of probability threshold value.
After determining the corresponding short message receiver number number of each title, subsample short message is calculated using above-mentioned formula and is closed And each title is concentrated to merge the probability value concentrated in subsample short message, it is sent so that the biggish title of probability is determined as short message The attaching information of square number.
The present embodiment calculates title by the corresponding short message receiver number number of title and merges concentration in subsample short message Probability, and the biggish title of probability is determined as to the attaching information of number of sender, improves determining attaching information Accuracy, and reduce the quantity of attaching information, convenience is brought to user.
Further, the title that the probability value is greater than probability threshold value is determined as to the ownership of the number of sender Information Step may include: to filter out most probable value from determining probability value, when most probable value is greater than probability threshold value When, the corresponding title of most probable value is determined as to the attaching information of number of sender, so as to by each number Attaching information is limited to one, further brings convenience to user.
Further, the subsample short message each title of merging concentration is being calculated in the probability value of subsample short message merging concentration Before, further includes: it is a to judge that the subsample short message merging concentrates the corresponding short message receiver number number of title whether to be less than Number threshold value;The corresponding title of short message receiver number number for being less than number threshold value is deleted.It calculates subsample short message and merges collection In each title when subsample short message merges the probability value concentrated, calculate the subsample short message after deleting and merge and concentrate each mark It inscribes and merges the probability value concentrated in subsample short message, to reduce the calculation amount for calculating probability.
Various technical characteristics in embodiment of above can be arbitrarily combined, as long as the combination between feature is not present Conflict or contradiction, but as space is limited, it is not described one by one, therefore the various technical characteristics in above embodiment is any It is combined the range for also belonging to this disclosure.
The disclosure also lists one of them specific example and is illustrated.In this example, the knowledge of the attaching information of number Other method includes:
S1: notice class sample short message collection S is obtained.
S2: the sample short message comprising special symbol group is filtered out from notice class sample short message concentration, obtains short message collection Ssub。 Enable short message collection SsubIn sender number be number of sender, obtain number of sender collection N (number (1), number(2)……number(t)…)。
The operation of following S3 to S7 is executed for each number of sender.
S3: integrate the initial subsample short message collection that sender number is filtered out in S as number (t) from notice class sample short message Snumber, initial subsample short message collection SnumberIn every sample short message may include triplet information: < number (t), short message connect Debit's number, short message content >.Such as:
Triple 1: < 106988888888,13488888888, " [Tentent Science] [mail reminder of QQ mailbox] sender: Mr. Zhang, Taobao's theme: ... " >.
Triple 2: < 106988888888,13444444444, " your identifying code of this operation is 5889 (in 20 minutes Effectively), please complete to verify, [Tentent Science] [warm tip] " >.
Triple 3: < 106988888888,13455555555, " Mr. Zhang pays the bill 150.00 yuan to your 134*5555.Horse On check and accept.[Alipay] " >.
Triple 4: < 106988888888,13466666666, " you are good by the client respected!Mr. Zhang was 10 days 10: 30 May Give you to send a telegram here, please reply in time " >.
S4: when initial subsample short message concentrates sample short message to include special symbol group, by regular expression from sample The information between special symbol group is extracted in short message, and the title of sample short message is determined according to the information of extraction;When in sample short message When not comprising special symbol group, the title of sample short message is determined as sky information.Every galley proof is established according to identified each title The incidence relation of number of sender, short message receiver number and title in this short message, then can will be in above-mentioned example Triple replaces with new triple<number (t), short message receiver number, and title>, final subsample short message collection is obtained, point It is not as follows:
New triple 1:<106988888888,13488888888, { Tentent Science, the mail reminder of QQ mailbox }>.
New triple 2:<106988888888,13444444444, { Tentent Science, warm prompting }>.
New triple 3:<106988888888,13455555555, { Alipay }>.
New triple 4:<106988888888,13466666666, " { }>.
S5: it calculates subsample short message and concentrates the corresponding short message receiver number number of each title, is i.e. calculating subsample is short Letter concentrates each title to be received by how many a numbers.For example, in above-mentioned example " Tentent Science " quilt " 13488888888 " and " 13444444444 " receive, then " Tentent Science " corresponding recipient's number number is 2.In this regard, short message transmission can be generated Square number, title, recipient's number number incidence relation, obtain subsample short message merge collection, it is as follows:
<106988888888, " Tentent Science ", 2>;
<106988888888, " mail reminder of QQ mailbox ", 1>;
<106988888888, " warmth is reminded ", 1>;
<106988888888, " Alipay ", 1>;
<106988888888, " ", 1>.
S6: can preset and count threshold value one by one, and the title that recipient's number number is less than number threshold value is deleted.Example Such as, number threshold value is set as 2, then the title that number number is less than a 2 is deleted, is left following information after deletion:
<106988888888, " Tentent Science ", 2>.
It is understood that after carrying out the processing of S6 step, it, can be directly true by title if only remaining next title It is set to attaching information.If there remains multiple titles, S7 step can be executed and screened again.
S7: it uses following formula to calculate subsample short message and merges each title of concentration in the general of subsample short message merging concentration Rate value:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate Subsample short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message is closed Title title in unionkCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number.It will be general The title that rate value is greater than probability threshold value is determined as the attaching information of the number of sender.
Corresponding with the embodiment of the recognition methods of the attaching information of aforementioned number, the disclosure additionally provides the ownership of number The embodiment of the identification device of information and its applied terminal.
As shown in figure 3, Fig. 3 is a kind of disclosure identification of the attaching information of number shown according to an exemplary embodiment The block diagram of device, described device include: that short message collection obtains module 31, title abstraction module 32, title merging module 33 and first Attaching information identification module 34.
Wherein, short message collection obtains module 31, is configured as obtaining sample short message collection.
Title abstraction module 32 is configured as extracting from the sample short message of the sample short message collection and return for identification number Belong to the title of information.
Title merging module 33 is configured as closing the title of the corresponding sample short message of same number of sender And.
First attaching information identification module 34 is configured as identifying the number of sender according to pooling information Attaching information.
As seen from the above-described embodiment, sample short message collection is obtained, then extracts and is used for from the sample short message of sample short message collection The title of identification number attaching information merges the title of the corresponding sample short message of same number of sender, according to Pooling information identifies the attaching information of number of sender, and realization automatically identifies sample short message and concentrates SMS sender number The attaching information of code, avoids waste of human resource caused by manual identified number, while improving recognition efficiency.
As shown in figure 4, Fig. 4 is the knowledge of the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of other device, for the embodiment on the basis of aforementioned embodiment illustrated in fig. 3, it includes: short that the short message collection, which obtains module 31, Letter acquisition submodule 311, Number Reorganization submodule 312 and sample short message collection determine submodule 313.
Wherein, short message acquisition submodule 311 is configured to obtain the history short message in preset time period.
Number Reorganization submodule 312 is configured to identify the sender number of the history short message.
Sample short message collection determines submodule 313, is configured to be the history for notifying class note number by described sender number Short message is determined as sample short message, obtains sample short message collection.
As seen from the above-described embodiment, due to the sender number of the sender number of notice class short message and other conventional short messages Difference, therefore the present embodiment can identify whether history short message is notice class short message by sender number, thus will notice Class short message is determined as sample short message, sample short message collection is obtained, to improve the efficiency of the attaching information of subsequent identification number.
As shown in figure 5, Fig. 5 is the knowledge of the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of other device, for the embodiment on the basis of aforementioned embodiment illustrated in fig. 3, the title abstraction module 32 includes: title Extract submodule 321.
Wherein, title extracts submodule 321, is configured as in the sample short message for working as the sample short message collection comprising specific symbol When number group, from the information extracted between special symbol group in the sample short message, determine that the sample is short according to the information of extraction The title of letter;When not including the special symbol group in the sample short message of the sample short message collection, by the sample short message Title is determined as sky information.
As seen from the above-described embodiment, it when in sample short message including the special symbol group, can be taken out from sample short message The information between special symbol group is taken, the title of sample short message is determined according to the information of extraction, it is special when not including in sample short message When determining set of symbols, the title of this sample short message can be determined as sky information, to improve the title of determining sample short message Efficiency.
As shown in fig. 6, Fig. 6 is the knowledge of the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of other device, the embodiment is on the basis of aforementioned embodiment illustrated in fig. 5, described device further include: SMS sender number Code determining module 35 and subsample short message collection determining module 36.
Wherein, number of sender determining module 35 is configured as concentrating the sample short message comprising described specific The corresponding sender number of sample short message of set of symbols is determined as number of sender.
Subsample short message collection determining module 36 is configured as filtering out each short message hair respectively from sample short message concentration The corresponding subsample short message collection of the side's of sending number, the subsample short message concentrate each sample short message include number of sender, Short message receiver number and title.
As seen from the above-described embodiment, sample short message can be concentrated into the corresponding transmission of sample short message comprising special symbol group Square number is determined as number of sender;The corresponding son of each number of sender is filtered out respectively from sample short message concentration Sample short message collection is concentrated to all including the corresponding sample short message of number of sender in subsample short message, subsample Short message concentration both includes the sample short message comprising special symbol group, can also include that the sample not comprising special symbol group is short Letter avoids not including special symbol group to improve the accuracy of the subsequent attaching information for identifying number of sender Sample short message quantity it is larger when, only determine that number of sender is corresponding according to the sample short message comprising special symbol group and return Belong to error caused by information.
As shown in fig. 7, Fig. 7 is the knowledge of the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of other device, for the embodiment on the basis of aforementioned embodiment illustrated in fig. 3, the title merging module 33 includes: to merge Collect and determines submodule 331.
Wherein, merge to collect and determine submodule 331, be configured as calculating the corresponding short message of each title of subsample short message concentration Recipient's number number obtains subsample short message and merges collection, and it includes short that the subsample short message, which merges each sample short message of concentration, Believe sender number, title, short message receiver number number, the subsample short message collection includes same number of sender pair The sample short message answered.
As shown in figure 8, Fig. 8 is the knowledge of the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of other device, on the basis of aforementioned embodiment illustrated in fig. 7, the first attaching information identification module 34 wraps the embodiment Include: probability value computational submodule 341 and attaching information determine submodule 342.
Wherein, probability value computational submodule 341 is configured as calculating subsample short message merging concentration using following formula often A title merges the probability value concentrated in subsample short message:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate Subsample short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message is closed Title title in unionkCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number.
Attaching information determines submodule 342, and the title for being configured as the probability value being greater than probability threshold value is determined as institute State the attaching information of number of sender.
As seen from the above-described embodiment, title is calculated in subsample short message by the corresponding short message receiver number number of title Merge the probability concentrated, and the biggish title of probability is determined as to the attaching information of number of sender, improves determination and return Belong to the accuracy of information, and reduce the quantity of attaching information, brings convenience to user.
As shown in figure 9, Fig. 9 is the knowledge of the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of other device, for the embodiment on the basis of aforementioned embodiment illustrated in fig. 8, the first attaching information identification module 34 is also It include: attaching information filter submodule 343.
Wherein, attaching information filter submodule 343 judges that the subsample short message merges and the corresponding short message of title is concentrated to connect Whether debit's number number is less than number threshold value;The corresponding title of short message receiver number number for being less than number threshold value is deleted It removes.
As seen from the above-described embodiment, the subsample short message each title of merging concentration is being calculated in subsample short message merging concentration Probability value before, further includes: judge that the subsample short message merges and concentrate the corresponding short message receiver number number of title to be It is no to be less than number threshold value;The corresponding title of short message receiver number number for being less than number threshold value is deleted.It is short to calculate subsample Letter, which merges, concentrates each title when subsample short message merges the probability value concentrated, and calculates the subsample short message after deleting and merges collection In each title subsample short message merge concentrate probability value, thus reduce calculate probability calculation amount.
As shown in Figure 10, Figure 10 is the attaching information of the disclosure another number shown according to an exemplary embodiment The block diagram of identification device, the embodiment is on the basis of aforementioned embodiment illustrated in fig. 3, described device further include: incidence relation is true Cover half block 37 and the second attaching information identification module 38.
Wherein, incidence relation determining module 37 is configured as being determined according to the attaching information of each number of sender short Believe the incidence relation of sender number and attaching information.
Second attaching information identification module 38 is configured as the incidence relation according to described sender number and attaching information Destination number to be identified is identified, determines that the attaching information of the destination number, the destination number include calling party Number to be transferred to, the received number of callee, the SMS sender received number of number or short message receiver to be sent.
Correspondingly, the disclosure also provides a kind of identification device of the attaching information of number, described device includes processor; Memory for storage processor executable instruction;Wherein, the processor is configured to:
Obtain sample short message collection.
The title for being used for identification number attaching information is extracted from the sample short message of the sample short message collection.
The title of the corresponding sample short message of same number of sender is merged.
The attaching information of the number of sender is identified according to pooling information.
The function of each unit and the realization process of effect are specifically detailed in the above method and correspond to step in above-mentioned apparatus Realization process, details are not described herein.
For device embodiment, since it corresponds essentially to embodiment of the method, so related place is referring to method reality Apply the part explanation of example.The apparatus embodiments described above are merely exemplary, wherein described be used as separation unit The unit of explanation may or may not be physically separated, and component shown as a unit can be or can also be with It is not physical unit, it can it is in one place, or may be distributed over multiple network units.It can be according to actual The purpose for needing to select some or all of the modules therein to realize disclosure scheme.Those of ordinary skill in the art are not paying Out in the case where creative work, it can understand and implement.
As shown in figure 11, Figure 11 is a kind of identification of attaching information for number shown according to an exemplary embodiment One structural schematic diagram of device 1100.For example, device 1100 may be provided as a server.Referring to Fig.1 1, device 1100 wraps Processing component 1122 is included, further comprises one or more processors, and the money of the memory as representated by memory 1132 Source, can be by the instruction of the execution of processing component 1122, such as application program for storing.The application journey stored in memory 1132 Sequence may include it is one or more each correspond to one group of instruction module.In addition, processing component 1122 is configured To execute instruction, to execute the recognition methods of the attaching information of above-mentioned number.
Device 1100 can also include that a power supply module 1126 be configured as the power management of executive device 1100, and one Wired or wireless network interface 1150 is configured as device 1100 being connected to network and input and output (I/O) interface 1158.Device 1100 can be operated based on the operating system for being stored in memory 1132, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or similar.
Those skilled in the art after considering the specification and implementing the invention disclosed here, will readily occur to its of the disclosure Its embodiment.The disclosure is intended to cover any variations, uses, or adaptations of the disclosure, these modifications, purposes or Person's adaptive change follows the general principles of this disclosure and including the undocumented common knowledge in the art of the disclosure Or conventional techniques.The description and examples are only to be considered as illustrative, and the true scope and spirit of the disclosure are by following Claim is pointed out.
It should be understood that the present disclosure is not limited to the precise structures that have been described above and shown in the drawings, and And various modifications and changes may be made without departing from the scope thereof.The scope of the present disclosure is only limited by the accompanying claims.
The foregoing is merely the preferred embodiments of the disclosure, not to limit the disclosure, all essences in the disclosure Within mind and principle, any modification, equivalent substitution, improvement and etc. done be should be included within the scope of disclosure protection.

Claims (15)

1. a kind of recognition methods of the attaching information of number, which is characterized in that the described method includes:
Obtain sample short message collection;
The title for being used for identification number attaching information is extracted from the sample short message of the sample short message collection;
The title of the corresponding sample short message of same number of sender is merged;
The attaching information of the number of sender is identified according to pooling information, comprising:
Subsample short message, which is calculated, using following formula merges the probability value for concentrating each title to concentrate in the merging of subsample short message:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate increment This short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message merges collection Middle title titlekCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number;Subsample is short Letter merges the incidence relation that collection includes number of sender, title, short message receiver number number;
The title that the probability value is greater than probability threshold value is determined as to the attaching information of the number of sender.
2. the method according to claim 1, wherein the acquisition sample short message collection includes:
Obtain the history short message in preset time period;
The sender number of the history short message is identified;
It is that the history short message of class note number is notified to be determined as sample short message by described sender number, obtains sample short message collection.
3. the method according to claim 1, wherein described extract from the sample short message of the sample short message collection Title for identification number attaching information, comprising:
When in the sample short message of the sample short message collection including special symbol group, special symbol is extracted from the sample short message Information between group determines the title of the sample short message according to the information of extraction;
It is when not including the special symbol group in the sample short message of the sample short message collection, the title of the sample short message is true It is set to sky information.
4. according to the method described in claim 3, it is characterized in that, the method also includes:
The corresponding sender number of sample short message comprising the special symbol group is concentrated to be determined as short message the sample short message Sender number;
The corresponding subsample short message collection of each number of sender, the increment are filtered out respectively from sample short message concentration It includes number of sender, short message receiver number and title that this short message, which concentrates each sample short message,.
5. the method according to claim 1, wherein described that the corresponding sample of same number of sender is short The title of letter merges, comprising:
It calculates subsample short message and concentrates the corresponding short message receiver number number of each title, obtain subsample short message and merge collection, The subsample short message merges the incidence relation that collection includes number of sender, title, short message receiver number number, described Subsample short message collection includes the corresponding sample short message of same number of sender.
6. the method according to claim 1, wherein described calculate subsample short message merging collection using following formula In each title subsample short message merge concentrate probability value before, further includes:
Judge that the subsample short message merges and concentrates whether the corresponding short message receiver number number of title is less than number threshold value;
The corresponding title of short message receiver number number for being less than number threshold value is deleted.
7. the method according to claim 1, wherein the method also includes:
The incidence relation of number of sender and attaching information is determined according to the attaching information of each number of sender;
Destination number to be identified is identified according to the incidence relation of described sender number and attaching information, described in determination The attaching information of destination number, the destination number include calling party's number to be transferred to, the received number of callee, short message hair The side's of sending number to be sent or the received number of short message receiver.
8. a kind of identification device of the attaching information of number, which is characterized in that described device includes:
Short message collection obtains module, is configured as obtaining sample short message collection;
Title abstraction module is configured as extracting from the sample short message of the sample short message collection for identification number attaching information Title;
Title merging module is configured as merging the title of the corresponding sample short message of same number of sender;
First attaching information identification module is configured as identifying the ownership letter of the number of sender according to pooling information Breath, comprising:
Subsample short message, which is calculated, using following formula merges the probability value for concentrating each title to concentrate in the merging of subsample short message:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate increment This short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message merges collection Middle title titlekCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number;Subsample is short Letter merges the incidence relation that collection includes number of sender, title, short message receiver number number;
The title that the probability value is greater than probability threshold value is determined as to the attaching information of the number of sender.
9. device according to claim 8, which is characterized in that the short message collection obtains module and includes:
Short message acquisition submodule is configured to obtain the history short message in preset time period;
Number Reorganization submodule is configured to identify the sender number of the history short message;
Sample short message collection determines submodule, is configured to be that the history short message of class note number is notified to determine by described sender number For sample short message, sample short message collection is obtained.
10. device according to claim 8, which is characterized in that the title abstraction module includes:
Title extracts submodule, is configured as when in the sample short message of the sample short message collection including special symbol group, from institute The information extracted between special symbol group in sample short message is stated, the title of the sample short message is determined according to the information of extraction;When When not including the special symbol group in the sample short message of the sample short message collection, the title of the sample short message is determined as sky Information.
11. device according to claim 10, which is characterized in that described device further include:
Number of sender determining module is configured as the sample short message concentrating the sample comprising the special symbol group The corresponding sender number of short message is determined as number of sender;
Subsample short message collection determining module is configured as filtering out each SMS sender number respectively from sample short message concentration The corresponding subsample short message collection of code, it includes that number of sender, short message connect that the subsample short message, which concentrates each sample short message, Debit's number and title.
12. device according to claim 8, which is characterized in that the title merging module includes:
Merge to collect and determine submodule, is configured as calculating the corresponding short message receiver number of each title of subsample short message concentration Number obtains subsample short message and merges collection, and the subsample short message merges collection and receives comprising number of sender, title, short message The incidence relation of square number number, the subsample short message collection include the corresponding sample short message of same number of sender.
13. device according to claim 8, which is characterized in that the first attaching information identification module further include:
Attaching information filter submodule judges that the subsample short message merges and concentrates the corresponding short message receiver number number of title Whether number threshold value is less than;The corresponding title of short message receiver number number for being less than number threshold value is deleted.
14. device according to claim 8, which is characterized in that described device further include:
Incidence relation determining module is configured as determining number of sender according to the attaching information of each number of sender With the incidence relation of attaching information;
Second attaching information identification module is configured as treating knowledge according to the incidence relation of described sender number and attaching information Other destination number is identified, determines that the attaching information of the destination number, the destination number include that calling party waits transfering to Number, the received number of callee, the SMS sender received number of number or short message receiver to be sent.
15. a kind of identification device of the attaching information of number characterized by comprising
Processor;
Memory for storage processor executable instruction;
Wherein, the processor is configured to:
Obtain sample short message collection;
The title for being used for identification number attaching information is extracted from the sample short message of the sample short message collection;
The title of the corresponding sample short message of same number of sender is merged;
The attaching information of the number of sender is identified according to pooling information, comprising:
Subsample short message, which is calculated, using following formula merges the probability value for concentrating each title to concentrate in the merging of subsample short message:
Wherein, P (titlei) indicate title titleiIn subsample, short message merges the probability value concentrated, C (titlei) indicate increment This short message, which merges, concentrates title titleiCorresponding short message receiver number number, C (titlek) indicate that subsample short message merges collection Middle title titlekCorresponding short message receiver number number, n indicate that subsample short message merges and concentrate title number;Subsample is short Letter merges the incidence relation that collection includes number of sender, title, short message receiver number number;
The title that the probability value is greater than probability threshold value is determined as to the attaching information of the number of sender.
CN201510728723.8A 2015-10-30 2015-10-30 The recognition methods of the attaching information of number and device Active CN105430654B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510728723.8A CN105430654B (en) 2015-10-30 2015-10-30 The recognition methods of the attaching information of number and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510728723.8A CN105430654B (en) 2015-10-30 2015-10-30 The recognition methods of the attaching information of number and device

Publications (2)

Publication Number Publication Date
CN105430654A CN105430654A (en) 2016-03-23
CN105430654B true CN105430654B (en) 2018-12-11

Family

ID=55508523

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510728723.8A Active CN105430654B (en) 2015-10-30 2015-10-30 The recognition methods of the attaching information of number and device

Country Status (1)

Country Link
CN (1) CN105430654B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106101464A (en) * 2016-05-26 2016-11-09 北京小米移动软件有限公司 Number mark method and device
CN109561402A (en) * 2017-09-26 2019-04-02 中国电信股份有限公司 Information acquisition method, device and mobile terminal
CN108494977B (en) * 2018-02-09 2020-12-29 北京泰迪熊移动科技有限公司 Method, device and system for identifying short signal code
CN113810547B (en) * 2020-06-16 2023-12-15 中国移动通信集团重庆有限公司 Voice call safety protection method and device and computing equipment

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103369095A (en) * 2012-03-30 2013-10-23 北京千橡网景科技发展有限公司 Method and device for type identification of incoming call or text message
CN104618877A (en) * 2015-01-30 2015-05-13 广东欧珀移动通信有限公司 Short message arranging method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8704863B2 (en) * 2010-04-07 2014-04-22 Apple Inc. Transitioning between circuit switched calls and video calls

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103369095A (en) * 2012-03-30 2013-10-23 北京千橡网景科技发展有限公司 Method and device for type identification of incoming call or text message
CN104618877A (en) * 2015-01-30 2015-05-13 广东欧珀移动通信有限公司 Short message arranging method and device

Also Published As

Publication number Publication date
CN105430654A (en) 2016-03-23

Similar Documents

Publication Publication Date Title
CN106330685B (en) Group message reminding method and terminal
CN105430654B (en) The recognition methods of the attaching information of number and device
US20130294594A1 (en) Automating the identification of meeting attendees
WO2013166922A1 (en) Information processing method and terminal
CN104883671B (en) A kind of judgment method and system of refuse messages
CN103517228A (en) Contact information prompting method of mobile terminal and system of mobile terminal and mobile terminal
WO2013127359A1 (en) Method and device for processing contact persons, and mobile terminal
CN103501374A (en) Telephone book sequencing method and device as well as terminal
CN105049608B (en) Short message verification code processing method, device and mobile terminal
CN101877737A (en) Communication device and image sharing method thereof
CN104219399B (en) Method for selecting number, terminal, strategic server and the system of one-card multi-number
CN103442140B (en) Method and system for cooperation of input method and address list, and mobile terminal
CN105072238A (en) Method and apparatus for creating contact list according to note information of newly-added number
WO2014201872A1 (en) Method and device for processing short messages
CN103369100A (en) Mobile terminal and method for generating head portrait of contact person
CN101996030A (en) Mobile device and common text inserting method thereof
CN102883291B (en) Method, client and the system of bulk SMS breath
CN103916526B (en) Contact person information processing method, device and mobile terminal
WO2015123923A1 (en) Mobile terminal, and method and storage medium thereof for control of a message service
CN105208179A (en) Telephone number recognition system and method, and electronic product
WO2013167015A2 (en) Method, apparatus and mobile terminal for implementing profile mode setting
CN105472144A (en) Situation management method and system, and electronic equipment
CN108206893A (en) call processing method and device
CN106485520A (en) Across channel communicating control method and server
CN105808568B (en) Context distributed reasoning method and device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant