CN109857773A - A kind of method and apparatus automatically analyzing service number - Google Patents

A kind of method and apparatus automatically analyzing service number Download PDF

Info

Publication number
CN109857773A
CN109857773A CN201811573549.4A CN201811573549A CN109857773A CN 109857773 A CN109857773 A CN 109857773A CN 201811573549 A CN201811573549 A CN 201811573549A CN 109857773 A CN109857773 A CN 109857773A
Authority
CN
China
Prior art keywords
liaison
communications
service number
judging result
service
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811573549.4A
Other languages
Chinese (zh)
Other versions
CN109857773B (en
Inventor
林文楷
周成祖
周宏�
刘源
杜新胜
温若辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Meiya Pico Information Co Ltd
Original Assignee
Xiamen Meiya Pico Information Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Meiya Pico Information Co Ltd filed Critical Xiamen Meiya Pico Information Co Ltd
Priority to CN201811573549.4A priority Critical patent/CN109857773B/en
Publication of CN109857773A publication Critical patent/CN109857773A/en
Application granted granted Critical
Publication of CN109857773B publication Critical patent/CN109857773B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Telephonic Communication Services (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention provides a kind of method and apparatus for automatically analyzing service number.The described method includes: extracting communications and liaison characteristics of objects based on the communications and liaison data, the data set of communications and liaison object is obtained;Classified according to data set of the predetermined condition to the communications and liaison object, obtains analysis set;Gaussian Profile calculating is carried out to the communications and liaison number of objects in the analysis set, obtains the normal distribution of the analysis set;Judge whether the data set is service number according to position of the communications and liaison number of objects in the analysis set in the normal distribution of the analysis set.It can not be matched with device, the service number that can be overcome the problems, such as the characteristics such as time difference, individual difference and cause according to the method for the present invention, realize automatically analyzing and extracting for service number.

Description

A kind of method and apparatus automatically analyzing service number
Technical field
The present invention relates to field of computer technology, relate more specifically to the method for Analysis Service number.
Background technique
Enterprise further strengthens brand promotion, establishes good corporate image, just generate to construct uniform service channel Thousands of, all kinds of service number, such as carrier service phone, bank service phone, QQ doctor, wechat public platform Deng;These service number are large number of, frequent without apparent number feature, communications and liaison, cause to be flooded in Mobile Phone Forensics data big The interference information of this type is measured, how to automatically analyze and filters out constantly newly-increased service number, become and promote Mobile Phone Forensics number According to the focus of analysis ability and analysis efficiency.Due to service number as the increase of enterprise and application type is continuously increased, And without apparent number feature, existing similar analysis tool is checked numbers mainly by way of manually studying and judging currently on the market Judged one by one, for example service number filters by hand again, these technologies are unable to satisfy analysis complicated in reality struggle and need It asks.
Therefore, in the prior art include following deficiency: place one's entire reliance upon the mode manually studied and judged, and needs to expend and largely grind The case where sentencing the time, and being easy to appear Wrong, missing, extreme influence data analyze the efficiency and quality of work;Due to associated numbers It is first all to show, then filtered by manual type, so the presence of service number seriously affects the response performance and use of system Family experience increases the resource overhead and research and development cost of system;Due to the presence of service number, in Mobile Phone Forensics data loading, It is easy to a large amount of original irrespective objects becoming have relationship, it is empty that the storage of these newly-increased relationships needs to expend a large amount of storage Between, the construction cost of system is significantly greatly increased.
Summary of the invention
The present invention is proposed in view of the above problem.The present invention provides a kind of method for automatically analyzing service number and Device automatically analyzes service number using Gauss distribution method, and the accurate method for analyzing various types service number reduces service Interference of the number to the analysis work of mobile phone forensic data, promotes analysis efficiency, and staff is helped quickly to position Key thread, adjust Look into evidence obtaining.
According to an aspect of the present invention, a kind of method for automatically analyzing service number is provided, comprising:
Obtain communications and liaison data;
Communications and liaison characteristics of objects is extracted based on the communications and liaison data, obtains the data set of communications and liaison object;
Classified according to data set of the predetermined condition to the communications and liaison object, obtains analysis set;
Gaussian Profile calculating is carried out to the communications and liaison number of objects in the analysis set, obtains the normal state point of the analysis set Cloth;
Institute is judged according to position of the communications and liaison number of objects in the analysis set in the normal distribution of the analysis set State whether data set is service number.
Optionally, the communications and liaison object have attribute, the attribute include communications and liaison region, communications and liaison direction, the communications and liaison period and Communications and liaison type.
Optionally, the data set of communications and liaison object is obtained;It include: the attribute based on the communications and liaison object to the communications and liaison data Communications and liaison characteristics of objects is extracted, the data set of communications and liaison object is obtained.
Optionally, the analysis set includes: the clothes of the service number set of local normal working hours, local all-weather At least one of business set of numbers, the service number set of harassing and wrecking property, national service number set.
Optionally, if judge the data set whether be service number include: it is described analysis set in communications and liaison object Number has the right area of the range within n standard deviation in the positional distance average value in the normal distribution, and n is natural number, Then determine that the communications and liaison object is service number.
Optionally, the method also includes: the judging result is verified.
Optionally, it is described to the judging result carry out verifying include:
Extract the remark information and/or communications and liaison content of the communications and liaison object that the judging result is service number;
Namebase based on the remark information and the different type service having had built up, verifying the judging result is Whether the communications and liaison object of service number is service number;And/or
It is the communications and liaison content of the communications and liaison object of service number and the different type having had built up based on the judging result The keywords database for servicing communications and liaison content, verifies whether the communications and liaison object that the judging result is service number is service number.
Optionally, described that the judging result is verified further include: if being based on the namebase and keywords database When whether communications and liaison object that the judging result is service number can not be verified be service number, then it is sent to user and sentences It is disconnected.
Optionally, extracting the judging result as the remark information of the communications and liaison object of service number includes: by extracting institute The remark information for stating the communications and liaison object that judging result is service number, constructs remarks data set.
Optionally, extracting the judging result as the communications and liaison content of the communications and liaison object of service number includes: by extracting institute The communications and liaison content for stating the communications and liaison object that judging result is service number, constructs communications and liaison content set.
Optionally, it is based on the remark information and the namebase, verifies the communications and liaison that the judging result is service number Whether object is that service number includes:
The remarks data set and the namebase are compared, if the two has intersection, verification result is the communications and liaison pair As for service number.
Optionally, communications and liaison content and the keywords database based on the communications and liaison object that the judging result is service number, Verify whether the communications and liaison object that the judging result is service number is that service number includes:
The communications and liaison content set and the keywords database are compared, if the two communications and liaison content sets include the keyword The content in library, then it is service number that verification result, which is the communications and liaison object,.
Optionally, the method also includes: save the data set and corresponding verification result of the communications and liaison object.Wherein, The verification result include the communications and liaison object whether be service number and/or the verification result reliability.
Optionally, the data set and corresponding judging result or verification result for saving the communications and liaison object include: by single For communications and liaison record storage in Full-text database, incidence relation is stored in picture library database.
Optionally, the data set and corresponding judging result or verification result for saving the communications and liaison object include: when described Judging result or verification result are the communications and liaison object when being service number, and the communications and liaison object is stamped " service number " mark Label.
According to a further aspect of the invention, it provides one kind and automatically analyzes service number device, described device includes:
Data acquisition module, for obtaining communications and liaison data;
Data set module obtains the data set of communications and liaison object for extracting communications and liaison characteristics of objects based on the communications and liaison data;
Analytic set module obtains analytic set for classifying according to data set of the predetermined condition to the communications and liaison object It closes;
Computing module obtains described point for carrying out Gaussian Profile calculating to the communications and liaison number of objects in the analysis set Analyse the normal distribution of set;
Judgment module, for being analyzed in the normal distribution gathered according to the communications and liaison number of objects in the analysis set described Position judge whether the data set is service number.
Optionally, data acquisition module can be further used for: the history communication data based on communication tool obtains institute State communications and liaison data.
Optionally, the communications and liaison object have attribute, the attribute include communications and liaison region, communications and liaison direction, the communications and liaison period and Communications and liaison type.
Optionally, data set module can be further used for: the attribute based on the communications and liaison object is to the communications and liaison number According to communications and liaison characteristics of objects is extracted, the data set of communications and liaison object is obtained.
Optionally, the analysis set includes: the clothes of the service number set of local normal working hours, local all-weather At least one of business set of numbers, the service number set of harassing and wrecking property, national service number set.
Optionally, judgment module can also further include: if the communications and liaison number of objects in the analysis set is in institute The right area that the positional distance average value in normal distribution has the range within n standard deviation is stated, n is natural number, it is determined that The communications and liaison object is service number.
Optionally, described device further include: memory module, for saving the data set of the communications and liaison object and corresponding sentencing Disconnected result.Wherein, the judging result include the communications and liaison object whether be service number and/or the judging result can By property.
Optionally, described device further include: authentication module, for being verified to the judging result.
Optionally, it is described to the judging result carry out verifying include:
Extract the remark information and/or communications and liaison content of the communications and liaison object that the judging result is service number;
Namebase based on the remark information and the different type service having had built up, verifying the judging result is Whether the communications and liaison object of service number is service number;And/or
It is the communications and liaison content of the communications and liaison object of service number and the different type having had built up based on the judging result The keywords database for servicing communications and liaison content, verifies whether the communications and liaison object that the judging result is service number is service number.
Optionally, described that the judging result is verified further include: if being based on the namebase and keywords database When whether communications and liaison object that the judging result is service number can not be verified be service number, then it is sent to user and sentences It is disconnected.
Optionally, extracting the judging result as the remark information of the communications and liaison object of service number includes: by extracting institute The remark information for stating the communications and liaison object that judging result is service number, constructs remarks data set.
Optionally, extracting the judging result as the communications and liaison content of the communications and liaison object of service number includes: by extracting institute The communications and liaison content for stating the communications and liaison object that judging result is service number, constructs communications and liaison content set.
Optionally, it is based on the remark information and the namebase, verifies the communications and liaison that the judging result is service number Whether object is that service number includes:
The remarks data set and the namebase are compared, if the two has intersection, verification result is the communications and liaison pair As for service number.
Optionally, communications and liaison content and the keywords database based on the communications and liaison object that the judging result is service number, Verify whether the communications and liaison object that the judging result is service number is that service number includes:
The communications and liaison content set and the keywords database are compared, if the two communications and liaison content sets include the keyword The content in library, then it is service number that verification result, which is the communications and liaison object,.
Optionally, the memory module is used for: saving the data set and corresponding verification result of the communications and liaison object.Its In, the verification result include the communications and liaison object whether be service number and/or the verification result reliability.
Optionally, the memory module is also used to: save the communications and liaison object data set and corresponding judging result or Verification result.
Optionally, the memory module is also used to: by single communications and liaison record storage in Full-text database, incidence relation It is stored in picture library database.
Optionally, the data set and corresponding judging result or verification result for saving the communications and liaison object include: when described Judging result or verification result are the communications and liaison object when being service number, and the communications and liaison object is stamped " service number " mark Label.
According to a further aspect of the invention, a kind of system for automatically analyzing service number, including memory, processor are provided And it is stored in the computer program run on the memory and on the processor, the processor executes the computer The step of above method is realized when program.
According to a further aspect of the invention, a kind of computer readable storage medium is provided, computer program is stored thereon with, The step of above method is realized when the computer program is computer-executed.
A kind of method automatically analyzing service number, system and storage medium according to an embodiment of the present invention, can overcome The problem of characteristics such as time difference, individual difference and the service number caused can not match, and pass through remarks, number affiliated area Etc. base libraries auxiliary improve analysis result accuracy rate, realize automatically analyzing and extracting for service number.
Detailed description of the invention
The embodiment of the present invention is described in more detail in conjunction with the accompanying drawings, the above and other purposes of the present invention, Feature and advantage will be apparent.Attached drawing is used to provide to further understand the embodiment of the present invention, and constitutes explanation A part of book, is used to explain the present invention together with the embodiment of the present invention, is not construed as limiting the invention.In the accompanying drawings, Identical reference label typically represents same parts or step.
Fig. 1 is for realizing a kind of side for automatically analyzing service number of the embodiment of the present invention according to an embodiment of the present invention The schematic flow chart of method;
Fig. 2 is the example for realizing normal distribution according to an embodiment of the present invention;
Fig. 3 is for realizing the exemplary exemplary flow according to an embodiment of the present invention verified to the judging result Figure;
Fig. 4 is to restore the exemplary of original function under Dalvik operational mode for realizing according to an embodiment of the present invention to show Meaning property flow chart;
Fig. 5 is the schematic flow chart of the device for automatically analyzing service number for the embodiment of the present invention.
Specific embodiment
In order to enable the object, technical solutions and advantages of the present invention become apparent, root is described in detail below with reference to accompanying drawings According to example embodiments of the present invention.Obviously, described embodiment is only a part of the embodiments of the present invention, rather than this hair Bright whole embodiments, it should be appreciated that the present invention is not limited by example embodiment described herein.Based on described in the present invention The embodiment of the present invention, those skilled in the art's obtained all other embodiment in the case where not making the creative labor It should all fall under the scope of the present invention.
The embodiment of the present invention proposes a kind of method for automatically analyzing service number.It is described with reference to Figure 1 for realizing this A kind of method 100 for automatically analyzing service number of inventive embodiments.The method 100 includes:
Firstly, obtaining communications and liaison data in step S110;
In step S120, communications and liaison characteristics of objects is extracted based on the communications and liaison data, obtains the data set of communications and liaison object;
In step S130, classified according to data set of the predetermined condition to the communications and liaison object, obtains analysis set;
In step S140, Gaussian Profile calculating is carried out to the communications and liaison number of objects in the analysis set, obtains the analysis The normal distribution of set;
In step S150, according to the communications and liaison number of objects in the analysis set in the normal distribution of the analysis set Position judges whether the data set is service number.
A kind of method for automatically analyzing service number that the embodiment of the present invention proposes, can overcome time difference, individual difference The problem of characteristics such as different and the service number caused can not match, and pass through the auxiliary of the base libraries such as remarks, number affiliated area The accuracy rate for improving analysis result, realizes automatically analyzing and extracting for service number.
According to embodiments of the present invention, step 110 can further include: the history communication data based on communication tool obtains Take the communications and liaison data.
Wherein, the history communication data of the communication tool includes the message registration and short message record in mobile phone, Instant Messenger Interrogate communications and liaison and the call-information etc. of type.
In one embodiment, the history communication data of the communication tool includes the logical of communications and liaison object in mobile phone communications and liaison library Words record and/or short message record, the communications and liaison object refer to that the object that communication behavior occurs with user (is such as communicated with user Cell-phone number or virtual communication account etc.), as shown in table 1- table 2, table 1, which is shown, obtains the communications and liaison number by message registration According to table 2, which is shown, obtains the communications and liaison data by short message record.
Table 1: message registration table
Table 2: short message record sheet
In another embodiment, the history communication data of the communication tool includes communications and liaison object in virtual identity Kuku Communications and liaison record, as shown in table 3- table 4, table 3- table 4 all illustrate obtained by the communications and liaison of instant messaging type record it is described logical Join data.
Table 3:IM chat message
Table 4:IM call-information
Optionally, the communications and liaison object have attribute, the attribute include communications and liaison region, communications and liaison direction, the communications and liaison period and Communications and liaison type.
According to embodiments of the present invention, step 120 can further include: the attribute based on the communications and liaison object is to described Communications and liaison data extract communications and liaison characteristics of objects, obtain the data set of communications and liaison object.
Wherein, the data set S of the communications and liaison object includes n subclass { S 1, S2 ..., Sn }, and each subset includes Data item has: communications and liaison number of objects, affiliated area, communications and liaison direction, communications and liaison period, affiliated type, chat-type, the number of each subset It is to be overlapped centered on communications and liaison object in conjunction with all historical datas according to item.Such as: communications and liaison object subset Sn (communications and liaison object 13022334455) to include: communications and liaison number of objects: 231, affiliated area: Fujian Foochow;Communications and liaison direction: into | go out;The communications and liaison period: Daytime | at night | morning;Affiliated type: mobile phone;Communications and liaison type: interim | good friend.
According to embodiments of the present invention, step 130 can also further include: the analysis set includes: local normal The service number set of working time, the service number set of local all-weather, the service number set of harassing and wrecking property, national clothes At least one of set of numbers of being engaged in.
Wherein, the service number of local normal working hours: such as local logistics express delivery number, the characteristic of this style number code It is the local number that normal time communications and liaison can just occur, so passing through setting subset condition (affiliated area: local;The communications and liaison period: Daytime | at night), generate new analysis set Sa;
The service number of local all-weather: such as local drop drop service number, the characteristic of this style number code be normal person not Good friend, number possession can be saved as to be local, pass through setting subset condition (affiliated area: local;Communications and liaison type: interim), it is raw The analysis set Sb of Cheng Xin;
The service number of harassing and wrecking property: such as recommendation stock or swindle, the characteristic of this style number code are that normal person will not save For good friend, and unidirectional communications and liaison, by setting subset condition (communications and liaison direction: into;Communications and liaison type: interim), generate new analysis Set Sc;
National service number: such as bank service number, the characteristic of this style number code are that normal person will not save as good friend, By setting subset condition (communications and liaison type: interim), new analysis set Sd is generated.
According to embodiments of the present invention, step 150 can also further include: if the communications and liaison pair in the analysis set There is the right area of the range within n standard deviation in the positional distance average value in the normal distribution as counting, n is nature Number, it is determined that the communications and liaison object is service number.
In one embodiment, referring to fig. 2, Fig. 2 shows the examples of normal distribution according to an embodiment of the present invention.Take n =3, as shown in Fig. 2, when communications and liaison number of objects has the model within 3 standard deviations in the positional distance average value in the normal distribution The right area enclosed, it is determined that the communications and liaison object is service number.
Optionally, the method 100 further include: save the communications and liaison object data set and corresponding judging result.Its In, the judging result include the communications and liaison object whether be service number and/or the judging result reliability.
Optionally, the exemplary schematic flow diagram verified to the judging result is shown referring to Fig. 3, Fig. 3. The method 100 further include: the judging result is verified.
Optionally, it is described to the judging result carry out verifying include:
Extract the remark information and/or communications and liaison content of the communications and liaison object that the judging result is service number;
Namebase based on the remark information and the different type service having had built up, verifying the judging result is Whether the communications and liaison object of service number is service number;And/or
It is the communications and liaison content of the communications and liaison object of service number and the different type having had built up based on the judging result The keywords database for servicing communications and liaison content, verifies whether the communications and liaison object that the judging result is service number is service number.
Optionally, described that the judging result is verified further include: if being based on the namebase and keywords database When whether communications and liaison object that the judging result is service number can not be verified be service number, then it is sent to user and sentences It is disconnected.
Wherein, the keywords database Gn of the different type service communications and liaison content, such as Carrier Announcement, bank's notice, system Message etc.;The namebase Mn of the different type service, such as call a taxi, logistics, property.
Optionally, extracting the judging result as the remark information of the communications and liaison object of service number includes: by extracting institute The remark information for stating the communications and liaison object that judging result is service number, constructs remarks data set Bn.
Optionally, extracting the judging result as the communications and liaison content of the communications and liaison object of service number includes: by extracting institute The communications and liaison content for stating the communications and liaison object that judging result is service number, constructs communications and liaison content set Tn.
Optionally, it is based on the remark information and the namebase, verifies the communications and liaison that the judging result is service number Whether object is that service number includes:
The remarks data set and the namebase are compared, if the two has intersection, verification result is the communications and liaison pair As for service number.
Optionally, communications and liaison content and the keywords database based on the communications and liaison object that the judging result is service number, Verify whether the communications and liaison object that the judging result is service number is that service number includes:
The communications and liaison content set and the keywords database are compared, if the two communications and liaison content sets include the keyword The content in library, then it is service number that verification result, which is the communications and liaison object,.
Optionally, the method 100 further include: save the data set and corresponding verification result of the communications and liaison object.Its In, the verification result include the communications and liaison object whether be service number and/or the verification result reliability.
Optionally, the data set and corresponding judging result or verification result for saving the communications and liaison object include: by single For communications and liaison record storage in Full-text database, incidence relation is stored in picture library database.
Optionally, the data set and corresponding judging result or verification result for saving the communications and liaison object include: when described Judging result or verification result are the communications and liaison object when being service number, and the communications and liaison object is stamped " service number " mark Label.
Wherein, it is above-mentioned by single communications and liaison record storage at Full-text database (Elast icSearch), association is closed System be stored in picture library database (Titan), service number be it is tagged, do not store the preservation side of corresponding incidence relation Formula can be reduced memory space.
In one embodiment, the data set for saving the communications and liaison object and corresponding judging result or verification result It is as shown in table 5:
Table 5
The embodiment of the present invention automatically analyzes service number method, and the communications and liaison number of objects by analyzing normal subjects meets just State distribution, and the communications and liaison number of objects of service number is significantly greater than standard deviation;Furthermore the communications and liaison feature of service number, which depends on, to be held Object, the corresponding service number such as different zones, business, communications and liaison purpose have different service features, by building for not The data set of same type service number is analyzed, and the precision of analysis of service number is substantially increased.
In one embodiment, referring to fig. 4, Fig. 4 shows is automatically analyzing for realizing according to an embodiment of the present invention The exemplary schematic flow chart of service number method.Specifically:
Firstly, the history communication data based on communication tool obtains the communications and liaison data;It specifically includes according to Mobile Phone Forensics Mobile phone call history, SMS, the instant messaging communications and liaison record, instant messaging message registration of extraction.
Then, communications and liaison characteristics of objects is extracted based on the communications and liaison data, obtains the data set of communications and liaison object;It specifically includes: By 4 communications and liaison region, communications and liaison period, communications and liaison direction, communications and liaison type dimensions, to building pair after communications and liaison data extraction feature The communications and liaison object data set answered.
Then, classified according to data set of the predetermined condition to the communications and liaison object, obtain analysis set;Specific packet It includes:
The local number that communications and liaison can just occur according to normal time, setting predetermined condition includes: affiliated area to be local, logical The connection period is daytime | at night;Generate the service number set Sa of local normal working hours;
According to the local number that normal person will not save as good friend, number possession is local, setting predetermined condition includes: institute Belonging to region is locally that communications and liaison type is interim;Generate the service number set Sb of local all-weather;
Good friend, and the number of unidirectional communications and liaison will not be saved as according to normal person, setting predetermined condition includes: communications and liaison direction For into communications and liaison type is interim;Generate the service number set Sc of harassing and wrecking property;
Good friend will not be saved as according to normal person, it is interim that set predetermined condition, which include: communications and liaison type,;Generate national clothes Be engaged in set of numbers Sd.
Then, Gaussian Profile calculating is carried out to the communications and liaison number of objects in the analysis set, obtains the analysis set Normal distribution;It specifically includes: calculating the communications and liaison number of objects in above-mentioned set Sa, Sb, Sc, Sd respectively using Gauss distribution method Normal distribution curve.
Then, it according to the characteristic of " the communications and liaison number of objects of service number can be more than that there are many right number ", takes and is distributed in distance Average value has the right area of the range within 3 standard deviations, and the data set in the region is service number;And save the judgement As a result.
Then, the judging result is verified;It specifically includes: extracting the communications and liaison object that the judging result is service number Remark information;Namebase based on the remark information and the different type service having had built up, verifies the judging result It whether is service number for the communications and liaison object of service number;If verification result is service number, by the communications and liaison object marking " service object " label, saves the verification result;
If verification result is not service number, the communications and liaison for the communications and liaison object that the judging result is service number are extracted Content is the communications and liaison content of the communications and liaison object of service number and the different type service having had built up based on the judging result The keywords database of communications and liaison content, verifies whether the communications and liaison object that the judging result is service number is service number;If tested Card is the result is that service number saves the verification result then by communications and liaison object marking " service object " label;
If verification result is not service number, it is sent to user and carries out manual examination and verification.
The method for automatically analyzing service number through the embodiment of the present invention, by the business rule for constructing service number Library, in conjunction with a large amount of mobile phone communications and liaison library, virtual identity library on mobile phone, automatic with computer program, accurate analysis various types clothes The method of business number reduces interference of the service number to the analysis work of mobile phone forensic data, promotes analysis efficiency, help work people Member quickly positions Key thread, investigates and collects evidence;And the communications and liaison library of different mobile phones and virtual identity library can be carried out effectively Processing and analysis, intellectual analysis go out service number, are effectively reduced construction cost, the response performance of lifting system.
The device for automatically analyzing service number of the embodiment of the present invention is shown referring to Fig. 5, Fig. 5.Described device 500 is wrapped It includes:
Data acquisition module 510, for obtaining communications and liaison data;
Data set module 520 obtains the data of communications and liaison object for extracting communications and liaison characteristics of objects based on the communications and liaison data Collection;
Analytic set module 530 is analyzed for being classified according to data set of the predetermined condition to the communications and liaison object Set;
Computing module 540 obtains described for carrying out Gaussian Profile calculating to the communications and liaison number of objects in the analysis set Analyze the normal distribution of set;
Judgment module 550 divides for the normal state according to the communications and liaison number of objects in the analysis set in the analysis set Position in cloth judges whether the data set is service number.
The device for automatically analyzing service number that the embodiment of the present invention proposes, can overcome time difference, individual difference etc. The problem of characteristic and the service number caused can not match reduces interference of the service number to the analysis work of mobile phone forensic data, Analysis efficiency is promoted, construction cost, the response performance of lifting system are effectively reduced.
According to embodiments of the present invention, data acquisition module 510 can be further used for: the history based on communication tool is logical Interrogate communications and liaison data described in data acquisition.
Wherein, the history communication data of the communication tool includes the message registration and short message record in mobile phone, Instant Messenger Interrogate communications and liaison and the call-information etc. of type.
Optionally, the communications and liaison object have attribute, the attribute include communications and liaison region, communications and liaison direction, the communications and liaison period and Communications and liaison type.
According to embodiments of the present invention, data set module 520 can be further used for: the attribute based on the communications and liaison object Communications and liaison characteristics of objects is extracted to the communications and liaison data, obtains the data set of communications and liaison object.
Wherein, the data set S of the communications and liaison object includes n subclass { S1, S2 ..., Sn }, the number that each subset includes Have according to item: communications and liaison number of objects, affiliated area, communications and liaison direction, communications and liaison period, affiliated type, chat-type, the data of each subset Item is to be overlapped centered on communications and liaison object in conjunction with all historical datas.Such as: (communications and liaison object is communications and liaison object subset Sn 13022334455) include: communications and liaison number of objects: 231, affiliated area: Fujian Foochow;Communications and liaison direction: into | go out;The communications and liaison period: white It | at night | morning;Affiliated type: mobile phone;Communications and liaison type: interim | good friend.
According to embodiments of the present invention, the analysis set includes: service number set, the local of local normal working hours At least one of the service number set of all-weather, the service number set of harassing and wrecking property, national service number set.
Wherein, the service number of local normal working hours: such as local logistics express delivery number, the characteristic of this style number code It is the local number that normal time communications and liaison can just occur, so passing through setting subset condition (affiliated area: local;The communications and liaison period: Daytime | at night), generate new analysis set Sa;
The service number of local all-weather: such as local drop drop service number, the characteristic of this style number code be normal person not Good friend, number possession can be saved as to be local, pass through setting subset condition (affiliated area: local;Communications and liaison type: interim), it is raw The analysis set Sb of Cheng Xin;
The service number of harassing and wrecking property: such as recommendation stock or swindle, the characteristic of this style number code are that normal person will not save For good friend, and unidirectional communications and liaison, by setting subset condition (communications and liaison direction: into;Communications and liaison type: interim), generate new analysis Set Sc;
National service number: such as bank service number, the characteristic of this style number code are that normal person will not save as good friend, By setting subset condition (communications and liaison type: interim), new analysis set Sd is generated.
According to embodiments of the present invention, judgment module 550 can also further include: if logical in the analysis set Connection number of objects has the right area of the range within n standard deviation in the positional distance average value in the normal distribution, and n is certainly So number, it is determined that the communications and liaison object is service number.
Optionally, described device 500 further include: memory module 560, for saving the data set of the communications and liaison object and right The judging result answered.Wherein, the judging result includes whether the communications and liaison object is service number and/or judgement knot The reliability of fruit.
Optionally, described device 500 further include: authentication module 570, for being verified to the judging result.
Optionally, it is described to the judging result carry out verifying include:
Extract the remark information and/or communications and liaison content of the communications and liaison object that the judging result is service number;
Namebase based on the remark information and the different type service having had built up, verifying the judging result is Whether the communications and liaison object of service number is service number;And/or
It is the communications and liaison content of the communications and liaison object of service number and the different type having had built up based on the judging result The keywords database for servicing communications and liaison content, verifies whether the communications and liaison object that the judging result is service number is service number.
Optionally, described that the judging result is verified further include: if being based on the namebase and keywords database When whether communications and liaison object that the judging result is service number can not be verified be service number, then it is sent to user and sentences It is disconnected.
Wherein, the keywords database Gn of the different type service communications and liaison content, such as Carrier Announcement, bank's notice, system Message etc.;The namebase Mn of the different type service, such as call a taxi, logistics, property.
Optionally, extracting the judging result as the remark information of the communications and liaison object of service number includes: by extracting institute The remark information for stating the communications and liaison object that judging result is service number, constructs remarks data set Bn.
Optionally, extracting the judging result as the communications and liaison content of the communications and liaison object of service number includes: by extracting institute The communications and liaison content for stating the communications and liaison object that judging result is service number, constructs communications and liaison content set Tn.
Optionally, it is based on the remark information and the namebase, verifies the communications and liaison that the judging result is service number Whether object is that service number includes:
The remarks data set and the namebase are compared, if the two has intersection, verification result is the communications and liaison pair As for service number.
Optionally, communications and liaison content and the keywords database based on the communications and liaison object that the judging result is service number, Verify whether the communications and liaison object that the judging result is service number is that service number includes:
The communications and liaison content set and the keywords database are compared, if the two communications and liaison content sets include the keyword The content in library, then it is service number that verification result, which is the communications and liaison object,.
Optionally, the memory module 560 is used for: saving the data set and corresponding verification result of the communications and liaison object. Wherein, the verification result include the communications and liaison object whether be service number and/or the verification result reliability.
Optionally, the memory module 560 is also used to: save the communications and liaison object data set and corresponding judging result Or verification result.
Optionally, the memory module 560 is also used to: by single communications and liaison record storage in Full-text database, association Relationship is stored in picture library database.
Optionally, the data set and corresponding judging result or verification result for saving the communications and liaison object include: when described Judging result or verification result are the communications and liaison object when being service number, and the communications and liaison object is stamped " service number " mark Label.
Wherein, it is above-mentioned by single communications and liaison record storage at Full-text database (ElasticSearch), by incidence relation Be stored in picture library database (Titan), service number be it is tagged, do not store the preserving type of corresponding incidence relation, It can be reduced memory space.
In one embodiment, the memory module 560 saves the data set and corresponding judgement knot of the communications and liaison object Fruit or verification result, as shown in table 5:
Property Name Attribute description Remarks
SrcAccountId Account ID
AccountType Account type 1- service number;2- right number
AccountNum Account
NickName The pet name
AreaCode Ownership place
Table 5
The embodiment of the present invention automatically analyzes service number device, and the communications and liaison number of objects by analyzing normal subjects meets just State distribution, and the communications and liaison number of objects of service number is significantly greater than standard deviation;Furthermore the communications and liaison feature of service number, which depends on, to be held Object, the corresponding service number such as different zones, business, communications and liaison purpose have different service features, by building for not The data set of same type service number is analyzed, and the precision of analysis of service number is substantially increased.
Those of ordinary skill in the art may be aware that list described in conjunction with the examples disclosed in the embodiments of the present disclosure Member and algorithm steps can be realized with the combination of electronic hardware or computer software and electronic hardware.These functions are actually It is implemented in hardware or software, the specific application and design constraint depending on technical solution.Professional technician Each specific application can be used different methods to achieve the described function, but this realization is it is not considered that exceed The scope of the present invention.
According to embodiments of the present invention, a kind of system for automatically analyzing service number, including memory, processor are additionally provided And it is stored in the computer program run on the memory and on the processor, which is characterized in that the processor is held The step of realizing the above method when row computer program.
In addition, according to embodiments of the present invention, additionally providing a kind of computer readable storage medium, on said storage Program instruction is stored, when described program instruction is run by computer or processor for executing the automatic of the embodiment of the present invention The corresponding steps of Analysis Service number method.The storage medium may include read-only memory, and erasable programmable is read-only to be deposited Any combination of the various memories such as reservoir or above-mentioned storage medium.
It is according to an embodiment of the present invention to automatically analyze service number method, apparatus system and storage medium, pass through analysis The communications and liaison number of objects of normal subjects meets normal distribution, and the communications and liaison number of objects of service number is significantly greater than standard deviation;Furthermore it takes The communications and liaison feature of business number has not dependent on object, the corresponding service number such as different zones, business, communications and liaison purpose is held Same service feature, the data set by building for different type service number are analyzed, and service number is substantially increased Precision of analysis;And the service number that can be overcome the problems, such as the characteristics such as time difference, individual difference and cause can not match, Interference of the service number to the analysis work of mobile phone forensic data is reduced, analysis efficiency is promoted, is effectively reduced construction cost, promoted The response performance of system;To the with high accuracy of service number, and can the newly-increased application type of automatic adaptation service number Code analysis, promotes the analysis ability of communications and liaison data.
Although describing example embodiment by reference to attached drawing here, it should be understood that above example embodiment are only exemplary , and be not intended to limit the scope of the invention to this.Those of ordinary skill in the art can carry out various changes wherein And modification, it is made without departing from the scope of the present invention and spiritual.All such changes and modifications are intended to be included in appended claims Within required the scope of the present invention.Those of ordinary skill in the art may be aware that in conjunction with implementation disclosed herein Each exemplary unit and algorithm steps of example description, can be with the combination of electronic hardware or computer software and electronic hardware To realize.These functions are implemented in hardware or software actually, and the specific application and design depending on technical solution are about Beam condition.Professional technician can use different methods to achieve the described function each specific application, still Such implementation should not be considered as beyond the scope of the present invention.
In addition, it will be appreciated by those of skill in the art that although some embodiments described herein include other embodiments In included certain features rather than other feature, but the combination of the feature of different embodiments mean it is of the invention Within the scope of and form different embodiments.For example, in detail in the claims, embodiment claimed it is one of any Can in any combination mode come using.
The above description is merely a specific embodiment or to the explanation of specific embodiment, protection of the invention Range is not limited thereto, and anyone skilled in the art in the technical scope disclosed by the present invention, can be easily Expect change or replacement, should be covered by the protection scope of the present invention.Protection scope of the present invention should be with claim Subject to protection scope.

Claims (11)

1. a kind of method for automatically analyzing service number, which is characterized in that the described method includes:
Obtain communications and liaison data;
Communications and liaison characteristics of objects is extracted based on the communications and liaison data, obtains the data set of communications and liaison object;
Classified according to data set of the predetermined condition to the communications and liaison object, obtains analysis set;
Gaussian Profile calculating is carried out to the communications and liaison number of objects in the analysis set, obtains the normal distribution of the analysis set;
The number is judged according to position of the communications and liaison number of objects in the analysis set in the normal distribution of the analysis set According to whether integrating as service number.
2. the method as described in claim 1, which is characterized in that the data set for obtaining communications and liaison object includes: based on communications and liaison The attribute of object extracts communications and liaison characteristics of objects to the communications and liaison data, obtains the data set of communications and liaison object.
3. the method as described in claim 1, which is characterized in that the analysis set includes: the clothes of local normal working hours Be engaged in set of numbers, the service number set of local all-weather, the service number set of harassing and wrecking property, in national service number set At least one.
4. method as claimed in claim 3, which is characterized in that if judging whether the data set is that service number includes: Communications and liaison number of objects in the analysis set has the model within n standard deviation in the positional distance average value in the normal distribution The right area enclosed, n are natural number, it is determined that the communications and liaison object is service number.
5. the method as described in claim 1, which is characterized in that the method also includes: the judging result is verified.
6. method as claimed in claim 5, which is characterized in that it is described to the judging result carry out verifying include:
Extract the remark information and/or communications and liaison content of the communications and liaison object that the judging result is service number;
Namebase based on the remark information and the different type service having had built up verifies the judging result as service Whether the communications and liaison object of number is service number;And/or
It is the communications and liaison content of the communications and liaison object of service number and the different type service having had built up based on the judging result The keywords database of communications and liaison content, verifies whether the communications and liaison object that the judging result is service number is service number.
7. method as claimed in claim 6, which is characterized in that described to be verified to the judging result further include: if It can not verify whether the communications and liaison object that the judging result is service number is service number based on the namebase and keywords database When code, then it is sent to user and judges.
8. method as claimed in claim 5, which is characterized in that the method also includes saving the data set of the communications and liaison object And corresponding judging result and/or verification result.
9. a kind of device for automatically analyzing service number, which is characterized in that described device includes:
Data acquisition module, for obtaining communications and liaison data;
Data set module obtains the data set of communications and liaison object for extracting communications and liaison characteristics of objects based on the communications and liaison data;
Analytic set module obtains analysis set for classifying according to data set of the predetermined condition to the communications and liaison object;
Computing module obtains the analytic set for carrying out Gaussian Profile calculating to the communications and liaison number of objects in the analysis set The normal distribution of conjunction;
Judgment module, for the position according to the communications and liaison number of objects in the analysis set in the normal distribution of the analysis set It sets and judges whether the data set is service number.
10. a kind of system for automatically analyzing service number, including memory, processor and it is stored on the memory and in institute State the computer program run on processor, which is characterized in that the processor realizes right when executing the computer program It is required that the step of any one of 1 to 8 the method.
11. a kind of computer readable storage medium, is stored thereon with computer program, which is characterized in that the computer program The step of any one of claims 1 to 8 the method is realized when being computer-executed.
CN201811573549.4A 2018-12-21 2018-12-21 Method and device for automatically analyzing service number Active CN109857773B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811573549.4A CN109857773B (en) 2018-12-21 2018-12-21 Method and device for automatically analyzing service number

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811573549.4A CN109857773B (en) 2018-12-21 2018-12-21 Method and device for automatically analyzing service number

Publications (2)

Publication Number Publication Date
CN109857773A true CN109857773A (en) 2019-06-07
CN109857773B CN109857773B (en) 2022-03-01

Family

ID=66891972

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811573549.4A Active CN109857773B (en) 2018-12-21 2018-12-21 Method and device for automatically analyzing service number

Country Status (1)

Country Link
CN (1) CN109857773B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102572059A (en) * 2010-12-16 2012-07-11 中国移动通信集团广东有限公司 Method and system for incoming call processing
CN103607370A (en) * 2013-11-22 2014-02-26 南京信息职业技术学院 Credibility assessment method of complex BPSK signal blind processing result
CN103823867A (en) * 2014-02-26 2014-05-28 深圳大学 Humming type music retrieval method and system based on note modeling
US20150161139A1 (en) * 2013-12-10 2015-06-11 Alibaba Group Holding Limited Data search processing
CN106210239A (en) * 2016-09-14 2016-12-07 北京奇虎科技有限公司 The maliciously automatic identifying method of caller's vocal print, device and mobile terminal
CN107808306A (en) * 2017-09-28 2018-03-16 平安科技(深圳)有限公司 Cutting method, electronic installation and the storage medium of business object based on tag library
CN107967323A (en) * 2017-11-24 2018-04-27 泰华智慧产业集团股份有限公司 The method and system of abnormal in-trips vehicles analysis are carried out based on big data
US20180183747A1 (en) * 2016-12-23 2018-06-28 International Business Machines Corporation Analyzing messages in social networks

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102572059A (en) * 2010-12-16 2012-07-11 中国移动通信集团广东有限公司 Method and system for incoming call processing
CN103607370A (en) * 2013-11-22 2014-02-26 南京信息职业技术学院 Credibility assessment method of complex BPSK signal blind processing result
US20150161139A1 (en) * 2013-12-10 2015-06-11 Alibaba Group Holding Limited Data search processing
CN103823867A (en) * 2014-02-26 2014-05-28 深圳大学 Humming type music retrieval method and system based on note modeling
CN106210239A (en) * 2016-09-14 2016-12-07 北京奇虎科技有限公司 The maliciously automatic identifying method of caller's vocal print, device and mobile terminal
US20180183747A1 (en) * 2016-12-23 2018-06-28 International Business Machines Corporation Analyzing messages in social networks
CN107808306A (en) * 2017-09-28 2018-03-16 平安科技(深圳)有限公司 Cutting method, electronic installation and the storage medium of business object based on tag library
CN107967323A (en) * 2017-11-24 2018-04-27 泰华智慧产业集团股份有限公司 The method and system of abnormal in-trips vehicles analysis are carried out based on big data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张奇支等: ""移动智能网用户的通话时长模型分析"", 《高技术通讯》 *

Also Published As

Publication number Publication date
CN109857773B (en) 2022-03-01

Similar Documents

Publication Publication Date Title
CN106384273B (en) Malicious bill-swiping detection system and method
CN107248082B (en) Card maintenance identification method and device
CN102945366A (en) Method and device for face recognition
CN109740155A (en) A kind of customer service system artificial intelligence quality inspection rule self concludes the method and system of model
CN110443120A (en) A kind of face identification method and equipment
CN112199530B (en) Multi-dimensional face library picture automatic updating method, system, equipment and medium
CN102438205B (en) Method and system for pushing service based on action of mobile user
CN114742477B (en) Enterprise order data processing method, device, equipment and storage medium
CN106789292A (en) A kind of abnormal behaviour monitoring method and device
CN108596559A (en) Task automates checking method, device, equipment and storage medium
CN110609908A (en) Case serial-parallel method and device
CN110909129B (en) Abnormal complaint event identification method and device
CN108388672A (en) Lookup method, device and the computer readable storage medium of video
CN110796014A (en) Garbage throwing habit analysis method, system and device and storage medium
CN109948489A (en) A kind of face identification system and method based on the fusion of video multiframe face characteristic
CN109801394B (en) Staff attendance checking method and device, electronic equipment and readable storage medium
CN112508626A (en) Information processing method and device, electronic equipment and storage medium
CN108921433B (en) Risk quantitative analysis system based on business continuity
CN109857773A (en) A kind of method and apparatus automatically analyzing service number
CN115083004B (en) Identity recognition method and device and computer readable storage medium
CN114817518B (en) License handling method, system and medium based on big data archive identification
CN110162572A (en) Policy execution method, server and computer storage medium
CN111091047A (en) Living body detection method and device, server and face recognition equipment
CN107332806A (en) The method to set up and device of mobile device mark
CN115527241A (en) Fingerprint template updating method and device, embedded equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant