CN109873908B - Junk call identification recognition method and device, computer equipment and storage medium - Google Patents

Junk call identification recognition method and device, computer equipment and storage medium Download PDF

Info

Publication number
CN109873908B
CN109873908B CN201711259287.XA CN201711259287A CN109873908B CN 109873908 B CN109873908 B CN 109873908B CN 201711259287 A CN201711259287 A CN 201711259287A CN 109873908 B CN109873908 B CN 109873908B
Authority
CN
China
Prior art keywords
marking
type
mark
calling number
record
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201711259287.XA
Other languages
Chinese (zh)
Other versions
CN109873908A (en
Inventor
王冼
廖振松
周小翠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
China Mobile Group Hubei Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
China Mobile Group Hubei Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd, China Mobile Group Hubei Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN201711259287.XA priority Critical patent/CN109873908B/en
Publication of CN109873908A publication Critical patent/CN109873908A/en
Application granted granted Critical
Publication of CN109873908B publication Critical patent/CN109873908B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Telephonic Communication Services (AREA)
  • Telephone Function (AREA)

Abstract

The invention discloses a junk phone identification recognition method and device, computer equipment and a storage medium. The method comprises the following steps: receiving a spam call identification from a terminal; acquiring the number marking content of a calling number from a junk telephone marking platform; if the calling number is marked as a plurality of mark record types by different called numbers, the marking times of each mark record type is adjusted, and the final mark type of the calling number is determined. The method can fully utilize Internet netizens to classify and identify the received strange numbers based on a crowdsourcing mechanism, establish a junk telephone marking platform to record the contents of the classified identifications, and perform type consistency processing on calling numbers with conflicting marking types, thereby determining a marking recording type which can reflect the number type most.

Description

Junk call identification recognition method and device, computer equipment and storage medium
Technical Field
The invention belongs to the technical field of data processing, and particularly relates to a junk phone identification recognition method, a junk phone identification recognition device, computer equipment and a computer storage medium.
Background
Along with the popularization of smart phones, junk calls and harassing calls tend to be in a blowout trend, the speed is increased obviously, the threat of user information security in the era of mobile internet after being used as a relay brain virus and phishing website brings great trouble to public life, and huge loss is caused to the privacy and economy of the public.
In the related technology, a user acquires a download link of an unknown number from a cloud, calls a telephone address book expansion interface to update a system after the download is finished, and can reduce the harassment probability of a called user by timely prompting or intercepting a new malicious number; or acquiring the number of times that the unknown incoming call number is marked as a harassing call, and judging whether the number of times is greater than or equal to an interception threshold value preset by a user to intercept the harassing call. In addition, due to the periodicity of the junk calls and the harassing calls, a recovery and updating mechanism of the marked numbers is not involved, and once the numbers are marked, the numbers cannot be connected. Therefore, how to realize the harassing call marking and simultaneously solve the influence of disordered number marking sources on the accuracy of harassing call type marking becomes an urgent problem to be solved.
Disclosure of Invention
The embodiment of the invention provides a junk phone identification method, a junk phone identification device, computer equipment and a storage medium, which can fully utilize Internet netizens to carry out classified identification on received strange numbers based on a crowdsourcing mechanism, establish a junk phone marking platform to record the contents of the classified identifications, and carry out type consistency processing on calling numbers with conflicting marking types, thereby determining a marking recording type which can embody the number type most, so that a user downloads a final marking type to know the type of the number when the user is called by the strange numbers, and the accuracy of type judgment is improved.
In one aspect, an embodiment of the present invention provides a method for identifying a spam call identifier, where the method includes: receiving a junk telephone identifier from a terminal, wherein the junk telephone identifier comprises a calling number; acquiring number marking contents of a calling number from a junk telephone marking platform, wherein the number marking contents comprise marking record types, marking times of each marking record type and a called number for marking the calling number; if the calling number is marked as a plurality of mark record types by different called numbers, the marking times of each mark record type is adjusted, and the final mark type of the calling number is determined.
In the above technical solution, preferably, the method further includes: and when a downloading instruction of the final mark type from the terminal is received, sending the final mark type to the terminal.
In the above technical solution, preferably, the method further includes: and within the first preset number of days, not receiving the junk phone identification of the calling number and not acquiring new number marking content of the calling number in the junk phone marking platform, and marking the final marking type of the calling number as overdue release.
In the above technical solution, preferably, the method further includes: if the number evaluation information is acquired from the junk telephone marking platform but the number marking content is not acquired, splitting the number evaluation information into a plurality of word segments; determining a mark record type corresponding to each participle according to the mark concept tree; the marked concept tree comprises the corresponding relation between each participle and the type of the marked record.
In the above technical solution, preferably, if the calling number is marked as a plurality of mark record types by different called numbers, the step of adjusting the number of marks of each mark record type and determining the final mark type of the calling number specifically includes: if the calling number is marked as a plurality of mark record types by different called numbers, the marking times of each mark record type is adjusted according to a first formula; updating the number marking content to generate a marking sequence; determining the final marking type of the calling number according to the marking sequence; wherein the first formula is
Figure BDA0001493100320000021
Wherein, M'icmNumber of marking times of the type recorded for the cm-th mark of the adjusted calling number i, MicmNumber of marking times of the cm-th marking record type for the calling number i before adjustment, MikcmThe marking times of the type of the cm-th marked record of the calling number i marked by the k-th called number are recorded, N is the number of the called numbers, and k belongs to N.
In the above technical solution, preferably, the step of updating the number mark content to generate the mark sequence specifically includes: updating the number mark content; acquiring the number marking content of the second preset number of days j to form a marking sequence { M }ijcm},Where j is an integer greater than 0, MijcmAnd recording the marking times of the type for the cm marks of the calling number i in the second preset number of days j.
In the foregoing technical solution, preferably, the step of determining the final type of the calling number according to the tag sequence specifically includes: according to the mark sequence { MijcmAnd calculating a secondary difference sequence { D' of each mark record type of the calling number according to a second formulaicm}; second order difference sequence { D' for each mark record typeicmCarrying out positive sequence sorting; judging whether a third formula is established; if the third formula is established, taking the first mark record type in positive sequence as the final mark type of the calling number; wherein the second formula is
Figure BDA0001493100320000031
The third formula is
F(D″icm)-S(D″icm)>S(D″icm)/4,
Wherein, D ″)icmRecording the quadratic difference of the type for the cm symbols of the calling number i, MijcmMarking times of marking record types for cm marks of the calling number i in the second preset number of days j, j belongs to B, B is an integer larger than 0, and F (D ″)icm) Second order difference of mark record type for positive order first, S (D ″)icm) The second mark recording type quadratic difference is arranged in positive order.
In the above technical solution, preferably, the method further includes: if the third formula does not hold, the second difference sequence { D ″')icmComparing each secondary difference with other secondary differences to obtain a plurality of comparison differences; if the comparison differences are smaller than a preset threshold value, the mark record type is sent to the terminal for type re-marking according to the damage degree of the mark record type; the type of the marking record type marked again by the receiving terminal is used as the final marking type of the calling number.
In the above technical solution, preferably, the method further includes: storing the spam call identification to a spam call marking platform; the spam telephone identification also comprises a current called number, a current mark type and mark time.
In the above technical solution, preferably, the mark record type and the current mark type include at least one of: fraud calls, a sound call, an advertisement call, a broker call, a courier call, a suspected fraud call, other calls.
On the other hand, the embodiment of the invention provides a device for identifying a spam telephone identifier, which comprises: the receiving module is used for receiving the junk telephone identification from the terminal, and the junk telephone identification comprises a calling number; the system comprises an acquisition module, a display module and a display module, wherein the acquisition module is used for acquiring the number marking content of a calling number from a junk telephone marking platform, and the number marking content comprises marking record types, marking times of each marking record type and a called number for marking the calling number; and the type determining module is used for adjusting the marking times of each mark recording type and determining the final mark type of the calling number if the calling number is marked as a plurality of mark recording types by different called numbers.
In another aspect, an embodiment of the present invention provides a computer device, where the computer device includes: a processor and a memory storing computer program instructions; the processor, when executing the computer program instructions, implements the spam call identification recognition method as described in any of the above.
In yet another aspect, an embodiment of the present invention provides a computer storage medium, where computer program instructions are stored on the computer storage medium, and when executed by a processor, the computer program instructions implement any one of the above spam call identification recognition methods.
The junk call identification recognition method, the junk call identification recognition device, the computer equipment and the storage medium can fully utilize Internet netizens to classify and identify received unfamiliar numbers based on a crowdsourcing mechanism, and establish a junk call marking platform to record the contents of the classified identifications. In some embodiments, the calling numbers with conflicting number types are processed for type consistency, so as to determine a mark record type which can best embody the number type, so that a user downloads a final mark type to know the number type when being called by a strange number, and the accuracy of type judgment is improved. In some embodiments, under the condition that the number marking content is incomplete, the marking record type corresponding to the unknown number is analyzed by combining the number evaluation information, so that the data acquisition range for determining the final marking type is richer. In some embodiments, the number mark type is periodically updated and released according to the life cycle of the telephone number, so that the defect that the telephone cannot be connected once the number is marked is avoided, and the memory pressure of a junk telephone mark platform is relieved.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings required to be used in the embodiments of the present invention will be briefly described below, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flow chart of a spam call identification recognition method according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of a spam call identification recognition method according to another embodiment of the present invention;
FIG. 3 is a schematic diagram of a single instance of a tagged-rating concept tree, as provided by an embodiment of the present invention;
FIG. 4 is a flowchart illustrating a spam identification recognizing method according to still another embodiment of the present invention;
fig. 5 is a flowchart illustrating a spam call identification recognizing method according to another embodiment of the present invention;
fig. 6 is a schematic structural diagram of a spam call identification recognizing apparatus according to another embodiment of the present invention;
FIG. 7 shows a schematic structural diagram of a computer device of an embodiment of the present invention;
fig. 8 shows a schematic structural diagram of a computer device according to another embodiment of the present invention.
Detailed Description
Features and exemplary embodiments of various aspects of the present invention will be described in detail below, and in order to make objects, technical solutions and advantages of the present invention more apparent, the present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not to be construed as limiting the invention. It will be apparent to one skilled in the art that the present invention may be practiced without some of these specific details. The following description of the embodiments is merely intended to provide a better understanding of the present invention by illustrating examples of the present invention.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element.
In order to solve the problem of the prior art, embodiments of the present invention provide a method and an apparatus for identifying a spam call identifier, a computer device, and a storage medium. The method for identifying spam call identifiers provided by the embodiment of the invention is introduced below.
Fig. 1 is a flowchart illustrating a spam call identification recognition method according to an embodiment of the present invention. As shown in fig. 1, the method includes:
102, receiving a spam telephone identification from a terminal;
104, acquiring the number marking content of the calling number from the junk phone marking platform;
and step 106, if the calling number is marked as a plurality of mark record types by different called numbers, adjusting the marking times of each mark record type, and determining the final mark type of the calling number.
The junk phone identification comprises a calling number; the number mark content comprises mark record types, mark times of each mark record type and called numbers for marking calling numbers.
In the embodiment, the internet netizens can be fully utilized to carry out classification identification on the received strange numbers based on a crowdsourcing mechanism, a junk phone marking platform is established to record the contents of the classification identifications, and type consistency processing is carried out on calling numbers with conflicting marking types, so that a marking recording type which can reflect the number type most is determined, a user can download a final marking type to know the type of the number when the user is called by the strange number, and the accuracy of type judgment is improved.
In specific implementation, the server periodically obtains the number marking content of the calling number from a spam telephone marking platform (for example, a third party marking platform) or a spam telephone database, and specifically includes: number, type of label record, number of labels, date of label, and source of label. A number can be marked into a plurality of mark record types, the marked times of each mark record type are not necessarily the same, the judgment of different called parties on the calling number type is different, the time or the length of marking the calling number type each time is shorter or longer, the number mark types are collected and packaged into number mark content, and the further processing of data is facilitated.
Fig. 2 is a flowchart illustrating a spam call identification recognizing method according to another embodiment of the present invention. As shown in fig. 2, the method includes:
step 202, receiving a spam call identifier from a terminal;
step 204, acquiring the number marking content of the calling number from the junk phone marking platform;
step 206, judging whether the number marking content of the calling number is acquired, if so, executing step 212, otherwise, executing step 208;
step 208, acquiring number evaluation information from the junk phone marking platform, and splitting the number evaluation information into a plurality of participles;
step 210, determining a marked record type corresponding to each participle according to the marked concept tree; the marked concept tree comprises a corresponding relation between each participle and a marked record type;
step 212, if the calling number is marked as a plurality of mark record types by different called numbers, adjusting the marking times of each mark record type and determining the final mark type of the calling number;
step 214, when receiving a downloading instruction of the final mark type from the terminal, sending the final mark type to the terminal;
step 216, within a first preset number of days, if the junk call identifier of the calling number is not received and new number marking content of the calling number is not acquired in the junk call marking platform, marking the final marking type of the calling number as overdue release.
The junk phone identification comprises a calling number; the number mark content comprises mark record types, mark times of each mark record type and called numbers for marking calling numbers.
In this embodiment, even in the case that the number marking content of the calling number is not obtained from the spammed telephone marking platform, number evaluation information such as comments, complaints, and the like of other people on the calling number may be obtained from the spammed telephone marking platform or a related website, and when the number evaluation information is a relatively long sentence, the sentence is divided into a plurality of participles/keywords, and the participles are identified to obtain the meaning represented by each participle, thereby determining the marking record type of the number in the number evaluation information.
In one example embodiment, a tree of marking concepts related to marks such as harassment, fraud, a sound, advertising, an intermediary, express, and suspected fraud is built. And the Internet friends are divided into a plurality of participles for the comment sentences of the unknown numbers, wherein some participles exist in the marked concept tree, and are subjected to normalized marking according to the marked evaluation semantics to obtain the conclusion of the unknown number types. As an example shown in fig. 3, the unknown number type may be defined as a nuisance number if the number evaluation information relates to a segmentation/keyword of malicious, early morning, frequent, stock, etc., and a fraud number if the number evaluation information relates to a segmentation of telephone fee, game, high speed, work, etc. Under the condition that the number marking content is incomplete, the marking record type corresponding to the unknown number is analyzed by combining the number evaluation information, so that the data acquisition range for determining the final marking type is richer.
In some embodiments, a spam telephone marking platform and a personalized number marking library can be further established, wherein the spam telephone marking platform belongs to a public number marking library, number marking contents for a large number of calling numbers are stored in the public number marking library, and a final marking type determined according to comments and complaint information semantics through word segmentation can be stored in the personalized number marking library.
In some embodiments, numbers that have been marked as spam may also be periodically checked for continued harassment. If any mark of the net friend on the junk telephone number is not detected within a certain time, or new number mark content of the junk telephone number is not updated in the junk telephone mark platform, the number mark is periodically updated and released according to the life cycle of harassing/fraud calls by releasing the final mark type mark which does not disturb the calling number of the user any more for a long time, the defect that the number cannot be connected once marked is avoided, and the memory pressure of the junk telephone mark platform is also facilitated to be relieved.
Fig. 4 is a flowchart illustrating a spam call identification recognizing method according to an embodiment of the present invention. As shown in fig. 4, the method includes:
step 402, receiving a spam telephone identification from a terminal;
step 404, obtaining number marking content of the calling number from the junk phone marking platform;
step 406, determining whether the number marking content of the calling number is acquired, if so, executing step 412, otherwise, executing step 408;
step 408, acquiring number evaluation information from the junk phone marking platform, and splitting the number evaluation information into a plurality of participles;
step 410, determining a marked record type corresponding to each participle according to the marked concept tree; the marked concept tree comprises a corresponding relation between each participle and a marked record type;
step 412, if the calling number is marked as a plurality of mark record types by different called numbers, adjusting the marking times of each mark record type according to a formula 1;
Figure BDA0001493100320000081
wherein, M'icmNumber of marking times of the type recorded for the cm-th mark of the adjusted calling number i, MicmNumber of marking times of the cm-th marking record type for the calling number i before adjustment, MikcmThe marking times of the type of the cm-th marked record of the calling number i marked by the k-th called number are recorded, N is the number of the called numbers, and k belongs to N.
Step 414, updating the number mark content;
step 416, obtaining the number marking content of the second preset number of days j, and forming a marking sequence { M }ijcmWhere j is an integer greater than 0, MijcmRecording the marking times of the types of the cm marks of the calling number i within a second preset number of days j;
step 418, determining the final marking type of the calling number according to the marking sequence;
step 420, when receiving a download instruction of the final mark type from the terminal, sending the final mark type to the terminal;
step 422, in the first preset number of days, if the junk phone identification of the calling number is not received and new number marking content of the calling number is not acquired in the junk phone marking platform, marking the final marking type of the calling number as overdue release.
The junk phone identification comprises a calling number; the number mark content comprises mark record types, mark times of each mark record type and called numbers for marking calling numbers.
In the embodiment, the internet netizens can be fully utilized to carry out classification identification on the received unfamiliar numbers based on a crowdsourcing mechanism, a junk phone marking platform is established to record the contents of the classification identifications, and consistency processing is carried out on calling numbers with conflicting number types, so that a marking recording type which can reflect the number type most is determined, a user can download a final marking type to know the number type when being called by the unfamiliar numbers, the unfamiliar number type is acquired in real time, and meanwhile accuracy of type judgment is improved.
The same calling number is marked as different types by different users, in order to avoid the influence of malicious marks on actual marks, the times that one calling number is marked as any type by one called number needs to be planned according to the formula 1 again, and the marking times of each type of one calling number are sequentially brought into the formula 1
Figure BDA0001493100320000091
Wherein, M'icmNumber of marking times of the type recorded for the cm-th mark of the adjusted calling number i, MicmNumber of marking times of the cm-th marking record type for the calling number i before adjustment, MikcmThe marking times of the type of the cm-th marked record of the calling number i marked by the k-th called number are recorded, N is the number of the called numbers, and k belongs to N. Finally, the adjusted marking times of each type are obtained, the authenticity of the marking times can be reflected, a marking sequence within a second preset number of days j is defined by users, and the final marking type with a real and accurate calling number is judged according to the marking sequence.
Fig. 5 is a flowchart illustrating a spam call identification recognizing method according to an embodiment of the present invention. As shown in fig. 5, the method includes:
step 502, receiving a junk phone identifier from a terminal, and acquiring number marking content of a calling number from a junk phone marking platform;
step 504, judging whether the number marking content of the calling number is acquired, if so, executing step 508, otherwise, executing step 506;
step 506, acquiring number evaluation information from the junk phone marking platform, dividing the number evaluation information into a plurality of participles, and determining a marking record type corresponding to each participle according to a marking concept tree; the marked concept tree comprises a corresponding relation between each participle and a marked record type;
step 508, if the calling number is marked as a plurality of mark record types by different called numbers, adjusting the marking times of each mark record type according to a first formula, updating the number mark content, obtaining the number mark content of a second preset number of days j, and forming a mark sequence { M }ijcmWhere j is an integer greater than 0, MijcmRecording the marking times of the types of the cm marks of the i calling numbers within a second preset number of days j;
Figure BDA0001493100320000101
wherein, M'icmNumber of marking times of the type recorded for the cm-th mark of the adjusted calling number i, MicmNumber of marking times of the cm-th marking record type for the calling number i before adjustment, MikcmThe marking times of the type of the cm-th marked record of the calling number i marked by the k-th called number are recorded, N is the number of the called numbers, and k belongs to N.
Step 510, according to the mark sequence { MijcmAnd calculating a secondary difference sequence { D' of each mark record type of the calling number according to a formula 2icmRecord the quadratic difference sequence { D') of the type for each markicmCarrying out positive sequence sorting;
Figure BDA0001493100320000102
wherein, D ″)icmRecording the quadratic difference of the type for the cm symbols of the calling number i, MijcmAnd marking times of the mark record types of the cm marks of the calling number i in the second preset number of days j, wherein j belongs to B, and B is an integer larger than 0.
Step 512, determining whether the formula 3 is true, if yes, executing step 514, otherwise, executing step 516;
F(D″icm)-S(D″icm)>S(D″icm) /4 (formula 3)
Wherein, F (D ″)icm) Second order difference of mark record type for positive order first, S (D ″)icm) The second mark recording type quadratic difference is arranged in positive order.
Step 514, using the first mark record type in the positive sequence as the final mark type of the calling number;
step 516, the quadratic difference sequence { D ″ ]icmComparing each secondary difference with other secondary differences to obtain a plurality of comparison differences;
step 518, if the comparison differences are smaller than a preset threshold value, sending the mark record type to the terminal for type re-marking according to the damage degree of the mark record type; the type marked again by the marking record type by the receiving terminal is used as the final marking type of the calling number;
step 520, when receiving a download instruction of the final mark type from the terminal, sending the final mark type to the terminal;
step 522, within a first preset number of days, if the junk call identification of the calling number is not received and new number marking content of the calling number is not acquired in a junk call marking platform, marking the final marking type of the calling number as overdue release;
step 524, the spam call identification is stored to a spam call marking platform.
The junk phone identification comprises a calling number; the number marking content comprises marking record types, marking times of each marking record type and called numbers for marking calling numbers; the spam telephone identification also comprises a current called number, a current mark type and mark time; the mark record type and the current mark type include at least one of: fraud calls, a sound call, an advertisement call, a broker call, a courier call, a suspected fraud call, other calls.
In this embodiment, the obtained tag sequence is substituted into the second formula to obtain a secondary difference sequence related to the type of the calling number, the secondary difference sequence is sorted in a positive order, for example, the number cm of the tag record types of the calling number is 7, the secondary difference sequence includes secondary difference values of 7 tag record types, the 7 secondary difference values are arranged in a descending order, the tag record type corresponding to the second difference value arranged in the front can embody the true type of the unknown number, and the secondary difference value arranged in the first place (with the largest value) is marked as F (D ″)icm) The second order difference value ranked next to the second order (numerical order is greater) is denoted as S (D ″)icm),F(D″icm) And S (D ″)icm) Once the judgment condition of formula 3 is satisfied, F (D') of the first bit is arranged in positive ordericm) The corresponding tag record type may be the final tag type of the calling number.
If the judgment condition of the formula 3 is not met, the secondary difference sequence can be further processed, namely, each secondary difference is compared with other secondary differences to obtain a plurality of comparison differences, when any one comparison difference is not larger than a preset threshold value, the times of each mark record type representing the calling number are closer, a certain mark record type is sent to a terminal according to the damage degree of each mark record type, for example, a mark record type with the minimum or maximum damage degree is distributed to the terminal for a user to confirm the mark record type, and the mark record type is used as a final mark type after the user confirms, so that the type mark of the calling number is more accurate. Therein, various types of harm levels are ranked in advance, such as fraud phone harm level > suspected fraud phone > advertising phone > intermediary phone > express phone > one sound phone > other phone.
Fig. 6 shows a schematic structural diagram of a spam call identification recognizing apparatus 600 according to another embodiment of the present invention. As shown in fig. 6, the apparatus includes:
a receiving module 602, configured to receive a spam call identifier from a terminal, where the spam call identifier includes a calling number;
an obtaining module 604, configured to obtain number marking content of a calling number from a junk phone marking platform, where the number marking content includes a mark recording type, a marking frequency of each mark recording type, and a called number for marking the calling number;
a type determining module 606, configured to adjust the marking times of each mark record type and determine a final mark type of the calling number if the calling number is marked as a plurality of mark record types by different called numbers.
In the embodiment, the internet netizens can be fully utilized to carry out classification identification on the received strange numbers based on a crowdsourcing mechanism, a junk phone marking platform is established to record the contents of the classification identifications, and type consistency processing is carried out on calling numbers with conflicting number types, so that a marking recording type which can reflect the number type most is determined, a user can download a final marking type to know the type of the number when the user is called by the strange number, and the accuracy of type judgment is improved.
Fig. 7 shows a hardware structure diagram of a computer device 700 according to an embodiment of the present invention. As shown in fig. 7, the apparatus includes: a processor 702 and a memory 704 in which computer program instructions are stored. Specifically, the processor 702 may include a Central Processing Unit (CPU), or an Application Specific Integrated Circuit (ASIC), or may be configured as one or more Integrated circuits implementing embodiments of the present invention.
Memory 704 may include mass storage for data or instructions. By way of example, and not limitation, memory 704 may include a Hard Disk Drive (HDD), floppy Disk Drive, flash memory, optical Disk, magneto-optical Disk, tape, or Universal Serial Bus (USB) Drive or a combination of two or more of these. Memory 704 may include removable or non-removable (or fixed) media, where appropriate. The memory 704 may be internal or external to the integrated gateway disaster recovery device, where appropriate. In a particular embodiment, the memory 704 is a non-volatile solid-state memory. In certain embodiments, memory 704 comprises Read Only Memory (ROM). Where appropriate, the ROM may be mask-programmed ROM, Programmable ROM (PROM), Erasable PROM (EPROM), Electrically Erasable PROM (EEPROM), electrically rewritable ROM (EAROM), or flash memory, or a combination of two or more of these.
The processor 702 may implement any of the above embodiments of spam identification recognition methods by reading and executing computer program instructions stored in the memory 704.
In one example, the computer device may also include a communication interface 806 and a bus 808. As shown in fig. 8, the processor 802, the memory 804, and the communication interface 806 are connected via a bus 808 to complete communication therebetween.
The communication interface 806 is mainly used for implementing communication between modules, apparatuses, units and/or devices in the embodiments of the present invention.
Bus 808 comprises hardware, software, or both coupling the components of the online data traffic billing device to one another. By way of example, and not limitation, a bus may include an Accelerated Graphics Port (AGP) or other graphics bus, an Enhanced Industry Standard Architecture (EISA) bus, a Front Side Bus (FSB), a Hypertransport (HT) interconnect, an Industry Standard Architecture (ISA) bus, an infiniband interconnect, a Low Pin Count (LPC) bus, a memory bus, a Micro Channel Architecture (MCA) bus, a Peripheral Component Interconnect (PCI) bus, a PCI-Express (PCI-X) bus, a Serial Advanced Technology Attachment (SATA) bus, a video electronics standards association local (VLB) bus, or other suitable bus or a combination of two or more of these. Bus 808 may include one or more buses, where appropriate. Although specific buses have been described and shown in the embodiments of the invention, any suitable buses or interconnects are contemplated by the invention.
In addition, in combination with the spam call identification recognition method in the above embodiment, the embodiment of the present invention may provide a computer storage medium to implement. The computer storage medium having computer program instructions stored thereon; the computer program instructions, when executed by a processor, implement any of the above embodiments of the spam call identification recognition method.
It is to be understood that the invention is not limited to the specific arrangements and instrumentality described above and shown in the drawings. A detailed description of known methods is omitted herein for the sake of brevity. In the above embodiments, several specific steps are described and shown as examples. However, the method processes of the present invention are not limited to the specific steps described and illustrated, and those skilled in the art can make various changes, modifications and additions or change the order between the steps after comprehending the spirit of the present invention.
The functional blocks shown in the above-described structural block diagrams may be implemented as hardware, software, firmware, or a combination thereof. When implemented in hardware, it may be, for example, an electronic circuit, an Application Specific Integrated Circuit (ASIC), suitable firmware, plug-in, function card, or the like. When implemented in software, the elements of the invention are the programs or code segments used to perform the required tasks. The program or code segments may be stored in a machine-readable medium or transmitted by a data signal carried in a carrier wave over a transmission medium or a communication link. A "machine-readable medium" may include any medium that can store or transfer information. Examples of a machine-readable medium include electronic circuits, semiconductor memory devices, ROM, flash memory, Erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, Radio Frequency (RF) links, and so forth. The code segments may be downloaded via computer networks such as the internet, intranet, etc.
It should also be noted that the exemplary embodiments mentioned in this patent describe some methods or systems based on a series of steps or devices. However, the present invention is not limited to the order of the above-described steps, that is, the steps may be performed in the order mentioned in the embodiments, may be performed in an order different from the order in the embodiments, or may be performed simultaneously.
As described above, only the specific embodiments of the present invention are provided, and it can be clearly understood by those skilled in the art that, for convenience and brevity of description, the specific working processes of the system, the module and the unit described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again. It should be understood that the scope of the present invention is not limited thereto, and any person skilled in the art can easily conceive various equivalent modifications or substitutions within the technical scope of the present invention, and these modifications or substitutions should be covered within the scope of the present invention.

Claims (12)

1. A spam call identification recognition method, comprising:
receiving a junk telephone identifier from a terminal, wherein the junk telephone identifier comprises a calling number;
acquiring the number marking content of the calling number from a junk telephone marking platform, wherein the number marking content comprises marking record types, marking times of each marking record type and a called number for marking the calling number;
if the calling number is marked as a plurality of mark record types by different called numbers, adjusting the marking times of each mark record type according to a first formula;
updating the number marking content to generate a marking sequence;
determining the final marking type of the calling number according to the marking sequence;
wherein the first formula is
Figure FDA0002609932980000011
Wherein, M'icmThe marking times of the mark recording type for the cm-th mark of the adjusted calling number i, MicmFor the cm-th mark record type of the calling number i before adjustmentNumber of marking of (D), MikcmAnd the marking times of the type of the mark record for the cm th mark of the calling number i marked by the kth called number are recorded, N is the number of the called numbers, and k belongs to N.
2. The spam call identification recognition method of claim 1, further comprising:
and when a downloading instruction of the final mark type from the terminal is received, sending the final mark type to the terminal.
3. The spam call identification recognition method of claim 1, further comprising:
and within a first preset number of days, if the junk phone identification of the calling number is not received and new number marking content of the calling number is not acquired in the junk phone marking platform, marking the final marking type of the calling number as overdue release.
4. The spam call identification recognition method of claim 1, further comprising:
if the number evaluation information is acquired from the junk telephone marking platform and the number marking content is not acquired, dividing the number evaluation information into a plurality of word segments;
determining the mark record type corresponding to each word segmentation according to the mark concept tree;
wherein, the marked concept tree comprises the corresponding relation between each word segmentation and the marked record type.
5. The spam call id identification method according to claim 1, wherein the step of updating the number tag content to generate the tag sequence specifically comprises:
updating the number mark content;
acquiring the number mark content of a second preset number of days j to form a markSequence of note { MijcmWhere j is an integer greater than 0, MijcmAnd recording the marking times of the types of the marks for the cm numbers of the calling number i in the second preset number of days j.
6. The spam call id identification method according to claim 5, wherein the step of determining the final type of the label of the calling number according to the label sequence specifically comprises:
according to the mark sequence { MijcmCalculating a second difference sequence { D' of each mark record type of the calling number according to a second formulaicm};
A quadratic difference sequence { D ″' for each of the mark recording typesicmCarrying out positive sequence sorting;
judging whether a third formula is established;
if the third formula is established, taking the first mark record type in positive sequence as the final mark type of the calling number;
wherein the second formula is
Figure FDA0002609932980000021
The third formula is
F(D″icm)-S(D″icm)>S(D″icm)/4,
Wherein, D ″)icmRecording the quadratic difference of the type for cm of said marks of said calling number i, MijcmMarking times of the mark record types for cm of the calling number i in the second preset number of days j, wherein j belongs to B, B is an integer larger than 0, and F (D ″)icm) Second order difference of mark record type for positive order first, S (D ″)icm) The second mark recording type quadratic difference is arranged in positive order.
7. The spam call identification recognition method of claim 6, further comprising:
if the third formula does not hold, the quadratic difference sequence { D ″')icmComparing each secondary difference with other secondary differences to obtain a plurality of comparison differences;
if the comparison differences are smaller than a preset threshold value, sending the mark record type to the terminal for type re-marking according to the hazard degree of the mark record type;
and receiving the type marked again by the marking record type by the terminal as the final marking type of the calling number.
8. The spam call identification recognition method according to any one of claims 1 to 7, further comprising:
storing the spam call identification to the spam call tagging platform;
the spam telephone identification also comprises a current called number, a current mark type and mark time.
9. The spam identification recognition method of claim 8, wherein the token record type and the current token type comprise at least one of: fraud calls, a sound call, an advertisement call, a broker call, a courier call, a suspected fraud call, other calls.
10. A spam identification recognizing device, comprising:
the system comprises a receiving module, a sending module and a receiving module, wherein the receiving module is used for receiving a junk telephone identifier from a terminal, and the junk telephone identifier comprises a calling number;
the acquiring module is used for acquiring the number marking content of the calling number from a junk telephone marking platform, wherein the number marking content comprises marking record types, marking times of each marking record type and a called number for marking the calling number;
the type determining module is used for adjusting the marking times of each marking record type according to a first formula, updating the number marking content to generate a marking sequence and determining the final marking type of the calling number according to the marking sequence if the calling number is marked as a plurality of marking record types by different called numbers;
wherein the first formula is
Figure FDA0002609932980000031
Wherein, M'icmThe marking times of the mark recording type for the cm-th mark of the adjusted calling number i, MicmNumber of times of marking for the cm-th marking record type of the calling number i before adjustment, MikcmAnd the marking times of the type of the mark record for the cm th mark of the calling number i marked by the kth called number are recorded, N is the number of the called numbers, and k belongs to N.
11. A computer device, the device comprising: a processor and a memory storing computer program instructions;
the processor, when executing the computer program instructions, implements the spam call identification method as recited in any of claims 1-9.
12. A computer storage medium having computer program instructions stored thereon which, when executed by a processor, implement a spam call identification recognition method as claimed in any one of claims 1 to 9.
CN201711259287.XA 2017-12-04 2017-12-04 Junk call identification recognition method and device, computer equipment and storage medium Active CN109873908B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711259287.XA CN109873908B (en) 2017-12-04 2017-12-04 Junk call identification recognition method and device, computer equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711259287.XA CN109873908B (en) 2017-12-04 2017-12-04 Junk call identification recognition method and device, computer equipment and storage medium

Publications (2)

Publication Number Publication Date
CN109873908A CN109873908A (en) 2019-06-11
CN109873908B true CN109873908B (en) 2020-11-20

Family

ID=66915445

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711259287.XA Active CN109873908B (en) 2017-12-04 2017-12-04 Junk call identification recognition method and device, computer equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109873908B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114048339B (en) * 2021-08-19 2022-08-05 广西东信易通科技有限公司 Intelligent AI (Artificial intelligence) query method and system for privacy number marking

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104253908A (en) * 2014-09-18 2014-12-31 北京奇虎科技有限公司 Unknown number grading method, unknown number marking method and unknown number marking device
CN104506731A (en) * 2014-12-26 2015-04-08 北京奇虎科技有限公司 Unfamiliar call reminding method and unfamiliar call reminding device
CN104683537A (en) * 2015-01-28 2015-06-03 北京羽乐创新科技有限公司 Number marking method and device
CN104836924A (en) * 2015-03-31 2015-08-12 小米科技有限责任公司 Number marking method and device
CN105162931A (en) * 2015-07-30 2015-12-16 小米科技有限责任公司 Method and device for classifying communication numbers
CN106791229A (en) * 2012-04-13 2017-05-31 北京搜狗科技发展有限公司 The identification method and device of number
CN106954218A (en) * 2017-03-15 2017-07-14 中国联合网络通信集团有限公司 The number sorted methods, devices and systems of one kind harassing and wrecking

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6782083B2 (en) * 2001-06-06 2004-08-24 Mci, Inc. ISN call interrupt

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106791229A (en) * 2012-04-13 2017-05-31 北京搜狗科技发展有限公司 The identification method and device of number
CN104253908A (en) * 2014-09-18 2014-12-31 北京奇虎科技有限公司 Unknown number grading method, unknown number marking method and unknown number marking device
CN104506731A (en) * 2014-12-26 2015-04-08 北京奇虎科技有限公司 Unfamiliar call reminding method and unfamiliar call reminding device
CN104683537A (en) * 2015-01-28 2015-06-03 北京羽乐创新科技有限公司 Number marking method and device
CN104836924A (en) * 2015-03-31 2015-08-12 小米科技有限责任公司 Number marking method and device
CN105162931A (en) * 2015-07-30 2015-12-16 小米科技有限责任公司 Method and device for classifying communication numbers
CN106954218A (en) * 2017-03-15 2017-07-14 中国联合网络通信集团有限公司 The number sorted methods, devices and systems of one kind harassing and wrecking

Also Published As

Publication number Publication date
CN109873908A (en) 2019-06-11

Similar Documents

Publication Publication Date Title
CN111061874B (en) Sensitive information detection method and device
US9130778B2 (en) Systems and methods for spam detection using frequency spectra of character strings
CN103428183B (en) Method and device for identifying malicious website
CN109976995B (en) Method and apparatus for testing
CN103440458B (en) A kind of method of heuristic static identification Android system malicious code
CN104270521A (en) Method for processing incoming call number and mobile terminal
CN108337153A (en) A kind of monitoring method of mail, system and device
CN112491864A (en) Method, device, equipment and medium for detecting phishing deep victim user
CN113328994B (en) Malicious domain name processing method, device, equipment and machine readable storage medium
CN106383862B (en) Illegal short message detection method and system
CN109905524B (en) Telephone number identification method and device, computer equipment and computer storage medium
CN111338692A (en) Vulnerability classification method and device based on vulnerability codes and electronic equipment
US11929969B2 (en) System and method for identifying spam email
CN106453062A (en) Application notification management method and terminal
CN113554049A (en) Method, device, equipment and storage medium for identifying different network broadband users
CN109873908B (en) Junk call identification recognition method and device, computer equipment and storage medium
CN109726398B (en) Entity identification and attribute judgment method, system, equipment and medium
CN108322912B (en) Method and device for distinguishing short messages
CN111461118B (en) Interest feature determining method, device, equipment and storage medium
CN113158686A (en) Network culture management compliance detection method and device, readable medium and electronic equipment
CN112784015A (en) Information recognition method and apparatus, device, medium, and program
CN110990570A (en) Mail drop extraction method based on deep learning
CN109426666B (en) Junk short message identification method, system, readable storage medium and mobile terminal
CN111428251B (en) Data processing method and device
CN112449062B (en) Malicious fee deduction identification method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant