CN108959646B - Method, system, device and storage medium for automatically verifying communication number - Google Patents

Method, system, device and storage medium for automatically verifying communication number Download PDF

Info

Publication number
CN108959646B
CN108959646B CN201810853313.XA CN201810853313A CN108959646B CN 108959646 B CN108959646 B CN 108959646B CN 201810853313 A CN201810853313 A CN 201810853313A CN 108959646 B CN108959646 B CN 108959646B
Authority
CN
China
Prior art keywords
communication number
voice
communication
matching
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810853313.XA
Other languages
Chinese (zh)
Other versions
CN108959646A (en
Inventor
华吉春
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Ctrip Travel Information Technology Shanghai Co Ltd
Original Assignee
Ctrip Travel Information Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Ctrip Travel Information Technology Shanghai Co Ltd filed Critical Ctrip Travel Information Technology Shanghai Co Ltd
Priority to CN201810853313.XA priority Critical patent/CN108959646B/en
Publication of CN108959646A publication Critical patent/CN108959646A/en
Application granted granted Critical
Publication of CN108959646B publication Critical patent/CN108959646B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a method, a system, equipment and a storage medium for automatically verifying a communication number, which are characterized in that a crawler technology is used for searching in a target website by using keywords, the communication number in a search result and a plurality of number characteristic fields related to the communication number are crawled, and the number characteristic fields comprise text fields; calling the crawled communication number and acquiring response voice; the response voice is converted into the voice text, the text field in the number characteristic field is matched with the voice text, if the matching is successful, the communication number is judged to be correct, if the matching is failed, the communication number is judged to be abnormal, and the judgment result is output, so that the communication number is automatically verified, the efficiency is improved, the problem is quickly found, and the labor cost is reduced.

Description

Method, system, device and storage medium for automatically verifying communication number
Technical Field
The present invention relates to the field of information processing technologies, and in particular, to a method, a system, a device, and a storage medium for automatically verifying a communication number.
Background
At present, many companies put customer service numbers of related services of the companies on various known websites. However, as the business of some large companies is expanded, the number to be released is more and more, and the problems that some numbers are not updated timely, the number to be released is wrong, the number and the release description are not consistent and the like occur at this time. Currently, such problems are often found out untimely, often found by clients, and the user experience is poor. In the prior art, communication numbers are usually verified manually, however, as the number of release sites and numbers increases, the manual verification mode is inefficient and wastes a large amount of manpower and material resources.
Disclosure of Invention
Aiming at the technical problems, the invention provides a method, a system, equipment and a storage medium for automatically verifying a communication number, which are used for crawling the communication number by a crawler technology, matching the delivered content with the actual content of the number by automatic calling and voice recognition, thereby automatically detecting the customer service number with problems and timely notifying related personnel to update the delivered content or the number.
A first aspect of the present invention provides a method of automatically verifying a communication number, comprising the steps of: s10, searching in the target website by using keywords through a crawler technology, and crawling communication numbers in the search result and a plurality of number characteristic fields related to the communication numbers, wherein the number characteristic fields comprise text fields; s30, calling the crawled communication number and acquiring response voice; s40, converting the response voice into a voice text, matching the text field in the number feature field with the voice text, if the matching is successful, executing S50, and if the matching is failed, executing S60; s50, judging that the communication number is correct; and S60, judging the communication number is abnormal and outputting the judgment result.
Preferably, step S10 includes: s11, searching in the target website by using the keywords; s12, judging the type of the search result according to the label semantic of the hypertext markup language of the search result; and S13, extracting the communication number and the number characteristic field in the search result according to the type of the search result by using the corresponding preset content crawling mode.
Preferably, step S11 includes: and acquiring keywords from the webpage delivery system.
Preferably, step S50 includes determining that the communication number is correct, and storing the successfully matched communication number and the number feature field in step S40.
Preferably, the number feature field includes a timestamp, and after step S10, the method further includes the steps of: s21, searching the crawled communication number in the stored data, executing the step S30 if the communication number is not searched, and executing the step S22 if the communication number is searched; s22, calculating whether the difference between the crawled time stamp and the time stamp corresponding to the communication number in the stored data exceeds a threshold value, if not, executing the step S50, and if so, executing the step S30.
Preferably, step S30 includes: and reading the crawled communication number, calling the communication number by using IP voice communication, starting recording, and stopping recording after detecting the on-hook prompt tone.
Preferably, step S40 includes: s41, judging whether the response voice contains the human voice frequency band through audio frequency spectrum analysis, if so, executing a step S42, and if not, executing a step S60; s42, performing character conversion on the response voice to obtain a voice text; and S43, extracting the characteristic nouns in the voice text, and matching the text fields in the number characteristic fields with the characteristic nouns.
A second aspect of the present invention provides a system for automatically verifying a communication number, comprising: the crawler module searches in a target website by using keywords through a crawler technology, crawls a communication number in a search result and a plurality of number characteristic fields related to the communication number, and the number characteristic fields comprise text fields; the calling module is used for calling the crawled communication number and acquiring response voice; the voice analysis module is used for converting the response voice into a voice text; a matching analysis module for matching the text field in the number characteristic field with the voice text, judging the communication number is correct if the matching is successful, judging the communication number is abnormal if the matching is failed, and outputting the judgment result
A third aspect of the present invention also provides an apparatus for automatically verifying a communication number, comprising: a processor; a memory having stored therein executable instructions of the processor; wherein the processor is configured to perform the steps of the method of automatically verifying a communication number of the first aspect described above via execution of executable instructions.
The fourth aspect of the present invention also provides a computer-readable storage medium storing a program which, when executed, implements the steps of the method of automatically verifying a communication number of the first aspect described above.
The method, the system, the equipment and the storage medium for automatically verifying the communication number provided by the invention crawl the communication number through a crawler technology, match the delivered content with the actual content of the number through automatic calling and voice recognition, and judge whether the communication number is correct, thereby automatically verifying the communication number, improving the efficiency, quickly finding the problem and reducing the labor cost.
Drawings
Other features, objects and advantages of the present invention will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, with reference to the accompanying drawings.
Fig. 1 is a flowchart of a method of automatically verifying a communication number according to a first embodiment of the present invention;
fig. 2 is a detailed flowchart of step S10 in fig. 1;
fig. 3 is a flowchart of a method of automatically verifying a communication number according to a second embodiment of the present invention;
FIG. 4 is a block diagram of a system including an automatic verification of a communication number in accordance with one embodiment of the present invention;
fig. 5 is a schematic structural diagram of an apparatus for automatically verifying a communication number according to an embodiment of the present invention; and
fig. 6 is a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present invention.
Detailed Description
Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals in the drawings denote the same or similar structures, and thus their repetitive description will be omitted.
Aiming at customer service number delivery inspection, the prior art usually uses manual verification, however, along with the increase of delivery sites and numbers, the prior art has the problems of low efficiency, untimely discovery, waste of human resources and the like.
According to the invention, the communication number is crawled through a crawler technology, and the delivered content is matched with the actual content of the number through automatic calling and voice recognition, so that the problem customer service number is automatically detected, and related personnel can be timely notified to update the delivered content or the number, therefore, the detection efficiency is improved, the problem can be quickly found, and the labor cost is reduced.
Fig. 1 is a flowchart of a method for automatically verifying a communication number according to a first embodiment of the present invention. As shown in fig. 1, the method for automatically verifying a communication number of the present invention includes the steps of:
and S10, searching in the target website by using keywords through a crawler technology, and crawling the communication number in the search result and a plurality of number characteristic fields related to the communication number, wherein the number characteristic fields comprise text fields.
The target website is a predetermined website, and the target website may generally select a search engine, for example, one hundred degrees (www.baidu.com). The keywords are service descriptors related to the communication number needing to be verified, such as a service telephone. The keywords may be preset manually. In the embodiment, the keywords are obtained from a web page launching system, the web page launching system is used for launching the web page of the specific service, the web page launching system presets keyword marks when the web page is launched, and when the communication number of the specific service needs to be verified, the keywords are captured from the web page launching system through the keyword marks. Taking a target website as an example, after searching by using the keywords, a plurality of search results are obtained.
And after searching by using the keywords in the target website, obtaining a search result, and extracting the communication number and a plurality of number characteristic fields related to the communication number from the search result. The manner of extracting the communication number may be to extract a numeric character string conforming to a preset format as the communication number by detecting the numeric character string. The manner of extracting the communication number may also use a tag that incorporates hypertext markup language (HTML) through detection of a numeric string, for example, reading a numeric string between tags < dd > </dd > as a communication number. The communication number may be obtained by combining the two methods. Other ways of extracting the communication number may be used in other embodiments.
The number feature field is a feature description of the captured communication number and comprises a text field. The text field in this embodiment includes the crawled keywords, the web page name, and the web page content. In this example, the webpage names and the webpage contents need to be further filtered, and the filtering includes filtering irrelevant information, error information or illegal violation information in the numbers.
In other embodiments, crawling the communication number and the number feature fields associated with the communication number may be accomplished using other crawler technologies as well.
Preferably, a plurality of content crawling modes are preset, and different content crawling modes are respectively used according to different search result types, so that the method is suitable for crawling of all webpage types of search results of the target website, has high universality and improves crawling accuracy.
Fig. 2 is a detailed flowchart of step S10 in fig. 1, and step S10 is described in detail below in conjunction with fig. 2. As shown in fig. 2, before step S11 is executed, a plurality of content crawling modes are first preset according to the types of web pages related to the communication numbers, in this example, the content crawling modes include a bracket frame mode, such as a hectometer security authentication number frame, a telephone icon mode, such as an online web page customer service, a question and answer web page mode, such as hectometer knowledge, and other modes. Different content crawling modes obtain a communication number and a plurality of number feature fields associated with the communication number by reading tags of different hypertext markup languages (HTML) and tag semantics.
In step S11, a search is performed in the target website using the keywords to obtain search results. In step S12, the type of the search result is determined according to the tag semantics of the hypertext markup language of the search result. And step S13, extracting the communication number and the number characteristic field according to the type of the search result by using the corresponding preset content crawling mode, and storing the communication number and the number characteristic field in the message queue.
With continued reference to fig. 1, S30, a call is made to the crawled communication number and an answer voice is acquired.
Specifically, a communication number in a message queue is read, and a communication number is called using voice over IP communication. The voice over IP communication method may employ an existing communication method, such as connecting a switch through a client, the switch converting an analog telephone communication system, thereby calling a communication number. The recording of the communication content is started at the same time as the outgoing call. And detecting an on-hook prompt tone, and stopping recording after the on-hook prompt tone is detected, wherein the recording file is the response voice.
And S40, converting the response voice into a voice text, matching the text field in the number feature field with the voice text, executing S50 if the matching is successful, and executing S60 if the matching is failed. Specifically, step S41 is executed first, whether the response voice contains a vocal band is determined through audio spectrum analysis, and if the response voice does not contain a vocal band, that is, if the response voice only contains a phone alert tone, it is determined that the matching is failed, and step S60 is executed directly. If it is determined that the voice frequency band is included, step S42 is executed to perform voice recognition on the response voice to convert the response voice into text, so as to obtain a voice text. The specific language identification may use existing language identification techniques. Then, step S43 is executed to extract feature nouns in the speech text, i.e. to filter preset irrelevant information, such as "welcome", and to extract nouns as feature nouns. And matching the text field in the number characteristic field with the characteristic nouns, wherein in the implementation, the characteristic nouns and the text field, namely the crawled keywords, the webpage names and the three items in the webpage content are matched, and judging that the matching is successful, otherwise, judging that the matching is failed.
And S50, judging that the communication number is correct.
And S60, judging the communication number is abnormal and outputting the judgment result.
According to the matching result of the step S40, if the matching is successful, it is determined that the communication number is normal, and if the matching is failed, it is determined that the communication number is abnormal. In this embodiment, the determination result is output only when the communication number is determined to be abnormal, that is, the determination result is not output when the communication number is determined to be normal, so that relevant personnel can update the release content or number, redundant information is reduced, and efficiency is improved. In other embodiments, the determination result may be output in step S50.
Fig. 3 is a flow chart of a method of automatically verifying a communication number according to another embodiment of the present invention. The present embodiment is substantially the same as the above-described embodiment, and the difference of the present embodiment is that in step S50, the communication number is determined to be correct, and the communication number and the number feature field after successful matching in step S40 are stored, that is, when the communication number is determined to be correct after successful matching in step S40, the communication number and the number feature field are read from the message queue and stored. Wherein the number characteristic field includes a timestamp. In the implementation, only the communication number with correct verification is stored, and the communication number with abnormal verification is not stored, so that the storage space is saved, and the efficiency is improved. Preferably, in this embodiment, the same communication number only stores one record, that is, the number feature field of the same communication number that is stored later will be overwritten by updating the number feature field that is stored earlier, so as to save the storage space and improve the efficiency.
Preferably, step S21 and step S22 are further included after step S10.
As shown in fig. 3, after step S10 is executed, step S21 is executed to search the stored data for the crawled communication number, that is, to determine whether the currently crawled communication number has been verified correctly, if the communication number has not been found, it is determined that the currently crawled communication number has not been verified correctly, and then step S30 is executed to verify the currently crawled communication number. If the communication number is found, the communication number currently crawled is judged to be verified to be correct, and the step S22 is entered for further judgment.
Step S22 is executed to calculate whether the difference between the crawled timestamp and the timestamp corresponding to the communication number in the stored data exceeds a threshold, that is, whether the timestamp corresponding to the currently crawled communication number exceeds a preset threshold compared to the timestamp when the current crawled communication number is verified correctly. If the current crawled communication number does not exceed the threshold, the verification of the current crawled communication number is not executed, and the step S50 is directly skipped to judge that the communication number is correct, however, it should be noted that at this time, the current crawled communication number and the number feature field are not stored, that is, the timestamp corresponding to the communication number in the stored data is not updated, because the matching step of S40 is not performed. If the threshold is exceeded, step S30 is executed to verify the currently crawled communication number. The setting ensures the data accuracy, ensures that the communication number which is checked is skipped for the second checking within a certain time range, and improves the operation reliability.
The embodiment further improves the efficiency of automatic verification of the communication number on the premise of ensuring the accuracy by avoiding verification of the verified communication number within a certain time threshold.
It can be known from the above description of the method for automatically verifying a communication number of the present invention that the method, system, device and storage medium for automatically verifying a communication number provided by the present invention crawl the communication number by a crawler technology, match the delivered content with the actual content of the number by automatic calling and voice recognition, and determine whether the communication number is correct, thereby automatically verifying the communication number, finding in time the problem of delivering customer service numbers on various websites at present, and improving the user experience.
The method judges the type of the search result according to the label semantics of the hypertext markup language of the search result, uses different content crawling modes for different types of search results, is suitable for crawling of all webpage types of the search results of the target website, has higher universality and improves the crawling accuracy.
The invention stores the communication number which is verified to be correct, avoids the repeated verification of the communication number within a certain time threshold range by searching the stored data and judging the time stamp, and further improves the efficiency of the automatic verification of the communication number on the premise of ensuring the accuracy.
The invention also provides a system for automatically verifying the communication number, which is used for solving the problem that the prior art usually uses manual verification for communication number detection and cannot meet the application of multi-site and multi-number detection.
Fig. 4 is a block diagram of a system including an automatic verification of a communication number in accordance with one embodiment of the present invention. As shown in fig. 4, the system 10 for automatically verifying a communication number according to the present invention includes a crawler module 11, a calling module 12, a voice conversion module 13, and a matching analysis module 14.
The crawler module 11 uses a crawler technology to search in a target website by using keywords, and crawls a communication number in a search result and a plurality of number characteristic fields associated with the communication number, wherein the number characteristic fields comprise text fields.
The target website is a predetermined website, and the target website may generally select a search engine, for example, one hundred degrees (www.baidu.com). The keywords are service descriptors related to the communication number needing to be verified, such as a service telephone. The keywords may be preset manually. In this example, the crawler module 11 is connected to an external web page delivery system, the keywords are obtained from the web page delivery system, the web page delivery system is used for delivering web pages of a specific service, the web page delivery system presets keyword marks when the web pages are delivered, and when communication numbers of the specific service need to be verified, the keywords are captured from the web page delivery system through the keyword marks. Taking a target website as an example, after searching by using the keywords, a plurality of search results are obtained.
The calling module 12 is used for calling the crawled communication number and acquiring response voice.
The voice analysis module 13 is configured to convert the response voice into a voice text.
The matching analysis module 14 is configured to match a text field in the number feature field with the voice text, determine that the communication number is correct if the matching is successful, determine that the communication number is abnormal if the matching is failed, and output a determination result. In this embodiment, the matching analysis module 14 is connected to an external communication number processing system, and the matching analysis module 14 notifies the customer service staff of the occurrence of an abnormality of the communication number through the communication number processing system, so that the customer service staff can process the abnormal communication number in time.
It will be appreciated that the present system 10 for automatically verifying communication numbers also includes other existing functional modules that support the operation of the system 10 for automatically verifying communication numbers. The system 10 for automatically verifying communication numbers shown in fig. 4 is merely an example and should not impose any limitation on the functionality or scope of use of embodiments of the present invention.
The system 10 for automatically verifying a communication number in this embodiment is used to implement the method for automatically verifying a communication number, so for the specific implementation steps of the system 10 for automatically verifying a communication number, reference may be made to the description of the method for automatically verifying a communication number, and details are not described here again.
The system for automatically verifying the communication number provided by the invention crawls the communication number through a crawler technology, matches the delivered content with the actual content of the number through automatic calling and voice recognition, judges whether the communication number is correct or not, thereby automatically verifying the communication number, timely finding the problem of delivering customer service numbers on various websites at present, improving the use experience of users, on the other hand, the automatic crawler number detection also releases the workload originally brought by manual detection, improves the working efficiency, reduces the labor cost and also ensures the quality of the service of enterprises to the users.
The embodiment of the invention also provides equipment for automatically verifying the communication number, which comprises a processor. A memory having stored therein executable instructions of the processor. Wherein the processor is configured to perform the steps of the above method of automatically verifying a communication number via execution of executable instructions.
As above, this embodiment crawls the communication number through the crawler technology, through automatic calling and speech recognition, matches input content and number actual content to the realization is with the customer service number automated inspection who has a problem come out, and then can in time inform relevant personnel to put in content or number and update, has improved detection efficiency, can discover the problem fast, reduces the human cost.
As will be appreciated by one skilled in the art, aspects of the present invention may be embodied as a system, method or program product. Thus, various aspects of the invention may be embodied in the form of: an entirely hardware embodiment, an entirely software embodiment (including firmware, microcode, etc.) or an embodiment combining hardware and software aspects that may all generally be referred to herein as a "circuit," module "or" platform.
Fig. 5 is a schematic structural diagram of an apparatus for automatically verifying a communication number according to an embodiment of the present invention. An automatic verification communication number device 600 according to this embodiment of the present invention is described below with reference to fig. 5. The automatic verification communication number device 600 shown in fig. 5 is only an example and should not impose any limitation on the functionality and scope of use of embodiments of the present invention.
As shown in fig. 5, the automatic verification communication number device 600 is in the form of a general purpose computing device. The components of the automatic verification communication number device 600 may include, but are not limited to: at least one processing unit 610, at least one memory unit 620, a bus 630 connecting the different platform components (including the memory unit 620 and the processing unit 610), a display unit 640, etc.
Wherein the storage unit stores program code executable by the processing unit 610 to cause the processing unit 610 to perform steps according to various exemplary embodiments of the present invention described in the above-mentioned electronic prescription flow processing method section of the present specification. For example, processing unit 610 may perform the steps as shown in fig. 1.
The storage unit 620 may include readable media in the form of volatile memory units, such as a random access memory unit (RAM)6201 and/or a cache memory unit 6202, and may further include a read-only memory unit (ROM) 6203.
The memory unit 620 may also include a program/utility 6204 having a set (at least one) of program modules 6205, such program modules 6205 including, but not limited to: an operating system, one or more application programs, other program modules, and program data, each of which, or some combination thereof, may comprise an implementation of a network environment.
Bus 630 may be one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, an accelerated graphics port, a processing unit, or a local bus using any of a variety of bus architectures.
The automatic verification communication number device 600 may also communicate with one or more external devices 700 (e.g., keyboard, pointing device, bluetooth device, etc.), with one or more devices that enable a user to interact with the automatic verification communication number device 600, and/or with any devices (e.g., router, modem, etc.) that enable the automatic verification communication number device 600 to communicate with one or more other computing devices. Such communication may occur via an input/output (I/O) interface 650. Also, the automatic verification communication number device 600 may also communicate with one or more networks (e.g., a Local Area Network (LAN), a Wide Area Network (WAN), and/or a public network, such as the internet) through the network adapter 660. The network adapter 660 may communicate with other modules of the automatic verification communication number device 600 via the bus 630. It should be appreciated that although not shown in the figures, other hardware and/or software modules may be used in conjunction with the automatic verification communication number device 600, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives, and data backup storage platforms, to name a few.
An embodiment of the present invention further provides a computer-readable storage medium, which is used for storing a program, and when the program is executed, the steps of the method for automatically verifying a communication number in the foregoing embodiment are implemented. In some possible embodiments, the aspects of the present invention may also be implemented in the form of a program product comprising program code for causing a terminal device to perform the steps according to various exemplary embodiments of the present invention described in the above-mentioned electronic prescription flow processing method section of this specification, when the program product is run on the terminal device.
As described above, when the program of the computer-readable storage medium of this embodiment is executed, the communication number is crawled by using a crawler technology, and the delivered content is matched with the actual content of the number by automatic calling and voice recognition, so that the customer service number with problems is automatically detected, and then relevant persons can be timely notified to update the delivered content or number, thereby improving the detection efficiency, quickly finding the problem, and reducing the labor cost.
Fig. 6 is a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present invention. Referring to fig. 6, a program product 800 for implementing the above method according to an embodiment of the present invention is described, which may employ a portable compact disc read only memory (CD-ROM) and include program code, and may be run on a terminal device, such as a personal computer. However, the program product of the present invention is not limited in this regard and, in the present document, a readable storage medium may be any tangible medium that can contain, or store a program for use by or in connection with an instruction execution system, apparatus, or device.
The program product may employ any combination of one or more readable media. The readable medium may be a readable signal medium or a readable storage medium. A readable storage medium may be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the foregoing. More specific examples (a non-exhaustive list) of the readable storage medium include: an electrical connection having one or more wires, a portable disk, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber, a portable compact disc read-only memory (CD-ROM), an optical storage device, a magnetic storage device, or any suitable combination of the foregoing.
A computer readable storage medium may include a propagated data signal with readable program code embodied therein, for example, in baseband or as part of a carrier wave. Such a propagated data signal may take many forms, including, but not limited to, electro-magnetic, optical, or any suitable combination thereof. A readable storage medium may also be any readable medium that is not a readable storage medium and that can communicate, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device. Program code embodied on a readable storage medium may be transmitted using any appropriate medium, including but not limited to wireless, wireline, optical fiber cable, RF, etc., or any suitable combination of the foregoing.
Program code for carrying out operations for aspects of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The program code may execute entirely on the user's computing device, partly on the user's device, as a stand-alone software package, partly on the user's computing device and partly on a remote computing device, or entirely on the remote computing device or server. In the case of a remote computing device, the remote computing device may be connected to the user computing device through any kind of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or may be connected to an external computing device (e.g., through the internet using an internet service provider).
The method, the system, the equipment and the storage medium for automatically verifying the communication number crawl the communication number through a crawler technology, match the delivered content with the actual content of the number through automatic calling and voice recognition, thereby realizing automatic detection of the customer service number with problems, further timely notifying related personnel to update the delivered content or the number, improving the detection efficiency, quickly finding the problems and reducing the labor cost.
The method judges the type of the search result according to the label semantics of the hypertext markup language of the search result, uses different content crawling modes for different types of search results, is suitable for crawling of all webpage types of the search results of the target website, has higher universality and improves the crawling accuracy.
The invention stores the communication number which is verified to be correct, avoids the repeated verification of the communication number within a certain time threshold range by searching the stored data and judging the time stamp, and further improves the efficiency of the automatic verification of the communication number on the premise of ensuring the accuracy.
The foregoing is a more detailed description of the invention in connection with specific preferred embodiments and it is not intended that the invention be limited to these specific details. For those skilled in the art to which the invention pertains, several simple deductions or substitutions can be made without departing from the spirit of the invention, and all shall be considered as belonging to the protection scope of the invention.

Claims (8)

1. A method of automatically validating a communication number, comprising the steps of:
s10, searching in a target website by using keywords through a crawler technology, and crawling a communication number in a search result and a plurality of number characteristic fields related to the communication number, wherein the number characteristic fields comprise text fields;
s30, calling the crawled communication number and acquiring response voice;
s40, converting the response voice into a voice text, matching the text field in the number feature field with the voice text, if the matching is successful, executing S50, and if the matching is failed, executing S60;
s50, judging that the communication number is correct;
s60, judging the communication number is abnormal, and outputting a judgment result;
the step S50 includes judging the communication number is correct, storing the communication number and number characteristic field after matching successfully in the step S40;
the number feature field includes a timestamp, and after step S10, the method further includes the steps of:
s21, searching the crawled communication number in the stored data, executing S30 if the communication number is not searched, and executing S22 if the communication number is searched;
s22, calculating whether the difference between the crawled time stamp and the time stamp corresponding to the communication number in the stored data exceeds a threshold value, if not, executing the step S50, and if so, executing the step S30.
2. The method of automatically verifying a communication number as recited in claim 1, wherein the step S10 includes:
s11, searching in the target website by using the keywords;
s12, judging the type of the search result according to the label semantic of the hypertext markup language of the search result;
and S13, extracting the communication number and the number characteristic field in the search result by using a corresponding preset content crawling mode according to the type of the search result.
3. The method of automatically verifying a communication number as recited in claim 2, wherein the step S11 includes: and acquiring the keywords from a webpage delivery system.
4. The method of automatically verifying a communication number as recited in claim 1, wherein the step S30 includes: and reading the crawled communication number, calling the communication number by using IP voice communication, starting recording, and stopping recording after detecting an on-hook prompt tone.
5. The method of automatically verifying a communication number as recited in claim 1, wherein step S40 includes: s41, judging whether the response voice contains the human voice frequency band through audio frequency spectrum analysis, if so, executing a step S42, and if not, executing a step S60; s42, performing character conversion on the response voice to obtain a voice text; s43, extracting the characteristic nouns in the voice text, and matching the text fields in the number characteristic fields with the characteristic nouns.
6. A system for automatically verifying a communication number, implementing the steps of the method for automatically verifying a communication number according to any one of claims 1 to 5, comprising:
the crawler module searches in a target website by using keywords through a crawler technology, crawls a communication number in a search result and a plurality of number characteristic fields related to the communication number, and the number characteristic fields comprise text fields;
the calling module is used for calling the crawled communication number and acquiring response voice;
the voice analysis module is used for converting the response voice into a voice text;
and the matching analysis module is used for matching the text field in the number characteristic field with the voice text, judging that the communication number is correct if the matching is successful, judging that the communication number is abnormal if the matching is failed, and outputting a judgment result.
7. An apparatus for automatically verifying a communication number, comprising:
a processor;
a memory having stored therein executable instructions of the processor;
wherein the processor is configured to perform the steps of the method of automatically verifying a communication number of any of claims 1 to 5 via execution of the executable instructions.
8. A computer-readable storage medium storing a program which, when executed, performs the steps of the method of automatically verifying a communication number of any one of claims 1 to 5.
CN201810853313.XA 2018-07-30 2018-07-30 Method, system, device and storage medium for automatically verifying communication number Active CN108959646B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810853313.XA CN108959646B (en) 2018-07-30 2018-07-30 Method, system, device and storage medium for automatically verifying communication number

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810853313.XA CN108959646B (en) 2018-07-30 2018-07-30 Method, system, device and storage medium for automatically verifying communication number

Publications (2)

Publication Number Publication Date
CN108959646A CN108959646A (en) 2018-12-07
CN108959646B true CN108959646B (en) 2021-03-12

Family

ID=64466522

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810853313.XA Active CN108959646B (en) 2018-07-30 2018-07-30 Method, system, device and storage medium for automatically verifying communication number

Country Status (1)

Country Link
CN (1) CN108959646B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112615965B (en) * 2020-12-01 2023-04-11 北京皮尔布莱尼软件有限公司 Communication number verification method and system and computing device
CN113139384A (en) * 2021-04-28 2021-07-20 北京百度网讯科技有限公司 Telephone verification, map processing and knowledge graph processing method and device

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199851A (en) * 2014-08-11 2014-12-10 北京奇虎科技有限公司 Method for extracting telephone numbers according to yellow page information and cloud server
CN105426675A (en) * 2015-11-13 2016-03-23 江苏大学 Full-automatic hospital telephone follow-up method and telephone device thereof
CN106021439A (en) * 2016-05-16 2016-10-12 腾讯科技(深圳)有限公司 Communication number processing method and device
CN107493353A (en) * 2017-10-11 2017-12-19 宁波感微知著机器人科技有限公司 A kind of intelligent robot cloud computing method based on contextual information
US10027767B2 (en) * 2015-08-25 2018-07-17 Myung Bean Song Method for providing SNS-based file aging service

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104199851A (en) * 2014-08-11 2014-12-10 北京奇虎科技有限公司 Method for extracting telephone numbers according to yellow page information and cloud server
US10027767B2 (en) * 2015-08-25 2018-07-17 Myung Bean Song Method for providing SNS-based file aging service
CN105426675A (en) * 2015-11-13 2016-03-23 江苏大学 Full-automatic hospital telephone follow-up method and telephone device thereof
CN106021439A (en) * 2016-05-16 2016-10-12 腾讯科技(深圳)有限公司 Communication number processing method and device
CN107493353A (en) * 2017-10-11 2017-12-19 宁波感微知著机器人科技有限公司 A kind of intelligent robot cloud computing method based on contextual information

Also Published As

Publication number Publication date
CN108959646A (en) 2018-12-07

Similar Documents

Publication Publication Date Title
CN107577947B (en) Vulnerability detection method and system for information system, storage medium and electronic equipment
US9215245B1 (en) Exploration system and method for analyzing behavior of binary executable programs
US9667644B2 (en) Risk identification
CN112183782B (en) Fault work order processing method and equipment
US7913233B2 (en) Performance analyzer
CN110798445B (en) Public gateway interface testing method and device, computer equipment and storage medium
CN110909229A (en) Webpage data acquisition and storage system based on simulated browser access
CN111104579A (en) Identification method and device for public network assets and storage medium
CN111563257B (en) Data detection method and device, computer readable medium and terminal equipment
CN107332765A (en) Method and apparatus for repairing router failure
CN108848276A (en) Telephone number method for detecting availability, system, equipment and storage medium
CN109684863B (en) Data leakage prevention method, device, equipment and storage medium
CN108959646B (en) Method, system, device and storage medium for automatically verifying communication number
CN110347573B (en) Application program analysis method, device, electronic equipment and computer readable medium
CN113032834A (en) Database table processing method, device, equipment and storage medium
CN113434400A (en) Test case execution method and device, computer equipment and storage medium
CN109657462B (en) Data detection method, system, electronic device and storage medium
CN103001934A (en) Terminal application login method and terminal application login system
CN111243580B (en) Voice control method, device and computer readable storage medium
CN110245059B (en) Data processing method, device and storage medium
CN116186716A (en) Security analysis method and device for continuous integrated deployment
CN111367531A (en) Code processing method and device
CN108804501B (en) Method and device for detecting effective information
CN111966630A (en) File type detection method, device, equipment and medium
CN110688558B (en) Webpage searching method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant