CN110784591A - Intelligent voice automatic detection method, device and system - Google Patents

Intelligent voice automatic detection method, device and system Download PDF

Info

Publication number
CN110784591A
CN110784591A CN201910908123.8A CN201910908123A CN110784591A CN 110784591 A CN110784591 A CN 110784591A CN 201910908123 A CN201910908123 A CN 201910908123A CN 110784591 A CN110784591 A CN 110784591A
Authority
CN
China
Prior art keywords
task
recording
module
voice recognition
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910908123.8A
Other languages
Chinese (zh)
Inventor
李钻达
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
FUJIAN NEW LAND SOFTWARE ENGINEERING Co Ltd
Original Assignee
FUJIAN NEW LAND SOFTWARE ENGINEERING Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by FUJIAN NEW LAND SOFTWARE ENGINEERING Co Ltd filed Critical FUJIAN NEW LAND SOFTWARE ENGINEERING Co Ltd
Priority to CN201910908123.8A priority Critical patent/CN110784591A/en
Publication of CN110784591A publication Critical patent/CN110784591A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/64Automatic arrangements for answering calls; Automatic arrangements for recording messages for absent subscribers; Arrangements for recording conversations
    • H04M1/65Recording arrangements for recording a message from the calling party
    • H04M1/656Recording arrangements for recording a message from the calling party for recording conversations
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72403User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality
    • H04M1/7243User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages
    • H04M1/72433User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality with interactive means for internal management of messages for voice messaging, e.g. dictaphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72484User interfaces specially adapted for cordless or mobile telephones wherein functions are triggered by incoming communication events

Landscapes

  • Engineering & Computer Science (AREA)
  • Human Computer Interaction (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Business, Economics & Management (AREA)
  • General Business, Economics & Management (AREA)
  • Telephone Function (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention discloses an intelligent voice automatic detection method, a device and a system, wherein a target number of a detection task is obtained according to task information issued by a user, and a system call module is called to dial the target number; judging whether the call is connected or not, and muting and recording the call; judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording through an automatic testing tool; converting the format of the generated recording file, calling a voice recognition module and obtaining a voice recognition result; calling a semantic analysis module to intelligently analyze the voice recognition result and obtain an analysis result; judging whether the task is finished or not according to a result returned by the semantic analysis module; clicking a target key when the information returned by the semantic analysis module contains key information; the recognized voice recognition result, the key and the starting and ending time of the step task. The invention can automatically identify and record the conversation content and the result of the key recording file so as to determine whether the task execution is successful.

Description

Intelligent voice automatic detection method, device and system
Technical Field
The invention relates to the technical field of intelligent voice recognition, in particular to an intelligent voice automatic detection method, device and system.
Background
With the rapid development of artificial intelligence technology, speech recognition technology is becoming more and more powerful, wherein the smart platform (science news) is one of the strongest speech recognition platforms at present and is widely used. The problem to be solved by the Auto Speech recognition technology (ASR for short) in science and science news is to make the device Recognize the human Speech and "extract" the text information contained in the Speech. ASR technology plays an important role in current electronic devices, making electronic devices have the function of recognizing human speech, and making human-computer communication and interaction more convenient.
However, the automatic speech recognition technology of science and technology news only provides functions such as speech recognition, and can only operate real-time audio streams of existing audio files or microphones of equipment, and cannot perform speech recognition operation under the Android mobile phone call condition, and cannot perform subsequent detection. In addition, the automatic speech recognition technology of science news does not provide the record of the related results of task execution, and cannot provide detailed detection results.
Disclosure of Invention
The invention aims to solve the technical problem of how to provide an intelligent voice automatic detection method and system which can perform voice recognition operation under the condition of mobile phone conversation and can provide detailed detection results.
In order to solve the technical problems, the technical scheme of the invention is as follows:
an intelligent voice automatic detection method comprises the following steps:
acquiring a target number of a detection task according to task information issued by a user, and calling a system call module to dial the target number;
judging whether the call is connected, if so, muting and recording the call;
judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording through an automatic testing tool;
converting the format of the generated recording file, transmitting the converted file to a voice recognition module, and calling the voice recognition module to obtain a voice recognition result;
calling a semantic analysis module to intelligently analyze the voice recognition result and obtain an analysis result;
judging whether the task is finished or not according to a result returned by the semantic analysis module;
clicking a target key when the information returned by the semantic analysis module contains key information;
and recording a task result, a recognized voice recognition result, a key and the start and end time of the step task.
Preferably, the step of calling the semantic analysis module to intelligently analyze the speech recognition result and obtain the analysis result includes:
calling a semantic analysis module through a network request, and transmitting task information and a voice recognition result of the current step to the semantic analysis module;
the semantic analysis module judges whether a successful target keyword exists or not, and if so, returns end information;
the semantic analysis module checks the task information association library to acquire menu association information;
the semantic analysis module divides the speech recognition result of the current step into words and phrases;
the semantic analysis module analyzes the task information and acquires a target of the current task information;
the semantic analysis module compares the phrases with the target by screening the phrases to obtain the similarity of each phrase, and then returns the key with the highest similarity.
Preferably, the step of converting the format of the generated recording file, transmitting the converted file to a voice recognition module, and calling the voice recognition module to obtain the voice recognition result includes:
searching a recording file meeting the conditions in a recording folder of the system according to the recording time during recording;
after finding out a sound recording file with a format suffix name of mp3 which meets the conditions, calling an open source audio conversion program integrated in the sound recording file, and converting the sound recording file into an audio stream;
and transmitting the audio stream file obtained by conversion to a voice recognition service, and acquiring a task result returned by the voice recognition service.
Preferably, the recording of the task result, the recognized voice recognition result, the key, and the start and end time of the step task includes:
if the result returned by the semantic analysis module contains task ending information, notifying the system to apply background service;
the system uses background service to call an automatic testing tool, and simulates a user to click and hang up the phone;
and the system terminates the detection task by applying a background service and records the voice content, the key-press process and the step execution time of each step.
Preferably, the step of obtaining a target number of the detection task according to task information issued by the user, and the calling the system call module to dial the target number includes:
configuring task information and task scripts by a user, and issuing the task information;
the system regularly acquires task information issued by a user, and notifies a background service to execute a task through a message communication mechanism after acquiring the task;
and the system application calls a target number in the task information by calling a call module of the system.
The invention also provides an intelligent voice automatic detection device, which comprises:
a background task module: acquiring a target number of a detection task according to task information issued by a user, and calling a system call module to dial the target number;
a recording module: judging whether the call is connected, if so, muting and recording the call;
automated test tool: judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording;
the audio conversion module: converting the format of the generated recording file, and transmitting the converted file to a voice recognition module;
a voice recognition module: carrying out voice recognition on the audio to obtain a voice recognition result;
a semantic analysis module: carrying out intelligent analysis on the voice recognition result to obtain an analysis result, and clicking a target key when the returned information contains key information;
a judging module: judging whether the task is finished or not according to a result returned by the semantic analysis module;
a recording module: and recording a task result, a recognized voice recognition result, a key and the start and end time of the step task.
Preferably, the semantic analysis module includes:
the information receiving unit is used for receiving the task information and the voice recognition result of the current step;
keyword unit: judging whether a successful target keyword exists or not, and if so, returning to end information;
a viewing unit: checking the task information association library to acquire menu association information;
word segmentation unit: performing word segmentation on the voice recognition result of the current step, and dividing the voice recognition result into word groups;
an analysis unit: analyzing the task information to obtain a target of the current task information;
an alignment unit: and through screening the phrases, comparing the phrases with the target to obtain the similarity of each phrase, and then returning the key with the highest similarity.
Preferably, the audio conversion module includes:
a file searching unit: searching a recording file meeting the conditions in a recording folder of the system according to the recording time during recording;
a conversion unit: after finding out a sound recording file with a format suffix name of mp3 which meets the conditions, calling an open source audio conversion program integrated in the sound recording file, and converting the sound recording file into an audio stream;
a task acquisition unit: and transmitting the audio stream file obtained by conversion to a voice recognition service, and acquiring a task result returned by the voice recognition service.
Preferably, the recording module: if the result returned by the semantic analysis module contains task ending information, notifying the system to apply background service; the system uses background service to call an automatic testing tool, and simulates a user to click and hang up the phone; the system application background service terminates the detection task and records the voice content, the key process and the step execution time of each step;
the background task module comprises: a configuration unit: configuring task information and task scripts by a user, and issuing the task information; a system application unit: and regularly acquiring task information issued by a user, and informing a background service to execute a task through a message communication mechanism after the task is acquired.
An intelligent voice automated detection system comprising:
the background service module regularly accesses the task server and checks whether a task needs to be executed;
the task analysis module is used for analyzing the task information to obtain required task information and scripts;
the message communication module is used for carrying out communication transmission for the background service;
the automatic testing tool module is used for acquiring the content on the mobile phone page and simulating the clicking and sliding operations of a user;
the system application obtains the call recording file and then converts the existing format into an audio stream through the audio conversion module;
the database processing module is used for realizing the operations of adding, modifying, deleting and inquiring the database and recording the task information and the task result;
the network communication module requests the voice recognition service to recognize the audio stream of the call recording to obtain the recognition result of the audio stream; requesting semantic recognition service, transmitting the result of the voice recognition service to a semantic recognition module, and obtaining a recognition result;
the voice recognition module is used for recognizing the audio stream transmitted by the Android application and returning a character recognition result;
and the semantic analysis module analyzes the key with the highest similarity through the task information and the voice recognition character result transmitted by the system application, obtains the key result with the highest similarity through the comparison result of the association library and the task information and the voice recognition character result, and returns the result to the application to perform key operation.
By adopting the technical scheme, the mobile phone can automatically record sound, call voice recognition service, intelligently judge the key according to task requirements, automatically end the detection task, terminate the call, record the detection result and the like during the call. The method is used for automatic detection, and the mobile phones are controlled by the application to carry out operations such as automatic call making, automatic recording, automatic call content recognition, automatic key pressing and the like, and a plurality of mobile phones can be controlled to carry out intelligent voice automatic detection. In addition, by adopting the scheme, the result information such as the approximate conversation text content, the key, the recording file and the like of each step in the task can be recorded, and whether the task is successfully executed is determined according to the menu association library and the successful keyword.
Drawings
FIG. 1 is a flow chart of an embodiment of an intelligent voice automated detection method of the present invention;
FIG. 2 is a detailed flowchart of step S50 in FIG. 1;
FIG. 3 is a block diagram of an embodiment of an intelligent voice automated detection system according to the present invention.
In the figure, the system comprises a background service module 10, a task analysis module 20, a message communication module 30, an automatic test tool module 40, an audio conversion module 50, a database processing module 60, a network communication module 70, a voice recognition module 80 and a semantic analysis module 90.
Detailed Description
The following further describes embodiments of the present invention with reference to the drawings. It should be noted that the description of the embodiments is provided to help understanding of the present invention, but the present invention is not limited thereto. In addition, the technical features involved in the embodiments of the present invention described below may be combined with each other as long as they do not conflict with each other.
Referring to fig. 1, an embodiment of the present invention provides an intelligent voice automatic detection method, including:
s10: acquiring a target number of a detection task according to task information issued by a user, and calling a system call module to dial the target number;
specifically, step S10 includes:
configuring necessary task information and task scripts by a user, and issuing the task information;
the system regularly acquires task information issued by a user, and notifies a background service to execute a task through a message communication mechanism after acquiring the task;
and the system application calls a target number in the task information by calling a call module of the system.
S20: judging whether the call is connected, if so, muting and recording the call;
monitoring the mobile phone page through an automatic testing tool, judging that the call is connected when the mobile phone page is subjected to specific change, and continuing subsequent operation; automatic microphone muting and automatic recording starting operation are carried out through an automatic testing tool, and the automatic microphone muting avoids other external interference so as to obtain a better recording effect;
s30: judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording through an automatic testing tool;
s40: converting the format of the generated recording file, transmitting the converted file to a voice recognition module, and calling the voice recognition module to obtain a voice recognition result;
specifically, a recording file meeting the conditions is searched in a recording folder of the system according to the recording time during recording;
after finding out a sound recording file with a format suffix name of mp3 which meets the conditions, calling an open source audio conversion program integrated in the sound recording file, and converting the sound recording file into an audio stream;
and transmitting the audio stream file obtained by conversion to a voice recognition service, and acquiring a task result returned by the voice recognition service.
S50: calling a semantic analysis module to intelligently analyze the voice recognition result and obtain an analysis result;
referring to fig. 2, specifically, step S50 includes:
s51: calling a semantic analysis module through a network request, and transmitting task information and a voice recognition result of the current step to the semantic analysis module;
s52: the semantic analysis module judges whether a successful target keyword exists or not, and if so, returns end information;
s53: the semantic analysis module checks the task information association library to acquire menu association information;
s54: the semantic analysis module divides the speech recognition result of the current step into words and phrases;
s55: the semantic analysis module analyzes the task information and acquires a target of the current task information;
s56: the semantic analysis module compares the phrases with the target by screening the phrases to obtain the similarity of each phrase, and then returns the key with the highest similarity.
S60: judging whether the task is finished or not according to a result returned by the semantic analysis module;
s70: clicking a target key when the information returned by the semantic analysis module contains key information;
if the result returned by the semantic analysis service does not contain a task termination instruction, the Android application background service is notified to click a target key; and the Android application background service calls an automatic test tool to simulate a user to click a target key.
S80: and recording a task result, a recognized voice recognition result, a key and the start and end time of the step task.
Specifically, step S80 includes:
if the result returned by the semantic analysis module contains task ending information, notifying the system to apply background service;
the system uses background service to call an automatic testing tool, and simulates a user to click and hang up the phone;
and the system terminates the detection task by applying a background service and records the voice content, the key-press process and the step execution time of each step.
By adopting the technical scheme, the mobile phone can automatically record sound, call voice recognition service, intelligently judge the key according to task requirements, automatically end the detection task, terminate the call, record the detection result and the like during the call. The method is used for automatic detection, and the mobile phones are controlled by the application to carry out operations such as automatic call making, automatic recording, automatic call content recognition, automatic key pressing and the like, and a plurality of mobile phones can be controlled to carry out intelligent voice automatic detection. In addition, by adopting the scheme, the result information such as the approximate conversation text content, the key, the recording file and the like of each step in the task can be recorded, and whether the task is successfully executed is determined according to the menu association library and the successful keyword.
The invention also provides an intelligent voice automatic detection device which comprises the following components: the method comprises the following steps:
a background task module: acquiring a target number of a detection task according to task information issued by a user, and calling a system call module to dial the target number;
the background task module comprises: a configuration unit: configuring necessary task information and task scripts by a user, and issuing the task information; a system application unit: and regularly acquiring task information issued by a user, and informing a background service to execute a task through a message communication mechanism after the task is acquired.
A recording module: judging whether the call is connected, if so, muting and recording the call;
automated test tool: judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording;
the audio conversion module: converting the format of the generated recording file, and transmitting the converted file to a voice recognition module;
specifically, the audio conversion module includes:
a file searching unit: searching a recording file meeting the conditions in a recording folder of the system according to the recording time during recording;
a conversion unit: after finding out a sound recording file with a format suffix name of mp3 which meets the conditions, calling an open source audio conversion program integrated in the sound recording file, and converting the sound recording file into an audio stream;
a task acquisition unit: and transmitting the audio stream file obtained by conversion to a voice recognition service, and acquiring a task result returned by the voice recognition service.
A voice recognition module: carrying out voice recognition on the audio to obtain a voice recognition result;
a semantic analysis module: carrying out intelligent analysis on the voice recognition result to obtain an analysis result, and clicking a target key when the returned information contains key information;
specifically, the semantic analysis module includes:
the information receiving unit is used for receiving the task information and the voice recognition result of the current step;
keyword unit: judging whether a successful target keyword exists or not, and if so, returning to end information;
a viewing unit: checking the task information association library to acquire menu association information;
word segmentation unit: performing word segmentation on the voice recognition result of the current step, and dividing the voice recognition result into word groups;
an analysis unit: analyzing the task information to obtain a target of the current task information;
an alignment unit: and through screening the phrases, comparing the phrases with the target to obtain the similarity of each phrase, and then returning the key with the highest similarity.
A judging module: judging whether the task is finished or not according to a result returned by the semantic analysis module;
a recording module: and recording a task result, a recognized voice recognition result, a key and the start and end time of the step task.
Specifically, the recording module: if the result returned by the semantic analysis module contains task ending information, notifying the system to apply background service; the system uses background service to call an automatic testing tool, and simulates a user to click and hang up the phone; the system application background service terminates the detection task and records the voice content, the key process and the step execution time of each step;
referring to fig. 3, another embodiment of the present invention further provides an intelligent voice automatic detection system, including:
a background service module 10, which can access the task server regularly to check whether there is any task to be executed;
a task analysis module 20, which analyzes the task information to obtain the required task information and script;
a message communication module 30 through which communication between background services is transmitted;
the automatic test tool module 40 can acquire the content on the mobile phone page and can simulate the basic operations of clicking, sliding and the like of a user;
the audio conversion module 50 is used for converting the existing format into an audio stream through the Android application after the Android application acquires the call recording file when the system generates the call recording file;
the database processing module 60 is used for realizing the operations of adding, modifying, deleting and inquiring the database and recording the task information and the task result;
a network communication module 70, which is mainly used for network communication and can request a voice recognition service to recognize the audio stream of the call recording to obtain the recognition result of the audio stream; semantic recognition service can be requested, the result of the voice recognition service is transmitted to a semantic recognition module, and a recognition result is obtained;
a voice recognition module 80, which is mainly used for recognizing the audio stream transmitted by the Android application and returning a character recognition result;
and the semantic analysis module 90 analyzes the key with the highest similarity through the task information and the voice recognition character result transmitted by the Android application, acquires the key result with the highest similarity through a comparison result of the association library and the task information and the voice recognition character result, and returns the result to the Android application to perform key operation.
The scheme provides a block diagram of a software architecture of the system, solves the problem that recording and call content recognition cannot be carried out in a call state, and realizes call recording, voice recognition and intelligent key detection in the call state; if the existing universal voice recognition platform is adopted, the call content recognition can not be carried out in the call state; if the manual voice detection is adopted, the labor cost is very high, and the manual voice detection tasks of a plurality of mobile phones cannot be simultaneously carried out, and occasionally some errors occur. By adopting the scheme, automatic detection can be carried out through the method, the mobile phones are controlled through application to carry out operations such as automatic dialing, automatic recording, automatic call content identification, automatic key pressing and the like, multiple mobile phones can be controlled to carry out intelligent voice automatic detection, and the success rate is more than nine-five percent under the condition that the content of the configuration file is correct.
The problem of recording the task information is solved through the scheme, and the recording of information such as the task information, the task result, the conversation content and the like is realized. If the current universal voice recognition platform is adopted, the task information cannot be recorded. If the voice detection is carried out manually, the labor cost is very high, and common workers cannot carry out operations such as call content recording at normal speed for a long time, by adopting the scheme, result information such as rough call text content, keys, recording files and the like of each step in the task can be recorded, and whether the task is successfully executed is determined according to the menu association library and the success keywords.
The embodiments of the present invention have been described in detail with reference to the accompanying drawings, but the present invention is not limited to the described embodiments. It will be apparent to those skilled in the art that various changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, and the scope of protection is still within the scope of the invention.

Claims (10)

1. An intelligent voice automatic detection method is characterized in that: the method comprises the following steps:
acquiring a target number of a detection task according to task information issued by a user, and calling a system call module to dial the target number;
judging whether the call is connected, if so, muting and recording the call;
judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording through an automatic testing tool;
converting the format of the generated recording file, transmitting the converted file to a voice recognition module, and calling the voice recognition module to obtain a voice recognition result;
calling a semantic analysis module to intelligently analyze the voice recognition result and obtain an analysis result;
judging whether the task is finished or not according to a result returned by the semantic analysis module;
clicking a target key when the information returned by the semantic analysis module contains key information;
and recording a task result, a recognized voice recognition result, a key and the start and end time of the step task.
2. The intelligent voice automation detection method of claim 1, characterized in that: calling a semantic analysis module, carrying out intelligent analysis on a voice recognition result, and acquiring an analysis result, wherein the method comprises the following steps:
calling a semantic analysis module through a network request, and transmitting task information and a voice recognition result of the current step to the semantic analysis module;
the semantic analysis module judges whether a successful target keyword exists or not, and if so, returns end information;
the semantic analysis module checks the task information association library to acquire menu association information;
the semantic analysis module divides the speech recognition result of the current step into words and phrases;
the semantic analysis module analyzes the task information and acquires a target of the current task information;
the semantic analysis module compares the phrases with the target by screening the phrases to obtain the similarity of each phrase, and then returns the key with the highest similarity.
3. The intelligent voice automation detection method of claim 1, characterized in that: converting the format of the generated recording file, transmitting the converted file to a voice recognition module, calling the voice recognition module, and acquiring a voice recognition result, wherein the voice recognition result comprises the following steps:
searching a recording file meeting the conditions in a recording folder of the system according to the recording time during recording;
after finding out a sound recording file with a format suffix name of mp3 which meets the conditions, calling an open source audio conversion program integrated in the sound recording file, and converting the sound recording file into an audio stream;
and transmitting the audio stream file obtained by conversion to a voice recognition service, and acquiring a task result returned by the voice recognition service.
4. The intelligent voice automation detection method of claim 1, characterized in that: recording a task result, wherein the recognized voice recognition result, the key and the start and end time of the step task comprise:
if the result returned by the semantic analysis module contains task ending information, notifying the system to apply background service;
the system uses background service to call an automatic testing tool, and simulates a user to click and hang up the phone;
and the system terminates the detection task by applying a background service and records the voice content, the key-press process and the step execution time of each step.
5. The intelligent voice automatic detection method according to claim 1, wherein the step of obtaining a target number of the detection task according to task information issued by a user, and the calling the system call module to dial the target number comprises:
configuring task information and task scripts by a user, and issuing the task information;
the system regularly acquires task information issued by a user, and notifies a background service to execute a task through a message communication mechanism after acquiring the task;
and the system application calls a target number in the task information by calling a call module of the system.
6. The utility model provides an automatic detection device of intelligence pronunciation which characterized in that: the method comprises the following steps:
a background task module: acquiring a target number of a detection task according to task information issued by a user, and calling a system call module to dial the target number;
a recording module: judging whether the call is connected, if so, muting and recording the call;
automated test tool: judging whether the recording can be finished or not through the configuration file, and if the recording can be finished, finishing the recording;
the audio conversion module: converting the format of the generated recording file, and transmitting the converted file to a voice recognition module;
a voice recognition module: carrying out voice recognition on the audio to obtain a voice recognition result;
a semantic analysis module: carrying out intelligent analysis on the voice recognition result to obtain an analysis result, and clicking a target key when the returned information contains key information;
a judging module: judging whether the task is finished or not according to a result returned by the semantic analysis module;
a recording module: and recording a task result, a recognized voice recognition result, a key and the start and end time of the step task.
7. The intelligent voice automation detection device of claim 6, wherein: the semantic analysis module comprises:
the information receiving unit is used for receiving the task information and the voice recognition result of the current step;
keyword unit: judging whether a successful target keyword exists or not, and if so, returning to end information;
a viewing unit: checking the task information association library to acquire menu association information;
word segmentation unit: performing word segmentation on the voice recognition result of the current step, and dividing the voice recognition result into word groups;
an analysis unit: analyzing the task information to obtain a target of the current task information;
an alignment unit: and through screening the phrases, comparing the phrases with the target to obtain the similarity of each phrase, and then returning the key with the highest similarity.
8. The intelligent voice automation detection device of claim 6, wherein: the audio conversion module includes:
a file searching unit: searching a recording file meeting the conditions in a recording folder of the system according to the recording time during recording;
a conversion unit: after finding out a sound recording file with a format suffix name of mp3 which meets the conditions, calling an open source audio conversion program integrated in the sound recording file, and converting the sound recording file into an audio stream;
a task acquisition unit: and transmitting the audio stream file obtained by conversion to a voice recognition service, and acquiring a task result returned by the voice recognition service.
9. The intelligent voice automation detection device of claim 6,
a recording module: if the result returned by the semantic analysis module contains task ending information, notifying the system to apply background service; the system uses background service to call an automatic testing tool, and simulates a user to click and hang up the phone; the system application background service terminates the detection task and records the voice content, the key process and the step execution time of each step;
the background task module comprises:
a configuration unit: configuring task information and task scripts by a user, and issuing the task information;
a system application unit: and regularly acquiring task information issued by a user, and informing a background service to execute a task through a message communication mechanism after the task is acquired.
10. An intelligent voice automated detection system, comprising:
the background service module regularly accesses the task server and checks whether a task needs to be executed;
the task analysis module is used for analyzing the task information to obtain required task information and scripts;
the message communication module is used for carrying out communication transmission for the background service;
the automatic testing tool module is used for acquiring the content on the mobile phone page and simulating the clicking and sliding operations of a user;
the system application obtains the call recording file and then converts the existing format into an audio stream through the audio conversion module;
the database processing module is used for realizing the operations of adding, modifying, deleting and inquiring the database and recording the task information and the task result;
the network communication module requests the voice recognition service to recognize the audio stream of the call recording to obtain the recognition result of the audio stream; requesting semantic recognition service, transmitting the result of the voice recognition service to a semantic recognition module, and obtaining a recognition result;
the voice recognition module is used for recognizing the audio stream transmitted by the Android application and returning a character recognition result;
and the semantic analysis module analyzes the key with the highest similarity through the task information and the voice recognition character result transmitted by the system application, obtains the key result with the highest similarity through the comparison result of the association library and the task information and the voice recognition character result, and returns the result to the application to perform key operation.
CN201910908123.8A 2019-09-25 2019-09-25 Intelligent voice automatic detection method, device and system Pending CN110784591A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910908123.8A CN110784591A (en) 2019-09-25 2019-09-25 Intelligent voice automatic detection method, device and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910908123.8A CN110784591A (en) 2019-09-25 2019-09-25 Intelligent voice automatic detection method, device and system

Publications (1)

Publication Number Publication Date
CN110784591A true CN110784591A (en) 2020-02-11

Family

ID=69384345

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910908123.8A Pending CN110784591A (en) 2019-09-25 2019-09-25 Intelligent voice automatic detection method, device and system

Country Status (1)

Country Link
CN (1) CN110784591A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112802475A (en) * 2020-12-30 2021-05-14 厦门市美亚柏科信息股份有限公司 Automatic application login method based on voice recognition and computer readable storage medium
CN112802477A (en) * 2020-12-30 2021-05-14 科大国创云网科技有限公司 Customer service assistant tool service method and system based on voice-to-text conversion
CN113314152A (en) * 2021-07-07 2021-08-27 上海中通吉网络技术有限公司 Method and equipment for judging whether call is effectively dialed out
CN115982000A (en) * 2022-11-28 2023-04-18 上海浦东发展银行股份有限公司 Whole scene voice robot testing system, method and medium
CN117544719A (en) * 2023-11-09 2024-02-09 深圳市恩泰世科技有限公司 Automatic dialing system and method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103856642A (en) * 2014-01-22 2014-06-11 中国移动(深圳)有限公司 Detection method and system
CN106131350A (en) * 2016-09-20 2016-11-16 深圳市同行者科技有限公司 Control method and the device thereof of electricity call is discarded based on speech recognition active release
CN107230478A (en) * 2017-05-03 2017-10-03 上海斐讯数据通信技术有限公司 A kind of voice information processing method and system
WO2018086219A1 (en) * 2016-11-10 2018-05-17 宇龙计算机通信科技(深圳)有限公司 Method, device and terminal for storing call information

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103856642A (en) * 2014-01-22 2014-06-11 中国移动(深圳)有限公司 Detection method and system
CN106131350A (en) * 2016-09-20 2016-11-16 深圳市同行者科技有限公司 Control method and the device thereof of electricity call is discarded based on speech recognition active release
WO2018086219A1 (en) * 2016-11-10 2018-05-17 宇龙计算机通信科技(深圳)有限公司 Method, device and terminal for storing call information
CN107230478A (en) * 2017-05-03 2017-10-03 上海斐讯数据通信技术有限公司 A kind of voice information processing method and system

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112802475A (en) * 2020-12-30 2021-05-14 厦门市美亚柏科信息股份有限公司 Automatic application login method based on voice recognition and computer readable storage medium
CN112802477A (en) * 2020-12-30 2021-05-14 科大国创云网科技有限公司 Customer service assistant tool service method and system based on voice-to-text conversion
CN113314152A (en) * 2021-07-07 2021-08-27 上海中通吉网络技术有限公司 Method and equipment for judging whether call is effectively dialed out
CN115982000A (en) * 2022-11-28 2023-04-18 上海浦东发展银行股份有限公司 Whole scene voice robot testing system, method and medium
CN117544719A (en) * 2023-11-09 2024-02-09 深圳市恩泰世科技有限公司 Automatic dialing system and method

Similar Documents

Publication Publication Date Title
CN110784591A (en) Intelligent voice automatic detection method, device and system
CN107038220B (en) Method, intelligent robot and system for generating memorandum
US11762629B2 (en) System and method for providing a response to a user query using a visual assistant
US8050918B2 (en) Quality evaluation tool for dynamic voice portals
CN105391730A (en) Information feedback method, device and system
CN110992955A (en) Voice operation method, device, equipment and storage medium of intelligent equipment
CN105760399A (en) Data retrieval method and device
CN111210842A (en) Voice quality inspection method, device, terminal and computer readable storage medium
CN111949255A (en) Script compiling method, device, equipment and storage medium based on voice
KR20060050966A (en) Verb error recovery in speech recognition
KR100669948B1 (en) System and method for copying ring back tone transfer sound
US10803861B2 (en) Method and apparatus for identifying information
WO2022142031A1 (en) Invalid call determination method and apparatus, computer device, and storage medium
US8130916B2 (en) Dynamically improving performance of an interactive voice response (IVR) system using a complex events processor (CEP)
CN111563182A (en) Voice conference record storage processing method and device
CN113782026A (en) Information processing method, device, medium and equipment
CN107680592A (en) A kind of mobile terminal sound recognition methods and mobile terminal and storage medium
CN108597499A (en) Method of speech processing and voice processing apparatus
EP3059731A1 (en) Method and apparatus for automatically sending multimedia file, mobile terminal, and storage medium
KR20220121455A (en) Speaker classification system using STT
CN114067842B (en) Customer satisfaction degree identification method and device, storage medium and electronic equipment
CN112714058B (en) Method, system and electronic device for immediately interrupting AI voice
CN114155845A (en) Service determination method and device, electronic equipment and storage medium
CN114297229A (en) Data query method and device, electronic equipment and storage medium
CN111274828B (en) Language translation method, system, computer program and handheld terminal based on message leaving

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200211

RJ01 Rejection of invention patent application after publication