CN107393530B

CN107393530B - Service guiding method and device

Info

Publication number: CN107393530B
Application number: CN201710579589.9A
Authority: CN
Inventors: 许传祺; 王静; 杨占孟; 宋豪; 王仁让; 吕潇; 吕媛; 李刚; 王辉; 李航; 王玉华; 薛真; 焦学华; 矫宏; 孙文豪; 张茂伟; 张伟; 韩博文; 陈里军; 胡耀文
Original assignee: State Grid Corp of China SGCC; Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd
Current assignee: State Grid Corp of China SGCC; Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd
Priority date: 2017-07-18
Filing date: 2017-07-18
Publication date: 2020-08-25
Anticipated expiration: 2037-07-18
Also published as: CN107393530A

Abstract

The invention provides a service guide method and a device, which relate to the technical field of guide machines and comprise the following steps: outputting prompt information containing preset calibration words so that a user reads the calibration words; after receiving a first voice signal acquired by the voice acquisition equipment, identifying the first voice signal to obtain a first identification word; determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words; and outputting service prompt contents by using the dialect type corresponding to the target dialect word so as to enable the user to select service items according to the service prompt, solve the technical problem that the prior guiding machine cannot provide guiding service for dialect clients, achieve the technical effects of 'understanding' the dialect spoken by the client, outputting the dialect which can be understood by the client and improving the communication efficiency between the guiding machine and the client.

Description

Service guiding method and device

Technical Field

The invention relates to the technical field of bootstrap machines, in particular to a service bootstrap method and a service bootstrap device.

Background

At present, in the period of centralized payment in the electric power business hall, the phenomenon of 'pricking pile' of payment is serious, the payment is arranged in long lines, the working and service efficiency is low, and inconvenience is brought to customers. Most electric power business halls actively provide humanized real situation service for clients on the basis of standardized service, promote the service of a 'guide machine', and a plurality of guide machines meet the clients in business hours every day to receive and guide the clients to distribute, guide the clients to transact different types of services in different areas, receive the consultation of the clients, guide and help the clients to use self-service equipment and the like.

However, when a user with poor mandarin and unable to communicate using mandarin is encountered, the bootstrap device will not be able to accurately identify the queries of the user, and even provide high-quality bootstrap service for the user according to the queries of the user, and a lot of time is wasted in the communication process between the user and the bootstrap device, thereby reducing the efficiency of the bootstrap service.

Disclosure of Invention

In view of the above, an object of the present invention is to provide a service guiding method and a guiding apparatus, so as to alleviate the technical problem that the guiding machine in the prior art cannot provide guiding service efficiently for a client with a heavier dialect.

In a first aspect, an embodiment of the present invention provides a service guiding method, which is applied to a guiding machine including a voice acquisition device and a voice output device, and the method includes:

outputting prompt information containing preset calibration words so that a user reads the calibration words;

after receiving a first voice signal acquired by the voice acquisition equipment, identifying the first voice signal to obtain a first identification word;

determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words;

and outputting service prompt content by using the dialect category corresponding to the target dialect word so that the user can select a service item according to the service prompt.

With reference to the first aspect, an embodiment of the present invention provides a first possible implementation manner of the first aspect, where the recognizing the first speech signal to obtain a first recognized word includes:

performing voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal;

performing tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal;

and determining a word formed by the pinyin information and the tone information as the first recognition word.

With reference to the first aspect, an embodiment of the present invention provides a second possible implementation manner of the first aspect, where the determining, among a plurality of dialect words corresponding to the calibration word, a dialect word whose similarity with the first recognition word exceeds a preset threshold as a target dialect word includes:

acquiring a plurality of dialect words corresponding to the calibration words, wherein the dialect words comprise dialect word pinyin and dialect word tone;

determining at least one dialect word with the similarity to the pinyin information exceeding a preset first similarity threshold value as a candidate word according to the pinyin of the dialect words;

and determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.

With reference to the first aspect, an embodiment of the present invention provides a third possible implementation manner of the first aspect, where the outputting service prompt content by using the dialect category corresponding to the target dialect word includes:

acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;

acquiring service prompt content to be output;

searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;

and outputting the plurality of searched dialects according to the arrangement sequence of the keywords in the service prompt content.

With reference to the first aspect, an embodiment of the present invention provides a fourth possible implementation manner of the first aspect, where the method further includes:

detecting whether a user is located in a preset service area;

when a user is located in the preset service area, outputting prompt content containing service items so that the user can select the service items;

after receiving a second voice signal acquired by the voice acquisition equipment, recognizing the second voice signal to obtain a second recognition word;

and when the second recognition word is different from any service keyword in a preset service word bank, outputting prompt information containing a preset calibration word.

In a second aspect, an embodiment of the present invention further provides a service guiding apparatus, where the apparatus includes:

the first output module is used for outputting prompt information containing preset calibration words so that a user can read the calibration words aloud;

the first recognition module is used for recognizing a first voice signal after receiving the first voice signal collected by the voice collection equipment to obtain a first recognition word;

the determining module is used for determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words;

and the second output module is used for outputting service prompt contents by using the dialect category corresponding to the target dialect word so as to enable the user to select service items according to the service prompt.

With reference to the second aspect, an embodiment of the present invention provides a first possible implementation manner of the second aspect, where the identification module includes:

the first recognition unit is used for carrying out voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal;

the second recognition unit is used for carrying out tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal;

and the first determining unit is used for determining a word consisting of the pinyin information and the tone information as the first recognition word.

With reference to the second aspect, an embodiment of the present invention provides a second possible implementation manner of the second aspect, where the determining module includes:

the first obtaining unit is used for obtaining a plurality of dialect words corresponding to the calibration words, and the dialect words comprise dialect word pinyin and dialect word tone;

the second determining unit is used for determining at least one dialect word with the similarity exceeding a preset first similarity threshold value with the pinyin information as a candidate word according to the pinyin of the dialect words;

and the third determining unit is used for determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.

With reference to the second aspect, an embodiment of the present invention provides a third possible implementation manner of the second aspect, where the second output module includes:

the second acquisition unit is used for acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;

a third obtaining unit, configured to obtain service prompt content to be output;

the searching unit is used for searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;

and the output unit is used for outputting the searched plurality of dialects according to the arrangement sequence of the keywords in the service prompt content.

With reference to the second aspect, an embodiment of the present invention provides a fourth possible implementation manner of the second aspect, where the apparatus further includes:

the detection module is used for detecting whether a user is located in a preset service area;

the third output module is used for outputting prompt contents containing service items when a user is located in the preset service area so that the user can select the service items;

the second recognition module is used for recognizing a second voice signal after receiving the second voice signal collected by the voice collection equipment to obtain a second recognition word;

and the fourth output module is used for outputting prompt information containing a preset calibration word when the second identification word is different from any service keyword in a preset service word bank.

The embodiment of the invention has the following beneficial effects: the embodiment of the invention firstly outputs prompt information containing a preset calibration word so as to enable a user to read the calibration word aloud, after a first voice signal acquired by voice acquisition equipment is received, the first voice signal is identified to obtain a first identification word, then a dialect word with similarity exceeding a preset threshold value with the first identification word is determined as a target dialect word in a plurality of dialect words corresponding to the calibration word, and finally service prompt content is output by utilizing the dialect category corresponding to the target dialect word so that the user can select a service item according to the service prompt.

According to the embodiment of the invention, the dialect type used by the user can be distinguished according to the first voice signal sent by the user reading the calibration word, then the service prompt content is output according to the dialect type, the dialect which can be understood by the client is output while the dialect which is spoken by the client is understood, and the communication efficiency between the guiding machine and the client is improved.

Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.

In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.

Fig. 1 is a flowchart of a service guiding method according to an embodiment of the present invention;

FIG. 2 is a flowchart of step S102 in FIG. 1;

FIG. 3 is a flowchart of step S103 in FIG. 1;

FIG. 4 is a flowchart of step S104 in FIG. 1;

fig. 5 is another flowchart of a service guiding method according to an embodiment of the present invention;

fig. 6 is a structural diagram of a service guiding apparatus according to an embodiment of the present invention.

Icon: 11-a first output module; 12-a first identification module; 13-a determination module; 14-a second output module.

Detailed Description

To make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Based on the fact that the existing guiding machine cannot accurately identify the queries of the clients who have poor mandarin and cannot communicate by using mandarin and cannot provide high-quality guiding service for the clients according to the queries of the clients, the guiding service efficiency is low, therefore, the service guiding method and the service guiding device provided by the embodiment of the invention can output the prompt information containing the preset calibration words to enable the users to read the calibration words aloud, identify the first voice signal after receiving the first voice signal acquired by the voice acquisition equipment to obtain the first identification word, determine the dialect words with the similarity exceeding the preset threshold value with the first identification word as the target dialect words in a plurality of dialect words corresponding to the calibration words, and output the service prompt content by utilizing the dialect categories corresponding to the target dialect words to enable the users to select service items according to the service prompt, according to the embodiment of the invention, the dialect type used by the user can be distinguished according to the first voice signal sent by the user reading the calibration word, then the service prompt content is output according to the dialect type, the dialect which can be understood by the client is output while the dialect which is spoken by the client is understood, and the communication efficiency between the guiding machine and the client is improved.

To facilitate understanding of the present embodiment, a service guiding method disclosed in the embodiment of the present invention is first described in detail, where the service guiding method can be applied to a guiding machine including a voice collecting device and a voice outputting device, where the voice collecting device may refer to a microphone and the like, and the voice outputting device may refer to a speaker and the like, and as shown in fig. 1, the method includes the following steps.

In step S101, a prompt message including a preset calibration word is output.

In the embodiment of the invention, a plurality of people speaking different dialects can be searched in advance, a plurality of people speaking different dialects are enabled to read a plurality of words respectively, then the preset number of words with the largest difference between pinyin and tone is determined as the preset calibration word, and the calibration word is set so as to distinguish the dialects of which types the client belongs to more quickly.

In this step, the prompt message may be output by displaying text on the display interface of the leader and/or by voice output using a voice output device.

Through the step S101, the user can read the calibration word aloud, and the voice collecting device is further convenient to collect the first voice signal such as the voice signal sent by the user through reading the calibration word aloud.

In step S102, after receiving a first voice signal acquired by the voice acquisition device, the first voice signal is recognized to obtain a first recognized word.

In the embodiment of the present invention, the first speech signal collected by the speech collection device may be recognized by a speech recognition technology to obtain the first recognized word, where the first recognized word may include pinyin and tone, for example, the pinyin of the shoe is "xiezi" and the tone is "two sounds, light sounds".

In step S103, a dialect word having a similarity exceeding a preset threshold with the first recognition word is determined as a target dialect word among the dialects corresponding to the calibration word.

In the embodiment of the invention, the calibration words are determined after the groups speaking different dialects read aloud, so that the groups speaking different dialects read aloud the calibration words to obtain the versions of the dialects, and the calibration words and the dialects are correspondingly stored, so that the corresponding relation between a plurality of groups of calibration words and the dialects can be obtained.

In this step, a plurality of dialect words corresponding to the current calibration word may be obtained according to a plurality of pre-stored sets of correspondence, then the first recognition word is compared with each dialect word in terms of pinyin and tone, and then the same proportion of the pinyin and the tone between the first recognition word and the dialect word is determined as similarity, for example, if the calibration word is "shoe", the first recognition word is "hai zi", "second sound, and soft sound", one of the dialect words "haizi", "second sound, and soft sound" corresponding to the calibration word, the same part is "hai zi", "second sound, and soft sound", the similarity is 100%, and if the other dialect word corresponding to the calibration word is "xie zi", "four sound, and soft sound", the similarity is 75%.

In step S104, service prompt content is output by using the dialect category corresponding to the target dialect word, so that the user selects a service item according to the service prompt.

In the embodiment of the present invention, the voice prompt signals of various dialect categories corresponding to a plurality of service prompt contents may be stored in advance, for example, if a certain service prompt content is "please select a service item", the voice prompt signals of shanxi dialect edition, south of river dialect edition, sheng yang dialect edition, and the like may be stored in a language.

Assuming that the dialect category corresponding to the target dialect word is the Shaanxi dialect, the service prompt content can be output by using the Shaanxi dialect.

The embodiment of the invention firstly outputs prompt information containing a preset calibration word so as to enable a user to read the calibration word aloud, after a first voice signal acquired by voice acquisition equipment is received, the first voice signal is identified to obtain a first identification word, then a dialect word with similarity exceeding a preset threshold value with the first identification word is determined as a target dialect word in a plurality of dialect words corresponding to the calibration word, and finally service prompt content is output by utilizing the dialect category corresponding to the target dialect word so that the user can select a service item according to the service prompt.

Since dialects are mostly different in pinyin and tone from mandarin, in another embodiment of the present invention, as shown in fig. 2, the step S102 includes the following steps.

In step S1021, performing speech recognition on the first speech signal to obtain pinyin information corresponding to the first speech signal.

In step S1022, tone recognition is performed on the first voice signal, so as to obtain tone information corresponding to the first voice signal.

In step S1023, a word composed of the pinyin information and the tone information is determined as the first recognized word.

On the basis of the foregoing embodiment, in another embodiment of the present invention, as shown in fig. 3, the step S103 includes the following steps.

In step S1031, a plurality of dialect words corresponding to the calibration word are obtained, where the dialect words include dialect word pinyin and dialect word tone;

in step S1032, determining at least one dialect word having a similarity to the pinyin information exceeding a preset first similarity threshold as a candidate word according to the pinyin of the dialect words;

when there is no dialect word with the similarity exceeding the preset first similarity threshold, step S101 may be executed again to obtain the first recognized word again until the dialect word with the similarity exceeding the preset first similarity threshold is found.

In step S1033, according to the tone of the dialect word of at least one of the candidate words, a reference word whose similarity to the tone information exceeds a preset second similarity threshold is determined as the target dialect word.

When there is no dialect word with the similarity exceeding the preset second similarity threshold, step S101 may be executed again to obtain the first recognized word again until the dialect word with the similarity exceeding the preset second similarity threshold is found.

In practical application, the similarity of the tone information can be compared first, and then the similarity of the pinyin information and the like can be compared, and the similarity can be set according to actual needs.

In still another embodiment of the present invention, as shown in fig. 4, the step S104 includes the following steps.

In step S1041, a dialect category corresponding to the target dialect and a dialect set corresponding to the dialect category are obtained, where the dialect set includes a plurality of dialect words;

in step S1042, a service prompt content to be output is obtained;

in step S1043, a dialect corresponding to each keyword of the service prompt content is searched for in the dialect set;

the service prompt content may be first cut off according to a preset word cutting rule, for example, the prepositions in the service prompt content, such as "yes" and/or "on" may be first removed, then the service prompt content is cut off from the first word according to the rule of two words, after the first word is cut off, the cut-off word is compared with the words in the preset word library, if the same word exists, the second word is continuously cut off, if the same word does not exist, the words of three words are cut off, and the cut-off words of three words are continuously compared … … with the words in the preset word library until the service prompt content is cut off.

In step S1044, the found plurality of dialect words are output according to the arrangement order of the keywords in the service prompt content.

In this step, after the good-brother words are arranged in the order of the keywords in the service provision contents, the service provision contents may be output by outputting a voice signal or the like.

In a further embodiment of the invention, as shown in fig. 5, the method further comprises the following steps.

In step S201, it is detected whether a user is located in a preset service area.

In an embodiment of the present invention, the predetermined service area may refer to an area which is a predetermined distance from the leader, for example, within 0.5 m from the front (the side having the display screen) of the leader.

In step S202, when a user is located in the preset service area, a prompt content including a service item is output, so that the user can select the service item.

In this step, the prompt content or the like may be output by displaying the prompt content on the display screen or by voice playing.

In step S203, after receiving the second voice signal acquired by the voice acquisition device, recognizing the second voice signal to obtain a second recognized word.

This step is similar to the processing in step S102, and is not described here again.

In step S204, when the second recognized word is different from any service keyword in the preset service lexicon, a prompt message including a preset calibration word is output.

The step is to screen users who speak Mandarin and users who speak Mandarin, if the users who speak Mandarin, the second recognized word will be at least the same as any service keyword, if the users who speak Mandarin, the second recognized word will be different from any service keyword.

In still another embodiment of the present invention, as shown in fig. 6, a service guide apparatus includes: a first output module 11, a first recognition module 12, a determination module 13 and a second output module 14;

the first output module 11 is configured to output prompt information including a preset calibration word, so that a user reads the calibration word aloud;

the first recognition module 12 is configured to, after receiving a first voice signal collected by the voice collection device, recognize the first voice signal to obtain a first recognition word;

the determining module 13 is configured to determine, among the plurality of dialect words corresponding to the calibration word, a dialect word with a similarity exceeding a preset threshold with the first recognition word as a target dialect word;

and a second output module 14, configured to output service prompt content by using the dialect category corresponding to the target dialect term, so that the user selects a service item according to the service prompt.

In another embodiment of the present invention, the identification module includes:

In another embodiment of the present invention, the determining module includes:

In yet another embodiment of the present invention, the second output module includes:

In yet another embodiment of the present invention, the apparatus further comprises:

The computer program product of the service guiding method and device provided by the embodiment of the present invention includes a computer readable storage medium storing a program code, where instructions included in the program code may be used to execute the method described in the foregoing method embodiment, and specific implementation may refer to the method embodiment, which is not described herein again.

It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the system and the apparatus described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.

In addition, in the description of the embodiments of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.

The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.

In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.

Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the embodiments of the present invention, and they should be construed as being included therein. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims

1. A service guide method is applied to a guide machine comprising a voice acquisition device and a voice output device, and comprises the following steps:

outputting service prompt content by utilizing the dialect category corresponding to the target dialect word so as to enable the user to select a service item according to the service prompt;

the recognizing the first voice signal to obtain a first recognized word, including: performing voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal; performing tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal; determining a word composed of the pinyin information and the tone information as the first recognition word;

determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words, wherein the determining comprises the following steps: acquiring a plurality of dialect words corresponding to the calibration words, wherein the dialect words comprise dialect word pinyin and dialect word tone; determining at least one dialect word with the similarity to the pinyin information exceeding a preset first similarity threshold value as a candidate word according to the pinyin of the dialect words; and determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.

2. The service guiding method according to claim 1, wherein outputting the service prompt content using the dialect category corresponding to the target dialect term comprises:

acquiring service prompt content to be output;

3. The service bootstrapping method of claim 2, further comprising:

detecting whether a user is located in a preset service area;

4. A service guide apparatus, characterized in that the apparatus comprises:

the second output module is used for outputting service prompt contents by utilizing the dialect category corresponding to the target dialect word so as to enable the user to select service items according to the service prompt;

the identification module comprises: the first recognition unit is used for carrying out voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal; the second recognition unit is used for carrying out tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal; a first determining unit, configured to determine that a word composed of the pinyin information and the tone information is the first recognized word;

the determining module includes: the first obtaining unit is used for obtaining a plurality of dialect words corresponding to the calibration words, and the dialect words comprise dialect word pinyin and dialect word tone; the second determining unit is used for determining at least one dialect word with the similarity exceeding a preset first similarity threshold value with the pinyin information as a candidate word according to the pinyin of the dialect words; and the third determining unit is used for determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.

5. The service guide apparatus of claim 4, wherein the second output module comprises:

6. The service guide apparatus of claim 5, wherein the apparatus further comprises: