CN107393530B - Service guiding method and device - Google Patents

Service guiding method and device Download PDF

Info

Publication number
CN107393530B
CN107393530B CN201710579589.9A CN201710579589A CN107393530B CN 107393530 B CN107393530 B CN 107393530B CN 201710579589 A CN201710579589 A CN 201710579589A CN 107393530 B CN107393530 B CN 107393530B
Authority
CN
China
Prior art keywords
dialect
word
service
words
voice signal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201710579589.9A
Other languages
Chinese (zh)
Other versions
CN107393530A (en
Inventor
许传祺
王静
杨占孟
宋豪
王仁让
吕潇
吕媛
李刚
王辉
李航
王玉华
薛真
焦学华
矫宏
孙文豪
张茂伟
张伟
韩博文
陈里军
胡耀文
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Qingdao Power Supply Co of State Grid Shandong Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201710579589.9A priority Critical patent/CN107393530B/en
Publication of CN107393530A publication Critical patent/CN107393530A/en
Application granted granted Critical
Publication of CN107393530B publication Critical patent/CN107393530B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/005Language recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/10Speech classification or search using distance or distortion measures between unknown speech and reference templates
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/221Announcement of recognition results
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Machine Translation (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a service guide method and a device, which relate to the technical field of guide machines and comprise the following steps: outputting prompt information containing preset calibration words so that a user reads the calibration words; after receiving a first voice signal acquired by the voice acquisition equipment, identifying the first voice signal to obtain a first identification word; determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words; and outputting service prompt contents by using the dialect type corresponding to the target dialect word so as to enable the user to select service items according to the service prompt, solve the technical problem that the prior guiding machine cannot provide guiding service for dialect clients, achieve the technical effects of 'understanding' the dialect spoken by the client, outputting the dialect which can be understood by the client and improving the communication efficiency between the guiding machine and the client.

Description

Service guiding method and device
Technical Field
The invention relates to the technical field of bootstrap machines, in particular to a service bootstrap method and a service bootstrap device.
Background
At present, in the period of centralized payment in the electric power business hall, the phenomenon of 'pricking pile' of payment is serious, the payment is arranged in long lines, the working and service efficiency is low, and inconvenience is brought to customers. Most electric power business halls actively provide humanized real situation service for clients on the basis of standardized service, promote the service of a 'guide machine', and a plurality of guide machines meet the clients in business hours every day to receive and guide the clients to distribute, guide the clients to transact different types of services in different areas, receive the consultation of the clients, guide and help the clients to use self-service equipment and the like.
However, when a user with poor mandarin and unable to communicate using mandarin is encountered, the bootstrap device will not be able to accurately identify the queries of the user, and even provide high-quality bootstrap service for the user according to the queries of the user, and a lot of time is wasted in the communication process between the user and the bootstrap device, thereby reducing the efficiency of the bootstrap service.
Disclosure of Invention
In view of the above, an object of the present invention is to provide a service guiding method and a guiding apparatus, so as to alleviate the technical problem that the guiding machine in the prior art cannot provide guiding service efficiently for a client with a heavier dialect.
In a first aspect, an embodiment of the present invention provides a service guiding method, which is applied to a guiding machine including a voice acquisition device and a voice output device, and the method includes:
outputting prompt information containing preset calibration words so that a user reads the calibration words;
after receiving a first voice signal acquired by the voice acquisition equipment, identifying the first voice signal to obtain a first identification word;
determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words;
and outputting service prompt content by using the dialect category corresponding to the target dialect word so that the user can select a service item according to the service prompt.
With reference to the first aspect, an embodiment of the present invention provides a first possible implementation manner of the first aspect, where the recognizing the first speech signal to obtain a first recognized word includes:
performing voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal;
performing tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal;
and determining a word formed by the pinyin information and the tone information as the first recognition word.
With reference to the first aspect, an embodiment of the present invention provides a second possible implementation manner of the first aspect, where the determining, among a plurality of dialect words corresponding to the calibration word, a dialect word whose similarity with the first recognition word exceeds a preset threshold as a target dialect word includes:
acquiring a plurality of dialect words corresponding to the calibration words, wherein the dialect words comprise dialect word pinyin and dialect word tone;
determining at least one dialect word with the similarity to the pinyin information exceeding a preset first similarity threshold value as a candidate word according to the pinyin of the dialect words;
and determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.
With reference to the first aspect, an embodiment of the present invention provides a third possible implementation manner of the first aspect, where the outputting service prompt content by using the dialect category corresponding to the target dialect word includes:
acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;
acquiring service prompt content to be output;
searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;
and outputting the plurality of searched dialects according to the arrangement sequence of the keywords in the service prompt content.
With reference to the first aspect, an embodiment of the present invention provides a fourth possible implementation manner of the first aspect, where the method further includes:
detecting whether a user is located in a preset service area;
when a user is located in the preset service area, outputting prompt content containing service items so that the user can select the service items;
after receiving a second voice signal acquired by the voice acquisition equipment, recognizing the second voice signal to obtain a second recognition word;
and when the second recognition word is different from any service keyword in a preset service word bank, outputting prompt information containing a preset calibration word.
In a second aspect, an embodiment of the present invention further provides a service guiding apparatus, where the apparatus includes:
the first output module is used for outputting prompt information containing preset calibration words so that a user can read the calibration words aloud;
the first recognition module is used for recognizing a first voice signal after receiving the first voice signal collected by the voice collection equipment to obtain a first recognition word;
the determining module is used for determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words;
and the second output module is used for outputting service prompt contents by using the dialect category corresponding to the target dialect word so as to enable the user to select service items according to the service prompt.
With reference to the second aspect, an embodiment of the present invention provides a first possible implementation manner of the second aspect, where the identification module includes:
the first recognition unit is used for carrying out voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal;
the second recognition unit is used for carrying out tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal;
and the first determining unit is used for determining a word consisting of the pinyin information and the tone information as the first recognition word.
With reference to the second aspect, an embodiment of the present invention provides a second possible implementation manner of the second aspect, where the determining module includes:
the first obtaining unit is used for obtaining a plurality of dialect words corresponding to the calibration words, and the dialect words comprise dialect word pinyin and dialect word tone;
the second determining unit is used for determining at least one dialect word with the similarity exceeding a preset first similarity threshold value with the pinyin information as a candidate word according to the pinyin of the dialect words;
and the third determining unit is used for determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.
With reference to the second aspect, an embodiment of the present invention provides a third possible implementation manner of the second aspect, where the second output module includes:
the second acquisition unit is used for acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;
a third obtaining unit, configured to obtain service prompt content to be output;
the searching unit is used for searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;
and the output unit is used for outputting the searched plurality of dialects according to the arrangement sequence of the keywords in the service prompt content.
With reference to the second aspect, an embodiment of the present invention provides a fourth possible implementation manner of the second aspect, where the apparatus further includes:
the detection module is used for detecting whether a user is located in a preset service area;
the third output module is used for outputting prompt contents containing service items when a user is located in the preset service area so that the user can select the service items;
the second recognition module is used for recognizing a second voice signal after receiving the second voice signal collected by the voice collection equipment to obtain a second recognition word;
and the fourth output module is used for outputting prompt information containing a preset calibration word when the second identification word is different from any service keyword in a preset service word bank.
The embodiment of the invention has the following beneficial effects: the embodiment of the invention firstly outputs prompt information containing a preset calibration word so as to enable a user to read the calibration word aloud, after a first voice signal acquired by voice acquisition equipment is received, the first voice signal is identified to obtain a first identification word, then a dialect word with similarity exceeding a preset threshold value with the first identification word is determined as a target dialect word in a plurality of dialect words corresponding to the calibration word, and finally service prompt content is output by utilizing the dialect category corresponding to the target dialect word so that the user can select a service item according to the service prompt.
According to the embodiment of the invention, the dialect type used by the user can be distinguished according to the first voice signal sent by the user reading the calibration word, then the service prompt content is output according to the dialect type, the dialect which can be understood by the client is output while the dialect which is spoken by the client is understood, and the communication efficiency between the guiding machine and the client is improved.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
In order to make the aforementioned and other objects, features and advantages of the present invention comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a flowchart of a service guiding method according to an embodiment of the present invention;
FIG. 2 is a flowchart of step S102 in FIG. 1;
FIG. 3 is a flowchart of step S103 in FIG. 1;
FIG. 4 is a flowchart of step S104 in FIG. 1;
fig. 5 is another flowchart of a service guiding method according to an embodiment of the present invention;
fig. 6 is a structural diagram of a service guiding apparatus according to an embodiment of the present invention.
Icon: 11-a first output module; 12-a first identification module; 13-a determination module; 14-a second output module.
Detailed Description
To make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions of the present invention will be clearly and completely described below with reference to the accompanying drawings, and it is apparent that the described embodiments are some, but not all embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Based on the fact that the existing guiding machine cannot accurately identify the queries of the clients who have poor mandarin and cannot communicate by using mandarin and cannot provide high-quality guiding service for the clients according to the queries of the clients, the guiding service efficiency is low, therefore, the service guiding method and the service guiding device provided by the embodiment of the invention can output the prompt information containing the preset calibration words to enable the users to read the calibration words aloud, identify the first voice signal after receiving the first voice signal acquired by the voice acquisition equipment to obtain the first identification word, determine the dialect words with the similarity exceeding the preset threshold value with the first identification word as the target dialect words in a plurality of dialect words corresponding to the calibration words, and output the service prompt content by utilizing the dialect categories corresponding to the target dialect words to enable the users to select service items according to the service prompt, according to the embodiment of the invention, the dialect type used by the user can be distinguished according to the first voice signal sent by the user reading the calibration word, then the service prompt content is output according to the dialect type, the dialect which can be understood by the client is output while the dialect which is spoken by the client is understood, and the communication efficiency between the guiding machine and the client is improved.
To facilitate understanding of the present embodiment, a service guiding method disclosed in the embodiment of the present invention is first described in detail, where the service guiding method can be applied to a guiding machine including a voice collecting device and a voice outputting device, where the voice collecting device may refer to a microphone and the like, and the voice outputting device may refer to a speaker and the like, and as shown in fig. 1, the method includes the following steps.
In step S101, a prompt message including a preset calibration word is output.
In the embodiment of the invention, a plurality of people speaking different dialects can be searched in advance, a plurality of people speaking different dialects are enabled to read a plurality of words respectively, then the preset number of words with the largest difference between pinyin and tone is determined as the preset calibration word, and the calibration word is set so as to distinguish the dialects of which types the client belongs to more quickly.
In this step, the prompt message may be output by displaying text on the display interface of the leader and/or by voice output using a voice output device.
Through the step S101, the user can read the calibration word aloud, and the voice collecting device is further convenient to collect the first voice signal such as the voice signal sent by the user through reading the calibration word aloud.
In step S102, after receiving a first voice signal acquired by the voice acquisition device, the first voice signal is recognized to obtain a first recognized word.
In the embodiment of the present invention, the first speech signal collected by the speech collection device may be recognized by a speech recognition technology to obtain the first recognized word, where the first recognized word may include pinyin and tone, for example, the pinyin of the shoe is "xiezi" and the tone is "two sounds, light sounds".
In step S103, a dialect word having a similarity exceeding a preset threshold with the first recognition word is determined as a target dialect word among the dialects corresponding to the calibration word.
In the embodiment of the invention, the calibration words are determined after the groups speaking different dialects read aloud, so that the groups speaking different dialects read aloud the calibration words to obtain the versions of the dialects, and the calibration words and the dialects are correspondingly stored, so that the corresponding relation between a plurality of groups of calibration words and the dialects can be obtained.
In this step, a plurality of dialect words corresponding to the current calibration word may be obtained according to a plurality of pre-stored sets of correspondence, then the first recognition word is compared with each dialect word in terms of pinyin and tone, and then the same proportion of the pinyin and the tone between the first recognition word and the dialect word is determined as similarity, for example, if the calibration word is "shoe", the first recognition word is "hai zi", "second sound, and soft sound", one of the dialect words "haizi", "second sound, and soft sound" corresponding to the calibration word, the same part is "hai zi", "second sound, and soft sound", the similarity is 100%, and if the other dialect word corresponding to the calibration word is "xie zi", "four sound, and soft sound", the similarity is 75%.
In step S104, service prompt content is output by using the dialect category corresponding to the target dialect word, so that the user selects a service item according to the service prompt.
In the embodiment of the present invention, the voice prompt signals of various dialect categories corresponding to a plurality of service prompt contents may be stored in advance, for example, if a certain service prompt content is "please select a service item", the voice prompt signals of shanxi dialect edition, south of river dialect edition, sheng yang dialect edition, and the like may be stored in a language.
Assuming that the dialect category corresponding to the target dialect word is the Shaanxi dialect, the service prompt content can be output by using the Shaanxi dialect.
The embodiment of the invention firstly outputs prompt information containing a preset calibration word so as to enable a user to read the calibration word aloud, after a first voice signal acquired by voice acquisition equipment is received, the first voice signal is identified to obtain a first identification word, then a dialect word with similarity exceeding a preset threshold value with the first identification word is determined as a target dialect word in a plurality of dialect words corresponding to the calibration word, and finally service prompt content is output by utilizing the dialect category corresponding to the target dialect word so that the user can select a service item according to the service prompt.
According to the embodiment of the invention, the dialect type used by the user can be distinguished according to the first voice signal sent by the user reading the calibration word, then the service prompt content is output according to the dialect type, the dialect which can be understood by the client is output while the dialect which is spoken by the client is understood, and the communication efficiency between the guiding machine and the client is improved.
Since dialects are mostly different in pinyin and tone from mandarin, in another embodiment of the present invention, as shown in fig. 2, the step S102 includes the following steps.
In step S1021, performing speech recognition on the first speech signal to obtain pinyin information corresponding to the first speech signal.
In step S1022, tone recognition is performed on the first voice signal, so as to obtain tone information corresponding to the first voice signal.
In step S1023, a word composed of the pinyin information and the tone information is determined as the first recognized word.
On the basis of the foregoing embodiment, in another embodiment of the present invention, as shown in fig. 3, the step S103 includes the following steps.
In step S1031, a plurality of dialect words corresponding to the calibration word are obtained, where the dialect words include dialect word pinyin and dialect word tone;
in step S1032, determining at least one dialect word having a similarity to the pinyin information exceeding a preset first similarity threshold as a candidate word according to the pinyin of the dialect words;
when there is no dialect word with the similarity exceeding the preset first similarity threshold, step S101 may be executed again to obtain the first recognized word again until the dialect word with the similarity exceeding the preset first similarity threshold is found.
In step S1033, according to the tone of the dialect word of at least one of the candidate words, a reference word whose similarity to the tone information exceeds a preset second similarity threshold is determined as the target dialect word.
When there is no dialect word with the similarity exceeding the preset second similarity threshold, step S101 may be executed again to obtain the first recognized word again until the dialect word with the similarity exceeding the preset second similarity threshold is found.
In practical application, the similarity of the tone information can be compared first, and then the similarity of the pinyin information and the like can be compared, and the similarity can be set according to actual needs.
In still another embodiment of the present invention, as shown in fig. 4, the step S104 includes the following steps.
In step S1041, a dialect category corresponding to the target dialect and a dialect set corresponding to the dialect category are obtained, where the dialect set includes a plurality of dialect words;
in step S1042, a service prompt content to be output is obtained;
in step S1043, a dialect corresponding to each keyword of the service prompt content is searched for in the dialect set;
the service prompt content may be first cut off according to a preset word cutting rule, for example, the prepositions in the service prompt content, such as "yes" and/or "on" may be first removed, then the service prompt content is cut off from the first word according to the rule of two words, after the first word is cut off, the cut-off word is compared with the words in the preset word library, if the same word exists, the second word is continuously cut off, if the same word does not exist, the words of three words are cut off, and the cut-off words of three words are continuously compared … … with the words in the preset word library until the service prompt content is cut off.
In step S1044, the found plurality of dialect words are output according to the arrangement order of the keywords in the service prompt content.
In this step, after the good-brother words are arranged in the order of the keywords in the service provision contents, the service provision contents may be output by outputting a voice signal or the like.
In a further embodiment of the invention, as shown in fig. 5, the method further comprises the following steps.
In step S201, it is detected whether a user is located in a preset service area.
In an embodiment of the present invention, the predetermined service area may refer to an area which is a predetermined distance from the leader, for example, within 0.5 m from the front (the side having the display screen) of the leader.
In step S202, when a user is located in the preset service area, a prompt content including a service item is output, so that the user can select the service item.
In this step, the prompt content or the like may be output by displaying the prompt content on the display screen or by voice playing.
In step S203, after receiving the second voice signal acquired by the voice acquisition device, recognizing the second voice signal to obtain a second recognized word.
This step is similar to the processing in step S102, and is not described here again.
In step S204, when the second recognized word is different from any service keyword in the preset service lexicon, a prompt message including a preset calibration word is output.
The step is to screen users who speak Mandarin and users who speak Mandarin, if the users who speak Mandarin, the second recognized word will be at least the same as any service keyword, if the users who speak Mandarin, the second recognized word will be different from any service keyword.
In still another embodiment of the present invention, as shown in fig. 6, a service guide apparatus includes: a first output module 11, a first recognition module 12, a determination module 13 and a second output module 14;
the first output module 11 is configured to output prompt information including a preset calibration word, so that a user reads the calibration word aloud;
the first recognition module 12 is configured to, after receiving a first voice signal collected by the voice collection device, recognize the first voice signal to obtain a first recognition word;
the determining module 13 is configured to determine, among the plurality of dialect words corresponding to the calibration word, a dialect word with a similarity exceeding a preset threshold with the first recognition word as a target dialect word;
and a second output module 14, configured to output service prompt content by using the dialect category corresponding to the target dialect term, so that the user selects a service item according to the service prompt.
In another embodiment of the present invention, the identification module includes:
the first recognition unit is used for carrying out voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal;
the second recognition unit is used for carrying out tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal;
and the first determining unit is used for determining a word consisting of the pinyin information and the tone information as the first recognition word.
In another embodiment of the present invention, the determining module includes:
the first obtaining unit is used for obtaining a plurality of dialect words corresponding to the calibration words, and the dialect words comprise dialect word pinyin and dialect word tone;
the second determining unit is used for determining at least one dialect word with the similarity exceeding a preset first similarity threshold value with the pinyin information as a candidate word according to the pinyin of the dialect words;
and the third determining unit is used for determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.
In yet another embodiment of the present invention, the second output module includes:
the second acquisition unit is used for acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;
a third obtaining unit, configured to obtain service prompt content to be output;
the searching unit is used for searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;
and the output unit is used for outputting the searched plurality of dialects according to the arrangement sequence of the keywords in the service prompt content.
In yet another embodiment of the present invention, the apparatus further comprises:
the detection module is used for detecting whether a user is located in a preset service area;
the third output module is used for outputting prompt contents containing service items when a user is located in the preset service area so that the user can select the service items;
the second recognition module is used for recognizing a second voice signal after receiving the second voice signal collected by the voice collection equipment to obtain a second recognition word;
and the fourth output module is used for outputting prompt information containing a preset calibration word when the second identification word is different from any service keyword in a preset service word bank.
The computer program product of the service guiding method and device provided by the embodiment of the present invention includes a computer readable storage medium storing a program code, where instructions included in the program code may be used to execute the method described in the foregoing method embodiment, and specific implementation may refer to the method embodiment, which is not described herein again.
It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the system and the apparatus described above may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again.
In addition, in the description of the embodiments of the present invention, unless otherwise explicitly specified or limited, the terms "mounted," "connected," and "connected" are to be construed broadly, e.g., as meaning either a fixed connection, a removable connection, or an integral connection; can be mechanically or electrically connected; they may be connected directly or indirectly through intervening media, or they may be interconnected between two elements. The specific meanings of the above terms in the present invention can be understood in specific cases to those skilled in the art.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
In the description of the present invention, it should be noted that the terms "center", "upper", "lower", "left", "right", "vertical", "horizontal", "inner", "outer", etc., indicate orientations or positional relationships based on the orientations or positional relationships shown in the drawings, and are only for convenience of description and simplicity of description, but do not indicate or imply that the device or element being referred to must have a particular orientation, be constructed and operated in a particular orientation, and thus, should not be construed as limiting the present invention. Furthermore, the terms "first," "second," and "third" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance.
Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the embodiments of the present invention, and they should be construed as being included therein. Therefore, the protection scope of the present invention shall be subject to the protection scope of the claims.

Claims (6)

1. A service guide method is applied to a guide machine comprising a voice acquisition device and a voice output device, and comprises the following steps:
outputting prompt information containing preset calibration words so that a user reads the calibration words;
after receiving a first voice signal acquired by the voice acquisition equipment, identifying the first voice signal to obtain a first identification word;
determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words;
outputting service prompt content by utilizing the dialect category corresponding to the target dialect word so as to enable the user to select a service item according to the service prompt;
the recognizing the first voice signal to obtain a first recognized word, including: performing voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal; performing tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal; determining a word composed of the pinyin information and the tone information as the first recognition word;
determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words, wherein the determining comprises the following steps: acquiring a plurality of dialect words corresponding to the calibration words, wherein the dialect words comprise dialect word pinyin and dialect word tone; determining at least one dialect word with the similarity to the pinyin information exceeding a preset first similarity threshold value as a candidate word according to the pinyin of the dialect words; and determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.
2. The service guiding method according to claim 1, wherein outputting the service prompt content using the dialect category corresponding to the target dialect term comprises:
acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;
acquiring service prompt content to be output;
searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;
and outputting the plurality of searched dialects according to the arrangement sequence of the keywords in the service prompt content.
3. The service bootstrapping method of claim 2, further comprising:
detecting whether a user is located in a preset service area;
when a user is located in the preset service area, outputting prompt content containing service items so that the user can select the service items;
after receiving a second voice signal acquired by the voice acquisition equipment, recognizing the second voice signal to obtain a second recognition word;
and when the second recognition word is different from any service keyword in a preset service word bank, outputting prompt information containing a preset calibration word.
4. A service guide apparatus, characterized in that the apparatus comprises:
the first output module is used for outputting prompt information containing preset calibration words so that a user can read the calibration words aloud;
the first recognition module is used for recognizing a first voice signal after receiving the first voice signal collected by the voice collection equipment to obtain a first recognition word;
the determining module is used for determining the dialect words with the similarity exceeding a preset threshold value with the first recognition word as target dialect words in the plurality of dialect words corresponding to the calibration words;
the second output module is used for outputting service prompt contents by utilizing the dialect category corresponding to the target dialect word so as to enable the user to select service items according to the service prompt;
the identification module comprises: the first recognition unit is used for carrying out voice recognition on the first voice signal to obtain pinyin information corresponding to the first voice signal; the second recognition unit is used for carrying out tone recognition on the first voice signal to obtain tone information corresponding to the first voice signal; a first determining unit, configured to determine that a word composed of the pinyin information and the tone information is the first recognized word;
the determining module includes: the first obtaining unit is used for obtaining a plurality of dialect words corresponding to the calibration words, and the dialect words comprise dialect word pinyin and dialect word tone; the second determining unit is used for determining at least one dialect word with the similarity exceeding a preset first similarity threshold value with the pinyin information as a candidate word according to the pinyin of the dialect words; and the third determining unit is used for determining a reference word with the similarity degree with the tone information exceeding a preset second similarity threshold value as a target dialect word according to the tone of the dialect word of at least one candidate word.
5. The service guide apparatus of claim 4, wherein the second output module comprises:
the second acquisition unit is used for acquiring a dialect category corresponding to the target dialect term and a dialect term set corresponding to the dialect category, wherein the dialect term set comprises a plurality of dialect terms;
a third obtaining unit, configured to obtain service prompt content to be output;
the searching unit is used for searching the dialect words corresponding to the keywords of the service prompt content in the dialect word set;
and the output unit is used for outputting the searched plurality of dialects according to the arrangement sequence of the keywords in the service prompt content.
6. The service guide apparatus of claim 5, wherein the apparatus further comprises:
the detection module is used for detecting whether a user is located in a preset service area;
the third output module is used for outputting prompt contents containing service items when a user is located in the preset service area so that the user can select the service items;
the second recognition module is used for recognizing a second voice signal after receiving the second voice signal collected by the voice collection equipment to obtain a second recognition word;
and the fourth output module is used for outputting prompt information containing a preset calibration word when the second identification word is different from any service keyword in a preset service word bank.
CN201710579589.9A 2017-07-18 2017-07-18 Service guiding method and device Expired - Fee Related CN107393530B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710579589.9A CN107393530B (en) 2017-07-18 2017-07-18 Service guiding method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710579589.9A CN107393530B (en) 2017-07-18 2017-07-18 Service guiding method and device

Publications (2)

Publication Number Publication Date
CN107393530A CN107393530A (en) 2017-11-24
CN107393530B true CN107393530B (en) 2020-08-25

Family

ID=60339971

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710579589.9A Expired - Fee Related CN107393530B (en) 2017-07-18 2017-07-18 Service guiding method and device

Country Status (1)

Country Link
CN (1) CN107393530B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108777142A (en) * 2018-06-05 2018-11-09 上海木木机器人技术有限公司 A kind of interactive voice recognition methods and interactive voice robot based on airport environment
CN111489752B (en) * 2020-03-16 2024-03-26 咪咕互动娱乐有限公司 Voice output method, voice output device, electronic equipment and computer readable storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060122840A1 (en) * 2004-12-07 2006-06-08 David Anderson Tailoring communication from interactive speech enabled and multimodal services
US20110313767A1 (en) * 2010-06-18 2011-12-22 At&T Intellectual Property I, L.P. System and method for data intensive local inference
CN102607585A (en) * 2012-04-01 2012-07-25 北京乾图方园软件技术有限公司 Configuration-file-based navigation voice broadcasting method and device
US20130110511A1 (en) * 2011-10-31 2013-05-02 Telcordia Technologies, Inc. System, Method and Program for Customized Voice Communication
CN106003100A (en) * 2016-08-08 2016-10-12 深圳市前海小村机器人智能科技有限公司 Multimedia public service robot
CN106808480A (en) * 2017-03-23 2017-06-09 北京瑞华康源科技有限公司 A kind of robot guide medical system

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004037721A (en) * 2002-07-02 2004-02-05 Pioneer Electronic Corp System and program for voice response and storage medium therefor
CN102760432B (en) * 2012-07-06 2015-08-19 广东美的制冷设备有限公司 A kind of household electrical appliances Acoustic control remote controller and control method thereof
CN103578465B (en) * 2013-10-18 2016-08-17 威盛电子股份有限公司 Speech identifying method and electronic installation
CN105653596A (en) * 2015-12-22 2016-06-08 惠州Tcl移动通信有限公司 Quick startup method and device of specific function on the basis of voice frequency comparison
CN105654950B (en) * 2016-01-28 2019-07-16 百度在线网络技术(北京)有限公司 Adaptive voice feedback method and device
CN105872687A (en) * 2016-03-31 2016-08-17 乐视控股(北京)有限公司 Method and device for controlling intelligent equipment through voice
CN105931145A (en) * 2016-05-06 2016-09-07 乐视控股(北京)有限公司 Intelligent ordering method and apparatus
CN106493741A (en) * 2016-11-28 2017-03-15 广西乐美趣智能科技有限公司 A kind of hotel service Multifunctional intelligent robot

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060122840A1 (en) * 2004-12-07 2006-06-08 David Anderson Tailoring communication from interactive speech enabled and multimodal services
US20110313767A1 (en) * 2010-06-18 2011-12-22 At&T Intellectual Property I, L.P. System and method for data intensive local inference
US20130110511A1 (en) * 2011-10-31 2013-05-02 Telcordia Technologies, Inc. System, Method and Program for Customized Voice Communication
CN102607585A (en) * 2012-04-01 2012-07-25 北京乾图方园软件技术有限公司 Configuration-file-based navigation voice broadcasting method and device
CN106003100A (en) * 2016-08-08 2016-10-12 深圳市前海小村机器人智能科技有限公司 Multimedia public service robot
CN106808480A (en) * 2017-03-23 2017-06-09 北京瑞华康源科技有限公司 A kind of robot guide medical system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
《Tonal and non-tonal intonation in Yichang dialect》;Yan Li et al.;《IEEE 2016 International Conference on Asian Language Processing (IALP)》;20161231;全文 *
《基于移动终端的智能机器人的设计与实现》;刘焯杰 等;《科技经济信息化》;20151231;全文 *

Also Published As

Publication number Publication date
CN107393530A (en) 2017-11-24

Similar Documents

Publication Publication Date Title
Tatman Gender and dialect bias in YouTube’s automatic captions
CN111128223B (en) Text information-based auxiliary speaker separation method and related device
US20190102381A1 (en) Exemplar-based natural language processing
US9230547B2 (en) Metadata extraction of non-transcribed video and audio streams
CN109618181B (en) Live broadcast interaction method and device, electronic equipment and storage medium
JP6857581B2 (en) Growth interactive device
JP6651973B2 (en) Interactive processing program, interactive processing method, and information processing apparatus
CN106782615B (en) Voice data emotion detection method, device and system
TWI711967B (en) Method, device and equipment for determining broadcast voice
CN107492153B (en) Attendance system, method, attendance server and attendance terminal
CN107886951B (en) Voice detection method, device and equipment
US20130253932A1 (en) Conversation supporting device, conversation supporting method and conversation supporting program
CN108305618B (en) Voice acquisition and search method, intelligent pen, search terminal and storage medium
JP2020095210A (en) Minutes output device and control program for minutes output device
CN110807093A (en) Voice processing method and device and terminal equipment
CN107393530B (en) Service guiding method and device
CN111768789B (en) Electronic equipment, and method, device and medium for determining identity of voice generator of electronic equipment
CN113782026A (en) Information processing method, device, medium and equipment
JP6254504B2 (en) Search server and search method
US20140163986A1 (en) Voice-based captcha method and apparatus
EP2913822B1 (en) Speaker recognition
KR101440887B1 (en) Method and apparatus of recognizing business card using image and voice information
CN111640423A (en) Word boundary estimation method and device and electronic equipment
CN108777804B (en) Media playing method and device
CN115691503A (en) Voice recognition method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20200825

CF01 Termination of patent right due to non-payment of annual fee