CN110706702A - Infinite stage multi-turn dialogue method for speech recognition - Google Patents

Infinite stage multi-turn dialogue method for speech recognition Download PDF

Info

Publication number
CN110706702A
CN110706702A CN201910978335.3A CN201910978335A CN110706702A CN 110706702 A CN110706702 A CN 110706702A CN 201910978335 A CN201910978335 A CN 201910978335A CN 110706702 A CN110706702 A CN 110706702A
Authority
CN
China
Prior art keywords
card
matched
conversation
dialogue
semantics
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910978335.3A
Other languages
Chinese (zh)
Other versions
CN110706702B (en
Inventor
杨毅
李秋标
吴恭辉
陈伟德
吴启庭
黄永煌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuhu Ombo Technology Co Ltd
Original Assignee
Wuhu Ombo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuhu Ombo Technology Co Ltd filed Critical Wuhu Ombo Technology Co Ltd
Priority to CN201910978335.3A priority Critical patent/CN110706702B/en
Publication of CN110706702A publication Critical patent/CN110706702A/en
Application granted granted Critical
Publication of CN110706702B publication Critical patent/CN110706702B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/225Feedback of the input speech

Abstract

The invention discloses a voice recognition infinite-stage multi-turn dialogue method, which comprises the steps of formulating a functional card model file, wherein the card type of the functional card model file comprises a main card and an alternative card, after voice input and a recognition result are obtained, recognized semantics are matched with corresponding semantics of the cards one by one, after the recognized semantics are matched with the cards, corresponding card instructions are executed, the corresponding alternative cards are reloaded into a pool to be matched according to the corresponding next-stage dialogue ID, the main card stored into the corresponding pool to be matched is taken as a matching object in the first turn of dialogue, then each turn of dialogue takes the reloaded alternative card after the previous turn of dialogue is matched as a matching object, and the dialogue is ended when the matched card has no next-stage dialogue ID or undefined abnormality occurs. The number of the targets required to be matched in the multi-stage conversation is only related to the number of the targets in the previous stage, the increase of the number of the associated conversation stages has no influence on the matching efficiency, and the matching efficiency and the reliability of the multi-stage infinite round of conversation are guaranteed.

Description

Infinite stage multi-turn dialogue method for speech recognition
Technical Field
The invention relates to an infinite-level multi-turn dialogue recognition method.
Background
At present, the application range of the voice recognition dialogue mode control equipment is wider and wider, and the voice recognition dialogue mode control equipment also starts to adopt the mode to control and automatically answer. In multiple rounds of dialogues, one dialog is related to the requirement of the dialog itself, and the correct feedback and response corresponding to the dialog need to be judged by combining the content of the previous dialog, compared with a single dialog, which is called a secondary dialog, the more and higher the number of the dialog layers related to the front and the back are, but the most of the speech recognition multi-level dialogues in the market at present are three-level dialogues, i.e., a dialog can be made that is continuously related to the content of the previous dialog, can only contain three levels, because existing dialogs have more targets in the pool to be matched when answering or performing a match, and the involvement of an associated multi-level dialog also requires multiple matches in conjunction with previous dialogs, which greatly reduces the efficiency and reliability of the matching, and as the number of the associated dialog levels is increased, the later dialog is obtained, the matching effect is poorer, and the currently associated multi-level dialog can only realize limited round of dialog within three levels.
Disclosure of Invention
The invention aims to provide a voice recognition infinite-stage multi-turn dialogue method, which aims to solve the problem that the related dialogue stages of multi-turn dialogue are limited due to the fact that related targets and related contents are more in matching, matching efficiency is low, error rate is high in the prior art.
The voice recognition infinite-stage multi-turn conversation method comprises the steps that a functional card model file is formulated, the card type of the functional card model file comprises a main card and an alternative card, the content of the functional card model file comprises a card type ID, a card ID, corresponding semantics of the card, a card instruction and a next-stage conversation ID of the card, and the card ID of the alternative card is the next-stage conversation ID of the main card or the alternative card; after voice input and analysis are carried out to obtain semantics, the semantics are matched with the corresponding semantics of the cards one by one, corresponding card instructions are executed after the semantics are matched with the cards, the matched alternative cards are reloaded into a pool to be matched according to the card IDs contained in the corresponding next-level conversation IDs, the pool to be matched after the cards are reset only contains a main card as a matching object, each round of conversation outside the first level takes the reloaded alternative card after the previous round of conversation is matched as the matching object, and the conversation is ended when the matched cards have no card next-level conversation IDs or undefined abnormal conditions occur.
Preferably, the system formulates the function card model file according to a set semantic understanding protocol, and the function card model file is arranged in groups according to a main card and an alternative card, and the method specifically comprises the following steps:
the method comprises the following steps that firstly, initialization is carried out after a system is started, only a main card is loaded in a pool to be matched to realize card resetting, and then the pool is in an idle state to wait for voice input;
secondly, the system receives the dialogue voice and obtains semantics through the analysis of a voice recognition module;
step three, matching the semantics with the corresponding semantics of each card in the pool to be matched one by one, and executing the step five when the matching is successful, otherwise executing the step four;
step four, checking whether the abnormal condition is defined or not when the matching is unsuccessful, if so, prompting a user to repeat the dialogue voice, returning to the step two to analyze the semantics again, and if not, executing the step six;
step five, if the semantic matching is successful, executing a card instruction of the matched card, checking whether the card has a next-level dialogue ID, if so, reloading the corresponding next-level alternative card into a pool to be matched, then entering an idle state to wait for voice input, and if not, executing step six, wherein the card is a dialogue endpoint;
and step six, ending the conversation to reset the card, and then waiting for voice input in an idle state.
Preferably, the method is provided with a parameter abnormal frequency representing the frequency of occurrence of the defined abnormal condition, the initial value is 0, the abnormal frequency is +1 if the defined abnormal condition is judged in the fourth step, then the abnormal frequency is checked, if the abnormal frequency is less than 3, the user is prompted to repeat the dialogue voice and return to the second step to analyze the semantics again, and if not, the sixth step is executed.
Preferably, the method further comprises the steps of after the system executes the card instruction, if the card is not reset, starting timing through a timer, resetting the timer after the system receives the next round of conversation, judging that the system is abnormal after the timing is greater than a set first threshold, prompting a user to provide conversation voice if the abnormal times are less than 3, resetting a counter for timing, and resetting and closing the timer when the card is reset.
Preferably, the next-level session ID of the candidate card may be null, or may include one or more card IDs of the candidate cards, and when reloading the candidate cards, all the card IDs included in the next-level session ID are read out respectively, and then the candidate cards corresponding to each card ID are reloaded into the pool to be matched.
Preferably, the defined abnormal condition comprises that the semantic is normal after being analyzed, but the semantic is not matched with the card, and the dialogue voice is not received after overtime; the undefined abnormal condition comprises that the received voice cannot be analyzed to obtain semantics and the semantics comprise abnormal characters.
The invention has the following advantages:
according to the method, the function card model file is formulated to be used for matching with the semantic meaning recognized by the voice to feed back the execution command, the matched cards are divided into the main card and the alternative cards, the main card is used for the first-stage conversation (also the first-round conversation) of conversation start, and the main card and the alternative cards are divided into different pools to be matched, so that the target number of conversation matching is greatly reduced at the beginning of the conversation, and the matching efficiency and accuracy are improved. Meanwhile, for each stage of later conversation, the next stage of conversation ID related to the card is set, the possible related next stage of alternative cards are searched out for reloading, the number of targets matched with the later conversation is changed, invalid targets are prevented from being added into objects matched one by one, therefore, when all stages of later conversations are matched, the number of targets is only related to the number of card IDs contained in the next stage of conversation ID of the card corresponding to the previous stage of conversation, no matter how many stages of conversations are related to the previous stage of conversation, the matching efficiency cannot be greatly influenced due to the increase of the number of the related conversation stages, and the matching efficiency and reliability of multi-stage infinite conversation are guaranteed.
Drawings
FIG. 1 is a flow chart of a method for infinite multi-turn speech recognition according to the present invention.
Detailed Description
The following detailed description of the embodiments of the present invention will be given in order to provide those skilled in the art with a more complete, accurate and thorough understanding of the inventive concept and technical solutions of the present invention.
As shown in fig. 1, the present invention provides a voice recognition infinite-level multi-round conversation method, in which a semantic understanding protocol set by a voice recognition matching system used in the method defines a function card model file, the card types of the function card model file include a main card and an alternative card, the contents of the function card model file include a card type ID, a card ID, corresponding semantics of the card, a card instruction, and a next-level conversation ID of the card, the card IDs of the alternative cards are all the next-level conversation IDs of the main card or the alternative card, and the main card and the alternative card are stored in groups.
The infinite-stage multi-turn dialogue recognition method specifically comprises the following steps of:
the method comprises the following steps that firstly, initialization is carried out after a system is started, only a main card is loaded in a pool to be matched to realize card resetting, and then the pool is in an idle state to wait for voice input;
secondly, the system receives the dialogue voice and obtains semantics through the analysis of a voice recognition module;
step three, matching the semantics with the corresponding semantics of each card in the pool to be matched one by one, and executing the step five when the matching is successful, otherwise executing the step four;
step four, checking whether the abnormal condition is defined or not when the matching is unsuccessful, if so, prompting a user to repeat the dialogue voice, returning to the step two to analyze the semantics again, and if not, executing the step six;
step five, if the semantic matching is successful, executing a card instruction of the matched card, checking whether the card has a next-level dialogue ID, if so, reloading the corresponding next-level alternative card into a pool to be matched, then entering an idle state to wait for voice input, and if not, executing step six, wherein the card is a dialogue endpoint;
and step six, ending the conversation to reset the card, and then waiting for voice input in an idle state.
The system is initialized firstly after being started, the cards in the pool to be matched are reset, namely the cards are all reloaded into the main cards, so that only the main cards in the pool to be matched corresponding to the first wheel pair session are used as matching objects, and then if the previous session of each wheel pair session is not finished and the cards in the pool to be matched are not reset, the reloaded alternative cards after the previous wheel pair session is matched are used as the matching objects.
The matched cards are divided into main cards and alternative cards, the main cards are used for the first-level conversation (also the first round of conversation) of conversation starting, and the main cards and the alternative cards are divided into different pools to be matched, so that the target number of conversation matching is greatly reduced at the beginning of the conversation, and the matching efficiency and accuracy are improved. When all levels of conversations are matched, the number of the matched card targets is only related to the number of the card IDs contained in the next level of conversation IDs of the corresponding cards of the previous level of conversations, no matter how many levels of conversations are related to the previous level of conversations, the matching efficiency cannot be greatly influenced due to the increase of the number of the related conversation levels, and the matching efficiency and the reliability of the multi-level infinite conversation are guaranteed.
Furthermore, in order to prevent the user from being repeatedly prompted to repeatedly talk to enter the dead loop, the method is provided with a parameter abnormal frequency representing the frequency of occurrence of the defined abnormal condition, and the initial value is 0. And in the fourth step, if the abnormal condition is judged to be the defined abnormal condition, the abnormal times are plus 1, then the abnormal times are checked, if the abnormal times are less than 3, a user is prompted to repeat the dialogue voice and return to the second step to analyze the semantics again, if not, the sixth step is executed, and the card is reset after the dialogue is ended.
Therefore, when the received voice is unclear, the voice which can not be correctly recognized appears, and the semantic content exceeds the range which can be responded by the system, the method can firstly repeat the voice for 2 times and then recognize and match, can timely finish the conversation when the voice can not be matched for more than 3 times so as to prevent the effect of repeated prompting endless loop, and feeds back a user to enable the user to know the condition that the voice recognition matching has errors, thereby timely remedying.
The defined abnormal conditions comprise that the semantic is normal but can not be matched with the card after being analyzed, and the dialogue voice is not received after overtime; the undefined abnormal condition comprises that the received voice cannot be analyzed to obtain semantics and the semantics comprise abnormal characters. The normal semantics obtained through analysis and the failure of matching with the card generally belong to normal conversation, but the conversation content exceeds the range of the system executable command and cannot be realized; or the speech is unclear, so that the semantic analysis has a small amount of errors and cannot be matched. The absence of the received speech is usually that the next dialog is not performed for a long time after the first dialog, and the user may leave or not want to perform the next dialog, but may just forget to perform the next dialog. These situations require the user to be prompted to repeat or conduct the conversation and to end the conversation when multiple failures occur. When the speech is analyzed to obtain the semantics and the semantics contain abnormal characters, the input speech or the input speech does not belong to the normal conversation of the user and may be noise; on the other hand, the problem may occur that the speech recognition matching system itself is out of order, resulting in information errors, which need to be corrected, and the dialog should be ended and the problem should be fed back to the user.
The method further comprises the steps that after the system executes a card instruction, if the card is not reset, timing is started through a timer, the timer is reset after the system receives the next round of conversation, the system judges that the system is abnormal after the timing is larger than a set first threshold value, if the abnormal times are smaller than 3, a user is prompted to give out conversation voice, meanwhile, a counter is reset to conduct timing, and the system resets and closes the timer when the card is reset. Therefore, after the multi-level conversation starts, the method can remind the user who forgets to continue the conversation in time, and can also remind the user to automatically stop the conversation service after the user is not fed back 2 times when the user leaves the scene or does not want to continue the conversation, thereby reducing the energy consumption of the system and avoiding the repeated prompt from disturbing the user or others.
The next-level conversation ID of the alternative cards can be empty, and can also comprise one card ID of one to a plurality of alternative cards, when the alternative cards are reloaded, all the card IDs contained in the next-level conversation ID are respectively read, and then the alternative cards corresponding to the card IDs are reloaded into the pool to be matched. Therefore, the next stage dialogue has multiple choices, one-to-one matching can be carried out according to the problem actually proposed, and the ID of the next stage dialogue also allows the corresponding alternative card which is being matched to be contained with the ID of the previous stage, so that the freedom degree of dialogue matching can be greatly improved compared with the traditional stage-by-stage association and multiple matching, and the efficiency can be improved.
The invention is described above with reference to the accompanying drawings, it is obvious that the specific implementation of the invention is not limited by the above-mentioned manner, and it is within the scope of the invention to adopt various insubstantial modifications of the inventive concept and solution of the invention, or to apply the inventive concept and solution directly to other applications without modification.

Claims (6)

1. An infinite-level multi-turn dialogue method for voice recognition is characterized in that: a functional card model file is formulated, the card type of the functional card model file comprises a main card and an alternative card, the content of the functional card model file comprises a card type ID, a card ID, corresponding semantics of the card, a card instruction and a next-level card conversation ID, and the card ID of the alternative card is the next-level card conversation ID of the main card or the alternative card; after voice input and analysis are carried out to obtain semantics, the semantics are matched with the corresponding semantics of the cards one by one, corresponding card instructions are executed after the semantics are matched with the cards, the matched alternative cards are reloaded into a pool to be matched according to the card IDs contained in the corresponding next-level conversation IDs, the pool to be matched after the cards are reset only contains a main card as a matching object, each round of conversation outside the first level takes the reloaded alternative card after the previous round of conversation is matched as the matching object, and the conversation is ended when the matched cards have no card next-level conversation IDs or undefined abnormal conditions occur.
2. A speech recognition infinite number of dialog method according to claim 1, characterized in that: the system formulates the function card model file according to a set semantic understanding protocol, the function card model file is arranged according to the main card and the alternative card in a grouping mode, and the method specifically comprises the following steps:
the method comprises the following steps that firstly, initialization is carried out after a system is started, only a main card is loaded in a pool to be matched to realize card resetting, and then the pool is in an idle state to wait for voice input;
secondly, the system receives the dialogue voice and obtains semantics through the analysis of a voice recognition module;
step three, matching the semantics with the corresponding semantics of each card in the pool to be matched one by one, and executing the step five when the matching is successful, otherwise executing the step four;
step four, checking whether the abnormal condition is defined or not when the matching is unsuccessful, if so, prompting a user to repeat the dialogue voice, returning to the step two to analyze the semantics again, and if not, executing the step six;
step five, if the semantic matching is successful, executing a card instruction of the matched card, checking whether the card has a next-level dialogue ID, if so, reloading the corresponding next-level alternative card into a pool to be matched, then entering an idle state to wait for voice input, and if not, executing step six, wherein the card is a dialogue endpoint;
and step six, ending the conversation to reset the card, and then waiting for voice input in an idle state.
3. A speech recognition infinite number of dialog method according to claim 2, characterized in that: the method is provided with parameter abnormal times which represent the times of occurrence of defined abnormal conditions, the initial value is 0, the abnormal times is +1 if the defined abnormal conditions are judged in the fourth step, then the abnormal times is checked, if the abnormal times is less than 3, a user is prompted to repeat the dialogue voice and return to the second step to analyze the semantics again, and if not, the sixth step is executed.
4. A speech recognition infinite number of rounds of dialogue as claimed in claim 3, wherein: the method comprises the steps that after a system executes a card instruction, if the card is not reset, timing is started through a timer, the timer is reset after the system receives the next round of conversation, the system judges that the system is abnormal after the timing is larger than a set first threshold value, if the abnormal times are smaller than 3, a user is prompted to give out conversation voice, meanwhile, a counter is reset for timing, and the system resets and closes the timer when the card is reset.
5. A speech recognition infinite number of rounds of dialogue as claimed in claim 4, wherein: the next-level conversation ID of the alternative cards can be empty, and can also comprise one card ID of one to a plurality of alternative cards, when the alternative cards are reloaded, all the card IDs contained in the next-level conversation ID are respectively read, and then the alternative cards corresponding to the card IDs are reloaded into the pool to be matched.
6. A speech recognition infinite number of rounds of dialogue as claimed in claim 5, wherein: the defined abnormal conditions comprise that the semantic is normal but can not be matched with the card after being analyzed, and the dialogue voice is not received after overtime; the undefined abnormal condition comprises that the received voice cannot be analyzed to obtain semantics and the semantics comprise abnormal characters.
CN201910978335.3A 2019-10-15 2019-10-15 Infinite stage multi-turn dialogue method for speech recognition Active CN110706702B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910978335.3A CN110706702B (en) 2019-10-15 2019-10-15 Infinite stage multi-turn dialogue method for speech recognition

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910978335.3A CN110706702B (en) 2019-10-15 2019-10-15 Infinite stage multi-turn dialogue method for speech recognition

Publications (2)

Publication Number Publication Date
CN110706702A true CN110706702A (en) 2020-01-17
CN110706702B CN110706702B (en) 2021-11-09

Family

ID=69198610

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910978335.3A Active CN110706702B (en) 2019-10-15 2019-10-15 Infinite stage multi-turn dialogue method for speech recognition

Country Status (1)

Country Link
CN (1) CN110706702B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022247466A1 (en) * 2021-05-26 2022-12-01 华为技术有限公司 Resource display method, terminal and server
CN115527537A (en) * 2022-11-24 2022-12-27 广州小鹏汽车科技有限公司 Voice interaction method, server and computer readable storage medium

Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008242318A (en) * 2007-03-28 2008-10-09 Toshiba Corp Apparatus, method and program detecting interaction
TW200945871A (en) * 2008-04-25 2009-11-01 Delta Electronics Inc Outbound dialogue system and dialogue operation method
CN102339604A (en) * 2010-07-16 2012-02-01 西安欧博科工贸有限公司 Speech intelligent interaction system
CN103020047A (en) * 2012-12-31 2013-04-03 威盛电子股份有限公司 Method for revising voice response and natural language dialogue system
CN106294854A (en) * 2016-08-22 2017-01-04 北京光年无限科技有限公司 A kind of man-machine interaction method for intelligent robot and device
CN107369443A (en) * 2017-06-29 2017-11-21 北京百度网讯科技有限公司 Dialogue management method and device based on artificial intelligence
US10057421B1 (en) * 2016-12-27 2018-08-21 Amazon Technologies, Inc. Utilizing virtual user records for device management
CN108959520A (en) * 2018-06-28 2018-12-07 百度在线网络技术(北京)有限公司 Searching method, device, equipment and storage medium based on artificial intelligence
CN109190107A (en) * 2018-07-17 2019-01-11 湖南优浪语音科技有限公司 Intelligent dialogue method and apparatus
CN109616108A (en) * 2018-11-29 2019-04-12 北京羽扇智信息科技有限公司 More wheel dialogue interaction processing methods, device, electronic equipment and storage medium
CN109658925A (en) * 2018-11-28 2019-04-19 上海蔚来汽车有限公司 It is a kind of that wake-up vehicle-mounted voice dialogue method and system are exempted from based on context
CN109918673A (en) * 2019-03-14 2019-06-21 湖北亿咖通科技有限公司 Semantic referee method, device, electronic equipment and computer readable storage medium

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2008242318A (en) * 2007-03-28 2008-10-09 Toshiba Corp Apparatus, method and program detecting interaction
TW200945871A (en) * 2008-04-25 2009-11-01 Delta Electronics Inc Outbound dialogue system and dialogue operation method
CN102339604A (en) * 2010-07-16 2012-02-01 西安欧博科工贸有限公司 Speech intelligent interaction system
CN103020047A (en) * 2012-12-31 2013-04-03 威盛电子股份有限公司 Method for revising voice response and natural language dialogue system
CN106294854A (en) * 2016-08-22 2017-01-04 北京光年无限科技有限公司 A kind of man-machine interaction method for intelligent robot and device
US10057421B1 (en) * 2016-12-27 2018-08-21 Amazon Technologies, Inc. Utilizing virtual user records for device management
CN107369443A (en) * 2017-06-29 2017-11-21 北京百度网讯科技有限公司 Dialogue management method and device based on artificial intelligence
CN108959520A (en) * 2018-06-28 2018-12-07 百度在线网络技术(北京)有限公司 Searching method, device, equipment and storage medium based on artificial intelligence
CN109190107A (en) * 2018-07-17 2019-01-11 湖南优浪语音科技有限公司 Intelligent dialogue method and apparatus
CN109658925A (en) * 2018-11-28 2019-04-19 上海蔚来汽车有限公司 It is a kind of that wake-up vehicle-mounted voice dialogue method and system are exempted from based on context
CN109616108A (en) * 2018-11-29 2019-04-12 北京羽扇智信息科技有限公司 More wheel dialogue interaction processing methods, device, electronic equipment and storage medium
CN109918673A (en) * 2019-03-14 2019-06-21 湖北亿咖通科技有限公司 Semantic referee method, device, electronic equipment and computer readable storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CHEONGJAE LEE ET AL.: "A SITUATION-BASED DIALOGUE MANAGEMENT USING DIALOGUE EXAMPLES", 《ICASSP 2006》 *
郭天翼等: "自然语言处理领域中的自动问答研究进展", 《武汉大学学报(理学版)》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022247466A1 (en) * 2021-05-26 2022-12-01 华为技术有限公司 Resource display method, terminal and server
CN115527537A (en) * 2022-11-24 2022-12-27 广州小鹏汽车科技有限公司 Voice interaction method, server and computer readable storage medium

Also Published As

Publication number Publication date
CN110706702B (en) 2021-11-09

Similar Documents

Publication Publication Date Title
CN110706702B (en) Infinite stage multi-turn dialogue method for speech recognition
KR102357247B1 (en) Enhanced speech endpointing
CN102282610A (en) Voice conversation device, conversation control method, and conversation control program
US7228275B1 (en) Speech recognition system having multiple speech recognizers
US10810995B2 (en) Automatic speech recognition (ASR) model training
US20060287868A1 (en) Dialog system
US8700398B2 (en) Interface for setting confidence thresholds for automatic speech recognition and call steering applications
US7346507B1 (en) Method and apparatus for training an automated speech recognition-based system
WO2002060162A2 (en) Method and system for preventing error amplification in natural language dialogues
JPS62221735A (en) Real time process control method and apparatus using non-procedure language
CN102138175A (en) Speech recognition with parallel recognition tasks
US8515734B2 (en) Integrated language model, related systems and methods
US7809566B2 (en) One-step repair of misrecognized recognition strings
CN105323392A (en) Method and apparatus for quickly entering IVR menu
US20130204626A1 (en) Method and Apparatus for Setting Selected Recognition Parameters to Minimize an Application Cost Function
CN112084317A (en) Method and apparatus for pre-training a language model
CN111897256A (en) Children programming control system
CN105007365B (en) A kind of extension number dialing method and device
WO2003088210A1 (en) Method and system for speech recognition of symbol sequences
CN116825103A (en) Voice reply method, device and system
CN111970311B (en) Session segmentation method, electronic device and computer readable medium
CN111443915B (en) Method and system for realizing CLI command interface on singlechip
CN110888971B (en) Multi-round interaction method and device for robot customer service and user
CN111857721A (en) SQL statement verification method, data acquisition method, equipment and storage device
JP2581111B2 (en) Data input control method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant