JP7311707B2 - ヒューマンマシン対話処理方法 - Google Patents
ヒューマンマシン対話処理方法 Download PDFInfo
- Publication number
- JP7311707B2 JP7311707B2 JP2022522284A JP2022522284A JP7311707B2 JP 7311707 B2 JP7311707 B2 JP 7311707B2 JP 2022522284 A JP2022522284 A JP 2022522284A JP 2022522284 A JP2022522284 A JP 2022522284A JP 7311707 B2 JP7311707 B2 JP 7311707B2
- Authority
- JP
- Japan
- Prior art keywords
- voice message
- user terminal
- interaction
- mode
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 230000003993 interaction Effects 0.000 title claims description 121
- 238000003672 processing method Methods 0.000 title claims description 17
- 230000009977 dual effect Effects 0.000 claims description 50
- 238000000034 method Methods 0.000 claims description 33
- 230000008569 process Effects 0.000 claims description 19
- 230000002452 interceptive effect Effects 0.000 claims description 17
- 230000004044 response Effects 0.000 claims description 17
- 238000012545 processing Methods 0.000 claims description 15
- 230000000875 corresponding effect Effects 0.000 description 32
- 238000005516 engineering process Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 4
- 238000010586 diagram Methods 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 2
- 238000004590 computer program Methods 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000004140 cleaning Methods 0.000 description 1
- 230000008094 contradictory effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 229920001690 polydopamine Polymers 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9032—Query formulation
- G06F16/90332—Natural language query formulation or dialogue systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for program control, e.g. control units
- G06F9/06—Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
- G06F9/44—Arrangements for executing specific programs
- G06F9/4401—Bootstrapping
- G06F9/4418—Suspend and resume; Hibernate and awake
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/162—Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1822—Parsing for meaning understanding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L2015/088—Word spotting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Software Systems (AREA)
- Artificial Intelligence (AREA)
- Mathematical Physics (AREA)
- Databases & Information Systems (AREA)
- General Health & Medical Sciences (AREA)
- Data Mining & Analysis (AREA)
- Computer Security & Cryptography (AREA)
- Telephonic Communication Services (AREA)
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910975502.9A CN112732340B (zh) | 2019-10-14 | 2019-10-14 | 人机对话处理方法及装置 |
CN201910975502.9 | 2019-10-14 | ||
PCT/CN2019/120612 WO2021072914A1 (zh) | 2019-10-14 | 2019-11-25 | 人机对话处理方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
JP2022545981A JP2022545981A (ja) | 2022-11-01 |
JP7311707B2 true JP7311707B2 (ja) | 2023-07-19 |
Family
ID=75538276
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2022522284A Active JP7311707B2 (ja) | 2019-10-14 | 2019-11-25 | ヒューマンマシン対話処理方法 |
Country Status (5)
Country | Link |
---|---|
US (1) | US11830483B2 (de) |
EP (1) | EP4047489A4 (de) |
JP (1) | JP7311707B2 (de) |
CN (1) | CN112732340B (de) |
WO (1) | WO2021072914A1 (de) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113628622A (zh) * | 2021-08-24 | 2021-11-09 | 北京达佳互联信息技术有限公司 | 语音交互方法、装置、电子设备及存储介质 |
CN113744743B (zh) * | 2021-08-27 | 2022-11-08 | 海信冰箱有限公司 | 一种洗衣机的语音交互方法及装置 |
CN114417891B (zh) * | 2022-01-22 | 2023-05-09 | 平安科技(深圳)有限公司 | 基于粗糙语义的回复语句确定方法、装置及电子设备 |
CN117153157B (zh) * | 2023-09-19 | 2024-06-04 | 深圳市麦驰信息技术有限公司 | 一种语意识别的多模态全双工对话方法及系统 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090089065A1 (en) | 2007-10-01 | 2009-04-02 | Markus Buck | Adjusting or setting vehicle elements through speech control |
US20140309996A1 (en) | 2013-04-10 | 2014-10-16 | Via Technologies, Inc. | Voice control method and mobile terminal apparatus |
CN109657091A (zh) | 2019-01-02 | 2019-04-19 | 百度在线网络技术(北京)有限公司 | 语音交互设备的状态呈现方法、装置、设备及存储介质 |
CN112002315A (zh) | 2020-07-28 | 2020-11-27 | 珠海格力电器股份有限公司 | 一种语音控制方法、装置、电器设备、存储介质及处理器 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101389059B (zh) * | 2007-09-11 | 2012-08-08 | 华为技术有限公司 | 实现会话模式切换的方法及设备 |
US8681664B2 (en) * | 2008-08-11 | 2014-03-25 | Qualcomm Incorporated | Setting up a full-duplex communication session and transitioning between half-duplex and full-duplex during a communication session within a wireless communications system |
US20140244273A1 (en) * | 2013-02-27 | 2014-08-28 | Jean Laroche | Voice-controlled communication connections |
CN104679472A (zh) * | 2015-02-13 | 2015-06-03 | 百度在线网络技术(北京)有限公司 | 人机语音交互方法和装置 |
US9713192B2 (en) * | 2015-03-27 | 2017-07-18 | Intel Corporation | Device and method for processing audio data |
CN106658369B (zh) * | 2016-12-06 | 2020-02-07 | 歌尔科技有限公司 | 一种双向语音通信设备、通信系统及通信方法 |
CN109739971B (zh) | 2019-01-03 | 2021-04-23 | 浙江百应科技有限公司 | 一种基于微信小程序实现全双工智能语音对话的方法 |
CN112017650B (zh) * | 2019-05-31 | 2024-05-24 | 百度在线网络技术(北京)有限公司 | 电子设备的语音控制方法、装置、计算机设备和存储介质 |
CN110660390B (zh) * | 2019-09-17 | 2022-05-03 | 百度在线网络技术(北京)有限公司 | 智能设备唤醒方法、智能设备及计算机可读存储介质 |
-
2019
- 2019-10-14 CN CN201910975502.9A patent/CN112732340B/zh active Active
- 2019-11-25 WO PCT/CN2019/120612 patent/WO2021072914A1/zh unknown
- 2019-11-25 JP JP2022522284A patent/JP7311707B2/ja active Active
- 2019-11-25 EP EP19948949.3A patent/EP4047489A4/de active Pending
- 2019-11-25 US US17/768,666 patent/US11830483B2/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20090089065A1 (en) | 2007-10-01 | 2009-04-02 | Markus Buck | Adjusting or setting vehicle elements through speech control |
US20140309996A1 (en) | 2013-04-10 | 2014-10-16 | Via Technologies, Inc. | Voice control method and mobile terminal apparatus |
CN109657091A (zh) | 2019-01-02 | 2019-04-19 | 百度在线网络技术(北京)有限公司 | 语音交互设备的状态呈现方法、装置、设备及存储介质 |
CN112002315A (zh) | 2020-07-28 | 2020-11-27 | 珠海格力电器股份有限公司 | 一种语音控制方法、装置、电器设备、存储介质及处理器 |
Also Published As
Publication number | Publication date |
---|---|
CN112732340A (zh) | 2021-04-30 |
EP4047489A4 (de) | 2022-11-23 |
JP2022545981A (ja) | 2022-11-01 |
WO2021072914A1 (zh) | 2021-04-22 |
US11830483B2 (en) | 2023-11-28 |
US20230162730A1 (en) | 2023-05-25 |
EP4047489A1 (de) | 2022-08-24 |
CN112732340B (zh) | 2022-03-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP7311707B2 (ja) | ヒューマンマシン対話処理方法 | |
JP7353497B2 (ja) | 能動的に対話の開始を提起するためのサーバ側処理方法及びサーバ、並びに能動的に対話の開始が提起できる音声インタラクションシステム | |
CN111147357B (zh) | 数字助手在通信中的使用 | |
US10055190B2 (en) | Attribute-based audio channel arbitration | |
CN108877804B (zh) | 语音服务方法、系统、电子设备及存储介质 | |
CN111049996A (zh) | 多场景语音识别方法及装置、和应用其的智能客服系统 | |
WO2012055315A1 (zh) | 一种提供和管理互动服务的系统和方法 | |
CN110246499B (zh) | 家居设备的语音控制方法及装置 | |
CN112542183B (zh) | 音频数据处理的方法、装置、设备及存储介质 | |
WO2019205985A1 (zh) | 一种基于小程序的声音处理系统、方法及服务器 | |
JP6934076B2 (ja) | スマートサービス方法、装置及び機器 | |
CN110890094A (zh) | 物联网设备语音控制方法及语音服务端 | |
CN112185362A (zh) | 针对用户个性化服务的语音处理方法及装置 | |
CN112689012A (zh) | 跨网络的代理通讯方法及装置 | |
CN110442698B (zh) | 对话内容生成方法及系统 | |
CN111161734A (zh) | 基于指定场景的语音交互方法及装置 | |
CN107395493B (zh) | 一种基于意图Intent分享消息的方法及装置 | |
CN110035308A (zh) | 数据处理方法、设备和存储介质 | |
CN112786031B (zh) | 人机对话方法及系统 | |
CN104954538B (zh) | 一种信息处理方法及电子设备 | |
CN113709506A (zh) | 基于云手机的多媒体播放方法、装置、介质及程序产品 | |
CN113271385A (zh) | 一种呼叫转移方法 | |
CN111091303A (zh) | 技能定制方法及装置 | |
WO2018118725A1 (en) | Supplementing telephony calls with conversational bots | |
CN114466320A (zh) | 会话协商方法、装置、电子设备及计算机可读介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20220801 |
|
A871 | Explanation of circumstances concerning accelerated examination |
Free format text: JAPANESE INTERMEDIATE CODE: A871 Effective date: 20220801 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20221122 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20230216 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230327 |
|
TRDD | Decision of grant or rejection written | ||
A01 | Written decision to grant a patent or to grant a registration (utility model) |
Free format text: JAPANESE INTERMEDIATE CODE: A01 Effective date: 20230620 |
|
A61 | First payment of annual fees (during grant procedure) |
Free format text: JAPANESE INTERMEDIATE CODE: A61 Effective date: 20230706 |
|
R150 | Certificate of patent or registration of utility model |
Ref document number: 7311707 Country of ref document: JP Free format text: JAPANESE INTERMEDIATE CODE: R150 |