JP7311707B2 - ヒューマンマシン対話処理方法 - Google Patents

ヒューマンマシン対話処理方法 Download PDF

Info

Publication number
JP7311707B2
JP7311707B2 JP2022522284A JP2022522284A JP7311707B2 JP 7311707 B2 JP7311707 B2 JP 7311707B2 JP 2022522284 A JP2022522284 A JP 2022522284A JP 2022522284 A JP2022522284 A JP 2022522284A JP 7311707 B2 JP7311707 B2 JP 7311707B2
Authority
JP
Japan
Prior art keywords
voice message
user terminal
interaction
mode
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2022522284A
Other languages
English (en)
Japanese (ja)
Other versions
JP2022545981A (ja
Inventor
ヤン、キンウェイ
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AI Speech Ltd
Original Assignee
AI Speech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by AI Speech Ltd filed Critical AI Speech Ltd
Publication of JP2022545981A publication Critical patent/JP2022545981A/ja
Application granted granted Critical
Publication of JP7311707B2 publication Critical patent/JP7311707B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9032Query formulation
    • G06F16/90332Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F9/00Arrangements for program control, e.g. control units
    • G06F9/06Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
    • G06F9/44Arrangements for executing specific programs
    • G06F9/4401Bootstrapping
    • G06F9/4418Suspend and resume; Hibernate and awake
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/162Interface to dedicated audio devices, e.g. audio drivers, interface to CODECs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Acoustics & Sound (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Artificial Intelligence (AREA)
  • Mathematical Physics (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Security & Cryptography (AREA)
  • Telephonic Communication Services (AREA)
JP2022522284A 2019-10-14 2019-11-25 ヒューマンマシン対話処理方法 Active JP7311707B2 (ja)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CN201910975502.9A CN112732340B (zh) 2019-10-14 2019-10-14 人机对话处理方法及装置
CN201910975502.9 2019-10-14
PCT/CN2019/120612 WO2021072914A1 (zh) 2019-10-14 2019-11-25 人机对话处理方法

Publications (2)

Publication Number Publication Date
JP2022545981A JP2022545981A (ja) 2022-11-01
JP7311707B2 true JP7311707B2 (ja) 2023-07-19

Family

ID=75538276

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2022522284A Active JP7311707B2 (ja) 2019-10-14 2019-11-25 ヒューマンマシン対話処理方法

Country Status (5)

Country Link
US (1) US11830483B2 (de)
EP (1) EP4047489A4 (de)
JP (1) JP7311707B2 (de)
CN (1) CN112732340B (de)
WO (1) WO2021072914A1 (de)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113628622A (zh) * 2021-08-24 2021-11-09 北京达佳互联信息技术有限公司 语音交互方法、装置、电子设备及存储介质
CN113744743B (zh) * 2021-08-27 2022-11-08 海信冰箱有限公司 一种洗衣机的语音交互方法及装置
CN114417891B (zh) * 2022-01-22 2023-05-09 平安科技(深圳)有限公司 基于粗糙语义的回复语句确定方法、装置及电子设备
CN117153157B (zh) * 2023-09-19 2024-06-04 深圳市麦驰信息技术有限公司 一种语意识别的多模态全双工对话方法及系统

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090089065A1 (en) 2007-10-01 2009-04-02 Markus Buck Adjusting or setting vehicle elements through speech control
US20140309996A1 (en) 2013-04-10 2014-10-16 Via Technologies, Inc. Voice control method and mobile terminal apparatus
CN109657091A (zh) 2019-01-02 2019-04-19 百度在线网络技术(北京)有限公司 语音交互设备的状态呈现方法、装置、设备及存储介质
CN112002315A (zh) 2020-07-28 2020-11-27 珠海格力电器股份有限公司 一种语音控制方法、装置、电器设备、存储介质及处理器

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101389059B (zh) * 2007-09-11 2012-08-08 华为技术有限公司 实现会话模式切换的方法及设备
US8681664B2 (en) * 2008-08-11 2014-03-25 Qualcomm Incorporated Setting up a full-duplex communication session and transitioning between half-duplex and full-duplex during a communication session within a wireless communications system
US20140244273A1 (en) * 2013-02-27 2014-08-28 Jean Laroche Voice-controlled communication connections
CN104679472A (zh) * 2015-02-13 2015-06-03 百度在线网络技术(北京)有限公司 人机语音交互方法和装置
US9713192B2 (en) * 2015-03-27 2017-07-18 Intel Corporation Device and method for processing audio data
CN106658369B (zh) * 2016-12-06 2020-02-07 歌尔科技有限公司 一种双向语音通信设备、通信系统及通信方法
CN109739971B (zh) 2019-01-03 2021-04-23 浙江百应科技有限公司 一种基于微信小程序实现全双工智能语音对话的方法
CN112017650B (zh) * 2019-05-31 2024-05-24 百度在线网络技术(北京)有限公司 电子设备的语音控制方法、装置、计算机设备和存储介质
CN110660390B (zh) * 2019-09-17 2022-05-03 百度在线网络技术(北京)有限公司 智能设备唤醒方法、智能设备及计算机可读存储介质

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090089065A1 (en) 2007-10-01 2009-04-02 Markus Buck Adjusting or setting vehicle elements through speech control
US20140309996A1 (en) 2013-04-10 2014-10-16 Via Technologies, Inc. Voice control method and mobile terminal apparatus
CN109657091A (zh) 2019-01-02 2019-04-19 百度在线网络技术(北京)有限公司 语音交互设备的状态呈现方法、装置、设备及存储介质
CN112002315A (zh) 2020-07-28 2020-11-27 珠海格力电器股份有限公司 一种语音控制方法、装置、电器设备、存储介质及处理器

Also Published As

Publication number Publication date
CN112732340A (zh) 2021-04-30
EP4047489A4 (de) 2022-11-23
JP2022545981A (ja) 2022-11-01
WO2021072914A1 (zh) 2021-04-22
US11830483B2 (en) 2023-11-28
US20230162730A1 (en) 2023-05-25
EP4047489A1 (de) 2022-08-24
CN112732340B (zh) 2022-03-15

Similar Documents

Publication Publication Date Title
JP7311707B2 (ja) ヒューマンマシン対話処理方法
JP7353497B2 (ja) 能動的に対話の開始を提起するためのサーバ側処理方法及びサーバ、並びに能動的に対話の開始が提起できる音声インタラクションシステム
CN111147357B (zh) 数字助手在通信中的使用
US10055190B2 (en) Attribute-based audio channel arbitration
CN108877804B (zh) 语音服务方法、系统、电子设备及存储介质
CN111049996A (zh) 多场景语音识别方法及装置、和应用其的智能客服系统
WO2012055315A1 (zh) 一种提供和管理互动服务的系统和方法
CN110246499B (zh) 家居设备的语音控制方法及装置
CN112542183B (zh) 音频数据处理的方法、装置、设备及存储介质
WO2019205985A1 (zh) 一种基于小程序的声音处理系统、方法及服务器
JP6934076B2 (ja) スマートサービス方法、装置及び機器
CN110890094A (zh) 物联网设备语音控制方法及语音服务端
CN112185362A (zh) 针对用户个性化服务的语音处理方法及装置
CN112689012A (zh) 跨网络的代理通讯方法及装置
CN110442698B (zh) 对话内容生成方法及系统
CN111161734A (zh) 基于指定场景的语音交互方法及装置
CN107395493B (zh) 一种基于意图Intent分享消息的方法及装置
CN110035308A (zh) 数据处理方法、设备和存储介质
CN112786031B (zh) 人机对话方法及系统
CN104954538B (zh) 一种信息处理方法及电子设备
CN113709506A (zh) 基于云手机的多媒体播放方法、装置、介质及程序产品
CN113271385A (zh) 一种呼叫转移方法
CN111091303A (zh) 技能定制方法及装置
WO2018118725A1 (en) Supplementing telephony calls with conversational bots
CN114466320A (zh) 会话协商方法、装置、电子设备及计算机可读介质

Legal Events

Date Code Title Description
A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20220801

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20220801

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20221122

A601 Written request for extension of time

Free format text: JAPANESE INTERMEDIATE CODE: A601

Effective date: 20230216

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20230327

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20230620

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20230706

R150 Certificate of patent or registration of utility model

Ref document number: 7311707

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150