JP2013101248A - 音声制御装置、音声制御方法、およびプログラム - Google Patents

音声制御装置、音声制御方法、およびプログラム Download PDF

Info

Publication number
JP2013101248A
JP2013101248A JP2011245357A JP2011245357A JP2013101248A JP 2013101248 A JP2013101248 A JP 2013101248A JP 2011245357 A JP2011245357 A JP 2011245357A JP 2011245357 A JP2011245357 A JP 2011245357A JP 2013101248 A JP2013101248 A JP 2013101248A
Authority
JP
Japan
Prior art keywords
information
voice control
output
tag information
control device
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
JP2011245357A
Other languages
English (en)
Japanese (ja)
Other versions
JP2013101248A5 (enExample
Inventor
Maki Mori
麻紀 森
Shunichi Kasahara
俊一 笠原
Shu Shigeta
脩 繁田
Seiji Suzuki
誠司 鈴木
Ryo Fukazawa
遼 深澤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Priority to JP2011245357A priority Critical patent/JP2013101248A/ja
Priority to EP12848137.1A priority patent/EP2777040B1/en
Priority to US14/353,856 priority patent/US9299349B2/en
Priority to CN201280053462.9A priority patent/CN103918284B/zh
Priority to PCT/JP2012/005291 priority patent/WO2013069178A1/en
Publication of JP2013101248A publication Critical patent/JP2013101248A/ja
Publication of JP2013101248A5 publication Critical patent/JP2013101248A5/ja
Priority to US15/046,578 priority patent/US9557962B2/en
Priority to US15/376,052 priority patent/US9830128B2/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • G06T19/006Mixed reality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/01Aspects of volume control, not necessarily automatic, in sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/07Use of position data from wide-area or local-area positioning systems in hearing devices, e.g. program or information selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Library & Information Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
JP2011245357A 2011-11-09 2011-11-09 音声制御装置、音声制御方法、およびプログラム Pending JP2013101248A (ja)

Priority Applications (7)

Application Number Priority Date Filing Date Title
JP2011245357A JP2013101248A (ja) 2011-11-09 2011-11-09 音声制御装置、音声制御方法、およびプログラム
EP12848137.1A EP2777040B1 (en) 2011-11-09 2012-08-23 Voice control device, voice control method and program
US14/353,856 US9299349B2 (en) 2011-11-09 2012-08-23 Voice control device, voice control method and program
CN201280053462.9A CN103918284B (zh) 2011-11-09 2012-08-23 语音控制装置、语音控制方法和程序
PCT/JP2012/005291 WO2013069178A1 (en) 2011-11-09 2012-08-23 Voice control device, voice control method and program
US15/046,578 US9557962B2 (en) 2011-11-09 2016-02-18 Voice control device, voice control method and program
US15/376,052 US9830128B2 (en) 2011-11-09 2016-12-12 Voice control device, voice control method and program

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2011245357A JP2013101248A (ja) 2011-11-09 2011-11-09 音声制御装置、音声制御方法、およびプログラム

Related Child Applications (1)

Application Number Title Priority Date Filing Date
JP2016104296A Division JP6206537B2 (ja) 2016-05-25 2016-05-25 携帯端末、情報処理装置、およびプログラム

Publications (2)

Publication Number Publication Date
JP2013101248A true JP2013101248A (ja) 2013-05-23
JP2013101248A5 JP2013101248A5 (enExample) 2014-11-06

Family

ID=48288955

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2011245357A Pending JP2013101248A (ja) 2011-11-09 2011-11-09 音声制御装置、音声制御方法、およびプログラム

Country Status (5)

Country Link
US (3) US9299349B2 (enExample)
EP (1) EP2777040B1 (enExample)
JP (1) JP2013101248A (enExample)
CN (1) CN103918284B (enExample)
WO (1) WO2013069178A1 (enExample)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20150060507A (ko) 2013-11-25 2015-06-03 아즈빌주식회사 플랜트 감시 서버 및 플랜트 감시 방법
JP2016021169A (ja) * 2014-07-15 2016-02-04 Kddi株式会社 仮想音源を提供情報位置に配置する携帯端末、音声提示プログラム及び音声提示方法
JP2016194612A (ja) * 2015-03-31 2016-11-17 株式会社ニデック 視覚認識支援装置および視覚認識支援プログラム
JP2017079457A (ja) * 2015-10-19 2017-04-27 このみ 一色 携帯情報端末、情報処理装置、及びプログラム
WO2018155026A1 (ja) 2017-02-27 2018-08-30 ソニー株式会社 情報処理装置、情報処理方法、及びプログラム
JPWO2018190099A1 (ja) * 2017-04-10 2020-02-27 ヤマハ株式会社 音声提供装置、音声提供方法及びプログラム
WO2021260848A1 (ja) * 2020-06-24 2021-12-30 日本電信電話株式会社 学習装置、学習方法及び学習プログラム
JPWO2022014308A1 (enExample) * 2020-07-15 2022-01-20
JPWO2023195048A1 (enExample) * 2022-04-04 2023-10-12

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20160015512A (ko) * 2014-07-30 2016-02-15 에스케이플래닛 주식회사 비콘 신호 기반 스탬프 서비스 제공 방법
CN107154265A (zh) * 2017-03-30 2017-09-12 联想(北京)有限公司 一种采集控制方法及电子设备
JP6907788B2 (ja) * 2017-07-28 2021-07-21 富士フイルムビジネスイノベーション株式会社 情報処理装置及びプログラム
KR102530669B1 (ko) * 2020-10-07 2023-05-09 네이버 주식회사 앱과 웹의 연동을 통해 음성 파일에 대한 메모를 작성하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
CN113707165B (zh) * 2021-09-07 2024-09-17 联想(北京)有限公司 音频处理方法、装置及电子设备和存储介质
EP4614276A1 (en) * 2023-11-20 2025-09-10 Samsung Electronics Co., Ltd Electronic device comprising speaker module

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63260253A (ja) * 1987-04-17 1988-10-27 Hitachi Ltd 音声応答方式
JPH0566131A (ja) * 1991-09-09 1993-03-19 Sumitomo Electric Ind Ltd 音声案内装置
JP2002023778A (ja) * 2000-06-30 2002-01-25 Canon Inc 音声合成装置、音声合成システム、音声合成方法及び記憶媒体
JP2006155627A (ja) * 2005-12-05 2006-06-15 Sony Corp シミュレーションシステム、仮想空間提供装置および方法、並びにユーザ端末装置および仮想空間画像生成方法
JP2010103756A (ja) * 2008-10-23 2010-05-06 Nissan Motor Co Ltd 音声出力装置および音声出力方法
JP2011197477A (ja) * 2010-03-19 2011-10-06 Sony Corp 情報処理装置、情報処理方法、およびプログラム

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61234355A (ja) 1985-04-11 1986-10-18 Terumo Corp 超音波測定方法およびその装置
JP2783212B2 (ja) 1995-09-08 1998-08-06 日本電気株式会社 情報提示装置
JP3309735B2 (ja) 1996-10-24 2002-07-29 三菱電機株式会社 音声マンマシンインタフェース装置
JP2003521202A (ja) * 2000-01-28 2003-07-08 レイク テクノロジー リミティド 地理的な環境で使用される空間オーディオシステム。
JP2002023787A (ja) 2000-07-06 2002-01-25 Canon Inc 音声合成装置、音声合成システム、音声合成方法及び記憶媒体
US7031924B2 (en) 2000-06-30 2006-04-18 Canon Kabushiki Kaisha Voice synthesizing apparatus, voice synthesizing system, voice synthesizing method and storage medium
JP2006059136A (ja) 2004-08-20 2006-03-02 Seiko Epson Corp ビューア装置及びそのプログラム
JP2006091390A (ja) 2004-09-24 2006-04-06 Mitsubishi Electric Corp 情報表示システム及び情報表示方法及び情報表示方法をコンピュータに実行させるためのプログラム及び情報表示端末装置
JP4861105B2 (ja) 2006-09-15 2012-01-25 株式会社エヌ・ティ・ティ・ドコモ 空間掲示板システム
JP2008217133A (ja) 2007-02-28 2008-09-18 Nec Corp 地域情報案内システム、地域情報配信システム、地域情報配信プログラム、地域情報案内方法
JP2009140402A (ja) 2007-12-10 2009-06-25 Nippon Telegr & Teleph Corp <Ntt> 情報表示装置、情報表示方法、情報表示プログラム及び情報表示プログラムを記録した記録媒体
US8170222B2 (en) 2008-04-18 2012-05-01 Sony Mobile Communications Ab Augmented reality enhanced audio
US20090315775A1 (en) * 2008-06-20 2009-12-24 Microsoft Corporation Mobile computing services based on devices with dynamic direction information
JP2010049158A (ja) 2008-08-25 2010-03-04 Ricoh Co Ltd 画像処理装置
EP2214425A1 (en) * 2009-01-28 2010-08-04 Auralia Emotive Media Systems S.L. Binaural audio guide
JP4911389B2 (ja) 2009-09-30 2012-04-04 Necビッグローブ株式会社 情報表示システム、サーバ、端末、及び方法
JP5293571B2 (ja) 2009-11-17 2013-09-18 日産自動車株式会社 情報提供装置及び方法
KR101099137B1 (ko) * 2010-01-29 2011-12-27 주식회사 팬택 이동 통신 시스템에서 증강 현실 정보를 제공하기 위한 장치 및 방법

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS63260253A (ja) * 1987-04-17 1988-10-27 Hitachi Ltd 音声応答方式
JPH0566131A (ja) * 1991-09-09 1993-03-19 Sumitomo Electric Ind Ltd 音声案内装置
JP2002023778A (ja) * 2000-06-30 2002-01-25 Canon Inc 音声合成装置、音声合成システム、音声合成方法及び記憶媒体
JP2006155627A (ja) * 2005-12-05 2006-06-15 Sony Corp シミュレーションシステム、仮想空間提供装置および方法、並びにユーザ端末装置および仮想空間画像生成方法
JP2010103756A (ja) * 2008-10-23 2010-05-06 Nissan Motor Co Ltd 音声出力装置および音声出力方法
JP2011197477A (ja) * 2010-03-19 2011-10-06 Sony Corp 情報処理装置、情報処理方法、およびプログラム

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20150060507A (ko) 2013-11-25 2015-06-03 아즈빌주식회사 플랜트 감시 서버 및 플랜트 감시 방법
KR101650629B1 (ko) * 2013-11-25 2016-08-23 아즈빌주식회사 플랜트 감시 서버 및 플랜트 감시 방법
JP2016021169A (ja) * 2014-07-15 2016-02-04 Kddi株式会社 仮想音源を提供情報位置に配置する携帯端末、音声提示プログラム及び音声提示方法
JP2016194612A (ja) * 2015-03-31 2016-11-17 株式会社ニデック 視覚認識支援装置および視覚認識支援プログラム
JP2017079457A (ja) * 2015-10-19 2017-04-27 このみ 一色 携帯情報端末、情報処理装置、及びプログラム
US11205426B2 (en) 2017-02-27 2021-12-21 Sony Corporation Information processing device, information processing method, and program
KR20190121758A (ko) 2017-02-27 2019-10-28 소니 주식회사 정보 처리 장치, 정보 처리 방법, 및 프로그램
WO2018155026A1 (ja) 2017-02-27 2018-08-30 ソニー株式会社 情報処理装置、情報処理方法、及びプログラム
JPWO2018190099A1 (ja) * 2017-04-10 2020-02-27 ヤマハ株式会社 音声提供装置、音声提供方法及びプログラム
WO2021260848A1 (ja) * 2020-06-24 2021-12-30 日本電信電話株式会社 学習装置、学習方法及び学習プログラム
JPWO2022014308A1 (enExample) * 2020-07-15 2022-01-20
WO2022014308A1 (ja) * 2020-07-15 2022-01-20 ソニーグループ株式会社 情報処理装置、情報処理方法および端末装置
JP7711708B2 (ja) 2020-07-15 2025-07-23 ソニーグループ株式会社 情報処理装置および情報処理方法
US12425790B2 (en) 2020-07-15 2025-09-23 Sony Group Corporation Information processing apparatus, information processing method, and terminal device
JPWO2023195048A1 (enExample) * 2022-04-04 2023-10-12
WO2023195048A1 (ja) * 2022-04-04 2023-10-12 マクセル株式会社 音声拡張現実オブジェクト再生装置、情報端末システム
JP7781260B2 (ja) 2022-04-04 2025-12-05 マクセル株式会社 音声拡張現実オブジェクト再生装置、情報端末システム

Also Published As

Publication number Publication date
US9830128B2 (en) 2017-11-28
US9299349B2 (en) 2016-03-29
EP2777040A4 (en) 2015-09-09
US20160210118A1 (en) 2016-07-21
EP2777040A1 (en) 2014-09-17
US20140297289A1 (en) 2014-10-02
EP2777040B1 (en) 2018-12-12
US20170123758A1 (en) 2017-05-04
WO2013069178A1 (en) 2013-05-16
CN103918284A (zh) 2014-07-09
CN103918284B (zh) 2017-02-15
US9557962B2 (en) 2017-01-31

Similar Documents

Publication Publication Date Title
JP2013101248A (ja) 音声制御装置、音声制御方法、およびプログラム
CN110868639B (zh) 视频合成方法及装置
WO2021213120A1 (zh) 投屏方法、装置和电子设备
KR101606727B1 (ko) 휴대 단말기 및 그 동작 방법
CN115428413A (zh) 一种通知处理方法、电子设备和系统
US10231067B2 (en) Hearing aid adjustment via mobile device
CN110061900B (zh) 消息显示方法、装置、终端及计算机可读存储介质
CN107636541B (zh) 计算设备上的方法、用于闹铃的系统和机器可读介质
CN110798327B (zh) 消息处理方法、设备及存储介质
JP7067484B2 (ja) 情報処理装置、情報処理方法、プログラム、および情報処理システム
CN114371824B (zh) 一种音频处理方法、系统及相关装置
CN114697732A (zh) 一种拍摄方法、系统及电子设备
CN113170279A (zh) 基于低功耗蓝牙的通信方法及相关装置
CN109068008A (zh) 铃声设置方法、装置、终端及存储介质
US8862058B2 (en) Systems and methods for reducing electromagnetic radiation emitted from a wireless headset
CN110377200A (zh) 分享数据生成方法、装置及存储介质
CN109451168A (zh) 生成铃声文件的方法、装置及存储介质
CN115291779A (zh) 一种窗口控制方法及其设备
JP2015090593A (ja) 情報処理装置、情報処理方法および情報処理システム
JP6206537B2 (ja) 携帯端末、情報処理装置、およびプログラム
WO2016158003A1 (ja) 情報処理装置、情報処理方法及びコンピュータプログラム
CN113301444A (zh) 视频处理方法、装置、电子设备及存储介质
US12019941B2 (en) Information processing apparatus and information processing method
CN114461938B (zh) 信息显示方法、装置、电子设备、存储介质及产品
CN109660450B (zh) 消息自动回复方法、装置、终端、服务器及存储介质

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20140919

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20140919

A131 Notification of reasons for refusal

Free format text: JAPANESE INTERMEDIATE CODE: A131

Effective date: 20151027

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20151125

A02 Decision of refusal

Free format text: JAPANESE INTERMEDIATE CODE: A02

Effective date: 20160412