CN103918284B - 语音控制装置、语音控制方法和程序 - Google Patents

语音控制装置、语音控制方法和程序 Download PDF

Info

Publication number
CN103918284B
CN103918284B CN201280053462.9A CN201280053462A CN103918284B CN 103918284 B CN103918284 B CN 103918284B CN 201280053462 A CN201280053462 A CN 201280053462A CN 103918284 B CN103918284 B CN 103918284B
Authority
CN
China
Prior art keywords
sound
tag information
information
mobile terminal
output
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201280053462.9A
Other languages
English (en)
Chinese (zh)
Other versions
CN103918284A (zh
Inventor
森麻纪
笠原俊
笠原俊一
繁田脩
铃木诚司
深泽辽
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN103918284A publication Critical patent/CN103918284A/zh
Application granted granted Critical
Publication of CN103918284B publication Critical patent/CN103918284B/zh
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/167Audio in a user interface, e.g. using voice commands for navigating, audio feedback
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • G06T11/60Editing figures and text; Combining figures or text
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T19/00Manipulating 3D models or images for computer graphics
    • G06T19/006Mixed reality
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2430/00Signal processing covered by H04R, not provided for in its groups
    • H04R2430/01Aspects of volume control, not necessarily automatic, in sound systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2460/00Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
    • H04R2460/07Use of position data from wide-area or local-area positioning systems in hearing devices, e.g. program or information selection
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/13Aspects of volume control, not necessarily automatic, in stereophonic sound systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Library & Information Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Telephone Function (AREA)
  • User Interface Of Digital Computer (AREA)
  • Stereophonic System (AREA)
  • Circuit For Audible Band Transducer (AREA)
CN201280053462.9A 2011-11-09 2012-08-23 语音控制装置、语音控制方法和程序 Expired - Fee Related CN103918284B (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2011245357A JP2013101248A (ja) 2011-11-09 2011-11-09 音声制御装置、音声制御方法、およびプログラム
JP2011-245357 2011-11-09
PCT/JP2012/005291 WO2013069178A1 (en) 2011-11-09 2012-08-23 Voice control device, voice control method and program

Publications (2)

Publication Number Publication Date
CN103918284A CN103918284A (zh) 2014-07-09
CN103918284B true CN103918284B (zh) 2017-02-15

Family

ID=48288955

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201280053462.9A Expired - Fee Related CN103918284B (zh) 2011-11-09 2012-08-23 语音控制装置、语音控制方法和程序

Country Status (5)

Country Link
US (3) US9299349B2 (enExample)
EP (1) EP2777040B1 (enExample)
JP (1) JP2013101248A (enExample)
CN (1) CN103918284B (enExample)
WO (1) WO2013069178A1 (enExample)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6231362B2 (ja) 2013-11-25 2017-11-15 アズビル株式会社 プラント監視サーバーおよびプラント監視方法
JP6263098B2 (ja) * 2014-07-15 2018-01-17 Kddi株式会社 仮想音源を提供情報位置に配置する携帯端末、音声提示プログラム及び音声提示方法
KR20160015512A (ko) * 2014-07-30 2016-02-15 에스케이플래닛 주식회사 비콘 신호 기반 스탬프 서비스 제공 방법
JP2016194612A (ja) * 2015-03-31 2016-11-17 株式会社ニデック 視覚認識支援装置および視覚認識支援プログラム
JP6651231B2 (ja) * 2015-10-19 2020-02-19 このみ 一色 携帯情報端末、情報処理装置、及びプログラム
CN110326300B (zh) * 2017-02-27 2021-12-21 索尼公司 信息处理设备、信息处理方法及计算机可读存储介质
CN107154265A (zh) * 2017-03-30 2017-09-12 联想(北京)有限公司 一种采集控制方法及电子设备
WO2018190099A1 (ja) * 2017-04-10 2018-10-18 ヤマハ株式会社 音声提供装置、音声提供方法及びプログラム
JP6907788B2 (ja) * 2017-07-28 2021-07-21 富士フイルムビジネスイノベーション株式会社 情報処理装置及びプログラム
JP7416245B2 (ja) * 2020-06-24 2024-01-17 日本電信電話株式会社 学習装置、学習方法及び学習プログラム
JP7711708B2 (ja) * 2020-07-15 2025-07-23 ソニーグループ株式会社 情報処理装置および情報処理方法
KR102530669B1 (ko) * 2020-10-07 2023-05-09 네이버 주식회사 앱과 웹의 연동을 통해 음성 파일에 대한 메모를 작성하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
CN113707165B (zh) * 2021-09-07 2024-09-17 联想(北京)有限公司 音频处理方法、装置及电子设备和存储介质
CN118975274A (zh) * 2022-04-04 2024-11-15 麦克赛尔株式会社 声音增强现实对象再现装置、信息终端系统
WO2025110423A1 (ko) * 2023-11-20 2025-05-30 삼성전자주식회사 스피커 모듈을 포함하는 전자 장치

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001055833A1 (en) * 2000-01-28 2001-08-02 Lake Technology Limited Spatialized audio system for use in a geographical environment
WO2009128859A1 (en) * 2008-04-18 2009-10-22 Sony Ericsson Mobile Communications Ab Augmented reality enhanced audio
CN102143429A (zh) * 2010-01-29 2011-08-03 株式会社泛泰 提供增强现实信息的服务器、移动通信终端、系统和方法

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS61234355A (ja) 1985-04-11 1986-10-18 Terumo Corp 超音波測定方法およびその装置
JPS63260253A (ja) * 1987-04-17 1988-10-27 Hitachi Ltd 音声応答方式
JPH0566131A (ja) * 1991-09-09 1993-03-19 Sumitomo Electric Ind Ltd 音声案内装置
JP2783212B2 (ja) 1995-09-08 1998-08-06 日本電気株式会社 情報提示装置
JP3309735B2 (ja) 1996-10-24 2002-07-29 三菱電機株式会社 音声マンマシンインタフェース装置
JP2002023787A (ja) 2000-07-06 2002-01-25 Canon Inc 音声合成装置、音声合成システム、音声合成方法及び記憶媒体
US7031924B2 (en) * 2000-06-30 2006-04-18 Canon Kabushiki Kaisha Voice synthesizing apparatus, voice synthesizing system, voice synthesizing method and storage medium
JP2002023778A (ja) * 2000-06-30 2002-01-25 Canon Inc 音声合成装置、音声合成システム、音声合成方法及び記憶媒体
JP2006059136A (ja) 2004-08-20 2006-03-02 Seiko Epson Corp ビューア装置及びそのプログラム
JP2006091390A (ja) 2004-09-24 2006-04-06 Mitsubishi Electric Corp 情報表示システム及び情報表示方法及び情報表示方法をコンピュータに実行させるためのプログラム及び情報表示端末装置
JP3815509B2 (ja) * 2005-12-05 2006-08-30 ソニー株式会社 シミュレーションシステム、仮想空間提供装置および方法、並びにユーザ端末装置および仮想空間画像生成方法
JP4861105B2 (ja) 2006-09-15 2012-01-25 株式会社エヌ・ティ・ティ・ドコモ 空間掲示板システム
JP2008217133A (ja) 2007-02-28 2008-09-18 Nec Corp 地域情報案内システム、地域情報配信システム、地域情報配信プログラム、地域情報案内方法
JP2009140402A (ja) 2007-12-10 2009-06-25 Nippon Telegr & Teleph Corp <Ntt> 情報表示装置、情報表示方法、情報表示プログラム及び情報表示プログラムを記録した記録媒体
US20090315775A1 (en) 2008-06-20 2009-12-24 Microsoft Corporation Mobile computing services based on devices with dynamic direction information
JP2010049158A (ja) 2008-08-25 2010-03-04 Ricoh Co Ltd 画像処理装置
JP2010103756A (ja) * 2008-10-23 2010-05-06 Nissan Motor Co Ltd 音声出力装置および音声出力方法
EP2214425A1 (en) 2009-01-28 2010-08-04 Auralia Emotive Media Systems S.L. Binaural audio guide
JP4911389B2 (ja) 2009-09-30 2012-04-04 Necビッグローブ株式会社 情報表示システム、サーバ、端末、及び方法
JP5293571B2 (ja) 2009-11-17 2013-09-18 日産自動車株式会社 情報提供装置及び方法
JP6016322B2 (ja) * 2010-03-19 2016-10-26 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2001055833A1 (en) * 2000-01-28 2001-08-02 Lake Technology Limited Spatialized audio system for use in a geographical environment
WO2009128859A1 (en) * 2008-04-18 2009-10-22 Sony Ericsson Mobile Communications Ab Augmented reality enhanced audio
CN102143429A (zh) * 2010-01-29 2011-08-03 株式会社泛泰 提供增强现实信息的服务器、移动通信终端、系统和方法

Also Published As

Publication number Publication date
EP2777040A4 (en) 2015-09-09
EP2777040A1 (en) 2014-09-17
US9830128B2 (en) 2017-11-28
US9557962B2 (en) 2017-01-31
WO2013069178A1 (en) 2013-05-16
US20140297289A1 (en) 2014-10-02
EP2777040B1 (en) 2018-12-12
US20160210118A1 (en) 2016-07-21
JP2013101248A (ja) 2013-05-23
US20170123758A1 (en) 2017-05-04
CN103918284A (zh) 2014-07-09
US9299349B2 (en) 2016-03-29

Similar Documents

Publication Publication Date Title
CN103918284B (zh) 语音控制装置、语音控制方法和程序
CN110868639B (zh) 视频合成方法及装置
US9558591B2 (en) Method of providing augmented reality and terminal supporting the same
CN113965807B (zh) 消息推送方法、装置、终端、服务器及存储介质
CN110061900B (zh) 消息显示方法、装置、终端及计算机可读存储介质
CN105450736B (zh) 与虚拟现实连接的方法和装置
KR101864892B1 (ko) 휴대단말기에서 사용자의 검색패턴 제공 장치 및 방법
CN115525383B (zh) 壁纸显示方法、装置、移动终端及存储介质
CN112764608B (zh) 消息处理方法、装置、设备及存储介质
KR20160015727A (ko) 음악 정보 시각화 방법 및 장치
CN110795007A (zh) 一种获取截图信息的方法及装置
CN114186083B (zh) 信息显示方法、装置、终端、服务器及存储介质
CN109218982A (zh) 景点信息获取方法、装置、移动终端以及存储介质
CN112996042A (zh) 网络加速方法、终端设备、服务器及存储介质
US20190265798A1 (en) Information processing apparatus, information processing method, program, and information processing system
CN107356261A (zh) 导航方法及相关产品
CN110113659A (zh) 生成视频的方法、装置、电子设备及介质
CN110798327A (zh) 消息处理方法、设备及存储介质
WO2017050090A1 (zh) 生成gif文件的方法、设备及计算机可读存储介质
JPWO2014103544A1 (ja) 表示制御装置、表示制御方法および記録媒体
JP6206537B2 (ja) 携帯端末、情報処理装置、およびプログラム
CN113301444A (zh) 视频处理方法、装置、电子设备及存储介质
CN114464171B (zh) 音频切分方法、装置、电子设备、存储介质及产品
CN115686421B (zh) 图像显示、图像处理方法、装置及设备
CN118093068A (zh) 多媒体资源的分享方法、装置和设备

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170215

CF01 Termination of patent right due to non-payment of annual fee