CN103918284B - 语音控制装置、语音控制方法和程序 - Google Patents
语音控制装置、语音控制方法和程序 Download PDFInfo
- Publication number
- CN103918284B CN103918284B CN201280053462.9A CN201280053462A CN103918284B CN 103918284 B CN103918284 B CN 103918284B CN 201280053462 A CN201280053462 A CN 201280053462A CN 103918284 B CN103918284 B CN 103918284B
- Authority
- CN
- China
- Prior art keywords
- sound
- tag information
- information
- mobile terminal
- output
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/302—Electronic adaptation of stereophonic sound system to listener position or orientation
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/686—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0488—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/60—Editing figures and text; Combining figures or text
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L13/00—Speech synthesis; Text to speech systems
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification techniques
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
- G06T19/006—Mixed reality
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
- H04R2430/01—Aspects of volume control, not necessarily automatic, in sound systems
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2460/00—Details of hearing devices, i.e. of ear- or headphones covered by H04R1/10 or H04R5/033 but not provided for in any of their subgroups, or of hearing aids covered by H04R25/00 but not provided for in any of its subgroups
- H04R2460/07—Use of position data from wide-area or local-area positioning systems in hearing devices, e.g. program or information selection
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/11—Positioning of individual sound objects, e.g. moving airplane, within a sound field
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2400/00—Details of stereophonic systems covered by H04S but not provided for in its groups
- H04S2400/13—Aspects of volume control, not necessarily automatic, in stereophonic sound systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Multimedia (AREA)
- Human Computer Interaction (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Signal Processing (AREA)
- Computational Linguistics (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Library & Information Science (AREA)
- General Health & Medical Sciences (AREA)
- Telephone Function (AREA)
- User Interface Of Digital Computer (AREA)
- Stereophonic System (AREA)
- Circuit For Audible Band Transducer (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2011245357A JP2013101248A (ja) | 2011-11-09 | 2011-11-09 | 音声制御装置、音声制御方法、およびプログラム |
| JP2011-245357 | 2011-11-09 | ||
| PCT/JP2012/005291 WO2013069178A1 (en) | 2011-11-09 | 2012-08-23 | Voice control device, voice control method and program |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN103918284A CN103918284A (zh) | 2014-07-09 |
| CN103918284B true CN103918284B (zh) | 2017-02-15 |
Family
ID=48288955
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201280053462.9A Expired - Fee Related CN103918284B (zh) | 2011-11-09 | 2012-08-23 | 语音控制装置、语音控制方法和程序 |
Country Status (5)
| Country | Link |
|---|---|
| US (3) | US9299349B2 (enExample) |
| EP (1) | EP2777040B1 (enExample) |
| JP (1) | JP2013101248A (enExample) |
| CN (1) | CN103918284B (enExample) |
| WO (1) | WO2013069178A1 (enExample) |
Families Citing this family (15)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP6231362B2 (ja) | 2013-11-25 | 2017-11-15 | アズビル株式会社 | プラント監視サーバーおよびプラント監視方法 |
| JP6263098B2 (ja) * | 2014-07-15 | 2018-01-17 | Kddi株式会社 | 仮想音源を提供情報位置に配置する携帯端末、音声提示プログラム及び音声提示方法 |
| KR20160015512A (ko) * | 2014-07-30 | 2016-02-15 | 에스케이플래닛 주식회사 | 비콘 신호 기반 스탬프 서비스 제공 방법 |
| JP2016194612A (ja) * | 2015-03-31 | 2016-11-17 | 株式会社ニデック | 視覚認識支援装置および視覚認識支援プログラム |
| JP6651231B2 (ja) * | 2015-10-19 | 2020-02-19 | このみ 一色 | 携帯情報端末、情報処理装置、及びプログラム |
| CN110326300B (zh) * | 2017-02-27 | 2021-12-21 | 索尼公司 | 信息处理设备、信息处理方法及计算机可读存储介质 |
| CN107154265A (zh) * | 2017-03-30 | 2017-09-12 | 联想(北京)有限公司 | 一种采集控制方法及电子设备 |
| WO2018190099A1 (ja) * | 2017-04-10 | 2018-10-18 | ヤマハ株式会社 | 音声提供装置、音声提供方法及びプログラム |
| JP6907788B2 (ja) * | 2017-07-28 | 2021-07-21 | 富士フイルムビジネスイノベーション株式会社 | 情報処理装置及びプログラム |
| JP7416245B2 (ja) * | 2020-06-24 | 2024-01-17 | 日本電信電話株式会社 | 学習装置、学習方法及び学習プログラム |
| JP7711708B2 (ja) * | 2020-07-15 | 2025-07-23 | ソニーグループ株式会社 | 情報処理装置および情報処理方法 |
| KR102530669B1 (ko) * | 2020-10-07 | 2023-05-09 | 네이버 주식회사 | 앱과 웹의 연동을 통해 음성 파일에 대한 메모를 작성하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체 |
| CN113707165B (zh) * | 2021-09-07 | 2024-09-17 | 联想(北京)有限公司 | 音频处理方法、装置及电子设备和存储介质 |
| CN118975274A (zh) * | 2022-04-04 | 2024-11-15 | 麦克赛尔株式会社 | 声音增强现实对象再现装置、信息终端系统 |
| WO2025110423A1 (ko) * | 2023-11-20 | 2025-05-30 | 삼성전자주식회사 | 스피커 모듈을 포함하는 전자 장치 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001055833A1 (en) * | 2000-01-28 | 2001-08-02 | Lake Technology Limited | Spatialized audio system for use in a geographical environment |
| WO2009128859A1 (en) * | 2008-04-18 | 2009-10-22 | Sony Ericsson Mobile Communications Ab | Augmented reality enhanced audio |
| CN102143429A (zh) * | 2010-01-29 | 2011-08-03 | 株式会社泛泰 | 提供增强现实信息的服务器、移动通信终端、系统和方法 |
Family Cites Families (21)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPS61234355A (ja) | 1985-04-11 | 1986-10-18 | Terumo Corp | 超音波測定方法およびその装置 |
| JPS63260253A (ja) * | 1987-04-17 | 1988-10-27 | Hitachi Ltd | 音声応答方式 |
| JPH0566131A (ja) * | 1991-09-09 | 1993-03-19 | Sumitomo Electric Ind Ltd | 音声案内装置 |
| JP2783212B2 (ja) | 1995-09-08 | 1998-08-06 | 日本電気株式会社 | 情報提示装置 |
| JP3309735B2 (ja) | 1996-10-24 | 2002-07-29 | 三菱電機株式会社 | 音声マンマシンインタフェース装置 |
| JP2002023787A (ja) | 2000-07-06 | 2002-01-25 | Canon Inc | 音声合成装置、音声合成システム、音声合成方法及び記憶媒体 |
| US7031924B2 (en) * | 2000-06-30 | 2006-04-18 | Canon Kabushiki Kaisha | Voice synthesizing apparatus, voice synthesizing system, voice synthesizing method and storage medium |
| JP2002023778A (ja) * | 2000-06-30 | 2002-01-25 | Canon Inc | 音声合成装置、音声合成システム、音声合成方法及び記憶媒体 |
| JP2006059136A (ja) | 2004-08-20 | 2006-03-02 | Seiko Epson Corp | ビューア装置及びそのプログラム |
| JP2006091390A (ja) | 2004-09-24 | 2006-04-06 | Mitsubishi Electric Corp | 情報表示システム及び情報表示方法及び情報表示方法をコンピュータに実行させるためのプログラム及び情報表示端末装置 |
| JP3815509B2 (ja) * | 2005-12-05 | 2006-08-30 | ソニー株式会社 | シミュレーションシステム、仮想空間提供装置および方法、並びにユーザ端末装置および仮想空間画像生成方法 |
| JP4861105B2 (ja) | 2006-09-15 | 2012-01-25 | 株式会社エヌ・ティ・ティ・ドコモ | 空間掲示板システム |
| JP2008217133A (ja) | 2007-02-28 | 2008-09-18 | Nec Corp | 地域情報案内システム、地域情報配信システム、地域情報配信プログラム、地域情報案内方法 |
| JP2009140402A (ja) | 2007-12-10 | 2009-06-25 | Nippon Telegr & Teleph Corp <Ntt> | 情報表示装置、情報表示方法、情報表示プログラム及び情報表示プログラムを記録した記録媒体 |
| US20090315775A1 (en) | 2008-06-20 | 2009-12-24 | Microsoft Corporation | Mobile computing services based on devices with dynamic direction information |
| JP2010049158A (ja) | 2008-08-25 | 2010-03-04 | Ricoh Co Ltd | 画像処理装置 |
| JP2010103756A (ja) * | 2008-10-23 | 2010-05-06 | Nissan Motor Co Ltd | 音声出力装置および音声出力方法 |
| EP2214425A1 (en) | 2009-01-28 | 2010-08-04 | Auralia Emotive Media Systems S.L. | Binaural audio guide |
| JP4911389B2 (ja) | 2009-09-30 | 2012-04-04 | Necビッグローブ株式会社 | 情報表示システム、サーバ、端末、及び方法 |
| JP5293571B2 (ja) | 2009-11-17 | 2013-09-18 | 日産自動車株式会社 | 情報提供装置及び方法 |
| JP6016322B2 (ja) * | 2010-03-19 | 2016-10-26 | ソニー株式会社 | 情報処理装置、情報処理方法、およびプログラム |
-
2011
- 2011-11-09 JP JP2011245357A patent/JP2013101248A/ja active Pending
-
2012
- 2012-08-23 WO PCT/JP2012/005291 patent/WO2013069178A1/en not_active Ceased
- 2012-08-23 CN CN201280053462.9A patent/CN103918284B/zh not_active Expired - Fee Related
- 2012-08-23 EP EP12848137.1A patent/EP2777040B1/en active Active
- 2012-08-23 US US14/353,856 patent/US9299349B2/en active Active
-
2016
- 2016-02-18 US US15/046,578 patent/US9557962B2/en not_active Expired - Fee Related
- 2016-12-12 US US15/376,052 patent/US9830128B2/en active Active
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| WO2001055833A1 (en) * | 2000-01-28 | 2001-08-02 | Lake Technology Limited | Spatialized audio system for use in a geographical environment |
| WO2009128859A1 (en) * | 2008-04-18 | 2009-10-22 | Sony Ericsson Mobile Communications Ab | Augmented reality enhanced audio |
| CN102143429A (zh) * | 2010-01-29 | 2011-08-03 | 株式会社泛泰 | 提供增强现实信息的服务器、移动通信终端、系统和方法 |
Also Published As
| Publication number | Publication date |
|---|---|
| EP2777040A4 (en) | 2015-09-09 |
| EP2777040A1 (en) | 2014-09-17 |
| US9830128B2 (en) | 2017-11-28 |
| US9557962B2 (en) | 2017-01-31 |
| WO2013069178A1 (en) | 2013-05-16 |
| US20140297289A1 (en) | 2014-10-02 |
| EP2777040B1 (en) | 2018-12-12 |
| US20160210118A1 (en) | 2016-07-21 |
| JP2013101248A (ja) | 2013-05-23 |
| US20170123758A1 (en) | 2017-05-04 |
| CN103918284A (zh) | 2014-07-09 |
| US9299349B2 (en) | 2016-03-29 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN103918284B (zh) | 语音控制装置、语音控制方法和程序 | |
| CN110868639B (zh) | 视频合成方法及装置 | |
| US9558591B2 (en) | Method of providing augmented reality and terminal supporting the same | |
| CN113965807B (zh) | 消息推送方法、装置、终端、服务器及存储介质 | |
| CN110061900B (zh) | 消息显示方法、装置、终端及计算机可读存储介质 | |
| CN105450736B (zh) | 与虚拟现实连接的方法和装置 | |
| KR101864892B1 (ko) | 휴대단말기에서 사용자의 검색패턴 제공 장치 및 방법 | |
| CN115525383B (zh) | 壁纸显示方法、装置、移动终端及存储介质 | |
| CN112764608B (zh) | 消息处理方法、装置、设备及存储介质 | |
| KR20160015727A (ko) | 음악 정보 시각화 방법 및 장치 | |
| CN110795007A (zh) | 一种获取截图信息的方法及装置 | |
| CN114186083B (zh) | 信息显示方法、装置、终端、服务器及存储介质 | |
| CN109218982A (zh) | 景点信息获取方法、装置、移动终端以及存储介质 | |
| CN112996042A (zh) | 网络加速方法、终端设备、服务器及存储介质 | |
| US20190265798A1 (en) | Information processing apparatus, information processing method, program, and information processing system | |
| CN107356261A (zh) | 导航方法及相关产品 | |
| CN110113659A (zh) | 生成视频的方法、装置、电子设备及介质 | |
| CN110798327A (zh) | 消息处理方法、设备及存储介质 | |
| WO2017050090A1 (zh) | 生成gif文件的方法、设备及计算机可读存储介质 | |
| JPWO2014103544A1 (ja) | 表示制御装置、表示制御方法および記録媒体 | |
| JP6206537B2 (ja) | 携帯端末、情報処理装置、およびプログラム | |
| CN113301444A (zh) | 视频处理方法、装置、电子设备及存储介质 | |
| CN114464171B (zh) | 音频切分方法、装置、电子设备、存储介质及产品 | |
| CN115686421B (zh) | 图像显示、图像处理方法、装置及设备 | |
| CN118093068A (zh) | 多媒体资源的分享方法、装置和设备 |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20170215 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |