JP2023531849A - オーディオ認識を行う拡張現実デバイスおよびその制御方法 - Google Patents
オーディオ認識を行う拡張現実デバイスおよびその制御方法 Download PDFInfo
- Publication number
- JP2023531849A JP2023531849A JP2022554571A JP2022554571A JP2023531849A JP 2023531849 A JP2023531849 A JP 2023531849A JP 2022554571 A JP2022554571 A JP 2022554571A JP 2022554571 A JP2022554571 A JP 2022554571A JP 2023531849 A JP2023531849 A JP 2023531849A
- Authority
- JP
- Japan
- Prior art keywords
- audio signal
- display unit
- see
- augmented reality
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000003190 augmentative effect Effects 0.000 title claims abstract description 59
- 238000000034 method Methods 0.000 title abstract description 15
- 230000005236 sound signal Effects 0.000 claims abstract description 96
- 238000001514 detection method Methods 0.000 description 7
- 238000013528 artificial neural network Methods 0.000 description 5
- 239000011521 glass Substances 0.000 description 5
- 230000000007 visual effect Effects 0.000 description 5
- 238000010586 diagram Methods 0.000 description 4
- 230000000694 effects Effects 0.000 description 4
- 239000004984 smart glass Substances 0.000 description 4
- 241001465754 Metazoa Species 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 230000001953 sensory effect Effects 0.000 description 3
- 206010011469 Crying Diseases 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 2
- 230000002452 interceptive effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 230000001149 cognitive effect Effects 0.000 description 1
- 230000007423 decrease Effects 0.000 description 1
- 238000013135 deep learning Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000008447 perception Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T19/00—Manipulating 3D models or images for computer graphics
- G06T19/006—Mixed reality
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/167—Audio in a user interface, e.g. using voice commands for navigating, audio feedback
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/40—Visual indication of stereophonic sound image
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T2215/00—Indexing scheme for image rendering
- G06T2215/16—Using real world measurements to influence rendering
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04R—LOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
- H04R2430/00—Signal processing covered by H04R, not provided for in its groups
- H04R2430/20—Processing of the output signals of the acoustic transducers of an array for obtaining a desired directivity characteristic
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/11—Application of ambisonics in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Theoretical Computer Science (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Human Computer Interaction (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Computer Graphics (AREA)
- Computer Hardware Design (AREA)
- Software Systems (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
- Processing Or Creating Images (AREA)
Abstract
Description
Claims (7)
- ユーザーの目が貫通して見られるように形成され、仮想オブジェクトの映像情報を出力するシースルーディスプレイ部と、
前記ディスプレイ部から予め設定された距離以内で発生したオーディオ信号の入力を受けるオーディオ入力部と、
前記オーディオ信号に対応するイベント情報を識別し、識別されたイベント情報に対応する仮想オブジェクトの映像情報が出力されるように、前記シースルーディスプレイ部の動作を制御する制御部と、を含むことを特徴とする拡張現実デバイス。 - 前記映像情報は、
前記オーディオ信号に関連するテキスト、イメージ、および位置情報のうち、少なくとも1つを含むことを特徴とする請求項1に記載の拡張現実デバイス。 - 前記制御部は、
前記オーディオ信号が発生した地点の位置を検出し、前記映像情報が検出された位置に関連する情報を含むように、前記シースルーディスプレイ部の動作を制御することを特徴とする請求項1に記載の拡張現実デバイス。 - 前記制御部は、
前記シースルーディスプレイ部の一部分が指向する方向に基づいて、前記オーディオ信号が入力された方向を検出し、前記映像情報が検出された方向に関連する情報を含むように、前記シースルーディスプレイ部の動作を制御することを特徴とする請求項3に記載の拡張現実デバイス。 - 前記制御部は、
前記オーディオ信号が発生した地点の位置が前記シースルーディスプレイ部を介してユーザーに見える可視領域外である場合、前記位置に関連する映像情報が出力されるように、前記シースルーディスプレイ部の動作を制御することを特徴とする請求項1に記載の拡張現実デバイス。 - 前記制御部は、
前記オーディオ信号が発生した地点の位置が前記可視領域内である場合、前記シースルーディスプレイ部の画面のうち、前記位置が投影される一部分に前記仮想オブジェクトの映像情報がオーバーラップされるように、前記シースルーディスプレイ部の動作を制御することを特徴とする請求項5に記載の拡張現実デバイス。 - 前記入力部は、
前記シースルーディスプレイ部の一部分に設けられ、一方向を撮影するカメラを含み、前記制御部は、前記カメラによって撮影された映像情報に含まれる少なくとも1つのオブジェクトを識別し、識別された少なくとも1つのオブジェクトと前記 オーディオ信号のイベント情報をマッチングさせることを特徴とする請求項1に記載の拡張現実デバイス。
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
KR1020200034136A KR102334091B1 (ko) | 2020-03-20 | 2020-03-20 | 오디오 인식을 수행하는 증강현실 디바이스 및 그의 제어방법 |
KR10-2020-0034136 | 2020-03-20 | ||
PCT/KR2021/002497 WO2021187771A1 (ko) | 2020-03-20 | 2021-02-26 | 오디오 인식을 수행하는 증강현실 디바이스 및 그의 제어방법 |
Publications (1)
Publication Number | Publication Date |
---|---|
JP2023531849A true JP2023531849A (ja) | 2023-07-26 |
Family
ID=77771708
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2022554571A Pending JP2023531849A (ja) | 2020-03-20 | 2021-02-26 | オーディオ認識を行う拡張現実デバイスおよびその制御方法 |
Country Status (6)
Country | Link |
---|---|
US (1) | US20230145966A1 (ja) |
EP (1) | EP4124073A4 (ja) |
JP (1) | JP2023531849A (ja) |
KR (1) | KR102334091B1 (ja) |
CN (1) | CN115336291A (ja) |
WO (1) | WO2021187771A1 (ja) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7476128B2 (ja) | 2021-03-11 | 2024-04-30 | 株式会社日立製作所 | 表示システムおよび表示装置 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6829018B2 (en) * | 2001-09-17 | 2004-12-07 | Koninklijke Philips Electronics N.V. | Three-dimensional sound creation assisted by visual information |
US8183997B1 (en) * | 2011-11-14 | 2012-05-22 | Google Inc. | Displaying sound indications on a wearable computing system |
KR20130097855A (ko) * | 2012-02-27 | 2013-09-04 | 한국전자통신연구원 | 증강 오디오 서비스 시스템 및 방법 |
US9129430B2 (en) * | 2013-06-25 | 2015-09-08 | Microsoft Technology Licensing, Llc | Indicating out-of-view augmented reality images |
US20170277257A1 (en) * | 2016-03-23 | 2017-09-28 | Jeffrey Ota | Gaze-based sound selection |
US9906885B2 (en) * | 2016-07-15 | 2018-02-27 | Qualcomm Incorporated | Methods and systems for inserting virtual sounds into an environment |
CN117198277A (zh) * | 2016-08-12 | 2023-12-08 | 奇跃公司 | 单词流注释 |
US10531220B2 (en) * | 2016-12-05 | 2020-01-07 | Magic Leap, Inc. | Distributed audio capturing techniques for virtual reality (VR), augmented reality (AR), and mixed reality (MR) systems |
-
2020
- 2020-03-20 KR KR1020200034136A patent/KR102334091B1/ko active IP Right Grant
-
2021
- 2021-02-26 WO PCT/KR2021/002497 patent/WO2021187771A1/ko active Application Filing
- 2021-02-26 JP JP2022554571A patent/JP2023531849A/ja active Pending
- 2021-02-26 EP EP21770949.2A patent/EP4124073A4/en active Pending
- 2021-02-26 US US17/911,637 patent/US20230145966A1/en active Pending
- 2021-02-26 CN CN202180020138.6A patent/CN115336291A/zh active Pending
Also Published As
Publication number | Publication date |
---|---|
KR102334091B1 (ko) | 2021-12-02 |
EP4124073A1 (en) | 2023-01-25 |
EP4124073A4 (en) | 2024-04-10 |
KR20210117654A (ko) | 2021-09-29 |
WO2021187771A1 (ko) | 2021-09-23 |
CN115336291A (zh) | 2022-11-11 |
US20230145966A1 (en) | 2023-05-11 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110647237B (zh) | 在人工现实环境中基于手势的内容共享 | |
US10395116B2 (en) | Dynamically created and updated indoor positioning map | |
CN110018736B (zh) | 人工现实中的经由近眼显示器界面的对象增强 | |
CN111630477A (zh) | 提供增强现实服务的设备及其操作方法 | |
KR20150135847A (ko) | 글래스 타입 단말기 및 이의 제어방법 | |
US20200202161A1 (en) | Information processing apparatus, information processing method, and program | |
KR20160001178A (ko) | 글래스 타입 단말기 및 이의 제어방법 | |
US11869156B2 (en) | Augmented reality eyewear with speech bubbles and translation | |
US11887261B2 (en) | Simulation object identity recognition method, related apparatus, and system | |
KR102110208B1 (ko) | 안경형 단말기 및 이의 제어방법 | |
CN104281266A (zh) | 头戴式显示设备 | |
US20210217247A1 (en) | Body pose message system | |
WO2020012955A1 (ja) | 情報処理装置、情報処理方法、およびプログラム | |
US20230362573A1 (en) | Audio enhanced augmented reality | |
JP2016033611A (ja) | 情報提供システム、表示装置、および、表示装置の制御方法 | |
WO2020129029A2 (en) | A system for generating an extended reality environment | |
US11605396B2 (en) | Image processing system and method | |
CN111415421B (zh) | 虚拟物体控制方法、装置、存储介质及增强现实设备 | |
JP2023531849A (ja) | オーディオ認識を行う拡張現実デバイスおよびその制御方法 | |
KR20190018906A (ko) | 증강현실이나 가상현실에 기초하여 가상의 인체 장기를 렌더링하는 이동 단말기 및 이를 이용하는 시스템 | |
KR20160001229A (ko) | 이동단말기 및 그 제어방법 | |
KR20210150881A (ko) | 전자 장치 및 그 동작 방법 | |
US20230168522A1 (en) | Eyewear with direction of sound arrival detection | |
KR20160027813A (ko) | 글래스형 단말기 | |
KR20170111010A (ko) | 가상 이미지를 이용한 영상 통화 시스템 및 방법과 이를 수행하기 위한 영상 통화 중계 서버 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
A621 | Written request for application examination |
Free format text: JAPANESE INTERMEDIATE CODE: A621 Effective date: 20220907 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20230523 |
|
A131 | Notification of reasons for refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A131 Effective date: 20230912 |
|
A711 | Notification of change in applicant |
Free format text: JAPANESE INTERMEDIATE CODE: A711 Effective date: 20231120 |
|
A601 | Written request for extension of time |
Free format text: JAPANESE INTERMEDIATE CODE: A601 Effective date: 20231212 |
|
A521 | Request for written amendment filed |
Free format text: JAPANESE INTERMEDIATE CODE: A523 Effective date: 20240213 |
|
A02 | Decision of refusal |
Free format text: JAPANESE INTERMEDIATE CODE: A02 Effective date: 20240514 |