CN113039816B - 信息处理装置、信息处理方法和信息处理程序 - Google Patents

信息处理装置、信息处理方法和信息处理程序 Download PDF

Info

Publication number
CN113039816B
CN113039816B CN201980065687.8A CN201980065687A CN113039816B CN 113039816 B CN113039816 B CN 113039816B CN 201980065687 A CN201980065687 A CN 201980065687A CN 113039816 B CN113039816 B CN 113039816B
Authority
CN
China
Prior art keywords
ear
image
information processing
hrtf
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201980065687.8A
Other languages
English (en)
Chinese (zh)
Other versions
CN113039816A (zh
Inventor
福田和巳
曲谷地哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Group Corp
Original Assignee
Sony Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Group Corp filed Critical Sony Group Corp
Priority to CN202310573805.4A priority Critical patent/CN116801179A/zh
Publication of CN113039816A publication Critical patent/CN113039816A/zh
Application granted granted Critical
Publication of CN113039816B publication Critical patent/CN113039816B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/302Electronic adaptation of stereophonic sound system to listener position or orientation
    • H04S7/303Tracking of listener position or orientation
    • H04S7/304For headphones
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/764Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/7715Feature extraction, e.g. by transforming the feature space, e.g. multi-dimensional scaling [MDS]; Mappings, e.g. subspace methods
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/82Arrangements for image or video recognition or understanding using pattern recognition or machine learning using neural networks
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/60Type of objects
    • G06V20/64Three-dimensional [3D] objects
    • G06V20/647Three-dimensional [3D] objects by matching two-dimensional images to three-dimensional objects
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/027Spatial or constructional arrangements of microphones, e.g. in dummy heads
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; ELECTRIC HEARING AIDS; PUBLIC ADDRESS SYSTEMS
    • H04R5/00Stereophonic arrangements
    • H04R5/033Headphones for stereophonic communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S1/00Two-channel systems
    • H04S1/007Two-channel systems in which the audio signals are in digital form
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • H04S7/301Automatic calibration of stereophonic sound system, e.g. with test microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/01Enhancing the perception of the sound image or of the spatial distribution using head related transfer functions [HRTF's] or equivalents thereof, e.g. interaural time difference [ITD] or interaural level difference [ILD]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2420/00Techniques used stereophonic systems covered by H04S but not provided for in its groups
    • H04S2420/13Application of wave-field synthesis in stereophonic audio systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S3/00Systems employing more than two channels, e.g. quadraphonic
    • H04S3/002Non-adaptive circuits, e.g. manually adjustable or static, for enhancing the sound image or the spatial distribution
    • H04S3/004For headphones

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Multimedia (AREA)
  • General Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Software Systems (AREA)
  • Human Computer Interaction (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Image Analysis (AREA)
  • Stereophonic System (AREA)
  • Processing Or Creating Images (AREA)
CN201980065687.8A 2018-10-10 2019-10-03 信息处理装置、信息处理方法和信息处理程序 Active CN113039816B (zh)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310573805.4A CN116801179A (zh) 2018-10-10 2019-10-03 信息处理装置、信息处理方法和计算机可访问介质

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2018191513 2018-10-10
JP2018-191513 2018-10-10
PCT/JP2019/039103 WO2020075622A1 (ja) 2018-10-10 2019-10-03 情報処理装置、情報処理方法及び情報処理プログラム

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN202310573805.4A Division CN116801179A (zh) 2018-10-10 2019-10-03 信息处理装置、信息处理方法和计算机可访问介质

Publications (2)

Publication Number Publication Date
CN113039816A CN113039816A (zh) 2021-06-25
CN113039816B true CN113039816B (zh) 2023-06-06

Family

ID=70165249

Family Applications (2)

Application Number Title Priority Date Filing Date
CN201980065687.8A Active CN113039816B (zh) 2018-10-10 2019-10-03 信息处理装置、信息处理方法和信息处理程序
CN202310573805.4A Pending CN116801179A (zh) 2018-10-10 2019-10-03 信息处理装置、信息处理方法和计算机可访问介质

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN202310573805.4A Pending CN116801179A (zh) 2018-10-10 2019-10-03 信息处理装置、信息处理方法和计算机可访问介质

Country Status (6)

Country Link
US (2) US11595772B2 (https=)
EP (1) EP3866492B1 (https=)
JP (2) JPWO2020075622A1 (https=)
KR (1) KR20210068409A (https=)
CN (2) CN113039816B (https=)
WO (1) WO2020075622A1 (https=)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7442494B2 (ja) 2018-07-25 2024-03-04 ドルビー ラボラトリーズ ライセンシング コーポレイション 光学式捕捉によるパーソナライズされたhrtf
KR102759677B1 (ko) 2018-10-03 2025-02-03 소니그룹주식회사 정보 처리 장치, 정보 처리 방법 및 프로그램
KR20210068409A (ko) 2018-10-10 2021-06-09 소니그룹주식회사 정보 처리 장치, 정보 처리 방법 및 정보 처리 프로그램
FR3105549B1 (fr) * 2019-12-24 2022-01-07 Parrot Faurecia Automotive Sas Procédé et système audio d’appui-tête de siège
WO2022014308A1 (ja) 2020-07-15 2022-01-20 ソニーグループ株式会社 情報処理装置、情報処理方法および端末装置
US12586235B2 (en) * 2020-08-14 2026-03-24 Ceva Technologies, Inc. Systems and methods for head related transfer function personalization
EP4272464A1 (en) * 2020-12-31 2023-11-08 Harman International Industries, Incorporated Method for determining a personalized head-related transfer function
US12125305B2 (en) * 2021-10-26 2024-10-22 Avaya Management L.P. Usage and health-triggered machine response
US12483845B2 (en) * 2023-02-24 2025-11-25 Dell Products, L.P. Systems and methods for headset identification
WO2025065317A1 (zh) * 2023-09-27 2025-04-03 京东方科技集团股份有限公司 音频处理装置、方法、扩展现实装置、设备和存储介质
WO2025100801A1 (ko) * 2023-11-07 2025-05-15 삼성전자 주식회사 오디오 신호를 생성 또는 재생하는 전자 장치 및 그의 동작 방법

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1083190A (ja) * 1996-09-06 1998-03-31 Taimu Wear:Kk 過渡応答信号生成と設定方法及びその装置
JP2004314915A (ja) * 2003-04-21 2004-11-11 Alpine Electronics Inc 聴取点位置測定装置
US6996244B1 (en) * 1998-08-06 2006-02-07 Vulcan Patents Llc Estimation of head-related transfer functions for spatial sound representative
CN1901761A (zh) * 2005-07-20 2007-01-24 三星电子株式会社 宽单声道声音再现方法和设备
CN103139677A (zh) * 2011-11-22 2013-06-05 鹦鹉股份有限公司 用于收听音频音乐源和/或免提电话功能的具有非适应型的主动噪声控制的音频耳机
EP3351172A1 (en) * 2015-09-14 2018-07-25 Yamaha Corporation Ear shape analysis method, ear shape analysis device, and method for generating ear shape model
US10038966B1 (en) * 2016-10-20 2018-07-31 Oculus Vr, Llc Head-related transfer function (HRTF) personalization based on captured images of user

Family Cites Families (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6218091B1 (en) * 2000-04-07 2001-04-17 Eastman Kodak Company Rapid processing of high contrast aerial color negative film
EP2611216B1 (en) 2011-12-30 2015-12-16 GN Resound A/S Systems and methods for determining head related transfer functions
US9030545B2 (en) * 2011-12-30 2015-05-12 GNR Resound A/S Systems and methods for determining head related transfer functions
US9544706B1 (en) 2015-03-23 2017-01-10 Amazon Technologies, Inc. Customized head-related transfer functions
US10805757B2 (en) 2015-12-31 2020-10-13 Creative Technology Ltd Method for generating a customized/personalized head related transfer function
SG10201510822YA (en) 2015-12-31 2017-07-28 Creative Tech Ltd A method for generating a customized/personalized head related transfer function
JP2017216660A (ja) * 2016-06-02 2017-12-07 ソニー株式会社 情報処理装置、情報処理方法、およびプログラム
EP3539305A4 (en) 2016-11-13 2020-04-22 Embodyvr, Inc. SYSTEM AND METHOD FOR TAKING PICTURES OF THE EARSEL AND CHARACTERIZING THE HUMAN LOCAL ANATOMY USING PICTURES OF THE EARSEL
KR102759677B1 (ko) 2018-10-03 2025-02-03 소니그룹주식회사 정보 처리 장치, 정보 처리 방법 및 프로그램
KR20210068409A (ko) 2018-10-10 2021-06-09 소니그룹주식회사 정보 처리 장치, 정보 처리 방법 및 정보 처리 프로그램

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH1083190A (ja) * 1996-09-06 1998-03-31 Taimu Wear:Kk 過渡応答信号生成と設定方法及びその装置
US6996244B1 (en) * 1998-08-06 2006-02-07 Vulcan Patents Llc Estimation of head-related transfer functions for spatial sound representative
JP2004314915A (ja) * 2003-04-21 2004-11-11 Alpine Electronics Inc 聴取点位置測定装置
CN1901761A (zh) * 2005-07-20 2007-01-24 三星电子株式会社 宽单声道声音再现方法和设备
CN103139677A (zh) * 2011-11-22 2013-06-05 鹦鹉股份有限公司 用于收听音频音乐源和/或免提电话功能的具有非适应型的主动噪声控制的音频耳机
EP3351172A1 (en) * 2015-09-14 2018-07-25 Yamaha Corporation Ear shape analysis method, ear shape analysis device, and method for generating ear shape model
US10038966B1 (en) * 2016-10-20 2018-07-31 Oculus Vr, Llc Head-related transfer function (HRTF) personalization based on captured images of user

Also Published As

Publication number Publication date
EP3866492A1 (en) 2021-08-18
US20210385600A1 (en) 2021-12-09
KR20210068409A (ko) 2021-06-09
JP2024112961A (ja) 2024-08-21
JPWO2020075622A1 (ja) 2021-09-16
US11595772B2 (en) 2023-02-28
WO2020075622A1 (ja) 2020-04-16
EP3866492A4 (en) 2021-12-08
CN113039816A (zh) 2021-06-25
CN116801179A (zh) 2023-09-22
EP3866492B1 (en) 2025-09-03
US20230283979A1 (en) 2023-09-07
JP7715249B2 (ja) 2025-07-30
US12273704B2 (en) 2025-04-08

Similar Documents

Publication Publication Date Title
CN113039816B (zh) 信息处理装置、信息处理方法和信息处理程序
US12505644B2 (en) Method for generating customized/personalized head related transfer function
US11601775B2 (en) Method for generating a customized/personalized head related transfer function
US10313818B2 (en) HRTF personalization based on anthropometric features
JP4718559B2 (ja) モデル化によってhrtfを個別化するための方法および装置
CN116528141A (zh) 经由光学捕获的个性化hrtfs
US11315277B1 (en) Device to determine user-specific HRTF based on combined geometric data
JP2024501616A (ja) パーソナライズされた頭部伝達関数を決定する方法
CN120611122B (zh) Hrtf生成方法、设备及计算机可读存储介质
CN120611123B (zh) Hrtf生成方法、设备及计算机可读存储介质
Pirard et al. Photogrammetry-Reconstructed 3D Head Meshes for Accessible Individual Head-Related Transfer Functions
CN119545285A (zh) 一种基于自动编码器和球谐展开的hrtf重建装置及方法
HK1255064B (zh) 一种用於生成定制/个性化头部相关传递函数的方法

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant