IN2014MN01958A - - Google Patents

Download PDF

Info

Publication number
IN2014MN01958A
IN2014MN01958A IN1958MUN2014A IN2014MN01958A IN 2014MN01958 A IN2014MN01958 A IN 2014MN01958A IN 1958MUN2014 A IN1958MUN2014 A IN 1958MUN2014A IN 2014MN01958 A IN2014MN01958 A IN 2014MN01958A
Authority
IN
India
Prior art keywords
scene
captured
keypoint
video
objects
Prior art date
Application number
Inventor
Erik Visser
Haiyin Wang
Hasib A Siddiqui
Lae Hoon Kim
Original Assignee
Qualcomm Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Qualcomm Inc filed Critical Qualcomm Inc
Publication of IN2014MN01958A publication Critical patent/IN2014MN01958A/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S7/00Indicating arrangements; Control arrangements, e.g. balance control
    • H04S7/30Control circuits for electronic adaptation of the sound field
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/25Fusion techniques
    • G06F18/254Fusion techniques of classification results, e.g. of results related to same input data
    • G06F18/256Fusion techniques of classification results, e.g. of results related to same input data of results relating to different input data, e.g. multimodal recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/20Image preprocessing
    • G06V10/25Determination of region of interest [ROI] or a volume of interest [VOI]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/44Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components
    • G06V10/443Local feature extraction by analysis of parts of the pattern, e.g. by detecting edges, contours, loops, corners, strokes or intersections; Connectivity analysis, e.g. of connected components by matching or filtering
    • G06V10/449Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters
    • G06V10/451Biologically inspired filters, e.g. difference of Gaussians [DoG] or Gabor filters with interaction between the filter responses, e.g. cortical complex cells
    • G06V10/454Integrating the filters into a hierarchical structure, e.g. convolutional neural networks [CNN]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • G06V10/462Salient features, e.g. scale invariant feature transforms [SIFT]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/80Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level
    • G06V10/809Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of classification results, e.g. where the classifiers operate on the same input data
    • G06V10/811Fusion, i.e. combining data from various sources at the sensor level, preprocessing level, feature extraction level or classification level of classification results, e.g. where the classifiers operate on the same input data the classifiers operating on different input data, e.g. multi-modal recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R3/00Circuits for transducers, loudspeakers or microphones
    • H04R3/005Circuits for transducers, loudspeakers or microphones for combining the signals of two or more microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/11Positioning of individual sound objects, e.g. moving airplane, within a sound field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04SSTEREOPHONIC SYSTEMS 
    • H04S2400/00Details of stereophonic systems covered by H04S but not provided for in its groups
    • H04S2400/15Aspects of sound capture and related signal processing for recording or reproduction

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Evolutionary Computation (AREA)
  • Artificial Intelligence (AREA)
  • Signal Processing (AREA)
  • Acoustics & Sound (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Otolaryngology (AREA)
  • Computing Systems (AREA)
  • Databases & Information Systems (AREA)
  • Medical Informatics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Software Systems (AREA)
  • Biodiversity & Conservation Biology (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Image Analysis (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Studio Devices (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

Methods systems and articles of manufacture for recognizing and locating one or more objects in a scene are disclosed. An image and/or video of the scene are captured. Using audio recorded at the scene an object search of the captured scene is narrowed down. For example the direction of arrival (DOA) of a sound can be determined and used to limit the search area in a captured image/video. In another example keypoint signatures may be selected based on types of sounds identified in the recorded audio. A keypoint signature corresponds to a particular object that the system is configured to recognize. Objects in the scene may then be recognized using a shift invariant feature transform (SIFT) analysis comparing keypoints identified in the captured scene to the selected keypoint signatures.
IN1958MUN2014 2012-04-13 2013-03-07 IN2014MN01958A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201261623910P 2012-04-13 2012-04-13
US13/664,295 US9495591B2 (en) 2012-04-13 2012-10-30 Object recognition using multi-modal matching scheme
PCT/US2013/029558 WO2013154701A1 (en) 2012-04-13 2013-03-07 Object recognition using multi-modal matching scheme

Publications (1)

Publication Number Publication Date
IN2014MN01958A true IN2014MN01958A (en) 2015-07-10

Family

ID=49325131

Family Applications (1)

Application Number Title Priority Date Filing Date
IN1958MUN2014 IN2014MN01958A (en) 2012-04-13 2013-03-07

Country Status (7)

Country Link
US (1) US9495591B2 (en)
EP (1) EP2836964A1 (en)
JP (2) JP2015514239A (en)
KR (1) KR20140145195A (en)
CN (1) CN104246796B (en)
IN (1) IN2014MN01958A (en)
WO (1) WO2013154701A1 (en)

Families Citing this family (130)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8175617B2 (en) 2009-10-28 2012-05-08 Digimarc Corporation Sensor-based mobile search, related methods and systems
US8810598B2 (en) 2011-04-08 2014-08-19 Nant Holdings Ip, Llc Interference based augmented reality hosting platforms
US9489567B2 (en) * 2011-04-11 2016-11-08 Intel Corporation Tracking and recognition of faces using selected region classification
WO2013078345A1 (en) 2011-11-21 2013-05-30 Nant Holdings Ip, Llc Subscription bill service, systems and methods
US9099096B2 (en) * 2012-05-04 2015-08-04 Sony Computer Entertainment Inc. Source separation by independent component analysis with moving constraint
US8886526B2 (en) * 2012-05-04 2014-11-11 Sony Computer Entertainment Inc. Source separation using independent component analysis with mixed multi-variate probability density function
US8880395B2 (en) * 2012-05-04 2014-11-04 Sony Computer Entertainment Inc. Source separation by independent component analysis in conjunction with source direction information
US9955277B1 (en) * 2012-09-26 2018-04-24 Foundation For Research And Technology-Hellas (F.O.R.T.H.) Institute Of Computer Science (I.C.S.) Spatial sound characterization apparatuses, methods and systems
US9554203B1 (en) * 2012-09-26 2017-01-24 Foundation for Research and Technolgy—Hellas (FORTH) Institute of Computer Science (ICS) Sound source characterization apparatuses, methods and systems
US10136239B1 (en) 2012-09-26 2018-11-20 Foundation For Research And Technology—Hellas (F.O.R.T.H.) Capturing and reproducing spatial sound apparatuses, methods, and systems
US9549253B2 (en) 2012-09-26 2017-01-17 Foundation for Research and Technology—Hellas (FORTH) Institute of Computer Science (ICS) Sound source localization and isolation apparatuses, methods and systems
US10149048B1 (en) 2012-09-26 2018-12-04 Foundation for Research and Technology—Hellas (F.O.R.T.H.) Institute of Computer Science (I.C.S.) Direction of arrival estimation and sound source enhancement in the presence of a reflective surface apparatuses, methods, and systems
US20160210957A1 (en) 2015-01-16 2016-07-21 Foundation For Research And Technology - Hellas (Forth) Foreground Signal Suppression Apparatuses, Methods, and Systems
US10175335B1 (en) 2012-09-26 2019-01-08 Foundation For Research And Technology-Hellas (Forth) Direction of arrival (DOA) estimation apparatuses, methods, and systems
WO2014069112A1 (en) 2012-11-02 2014-05-08 ソニー株式会社 Signal processing device and signal processing method
WO2014069111A1 (en) 2012-11-02 2014-05-08 ソニー株式会社 Signal processing device, signal processing method, measurement method, and measurement device
CN103916723B (en) * 2013-01-08 2018-08-10 联想(北京)有限公司 A kind of sound collection method and a kind of electronic equipment
KR101832835B1 (en) * 2013-07-11 2018-02-28 삼성전자주식회사 Imaging processing module, ultrasound imaging apparatus, method for beam forming and method for controlling a ultrasound imaging apparatus
US9729994B1 (en) * 2013-08-09 2017-08-08 University Of South Florida System and method for listener controlled beamforming
US20150085615A1 (en) * 2013-09-25 2015-03-26 Lenovo (Singapore) Pte, Ltd. Motion modified steering vector
US9582516B2 (en) 2013-10-17 2017-02-28 Nant Holdings Ip, Llc Wide area augmented reality location-based services
EP2884491A1 (en) * 2013-12-11 2015-06-17 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Extraction of reverberant sound using microphone arrays
US9311639B2 (en) 2014-02-11 2016-04-12 Digimarc Corporation Methods, apparatus and arrangements for device to device communication
US9338575B2 (en) * 2014-02-19 2016-05-10 Echostar Technologies L.L.C. Image steered microphone array
CN103905810B (en) * 2014-03-17 2017-12-12 北京智谷睿拓技术服务有限公司 Multi-media processing method and multimedia processing apparatus
JP6320806B2 (en) * 2014-03-17 2018-05-09 国立大学法人豊橋技術科学大学 3D model search method and 3D model search system
KR20150118855A (en) * 2014-04-15 2015-10-23 삼성전자주식회사 Electronic apparatus and recording method thereof
US9990433B2 (en) 2014-05-23 2018-06-05 Samsung Electronics Co., Ltd. Method for searching and device thereof
US11314826B2 (en) 2014-05-23 2022-04-26 Samsung Electronics Co., Ltd. Method for searching and device thereof
CN105224941B (en) * 2014-06-18 2018-11-20 台达电子工业股份有限公司 Process identification and localization method
US10679407B2 (en) 2014-06-27 2020-06-09 The University Of North Carolina At Chapel Hill Methods, systems, and computer readable media for modeling interactive diffuse reflections and higher-order diffraction in virtual environment scenes
JP6118838B2 (en) * 2014-08-21 2017-04-19 本田技研工業株式会社 Information processing apparatus, information processing system, information processing method, and information processing program
US11308928B2 (en) * 2014-09-25 2022-04-19 Sunhouse Technologies, Inc. Systems and methods for capturing and interpreting audio
US9536509B2 (en) * 2014-09-25 2017-01-03 Sunhouse Technologies, Inc. Systems and methods for capturing and interpreting audio
US10061009B1 (en) 2014-09-30 2018-08-28 Apple Inc. Robust confidence measure for beamformed acoustic beacon for device tracking and localization
GB2533373B (en) * 2014-12-18 2018-07-04 Canon Kk Video-based sound source separation
US10037712B2 (en) * 2015-01-30 2018-07-31 Toyota Motor Engineering & Manufacturing North America, Inc. Vision-assist devices and methods of detecting a classification of an object
US10217379B2 (en) 2015-01-30 2019-02-26 Toyota Motor Engineering & Manufacturing North America, Inc. Modifying vision-assist device parameters based on an environment classification
US9791264B2 (en) * 2015-02-04 2017-10-17 Sony Corporation Method of fast and robust camera location ordering
US9736580B2 (en) * 2015-03-19 2017-08-15 Intel Corporation Acoustic camera based audio visual scene analysis
CN107980221B (en) * 2015-04-01 2021-10-29 猫头鹰实验室股份有限公司 Compositing and scaling angularly separated sub-scenes
US9769587B2 (en) * 2015-04-17 2017-09-19 Qualcomm Incorporated Calibration of acoustic echo cancelation for multi-channel sound in dynamic acoustic environments
US9892518B2 (en) * 2015-06-09 2018-02-13 The Trustees Of Columbia University In The City Of New York Systems and methods for detecting motion using local phase information
US10068445B2 (en) 2015-06-24 2018-09-04 Google Llc Systems and methods of home-specific sound event detection
US9754182B2 (en) * 2015-09-02 2017-09-05 Apple Inc. Detecting keypoints in image data
US10169684B1 (en) 2015-10-01 2019-01-01 Intellivision Technologies Corp. Methods and systems for recognizing objects based on one or more stored training images
CN107925818B (en) * 2015-10-15 2020-10-16 华为技术有限公司 Sound processing node for a sound processing node arrangement
CN105574525B (en) * 2015-12-18 2019-04-26 天津中科虹星科技有限公司 A kind of complex scene multi-modal biological characteristic image acquiring method and its device
WO2017108097A1 (en) * 2015-12-22 2017-06-29 Huawei Technologies Duesseldorf Gmbh Localization algorithm for sound sources with known statistics
TW201727537A (en) * 2016-01-22 2017-08-01 鴻海精密工業股份有限公司 Face recognition system and face recognition method
WO2017139473A1 (en) 2016-02-09 2017-08-17 Dolby Laboratories Licensing Corporation System and method for spatial processing of soundfield signals
EP3209034A1 (en) * 2016-02-19 2017-08-23 Nokia Technologies Oy Controlling audio rendering
US11234072B2 (en) 2016-02-18 2022-01-25 Dolby Laboratories Licensing Corporation Processing of microphone signals for spatial playback
GB2549073B (en) * 2016-03-24 2020-02-26 Imagination Tech Ltd Generating sparse sample histograms
WO2017208820A1 (en) 2016-05-30 2017-12-07 ソニー株式会社 Video sound processing device, video sound processing method, and program
CN109478400B (en) 2016-07-22 2023-07-07 杜比实验室特许公司 Network-based processing and distribution of multimedia content for live musical performances
CN105979442B (en) * 2016-07-22 2019-12-03 北京地平线机器人技术研发有限公司 Noise suppressing method, device and movable equipment
US10522169B2 (en) * 2016-09-23 2019-12-31 Trustees Of The California State University Classification of teaching based upon sound amplitude
US9942513B1 (en) * 2016-10-31 2018-04-10 Cisco Technology, Inc. Automated configuration of behavior of a telepresence system based on spatial detection of telepresence components
US10528850B2 (en) * 2016-11-02 2020-01-07 Ford Global Technologies, Llc Object classification adjustment based on vehicle communication
US10455601B2 (en) * 2016-11-17 2019-10-22 Telefonaktiebolaget Lm Ericsson (Publ) Co-scheduling of wireless devices
JP6942472B2 (en) * 2017-01-13 2021-09-29 キヤノン株式会社 Video recognition device, video recognition method and program
US10896668B2 (en) 2017-01-31 2021-01-19 Sony Corporation Signal processing apparatus, signal processing method, and computer program
US10248744B2 (en) 2017-02-16 2019-04-02 The University Of North Carolina At Chapel Hill Methods, systems, and computer readable media for acoustic classification and optimization for multi-modal rendering of real-world scenes
JP7121470B2 (en) * 2017-05-12 2022-08-18 キヤノン株式会社 Image processing system, control method, and program
US20180366139A1 (en) * 2017-06-14 2018-12-20 Upton Beall Bowden Employing vehicular sensor information for retrieval of data
CN107621625B (en) * 2017-06-23 2020-07-17 桂林电子科技大学 Sound source positioning method based on double micro microphones
CN111417961B (en) * 2017-07-14 2024-01-12 纪念斯隆-凯特林癌症中心 Weak-supervision image classifier
CN107526568A (en) * 2017-08-18 2017-12-29 广东欧珀移动通信有限公司 volume adjusting method, device, terminal device and storage medium
US11209306B2 (en) 2017-11-02 2021-12-28 Fluke Corporation Portable acoustic imaging tool with scanning and analysis capability
US11099075B2 (en) 2017-11-02 2021-08-24 Fluke Corporation Focus and/or parallax adjustment in acoustic imaging using distance information
CN109754814B (en) * 2017-11-08 2023-07-28 阿里巴巴集团控股有限公司 Sound processing method and interaction equipment
US11030997B2 (en) * 2017-11-22 2021-06-08 Baidu Usa Llc Slim embedding layers for recurrent neural language models
CN109977731B (en) * 2017-12-27 2021-10-29 深圳市优必选科技有限公司 Scene identification method, scene identification equipment and terminal equipment
US10616682B2 (en) * 2018-01-12 2020-04-07 Sorama Calibration of microphone arrays with an uncalibrated source
US10522167B1 (en) * 2018-02-13 2019-12-31 Amazon Techonlogies, Inc. Multichannel noise cancellation using deep neural network masking
WO2019183277A1 (en) * 2018-03-20 2019-09-26 Nant Holdings Ip, Llc Volumetric descriptors
CN108564116A (en) * 2018-04-02 2018-09-21 深圳市安软慧视科技有限公司 A kind of ingredient intelligent analysis method of camera scene image
US10523864B2 (en) * 2018-04-10 2019-12-31 Facebook, Inc. Automated cinematic decisions based on descriptive models
US11212637B2 (en) 2018-04-12 2021-12-28 Qualcomm Incorproated Complementary virtual audio generation
GB2573173B (en) * 2018-04-27 2021-04-28 Cirrus Logic Int Semiconductor Ltd Processing audio signals
EP3827227A1 (en) 2018-07-24 2021-06-02 Fluke Corporation Systems and methods for projecting and displaying acoustic data
EP3829161B1 (en) * 2018-07-24 2023-08-30 Sony Group Corporation Information processing device and method, and program
CN109284673B (en) * 2018-08-07 2022-02-22 北京市商汤科技开发有限公司 Object tracking method and device, electronic equipment and storage medium
US10769474B2 (en) 2018-08-10 2020-09-08 Apple Inc. Keypoint detection circuit for processing image pyramid in recursive manner
CN112956203A (en) * 2018-08-29 2021-06-11 英特尔公司 Apparatus and method for feature point tracking using inter prediction
KR20230113831A (en) * 2018-09-03 2023-08-01 스냅 인코포레이티드 Acoustic zooming
JP7119809B2 (en) * 2018-09-13 2022-08-17 富士通株式会社 Information display control program, information display control method and information display control device
US11605231B2 (en) * 2018-09-17 2023-03-14 Syracuse University Low power and privacy preserving sensor platform for occupancy detection
CN111050269B (en) * 2018-10-15 2021-11-19 华为技术有限公司 Audio processing method and electronic equipment
EP4408022A3 (en) 2018-10-24 2024-10-16 Gracenote, Inc. Methods and apparatus to adjust audio playback settings based on analysis of audio characteristics
EP3672282B1 (en) * 2018-12-21 2022-04-06 Sivantos Pte. Ltd. Method for beamforming in a binaural hearing aid
CN109697734B (en) * 2018-12-25 2021-03-09 浙江商汤科技开发有限公司 Pose estimation method and device, electronic equipment and storage medium
CN109817193B (en) * 2019-02-21 2022-11-22 深圳市魔耳乐器有限公司 Timbre fitting system based on time-varying multi-segment frequency spectrum
US11343545B2 (en) 2019-03-27 2022-05-24 International Business Machines Corporation Computer-implemented event detection using sonification
CN112233647A (en) * 2019-06-26 2021-01-15 索尼公司 Information processing apparatus and method, and computer-readable storage medium
CN110531351B (en) * 2019-08-16 2023-09-26 山东工商学院 GPR image hyperbolic wave top detection method based on Fast algorithm
US11440194B2 (en) * 2019-09-13 2022-09-13 Honda Motor Co., Ltd. Physical human-robot interaction (pHRI)
US10735887B1 (en) * 2019-09-19 2020-08-04 Wave Sciences, LLC Spatial audio array processing system and method
CN112862663B (en) * 2019-11-12 2023-06-16 芜湖每刻深思智能科技有限公司 Near sensor end computing system
US11610599B2 (en) * 2019-12-06 2023-03-21 Meta Platforms Technologies, Llc Systems and methods for visually guided audio separation
JP7250281B2 (en) * 2019-12-12 2023-04-03 本田技研工業株式会社 Three-dimensional structure restoration device, three-dimensional structure restoration method, and program
CN111191547A (en) * 2019-12-23 2020-05-22 中电健康云科技有限公司 Medical waste online screening method based on hyperspectral deconvolution and unmixing
US11295543B2 (en) * 2020-03-31 2022-04-05 International Business Machines Corporation Object detection in an image
US10929506B1 (en) * 2020-06-02 2021-02-23 Scientific Innovations, Inc. Computerized estimation of minimum number of sonic sources using antichain length
CN111652165B (en) * 2020-06-08 2022-05-17 北京世纪好未来教育科技有限公司 Mouth shape evaluating method, mouth shape evaluating equipment and computer storage medium
US11368456B2 (en) 2020-09-11 2022-06-21 Bank Of America Corporation User security profile for multi-media identity verification
US11356266B2 (en) 2020-09-11 2022-06-07 Bank Of America Corporation User authentication using diverse media inputs and hash-based ledgers
KR20220048090A (en) 2020-10-12 2022-04-19 삼성전자주식회사 Method of testing image sensor using frequency domain and test system performing the same
CN112386282B (en) * 2020-11-13 2022-08-26 声泰特(成都)科技有限公司 Ultrasonic automatic volume scanning imaging method and system
CN112465868B (en) * 2020-11-30 2024-01-12 浙江华锐捷技术有限公司 Target detection tracking method and device, storage medium and electronic device
CN112860198B (en) * 2021-01-05 2024-02-09 中科创达软件股份有限公司 Video conference picture switching method and device, computer equipment and storage medium
JP6967735B1 (en) * 2021-01-13 2021-11-17 パナソニックIpマネジメント株式会社 Signal processing equipment and signal processing system
CN113035162B (en) * 2021-03-22 2024-07-09 平安科技(深圳)有限公司 Ethnic music generation method, device, equipment and storage medium
US20240249743A1 (en) * 2021-05-25 2024-07-25 Google Llc Enhancing Audio Content of a Captured Sense
KR102437760B1 (en) * 2021-05-27 2022-08-29 이충열 Method for processing sounds by computing apparatus, method for processing images and sounds thereby, and systems using the same
CN113177536B (en) * 2021-06-28 2021-09-10 四川九通智路科技有限公司 Vehicle collision detection method and device based on deep residual shrinkage network
CN113189539B (en) * 2021-06-30 2021-09-28 成都华日通讯技术股份有限公司 Airspace filtering method based on direction-finding equipment
US11408971B1 (en) 2021-07-02 2022-08-09 Scientific Innovations, Inc. Computerized estimation of minimum number of sonic sources using maximum matching of a bipartite graph
CN113887360B (en) * 2021-09-23 2024-05-31 同济大学 Method for extracting dispersion waves based on iterative expansion dispersion modal decomposition
CN114241534B (en) * 2021-12-01 2022-10-18 佛山市红狐物联网科技有限公司 Rapid matching method and system for full-palm venation data
CN114280533B (en) * 2021-12-23 2022-10-21 哈尔滨工程大学 Sparse Bayesian DOA estimation method based on l0 norm constraint
CN114422910A (en) * 2022-01-17 2022-04-29 江苏水声技术有限公司 Scene analysis acoustic imaging device and method based on spherical microphone array
WO2022241328A1 (en) * 2022-05-20 2022-11-17 Innopeak Technology, Inc. Hand gesture detection methods and systems with hand shape calibration
US11830239B1 (en) * 2022-07-13 2023-11-28 Robert Bosch Gmbh Systems and methods for automatic extraction and alignment of labels derived from camera feed for moving sound sources recorded with a microphone array
US12020156B2 (en) 2022-07-13 2024-06-25 Robert Bosch Gmbh Systems and methods for automatic alignment between audio recordings and labels extracted from a multitude of asynchronous sensors in urban settings
CN115601576B (en) * 2022-12-12 2023-04-07 云南览易网络科技有限责任公司 Image feature matching method, device, equipment and storage medium
WO2024125793A1 (en) * 2022-12-15 2024-06-20 Telefonaktiebolaget Lm Ericsson (Publ) Focusing a camera capturing video data using directional data of audio
CN115880293B (en) * 2023-02-22 2023-05-05 中山大学孙逸仙纪念医院 Pathological image identification method, device and medium for bladder cancer lymph node metastasis
CN116796021B (en) * 2023-08-28 2023-12-05 上海任意门科技有限公司 Image retrieval method, system, electronic device and medium

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001067098A (en) * 1999-08-25 2001-03-16 Sanyo Electric Co Ltd Person detecting method and device equipped with person detecting function
US6738745B1 (en) * 2000-04-07 2004-05-18 International Business Machines Corporation Methods and apparatus for identifying a non-target language in a speech recognition system
US7130446B2 (en) * 2001-12-03 2006-10-31 Microsoft Corporation Automatic detection and tracking of multiple individuals using multiple cues
US7133535B2 (en) 2002-12-21 2006-11-07 Microsoft Corp. System and method for real time lip synchronization
JP2005117621A (en) * 2003-09-16 2005-04-28 Honda Motor Co Ltd Image distribution system
EP1643769B1 (en) 2004-09-30 2009-12-23 Samsung Electronics Co., Ltd. Apparatus and method performing audio-video sensor fusion for object localization, tracking and separation
JP2009296143A (en) * 2008-06-03 2009-12-17 Canon Inc Imaging device
US8391615B2 (en) * 2008-12-02 2013-03-05 Intel Corporation Image recognition algorithm, method of identifying a target image using same, and method of selecting data for transmission to a portable electronic device
US8964994B2 (en) 2008-12-15 2015-02-24 Orange Encoding of multichannel digital audio signals
US8548193B2 (en) * 2009-09-03 2013-10-01 Palo Alto Research Center Incorporated Method and apparatus for navigating an electronic magnifier over a target document
US9031243B2 (en) * 2009-09-28 2015-05-12 iZotope, Inc. Automatic labeling and control of audio algorithms by audio recognition
US8135221B2 (en) * 2009-10-07 2012-03-13 Eastman Kodak Company Video concept classification using audio-visual atoms
CN101742114A (en) * 2009-12-31 2010-06-16 上海量科电子科技有限公司 Method and device for determining shooting operation through gesture identification
JP4968346B2 (en) * 2010-01-20 2012-07-04 カシオ計算機株式会社 Imaging apparatus, image detection apparatus, and program
US8602887B2 (en) * 2010-06-03 2013-12-10 Microsoft Corporation Synthesis of information from multiple audiovisual sources
US8805007B2 (en) * 2011-10-13 2014-08-12 Disney Enterprises, Inc. Integrated background and foreground tracking

Also Published As

Publication number Publication date
KR20140145195A (en) 2014-12-22
US9495591B2 (en) 2016-11-15
CN104246796A (en) 2014-12-24
WO2013154701A1 (en) 2013-10-17
US20130272548A1 (en) 2013-10-17
JP2015514239A (en) 2015-05-18
CN104246796B (en) 2018-04-17
EP2836964A1 (en) 2015-02-18
JP2018077479A (en) 2018-05-17

Similar Documents

Publication Publication Date Title
IN2014MN01958A (en)
RU2015101724A (en) INFORMATION PROCESSING SYSTEM AND INFORMATION MEDIA
WO2016174524A3 (en) Data processing systems
JP2015514239A5 (en)
EP2887697A3 (en) Method of audio signal processing and hearing aid system for implementing the same
BR112016007145A2 (en) mobile video search
WO2018080650A3 (en) Video-based data collection, image capture and analysis configuration
EP2860706A3 (en) Anti-spoofing
WO2012177845A3 (en) Systems and methods for tracking and authenticating goods
EP3531714A3 (en) Facilitating calibration of an audio playback device
EP2782046A3 (en) Information processing device, sensor device, information processing system, and storage medium
MX364461B (en) Method and apparatus for implementing recording of object audio, and electronic device.
EP2372607A3 (en) Scene matching reference data generation system and position measurement system
GB2562664A (en) Methods for detecting a sleep disorder and sleep disorder detection devices
EP2372605A3 (en) Image processing system and position measurement system
WO2015017796A3 (en) Learning systems and methods
WO2015134544A3 (en) Converter device and system including converter device
IN2014CN03322A (en)
WO2013188807A3 (en) Methods and systems for signal processing
MY182303A (en) High-band signal generation
EP3177040A3 (en) Information processing apparatus, information processing method, and program
WO2016126768A3 (en) Conference word cloud
PH12019501920A1 (en) Image processing method and apparatus
EP4394768A3 (en) Vehicle-based media system with audio ad and visual content synchronization feature
AU2017302245A1 (en) Optical character recognition utilizing hashed templates