FR3085785B1 - Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation - Google Patents

Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation Download PDF

Info

Publication number
FR3085785B1
FR3085785B1 FR1858041A FR1858041A FR3085785B1 FR 3085785 B1 FR3085785 B1 FR 3085785B1 FR 1858041 A FR1858041 A FR 1858041A FR 1858041 A FR1858041 A FR 1858041A FR 3085785 B1 FR3085785 B1 FR 3085785B1
Authority
FR
France
Prior art keywords
audio signal
normalization
generating
characteristic
frequency component
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
FR1858041A
Other languages
English (en)
Other versions
FR3085785A1 (fr
Inventor
Robert Coover
Zafar Rafii
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gracenote Inc
Original Assignee
Gracenote Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gracenote Inc filed Critical Gracenote Inc
Priority to FR1858041A priority Critical patent/FR3085785B1/fr
Priority to US16/453,654 priority patent/US20200082835A1/en
Priority to EP19857365.1A priority patent/EP3847642B1/fr
Priority to CN201980072112.9A priority patent/CN113614828A/zh
Priority to JP2021512712A priority patent/JP7346552B2/ja
Priority to CA3111800A priority patent/CA3111800A1/fr
Priority to AU2019335404A priority patent/AU2019335404B2/en
Priority to PCT/US2019/049953 priority patent/WO2020051451A1/fr
Priority to KR1020217010094A priority patent/KR20210082439A/ko
Publication of FR3085785A1 publication Critical patent/FR3085785A1/fr
Application granted granted Critical
Publication of FR3085785B1 publication Critical patent/FR3085785B1/fr
Priority to AU2022275486A priority patent/AU2022275486A1/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/018Audio watermarking, i.e. embedding inaudible data in the audio signal
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/022Blocking, i.e. grouping of samples in time; Choice of analysis windows; Overlap factoring
    • G10L19/025Detection of transients or attacks for time/frequency resolution switching
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • G10L25/54Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination for retrieval

Abstract

Des procédés, des appareils, des systèmes et des articles de fabrication sont divulgués pour générer des empreintes numériques audio par voie de normalisation. Un procédé exemplaire pour la génération d'empreintes numériques de données audio inclut la réception d'un signal audio dans des composants de fréquence incluant un premier composant de fréquence de signal audio à l'intérieur d'un premier bac de fréquences et un deuxième composant de fréquence de signal audio à l'intérieur d'un deuxième bac de fréquences, la détermination d'une première caractéristique du premier composant de fréquence de signal audio et d'une deuxième caractéristique du deuxième composant de fréquence de signal audio et la normalisation du signal audio pour générer ainsi des valeurs d'énergie normalisées, la normalisation du signal audio incluant (1) la normalisation du premier composant de fréquence de signal audio en ayant recours à la première caractéristique et (2) la normalisation du deuxième composant de fréquence de signal audio en ayant recours à la deuxième caractéristique. L'exemple inclut par ailleurs la sélection d'une des valeurs d'énergie normalisées et la génération d'une empreinte numérique du signal audio en utilisant la valeur sélectionnée parmi les valeurs d'énergie sélectionnée.
FR1858041A 2018-09-07 2018-09-07 Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation Active FR3085785B1 (fr)

Priority Applications (10)

Application Number Priority Date Filing Date Title
FR1858041A FR3085785B1 (fr) 2018-09-07 2018-09-07 Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation
US16/453,654 US20200082835A1 (en) 2018-09-07 2019-06-26 Methods and apparatus to fingerprint an audio signal via normalization
CN201980072112.9A CN113614828A (zh) 2018-09-07 2019-09-06 经由归一化对音频信号进行指纹识别的方法和装置
JP2021512712A JP7346552B2 (ja) 2018-09-07 2019-09-06 正規化を介して音響信号をフィンガープリンティングするための方法、記憶媒体及び装置
CA3111800A CA3111800A1 (fr) 2018-09-07 2019-09-06 Procedes et appareil servant a etablir une empreinte digitale pour un signal audio par normalisation
AU2019335404A AU2019335404B2 (en) 2018-09-07 2019-09-06 Methods and apparatus to fingerprint an audio signal via normalization
EP19857365.1A EP3847642B1 (fr) 2018-09-07 2019-09-06 Procédés et appareil servant à établir une empreinte digitale pour un signal audio par normalisation
PCT/US2019/049953 WO2020051451A1 (fr) 2018-09-07 2019-09-06 Procédés et appareil servant à établir une empreinte digitale pour un signal audio par normalisation
KR1020217010094A KR20210082439A (ko) 2018-09-07 2019-09-06 정규화를 통해 오디오 신호를 핑거프린팅하는 방법 및 장치
AU2022275486A AU2022275486A1 (en) 2018-09-07 2022-11-24 Methods and apparatus to fingerprint an audio signal via normalization

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
FR1858041A FR3085785B1 (fr) 2018-09-07 2018-09-07 Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation

Publications (2)

Publication Number Publication Date
FR3085785A1 FR3085785A1 (fr) 2020-03-13
FR3085785B1 true FR3085785B1 (fr) 2021-05-14

Family

ID=65861336

Family Applications (1)

Application Number Title Priority Date Filing Date
FR1858041A Active FR3085785B1 (fr) 2018-09-07 2018-09-07 Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation

Country Status (9)

Country Link
US (1) US20200082835A1 (fr)
EP (1) EP3847642B1 (fr)
JP (1) JP7346552B2 (fr)
KR (1) KR20210082439A (fr)
CN (1) CN113614828A (fr)
AU (2) AU2019335404B2 (fr)
CA (1) CA3111800A1 (fr)
FR (1) FR3085785B1 (fr)
WO (1) WO2020051451A1 (fr)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11727953B2 (en) * 2020-12-31 2023-08-15 Gracenote, Inc. Audio content recognition method and system
US11798577B2 (en) 2021-03-04 2023-10-24 Gracenote, Inc. Methods and apparatus to fingerprint an audio signal
US11804231B2 (en) * 2021-07-02 2023-10-31 Capital One Services, Llc Information exchange on mobile devices using audio

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2002346116A1 (en) * 2001-07-20 2003-03-03 Gracenote, Inc. Automatic identification of sound recordings
EP1567965A1 (fr) * 2002-11-12 2005-08-31 Koninklijke Philips Electronics N.V. Extraction d'empreintes spectrales de contenus multimedia
DE102004036154B3 (de) * 2004-07-26 2005-12-22 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Vorrichtung und Verfahren zur robusten Klassifizierung von Audiosignalen sowie Verfahren zu Einrichtung und Betrieb einer Audiosignal-Datenbank sowie Computer-Programm
EP1752969A4 (fr) * 2005-02-08 2007-07-11 Nippon Telegraph & Telephone Dispositif de séparation de signal, méthode de séparation de signal, programme de séparation de signal et support d`enregistrement
WO2009110738A2 (fr) * 2008-03-03 2009-09-11 엘지전자(주) Procédé et appareil pour traiter un signal audio
EP2327213B1 (fr) * 2008-08-21 2014-10-08 Dolby Laboratories Licensing Corporation Calcul d'erreurs de synchronisation audio video base sur des caracteristiques audio-visuelles
CA2716266C (fr) * 2009-10-01 2016-08-16 Crim (Centre De Recherche Informatique De Montreal) Detection de polycopie magnetique a base de contenu
JP5728888B2 (ja) * 2010-10-29 2015-06-03 ソニー株式会社 信号処理装置および方法、並びにプログラム
EP2751804A1 (fr) * 2011-08-29 2014-07-09 Telefónica, S.A. Procédé de génération d'empreintes digitales audio
US9098576B1 (en) * 2011-10-17 2015-08-04 Google Inc. Ensemble interest point detection for audio matching
KR101286862B1 (ko) * 2011-11-18 2013-07-17 (주)이스트소프트 블록별 가중치 부여를 이용한 오디오 핑거프린트 검색방법
US9202472B1 (en) * 2012-03-29 2015-12-01 Google Inc. Magnitude ratio descriptors for pitch-resistant audio matching
US9390719B1 (en) * 2012-10-09 2016-07-12 Google Inc. Interest points density control for audio matching
US9183849B2 (en) * 2012-12-21 2015-11-10 The Nielsen Company (Us), Llc Audio matching with semantic audio recognition and report generation
CN104125509B (zh) * 2013-04-28 2015-09-30 腾讯科技(深圳)有限公司 节目识别方法、装置及服务器
CN104093079B (zh) * 2014-05-29 2015-10-07 腾讯科技(深圳)有限公司 基于多媒体节目的交互方法、终端、服务器和系统
CN104050259A (zh) * 2014-06-16 2014-09-17 上海大学 一种基于som算法的音频指纹提取方法
US9837101B2 (en) * 2014-11-25 2017-12-05 Facebook, Inc. Indexing based on time-variant transforms of an audio signal's spectrogram

Also Published As

Publication number Publication date
JP7346552B2 (ja) 2023-09-19
WO2020051451A1 (fr) 2020-03-12
EP3847642B1 (fr) 2024-04-10
CN113614828A (zh) 2021-11-05
AU2019335404B2 (en) 2022-08-25
EP3847642A4 (fr) 2022-07-06
AU2022275486A1 (en) 2023-01-05
EP3847642A1 (fr) 2021-07-14
FR3085785A1 (fr) 2020-03-13
KR20210082439A (ko) 2021-07-05
JP2021536596A (ja) 2021-12-27
US20200082835A1 (en) 2020-03-12
AU2019335404A1 (en) 2021-04-22
CA3111800A1 (fr) 2020-03-12

Similar Documents

Publication Publication Date Title
FR3085785B1 (fr) Procedes et appareil pour generer une empreinte numerique d'un signal audio par voie de normalisation
Chang et al. Music Genre Classification via Compressive Sampling.
US10540993B2 (en) Audio fingerprinting based on audio energy characteristics
CN103718242B (zh) 采用谱运动变换的用于处理声音信号的系统和方法
KR20180034216A (ko) 다른 신호의 스펙트럼을 검사하기 위한 신호 제거
Korshunov et al. Cross-database evaluation of audio-based spoofing detection systems
Ellis et al. Echoprint: An open music identification service
Kamble et al. Novel Variable Length Energy Separation Algorithm Using Instantaneous Amplitude Features for Replay Detection.
Nguyen et al. Acoustic scene classification with mismatched recording devices using mixture of experts layer
Tsipas et al. Semi-supervised audio-driven TV-news speaker diarization using deep neural embeddings
Kawa et al. Attack agnostic dataset: Towards generalization and stabilization of audio deepfake detection
Thambi et al. Random forest algorithm for improving the performance of speech/non-speech detection
Ghasemzadeh Multi-layer architecture for efficient steganalysis of UnderMp3Cover in multi-encoder scenario
Mendes et al. Universal patterns in sound amplitudes of songs and music genres
Pandey et al. Cell-phone identification from audio recordings using PSD of speech-free regions
Banchhor et al. Musical instrument recognition using spectrogram and autocorrelation
KR20200099093A (ko) 비선형 잡음 감소 시스템
Blaszke et al. Determination of low-level audio descriptors of a musical instrument sound using neural network
Choi et al. Light-weight Frequency Information Aware Neural Network Architecture for Voice Spoofing Detection
CN109150320B (zh) 一种声波信号编码、解码方法及装置
Hrabina et al. Implementation of developed gunshot detection algorithm on TMS320C6713 processor
CN112581975A (zh) 基于信号混叠和双声道相关性的超声波语音指令防御方法
RU2436173C1 (ru) Способ обнаружения пауз в речевых сигналах и устройство его реализующее
Khonglah et al. Low frequency region of vocal tract information for speech/music classification
Alluri et al. Replay spoofing countermeasures using high spectro-temporal resolution features

Legal Events

Date Code Title Description
PLFP Fee payment

Year of fee payment: 2

PLSC Publication of the preliminary search report

Effective date: 20200313

PLFP Fee payment

Year of fee payment: 3

PLFP Fee payment

Year of fee payment: 4

PLFP Fee payment

Year of fee payment: 5

PLFP Fee payment

Year of fee payment: 6