GB2523973A - Audio analysis system and method using audio segment characterisation - Google Patents

Audio analysis system and method using audio segment characterisation Download PDF

Info

Publication number
GB2523973A
GB2523973A GB1512636.0A GB201512636A GB2523973A GB 2523973 A GB2523973 A GB 2523973A GB 201512636 A GB201512636 A GB 201512636A GB 2523973 A GB2523973 A GB 2523973A
Authority
GB
United Kingdom
Prior art keywords
feature data
audio signal
input audio
method
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
GB1512636.0A
Other versions
GB201512636D0 (en
GB2523973B (en
Inventor
Michela Magas
Cyril Laurier
Original Assignee
Michela Magas
Cyril Laurier
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority to GBGB1222951.4A priority Critical patent/GB201222951D0/en
Priority to GB201312399A priority patent/GB201312399D0/en
Application filed by Michela Magas, Cyril Laurier filed Critical Michela Magas
Priority to PCT/GB2013/053362 priority patent/WO2014096832A1/en
Publication of GB201512636D0 publication Critical patent/GB201512636D0/en
Publication of GB2523973A publication Critical patent/GB2523973A/en
Application granted granted Critical
Publication of GB2523973B publication Critical patent/GB2523973B/en
Application status is Active legal-status Critical
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H1/00Details of electrophonic musical instruments
    • G10H1/0008Associated control or indicating means
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/041Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal based on mfcc [mel -frequency spectral coefficients]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H2210/00Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
    • G10H2210/031Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
    • G10H2210/061Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction of musical phrases, isolation of musically relevant segments, e.g. musical thumbnail generation, or for temporal structure analysis of a musical piece, e.g. determination of the movement sequence of a musical work
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/075Musical metadata derived from musical analysis or for use in electrophonic musical instruments
    • G10H2240/081Genre classification, i.e. descriptive metadata for classification or selection of musical pieces according to style
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/075Musical metadata derived from musical analysis or for use in electrophonic musical instruments
    • G10H2240/085Mood, i.e. generation, detection or selection of a particular emotional content or atmosphere in a musical piece
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H2240/00Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
    • G10H2240/121Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
    • G10H2240/131Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
    • G10H2240/141Library retrieval matching, i.e. any of the steps of matching an inputted segment or phrase with musical database contents, e.g. query by humming, singing or playing; the steps may include, e.g. musical analysis of the input, musical feature extraction, query formulation, or details of the retrieval process
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10HELECTROPHONIC MUSICAL INSTRUMENTS
    • G10H2250/00Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
    • G10H2250/131Mathematical functions for musical analysis, processing, synthesis or composition
    • G10H2250/215Transforms, i.e. mathematical transforms into domains appropriate for musical signal processing, coding or compression
    • G10H2250/235Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT]

Abstract

A method of matching an input audio signal to one or more audio segments within a plurality of audio segments, the method comprising: receiving the input audio signal; processing the input audio signal to determine structural parameter feature data related to the received input audio signal; analysing the determined structural parameter feature data to extract semantic feature data; comparing the feature data of the input audio signal to pre-processed feature data relating to the plurality of audio segments in order to match one or more audio segments within a similarity threshold of the input audio signal; outputting a search result on the basis of the matched one or more audio segments wherein semantic feature data is extracted from the structural parameter data using a supervised learning technique.
GB1512636.0A 2012-12-19 2013-12-19 Audio analysis system and method using audio segment characterisation Active GB2523973B (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
GBGB1222951.4A GB201222951D0 (en) 2012-12-19 2012-12-19 Audio analysis system and method
GB201312399A GB201312399D0 (en) 2013-07-10 2013-07-10 Audio analysis system and method
PCT/GB2013/053362 WO2014096832A1 (en) 2012-12-19 2013-12-19 Audio analysis system and method using audio segment characterisation

Publications (3)

Publication Number Publication Date
GB201512636D0 GB201512636D0 (en) 2015-08-26
GB2523973A true GB2523973A (en) 2015-09-09
GB2523973B GB2523973B (en) 2017-08-02

Family

ID=49998568

Family Applications (1)

Application Number Title Priority Date Filing Date
GB1512636.0A Active GB2523973B (en) 2012-12-19 2013-12-19 Audio analysis system and method using audio segment characterisation

Country Status (2)

Country Link
GB (1) GB2523973B (en)
WO (1) WO2014096832A1 (en)

Families Citing this family (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106663110A (en) * 2014-06-29 2017-05-10 谷歌公司 Derivation of probabilistic score for audio sequence alignment
US10372757B2 (en) 2015-05-19 2019-08-06 Spotify Ab Search media content based upon tempo
US10055413B2 (en) 2015-05-19 2018-08-21 Spotify Ab Identifying media content
EP3340238A4 (en) * 2015-05-25 2019-06-05 Guangzhou Kugou Computer Technology Co., Ltd. Audio processing method and apparatus, and terminal
US20180129659A1 (en) * 2016-06-09 2018-05-10 Spotify Ab Identifying media content
US10194022B2 (en) 2016-07-05 2019-01-29 Dialogtech Inc. System and method for automatically detecting undesired calls

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060075886A1 (en) * 2004-10-08 2006-04-13 Markus Cremer Apparatus and method for generating an encoded rhythmic pattern
US20070240557A1 (en) * 2006-04-12 2007-10-18 Whitman Brian A Understanding Music
US20080300702A1 (en) * 2007-05-29 2008-12-04 Universitat Pompeu Fabra Music similarity systems and methods using descriptors

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100749045B1 (en) * 2006-01-26 2007-08-13 삼성전자주식회사 Method and apparatus for searching similar music using summary of music content
TW201022968A (en) * 2008-12-10 2010-06-16 Univ Nat Taiwan A multimedia searching system, a method of building the system and associate searching method thereof

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060075886A1 (en) * 2004-10-08 2006-04-13 Markus Cremer Apparatus and method for generating an encoded rhythmic pattern
US20070240557A1 (en) * 2006-04-12 2007-10-18 Whitman Brian A Understanding Music
US20080300702A1 (en) * 2007-05-29 2008-12-04 Universitat Pompeu Fabra Music similarity systems and methods using descriptors

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
CASEY M A ET AL: "Content-Based Music Information Retrieval: Current Directions and Future Challenges", PROCEEDINGS OF THE IEEE, IEEE. NEW YORK, US, vol. 96, no. 4, 2 April 2008 (2008-04-02), pages 668-696, XP011206028, ISSN: 0018-9219 *
Michela Magas: "Michela Magas on music search and discovery -YouTube", BIG Awards' at Ravensbourne College, Greenwich, London, 6 March 2012 (2012-03-06), XP055107138, Internet Retrieved from the Internet: URL:http://www.youtube.com/watch?v=aCcJ2r0DPyI [retrieved on 2014-03-12] *

Also Published As

Publication number Publication date
GB201512636D0 (en) 2015-08-26
WO2014096832A1 (en) 2014-06-26
GB2523973B (en) 2017-08-02

Similar Documents

Publication Publication Date Title
WO2013006329A3 (en) Automated facial detection and eye tracking techniques implemented in commercial and consumer environments
WO2014062688A3 (en) Multi-mode audio recognition and data encoding/decoding
WO2011143633A3 (en) Systems and methods for object recognition using a large database
MX2015017625A (en) Adaptive event recognition.
IN2014MN01753A (en) Engagement dependent gesture recognition
WO2012138917A3 (en) Gesture-activated input using audio recognition
WO2012005970A3 (en) Intervalgram representation of audio for melody recognition
WO2014031618A3 (en) Data relationships storage platform
WO2013009578A3 (en) Systems and methods for speech command processing
EP2355093A3 (en) Multi-dimensional disambiguation of voice commands
MX2015003995A (en) Geometrical presentation of fracture planes.
WO2014004536A3 (en) Voice-based image tagging and searching
BR112015020150A2 (en) apparatus for generating a speech signal, and method for generating a speech signal
GB2476869B (en) System and process for detecting, tracking and counting human objects of interest
MX2015009491A (en) User authentication method and apparatus based on audio and video data.
WO2012167056A3 (en) System and method for non-signature based detection of malicious processes
NZ629522A (en) System and method for fingerprinting datasets
IN2014CH00781A (en) System for speech keyword detection and associated method
WO2012070840A3 (en) Apparatus and method for consensus search
WO2014140816A3 (en) Apparatus and method for performing actions based on captured image data
MX359781B (en) Private information hiding method and device.
WO2014169269A4 (en) Virtual teller systems and methods
MX2011007596A (en) Information processing device, information processing method, and information processing program.
WO2013134106A3 (en) Device for extracting information from a dialog
RU2014112242A (en) Method of analysis of tonality of text data