GB2523973A - Audio analysis system and method using audio segment characterisation - Google Patents
Audio analysis system and method using audio segment characterisation Download PDFInfo
- Publication number
- GB2523973A GB2523973A GB1512636.0A GB201512636A GB2523973A GB 2523973 A GB2523973 A GB 2523973A GB 201512636 A GB201512636 A GB 201512636A GB 2523973 A GB2523973 A GB 2523973A
- Authority
- GB
- United Kingdom
- Prior art keywords
- feature data
- audio signal
- input audio
- audio
- segments
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H1/00—Details of electrophonic musical instruments
- G10H1/0008—Associated control or indicating means
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G10H2210/041—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal based on mfcc [mel -frequency spectral coefficients]
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2210/00—Aspects or methods of musical processing having intrinsic musical character, i.e. involving musical theory or musical parameters or relying on musical knowledge, as applied in electrophonic musical tools or instruments
- G10H2210/031—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal
- G10H2210/061—Musical analysis, i.e. isolation, extraction or identification of musical elements or musical parameters from a raw acoustic signal or from an encoded audio signal for extraction of musical phrases, isolation of musically relevant segments, e.g. musical thumbnail generation, or for temporal structure analysis of a musical piece, e.g. determination of the movement sequence of a musical work
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/075—Musical metadata derived from musical analysis or for use in electrophonic musical instruments
- G10H2240/081—Genre classification, i.e. descriptive metadata for classification or selection of musical pieces according to style
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/075—Musical metadata derived from musical analysis or for use in electrophonic musical instruments
- G10H2240/085—Mood, i.e. generation, detection or selection of a particular emotional content or atmosphere in a musical piece
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2240/00—Data organisation or data communication aspects, specifically adapted for electrophonic musical tools or instruments
- G10H2240/121—Musical libraries, i.e. musical databases indexed by musical parameters, wavetables, indexing schemes using musical parameters, musical rule bases or knowledge bases, e.g. for automatic composing methods
- G10H2240/131—Library retrieval, i.e. searching a database or selecting a specific musical piece, segment, pattern, rule or parameter set
- G10H2240/141—Library retrieval matching, i.e. any of the steps of matching an inputted segment or phrase with musical database contents, e.g. query by humming, singing or playing; the steps may include, e.g. musical analysis of the input, musical feature extraction, query formulation, or details of the retrieval process
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10H—ELECTROPHONIC MUSICAL INSTRUMENTS; INSTRUMENTS IN WHICH THE TONES ARE GENERATED BY ELECTROMECHANICAL MEANS OR ELECTRONIC GENERATORS, OR IN WHICH THE TONES ARE SYNTHESISED FROM A DATA STORE
- G10H2250/00—Aspects of algorithms or signal processing methods without intrinsic musical character, yet specifically adapted for or used in electrophonic musical processing
- G10H2250/131—Mathematical functions for musical analysis, processing, synthesis or composition
- G10H2250/215—Transforms, i.e. mathematical transforms into domains appropriate for musical signal processing, coding or compression
- G10H2250/235—Fourier transform; Discrete Fourier Transform [DFT]; Fast Fourier Transform [FFT]
Landscapes
- Physics & Mathematics (AREA)
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
A method of matching an input audio signal to one or more audio segments within a plurality of audio segments, the method comprising: receiving the input audio signal; processing the input audio signal to determine structural parameter feature data related to the received input audio signal; analysing the determined structural parameter feature data to extract semantic feature data; comparing the feature data of the input audio signal to pre-processed feature data relating to the plurality of audio segments in order to match one or more audio segments within a similarity threshold of the input audio signal; outputting a search result on the basis of the matched one or more audio segments wherein semantic feature data is extracted from the structural parameter data using a supervised learning technique.
Applications Claiming Priority (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
GBGB1222951.4A GB201222951D0 (en) | 2012-12-19 | 2012-12-19 | Audio analysis system and method |
GB201312399A GB201312399D0 (en) | 2013-07-10 | 2013-07-10 | Audio analysis system and method |
PCT/GB2013/053362 WO2014096832A1 (en) | 2012-12-19 | 2013-12-19 | Audio analysis system and method using audio segment characterisation |
Publications (3)
Publication Number | Publication Date |
---|---|
GB201512636D0 GB201512636D0 (en) | 2015-08-26 |
GB2523973A true GB2523973A (en) | 2015-09-09 |
GB2523973B GB2523973B (en) | 2017-08-02 |
Family
ID=49998568
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
GB1512636.0A Expired - Fee Related GB2523973B (en) | 2012-12-19 | 2013-12-19 | Audio analysis system and method using audio segment characterisation |
Country Status (2)
Country | Link |
---|---|
GB (1) | GB2523973B (en) |
WO (1) | WO2014096832A1 (en) |
Families Citing this family (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106663110B (en) * | 2014-06-29 | 2020-09-15 | 谷歌有限责任公司 | Derivation of probability scores for audio sequence alignment |
US10055413B2 (en) | 2015-05-19 | 2018-08-21 | Spotify Ab | Identifying media content |
US10372757B2 (en) | 2015-05-19 | 2019-08-06 | Spotify Ab | Search media content based upon tempo |
JP6586514B2 (en) * | 2015-05-25 | 2019-10-02 | ▲広▼州酷狗▲計▼算机科技有限公司 | Audio processing method, apparatus and terminal |
WO2017214411A1 (en) | 2016-06-09 | 2017-12-14 | Tristan Jehan | Search media content based upon tempo |
WO2017214408A1 (en) * | 2016-06-09 | 2017-12-14 | Tristan Jehan | Identifying media content |
US10194022B2 (en) * | 2016-07-05 | 2019-01-29 | Dialogtech Inc. | System and method for automatically detecting undesired calls |
US20230073174A1 (en) * | 2021-07-02 | 2023-03-09 | Brainfm, Inc. | Neurostimulation Systems and Methods |
CN114205677B (en) * | 2021-11-30 | 2022-10-14 | 浙江大学 | Short video automatic editing method based on prototype video |
CN114928755B (en) * | 2022-05-10 | 2023-10-20 | 咪咕文化科技有限公司 | Video production method, electronic equipment and computer readable storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060075886A1 (en) * | 2004-10-08 | 2006-04-13 | Markus Cremer | Apparatus and method for generating an encoded rhythmic pattern |
US20070240557A1 (en) * | 2006-04-12 | 2007-10-18 | Whitman Brian A | Understanding Music |
US20080300702A1 (en) * | 2007-05-29 | 2008-12-04 | Universitat Pompeu Fabra | Music similarity systems and methods using descriptors |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
KR100749045B1 (en) * | 2006-01-26 | 2007-08-13 | 삼성전자주식회사 | Method and apparatus for searching similar music using summary of music content |
TW201022968A (en) * | 2008-12-10 | 2010-06-16 | Univ Nat Taiwan | A multimedia searching system, a method of building the system and associate searching method thereof |
-
2013
- 2013-12-19 GB GB1512636.0A patent/GB2523973B/en not_active Expired - Fee Related
- 2013-12-19 WO PCT/GB2013/053362 patent/WO2014096832A1/en active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060075886A1 (en) * | 2004-10-08 | 2006-04-13 | Markus Cremer | Apparatus and method for generating an encoded rhythmic pattern |
US20070240557A1 (en) * | 2006-04-12 | 2007-10-18 | Whitman Brian A | Understanding Music |
US20080300702A1 (en) * | 2007-05-29 | 2008-12-04 | Universitat Pompeu Fabra | Music similarity systems and methods using descriptors |
Non-Patent Citations (2)
Title |
---|
CASEY M A ET AL: "Content-Based Music Information Retrieval: Current Directions and Future Challenges", PROCEEDINGS OF THE IEEE, IEEE. NEW YORK, US, vol. 96, no. 4, 2 April 2008 (2008-04-02), pages 668-696, XP011206028, ISSN: 0018-9219 * |
Michela Magas: "Michela Magas on music search and discovery -YouTube", BIG Awards' at Ravensbourne College, Greenwich, London, 6 March 2012 (2012-03-06), XP055107138, Internet Retrieved from the Internet: URL:http://www.youtube.com/watch?v=aCcJ2r0DPyI [retrieved on 2014-03-12] * |
Also Published As
Publication number | Publication date |
---|---|
GB2523973B (en) | 2017-08-02 |
WO2014096832A1 (en) | 2014-06-26 |
GB201512636D0 (en) | 2015-08-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB2523973A (en) | Audio analysis system and method using audio segment characterisation | |
MX2016004667A (en) | Template construction method and apparatus, and information recognition method and apparatus. | |
MX367096B (en) | Discriminating ambiguous expressions to enhance user experience. | |
EP2863309A3 (en) | Contextual graph matching based anomaly detection | |
WO2013134641A3 (en) | Recognizing speech in multiple languages | |
GB2549875A (en) | Automated content classification/filtering | |
IN2014MU00919A (en) | ||
WO2017007035A8 (en) | Method for distinguishing one or more components of signal | |
IN2014DN10400A (en) | ||
MX340429B (en) | System and method for address matching. | |
WO2015138497A3 (en) | Systems and methods for rapid data analysis | |
MX2021015008A (en) | Cancer detection systems and methods. | |
WO2013185109A3 (en) | Recognizing textual identifiers within words | |
IN2014DN08472A (en) | ||
WO2014102548A3 (en) | Search system and corresponding method | |
EE201500014A (en) | Method and device for impedance analysis with binary excitation | |
GB201212783D0 (en) | A speech processing system | |
WO2013118143A8 (en) | Social media data analysis system and method | |
WO2014022172A3 (en) | Information classification based on product recognition | |
WO2014207644A3 (en) | Method and system for grading a computer program | |
TW201612549A (en) | Apparatus, system and method for space status detection based on an acoustic signal | |
MX2011012642A (en) | Method and apparatus for performing a cross-correlation. | |
GB2550777A (en) | Classification and storage of documents | |
MX2014010599A (en) | Methods and computing systems for processing data. | |
MX2015012826A (en) | Contextual socially aware local search. |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PCNP | Patent ceased through non-payment of renewal fee |
Effective date: 20201219 |