JP2008015002A5 - - Google Patents
Download PDFInfo
- Publication number
- JP2008015002A5 JP2008015002A5 JP2006183131A JP2006183131A JP2008015002A5 JP 2008015002 A5 JP2008015002 A5 JP 2008015002A5 JP 2006183131 A JP2006183131 A JP 2006183131A JP 2006183131 A JP2006183131 A JP 2006183131A JP 2008015002 A5 JP2008015002 A5 JP 2008015002A5
- Authority
- JP
- Japan
- Prior art keywords
- acoustic signal
- frequency
- feature
- extracted
- time
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Claims (11)
前記時間周波数スペクトログラムの時間間隔ごとに、前記時間周波数スペクトログラムの所定の基準特徴部を抽出し、
抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの基準周波数とし、
対数軸上で予め定められた間隔の複数の周波数を、当該基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記時間間隔ごとの音響信号の特徴量を抽出する
ことを特徴とする音響信号特徴抽出方法。 An acoustic signal feature extraction method in an acoustic signal feature extraction device that extracts a feature amount of the acoustic signal based on a time-frequency spectrogram of an input acoustic signal,
For each time interval of the time frequency spectrogram, extract a predetermined reference feature of the time frequency spectrogram,
The extracted frequency of the predetermined reference feature is set as a reference frequency for each time interval,
Extracting a plurality of frequencies at predetermined intervals on the logarithmic axis with reference to the reference frequency, and extracting feature values of the acoustic signal for each time interval based on the extracted power spectrum value of each frequency. An acoustic signal feature extraction method.
であることを特徴とする請求項1に記載の音響信号特徴抽出方法。 The acoustic signal feature extraction according to claim 1, wherein the predetermined reference feature portion is a portion in which a power spectrum value becomes a maximum value among several time-frequency spectrograms continuous on a time axis. Method.
であることを特徴とする請求項1に記載の音響信号特徴抽出方法。 The acoustic signal feature extraction method according to claim 1, wherein the predetermined reference feature portion is a portion serving as a center of gravity of power spectrum values of several time-frequency spectrograms continuous on a time axis.
前記参照音響信号及び前記検索対象音響信号それぞれの時間周波数スペクトログラムから、前記時間周波数スペクトログラムの時間間隔ごとの所定の基準特徴部を抽出し、
前記参照音響信号の時間周波数スペクトログラムから抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの前記参照音響信号の基準周波数とし、
前記検索対象音響信号の時間周波数スペクトログラムから抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの前記検索対象音響信号の基準周波数とし、
対数軸上で予め定められた間隔の複数の周波数を、前記参照音響信号の基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記参照音響信号の前記時間間隔ごとの音響信号の特徴量を抽出し、抽出した特徴量に基づいて前記参照音響信号の信号全体の時間長を一区間とする参照特徴量を算出し、
対数軸上で予め定められた間隔の複数の周波数を、前記検索対象音響信号の基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記検索対象音響信号の前記時間間隔ごとの特徴量を抽出し、
抽出した前記検索対象音響信号の前記特徴量に基づいて前記区間ごとの区間特徴量を算出し、
算出した区間特徴量と、前記参照特徴量とに基づいて類似度を算出し、
算出した類似度に基づいて、前記参照音響信号の音に類似する前記検索対象音響信号の区間を検索し、検索により検出した前記参照音響信号の音に類似する前記検索対象音響信号の区間を出力する
ことを特徴とする音響信号検索方法。 An acoustic signal search method in an acoustic signal search device that searches an input search target acoustic signal for a section including a sound similar to the reference acoustic signal based on a reference acoustic signal,
From the time frequency spectrogram of each of the reference acoustic signal and the search target acoustic signal, a predetermined reference feature for each time interval of the time frequency spectrogram is extracted,
The frequency of the predetermined reference feature extracted from the time frequency spectrogram of the reference sound signal is set as the reference frequency of the reference sound signal for each time interval ,
The frequency of the predetermined reference feature extracted from time-frequency spectrogram of the search target sound signal and the reference frequency of the search target acoustic signal of each of the time intervals,
A plurality of frequencies at predetermined intervals on the logarithmic axis are extracted based on the reference frequency of the reference acoustic signal, and the sound for each time interval of the reference acoustic signal is extracted based on the extracted power spectrum value of each frequency. Extracting a feature quantity of the signal, calculating a reference feature quantity based on the extracted feature quantity, with a time length of the entire signal of the reference acoustic signal as one section,
A plurality of frequencies at predetermined intervals on the logarithmic axis are extracted with reference to the reference frequency of the search target acoustic signal, and for each time interval of the search target acoustic signal based on the extracted power spectrum value of each frequency Features of the
A section feature amount for each section is calculated based on the extracted feature amount of the search target acoustic signal,
A similarity is calculated based on the calculated section feature and the reference feature,
Based on the calculated similarity, the search target sound signal section similar to the sound of the reference sound signal is searched, and the search target sound signal section similar to the sound of the reference sound signal detected by the search is output. An acoustic signal search method characterized by:
前記時間周波数スペクトログラムの時間間隔ごとに、前記時間周波数スペクトログラムの所定の基準特徴部を抽出し、抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの基準周波数とする基準周波数検出手段と、
対数軸上で予め定められた間隔の複数の周波数を、当該基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記時間間隔ごとの音響信号の特徴量を抽出する特徴量抽出手段と、
を備えたことを特徴とする音響信号特徴抽出装置。 An acoustic signal feature extraction device that extracts a feature amount of the acoustic signal based on a time-frequency spectrogram of the input acoustic signal,
For each time interval of the time frequency spectrogram, a predetermined reference feature portion of the time frequency spectrogram is extracted, and a reference frequency detection unit that uses the extracted frequency of the predetermined reference feature portion as a reference frequency for each time interval;
A feature amount that extracts a plurality of frequencies at predetermined intervals on the logarithmic axis with reference to the reference frequency, and extracts a feature amount of the acoustic signal for each time interval based on the extracted power spectrum value of each frequency. Extraction means;
An acoustic signal feature extraction apparatus comprising:
前記参照音響信号及び前記検索対象音響信号それぞれの時間周波数スペクトログラムから、前記時間周波数スペクトログラムの時間間隔ごとの所定の基準特徴部を抽出し、前記参照音響信号の時間周波数スペクトログラムから抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの前記参照音響信号の基準周波数とし、前記検索対象音響信号の時間周波数スペクトログラムから抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの前記検索対象音響信号の基準周波数とする基準周波数検出手段と、
対数軸上で予め定められた間隔の複数の周波数を、前記参照音響信号の基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記参照音響信号の前記時間間隔ごとの特徴量を抽出し、抽出した特徴量に基づいて前記参照音響信号の信号全体の時間長を一区間とする参照特徴量を算出し、対数軸上で予め定められた間隔の複数の周波数を、前記検索対象音響信号の基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記検索対象音響信号の前記時間間隔ごとの特徴量を抽出する特徴抽出手段と、
前記特徴抽出手段が抽出した前記検索対象音響信号の前記特徴量に基づいて前記区間ごとの区間特徴量を算出し、算出した区間特徴量と、前記参照特徴量とに基づいて類似度を算出し、算出した類似度に基づいて、前記参照音響信号の音に類似する前記検索対象音響信号の区間を検索し、検索により検出した前記参照音響信号の音に類似する前記検索対象音響信号の区間を出力する類似度計算手段と、
を備えたことを特徴とする音響信号検索装置。 Based on a reference acoustic signal, an acoustic signal search device that searches an input search target acoustic signal for a section including a sound similar to the reference acoustic signal,
Extracting a predetermined standard feature for each time interval of the time frequency spectrogram from the time frequency spectrogram of each of the reference acoustic signal and the search target acoustic signal, and extracting the predetermined standard extracted from the time frequency spectrogram of the reference acoustic signal The frequency of the feature portion is set as a reference frequency of the reference acoustic signal for each time interval, and the frequency of the predetermined reference feature portion extracted from the time frequency spectrogram of the search target acoustic signal is set to the search target acoustic signal for each time interval. a reference frequency detecting means for the issue of the reference frequency,
A plurality of frequencies at predetermined intervals on the logarithmic axis are extracted based on the reference frequency of the reference acoustic signal, and the characteristics of the reference acoustic signal for each time interval based on the extracted power spectrum value of each frequency Extracting a quantity, calculating a reference feature quantity having a time length of the entire signal of the reference acoustic signal as one section based on the extracted feature quantity, and calculating a plurality of frequencies at predetermined intervals on a logarithmic axis, A feature extraction unit that extracts a reference frequency of a search target acoustic signal as a reference , and extracts a feature amount for each time interval of the search target acoustic signal based on a power spectrum value of each extracted frequency;
A section feature amount for each section is calculated based on the feature amount of the search target acoustic signal extracted by the feature extraction unit, and a similarity is calculated based on the calculated section feature amount and the reference feature amount. The search target acoustic signal section similar to the sound of the reference acoustic signal is searched based on the calculated similarity, and the search target acoustic signal section similar to the sound of the reference acoustic signal detected by the search is searched. A similarity calculation means to output;
An acoustic signal retrieval device comprising:
前記時間周波数スペクトログラムの時間間隔ごとに、前記時間周波数スペクトログラムの所定の基準特徴部を抽出するステップと、
抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの基準周波数とするステップと、
対数軸上で予め定められた間隔の複数の周波数を、当該基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記時間間隔ごとの音響信号の特徴量を抽出するステップと、
を実行させるための音響信号特徴抽出プログラム。 Based on the time-frequency spectrogram of the input acoustic signal, the computer of the acoustic signal feature extraction device that extracts the feature quantity of the acoustic signal,
Extracting predetermined reference features of the time-frequency spectrogram for each time interval of the time-frequency spectrogram;
Setting the extracted frequency of the predetermined reference feature as a reference frequency for each time interval;
Extracting a plurality of frequencies at predetermined intervals on the logarithmic axis with reference to the reference frequency, and extracting a feature quantity of the acoustic signal for each time interval based on the extracted power spectrum value of each frequency; ,
Acoustic signal feature extraction program for executing
参照音響信号及び検索対象音響信号の時間周波数スペクトログラムを入力するステップと、
前記参照音響信号及び前記検索対象音響信号それぞれの時間周波数スペクトログラムから、前記時間周波数スペクトログラムの時間間隔ごとの所定の基準特徴部を抽出するステップと、
前記参照音響信号の時間周波数スペクトログラムから抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの前記参照音響信号の基準周波数とし、前記検索対象音響信号の時間周波数スペクトログラムから抽出した前記所定の基準特徴部の周波数を前記時間間隔ごとの前記検索対象音響信号の基準周波数とするステップと、
対数軸上で予め定められた間隔の複数の周波数を、前記参照音響信号の基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記参照音響信号の前記時間間隔ごとの特徴量を抽出し、抽出した特徴量に基づいて前記参照音響信号の信号全体の時間長を一区間とする参照特徴量を算出するステップと、
対数軸上で予め定められた間隔の複数の周波数を、前記検索対象音響信号の基準周波数を基準として抽出し、抽出した各周波数のパワースペクトル値に基づいて前記検索対象音響信号の前記時間間隔ごとの特徴量を抽出するステップと、
抽出した前記検索対象音響信号の前記特徴量に基づいて前記区間ごとの区間特徴量を算出するステップと、
前記類似度計算手段が、算出した区間特徴量と、前記参照特徴量とに基づいて類似度を算出するステップと、
算出した類似度に基づいて、前記参照音響信号の音に類似する前記検索対象音響信号の区間を検索し、検索により検出した前記参照音響信号の音に類似する前記検索対象音響信号の区間を出力するステップと、
を実行させるための音響信号検索プログラム。 Based on the reference acoustic signal, the computer of the acoustic signal retrieval device that retrieves the section including the sound similar to the reference acoustic signal from the input retrieval target acoustic signal,
Inputting a time frequency spectrogram of a reference acoustic signal and a search target acoustic signal;
Extracting a predetermined reference feature for each time interval of the time-frequency spectrogram from the time-frequency spectrogram of each of the reference sound signal and the search target sound signal;
The predetermined standard extracted from the time-frequency spectrogram of the search target acoustic signal, with the frequency of the predetermined reference feature extracted from the time-frequency spectrogram of the reference acoustic signal as the reference frequency of the reference acoustic signal for each time interval the method comprising the frequency of the feature and the reference frequency of the search target acoustic signal of each of the time intervals,
A plurality of frequencies at predetermined intervals on the logarithmic axis are extracted based on the reference frequency of the reference acoustic signal, and the characteristics of the reference acoustic signal for each time interval based on the extracted power spectrum value of each frequency Extracting a quantity, and calculating a reference feature quantity having a time length of the entire signal of the reference acoustic signal as one section based on the extracted feature quantity;
A plurality of frequencies at predetermined intervals on the logarithmic axis are extracted with reference to the reference frequency of the search target acoustic signal, and for each time interval of the search target acoustic signal based on the extracted power spectrum value of each frequency Extracting a feature quantity of
Calculating a section feature value for each section based on the extracted feature value of the extracted search target acoustic signal;
The similarity calculating means calculating a similarity based on the calculated section feature and the reference feature;
Based on the calculated similarity, the search target sound signal section similar to the sound of the reference sound signal is searched, and the search target sound signal section similar to the sound of the reference sound signal detected by the search is output. And steps to
Acoustic signal search program for executing
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006183131A JP4597919B2 (en) | 2006-07-03 | 2006-07-03 | Acoustic signal feature extraction method, extraction device, extraction program, recording medium recording the program, acoustic signal search method, search device, search program using the features, and recording medium recording the program |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2006183131A JP4597919B2 (en) | 2006-07-03 | 2006-07-03 | Acoustic signal feature extraction method, extraction device, extraction program, recording medium recording the program, acoustic signal search method, search device, search program using the features, and recording medium recording the program |
Publications (3)
Publication Number | Publication Date |
---|---|
JP2008015002A JP2008015002A (en) | 2008-01-24 |
JP2008015002A5 true JP2008015002A5 (en) | 2010-10-14 |
JP4597919B2 JP4597919B2 (en) | 2010-12-15 |
Family
ID=39072112
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP2006183131A Expired - Fee Related JP4597919B2 (en) | 2006-07-03 | 2006-07-03 | Acoustic signal feature extraction method, extraction device, extraction program, recording medium recording the program, acoustic signal search method, search device, search program using the features, and recording medium recording the program |
Country Status (1)
Country | Link |
---|---|
JP (1) | JP4597919B2 (en) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP5204904B2 (en) * | 2009-01-30 | 2013-06-05 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | Audio signal quality prediction |
US8930185B2 (en) | 2009-08-28 | 2015-01-06 | International Business Machines Corporation | Speech feature extraction apparatus, speech feature extraction method, and speech feature extraction program |
JP5561041B2 (en) * | 2010-09-06 | 2014-07-30 | 大日本印刷株式会社 | Relevant information retrieval device for acoustic data |
JP5462827B2 (en) * | 2011-03-28 | 2014-04-02 | 日本電信電話株式会社 | Specific acoustic signal containing section detecting device, method, and program |
JP2013117688A (en) * | 2011-12-05 | 2013-06-13 | Sony Corp | Sound processing device, sound processing method, program, recording medium, server device, sound replay device, and sound processing system |
CN108962231B (en) * | 2018-07-04 | 2021-05-28 | 武汉斗鱼网络科技有限公司 | Voice classification method, device, server and storage medium |
CN115696699A (en) * | 2022-09-28 | 2023-02-03 | 重庆长安汽车股份有限公司 | Atmosphere lamp rhythm processing method, device, equipment and medium |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1307992B1 (en) * | 2000-06-20 | 2007-08-15 | University of New Hampshire | Compression and decompression of audio files using a chaotic system |
DE10109648C2 (en) * | 2001-02-28 | 2003-01-30 | Fraunhofer Ges Forschung | Method and device for characterizing a signal and method and device for generating an indexed signal |
KR20040024870A (en) * | 2001-07-20 | 2004-03-22 | 그레이스노트 아이엔씨 | Automatic identification of sound recordings |
JP2004334160A (en) * | 2002-09-24 | 2004-11-25 | Matsushita Electric Ind Co Ltd | Characteristic amount extraction device |
-
2006
- 2006-07-03 JP JP2006183131A patent/JP4597919B2/en not_active Expired - Fee Related
Similar Documents
Publication | Publication Date | Title |
---|---|---|
JP2008015002A5 (en) | ||
JP2013222113A5 (en) | ||
WO2013080210A8 (en) | Method for extracting representative segments from music | |
US20130253924A1 (en) | Speech Conversation Support Apparatus, Method, and Program | |
JP2016508264A5 (en) | ||
CN107967922A (en) | A kind of music copyright recognition methods of feature based | |
SG11201810380VA (en) | Method, device, and apparatus for detecting disease probability, and computer-readable storage medium | |
CN104992712B (en) | It can identify music automatically at the method for spectrum | |
JP2015197436A5 (en) | Method and system for detecting events in a signal subject to periodic stationary background noise | |
JP2015508205A5 (en) | ||
US9997168B2 (en) | Method and apparatus for signal extraction of audio signal | |
JP2013231721A5 (en) | ||
BRPI0911440A2 (en) | method and device for recognizing a state of a noise generating machine to be investigated | |
CN101399035A (en) | Method and equipment for extracting beat from audio file | |
CN106528706B (en) | Music retrieval method and device | |
JP2012243033A5 (en) | ||
JP2014135543A5 (en) | Voice memo storage method related to schedule | |
Chen et al. | Estimating the voice source in noise. | |
Suman et al. | Algorithm for gunshot detection using mel-frequency cepstrum coefficients (MFCC) | |
JP5462827B2 (en) | Specific acoustic signal containing section detecting device, method, and program | |
CN102455324A (en) | DCT based method for extracting acoustical signal characteristics of grain and oil, and system thereof | |
CN105222941A (en) | A kind of high time resolution detection method and device thereof kicking down cigarette stress in cigarette ash process | |
JP2008288898A5 (en) | ||
JP5800974B1 (en) | Synonym determination device | |
CN110931046A (en) | Audio high-level semantic feature extraction method and system for overlapped sound event detection |