CN107506409B - Method for processing multi-audio data - Google Patents

Method for processing multi-audio data Download PDF

Info

Publication number
CN107506409B
CN107506409B CN201710673700.0A CN201710673700A CN107506409B CN 107506409 B CN107506409 B CN 107506409B CN 201710673700 A CN201710673700 A CN 201710673700A CN 107506409 B CN107506409 B CN 107506409B
Authority
CN
China
Prior art keywords
sound source
audio
data
frequency
sound
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710673700.0A
Other languages
Chinese (zh)
Other versions
CN107506409A (en
Inventor
王红娟
董毅
付宪瑞
王玉奎
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Financial Information Technology Co Ltd
Original Assignee
Inspur Financial Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Financial Information Technology Co Ltd filed Critical Inspur Financial Information Technology Co Ltd
Priority to CN201710673700.0A priority Critical patent/CN107506409B/en
Publication of CN107506409A publication Critical patent/CN107506409A/en
Application granted granted Critical
Publication of CN107506409B publication Critical patent/CN107506409B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/61Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/10Digital recording or reproducing
    • G11B20/10527Audio or video recording; Data buffering arrangements
    • G11B2020/10537Audio or video recording
    • G11B2020/10546Audio or video recording specifically adapted for audio data
    • G11B2020/10555Audio or video recording specifically adapted for audio data wherein the frequency, the amplitude, or other characteristics of the audio signal is taken into account

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • Software Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Stereophonic System (AREA)
  • Signal Processing For Digital Recording And Reproducing (AREA)

Abstract

The invention discloses a method for processing multi-audio data, which stores a multi-audio data in a single audio file and comprises the following steps: s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment; s2: setting sound source information and frequency related data for the acquired audio data; s3: setting the number of sound sources and the basic information of different sound sources in an audio file header; s4: the number of occurrences, the frequency, and the position and the duration of each occurrence of the sound source are set for each sound source. S5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream; s6: and writing the related sound source data into the frequency set by the encoder of the audio file. The multi-tone source file stored by the method is smaller in size than a traditional multi-file independent storage mode, and audio data indexes are easier to establish, so that the method has an important role in the related fields of audio recording, management and the like.

Description

Method for processing multi-audio data
Technical Field
The invention relates to a method for processing multi-audio data, and belongs to the technical field of electronic equipment.
Background
Audio data as a waveform data has two main important parameters in its conventional acquisition process: audio and volume. The audio frequency is often used as a main parameter for identifying the characteristics of the sound source, and the volume is an important characteristic for representing the sound intensity.
In the process of sound collection and coding, sound wave data of different sound sources are overlapped in an interactive mode on collection equipment, single audio files mixing various sound sources are finally generated, in the subsequent processing process, related processing algorithms often need to filter clutter data according to specific frequency characteristics and then can search data to be searched, and in the process, the difficulty and the accuracy of identification are often low. This is because, in the process of audio acquisition and storage, there is often an overlap of multiple sound sources, and the data of different sound sources are different in frequency, so that many times will overlap, and it is very difficult to filter out specific information in these overlapping frequencies. Especially, when the volume of other sound sources is higher than the volume of the target to be searched, the volume of the target to be searched is usually covered by the background volume and cannot be detected.
Therefore, in the process of searching the batch of audio files, a large amount of labor cost and time cost are required to be invested to find the target to be searched, and an effective automatic quick searching mode is difficult to find for replacing the target.
Disclosure of Invention
The present invention is directed to solve the above problems in the prior art, and to provide a method for processing multiple audio data.
The technical solution to the above object of the present invention is achieved by: a method for processing multi-audio data, comprising: one kind of multi-audio data is stored in a single audio file.
Preferably, the method comprises the steps of:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information and frequency related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: setting the occurrence times, frequency, and the position and duration of each occurrence of the sound source for each sound source;
s5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: the method comprises the steps of writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence to form a sound source 1| sound source 2| sound source 3.
Preferably, in step S3, the audio of different frequencies is regarded as different audio sources regardless of whether the capturing devices are the same.
Preferably, in step S4, the number of sound sources is the time period in which the audio of the frequency effectively appears, and the background data except for the frequency is ignored.
Preferably, in step S4, the sound source frequency is a unique identifier of the sound source, so as to create an index.
Preferably, in step S6, when a plurality of sets of sound source data exist in a certain frequency band, the sound source 1| sound source 2| sound source 3 are written in order of sound source.
Preferably, in step S6, when a plurality of sets of sound source data exist in a certain frequency band, the sound source 1| sound source 2| sound source 3 are written in order of sound source.
Preferably, a method for processing multi-audio data, which can store a multi-audio data in a single audio file; the method comprises the following steps:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information and frequency related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: setting the occurrence times, frequency, and the position and duration of each occurrence of the sound source for each sound source;
s5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence of a sound source 1| sound source 2| sound source 3; in step S3, the audio of different frequencies is regarded as different sound sources regardless of whether the capturing devices are the same; in step S4, the sound source frequency is the time period in which the audio of the frequency effectively appears, and the background data except the frequency is ignored; in step S4, the sound source frequency is a unique identifier of the sound source, so as to create an index; when a plurality of groups of sound source data exist in a certain frequency band, writing the sound source data into the frequency band according to the sound source sequence, wherein the sound source data comprises a sound source 1 and a sound source 2 and a sound source 3; when a plurality of sets of sound source data exist in a certain frequency band, the sound source data are written in sequence according to the sound source sequence, and the sound source 1| sound source 2| sound source 3 is generated.
The technical scheme of the invention has the advantages that: the method can not only integrate audio data of multiple sound sources at will, store audio data of multiple sound sources and multiple frequencies in the same audio file at the same time, but also freely switch the sound sources in the playback process, freely switch the sound source data or mix and throw away any sound source, realize real-time sound mixing operation, and set a sound pattern characteristic retrieval index aiming at different sound source data, so that the retrieval process of the audio data is more efficient. The multi-tone source file stored by the method is smaller in size than a traditional multi-file independent storage mode, audio data indexes are easier to establish, the method plays an important role in related fields such as audio recording and management, and is suitable for industrial popularization and use.
Drawings
Fig. 1 is a schematic view of the spatial dimensions of different audio source data in the present invention.
Fig. 2 is a representation of the data stream of two sets of audio data of different frequencies in a multi-dimensional data space according to the present invention.
Detailed Description
Objects, advantages and features of the present invention will be illustrated and explained by the following non-limiting description of preferred embodiments. The embodiments are merely exemplary for applying the technical solutions of the present invention, and any technical solution formed by replacing or converting the equivalent thereof falls within the scope of the present invention claimed.
The invention discloses a processing method of multi-audio data, which stores a multi-audio data in a single audio file.
Specifically, the method for processing the multi-audio data comprises the following steps:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information, frequency and other related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: the number of occurrences, the frequency, and the position and the duration of each occurrence of the sound source, etc. are set for each sound source. The audio of different frequencies is regarded as different sound sources no matter whether the acquisition equipment is the same or not. The sound source frequency is the time period when the audio of the frequency effectively appears, the background data except the frequency is ignored, and the sound source frequency is the unique identifier of the sound source so as to create an index; specifically, in this embodiment, the number, position, and duration are obtained in a specific acquisition process, the frequency range is a normal frequency range, the frequency is used to distinguish human voice, and the frequency range of human voice is 300Hz — 3400 Hz.
S5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: the method comprises the steps of writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence to form a sound source 1| sound source 2| sound source 3.
In this embodiment, the selection of the multiple sets of sound source data is not limited, and the selection is performed according to the actual needs in the working process. A certain frequency band refers to the voice of the same person, and the frequency is the same, unless high or low sound is intentionally emitted, the frequency of the voice of the same person is generally maintained in a frequency band, of course, the frequency here is not only the frequency of the audio itself, but also includes the concept of speech speed, i.e. how fast the voice is speaking.
Different from a common audio data storage mode, the sound source with the highest volume is always collected as valid data at the same time node, and the sound source with the lower volume is always submerged or only mixed in gaps with different frequencies to become background sound. The sound source data storage mode provided by the multi-audio data processing method can use a multi-dimensional means to freely match audio data of multiple sound sources and multiple frequencies, different sound sources or the same sound source audio can be separated in different dimensions, and the audio data can be mixed in real time during playback, so that the method has great flexibility. Fig. 1 is a schematic spatial diagram of different audio source data in dimensions, where x is an audio data stream and can also be understood as time, y is audio, z is different audio, and the audio data streams and the audio volumes are separately acquired according to the audio and separated in the y dimension. The audio frequencies of different frequencies are considered as different sound sources, and the different sound sources or the same sound source audio can be separated in different dimensions.
Two groups of audio data with different sound sources and different frequencies exist in the form of independent data streams in a multidimensional data space, and when the audio data are coded into an independent audio file, the storage mode is as shown in fig. 2: the frequencies and the sound sources of the audio 1 and the audio 2 are different, and specifically, the frequency, the starting position and the length of each time and the frequency of the audio 1 are different from those of the audio 2.
Unlike the existing audio data storage method, which is a combination of audio format header + audio data, storing multi-source audio in this way makes it difficult to separate the mixed audio data. The multidimensional audio coding and storing mode proposed by the processing method of the multi-audio data is based on the sound source and the frequency, and the audio information of each sound source and frequency is dispersedly stored in the same audio data stream, so that the method has extremely high efficiency when the audio data index is created.
By the method, the audio data with multiple sound sources and multiple frequencies can be stored in the same audio file at the same time, the sound sources can be freely switched in the playback process, real-time sound mixing operation is realized, and the voiceprint characteristic retrieval index can be set for different sound source data, so that the retrieval process of the audio data is more efficient.
The invention has a plurality of embodiments, and all technical solutions formed by adopting equivalent transformation or equivalent transformation are within the protection scope of the invention.

Claims (1)

1. A method for processing multi-audio data, comprising: storing a plurality of audio data in a single audio file; the method comprises the following steps:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information and frequency related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: setting the occurrence times, frequency, and the position and duration of each occurrence of the sound source for each sound source;
s5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence of a sound source 1| sound source 2| sound source 3;
in step S3, the audio of different frequencies is regarded as different sound sources regardless of whether the capturing devices are the same;
in step S4, the sound source frequency is the time period in which the audio of the frequency effectively appears, and the background data except the frequency is ignored; in step S4, the sound source frequency is a unique identifier of the sound source, so as to create an index; in step S6, when a plurality of sets of sound source data exist in one frequency band, the sound source 1| sound source 2| sound source 3 are written in order of sound source.
CN201710673700.0A 2017-08-09 2017-08-09 Method for processing multi-audio data Active CN107506409B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710673700.0A CN107506409B (en) 2017-08-09 2017-08-09 Method for processing multi-audio data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710673700.0A CN107506409B (en) 2017-08-09 2017-08-09 Method for processing multi-audio data

Publications (2)

Publication Number Publication Date
CN107506409A CN107506409A (en) 2017-12-22
CN107506409B true CN107506409B (en) 2021-01-08

Family

ID=60689589

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710673700.0A Active CN107506409B (en) 2017-08-09 2017-08-09 Method for processing multi-audio data

Country Status (1)

Country Link
CN (1) CN107506409B (en)

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3855862B2 (en) * 2002-04-01 2006-12-13 ソニー株式会社 Editing method and apparatus
US7869616B2 (en) * 2005-01-12 2011-01-11 Logitech International, S.A. Active crossover and wireless interface for use with multi-driver in-ear monitors
CN101001485A (en) * 2006-10-23 2007-07-18 中国传媒大学 Finite sound source multi-channel sound field system and sound field analogy method
CN102867514B (en) * 2011-07-07 2016-04-13 腾讯科技(北京)有限公司 A kind of sound mixing method and device sound mixing
CN105981411B (en) * 2013-11-27 2018-11-30 Dts(英属维尔京群岛)有限公司 The matrix mixing based on multi-component system for the multichannel audio that high sound channel counts
CN104064191B (en) * 2014-06-10 2017-12-15 北京音之邦文化科技有限公司 Sound mixing method and device
CN106486128B (en) * 2016-09-27 2021-10-22 腾讯科技(深圳)有限公司 Method and device for processing double-sound-source audio data

Also Published As

Publication number Publication date
CN107506409A (en) 2017-12-22

Similar Documents

Publication Publication Date Title
CN109672922B (en) Game video editing method and device
CN109982128B (en) Video bullet screen generation method and device, storage medium and electronic device
CN105677716B (en) A kind of computer data acquiring processing analysis system
CN105868397A (en) Method and device for determining song
CN101656094A (en) Data storage method and storage device
DE60120417D1 (en) METHOD FOR SEARCHING IN AN AUDIO DATABASE
CN108920611B (en) Article generation method, device, equipment and storage medium
CN102799605A (en) Method and system for monitoring advertisement broadcast
WO2003098479A3 (en) Managing search expressions in a database system
CN104599692A (en) Recording method and device and recording content searching method and device
WO2008043082A3 (en) Time series search engine
SG140510A1 (en) System and method for database indexing, searching and data retrieval
CN104933175B (en) Performance data correlation analysis method and performance monitoring system
CN110650374A (en) Clipping method, electronic device, and computer-readable storage medium
CN103853836A (en) Music retrieval method and system based on music fingerprint characteristic
KR20120090101A (en) Digital video fast matching system using key-frame index method
CN105069153A (en) Patent analysis system
CN107506409B (en) Method for processing multi-audio data
CN109284763A (en) A kind of method and server generating participle training data
DE102019123005A1 (en) SYSTEM AND METHOD FOR DISPLAYING THE OBJECT MOTION SCHEME
CN101770474A (en) History searching record-based searching method and device
CN103139272A (en) Method of obtaining online time within selected time period and device using the same
CN104978380A (en) Audio frequency processing method and device
CN104125334A (en) Information processing method and electronic equipment
CN104778202B (en) The analysis method and system of event evolutionary process based on keyword

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant