CN107506409B - Method for processing multi-audio data - Google Patents
Method for processing multi-audio data Download PDFInfo
- Publication number
- CN107506409B CN107506409B CN201710673700.0A CN201710673700A CN107506409B CN 107506409 B CN107506409 B CN 107506409B CN 201710673700 A CN201710673700 A CN 201710673700A CN 107506409 B CN107506409 B CN 107506409B
- Authority
- CN
- China
- Prior art keywords
- sound source
- audio
- data
- frequency
- sound
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 238000013500 data storage Methods 0.000 description 3
- 238000003672 processing method Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000007726 management method Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/61—Indexing; Data structures therefor; Storage structures
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
-
- G—PHYSICS
- G11—INFORMATION STORAGE
- G11B—INFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
- G11B20/00—Signal processing not specific to the method of recording or reproducing; Circuits therefor
- G11B20/10—Digital recording or reproducing
- G11B20/10527—Audio or video recording; Data buffering arrangements
- G11B2020/10537—Audio or video recording
- G11B2020/10546—Audio or video recording specifically adapted for audio data
- G11B2020/10555—Audio or video recording specifically adapted for audio data wherein the frequency, the amplitude, or other characteristics of the audio signal is taken into account
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- Software Systems (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Stereophonic System (AREA)
- Signal Processing For Digital Recording And Reproducing (AREA)
Abstract
The invention discloses a method for processing multi-audio data, which stores a multi-audio data in a single audio file and comprises the following steps: s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment; s2: setting sound source information and frequency related data for the acquired audio data; s3: setting the number of sound sources and the basic information of different sound sources in an audio file header; s4: the number of occurrences, the frequency, and the position and the duration of each occurrence of the sound source are set for each sound source. S5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream; s6: and writing the related sound source data into the frequency set by the encoder of the audio file. The multi-tone source file stored by the method is smaller in size than a traditional multi-file independent storage mode, and audio data indexes are easier to establish, so that the method has an important role in the related fields of audio recording, management and the like.
Description
Technical Field
The invention relates to a method for processing multi-audio data, and belongs to the technical field of electronic equipment.
Background
Audio data as a waveform data has two main important parameters in its conventional acquisition process: audio and volume. The audio frequency is often used as a main parameter for identifying the characteristics of the sound source, and the volume is an important characteristic for representing the sound intensity.
In the process of sound collection and coding, sound wave data of different sound sources are overlapped in an interactive mode on collection equipment, single audio files mixing various sound sources are finally generated, in the subsequent processing process, related processing algorithms often need to filter clutter data according to specific frequency characteristics and then can search data to be searched, and in the process, the difficulty and the accuracy of identification are often low. This is because, in the process of audio acquisition and storage, there is often an overlap of multiple sound sources, and the data of different sound sources are different in frequency, so that many times will overlap, and it is very difficult to filter out specific information in these overlapping frequencies. Especially, when the volume of other sound sources is higher than the volume of the target to be searched, the volume of the target to be searched is usually covered by the background volume and cannot be detected.
Therefore, in the process of searching the batch of audio files, a large amount of labor cost and time cost are required to be invested to find the target to be searched, and an effective automatic quick searching mode is difficult to find for replacing the target.
Disclosure of Invention
The present invention is directed to solve the above problems in the prior art, and to provide a method for processing multiple audio data.
The technical solution to the above object of the present invention is achieved by: a method for processing multi-audio data, comprising: one kind of multi-audio data is stored in a single audio file.
Preferably, the method comprises the steps of:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information and frequency related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: setting the occurrence times, frequency, and the position and duration of each occurrence of the sound source for each sound source;
s5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: the method comprises the steps of writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence to form a sound source 1| sound source 2| sound source 3.
Preferably, in step S3, the audio of different frequencies is regarded as different audio sources regardless of whether the capturing devices are the same.
Preferably, in step S4, the number of sound sources is the time period in which the audio of the frequency effectively appears, and the background data except for the frequency is ignored.
Preferably, in step S4, the sound source frequency is a unique identifier of the sound source, so as to create an index.
Preferably, in step S6, when a plurality of sets of sound source data exist in a certain frequency band, the sound source 1| sound source 2| sound source 3 are written in order of sound source.
Preferably, in step S6, when a plurality of sets of sound source data exist in a certain frequency band, the sound source 1| sound source 2| sound source 3 are written in order of sound source.
Preferably, a method for processing multi-audio data, which can store a multi-audio data in a single audio file; the method comprises the following steps:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information and frequency related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: setting the occurrence times, frequency, and the position and duration of each occurrence of the sound source for each sound source;
s5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence of a sound source 1| sound source 2| sound source 3; in step S3, the audio of different frequencies is regarded as different sound sources regardless of whether the capturing devices are the same; in step S4, the sound source frequency is the time period in which the audio of the frequency effectively appears, and the background data except the frequency is ignored; in step S4, the sound source frequency is a unique identifier of the sound source, so as to create an index; when a plurality of groups of sound source data exist in a certain frequency band, writing the sound source data into the frequency band according to the sound source sequence, wherein the sound source data comprises a sound source 1 and a sound source 2 and a sound source 3; when a plurality of sets of sound source data exist in a certain frequency band, the sound source data are written in sequence according to the sound source sequence, and the sound source 1| sound source 2| sound source 3 is generated.
The technical scheme of the invention has the advantages that: the method can not only integrate audio data of multiple sound sources at will, store audio data of multiple sound sources and multiple frequencies in the same audio file at the same time, but also freely switch the sound sources in the playback process, freely switch the sound source data or mix and throw away any sound source, realize real-time sound mixing operation, and set a sound pattern characteristic retrieval index aiming at different sound source data, so that the retrieval process of the audio data is more efficient. The multi-tone source file stored by the method is smaller in size than a traditional multi-file independent storage mode, audio data indexes are easier to establish, the method plays an important role in related fields such as audio recording and management, and is suitable for industrial popularization and use.
Drawings
Fig. 1 is a schematic view of the spatial dimensions of different audio source data in the present invention.
Fig. 2 is a representation of the data stream of two sets of audio data of different frequencies in a multi-dimensional data space according to the present invention.
Detailed Description
Objects, advantages and features of the present invention will be illustrated and explained by the following non-limiting description of preferred embodiments. The embodiments are merely exemplary for applying the technical solutions of the present invention, and any technical solution formed by replacing or converting the equivalent thereof falls within the scope of the present invention claimed.
The invention discloses a processing method of multi-audio data, which stores a multi-audio data in a single audio file.
Specifically, the method for processing the multi-audio data comprises the following steps:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information, frequency and other related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: the number of occurrences, the frequency, and the position and the duration of each occurrence of the sound source, etc. are set for each sound source. The audio of different frequencies is regarded as different sound sources no matter whether the acquisition equipment is the same or not. The sound source frequency is the time period when the audio of the frequency effectively appears, the background data except the frequency is ignored, and the sound source frequency is the unique identifier of the sound source so as to create an index; specifically, in this embodiment, the number, position, and duration are obtained in a specific acquisition process, the frequency range is a normal frequency range, the frequency is used to distinguish human voice, and the frequency range of human voice is 300Hz — 3400 Hz.
S5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: the method comprises the steps of writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence to form a sound source 1| sound source 2| sound source 3.
In this embodiment, the selection of the multiple sets of sound source data is not limited, and the selection is performed according to the actual needs in the working process. A certain frequency band refers to the voice of the same person, and the frequency is the same, unless high or low sound is intentionally emitted, the frequency of the voice of the same person is generally maintained in a frequency band, of course, the frequency here is not only the frequency of the audio itself, but also includes the concept of speech speed, i.e. how fast the voice is speaking.
Different from a common audio data storage mode, the sound source with the highest volume is always collected as valid data at the same time node, and the sound source with the lower volume is always submerged or only mixed in gaps with different frequencies to become background sound. The sound source data storage mode provided by the multi-audio data processing method can use a multi-dimensional means to freely match audio data of multiple sound sources and multiple frequencies, different sound sources or the same sound source audio can be separated in different dimensions, and the audio data can be mixed in real time during playback, so that the method has great flexibility. Fig. 1 is a schematic spatial diagram of different audio source data in dimensions, where x is an audio data stream and can also be understood as time, y is audio, z is different audio, and the audio data streams and the audio volumes are separately acquired according to the audio and separated in the y dimension. The audio frequencies of different frequencies are considered as different sound sources, and the different sound sources or the same sound source audio can be separated in different dimensions.
Two groups of audio data with different sound sources and different frequencies exist in the form of independent data streams in a multidimensional data space, and when the audio data are coded into an independent audio file, the storage mode is as shown in fig. 2: the frequencies and the sound sources of the audio 1 and the audio 2 are different, and specifically, the frequency, the starting position and the length of each time and the frequency of the audio 1 are different from those of the audio 2.
Unlike the existing audio data storage method, which is a combination of audio format header + audio data, storing multi-source audio in this way makes it difficult to separate the mixed audio data. The multidimensional audio coding and storing mode proposed by the processing method of the multi-audio data is based on the sound source and the frequency, and the audio information of each sound source and frequency is dispersedly stored in the same audio data stream, so that the method has extremely high efficiency when the audio data index is created.
By the method, the audio data with multiple sound sources and multiple frequencies can be stored in the same audio file at the same time, the sound sources can be freely switched in the playback process, real-time sound mixing operation is realized, and the voiceprint characteristic retrieval index can be set for different sound source data, so that the retrieval process of the audio data is more efficient.
The invention has a plurality of embodiments, and all technical solutions formed by adopting equivalent transformation or equivalent transformation are within the protection scope of the invention.
Claims (1)
1. A method for processing multi-audio data, comprising: storing a plurality of audio data in a single audio file; the method comprises the following steps:
s1: selecting sound source equipment, and setting independent acquisition channels for different sound source equipment;
s2: setting sound source information and frequency related data for the acquired audio data;
s3: setting the number of sound sources and the basic information of different sound sources in an audio file header;
s4: setting the occurrence times, frequency, and the position and duration of each occurrence of the sound source for each sound source;
s5: storing the sound source file header information, starting coding and writing the sound source file header information into an audio data stream;
s6: writing related sound source data on the set frequency of an encoder of an audio file, and when a plurality of groups of sound source data exist in a certain frequency band, sequentially writing the sound source data in a sound source sequence of a sound source 1| sound source 2| sound source 3;
in step S3, the audio of different frequencies is regarded as different sound sources regardless of whether the capturing devices are the same;
in step S4, the sound source frequency is the time period in which the audio of the frequency effectively appears, and the background data except the frequency is ignored; in step S4, the sound source frequency is a unique identifier of the sound source, so as to create an index; in step S6, when a plurality of sets of sound source data exist in one frequency band, the sound source 1| sound source 2| sound source 3 are written in order of sound source.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710673700.0A CN107506409B (en) | 2017-08-09 | 2017-08-09 | Method for processing multi-audio data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710673700.0A CN107506409B (en) | 2017-08-09 | 2017-08-09 | Method for processing multi-audio data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN107506409A CN107506409A (en) | 2017-12-22 |
CN107506409B true CN107506409B (en) | 2021-01-08 |
Family
ID=60689589
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710673700.0A Active CN107506409B (en) | 2017-08-09 | 2017-08-09 | Method for processing multi-audio data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107506409B (en) |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3855862B2 (en) * | 2002-04-01 | 2006-12-13 | ソニー株式会社 | Editing method and apparatus |
US7869616B2 (en) * | 2005-01-12 | 2011-01-11 | Logitech International, S.A. | Active crossover and wireless interface for use with multi-driver in-ear monitors |
CN101001485A (en) * | 2006-10-23 | 2007-07-18 | 中国传媒大学 | Finite sound source multi-channel sound field system and sound field analogy method |
CN102867514B (en) * | 2011-07-07 | 2016-04-13 | 腾讯科技(北京)有限公司 | A kind of sound mixing method and device sound mixing |
CN105981411B (en) * | 2013-11-27 | 2018-11-30 | Dts(英属维尔京群岛)有限公司 | The matrix mixing based on multi-component system for the multichannel audio that high sound channel counts |
CN104064191B (en) * | 2014-06-10 | 2017-12-15 | 北京音之邦文化科技有限公司 | Sound mixing method and device |
CN106486128B (en) * | 2016-09-27 | 2021-10-22 | 腾讯科技(深圳)有限公司 | Method and device for processing double-sound-source audio data |
-
2017
- 2017-08-09 CN CN201710673700.0A patent/CN107506409B/en active Active
Also Published As
Publication number | Publication date |
---|---|
CN107506409A (en) | 2017-12-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109672922B (en) | Game video editing method and device | |
CN109982128B (en) | Video bullet screen generation method and device, storage medium and electronic device | |
CN105677716B (en) | A kind of computer data acquiring processing analysis system | |
CN105868397A (en) | Method and device for determining song | |
CN101656094A (en) | Data storage method and storage device | |
DE60120417D1 (en) | METHOD FOR SEARCHING IN AN AUDIO DATABASE | |
CN108920611B (en) | Article generation method, device, equipment and storage medium | |
CN102799605A (en) | Method and system for monitoring advertisement broadcast | |
WO2003098479A3 (en) | Managing search expressions in a database system | |
CN104599692A (en) | Recording method and device and recording content searching method and device | |
WO2008043082A3 (en) | Time series search engine | |
SG140510A1 (en) | System and method for database indexing, searching and data retrieval | |
CN104933175B (en) | Performance data correlation analysis method and performance monitoring system | |
CN110650374A (en) | Clipping method, electronic device, and computer-readable storage medium | |
CN103853836A (en) | Music retrieval method and system based on music fingerprint characteristic | |
KR20120090101A (en) | Digital video fast matching system using key-frame index method | |
CN105069153A (en) | Patent analysis system | |
CN107506409B (en) | Method for processing multi-audio data | |
CN109284763A (en) | A kind of method and server generating participle training data | |
DE102019123005A1 (en) | SYSTEM AND METHOD FOR DISPLAYING THE OBJECT MOTION SCHEME | |
CN101770474A (en) | History searching record-based searching method and device | |
CN103139272A (en) | Method of obtaining online time within selected time period and device using the same | |
CN104978380A (en) | Audio frequency processing method and device | |
CN104125334A (en) | Information processing method and electronic equipment | |
CN104778202B (en) | The analysis method and system of event evolutionary process based on keyword |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |