CN102142924B - Versatile audio code (VAC) transmission method and device - Google Patents
Versatile audio code (VAC) transmission method and device Download PDFInfo
- Publication number
- CN102142924B CN102142924B CN201010111267.XA CN201010111267A CN102142924B CN 102142924 B CN102142924 B CN 102142924B CN 201010111267 A CN201010111267 A CN 201010111267A CN 102142924 B CN102142924 B CN 102142924B
- Authority
- CN
- China
- Prior art keywords
- coding
- vac
- information
- control
- sign
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Images
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention discloses a versatile audio code (VAC) transmission method and device. The method comprises: a coding application terminal formulates a code control strategy according to input setting information; after receiving audio data, the coding application terminal generates a code control identifier according to the code control strategy, and codes the received audio data into VAC data; and the coding application terminal multiplexes the code control identifier and the VAC data into a VAC frame, forms a VAC transport stream from multiple VAC frames and sends the VAC transport stream to a decoding application terminal. According to the invention, the audio frame coding, transmission and decoding can be realized based on different technical schemes according to the information such as user setting, application scene, client feedback and the like.
Description
Technical field
The present invention relates to the communications field, relate in particular to a kind of multipurpose audio encoding transmission method and device.
Background technology
In low complex degree, gradable multipurpose language audio coding decoding field, G.718 only support embedded graduated encoding and the transmission in arrowband and broadband.G.719 support, entirely with coding and the transmission of low complex degree, is not still supported hierarchical pattern.G.722.1 only support coding and the transmission of broadband, the low multiplicity of ultra broadband.Therefore, CCSA's (China Communications Standards Association is called for short CCSA) proposes the multipurpose language audio codec scheme of (Versatile Audio Codec is called for short VAC).
In existing VAC technical scheme, the method coding that can only adopt signal classification, that is: be divided into input audio signal: voice, music, noise, quiet, then according to the audio signal type of input, determine the coding-control parameters such as bandwidth, according to coding-control parameter to audio data coding.This method is more single to the control mode of coding, and flexibility is very poor.
Summary of the invention
The technical problem to be solved in the present invention proposes a kind of multipurpose audio encoding transmission method and device exactly, solves the single problem of coding-control mode in prior art.
In order to solve the problems of the technologies described above, the invention provides a kind of method of multipurpose audio encoding transmission, comprising:
Coding application end is formulated coding-control strategy according to the configuration information of input;
Described coding application end receives after voice data, according to described coding-control strategy, generates coding-control sign, and the audio data coding receiving is become to VAC data;
Described coding application end is VAC frame by coding-control sign and VAC data-reusing, and a plurality of VAC frames are formed to VAC transport stream, sends to decoding application end.
Further, said method also can have following characteristics:
Described configuration information comprises configuration information and application controls information;
The step that described coding application end is formulated coding-control strategy according to the configuration information of input specifically refers to: described coding application end is selected coding-control mode according to configuration information;
Wherein, described coding-control mode is: according to application controls information and/or audio signal classification, encode.
Further, said method also can have following characteristics:
Described application controls information comprises one or more in following information:
(1) user customized information;
(2) service customizing information;
(3) feedback information of decoding application end.
Further, said method also can have following characteristics:
If described application controls information is user customized information, coding-control mode is for to encode according to application controls information, and described coding application end directly generates coding-control sign according to user customized information;
Wherein, described user customized information comprises encoder sign and coding-control parameter.
Further, said method also can have following characteristics:
If described application controls information is service customizing information, coding-control mode is for to encode according to application controls information, and the step of described coding application end generation coding-control sign comprises:
According to type of service, determine coding mode, according to business demand, determine encoder bit rate;
According to coding mode and code rate selection coding-control parameter, and then select encoder;
According to encoder sign and coding-control parameter, generate coding-control sign.
Further, said method also can have following characteristics:
If described application controls information is the feedback information of decoding application end, coding-control mode is for to encode according to application controls information, and described coding application end selects to meet encoder and the coding-control parameter of described feedback information, generates coding-control sign.
Further, said method also can have following characteristics:
Described coding-control parameter comprises: coding mode, sample rate, sound channel and frame type.
Further, said method also can have following characteristics:
Coding-control sign comprises that VAC parameter identification, described VAC parameter identification comprise encoder sign and coding parameter;
Described coding parameter is used to indicate coding mode, sample rate, sound channel and frame type; When coding parameter represents when self-defined, described coding-control sign also comprises VAC configuration information;
When coding mode is embedded when gradable, described coding-control sign also comprises VAC rating information.
In order to solve the problems of the technologies described above, the invention provides a kind of device of multipurpose audio encoding transmission, comprise coding control module, coding module and VAC transmission flow multiplex module,
Described coding control module is for formulating coding-control strategy according to the configuration information of input, and, receive after voice data, according to described coding-control strategy, generate coding-control sign, and described coding-control sign is sent to coding module and VAC transmission flow multiplex module;
Described coding module is for becoming VAC data according to coding-control sign by the audio data coding receiving;
It is VAC frame that described VAC transmission flow multiplex module is used for coding-control sign and VAC data-reusing, and a plurality of VAC frames are formed to VAC transport stream, sends to decoding application end.
Further, said apparatus also can have following characteristics:
Described configuration information comprises configuration information and application controls information;
Described coding control module is further used for selecting coding-control mode according to configuration information;
Wherein, described coding-control mode is: according to application controls information and/or audio signal classification, encode.
The present invention can realize according to information such as user's setting, application scenarios and client feedback, by different technologies scheme, language audio frame is encoded, transmits, is decoded.In addition, the present invention supports a plurality of codecs, can select encoder according to configuration information, can support so leniently to take to Whole frequency band, gradable audio coding decoding function.
Accompanying drawing explanation
Fig. 1 is transmitting terminal and the receiving terminal schematic diagram of the embodiment of the present invention;
Fig. 2 is the schematic diagram of the coding control module input and output of the embodiment of the present invention;
Fig. 3 is the flow chart of the multipurpose audio encoding transmission method of the embodiment of the present invention;
Fig. 4 is that the VAC frame of the embodiment of the present invention forms schematic diagram;
Fig. 5 is that the VAC transport stream of the embodiment of the present invention forms schematic diagram;
Fig. 6 is the flow chart of the multipurpose language audio-frequency decoding method of the embodiment of the present invention.
Embodiment
Below in conjunction with drawings and the specific embodiments, the present invention is described in detail.
The present invention introduces with reference to the 06th > > that makes a Summary of < < information source coding work group, < < multipurpose language audio coding decoding (VAC) standard ToR > > and encoder techniques scheme.As shown in Figure 1, be transmitting terminal and the receiving terminal schematic diagram of the embodiment of the present invention.
Wherein, transmitting terminal, for coding application end (being the device of multipurpose audio encoding transmission), comprises coding control module, coding module and VAC transmission flow multiplex module,
Coding control module is for formulating coding-control strategy according to the configuration information of input, and, receive after voice data, according to described coding-control strategy, generate profile (coding-control sign), and profile is sent to coding module and VAC transmission flow multiplex module;
Coding module is for becoming VAC data according to profile by the audio data coding receiving;
It is VAC frame by profile and VAC data-reusing that VAC transmission flow multiplex module is used for, and a plurality of VAC frames are formed to VAC transport stream, sends to decoding application end.
With reference to figure 2, the configuration information of input coding control module comprises configuration information and application controls information; Described coding control module can be selected coding-control mode according to configuration information; Coding-control mode can be: according to application controls information encode, according to audio signal classification encode, according to application controls information and the audio signal classification mode such as encode.
Coding module comprises one or more encoders, can select encoder and encode according to the profile of coding control module output.
Receiving terminal, for decoding application end, comprises VAC transmission flow demultiplexing module and decoder module;
VAC transmission flow demultiplexing module is used for receiving VAC transport stream, parses VAC frame, and, resolve VAC frame, obtain profile and VAC data;
Decoder module is for becoming voice data by VAC Data Analysis.
Wherein, decoder module can comprise one or more decoders, can choose decoder according to profile, decodes.
Be described in further detail coding application end and decoding application end below and carry out the process of encoding and decoding.
As shown in Figure 3, coding application end is carried out following steps:
Wherein, configuration information comprises configuration information and application controls information;
Coding application end can be selected coding-control mode according to configuration information, and described coding-control mode is: according to application controls information and/or audio signal classification, encode;
According to application controls information and audio signal classification, encode and can be divided into: application controls priority is high or signal classification priority is high two kinds, configuration information is coding control method sign, shown in table specific as follows:
Table 1 coding control method sign
Application controls information can be one or more in following information:
(1) user customized information;
(2) service customizing information;
(3) feedback information of decoding application end.
Profile comprises VAC parameter identification, also may comprise VAC configuration information and VAC rating information.
VAC parameter identification comprises encoder sign and coding parameter;
Coding parameter is used to indicate the coding-control parameters such as coding mode, sample rate, sound channel and frame type;
Coding parameter can adopt as given a definition:
When coding parameter represents self-defined (being 0), profile also comprises VAC configuration information;
VAC configuration information can include but not limited to: the parameters such as coding mode, sample rate, channel number, bandwidth, frame type:
Coding mode comprises embedded gradable and low complex degree multi code Rate of Chinese character;
Sample rate includes but not limited to 16kHz, 32kHz, 48kHz;
Channel number includes but not limited to monophony, stereo;
Bandwidth includes but not limited to broadband [50Hz, 7000Hz], ultra broadband [50Hz, 14000Hz], full band [20Hz, 20000Hz];
Frame type definition comprises but does not limit following content:
When coding mode is embedded when gradable, profile also comprises VAC rating information, and VAC rating information can comprise: the parameters such as classification is initial, classification counting, rank value sign:
Classification is initial includes but not limited to 1 to 13;
Classification counting comprises but not in 1 to 13;
Rank value sign comprises acquiescence classification and self-defined rank value:
The default value of L1-L13
Rank value 1 represents the length value of 1 to the n level made by oneself to rank value n.
(1) when coding-control mode is when encoding according to application controls information, according to application controls Information Selection encoder and coding-control parameter, and then generate profile, and according to encoder and coding-control parameter, the audio data coding receiving is become to VAC data;
Wherein, coding-control parameter comprises the parameters such as coding mode, sample rate, sound channel, frame type.
(a) when application controls information is user customized information, user is the parameter such as input coding device sign and coding-control parameter directly, and coding application end can directly generate profile according to user customized information;
(b), when application controls information is service customizing information, the step that coding application end generates profile comprises:
(b.1) according to type of service, determine coding mode, according to business demand, determine encoder bit rate;
Wherein, the corresponding relation of type of service and coding mode can reference:
(b.2) according to coding mode and code rate selection coding-control parameter, and then select encoder;
If when this parameter only has an encoder to realize, choose this encoder; When if a plurality of encoders are all realized, according to parameters such as algorithm time delay, algorithm complex and coding qualities, choose the encoder of performance index the best;
(b.3) according to encoder sign and coding-control parameter, generate profile.
(c), when described application controls information is the feedback information of decoding application end, coding application end selects to meet encoder and the coding-control parameter of described feedback information, generates profile, specifically can comprise:
(c.1) have decoding application end and encoding under application end channel prerequisite, decoding application end, according to the decoding capability of self, sends the parameters such as encoder complexity, time delay, code check to coding application end;
(c.2), when coding application end receives the feedback information of decoding application end, according to relevant parameters such as complexity, time delay, code checks, select to generate the profile of performance index the best.
(2) when coding-control mode is when encoding according to audio signal classification, comprise the steps:
(2.1) input audio signal, by signal sorting technique extraction of signal type: voice, music, noise, quiet etc.;
(2.2) according to signal type, broadband range chosen in voice, and music and noise are chosen ultra broadband and full band scope, the quiet frame type of choosing the self-defining mute frame of VAC parameter identification;
(2.3) according to the encoder bit rate of configuration and the bandwidth obtained of input, determine encoder and coding-control parameter;
Embedded gradable during with low complex degree multi code Rate of Chinese character when exist simultaneously, all less low complex degree multi code Rate of Chinese character patterns of complexity time delay are selected in suggestion;
When existing a plurality of encoders to support this parameter, the VAC encoder of best performance is chosen in suggestion by performance index simultaneously;
(2.4) according to encoder sign and the coding-control parameter chosen, generate profile.
(3) when coding-control mode is when encoding according to application controls information and audio signal classification, can adopt following steps:
(3.1) according to the mode of encoding according to application controls information, generate profile1 respectively, according to the mode of encoding according to audio signal classification, generate profile2;
Wherein, set forth before the step of encoding and encoding according to audio signal classification according to application controls information, repeat no more herein;
(3.2), when signal sorting technique is checked through mute frame, profile is that frame type is the self-defining configuration information of mute frame; When application controls method priority is high, profile is mainly generated by profile1, and bandwidth, sample rate in profile are obtained by profile2; When application controls method priority is high, profile is mainly generated by profile2, and the coding mode in profile is obtained by profile1.
Wherein, as shown in Figure 4, be the composition schematic diagram of VAC frame, wherein, VAC frame comprises VAC frame head and VAC coded data, wherein, VAC frame head is profile, comprises VAC parameter identification, also may comprise VAC configuration information and VAC rating information.The VAC data that VAC coded data obtains for coding.
As shown in Figure 5, VAC transport stream is comprised of a plurality of VAC frames and VAC synchronization character.
In the process of coding, coding application end can be according to the configuration information of input, real-time update coding-control strategy.
As shown in Figure 6, decoding application end is carried out following steps:
If need to adjust coding strategy and have decoding application end to the signaling transmission channel of coding application end, the application end of decoding sends feedback information to coding application end.
In sum, the mode that the present invention adopts application controls method and signal sorting technique to combine, can realize according to information such as user's setting, application scenarios, client feedback and voice datas, press neatly different technologies scheme to language audio frame coding, transmission, decoding.In addition, the present invention can comprise a plurality of codecs, can support leniently to take to Whole frequency band, gradable audio coding decoding function.
Certainly; the present invention also can have other various embodiments; in the situation that not deviating from spirit of the present invention and essence thereof; those of ordinary skill in the art are when making according to the present invention various corresponding changes and distortion, but these corresponding changes and distortion all should belong to the protection range of the appended claim of the present invention.
Claims (8)
1. the method that multipurpose audio encoding transmits, comprising:
Coding application end is formulated coding-control strategy according to the configuration information of input;
Described coding application end receives after voice data, according to described coding-control strategy, generates coding-control sign, and the audio data coding receiving is become to multipurpose language audio codec (VAC) data;
Described coding application end is VAC frame by coding-control sign and VAC data-reusing, and a plurality of VAC frames are formed to VAC transport stream, sends to decoding application end;
Wherein, described configuration information comprises configuration information and application controls information;
The step that described coding application end is formulated coding-control strategy according to the configuration information of input specifically refers to: described coding application end is selected coding-control mode according to configuration information;
Wherein, described coding-control mode is: according to application controls information and/or audio signal classification, encode, comprising:
Only choose application controls information; Only choose signal classified information; Both choose, application controls information priority level is high; Both choose, signal classification priority is high.
2. the method for claim 1, is characterized in that,
Described application controls information comprises one or more in following information:
(1) user customized information;
(2) service customizing information;
(3) feedback information of decoding application end.
3. method as claimed in claim 2, is characterized in that,
If described application controls information is user customized information, coding-control mode is for to encode according to application controls information, and described coding application end directly generates coding-control sign according to user customized information;
Wherein, described user customized information comprises encoder sign and coding-control parameter.
4. method as claimed in claim 2, is characterized in that,
If described application controls information is service customizing information, coding-control mode is for to encode according to application controls information, and the step of described coding application end generation coding-control sign comprises:
According to type of service, determine coding mode, according to business demand, determine encoder bit rate;
According to coding mode and code rate selection coding-control parameter, and then select encoder;
According to encoder sign and coding-control parameter, generate coding-control sign.
5. method as claimed in claim 2, is characterized in that,
If described application controls information is the feedback information of decoding application end, coding-control mode is for to encode according to application controls information, and described coding application end selects to meet encoder and the coding-control parameter of described feedback information, generates coding-control sign.
6. the method as described in as arbitrary in claim 3~5, is characterized in that,
Described coding-control parameter comprises: coding mode, sample rate, sound channel and frame type.
7. the method as described in as arbitrary in claim 1~5, is characterized in that,
Coding-control sign comprises that VAC parameter identification, described VAC parameter identification comprise encoder sign and coding parameter;
Described coding parameter is used to indicate coding mode, sample rate, sound channel and frame type; When coding parameter represents when self-defined, described coding-control sign also comprises VAC configuration information;
When coding mode is embedded when gradable, described coding-control sign also comprises VAC rating information.
8. a device for multipurpose audio encoding transmission, is characterized in that, comprises coding control module, coding module and VAC transmission flow multiplex module,
Described coding control module is for formulating coding-control strategy according to the configuration information of input, and, receive after voice data, according to described coding-control strategy, generate coding-control sign, and described coding-control sign is sent to coding module and VAC transmission flow multiplex module;
Described coding module is for becoming VAC data according to coding-control sign by the audio data coding receiving;
It is VAC frame that described VAC transmission flow multiplex module is used for coding-control sign and VAC data-reusing, and a plurality of VAC frames are formed to VAC transport stream, sends to decoding application end;
Wherein,
Described configuration information comprises configuration information and application controls information;
Described coding control module is further used for selecting coding-control mode according to configuration information;
Wherein, described coding-control mode is: according to application controls information and/or audio signal classification, encode, comprising:
Only choose application controls information; Only choose signal classified information; Both choose, application controls information priority level is high; Both choose, signal classification priority is high.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010111267.XA CN102142924B (en) | 2010-02-03 | 2010-02-03 | Versatile audio code (VAC) transmission method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201010111267.XA CN102142924B (en) | 2010-02-03 | 2010-02-03 | Versatile audio code (VAC) transmission method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102142924A CN102142924A (en) | 2011-08-03 |
CN102142924B true CN102142924B (en) | 2014-04-09 |
Family
ID=44410179
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201010111267.XA Active CN102142924B (en) | 2010-02-03 | 2010-02-03 | Versatile audio code (VAC) transmission method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102142924B (en) |
Families Citing this family (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103812824A (en) * | 2012-11-07 | 2014-05-21 | 中兴通讯股份有限公司 | Audio frequency multi-code transmission method and corresponding device |
EP2782280A1 (en) * | 2013-03-20 | 2014-09-24 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Two-stage signaling for transmission of a datastream |
CN104506870B (en) * | 2014-11-28 | 2018-02-09 | 北京奇艺世纪科技有限公司 | A kind of video coding processing method and device suitable for more code streams |
CN109273017B (en) * | 2018-08-14 | 2022-06-21 | Oppo广东移动通信有限公司 | Encoding control method and device and electronic equipment |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101223575A (en) * | 2005-07-14 | 2008-07-16 | 皇家飞利浦电子股份有限公司 | Audio encoding and decoding |
CN101288116A (en) * | 2005-10-13 | 2008-10-15 | Lg电子株式会社 | Method and apparatus for signal processing |
CN101393741A (en) * | 2007-09-19 | 2009-03-25 | 中兴通讯股份有限公司 | Audio signal classification apparatus and method used in wideband audio encoder and decoder |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2003091870A1 (en) * | 2002-04-26 | 2003-11-06 | Electronics And Telecommunications Research Institute | Apparatus and method for adapting audio signal |
CN101371297A (en) * | 2006-01-18 | 2009-02-18 | Lg电子株式会社 | Apparatus and method for encoding and decoding signal |
WO2008045846A1 (en) * | 2006-10-10 | 2008-04-17 | Qualcomm Incorporated | Method and apparatus for encoding and decoding audio signals |
CN101521010B (en) * | 2008-02-29 | 2011-10-05 | 华为技术有限公司 | Coding and decoding method for voice frequency signals and coding and decoding device |
-
2010
- 2010-02-03 CN CN201010111267.XA patent/CN102142924B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101223575A (en) * | 2005-07-14 | 2008-07-16 | 皇家飞利浦电子股份有限公司 | Audio encoding and decoding |
CN101288116A (en) * | 2005-10-13 | 2008-10-15 | Lg电子株式会社 | Method and apparatus for signal processing |
CN101393741A (en) * | 2007-09-19 | 2009-03-25 | 中兴通讯股份有限公司 | Audio signal classification apparatus and method used in wideband audio encoder and decoder |
Also Published As
Publication number | Publication date |
---|---|
CN102142924A (en) | 2011-08-03 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102124655B (en) | Method for encoding a symbol, method for decoding a symbol, method for transmitting a symbol from a transmitter to a receiver, encoder, decoder and system for transmitting a symbol from a transmitter to a receiver | |
EP1946517B1 (en) | Audio data packet format and decoding method thereof and method for correcting mobile communication terminal codec setup error and mobile communication terminal performing same | |
CN103220082B (en) | Device and method for generating a signal for transmission or a decoded signal | |
US20080004883A1 (en) | Scalable audio coding | |
CN1153365C (en) | Transfer system adopting different coding principle | |
CN1322405A (en) | Device and method for entropy encoding of information words and device and method for decoding entropy-encoded information words | |
SG170078A1 (en) | Encoding device, decoding device, and method thereof | |
CN102142924B (en) | Versatile audio code (VAC) transmission method and device | |
WO2008065487A1 (en) | Method, apparatus and computer program product for stereo coding | |
CN103077723A (en) | Audio transmission system | |
CN101141644B (en) | Encoding integration system and method and decoding integration system and method | |
CN101981872A (en) | Systems, methods and apparatus for transmitting data over a voice channel of a wireless telephone network | |
CN101453653B (en) | Method for spreading digital audio and video parameter set | |
CN1192656C (en) | Transceiver for selecting source coder and processes carried out in such transceiver | |
US8510121B2 (en) | Multiple description audio coding and decoding method, apparatus, and system | |
JP5068429B2 (en) | Audio data conversion method and apparatus | |
CN113539281A (en) | Audio signal encoding method and apparatus | |
CN101322375A (en) | Audio data packet format and decoding method thereof and method for correcting mobile communication terminal codec setup error and mobile communication terminal performance same | |
CN103646647B (en) | In mixed audio demoder, the spectrum parameter of frame error concealment replaces method and system | |
CN101478616A (en) | Instant voice communication method | |
CN101160725A (en) | Lossless encoding of information with guaranteed maximum bitrate | |
KR101166650B1 (en) | Method and means for decoding background noise information | |
CN1826635A (en) | Audio file format conversion | |
KR101383915B1 (en) | A digital audio receiver having united speech and audio decoder | |
KR20230035373A (en) | Audio encoding method, audio decoding method, related device, and computer readable storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |