CN113380220A - Speech synthesis coding method and device - Google Patents

Speech synthesis coding method and device Download PDF

Info

Publication number
CN113380220A
CN113380220A CN202110647984.2A CN202110647984A CN113380220A CN 113380220 A CN113380220 A CN 113380220A CN 202110647984 A CN202110647984 A CN 202110647984A CN 113380220 A CN113380220 A CN 113380220A
Authority
CN
China
Prior art keywords
buffer
tblock
playing
continuous
stream data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110647984.2A
Other languages
Chinese (zh)
Inventor
皮碧虹
杨德文
龙丁奋
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Tongxingzhe Technology Co ltd
Original Assignee
Shenzhen Tongxingzhe Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Tongxingzhe Technology Co ltd filed Critical Shenzhen Tongxingzhe Technology Co ltd
Priority to CN202110647984.2A priority Critical patent/CN113380220A/en
Publication of CN113380220A publication Critical patent/CN113380220A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/02Methods for producing synthetic speech; Speech synthesisers
    • G10L13/04Details of speech synthesis systems, e.g. synthesiser structure or memory management

Abstract

According to the voice synthesis coding method and device provided by one or more embodiments of the present specification, after text data is synthesized into pcm stream data, a start buffer threshold Tstart required for starting playing is dynamically calculated according to a current system load condition, and if a buffer duration of a buffer is greater than the start buffer threshold Tstart, the pcm stream data in the buffer is read for playing; dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition; in the playing process, whether text data is continuously synthesized into pcm stream data or synthesized temporarily is judged according to the relation between the buffer duration of the buffer area and the continuous buffer threshold Tblock, so that the stability and smoothness of playing are guaranteed, and the smooth occupation of a cpu and a memory is realized.

Description

Speech synthesis coding method and device
Technical Field
The present invention relates to the field of speech synthesis methods, and in particular, to a speech synthesis encoding method and apparatus.
Background
The current encoding and playing schemes for speech synthesis (text to speech) are:
1. one-time synthesis: inputting the text into a speech synthesis engine, acquiring encoded pcm data at one time, and transmitting the pcm data to a player for playing at one time; the method needs to occupy a large amount of memory storage pcm, the synthesis waiting time is long, and the playing is started after the data are completely synthesized.
2. Sleep in streaming synthesis: synthesizing pcm data, stopping processing for a player, sleeping for a certain time in the synthesis process, and continuously synthesizing and playing; the size of a data block synthesized in a single mode is fixed, cpu fluctuation exists, the cpu may be occupied too high when the sleep time is too short, and the player may be interrupted or noise may occur when the sleep time is too long.
Disclosure of Invention
In view of the above, one or more embodiments of the present disclosure are directed to a speech synthesis encoding method and apparatus, which can effectively solve the technical problems in the prior art.
In view of the above, one or more embodiments of the present specification provide a speech synthesis encoding method, including:
starting to synthesize the text data into pcm stream data, and storing the pcm stream data in a buffer area;
dynamically calculating an initial buffering threshold value Tstart required for starting playing according to the current system load condition;
if the buffer duration of the buffer area is greater than the starting buffer threshold Tstart, reading the pcm stream data of the buffer area for playing;
dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition;
if the buffer duration of the buffer area is greater than the continuous buffer threshold Tblock, the text data is paused to be synthesized into pcm stream data, and the step of calculating the continuous buffer threshold Tblock is returned after the preset time is waited; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
As an optional implementation manner, the dynamically calculating a start buffer threshold Tstart required for starting playing according to the current system load condition includes:
if T2-T1< Tmin, Tstart is Tmin, otherwise Tstart is T2-T1;
wherein T1 is the estimation of synthesis duration;
t2 is the playing time length;
tmin is the minimum buffer duration.
As an alternative embodiment, T1 ═ L × U/C, T2 ═ L × T;
wherein C is the maximum idle calculation power of the single-core cpu; u is the calculation power consumption of single character synthesis; t is the estimation of the duration of a single character; l is the length of the whole sentence.
As an optional implementation manner, the dynamically calculating a sustained buffer threshold Tblock required for sustained play according to the current system load condition includes:
if T4 is less than or equal to T3, Tblock is T3, otherwise, Tblock is x (T2-T1) + Tbuf;
if Tblock is less than Tmin, Tblock is Tmin;
wherein, T4 is the residual playing time length estimation, T3 is the residual synthesis time length estimation, Tbuf is the residual playing time length of the current buffer, and x is the buffer unit.
As an alternative embodiment, T3 ═ R × U/C, T4 ═ R × T + Tbuf, Tmin ═ F × Tplayer;
wherein C is the maximum idle calculation power of the single-core cpu; u is the calculation power consumption of single character synthesis; r is the residual word length, F is the minimum playing buffer coefficient, Tplayer is the minimum buffer time length of the player, and T is the single word time length estimation.
As an alternative embodiment, the buffer unit x is 1%, and the minimum play buffer factor F is 2.
As an optional embodiment, the method further comprises the step of pausing the playing of the pcm stream data.
Corresponding to the speech synthesis encoding method, an embodiment of the present invention further provides a speech synthesis encoding apparatus, including:
the buffer module is used for starting to synthesize the text data into pcm stream data and storing the pcm stream data in a buffer area;
the first calculation module is used for dynamically calculating a starting buffer threshold value Tstart required for starting playing according to the current system load condition;
the playing module is used for reading the pcm streaming data in the buffer area to play when the buffer duration of the buffer area is greater than the starting buffer threshold Tstart;
the second calculation module is used for dynamically calculating a continuous buffering threshold Tblock required by continuous playing according to the current system load condition;
the judging module is used for pausing the synthesis of the text data into pcm streaming data if the buffer duration of the buffer area is greater than the continuous buffer threshold Tblock, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for the preset time; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
As an alternative implementation, the first computing module is configured to
If T2-T1< Tmin, Tstart is Tmin, otherwise Tstart is T2-T1;
wherein T1 is the estimation of synthesis duration;
t2 is the playing time length;
tmin is the minimum buffer duration.
As an alternative implementation, the second computing module is configured to
If T4 is less than or equal to T3, Tblock is T3, otherwise, Tblock is x (T2-T1) + Tbuf;
if Tblock is less than Tmin, Tblock is Tmin;
wherein, T4 is the residual playing time length estimation, T3 is the residual synthesis time length estimation, Tbuf is the residual playing time length of the current buffer, and x is the buffer unit.
As can be seen from the above, in the speech synthesis encoding method and apparatus provided in one or more embodiments of the present disclosure, after text data is synthesized into pcm stream data, a start buffer threshold Tstart required for starting playing is dynamically calculated according to a current system load condition, and if a buffer duration of a buffer is greater than the start buffer threshold Tstart, the pcm stream data in the buffer is read for playing; dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition; in the playing process, whether text data is continuously synthesized into pcm stream data or synthesized temporarily is judged according to the relation between the buffer duration of the buffer area and the continuous buffer threshold Tblock, so that the stability and smoothness of playing are guaranteed, and the smooth occupation of a cpu and a memory is realized.
Drawings
In order to more clearly illustrate one or more embodiments or prior art solutions of the present specification, the drawings that are needed in the description of the embodiments or prior art will be briefly described below, and it is obvious that the drawings in the following description are only one or more embodiments of the present specification, and that other drawings may be obtained by those skilled in the art without inventive effort from these drawings.
FIG. 1 is a logic diagram of a speech synthesis encoding method according to an embodiment of the present invention;
FIG. 2 is a diagram of a speech synthesis encoding apparatus according to an embodiment of the present invention.
Detailed Description
For the purpose of promoting a better understanding of the objects, aspects and advantages of the present disclosure, reference is made to the following detailed description taken in conjunction with the accompanying drawings.
To achieve the above object, an embodiment of the present invention provides a speech synthesis encoding method, including:
starting to synthesize the text data into pcm stream data, and storing the pcm stream data in a buffer area;
dynamically calculating an initial buffering threshold value Tstart required for starting playing according to the current system load condition;
if the buffer duration of the buffer area is greater than the starting buffer threshold Tstart, reading the pcm stream data of the buffer area for playing;
dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition;
if the buffer duration of the buffer area is greater than the continuous buffer threshold Tblock, the text data is paused to be synthesized into pcm stream data, and the step of calculating the continuous buffer threshold Tblock is returned after the preset time is waited; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
In the embodiment of the invention, after text data is synthesized into pcm stream data, a starting buffer threshold value Tstart required for starting playing is dynamically calculated according to the current system load condition, and if the buffer duration of a buffer area is greater than the starting buffer threshold value Tstart, the pcm stream data in the buffer area is read for playing; dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition; in the playing process, whether text data is continuously synthesized into pcm stream data or synthesized temporarily is judged according to the relation between the buffer duration of the buffer area and the continuous buffer threshold Tblock, so that the stability and smoothness of playing are guaranteed, and the smooth occupation of a cpu and a memory is realized.
As shown in fig. 1, an embodiment of the present invention provides a speech synthesis encoding method, including:
and S100, starting to synthesize the text data into pcm stream data, and storing the pcm stream data in a buffer area.
S200, dynamically calculating a starting buffer threshold Tstart required by starting playing according to the current system load condition.
Optionally, the dynamically calculating a start buffering threshold Tstart required for starting playing according to the current system load condition includes:
if T2-T1< Tmin, Tstart is Tmin, otherwise Tstart is T2-T1;
wherein, T1 is the estimated synthesis duration, T1 ═ L × U/C; t2 is the playing duration, T2 ═ L × T; tmin is the minimum buffer duration; c is the maximum idle calculation power of the single-core cpu; u is the calculation power consumption of single character synthesis; t is the estimation of the duration of a single character; l is the length of the whole sentence.
And S300, reading the pcm streaming data in the buffer area for playing if the buffer duration of the buffer area is greater than the starting buffer threshold value Tstart.
S400, dynamically calculating a continuous buffering threshold value Tblock required by continuous playing according to the current system load condition.
Optionally, the dynamically calculating a sustained buffer threshold Tblock required for sustained play according to the current system load condition includes:
if T4 is less than or equal to T3, Tblock is T3, otherwise, Tblock is x (T2-T1) + Tbuf;
if Tblock is less than Tmin, Tblock is Tmin;
wherein, T4 is the residual playing time estimation, T4 ═ R × T + Tbuf, T3 is the residual synthesized time estimation, T3 ═ R × U/C, Tbuf is the residual playing time of the current buffer, x is the buffer unit, the value is usually 1%, the minimum synthesized buffer time Tmin, Tmin ═ F × Tplayer; c is the maximum idle calculation power of the single-core cpu; u is the calculation power consumption of single character synthesis; r is the remaining word length, F is the minimum play buffer coefficient, usually the value F is 2, Tplayer is the player minimum buffer duration, and T is the single word duration estimate.
S500, if the buffer duration of the buffer area is greater than the continuous buffer threshold value Tblock, the text data is paused to be synthesized into pcm stream data, and the step of calculating the continuous buffer threshold value Tblock is returned after the preset time is waited; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
As an optional embodiment, the method further comprises the step of pausing the playing of the pcm stream data.
Corresponding to the speech synthesis encoding method, as shown in fig. 2, an embodiment of the present invention further provides a speech synthesis encoding apparatus, including:
the buffer module 10 is configured to start synthesizing text data into pcm stream data, and store the pcm stream data in a buffer area;
the first calculating module 20 is configured to dynamically calculate a starting buffer threshold Tstart required for starting playing according to a current system load condition;
the playing module 30 is configured to read the pcm stream data in the buffer for playing when the buffer duration of the buffer is greater than the starting buffer threshold Tstart;
the second calculating module 40 is configured to dynamically calculate a continuous buffer threshold Tblock required for continuous playing according to a current system load condition;
the judging module 50 is configured to suspend synthesizing the text data into pcm stream data if the buffer duration of the buffer area is greater than the continuous buffer threshold Tblock, and return to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
Optionally, the first computing module 20 is configured to
If T2-T1< Tmin, Tstart is Tmin, otherwise Tstart is T2-T1;
wherein T1 is the estimation of synthesis duration;
t2 is the playing time length;
tmin is the minimum buffer duration.
Optionally, the second computing module 40 is configured to
If T4 is less than or equal to T3, Tblock is T3, otherwise, Tblock is x (T2-T1) + Tbuf;
if Tblock is less than Tmin, Tblock is Tmin;
wherein, T4 is the residual playing time length estimation, T3 is the residual synthesis time length estimation, Tbuf is the residual playing time length of the current buffer, and x is the buffer unit.
In the embodiment of the invention, after text data is synthesized into pcm stream data, a starting buffer threshold value Tstart required for starting playing is dynamically calculated according to the current system load condition, and if the buffer duration of a buffer area is greater than the starting buffer threshold value Tstart, the pcm stream data in the buffer area is read for playing; dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition; in the playing process, whether text data is continuously synthesized into pcm stream data or synthesized temporarily is judged according to the relation between the buffer duration of the buffer area and the continuous buffer threshold Tblock, so that the stability and smoothness of playing are guaranteed, and the smooth occupation of a cpu and a memory is realized.
The foregoing description has been directed to specific embodiments of this disclosure. Other embodiments are within the scope of the following claims. In some cases, the actions or steps recited in the claims may be performed in a different order than in the embodiments and still achieve desirable results. In addition, the processes depicted in the accompanying figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing may also be possible or may be advantageous.
It is intended that the one or more embodiments of the present specification embrace all such alternatives, modifications and variations as fall within the broad scope of the appended claims. Therefore, any omissions, modifications, substitutions, improvements, and the like that may be made without departing from the spirit and principles of one or more embodiments of the present disclosure are intended to be included within the scope of the present disclosure.

Claims (10)

1. A speech synthesis encoding method, comprising:
starting to synthesize the text data into pcm stream data, and storing the pcm stream data in a buffer area;
dynamically calculating an initial buffering threshold value Tstart required for starting playing according to the current system load condition;
if the buffer duration of the buffer area is greater than the starting buffer threshold Tstart, reading the pcm stream data of the buffer area for playing;
dynamically calculating a continuous buffer threshold Tblock required by continuous playing according to the current system load condition;
if the buffer duration of the buffer area is greater than the continuous buffer threshold Tblock, the text data is paused to be synthesized into pcm stream data, and the step of calculating the continuous buffer threshold Tblock is returned after the preset time is waited; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
2. The speech synthesis encoding method according to claim 1, wherein the dynamically calculating a start buffer threshold Tstart required for starting playing according to the current system load condition comprises:
if T2-T1< Tmin, Tstart is Tmin, otherwise Tstart is T2-T1;
wherein T1 is the estimation of synthesis duration;
t2 is the playing time length;
tmin is the minimum buffer duration.
3. The speech synthesis coding method according to claim 2, wherein T1 ═ L × U/C, T2 ═ L × T;
wherein C is the maximum idle calculation power of the single-core cpu; u is the calculation power consumption of single character synthesis; t is the estimation of the duration of a single character; l is the length of the whole sentence.
4. The speech synthesis encoding method according to claim 1, wherein the dynamically calculating the Tblock required for continuous playback according to the current system load condition comprises:
if T4 is less than or equal to T3, Tblock is T3, otherwise, Tblock is x (T2-T1) + Tbuf;
if Tblock is less than Tmin, Tblock is Tmin;
wherein, T4 is the residual playing time length estimation, T3 is the residual synthesis time length estimation, Tbuf is the residual playing time length of the current buffer, and x is the buffer unit.
5. The speech synthesis coding method according to claim 4, wherein T3, T4, Tmin, Tlayer;
wherein C is the maximum idle calculation power of the single-core cpu; u is the calculation power consumption of single character synthesis; r is the residual word length, F is the minimum playing buffer coefficient, Tplayer is the minimum buffer time length of the player, and T is the single word time length estimation.
6. The speech synthesis coding method according to claim 5, wherein the buffer unit x is 1%, and the minimum play buffer coefficient F is 2.
7. The speech synthesis encoding method according to claim 1, further comprising the step of pausing the playing of the pcm stream data.
8. A speech synthesis encoding apparatus, comprising:
the buffer module is used for starting to synthesize the text data into pcm stream data and storing the pcm stream data in a buffer area;
the first calculation module is used for dynamically calculating a starting buffer threshold value Tstart required for starting playing according to the current system load condition;
the playing module is used for reading the pcm streaming data in the buffer area to play when the buffer duration of the buffer area is greater than the starting buffer threshold Tstart;
the second calculation module is used for dynamically calculating a continuous buffering threshold Tblock required by continuous playing according to the current system load condition;
the judging module is used for pausing the synthesis of the text data into pcm streaming data if the buffer duration of the buffer area is greater than the continuous buffer threshold Tblock, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for the preset time; otherwise, continuously synthesizing the text data into pcm stream data, and returning to the step of calculating the continuous buffer threshold Tblock after waiting for a preset time until all the text data are synthesized into pcm stream data.
9. The speech synthesis encoding apparatus of claim 8, wherein the first computing module is configured to compute the first speech signal according to the first speech signal
If T2-T1< Tmin, Tstart is Tmin, otherwise Tstart is T2-T1;
wherein T1 is the estimation of synthesis duration;
t2 is the playing time length;
tmin is the minimum buffer duration.
10. The speech synthesis encoding apparatus of claim 8, wherein the second computing module is configured to compute the second speech signal using a second algorithm
If T4 is less than or equal to T3, Tblock is T3, otherwise, Tblock is x (T2-T1) + Tbuf;
if Tblock is less than Tmin, Tblock is Tmin;
wherein, T4 is the residual playing time length estimation, T3 is the residual synthesis time length estimation, Tbuf is the residual playing time length of the current buffer, and x is the buffer unit.
CN202110647984.2A 2021-06-10 2021-06-10 Speech synthesis coding method and device Pending CN113380220A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110647984.2A CN113380220A (en) 2021-06-10 2021-06-10 Speech synthesis coding method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110647984.2A CN113380220A (en) 2021-06-10 2021-06-10 Speech synthesis coding method and device

Publications (1)

Publication Number Publication Date
CN113380220A true CN113380220A (en) 2021-09-10

Family

ID=77573540

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110647984.2A Pending CN113380220A (en) 2021-06-10 2021-06-10 Speech synthesis coding method and device

Country Status (1)

Country Link
CN (1) CN113380220A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060182086A1 (en) * 2005-02-14 2006-08-17 Dowdal John T Memory optimization packet loss concealment in a voice over packet network
CN101582832A (en) * 2008-05-17 2009-11-18 红杉树(杭州)信息技术有限公司 Method for dynamically processing VoIP jitter buffer area
CN103475934A (en) * 2013-09-13 2013-12-25 北京世纪鼎点软件有限公司 Video coding stream control method facing network live broadcast
CN107959659A (en) * 2016-10-17 2018-04-24 杭州海康威视数字技术股份有限公司 A kind of flow medium play control method, device and electronic equipment
CN109819312A (en) * 2019-03-19 2019-05-28 四川长虹电器股份有限公司 Player system and its control method based on dynamic buffer
CN110351445A (en) * 2019-06-19 2019-10-18 成都康胜思科技有限公司 A kind of high concurrent VOIP recording service system based on intelligent sound identification
CN111105779A (en) * 2020-01-02 2020-05-05 标贝(北京)科技有限公司 Text playing method and device for mobile client
CN111179973A (en) * 2020-01-06 2020-05-19 苏州思必驰信息科技有限公司 Speech synthesis quality evaluation method and system

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060182086A1 (en) * 2005-02-14 2006-08-17 Dowdal John T Memory optimization packet loss concealment in a voice over packet network
CN101582832A (en) * 2008-05-17 2009-11-18 红杉树(杭州)信息技术有限公司 Method for dynamically processing VoIP jitter buffer area
CN103475934A (en) * 2013-09-13 2013-12-25 北京世纪鼎点软件有限公司 Video coding stream control method facing network live broadcast
CN107959659A (en) * 2016-10-17 2018-04-24 杭州海康威视数字技术股份有限公司 A kind of flow medium play control method, device and electronic equipment
CN109819312A (en) * 2019-03-19 2019-05-28 四川长虹电器股份有限公司 Player system and its control method based on dynamic buffer
CN110351445A (en) * 2019-06-19 2019-10-18 成都康胜思科技有限公司 A kind of high concurrent VOIP recording service system based on intelligent sound identification
CN111105779A (en) * 2020-01-02 2020-05-05 标贝(北京)科技有限公司 Text playing method and device for mobile client
CN111179973A (en) * 2020-01-06 2020-05-19 苏州思必驰信息科技有限公司 Speech synthesis quality evaluation method and system

Similar Documents

Publication Publication Date Title
US8775831B2 (en) Using historic load profiles to dynamically adjust operating frequency and available power to a handheld multimedia device processor core
US20110305343A1 (en) Method and system decoding audio data with selective power control
JP4481304B2 (en) Storage device for storing different data formats
US20020013633A1 (en) Audio processor and audio data processing method
JP5103907B2 (en) Speech recognition system, speech recognition method, and speech recognition program
US20090024864A1 (en) Audio processor having dynamic automatic control function of operating frequency
EP1239462B1 (en) Distributed speech recognition system and method
US7356370B2 (en) Data processing circuit, data processing apparatus, data processing method, data processing control method, recording medium on which data processing program is stored and recording medium on which data processing control program is stored
CN113380220A (en) Speech synthesis coding method and device
US8509930B2 (en) Data processing method for audio data playback, audio data playback method and audio data playback device
JPH10108199A (en) Image coder
CN102860010A (en) Video encoding control method and apparatus
JP2001053869A (en) Voice storing device and voice encoding device
JP2006153907A (en) Audio data encoding device and audio data decoding device
JPWO2008142874A1 (en) Speech encoding and playback device
KR101611464B1 (en) Data pattern analysis method and analysis system
JP2006251837A (en) Memory controller
TW200731137A (en) Memory card with function of video/audio data processing
JP4289138B2 (en) Data processing apparatus and method and encoding apparatus
KR101032469B1 (en) Apparatus and Method for playing multimedia data
JP4591594B2 (en) Audio signal reproducing apparatus and method, and program
KR101265019B1 (en) instruction execution circuit
US20130103392A1 (en) Apparatus and method of reproducing audio data using low power
JP2008269745A (en) Reproducing device, program, and reproducing method
KR100903958B1 (en) Method and device for decoding digital audio data, and record medium for performing method of decoding digital audio data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination