CN103531224B

CN103531224B - Be applied to the simple and easy speech playing method of embedded system platform

Info

Publication number: CN103531224B
Application number: CN201310460892.9A
Authority: CN
Inventors: 周宇
Original assignee: Elefirst Science & Tech Co Ltd
Current assignee: Elefirst Science & Tech Co Ltd
Priority date: 2013-09-30
Filing date: 2013-09-30
Publication date: 2016-03-09
Anticipated expiration: 2033-09-30
Also published as: CN103531224A

Abstract

The present invention relates to the simple and easy speech playing method being applied to embedded system platform, comprising: target text is converted to WAV formatted file; The sampled data of WAV formatted file is sent to D/A converter by processor; D/A converter is sent to note amplifier after sampled data is carried out digital-to-analog conversion and is processed, and is play by loudspeaker.After adopting the inventive method, can effectively reduce processor occupancy, reduce the requirement to processor, save the hardware cost of embedded system platform language play back system.

Description

Be applied to the simple and easy speech playing method of embedded system platform

Technical field

The present invention relates to a kind of speech playing method, especially a kind of simple and easy speech playing method being applied to embedded system platform.

Background technology

In recent years, along with electronic technology fast development, embedded system platform is applied in field more and more widely, and also from strength to strength, complexity is also more and more higher for function, and embedded system platform speech play technology also experienced by the development in multiple stage.

According to the knowledge of the applicant, common at present embedded system platform speech playing method has three kinds usually:

(1) by voice recording and playback ic (as ISD4003 series), convert required speech data to numerical information in advance, and deposit to chip internal; During broadcasting, digital signal is restored into voice.The advantage of this method is: use high sampling rate can obtain high-quality, natural voice during recording; But its shortcoming is: the sound pick-up outfit and the playback environ-ment that need specialty during a. recording, otherwise intolerable neighbourhood noise can be introduced; B. high sampling rate needs more storage spaces, and chip price corresponding is with it expensive.

(2) adopt the chipspeech (as the XFS5051CE flown is interrogated in University of Science and Technology) of specialty, based on the speech synthesis technique of advanced person, receive text (comprising Chinese) to be synthesized by UART interface, then text is directly synthesized voice output file.The advantage of this method is: be simple and easy to use, optional multiple pronunciation style (comprising dialect), can, based on special linguistic context optimization, make voice warmer, natural; But its shortcoming is: chip kind is few, expensive.

(3) software decode and play-back technology (as common GPS Voice Navigation, Mp 3 player etc.), deposit to device interior memory block by the audio file recorded in advance, by software decode when needing, and coordinates hardware to export voice.The advantage of this method is: low price, and can play the audio file of any form in theory; But its shortcoming is: software decode needs to take a large amount of processor (CPU or MCU) resources, very high to the requirement of processor.

With regard to current actual service condition, most of occasion is not high to the requirement of voice quality, and this just needs a kind of, process easier, versatility high and with low cost embedded system platform speech playing method lower to hardware requirement badly.

Summary of the invention

Technical matters to be solved by this invention is: for prior art Problems existing, proposes a kind of simple and easy speech playing method being applied to embedded system platform, can reduce processor occupancy, reduce the requirement to processor.

The technical scheme that the present invention solves its technical matters is as follows:

A kind of simple and easy speech playing method being applied to embedded system platform, it is characterized in that, be applied to the embedded system platform language play back system containing processor, FLASH memory, RAM storer, D/A converter, note amplifier and loudspeaker, described processor is connected with FLASH memory, RAM memory data respectively, described processor and D/A converter data cube computation, described D/A converter is connected with speaker data through note amplifier;

Described speech playing method comprises the following steps:

The first step, target text is converted to 8KHz sampling rate, 16Bit, the uncompressed pcm encoder formatted file of monaural standard, the WAV formatted file namely containing file header and sampled data, the reference position containing sampled data in described file header and length; Then this WAV formatted file is put into FLASH memory, pending device calls;

Second step, described processor find corresponding WAV formatted file according to instruction in FLASH memory, and call in RAM storer by the file header of this WAV formatted file; Described processor knows sampled data reference position and length by the file header reading this WAV formatted file, and the sampled data of this WAV formatted file is directly sent to D/A converter by DMA transmission mode by processor from FLASH memory;

3rd step, described D/A converter are sent to note amplifier after sampled data described in second step is carried out digital-to-analog conversion and are processed, and are play by loudspeaker.

The major technique design of technique scheme is as follows:

Applicant thinks, to reduce processor occupancy, adopting special voice recording and playback ic or chipspeech then can cause undoubtedly with high costs, can only look for another way in software.Applicant finds after in depth practical studies, without the need to the feature of decoding when utilizing WAV formatted file to play, should be able to effectively reduce processor occupancy, simultaneously 8KHz sampling rate, 16Bit, monaural WAV formatted file not only can meet the needs of most of occasion completely in voice quality, and its file size is also in tolerance interval; In addition DMA transmission technology is adopted can to reduce the occupancy of processor further.Based on this important discovery, applicant has finally drawn the technical scheme of above-mentioned optimization organic assembling after further practical studies, can effectively reduce processor occupancy, reduces the hardware cost of embedded system platform.

Preferably, also comprise voice and merge broadcast multiple segments voice separately process:

In a first step, if there is the WAV formatted file that multiple playout length is less than predetermined value, then with one of them WAV formatted file for reference file, the end that sampled data in all the other WAV formatted files is added on this reference file is successively formed new file, the title of each sampled data of adding, starting location offset are formed data block together with lengths sets simultaneously, then this data block is added into the end of new file, is formed and merge WAV formatted file; Containing merged file identifier in the file header of described merging WAV formatted file; This merging WAV formatted file is put into FLASH memory, and pending device calls;

In second step, by reading the merged file identifier merging WAV formatted file file header, processor knows that this file is for merging WAV formatted file, then the data block of this merging WAV formatted file end to be read in RAM storer and is found starting location offset and the length of destination sample data by title by processor, and then destination sample data are sent to D/A converter with DMA transmission mode by processor.

Audio file volume can be reduced further like this, save storage space.

Preferably, in the first step, first target text is converted to MP3 format audio file, then this MP3 format audio file is converted to WAV formatted file.Completely so existing switching software can be utilized can to obtain the WAV formatted file corresponding with target text, not need to develop new switching software again.

Preferably, in second step, processor is before sending to D/A converter by sampled data, first judge whether sampled data length is greater than DMA and transmits maximum single transmission length, if then sampled data is sent to D/A converter with interrupt mode by processor in batches, then sampled data is directly sent to D/A converter by processor if not.

More preferably, in second step, processor has the I2S module containing register, and processor first by sampled data reference position and length write I2S module register, and then starts DMA transmission; If sampled data length is greater than DMA and transmits maximum single and send length, then processor transmits in batches with interrupt mode, and during each transmission, processor is first by this reference position of sampled data and length write I2S module register, then starts this DMA and transmit.

Can farthest make good use of DMA transmission technology like this, reduce the workload of processor as far as possible.

Preferably, described processor is MCU.

After adopting the inventive method, can effectively reduce processor occupancy, reduce the requirement to processor, save the hardware cost of embedded system platform language play back system.

Accompanying drawing explanation

Fig. 1 is the hardware composition schematic diagram of the embodiment of the present invention.

Fig. 2 is the main process schematic diagram of Fig. 1 embodiment.

Embodiment

With reference to the accompanying drawings and the present invention is described in further detail in conjunction with the embodiments.But the invention is not restricted to given example.

Embodiment

The present embodiment is applied to the simple and easy speech playing method of embedded system platform, be applied to the embedded system platform language play back system (as shown in Figure 1) containing processor (as MCU), FLASH memory, RAM storer, D/A converter, note amplifier and loudspeaker, processor is connected with FLASH memory, RAM memory data respectively, processor and D/A converter data cube computation, D/A converter is connected with speaker data through note amplifier.

As shown in Figure 2, speech playing method comprises the following steps:

The first step, first target text is converted to MP3 format audio file, this MP3 format audio file is converted to 8KHz sampling rate, 16Bit, the uncompressed pcm encoder formatted file of monaural standard again, namely the WAV formatted file of file header and sampled data is contained, the reference position containing sampled data in file header and length; Then this WAV formatted file is put into FLASH memory, pending device calls;

Second step, processor find corresponding WAV formatted file according to instruction in FLASH memory, and call in RAM storer by the file header of this WAV formatted file; Processor knows sampled data reference position and length by the file header reading this WAV formatted file, and the sampled data of this WAV formatted file is directly sent to D/A converter by DMA transmission mode by processor from FLASH memory;

Specifically, processor is before sending to D/A converter by sampled data, first judge whether sampled data length is greater than DMA and transmits maximum single transmission length, if then sampled data is sent to D/A converter with interrupt mode by processor in batches, then sampled data is directly sent to D/A converter by processor if not.

Situation is more specifically: processor has the I2S module containing register, and processor first by sampled data reference position and length write I2S module register, and then starts DMA transmission; If sampled data length is greater than DMA and transmits maximum single and send length, then processor transmits in batches with interrupt mode, and during each transmission, processor is first by this reference position of sampled data and length write I2S module register, then starts this DMA and transmit.

Said method also comprises voice and merges broadcast multiple segments voice separately process:

In addition to the implementation, the present invention can also have other embodiments.All employings are equal to the technical scheme of replacement or equivalent transformation formation, all drop on the protection domain of application claims.

Claims

1. one kind is applied to the simple and easy speech playing method of embedded system platform, it is characterized in that, be applied to the embedded system platform language play back system containing processor, FLASH memory, RAM storer, D/A converter, note amplifier and loudspeaker, described processor is connected with FLASH memory, RAM memory data respectively, described processor and D/A converter data cube computation, described D/A converter is connected with speaker data through note amplifier;

Described speech playing method comprises the following steps:

3rd step, described D/A converter are sent to note amplifier after sampled data described in second step is carried out digital-to-analog conversion and are processed, and play eventually through loudspeaker;

Also comprise voice and merge broadcast multiple segments voice separately process:

2. be applied to the simple and easy speech playing method of embedded system platform according to claim 1, it is characterized in that, in the first step, first target text is converted to MP3 format audio file, then this MP3 format audio file is converted to WAV formatted file.

3. be applied to the simple and easy speech playing method of embedded system platform according to claim 2, it is characterized in that, in second step, processor is before sending to D/A converter by sampled data, first judge whether sampled data length is greater than DMA and transmits maximum single transmission length, if then sampled data is sent to D/A converter with interrupt mode by processor in batches, then sampled data is directly sent to D/A converter by processor if not.

4. be applied to the simple and easy speech playing method of embedded system platform according to claim 3, it is characterized in that, in second step, processor has the I2S module containing register, processor first by sampled data reference position and length write I2S module register, and then starts DMA transmission; If sampled data length is greater than DMA and transmits maximum single and send length, then processor transmits in batches with interrupt mode, and during each transmission, processor is first by this reference position of sampled data and length write I2S module register, then starts this DMA and transmit.

5. be applied to the simple and easy speech playing method of embedded system platform according to claim 4, it is characterized in that, described processor is MCU.