CN110035043A

CN110035043A - A kind of story play system and method based on speech recognition

Info

Publication number: CN110035043A
Application number: CN201810104033.9A
Authority: CN
Inventors: 朱建强
Original assignee: Shanghai Hua Zhen Electronic Technology Co Ltd
Current assignee: Shanghai Hua Zhen Electronic Technology Co Ltd
Priority date: 2018-02-02
Filing date: 2018-02-02
Publication date: 2019-07-19

Abstract

The story play system based on speech recognition that the invention discloses a kind of, including cloud server and story player；Cloud server receives the recording that Story machine is sent and does speech recognition calculating, identify the story name in recording, inquire the audio files in stream media of this corresponding story of story name, audio files in stream media comprising story content is sent to story player, if not can recognize that story name, it will be prompted to duplicate streaming media audio file and be sent to story player；Recording is sent cloud server by story player, and receives the audio files in stream media returned on cloud server, plays out.The invention also discloses a kind of story playback method based on speech recognition.

Description

A kind of story play system and method based on speech recognition

Technical field

The invention belongs to technical field of voice recognition more particularly to a kind of story play systems and side based on speech recognition Method.

Background technique

Story machine traditional at present plays story, is all the audio files storage by story on Story machine, then passes through Key or IR remote controller play to control, due to the limitation of the storage size of Story machine, the story audio file number of broadcasting Measure it is limited, and control play mode it is also very single, to listen some story, can only go to select by key.

Summary of the invention

Based on this, the present invention provides a kind of story play system and method based on speech recognition, fully effective can solve Certainly above-mentioned technical problem.

The technical scheme is that a kind of story play system based on speech recognition, including cloud server and event Thing player；Cloud server receives the recording that Story machine is sent and does speech recognition calculating, identifies the story name in recording, Audio files in stream media comprising story content is sent to by the audio files in stream media for inquiring this corresponding story of story name Story player will be prompted to duplicate streaming media audio file and be sent to story player if not can recognize that story name；Therefore Recording is sent cloud server by thing player, and receives the audio files in stream media returned on cloud server, is broadcast It puts.

In a preferred embodiment, the cloud server includes:

Module is updated storage, for updating and storing the audio files in stream media of story；

Speech recognition engine module for receiving recording data, the corresponding story title of identification recording data, and provides the story The identification score value of title；

It identifies score value judgment module, identifies whether score value is greater than identification point threshold for judging, if so, output result is event Thing title, if it is not, then exporting result is the audio files in stream media comprising " the story name described in you fail to identify, please repeat ", And it is sent to story player；

Condition query module, for inquiring story audio files in stream media corresponding to the story title identified, and by the sound Frequency files in stream media is sent to story player.

In a preferred embodiment, the story player includes:

Recording module, for recording received voice, the voice of recording includes the content voice signal B and Story machine that user speaks The content voice signal A of broadcasting；

Front audio processing module inhibits to filter out voice signal A, output voice letter by echo for handling the voice recorded Number B；

The voice signal B of output is sent to cloud server for realizing the communication with cloud server by Wifi module, and Receive the audio files in stream media of cloud server passback；

Streaming media playing module for decoding received audio files in stream media, and plays out.

In order to solve the technical problem, the present invention also provides a kind of story playback method based on speech recognition, including following mistake Journey:

S100, it receives and records and do speech recognition calculating, identify the story name in recording, inquire the corresponding event of this story name The audio files in stream media of thing, output result is the audio files in stream media comprising story content, if identifying story title, Then exporting result is the audio files in stream media comprising " the story name described in you fail to identify, please repeat "；

S200, the audio files in stream media of step S100 output is decoded and is played.

In a preferred embodiment, step S100 specifically includes following process:

S101, update and the audio files in stream media for storing story；

S102, recording data, the corresponding story title of identification recording data are received, and provides the identification score value of the story title；

S103, judge to identify whether score value is greater than identification point threshold, if so, output result is story title, enter step S104, if it is not, then exporting result is the streaming media audio file comprising " the story name described in you fail to identify, please repeat ", into Enter step S203；

The audio files in stream media of story corresponding to the story title that S104, inquiry identify, enters step S203.

In a preferred embodiment, the step S200 specifically includes following process:

S201, recorded speech, wherein the voice of recording include the content voice signal B that user speaks and Story machine play it is interior Hold voice signal A；

The voice that S202, processing are recorded is inhibited to filter out voice signal A by echo, exports voice signal B, enter step S102；

S203, the audio files in stream media of passback is decoded and is played.

The beneficial effects of the present invention are: the present invention is connected to internet in a manner of wifi, realize logical with cloud server Believe, speech recognition is done on cloud server and calculates and stores story audio file, user says the name of story, passes through server On speech recognition, identify story name, storage story audio files in stream media beyond the clouds played on Story machine, by Then it is stored on server beyond the clouds, compared with traditional approach, can store more stories, carried out by English identification method It plays, designs intelligent humanized, enrich broadcast mode.

Detailed description of the invention

Fig. 1 is the functional block diagram of the story play system described in the embodiment of the present invention based on speech recognition；

Fig. 2 is the flow chart of the story playback method described in the embodiment of the present invention based on speech recognition；

Fig. 3 is the schematic diagram of echo process of inhibition described in the embodiment of the present invention.

Description of symbols:

100- cloud server, 200- story player, 101- update storage module, 102- speech recognition engine module, 103- Identify score value judgment module, 104- condition query module, 201- recording module, 202- front audio processing module, 203-Wifi Module, 204- streaming media playing module.

Specific embodiment

The present invention is described in detail below.

Embodiment

As shown in Figure 1, a kind of story play system based on speech recognition, including cloud server 100 and story play Machine 200；Cloud server 100 receives the recording that Story machine is sent and does speech recognition calculating, identifies the story name in recording, Audio files in stream media comprising story content is sent to by the audio files in stream media for inquiring this corresponding story of story name Story player 200 will be prompted to duplicate streaming media audio file and be sent to story player if not can recognize that story name 200；Recording is sent cloud server 100 by story player 200, and receives the audio stream returned on cloud server 100 Media file plays out.

In above system, the audio file of the story of magnanimity is stored on cloud server 100, audio file can basis Story name indexes inquiry, and story audio file is put can regularly update on the server, and newest story is added.Sound Frequency file supports streaming media and broadcasting.Speech recognition engine is run on the cloud server 100, this engine is large vocabulary Speech recognition engine can support the speech recognition content recognition of magnanimity, this engine supports multithreading, support that multiple Story machines are logical It crosses internet while sending recording data, while doing the calculating of speech recognition, the story name in the recording identified and and this event The identification score value of thing name.By identifying the judgement of score value threshold values, if the score value of identification is higher than identification score value threshold values, event is exported Thing name finds the audio file of story, the files in stream media of this audio is then issued story and is broadcast using story name as index Machine 200 is put, does on story player 200 and is played in downloading；If the score value of this identification is judged to lower than identification score value threshold values It cannot identify, return result to 200 machine of story player, tell user this time failing identification just by Story machine playing alert tones Really.

In another embodiment, the cloud server 100 includes:

Module 101 is updated storage, for updating and storing the audio files in stream media of story；

Speech recognition engine module 102 for receiving recording data, the corresponding story title of identification recording data, and provides this The identification score value of story title；

It identifies score value judgment module 103, identifies whether score value is greater than identification point threshold for judging, if so, output result For story title, if it is not, then exporting result is the audio Streaming Media text comprising " the story name described in you fail to identify, please repeat " Part, and it is sent to story player 200；

Condition query module 104, for inquiring story audio files in stream media corresponding to the story title identified, and should Audio files in stream media is sent to story player 200.

In another embodiment, the story player 200 includes:

Recording module 201, for recording received voice, the voice of recording includes the content voice signal B that user speaks and event The content voice signal A that thing player 200 plays；

Front audio processing module 202 is inhibited to filter out voice signal A by echo, exports voice for handling the voice recorded Signal B；Specifically, microphone also can record the sound that loudspeaker play to enter in recording, phonetic recognization rate can be greatly reduced, In order to also can precisely identify when playing, uses echo and inhibit function, this function is indicated with such as Fig. 3: story player 200 play voice signals, played back by loudspeaker, then by microphone resurvey and user's one's voice in speech It mixes, carries out " subtraction " (echo inhibition) with reference signal (being connected in front audio processing by power amplifier chips lead) Operation inhibits reference signal.By front audio, treated that sound is left with user's one's voice in speech in this way, ensure that therefore For affairs that should be kept secret when loudspeaker play, speech recognition equally has high discrimination.

The voice signal B of output is sent to cloud for realizing the communication with cloud server 100 by Wifi module 203 Server 100, and receive the audio files in stream media of the passback of cloud server 100；

Streaming media playing module 204 for decoding received audio files in stream media, and plays out.

As shown in Fig. 2, in order to solve the technical problem, the present invention also provides a kind of story playback method based on speech recognition, It comprises the following processes:

In another embodiment, step S100 specifically includes following process:

S101, update and the audio files in stream media for storing story；

In another embodiment, the step S200 specifically includes following process:

S203, the audio files in stream media of passback is decoded and is played.

In above-described embodiment, when starting to play, for example user says " I wants to listen the story of small red cap ", story player 200 Recording only includes that is said or talked about by user due to not playing story also at this time, in recording, recording is sent to cloud server 100, Identification is done on server to calculate, and is identified story name " small red cap ", is inquired into story audio list, by the audio of " small red cap " Files in stream media is sent to story player 200, and after story player 200 receives, decoding plays the audio file of small red cap, if User's sound of speaking is too small or from microphone it is too far etc. due to cause not can recognize that story name " small red cap ", then story The broadcasting content of player 200 is " the story name described in you fail to identify, please repeat, please repeat ".

During the story of " small red cap " is playing, user says " playing Snow White ", story player 200 The sound of loudspeaker has been done echo inhibition, there was only user's one's voice in speech in recording, recording is issued by front audio processing module The audio files in stream media of " Snow White " is issued story player 200, story after server identification by cloud server 100 Player 200 stops the broadcasting of " small red cap ", changes the audio files in stream media for broadcasting " Snow White ".

A specific embodiment of the invention above described embodiment only expresses, the description thereof is more specific and detailed, but simultaneously Limitations on the scope of the patent of the present invention therefore cannot be interpreted as.It should be pointed out that for those of ordinary skill in the art For, without departing from the inventive concept of the premise, various modifications and improvements can be made, these belong to guarantor of the invention Protect range.

Claims

1. a kind of story play system based on speech recognition, it is characterised in that: including cloud server and story player；Cloud End server receives the recording that Story machine is sent and does speech recognition calculating, identifies the story name in recording, inquires this event Audio files in stream media comprising story content is sent to story and played by the audio files in stream media of the corresponding story of thing name Machine will be prompted to duplicate streaming media audio file and be sent to story player if not can recognize that story name；Story player Cloud server is sent by recording, and receives the audio files in stream media returned on cloud server, is played out.

2. the story play system according to claim 1 based on speech recognition, which is characterized in that the cloud service Device includes:

3. the story play system according to claim 1 based on speech recognition, which is characterized in that the story plays Machine includes:

4. a kind of story playback method based on speech recognition, which is characterized in that comprise the following processes:

5. the story playback method according to claim 4 based on speech recognition, which is characterized in that step S100 is specifically wrapped Include following process:

S101, update and the audio files in stream media for storing story；

6. the story playback method according to claim, described in 5 based on speech recognition, which is characterized in that the step S200 specifically includes following process:

S203, the audio files in stream media of passback is decoded and is played.