CN107480159A

CN107480159A - The input method and device of a kind of speech data

Info

Publication number: CN107480159A
Application number: CN201611100679.7A
Authority: CN
Inventors: 王金龙; 丁小响
Original assignee: Guangdong Genius Technology Co Ltd
Current assignee: Guangdong Genius Technology Co Ltd
Priority date: 2016-12-02
Filing date: 2016-12-02
Publication date: 2017-12-15

Abstract

The applicable field of computer technology of the present invention, there is provided the input method and device of a kind of speech data, methods described include：Text data and speech data corresponding with text data are formed into one-to-one relationship, and establish text database and speech database respectively；The text data of speech data to be logged is obtained, the text data of speech data to be logged is matched with the text data in text database；According to the text data matched, from speech data corresponding to speech database extraction.Text data and speech data are formed one-to-one relation by the present invention, by matched text data, realize existing same voice data in extraction speech database, avoid repeated taping, speech data is reused, reduces recording demand, so as to reduce cost.

Description

The input method and device of a kind of speech data

Technical field

The invention belongs to the input method and device of field of computer technology, more particularly to a kind of speech data.

Background technology

Papery resources material is made in teaching material, religion be auxiliary etc. for point reader or data source used in private tutor's machine at present, its It is related to correspondingly obtaining speech data by the text data in resources material, so as to realize user in resources material is chosen Corresponding speech data can be played out after text data, because resources material frequently refers to version updating, and simply more New portion content, but generally require again to be recorded the text data in resources material, with voice number corresponding to acquisition According to.So that existing voice data are underutilized, repeated taping undoubtedly increases time cost, so as to reduce efficiency.

The content of the invention

It is an object of the invention to provide a kind of input method of speech data and device, it is intended to solve in the prior art because Resources material more redaction needs to be recorded again, the problem of causing existing speech data not reuse.

On the one hand, the invention provides a kind of input method of speech data, methods described to comprise the steps：

Text data and speech data corresponding with the text data are formed into one-to-one relationship, and established respectively Text database and speech database；

The text data of speech data to be logged is obtained, by the text data of the speech data to be logged and the text Text data in database is matched；

According to the text data matched, from speech data corresponding to speech database extraction.

On the other hand, the invention provides a kind of input device of speech data, described device to include：

Database unit, for text data and speech data corresponding with the text data to be formed one by one Corresponding relation, and text database and speech database are established respectively；

Matching unit, for obtaining the text data of speech data to be logged, by the text of the speech data to be logged Data are matched with the text data in the text database；And

Extraction unit, for according to the text data matched, from speech data corresponding to speech database extraction.

In embodiments of the present invention, text data and speech data are formed into one-to-one relation, passes through matched text Data, existing same voice data in extraction speech database are realized, avoid repeated taping so that speech data can obtain To recycling, reduce recording demand, so as to reduce cost.

Brief description of the drawings

Fig. 1 is the implementation process figure of the input method for the speech data that the embodiment of the present invention one provides；

Fig. 2 is the implementation process figure of the input method for the speech data that the embodiment of the present invention two provides；

Fig. 3 is the structural representation of the input device for the speech data that the embodiment of the present invention three provides；And

Fig. 4 is the structural representation of the input device for the speech data that the embodiment of the present invention four provides.

Embodiment

In order to make the purpose , technical scheme and advantage of the present invention be clearer, it is right below in conjunction with drawings and Examples The present invention is further elaborated.It should be appreciated that the specific embodiments described herein are merely illustrative of the present invention, and It is not used in the restriction present invention.

It is described in detail below in conjunction with specific implementation of the specific embodiment to the present invention：

Embodiment one：

Fig. 1 shows the implementation process figure of the input method for the speech data that the embodiment of the present invention one provides, for the ease of Illustrate, illustrate only the part related to the embodiment of the present invention, details are as follows：

In step S101, text data and speech data corresponding with text data are formed into one-to-one relationship, And text database and speech database are established respectively.

In embodiments of the present invention, private tutor's machine or point reader need to import the papery resources materials such as teaching material, tutorial, Because resources material often updates revision, it is therefore desirable to which the resources material after renewal is imported.By integrating in the past The teaching material of making teaches the text data of auxiliary book and speech data corresponding with text data, and text data and speech data are built Vertical one-to-one relationship, and text database and speech database are formed, and persistence maintenance updates after there is newly revised edition sheet Database.

Further, text data and speech data corresponding with text data are marked into same identification code.

Specifically, text data and speech data are established into one-to-one relationship, can be by text data and voice number One-to-one relationship is realized according to mark identification code, is marked to a text data and with the speech data corresponding to this article notebook data Same identification code, and the identification code is unique identification code, after to text data and speech data mark identification code, by text Notebook data and speech data are stored in text database and speech database respectively.

In step s 102, the text data of speech data to be logged is obtained, by the text data of speech data to be logged Matched with the text data in text database.

In embodiments of the present invention, when the resources material after having renewal needs to be imported, resources material is obtained first Text data, the text data of speech data to be logged is matched with the text data in text database, to obtain Identical text data, avoid carrying out repeated taping to text data so that speech data is efficiently utilized, will not be because of There is the resources material after renewal just to give up original speech data.

Further, the character string of the text data of speech data to be logged is obtained；

The character string of the text data of speech data to be logged is matched character by character in text database.

Specifically, the character string of the text data of speech data to be logged is obtained, by the way that character string is carried out character by character Match somebody with somebody, obtained in text database with the identical text data of the text data of speech data to be logged, effectively ensure The accuracy of matching, while improve matching efficiency.

In step s 103, according to the text data matched, from speech data corresponding to speech database extraction.

In embodiments of the present invention, the text data in the text data and text database of speech data to be logged is entered Row matching, after obtaining matching identical text data, from speech data corresponding to speech database extraction, avoid identical The drawbacks of text data repeated taping, the importing time of new resources material is improved, efficiently utilize existing voice number According to.

Further, the identification code of matched text data is obtained；

Extraction marks the speech data of same identification code in speech database.

Specifically, after identical text data is matched, the text data matched in text database is read only One identification code, according to the identification code, the speech data for being marked with the identification code is extracted in speech database.

Embodiment two：

Fig. 2 shows the implementation process figure of the input method for the speech data that the embodiment of the present invention two provides, for the ease of Illustrate, illustrate only the part related to the embodiment of the present invention, details are as follows：

In step s 201, text data and speech data corresponding with text data are formed into one-to-one relationship, And text database and speech database are established respectively.

In step S202, the text data of speech data to be logged is obtained, by the text data of speech data to be logged Matched with the text data in text database.

In step S203, according to the text data matched, from speech data corresponding to speech database extraction.

In embodiments of the present invention, step S201-S203 embodiment can correspond to and refer to step in previous embodiment one S101-S103 description, will not be repeated here.

In step S204, if being not matched to same text data in text database, by record obtain with Speech data corresponding to text data.

In embodiments of the present invention, when being not matched to same text data in text database, then need to pass through Speech data corresponding to recording acquisition.Specifically, can be by the way that text data be split, the character after being split, according to Voice corresponding to secondary acquisition character, and these voices are combined to the speech data to form corresponding text data successively.

Can be with one of ordinary skill in the art will appreciate that realizing that all or part of step in above-described embodiment method is The hardware of correlation is instructed to complete by program, described program can be stored in a computer read/write memory medium, Described storage medium, such as ROM/RAM, disk, CD.

Embodiment three：

Fig. 3 shows the structural representation of the input device for the speech data that the embodiment of the present invention three provides, for the ease of Illustrate, illustrate only the part related to the embodiment of the present invention.In embodiments of the present invention, the input device bag of speech data Include：Database unit 31, matching unit 32 and extraction unit 33, wherein：

Database unit 31, for text data and speech data corresponding with text data to be formed one a pair It should be related to, and establish text database and speech database respectively.

Further, Database unit 31 includes：

Identification code indexing unit 311, for text data and speech data corresponding with text data mark is same Identification code.

Matching unit 32, for obtaining the text data of speech data to be logged, by the textual data of speech data to be logged Matched according to the text data in text database.

Further, matching unit 32 includes：Character string acquiring unit 321 and coupling subelement 322, wherein：

Character string acquiring unit 321, the character string of the text data for obtaining speech data to be logged；And

Coupling subelement 322, for the character string of the text data of speech data to be logged to be entered in text database Row matches character by character.

Extraction unit 33, for according to the text data matched, from speech data corresponding to speech database extraction.

Further, extraction unit 33 includes：Identification code acquiring unit 331 and extraction subelement 332, wherein：

Identification code acquiring unit 331, for obtaining the identification code of matched text data；And

Subelement 332 is extracted, the speech data for the same identification code of extraction mark in speech database.

Example IV：

Fig. 4 shows the structural representation of the input device for the speech data that the embodiment of the present invention four provides, for the ease of Illustrate, illustrate only the part related to the embodiment of the present invention.In embodiments of the present invention, the input device bag of speech data Include：Database unit 41, matching unit 42, extraction unit 43 and recoding unit 44, wherein：

Database unit 41, for text data and speech data corresponding with text data to be formed one a pair It should be related to, and establish text database and speech database respectively；

Matching unit 42, for obtaining the text data of speech data to be logged, by the textual data of speech data to be logged Matched according to the text data in text database；

Extraction unit 43, for according to the text data matched, from speech data corresponding to speech database extraction；With And

Recoding unit 44, if for being not matched to same text data in text database, obtained by recording Speech data corresponding with text data.

In embodiments of the present invention, each unit of the input device of speech data can be real by corresponding hardware or software unit Existing, each unit can be independent soft and hardware unit, can also be integrated into a soft and hardware unit, herein not limiting Invention.The embodiment of the device each unit specifically refers to the description of previous embodiment one, will not be repeated here.

The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the invention, all essences in the present invention All any modification, equivalent and improvement made within refreshing and principle etc., should be included in the scope of the protection.

Claims

1. a kind of input method of speech data, it is characterised in that methods described comprises the steps：

Text data and speech data corresponding with the text data are formed into one-to-one relationship, and establish text respectively Database and speech database；

The text data of speech data to be logged is obtained, by the text data of the speech data to be logged and the text data Text data in storehouse is matched；

2. the method as described in claim 1, it is characterised in that by text data and voice corresponding with the text data Data form one-to-one relationship, and the step of establish text database and speech database respectively, including：

Text data and speech data corresponding with the text data are marked into same identification code.

3. the method as described in claim 1, it is characterised in that according to the text data matched, extracted from speech database The step of corresponding speech data, including：

Obtain the identification code of matched text data；

Extraction marks the speech data of same identification code in the speech database.

4. the method as described in claim 1, it is characterised in that obtain the text data of speech data to be logged, treated described The step of text data of typing speech data is matched with the text data in the text database, including：

Obtain the character string of the text data of speech data to be logged；

The character string of the text data of the speech data to be logged is matched character by character in text database.

5. the method as described in claim 1, it is characterised in that methods described also includes：

If being not matched to same text data in the text database, obtained and the text data pair by recording The speech data answered.

6. a kind of input device of speech data, it is characterised in that described device includes：

Database unit, corresponded for text data and speech data corresponding with the text data to be formed Relation, and text database and speech database are established respectively；

Matching unit, for obtaining the text data of speech data to be logged, by the text data of the speech data to be logged Matched with the text data in the text database；And

7. device as claimed in claim 6, it is characterised in that the Database unit includes：

Identification code indexing unit, for text data and speech data corresponding with the text data to be marked into same mark Code.

8. device as claimed in claim 6, it is characterised in that the extraction unit includes：

Identification code acquiring unit, for obtaining the identification code of matched text data；And

Subelement is extracted, the speech data for the same identification code of extraction mark in the speech database.

9. device as claimed in claim 6, it is characterised in that the matching unit includes：

Character string acquiring unit, the character string of the text data for obtaining speech data to be logged；And

Coupling subelement, for the character string of the text data of the speech data to be logged is carried out in text database by Character match.

10. device as claimed in claim 6, it is characterised in that the system also includes：

Recoding unit, if for being not matched to same text data in the text database, by record obtain with Speech data corresponding to the text data.