CN103065625A

CN103065625A - Method and device for adding digital voice tag

Info

Publication number: CN103065625A
Application number: CN2012105719712A
Authority: CN
Inventors: 曾元清
Original assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Current assignee: Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date: 2012-12-25
Filing date: 2012-12-25
Publication date: 2013-04-24

Abstract

The invention relates to the technical field of electronics, and discloses a method and a device for adding a digital voice tag. The method for adding the digital voice tag comprises the following steps of confirming any a time point in a digital voice file, intercepting digital voice data in a certain time range before the time point or after the time point, recognizing the digital voice data, acquiring literal content of the digital voice data, adding corresponding voice tags according to the literal content, and providing the voice tags for users. The method and the device for adding the digital voice tag can enable the users to rapidly know about content of voice files, and improves accuracy of the voice tags, convenience and practicability of usage of the users.

Description

A kind of adding method of digital speech label and device

Technical field

The present invention relates to electronic technology field, particularly a kind of adding method of digital speech label and device.

Background technology

In the application of existing digital voice file, people need to understand the content of a certain voice document at short notice.The user uses the method for adding identification label that voice document is identified mostly at present, in order to directly use identification label in the future, identifies the content of these voice.

The user can arrange identification label to the sometime point in the voice document, and this identification label mainly is word content, is inputted by hand according to the content of voice by the user.The user can arrange to a voice document identification label of any number, and the accuracy of identification content is directly proportional with the number of labels that the user arranges.If the number of labels that arranges is not enough, the accuracy of then identifying content will reduce greatly.

In addition, the operation that the adding method of this identification label needs the user to carry out multistep could realize, formality is loaded down with trivial details, length consuming time, and when using voice label, can only be to obtain the identification content at the time point that label is set, can not obtain voice content at time point arbitrarily, less pertinence is strong, locates not accurate enough.

Summary of the invention

The embodiment of the invention the first purpose is to provide a kind of adding method of digital speech label, uses this technical scheme and can make the user can understand fast the content of voice document, improves the accuracy of voice label, and user convenience and the practicality used.

The embodiment of the invention the second purpose is to provide a kind of device that adds the digital speech label, uses this technical scheme and can make the user can understand fast the content of voice document, improves the accuracy of voice label, and user convenience and the practicality used.

First aspect the invention provides a kind of adding method of digital speech label, comprising:

Determine the arbitrary time point in the digital voice file;

Intercept before or after the described time point the sometime digital voice data of section;

Identify described digital voice data, obtain the word content of described digital voice data;

According to described word content, add corresponding voice label, provide described voice label to the user.

In conjunction with first aspect, under the first implementation, the arbitrary time point in described definite digital voice file specifically comprises:

Determined arbitrary time point of described digital voice file by the user.

In conjunction with first aspect, under the second implementation, before or after the described time point of described intercepting sometime the section digital voice data before, also comprise:

Determine the time span of described time period.

Second aspect, the present embodiment provide a kind of device that adds the digital speech label, comprising:

The time point determining unit is for arbitrary time point of determining digital voice file;

The speech data interception unit is used for intercepting before or after the described time point the sometime digital voice data of section;

Recognition unit is used for identifying described digital voice data, obtains the word content of described digital voice data;

The label adding device is used for according to described word content, adds corresponding voice label;

Display unit is used for providing described voice label to the user.

In conjunction with second aspect, under the first implementation, described time point determining unit, the concrete arbitrary time point that is used for being determined by the user described digital voice file.

In conjunction with first aspect, under the second implementation, described device also comprises:

The duration determining unit is used for determining the described time span that needs the intercepting time period.

Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.

Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.

To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.

In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.

The schematic flow sheet of the adding method of a kind of digital speech label that Fig. 1 provides for the embodiment of the invention 1;

A kind of structural representation that adds the device of digital speech label that Fig. 2 provides for the embodiment of the invention 2.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.

Embodiment 1

Referring to Fig. 1, the present embodiment provides a kind of adding method of digital speech label, is applicable to the content that the user understands voice document fast.Its key step comprises:

Step 101: determine the arbitrary time point in the digital voice file.

In the present embodiment, can but be not limited to determine time point in the digital voice file by the user.The user can according to actual needs, arrange voice label to the arbitrary time point in the voice document.The user can but be not limited to when digital voice file is play, trigger arbitrary time point in the broadcast bar of this voice document and input and determine order, determine to add voice label at this time point.

Step 102: intercept before or after this time point the sometime digital voice data of section.

In the present embodiment, according to fixed time point, intercept before or after this time point the sometime digital voice data of section.The time span of this time period is set up on their own by the user, can according to actual needs, regulate the time span of this time period.

Step 103: identify this digital voice data, obtain the word content of this digital voice data.

In the present embodiment, can but be not limited to adopt speech recognition software that the speech data of intercepting is identified, thereby obtain word content corresponding to this speech data.

In the present embodiment, the user can arrange a plurality of time points, thereby determines a plurality of voice labels, does not need artificially the content of voice label to be edited, and also can accurately obtain near the voice content of this time point.

Step 104: according to word content, add corresponding voice label.

In the present embodiment, voice label can but be not limited to comprise: the word content in this speech data.The user can consult this voice label, to consult the voice content of these time point front and back.

Step 105: provide this voice label to the user.

In the present embodiment, the user can add and the voice label that obtains random time point immediately, can understand accurately and rapidly the content of voice document.

Embodiment 2

Referring to Fig. 2, the present embodiment provides a kind of device that adds the digital speech label, comprising: time point determining unit 201, speech data interception unit 202, recognition unit 203, label adding device 204, display unit 205, duration determining unit 206.

Main syndeton and the principle of work of its each parts are as follows:

Time point determining unit 201 is electrically connected with speech data interception unit 202, is used for determining arbitrary time point of digital voice file.

More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 101 in the example 1.

Speech data interception unit 202 is electrically connected with recognition unit 203, is used for identifying described digital voice data, obtains the word content of described digital voice data.

More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 102 in the example 1.

Recognition unit 203 is electrically connected with label adding device 204, is used for according to described word content, adds corresponding voice label.

More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 103 in the example 1.

Label adding device 204 with main control chip 203 connections, is used for the audio-video frequency content of main control chip 203 demodulation is externally exported.

More detailed contents of these parts and principle can but be not limited to corresponding record referring to step 104 in the example 1.

Mixed-media network modules mixed-media 205 is electrically connected with display unit 205, is used for interconnection network, downloads or upload required data message.

More detailed contents of these parts and principle can but be not limited to corresponding record referring to step 105 in the example 1.

Duration determining unit 206 is electrically connected with speech data interception unit 202, is used for determining the time span of this time period.

Device embodiment described above only is schematic, wherein said unit as the separating component explanation can or can not be physically to separate also, the parts that show as the unit can be or can not be physical locations also, namely can be positioned at a place, perhaps also can be distributed on a plurality of network element.Can select according to the actual needs wherein some or all of module to realize the purpose of the present embodiment scheme.Those of ordinary skills namely can understand and implement in the situation that do not pay performing creative labour.

Above-described embodiment does not consist of the restriction to this technical scheme protection domain.Any at above-mentioned embodiment spirit and principle within do modification, be equal to and replace and improvement etc., all should be included within the protection domain of this technical scheme.

Claims

1. the adding method of a digital speech label is characterized in that, comprising:

Determine the arbitrary time point in the digital voice file;

2. method according to claim 1 is characterized in that,

Arbitrary time point in described definite digital voice file specifically comprises:

Determined arbitrary time point of described digital voice file by the user.

3. method according to claim 1 is characterized in that,

Before or after the described time point of described intercepting sometime the section digital voice data before, also comprise:

Determine the time span of described time period.

4. a device that adds the digital speech label is characterized in that, comprising:

Display unit is used for providing described voice label to the user.

5. device according to claim 4 is characterized in that,

Described time point determining unit, the concrete arbitrary time point that is used for being determined by the user described digital voice file.

6. described 4 described devices according to claim 5 is characterized in that, described device also comprises: the duration determining unit is used for determining the described time span that needs the intercepting time period.