CN103065625A - Method and device for adding digital voice tag - Google Patents
Method and device for adding digital voice tag Download PDFInfo
- Publication number
- CN103065625A CN103065625A CN2012105719712A CN201210571971A CN103065625A CN 103065625 A CN103065625 A CN 103065625A CN 2012105719712 A CN2012105719712 A CN 2012105719712A CN 201210571971 A CN201210571971 A CN 201210571971A CN 103065625 A CN103065625 A CN 103065625A
- Authority
- CN
- China
- Prior art keywords
- time point
- label
- digital voice
- voice
- user
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention relates to the technical field of electronics, and discloses a method and a device for adding a digital voice tag. The method for adding the digital voice tag comprises the following steps of confirming any a time point in a digital voice file, intercepting digital voice data in a certain time range before the time point or after the time point, recognizing the digital voice data, acquiring literal content of the digital voice data, adding corresponding voice tags according to the literal content, and providing the voice tags for users. The method and the device for adding the digital voice tag can enable the users to rapidly know about content of voice files, and improves accuracy of the voice tags, convenience and practicability of usage of the users.
Description
Technical field
The present invention relates to electronic technology field, particularly a kind of adding method of digital speech label and device.
Background technology
In the application of existing digital voice file, people need to understand the content of a certain voice document at short notice.The user uses the method for adding identification label that voice document is identified mostly at present, in order to directly use identification label in the future, identifies the content of these voice.
The user can arrange identification label to the sometime point in the voice document, and this identification label mainly is word content, is inputted by hand according to the content of voice by the user.The user can arrange to a voice document identification label of any number, and the accuracy of identification content is directly proportional with the number of labels that the user arranges.If the number of labels that arranges is not enough, the accuracy of then identifying content will reduce greatly.
In addition, the operation that the adding method of this identification label needs the user to carry out multistep could realize, formality is loaded down with trivial details, length consuming time, and when using voice label, can only be to obtain the identification content at the time point that label is set, can not obtain voice content at time point arbitrarily, less pertinence is strong, locates not accurate enough.
Summary of the invention
The embodiment of the invention the first purpose is to provide a kind of adding method of digital speech label, uses this technical scheme and can make the user can understand fast the content of voice document, improves the accuracy of voice label, and user convenience and the practicality used.
The embodiment of the invention the second purpose is to provide a kind of device that adds the digital speech label, uses this technical scheme and can make the user can understand fast the content of voice document, improves the accuracy of voice label, and user convenience and the practicality used.
First aspect the invention provides a kind of adding method of digital speech label, comprising:
Determine the arbitrary time point in the digital voice file;
Intercept before or after the described time point the sometime digital voice data of section;
Identify described digital voice data, obtain the word content of described digital voice data;
According to described word content, add corresponding voice label, provide described voice label to the user.
In conjunction with first aspect, under the first implementation, the arbitrary time point in described definite digital voice file specifically comprises:
Determined arbitrary time point of described digital voice file by the user.
In conjunction with first aspect, under the second implementation, before or after the described time point of described intercepting sometime the section digital voice data before, also comprise:
Determine the time span of described time period.
Second aspect, the present embodiment provide a kind of device that adds the digital speech label, comprising:
The time point determining unit is for arbitrary time point of determining digital voice file;
The speech data interception unit is used for intercepting before or after the described time point the sometime digital voice data of section;
Recognition unit is used for identifying described digital voice data, obtains the word content of described digital voice data;
The label adding device is used for according to described word content, adds corresponding voice label;
Display unit is used for providing described voice label to the user.
In conjunction with second aspect, under the first implementation, described time point determining unit, the concrete arbitrary time point that is used for being determined by the user described digital voice file.
In conjunction with first aspect, under the second implementation, described device also comprises:
The duration determining unit is used for determining the described time span that needs the intercepting time period.
Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.
Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.
To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
The schematic flow sheet of the adding method of a kind of digital speech label that Fig. 1 provides for the embodiment of the invention 1;
A kind of structural representation that adds the device of digital speech label that Fig. 2 provides for the embodiment of the invention 2.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
Embodiment 1
Referring to Fig. 1, the present embodiment provides a kind of adding method of digital speech label, is applicable to the content that the user understands voice document fast.Its key step comprises:
Step 101: determine the arbitrary time point in the digital voice file.
In the present embodiment, can but be not limited to determine time point in the digital voice file by the user.The user can according to actual needs, arrange voice label to the arbitrary time point in the voice document.The user can but be not limited to when digital voice file is play, trigger arbitrary time point in the broadcast bar of this voice document and input and determine order, determine to add voice label at this time point.
Step 102: intercept before or after this time point the sometime digital voice data of section.
In the present embodiment, according to fixed time point, intercept before or after this time point the sometime digital voice data of section.The time span of this time period is set up on their own by the user, can according to actual needs, regulate the time span of this time period.
Step 103: identify this digital voice data, obtain the word content of this digital voice data.
In the present embodiment, can but be not limited to adopt speech recognition software that the speech data of intercepting is identified, thereby obtain word content corresponding to this speech data.
In the present embodiment, the user can arrange a plurality of time points, thereby determines a plurality of voice labels, does not need artificially the content of voice label to be edited, and also can accurately obtain near the voice content of this time point.
Step 104: according to word content, add corresponding voice label.
In the present embodiment, voice label can but be not limited to comprise: the word content in this speech data.The user can consult this voice label, to consult the voice content of these time point front and back.
Step 105: provide this voice label to the user.
In the present embodiment, the user can add and the voice label that obtains random time point immediately, can understand accurately and rapidly the content of voice document.
Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.
Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.
To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.
Embodiment 2
Referring to Fig. 2, the present embodiment provides a kind of device that adds the digital speech label, comprising: time point determining unit 201, speech data interception unit 202, recognition unit 203, label adding device 204, display unit 205, duration determining unit 206.
Main syndeton and the principle of work of its each parts are as follows:
Time point determining unit 201 is electrically connected with speech data interception unit 202, is used for determining arbitrary time point of digital voice file.
More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 101 in the example 1.
Speech data interception unit 202 is electrically connected with recognition unit 203, is used for identifying described digital voice data, obtains the word content of described digital voice data.
More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 102 in the example 1.
Recognition unit 203 is electrically connected with label adding device 204, is used for according to described word content, adds corresponding voice label.
More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 103 in the example 1.
Label adding device 204 with main control chip 203 connections, is used for the audio-video frequency content of main control chip 203 demodulation is externally exported.
More detailed contents of these parts and principle can but be not limited to corresponding record referring to step 104 in the example 1.
Mixed-media network modules mixed-media 205 is electrically connected with display unit 205, is used for interconnection network, downloads or upload required data message.
More detailed contents of these parts and principle can but be not limited to corresponding record referring to step 105 in the example 1.
Duration determining unit 206 is electrically connected with speech data interception unit 202, is used for determining the time span of this time period.
Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.
Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.
To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.
Device embodiment described above only is schematic, wherein said unit as the separating component explanation can or can not be physically to separate also, the parts that show as the unit can be or can not be physical locations also, namely can be positioned at a place, perhaps also can be distributed on a plurality of network element.Can select according to the actual needs wherein some or all of module to realize the purpose of the present embodiment scheme.Those of ordinary skills namely can understand and implement in the situation that do not pay performing creative labour.
Above-described embodiment does not consist of the restriction to this technical scheme protection domain.Any at above-mentioned embodiment spirit and principle within do modification, be equal to and replace and improvement etc., all should be included within the protection domain of this technical scheme.
Claims (6)
1. the adding method of a digital speech label is characterized in that, comprising:
Determine the arbitrary time point in the digital voice file;
Intercept before or after the described time point the sometime digital voice data of section;
Identify described digital voice data, obtain the word content of described digital voice data;
According to described word content, add corresponding voice label, provide described voice label to the user.
2. method according to claim 1 is characterized in that,
Arbitrary time point in described definite digital voice file specifically comprises:
Determined arbitrary time point of described digital voice file by the user.
3. method according to claim 1 is characterized in that,
Before or after the described time point of described intercepting sometime the section digital voice data before, also comprise:
Determine the time span of described time period.
4. a device that adds the digital speech label is characterized in that, comprising:
The time point determining unit is for arbitrary time point of determining digital voice file;
The speech data interception unit is used for intercepting before or after the described time point the sometime digital voice data of section;
Recognition unit is used for identifying described digital voice data, obtains the word content of described digital voice data;
The label adding device is used for according to described word content, adds corresponding voice label;
Display unit is used for providing described voice label to the user.
5. device according to claim 4 is characterized in that,
Described time point determining unit, the concrete arbitrary time point that is used for being determined by the user described digital voice file.
6. described 4 described devices according to claim 5 is characterized in that, described device also comprises: the duration determining unit is used for determining the described time span that needs the intercepting time period.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012105719712A CN103065625A (en) | 2012-12-25 | 2012-12-25 | Method and device for adding digital voice tag |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN2012105719712A CN103065625A (en) | 2012-12-25 | 2012-12-25 | Method and device for adding digital voice tag |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103065625A true CN103065625A (en) | 2013-04-24 |
Family
ID=48108225
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN2012105719712A Pending CN103065625A (en) | 2012-12-25 | 2012-12-25 | Method and device for adding digital voice tag |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103065625A (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104157286A (en) * | 2014-07-31 | 2014-11-19 | 深圳市金立通信设备有限公司 | Idiomatic phrase acquisition method and device |
CN104378684A (en) * | 2014-11-07 | 2015-02-25 | 重庆晋才富熙科技有限公司 | Device for conducting rapid video marking |
CN104581351A (en) * | 2015-01-28 | 2015-04-29 | 上海与德通讯技术有限公司 | Audio/video recording method, audio/video playing method and electronic device |
CN104679724A (en) * | 2013-12-03 | 2015-06-03 | 腾讯科技(深圳)有限公司 | Page noting method and device |
CN104933048A (en) * | 2014-03-17 | 2015-09-23 | 联想(北京)有限公司 | Voice message processing method and device, and electronic device |
CN105100920A (en) * | 2015-08-31 | 2015-11-25 | 北京奇艺世纪科技有限公司 | Video preview method and device |
CN105913838A (en) * | 2016-05-19 | 2016-08-31 | 努比亚技术有限公司 | Device and method of audio management |
CN110010131A (en) * | 2019-04-04 | 2019-07-12 | 深圳市语芯维电子有限公司 | A kind of method and apparatus of speech signal analysis |
CN110992957A (en) * | 2019-11-15 | 2020-04-10 | 东华大学 | Voice data processing method based on privacy protection |
CN111711849A (en) * | 2020-06-30 | 2020-09-25 | 浙江同花顺智能科技有限公司 | Method, device and storage medium for displaying multimedia data |
CN113284509A (en) * | 2021-05-06 | 2021-08-20 | 北京百度网讯科技有限公司 | Method and device for acquiring accuracy of voice annotation and electronic equipment |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101199018A (en) * | 2005-06-13 | 2008-06-11 | 松下电器产业株式会社 | Content tag attachment support device and content tag attachment support method |
CN101833981A (en) * | 2009-03-12 | 2010-09-15 | 新奥特硅谷视频技术有限责任公司 | Manually-triggered court trial audio file real-time indexing system |
CN101833982A (en) * | 2009-03-12 | 2010-09-15 | 新奥特硅谷视频技术有限责任公司 | Special sound-triggered court trial audio file real-time indexing method |
CN101833980A (en) * | 2009-03-12 | 2010-09-15 | 新奥特硅谷视频技术有限责任公司 | Voice recognition-based court hearing audio file real-time indexing system |
CN102422284A (en) * | 2009-03-10 | 2012-04-18 | 因特拉松尼克斯有限公司 | Bookmarking system |
CN102664007A (en) * | 2012-03-27 | 2012-09-12 | 上海量明科技发展有限公司 | Method, client and system for generating character identification content |
-
2012
- 2012-12-25 CN CN2012105719712A patent/CN103065625A/en active Pending
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101199018A (en) * | 2005-06-13 | 2008-06-11 | 松下电器产业株式会社 | Content tag attachment support device and content tag attachment support method |
CN102422284A (en) * | 2009-03-10 | 2012-04-18 | 因特拉松尼克斯有限公司 | Bookmarking system |
CN101833981A (en) * | 2009-03-12 | 2010-09-15 | 新奥特硅谷视频技术有限责任公司 | Manually-triggered court trial audio file real-time indexing system |
CN101833982A (en) * | 2009-03-12 | 2010-09-15 | 新奥特硅谷视频技术有限责任公司 | Special sound-triggered court trial audio file real-time indexing method |
CN101833980A (en) * | 2009-03-12 | 2010-09-15 | 新奥特硅谷视频技术有限责任公司 | Voice recognition-based court hearing audio file real-time indexing system |
CN102664007A (en) * | 2012-03-27 | 2012-09-12 | 上海量明科技发展有限公司 | Method, client and system for generating character identification content |
Cited By (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104679724A (en) * | 2013-12-03 | 2015-06-03 | 腾讯科技(深圳)有限公司 | Page noting method and device |
CN104933048B (en) * | 2014-03-17 | 2018-08-31 | 联想(北京)有限公司 | A kind of voice information processing method, device and electronic equipment |
CN104933048A (en) * | 2014-03-17 | 2015-09-23 | 联想(北京)有限公司 | Voice message processing method and device, and electronic device |
CN104157286A (en) * | 2014-07-31 | 2014-11-19 | 深圳市金立通信设备有限公司 | Idiomatic phrase acquisition method and device |
CN104157286B (en) * | 2014-07-31 | 2017-12-29 | 深圳市金立通信设备有限公司 | A kind of phrasal acquisition methods and device |
CN104378684A (en) * | 2014-11-07 | 2015-02-25 | 重庆晋才富熙科技有限公司 | Device for conducting rapid video marking |
CN104581351A (en) * | 2015-01-28 | 2015-04-29 | 上海与德通讯技术有限公司 | Audio/video recording method, audio/video playing method and electronic device |
CN105100920B (en) * | 2015-08-31 | 2019-07-23 | 北京奇艺世纪科技有限公司 | A kind of method and apparatus of video preview |
CN105100920A (en) * | 2015-08-31 | 2015-11-25 | 北京奇艺世纪科技有限公司 | Video preview method and device |
CN105913838A (en) * | 2016-05-19 | 2016-08-31 | 努比亚技术有限公司 | Device and method of audio management |
CN110010131A (en) * | 2019-04-04 | 2019-07-12 | 深圳市语芯维电子有限公司 | A kind of method and apparatus of speech signal analysis |
CN110992957A (en) * | 2019-11-15 | 2020-04-10 | 东华大学 | Voice data processing method based on privacy protection |
CN110992957B (en) * | 2019-11-15 | 2023-09-08 | 东华大学 | Voice data processing method based on privacy protection |
CN111711849A (en) * | 2020-06-30 | 2020-09-25 | 浙江同花顺智能科技有限公司 | Method, device and storage medium for displaying multimedia data |
CN113284509A (en) * | 2021-05-06 | 2021-08-20 | 北京百度网讯科技有限公司 | Method and device for acquiring accuracy of voice annotation and electronic equipment |
CN113284509B (en) * | 2021-05-06 | 2024-01-16 | 北京百度网讯科技有限公司 | Method and device for obtaining accuracy of voice annotation and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103065625A (en) | Method and device for adding digital voice tag | |
US10705803B2 (en) | Method and system for realizing data tracking by means of software development kit | |
US8661502B2 (en) | Determining a sensitivity label of document information in real time | |
CN113987074A (en) | Distributed service full-link monitoring method and device, electronic equipment and storage medium | |
CN105630512A (en) | Method and system for implementing mobile device data tracking through software development toolkit | |
CN107491382B (en) | Log output method and device | |
CN105426759A (en) | URL legality determining method and apparatus | |
CN106354519A (en) | Method and device for generating label for user portrait | |
CN101957756A (en) | System and method for rapidly generating intelligent mobile terminal program | |
CN101710282A (en) | Method and device for realizing system support for multi-language resource | |
CN110798445A (en) | Public gateway interface testing method and device, computer equipment and storage medium | |
CN106911554B (en) | Historical information display method and device | |
CN104008042A (en) | UI (user interface) automated testing method, system and device | |
CN112860662B (en) | Automatic production data blood relationship establishment method, device, computer equipment and storage medium | |
CN109697281A (en) | The online method, apparatus and electronic equipment for merging document | |
CN101894021A (en) | Method and system for realizing interface of embedded system | |
CN109862399A (en) | It shows the method for rich media information, handle method, computer installation and the computer readable storage medium of rich media information | |
CN105207830A (en) | Detection method and apparatus for terminal information, and terminal | |
CN109669678A (en) | Template engine integration method, device, electronic equipment and storage medium | |
CN103312702A (en) | Service push method and device | |
CN109522507B (en) | Method for uniformly managing webpage components | |
CN101442539A (en) | Method and apparatus for implementing field filtration | |
CN103984779A (en) | Data updating method and device | |
CN110659540A (en) | Traffic light detection method and device | |
CN108268545B (en) | Method and device for establishing hierarchical user label library |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C12 | Rejection of a patent application after its publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20130424 |