CN103065625A - Method and device for adding digital voice tag - Google Patents

Method and device for adding digital voice tag Download PDF

Info

Publication number
CN103065625A
CN103065625A CN2012105719712A CN201210571971A CN103065625A CN 103065625 A CN103065625 A CN 103065625A CN 2012105719712 A CN2012105719712 A CN 2012105719712A CN 201210571971 A CN201210571971 A CN 201210571971A CN 103065625 A CN103065625 A CN 103065625A
Authority
CN
China
Prior art keywords
time point
label
digital voice
voice
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012105719712A
Other languages
Chinese (zh)
Inventor
曾元清
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN2012105719712A priority Critical patent/CN103065625A/en
Publication of CN103065625A publication Critical patent/CN103065625A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention relates to the technical field of electronics, and discloses a method and a device for adding a digital voice tag. The method for adding the digital voice tag comprises the following steps of confirming any a time point in a digital voice file, intercepting digital voice data in a certain time range before the time point or after the time point, recognizing the digital voice data, acquiring literal content of the digital voice data, adding corresponding voice tags according to the literal content, and providing the voice tags for users. The method and the device for adding the digital voice tag can enable the users to rapidly know about content of voice files, and improves accuracy of the voice tags, convenience and practicability of usage of the users.

Description

A kind of adding method of digital speech label and device
Technical field
The present invention relates to electronic technology field, particularly a kind of adding method of digital speech label and device.
Background technology
In the application of existing digital voice file, people need to understand the content of a certain voice document at short notice.The user uses the method for adding identification label that voice document is identified mostly at present, in order to directly use identification label in the future, identifies the content of these voice.
The user can arrange identification label to the sometime point in the voice document, and this identification label mainly is word content, is inputted by hand according to the content of voice by the user.The user can arrange to a voice document identification label of any number, and the accuracy of identification content is directly proportional with the number of labels that the user arranges.If the number of labels that arranges is not enough, the accuracy of then identifying content will reduce greatly.
In addition, the operation that the adding method of this identification label needs the user to carry out multistep could realize, formality is loaded down with trivial details, length consuming time, and when using voice label, can only be to obtain the identification content at the time point that label is set, can not obtain voice content at time point arbitrarily, less pertinence is strong, locates not accurate enough.
Summary of the invention
The embodiment of the invention the first purpose is to provide a kind of adding method of digital speech label, uses this technical scheme and can make the user can understand fast the content of voice document, improves the accuracy of voice label, and user convenience and the practicality used.
The embodiment of the invention the second purpose is to provide a kind of device that adds the digital speech label, uses this technical scheme and can make the user can understand fast the content of voice document, improves the accuracy of voice label, and user convenience and the practicality used.
First aspect the invention provides a kind of adding method of digital speech label, comprising:
Determine the arbitrary time point in the digital voice file;
Intercept before or after the described time point the sometime digital voice data of section;
Identify described digital voice data, obtain the word content of described digital voice data;
According to described word content, add corresponding voice label, provide described voice label to the user.
In conjunction with first aspect, under the first implementation, the arbitrary time point in described definite digital voice file specifically comprises:
Determined arbitrary time point of described digital voice file by the user.
In conjunction with first aspect, under the second implementation, before or after the described time point of described intercepting sometime the section digital voice data before, also comprise:
Determine the time span of described time period.
Second aspect, the present embodiment provide a kind of device that adds the digital speech label, comprising:
The time point determining unit is for arbitrary time point of determining digital voice file;
The speech data interception unit is used for intercepting before or after the described time point the sometime digital voice data of section;
Recognition unit is used for identifying described digital voice data, obtains the word content of described digital voice data;
The label adding device is used for according to described word content, adds corresponding voice label;
Display unit is used for providing described voice label to the user.
In conjunction with second aspect, under the first implementation, described time point determining unit, the concrete arbitrary time point that is used for being determined by the user described digital voice file.
In conjunction with first aspect, under the second implementation, described device also comprises:
The duration determining unit is used for determining the described time span that needs the intercepting time period.
Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.
Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.
To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.
In order to be illustrated more clearly in the embodiment of the invention or technical scheme of the prior art, the below will do to introduce simply to the accompanying drawing of required use in embodiment or the description of the Prior Art, apparently, accompanying drawing in the following describes only is some embodiments of the present invention, for those of ordinary skills, under the prerequisite of not paying creative work, can also obtain according to these accompanying drawings other accompanying drawing.
The schematic flow sheet of the adding method of a kind of digital speech label that Fig. 1 provides for the embodiment of the invention 1;
A kind of structural representation that adds the device of digital speech label that Fig. 2 provides for the embodiment of the invention 2.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the invention, the technical scheme in the embodiment of the invention is clearly and completely described, obviously, described embodiment only is the present invention's part embodiment, rather than whole embodiment.Based on the embodiment among the present invention, those of ordinary skills belong to the scope of protection of the invention not making the every other embodiment that obtains under the creative work prerequisite.
Embodiment 1
Referring to Fig. 1, the present embodiment provides a kind of adding method of digital speech label, is applicable to the content that the user understands voice document fast.Its key step comprises:
Step 101: determine the arbitrary time point in the digital voice file.
In the present embodiment, can but be not limited to determine time point in the digital voice file by the user.The user can according to actual needs, arrange voice label to the arbitrary time point in the voice document.The user can but be not limited to when digital voice file is play, trigger arbitrary time point in the broadcast bar of this voice document and input and determine order, determine to add voice label at this time point.
Step 102: intercept before or after this time point the sometime digital voice data of section.
In the present embodiment, according to fixed time point, intercept before or after this time point the sometime digital voice data of section.The time span of this time period is set up on their own by the user, can according to actual needs, regulate the time span of this time period.
Step 103: identify this digital voice data, obtain the word content of this digital voice data.
In the present embodiment, can but be not limited to adopt speech recognition software that the speech data of intercepting is identified, thereby obtain word content corresponding to this speech data.
In the present embodiment, the user can arrange a plurality of time points, thereby determines a plurality of voice labels, does not need artificially the content of voice label to be edited, and also can accurately obtain near the voice content of this time point.
Step 104: according to word content, add corresponding voice label.
In the present embodiment, voice label can but be not limited to comprise: the word content in this speech data.The user can consult this voice label, to consult the voice content of these time point front and back.
Step 105: provide this voice label to the user.
In the present embodiment, the user can add and the voice label that obtains random time point immediately, can understand accurately and rapidly the content of voice document.
Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.
Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.
To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.
Embodiment 2
Referring to Fig. 2, the present embodiment provides a kind of device that adds the digital speech label, comprising: time point determining unit 201, speech data interception unit 202, recognition unit 203, label adding device 204, display unit 205, duration determining unit 206.
Main syndeton and the principle of work of its each parts are as follows:
Time point determining unit 201 is electrically connected with speech data interception unit 202, is used for determining arbitrary time point of digital voice file.
More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 101 in the example 1.
Speech data interception unit 202 is electrically connected with recognition unit 203, is used for identifying described digital voice data, obtains the word content of described digital voice data.
More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 102 in the example 1.
Recognition unit 203 is electrically connected with label adding device 204, is used for according to described word content, adds corresponding voice label.
More detailed contents of this unit and principle can but be not limited to corresponding record referring to step 103 in the example 1.
Label adding device 204 with main control chip 203 connections, is used for the audio-video frequency content of main control chip 203 demodulation is externally exported.
More detailed contents of these parts and principle can but be not limited to corresponding record referring to step 104 in the example 1.
Mixed-media network modules mixed-media 205 is electrically connected with display unit 205, is used for interconnection network, downloads or upload required data message.
More detailed contents of these parts and principle can but be not limited to corresponding record referring to step 105 in the example 1.
Duration determining unit 206 is electrically connected with speech data interception unit 202, is used for determining the time span of this time period.
Therefore, use the present embodiment technical scheme, for each digital voice file, after the user determines a time point in the voice document, can intercept near one section speech data that duration is set by the user of this time point in the backstage, identify this speech data, obtain corresponding word content, provide this literal content to the user.In prior art, when adding voice label, need manually voice content to be inputted, and the employing the technical program, the word content of voice label is drawn by the system identification voice content, therefore the workload in the time of reducing artificial interpolation label improves user's ease of use.
Further, for adding tagged voice document, in the prior art, owing to only having the voice label of limited number, the user can only consult this voice label, thereby obtains the content of voice document, and accuracy is not high.And the employing the technical program, the user can at arbitrary time point of voice document, check to have strengthened practicality and convenience by the voice content of this time point near the time period immediately.In addition, the word content of this voice label is identified by system and is drawn, and improves the accuracy of label substance.
To sum up, adopt the present embodiment technical scheme, can make the user can understand fast the content of voice document, improve the accuracy of voice label, and user convenience and the practicality used.
Device embodiment described above only is schematic, wherein said unit as the separating component explanation can or can not be physically to separate also, the parts that show as the unit can be or can not be physical locations also, namely can be positioned at a place, perhaps also can be distributed on a plurality of network element.Can select according to the actual needs wherein some or all of module to realize the purpose of the present embodiment scheme.Those of ordinary skills namely can understand and implement in the situation that do not pay performing creative labour.
Above-described embodiment does not consist of the restriction to this technical scheme protection domain.Any at above-mentioned embodiment spirit and principle within do modification, be equal to and replace and improvement etc., all should be included within the protection domain of this technical scheme.

Claims (6)

1. the adding method of a digital speech label is characterized in that, comprising:
Determine the arbitrary time point in the digital voice file;
Intercept before or after the described time point the sometime digital voice data of section;
Identify described digital voice data, obtain the word content of described digital voice data;
According to described word content, add corresponding voice label, provide described voice label to the user.
2. method according to claim 1 is characterized in that,
Arbitrary time point in described definite digital voice file specifically comprises:
Determined arbitrary time point of described digital voice file by the user.
3. method according to claim 1 is characterized in that,
Before or after the described time point of described intercepting sometime the section digital voice data before, also comprise:
Determine the time span of described time period.
4. a device that adds the digital speech label is characterized in that, comprising:
The time point determining unit is for arbitrary time point of determining digital voice file;
The speech data interception unit is used for intercepting before or after the described time point the sometime digital voice data of section;
Recognition unit is used for identifying described digital voice data, obtains the word content of described digital voice data;
The label adding device is used for according to described word content, adds corresponding voice label;
Display unit is used for providing described voice label to the user.
5. device according to claim 4 is characterized in that,
Described time point determining unit, the concrete arbitrary time point that is used for being determined by the user described digital voice file.
6. described 4 described devices according to claim 5 is characterized in that, described device also comprises: the duration determining unit is used for determining the described time span that needs the intercepting time period.
CN2012105719712A 2012-12-25 2012-12-25 Method and device for adding digital voice tag Pending CN103065625A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012105719712A CN103065625A (en) 2012-12-25 2012-12-25 Method and device for adding digital voice tag

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012105719712A CN103065625A (en) 2012-12-25 2012-12-25 Method and device for adding digital voice tag

Publications (1)

Publication Number Publication Date
CN103065625A true CN103065625A (en) 2013-04-24

Family

ID=48108225

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012105719712A Pending CN103065625A (en) 2012-12-25 2012-12-25 Method and device for adding digital voice tag

Country Status (1)

Country Link
CN (1) CN103065625A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104157286A (en) * 2014-07-31 2014-11-19 深圳市金立通信设备有限公司 Idiomatic phrase acquisition method and device
CN104378684A (en) * 2014-11-07 2015-02-25 重庆晋才富熙科技有限公司 Device for conducting rapid video marking
CN104581351A (en) * 2015-01-28 2015-04-29 上海与德通讯技术有限公司 Audio/video recording method, audio/video playing method and electronic device
CN104679724A (en) * 2013-12-03 2015-06-03 腾讯科技(深圳)有限公司 Page noting method and device
CN104933048A (en) * 2014-03-17 2015-09-23 联想(北京)有限公司 Voice message processing method and device, and electronic device
CN105100920A (en) * 2015-08-31 2015-11-25 北京奇艺世纪科技有限公司 Video preview method and device
CN105913838A (en) * 2016-05-19 2016-08-31 努比亚技术有限公司 Device and method of audio management
CN110010131A (en) * 2019-04-04 2019-07-12 深圳市语芯维电子有限公司 A kind of method and apparatus of speech signal analysis
CN110992957A (en) * 2019-11-15 2020-04-10 东华大学 Voice data processing method based on privacy protection
CN111711849A (en) * 2020-06-30 2020-09-25 浙江同花顺智能科技有限公司 Method, device and storage medium for displaying multimedia data
CN113284509A (en) * 2021-05-06 2021-08-20 北京百度网讯科技有限公司 Method and device for acquiring accuracy of voice annotation and electronic equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101199018A (en) * 2005-06-13 2008-06-11 松下电器产业株式会社 Content tag attachment support device and content tag attachment support method
CN101833981A (en) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 Manually-triggered court trial audio file real-time indexing system
CN101833982A (en) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 Special sound-triggered court trial audio file real-time indexing method
CN101833980A (en) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 Voice recognition-based court hearing audio file real-time indexing system
CN102422284A (en) * 2009-03-10 2012-04-18 因特拉松尼克斯有限公司 Bookmarking system
CN102664007A (en) * 2012-03-27 2012-09-12 上海量明科技发展有限公司 Method, client and system for generating character identification content

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101199018A (en) * 2005-06-13 2008-06-11 松下电器产业株式会社 Content tag attachment support device and content tag attachment support method
CN102422284A (en) * 2009-03-10 2012-04-18 因特拉松尼克斯有限公司 Bookmarking system
CN101833981A (en) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 Manually-triggered court trial audio file real-time indexing system
CN101833982A (en) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 Special sound-triggered court trial audio file real-time indexing method
CN101833980A (en) * 2009-03-12 2010-09-15 新奥特硅谷视频技术有限责任公司 Voice recognition-based court hearing audio file real-time indexing system
CN102664007A (en) * 2012-03-27 2012-09-12 上海量明科技发展有限公司 Method, client and system for generating character identification content

Cited By (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104679724A (en) * 2013-12-03 2015-06-03 腾讯科技(深圳)有限公司 Page noting method and device
CN104933048B (en) * 2014-03-17 2018-08-31 联想(北京)有限公司 A kind of voice information processing method, device and electronic equipment
CN104933048A (en) * 2014-03-17 2015-09-23 联想(北京)有限公司 Voice message processing method and device, and electronic device
CN104157286A (en) * 2014-07-31 2014-11-19 深圳市金立通信设备有限公司 Idiomatic phrase acquisition method and device
CN104157286B (en) * 2014-07-31 2017-12-29 深圳市金立通信设备有限公司 A kind of phrasal acquisition methods and device
CN104378684A (en) * 2014-11-07 2015-02-25 重庆晋才富熙科技有限公司 Device for conducting rapid video marking
CN104581351A (en) * 2015-01-28 2015-04-29 上海与德通讯技术有限公司 Audio/video recording method, audio/video playing method and electronic device
CN105100920B (en) * 2015-08-31 2019-07-23 北京奇艺世纪科技有限公司 A kind of method and apparatus of video preview
CN105100920A (en) * 2015-08-31 2015-11-25 北京奇艺世纪科技有限公司 Video preview method and device
CN105913838A (en) * 2016-05-19 2016-08-31 努比亚技术有限公司 Device and method of audio management
CN110010131A (en) * 2019-04-04 2019-07-12 深圳市语芯维电子有限公司 A kind of method and apparatus of speech signal analysis
CN110992957A (en) * 2019-11-15 2020-04-10 东华大学 Voice data processing method based on privacy protection
CN110992957B (en) * 2019-11-15 2023-09-08 东华大学 Voice data processing method based on privacy protection
CN111711849A (en) * 2020-06-30 2020-09-25 浙江同花顺智能科技有限公司 Method, device and storage medium for displaying multimedia data
CN113284509A (en) * 2021-05-06 2021-08-20 北京百度网讯科技有限公司 Method and device for acquiring accuracy of voice annotation and electronic equipment
CN113284509B (en) * 2021-05-06 2024-01-16 北京百度网讯科技有限公司 Method and device for obtaining accuracy of voice annotation and electronic equipment

Similar Documents

Publication Publication Date Title
CN103065625A (en) Method and device for adding digital voice tag
US10705803B2 (en) Method and system for realizing data tracking by means of software development kit
US8661502B2 (en) Determining a sensitivity label of document information in real time
CN113987074A (en) Distributed service full-link monitoring method and device, electronic equipment and storage medium
CN105630512A (en) Method and system for implementing mobile device data tracking through software development toolkit
CN107491382B (en) Log output method and device
CN105426759A (en) URL legality determining method and apparatus
CN106354519A (en) Method and device for generating label for user portrait
CN101957756A (en) System and method for rapidly generating intelligent mobile terminal program
CN101710282A (en) Method and device for realizing system support for multi-language resource
CN110798445A (en) Public gateway interface testing method and device, computer equipment and storage medium
CN106911554B (en) Historical information display method and device
CN104008042A (en) UI (user interface) automated testing method, system and device
CN112860662B (en) Automatic production data blood relationship establishment method, device, computer equipment and storage medium
CN109697281A (en) The online method, apparatus and electronic equipment for merging document
CN101894021A (en) Method and system for realizing interface of embedded system
CN109862399A (en) It shows the method for rich media information, handle method, computer installation and the computer readable storage medium of rich media information
CN105207830A (en) Detection method and apparatus for terminal information, and terminal
CN109669678A (en) Template engine integration method, device, electronic equipment and storage medium
CN103312702A (en) Service push method and device
CN109522507B (en) Method for uniformly managing webpage components
CN101442539A (en) Method and apparatus for implementing field filtration
CN103984779A (en) Data updating method and device
CN110659540A (en) Traffic light detection method and device
CN108268545B (en) Method and device for establishing hierarchical user label library

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20130424