CN104090955A - Automatic audio/video label labeling method and system - Google Patents

Automatic audio/video label labeling method and system Download PDF

Info

Publication number
CN104090955A
CN104090955A CN201410320555.4A CN201410320555A CN104090955A CN 104090955 A CN104090955 A CN 104090955A CN 201410320555 A CN201410320555 A CN 201410320555A CN 104090955 A CN104090955 A CN 104090955A
Authority
CN
China
Prior art keywords
keyword
video
audio
subject
label
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410320555.4A
Other languages
Chinese (zh)
Inventor
徐玉林
王政
钟锟
胡国亮
梁昭
张建华
王丽红
郭强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
iFlytek Co Ltd
Original Assignee
iFlytek Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by iFlytek Co Ltd filed Critical iFlytek Co Ltd
Priority to CN201410320555.4A priority Critical patent/CN104090955A/en
Publication of CN104090955A publication Critical patent/CN104090955A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/31Indexing; Data structures therefor; Storage structures
    • G06F16/313Selection or weighting of terms for indexing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • Software Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to the technical field of information labeling, and discloses an automatic audio/video label labeling method and system. The method includes the steps that knowledge points and vocabularies of all subjects are captured in advance, and a corresponding subject mapping knowledge domain is built; the subject vocabularies serve as a hot word source, and voice frequencies extracted from an audio/video to be labeled are converted into a text; key words in the text are extracted, and the subject to which the audio/video belongs and the knowledge points are determined according to the incidence relation of the keywords and the mapping knowledge domain; a label corresponding to the audio/video is built, and the label comprises the keywords and the subject to which the audio/video belongs and the knowledge points. By means of the automatic audio/video label labeling method and system, the audio/video source contents can be fully excavated, automatic labeling of labels is achieved, the amount of manual intervention is reduced, and meanwhile, a good basis is provided for the later resource push and other services.

Description

A kind of audio frequency and video label automatic marking method and system
Technical field
The present invention relates to information labeling technical field, be specifically related to a kind of audio frequency and video label automatic marking method and system.
Background technology
Flourish along with internet and education cloud, education and instruction class resource is tinkling of jades meets the eye on every side, uneven.For teacher, student, can be by a small amount of metadata, as title etc. judges whether resource is that self is required, this mode depends on metadata, and the wrongly written or mispronounced characters in title may all can affect user's judgement; Also may need completely to browse whole audio frequency and video and could determine whether the content of this resource is required resource, and complete browse whole audio frequency and video can be more consuming time.Visible, traditional this audio frequency and video obtain manner cannot meet current from magnanimity Internet resources quick obtaining meet the demand of oneself requirement resource.
To education and instruction class resource, especially audio frequency and video are carried out label automatic marking, on the one hand, the label of mark can excavate the actual content of current resource better, makes up the shortcoming of metadata deficiency, and user no longer needs completely to browse whole audio frequency and video and just can capture flesh and blood, on the other hand, the label of mark has great facilitation to resource supplying field, and therefore, the automatic marking of label is significant to the change of current modern education teaching pattern.
Summary of the invention
The embodiment of the present invention provides a kind of audio frequency and video label automatic marking method and system, can allow user in the situation that not browsing whole audio frequency and video, accurately holds the content of this audio and video resources; Reduce artificial participation amount; For the follow-up services such as resource recommendation provide foundation more accurately.
For this reason, the invention provides following technical scheme:
An audio frequency and video label automatic marking method, comprising:
Capture in advance each subject knowledge point and subject vocabulary, build corresponding subject knowledge collection of illustrative plates;
Using described subject vocabulary as hot word resource, the audio conversion that audio or video to be marked is extracted is write as text;
Extract the keyword in described text, and determine subject and the knowledge point under described audio or video according to the incidence relation of described keyword and described knowledge collection of illustrative plates;
Set up the label of corresponding described audio or video, described label comprises: subject, knowledge point under described keyword and described audio or video.
Preferably, described keyword has one or more.
Preferably, the keyword in the described text of described extraction comprises:
Described text is carried out to participle, obtain each sub-word;
Calculate the TF-IDF value of each sub-word;
Using described TF-IDF value higher than the sub-word of setting threshold as keyword, or according to described TF-IDF value order from high to low choose set number above sub-word as keyword.
Preferably, the incidence relation of described keyword and knowledge collection of illustrative plates comprises: position and number of times that described keyword occurs in described knowledge collection of illustrative plates.
Preferably, described method also comprises:
The selection number of times of counting user to label, adds, deletes or replace label according to described selection number of times.
An audio frequency and video label automatic marking system, comprising:
Handling module, for capturing in advance each subject knowledge point and subject vocabulary;
Map construction module, for building the subject knowledge collection of illustrative plates of corresponding described subject knowledge point and subject vocabulary;
Transcription module, is write as text for the audio conversion that audio or video to be marked is extracted, and during transcription, usings described subject vocabulary as hot word resource;
Keyword extracting module, for extracting the keyword of described text;
Information determination module, for determining subject and the knowledge point under described audio or video according to the incidence relation of described keyword and described knowledge collection of illustrative plates;
Label is set up module, and for setting up the label of corresponding described audio or video, described label comprises: subject, knowledge point under described keyword and described audio or video.
Preferably, described keyword has one or more.
Preferably, described keyword extracting module comprises:
Participle unit, for described text is carried out to participle, obtains each sub-word;
Computing unit, for calculating the TF-IDF value of described each sub-word;
Extraction unit, for described TF-IDF value is extracted as to keyword higher than the sub-word of setting threshold, or chooses according to described TF-IDF value order from high to low the sub-word of setting number above and is extracted as keyword.
Preferably, the incidence relation of described keyword and knowledge collection of illustrative plates comprises: position and number of times that described keyword occurs in described knowledge collection of illustrative plates.
Preferably, described system also comprises:
Optimize module, the selection number of times for counting user to label, adds, deletes or replace label according to described selection number of times.
Audio frequency and video label automatic marking method and system that the embodiment of the present invention provides, utilize speech transcription technology and abundant internet data, audio and video resources is carried out to speech transcription, keyword extraction, and determine subject and the knowledge point under audio or video according to keyword and knowledge collection of illustrative plates, realize label automatic marking, reduced artificial participation amount, can provide good foundation for the follow-up services such as resource supplying, be more conducive to teacher, student and find in time high-quality teaching resource simultaneously.
Accompanying drawing explanation
In order to be illustrated more clearly in the embodiment of the present application or technical scheme of the prior art, to the accompanying drawing of required use in embodiment be briefly described below, apparently, the accompanying drawing the following describes is only some embodiment that record in the present invention, for those of ordinary skills, can also obtain according to these accompanying drawings other accompanying drawing.
Fig. 1 is the process flow diagram of embodiment of the present invention audio frequency and video label automatic marking method;
Fig. 2 is a kind of simple examples of the subject knowledge collection of illustrative plates that builds in the embodiment of the present invention;
Fig. 3 is a kind of structural representation of embodiment of the present invention audio frequency and video label automatic marking system.
Embodiment
In order to make those skilled in the art person understand better the scheme of the embodiment of the present invention, below in conjunction with drawings and embodiments, the embodiment of the present invention is described in further detail.
Flourish along with internet and education cloud, various audio and video resources are tinkling of jades meets the eye on every side, uneven.In prior art, can be by a small amount of metadata, as title etc. judges whether resource is that self is required, this mode depends on metadata, and the wrongly written or mispronounced characters in title may all can affect user's judgement; Also may need completely to browse whole audio frequency and video and could determine whether the content of this resource is required resource, and complete browse whole audio frequency and video can be more consuming time.For this reason, the embodiment of the present invention provides a kind of audio frequency and video label automatic marking method and system, makes user no longer need completely to browse whole audio frequency and video and just can capture flesh and blood.First, by instruments such as web crawlers, capture each subject knowledge point and subject vocabulary, build corresponding subject knowledge collection of illustrative plates; Then, using described subject vocabulary as hot word resource, the audio conversion that audio or video to be marked is extracted is write as text; Secondly, extract the keyword in transcription text, and subject and knowledge point under determining corresponding audio or video according to this keyword with the incidence relation of constructed knowledge collection of illustrative plates; Finally, set up the label of corresponding described audio or video, described label comprises: subject, knowledge point under described keyword and described audio or video.
As shown in Figure 1, be the process flow diagram of embodiment of the present invention audio frequency and video label automatic marking method, comprise the following steps:
Step 101, captures each subject knowledge point and subject vocabulary in advance, builds corresponding subject knowledge collection of illustrative plates.
Knowledge collection of illustrative plates, also can be described as Knowledge Map, to take scientific knowledge as quantitative study object, carry out a series of various figure of explicit knowledge's development process and structural relation, it can describe knowledge resource and the carrier thereof that the mankind have in time with visualization technique, draws, excavates, analyzes and demonstration scientific knowledge and connecting each other between them.On internet, ubiquitous hypertext link is exactly a kind of simple form of knowledge collection of illustrative plates.In embodiments of the present invention, described subject knowledge collection of illustrative plates comprises each knowledge point and the mutual relationship thereof specific to certain subject.The effect of subject knowledge collection of illustrative plates is to show the correlativity of each vocabulary in this subject, and the propelling movement of the prediction of subject and resource is had to vital effect.
In practical application, can first by instruments such as web crawlers, capture subject knowledge point and subject vocabulary, such as, this knowledge point of buoyancy in physics subject, then, usings subject knowledge point as starting point, utilizes vertical search engine to obtain the word lists of Knowledge Relation.Vertical search engine is the professional search engine for some industries, segmentation and the extension of search engine, be that the special information of certain class in web page library is once integrated, directed minute field extracts after the data that need are processed and with certain form, returns to user again.To each vocabulary, obtain its encyclopaedia content and encyclopaedia entry label, to judge whether this vocabulary is this subject vocabulary, constantly vocabulary is carried out to degree of depth traversal, forms the knowledge collection of illustrative plates of corresponding subject.As shown in Figure 2, be a kind of simple examples of the subject knowledge collection of illustrative plates that builds in the embodiment of the present invention.
Step 102, using described subject vocabulary as hot word resource, the audio conversion that audio or video to be marked is extracted is write as text.
Particularly, when the audio conversion that audio or video to be marked is extracted is write as text, can use existing speech transcription technology to carry out audio frequency and video transcription.But the complicacy due to Chinese, traditional speech transcription technology transcription accuracy rate is generally lower, can not meet the demand of practical application, still needs significantly to improve just and can be applied, especially for the education and instruction class audio and video resources that comprises a large amount of specialized vocabularies, the accuracy rate of speech transcription may be lower.
For this reason, in embodiments of the present invention, using the specialized vocabulary that grabs as hot word resource, carry out speech transcription, tone decoding can be selected traditional acoustic model and language model, is not needing to revise under the prerequisite of "current" model, can make speech transcription accuracy rate be greatly enhanced.
Step 103, extracts the keyword in described text, and determines subject and the knowledge point under described audio or video according to the incidence relation of described keyword and described knowledge collection of illustrative plates.
Particularly, during keyword in extracting described text, first to carry out participle to the text, obtain each sub-word, then calculate TF-IDF (the Term Frequency – Inverse Document Frequency of each sub-word, word frequency-reverse document frequency) value, finally judges that according to the TF-IDF value of each sub-word (frequency occurring in numerous documents by adding up frequency that each sub-word occurs in current document and this word) can this word as the keyword of the text.
In the present embodiment, while determining the keyword of text according to the TF-IDF value of each sub-word, can there is following multiple definite method, such as:
(1) setting threshold method: first the method sets TF-IDF threshold value (as 0.202), is then defined as keyword by TF-IDF value in text higher than the sub-word of setting threshold; For different texts, under same setting threshold, the keyword number of extracting may be different.
(2) set number method: first the method sets keyword number (as 5) to be extracted, the sub-word of then choosing setting number according to the TF-IDF value of each sub-word in text order is from high to low as keyword.
The keyword accuracy of using TF-IDF technology to extract is very little to the dependence of the accuracy of speech transcription, even if speech transcription accuracy rate lower than 50%, is used TF-IDF technology still can extract key word information accurately.
It should be noted that, can extract one or more sub-word in text as the keyword of text, keyword number (generally desirable 3~5) can set according to user's request.
After having determined the keyword of text, according to the incidence relation of keyword and subject knowledge collection of illustrative plates, determine subject and the knowledge point under described audio or video.Such as, if the keyword the extracting number of times that place, this knowledge point of equation solution occurs in Mathematics Discipline knowledge collection of illustrative plates is the highest, can determine the equation solution that subject under the corresponding audio or video of this keyword and knowledge point are Mathematics Discipline.
Step 104, sets up the label of corresponding described audio or video, and described label comprises: subject, knowledge point under described keyword and described audio or video.
Extracting the keyword of audio or video, determining behind the corresponding subject of these audio frequency and video and knowledge point, automatic marking can carried out to as the label of these audio frequency and video in described keyword, affiliated subject, knowledge point.Such as: extracting keyword number is 5, and the label of the audio frequency and video that this keyword is corresponding comprises: 5 keywords, affiliated subject, knowledge point, amount to 7 labels.
The audio frequency and video label automatic marking method that the embodiment of the present invention provides, utilize speech transcription technology and abundant internet data, audio and video resources is carried out speech transcription, keyword extraction, according to knowledge collection of illustrative plates, determined subject and knowledge point, can fully excavate audio and video resources content, find in time the audio and video resources of high-quality, also can provide good foundation for the follow-up services such as resource supplying.
In order further to optimize the label of automatic marking, the flesh and blood that reflects better audio frequency and video, in another embodiment of audio frequency and video label automatic marking method of the present invention, also comprise: the selection number of times of counting user to label, adds, deletes or replace label according to described selection number of times.Such as: the selection number of times of counting user to label, retain user and select number of times higher than the label of set point number threshold value, delete or replace user and select number of times lower than the label of set point number threshold value.And can be according to the deleting or add of label, hot word resource and knowledge collection of illustrative plates are carried out perfect, and then again set up more excellent audio frequency and video label.
For example: for one piece of text of transcription, the possibility of result of system mark is " gravity, universal gravitation, quality, newton, experiment, physics, Newton's law " these labels, user to the feedback procedure of label in, find that more than 90% user does not support this label of word " experiment ", and support other words, when optimizing knowledge collection of illustrative plates and hot word resource, first reduce " experiment " weights in corpus, next searches for knowledge collection of illustrative plates, associated all the other words, by " universal gravitation, quality, newton, physics, Newton's law " and " gravity " associate, and record the degree of association, along with increasing of user feedback, knowledge collection of illustrative plates can be more and more abundanter, more and more accurate.
Correspondingly, the embodiment of the present invention also provides a kind of audio frequency and video label automatic marking system, as shown in Figure 3, is a kind of structural representation of this system.
In this embodiment, described system comprises: handling module 201, and map construction module 202, transcription module 203, keyword extracting module 204, information determination module 205 and label are set up module 206.Wherein:
Handling module 201, for capturing in advance each subject knowledge point and subject vocabulary.
In practical application, handling module 201 can first capture subject knowledge point and subject vocabulary by instruments such as web crawlers, usings subject knowledge point as starting point, obtains the word lists of Knowledge Relation according to Baidu's vertical search.
Map construction module 202, for building the subject knowledge collection of illustrative plates of corresponding described subject knowledge point and subject vocabulary.
Concrete, each vocabulary that 202 pairs of map construction modules grab, obtain its Baidupedia content and Baidupedia entry label, to judge whether this vocabulary is this subject vocabulary, constantly vocabulary is carried out to degree of depth traversal, obtain the incidence relation of each subject knowledge point, each subject vocabulary, and build knowledge collection of illustrative plates according to subject knowledge point, subject vocabulary and incidence relation thereof.
Transcription module 203, is write as text for the audio conversion that audio or video to be marked is extracted, and during transcription, usings described subject vocabulary as hot word resource.
Particularly, when transcription module 203 is write as text at the audio conversion that audio or video to be marked is extracted, can use existing speech transcription technology to carry out audio frequency and video transcription.But the complicacy due to Chinese, traditional speech transcription technology transcription accuracy rate is generally lower, can not meet the demand of practical application, still needs significantly to improve just and can be applied, especially for the education and instruction class audio and video resources that comprises a large amount of specialized vocabularies, the accuracy rate of speech transcription may be lower.
For this reason, in embodiments of the present invention, transcription module 203 is usingd the specialized vocabulary that grabs as hot word resource, carry out speech transcription, tone decoding is selected traditional acoustic model or language model, do not needing to revise under the prerequisite of "current" model, can make speech transcription accuracy rate be greatly enhanced.
Keyword extracting module 204, for extracting the keyword of described text.
Particularly, keyword extracting module 204 comprises: participle unit, computing unit and extraction unit.During keyword in extracting described text, described participle unit carries out participle to described text, obtains each sub-word; Described computing unit calculates the TF-IDF value of each sub-word, described extraction unit extracts the keyword of corresponding text according to the TF-IDF value of each sub-word, such as, described TF-IDF value can be extracted as to keyword higher than the sub-word of setting threshold, or according to described TF-IDF value order from high to low, choose the sub-word of setting number above and be extracted as keyword.For different texts, under same setting threshold, the keyword number of extracting may be different.
The keyword accuracy that keyword extracting module 204 is used TF-IDF technology to extract is very little to the dependence of the accuracy of speech transcription, even if speech transcription accuracy rate, lower than 50%, still can be extracted key word information accurately.
It should be noted that, keyword extracting module 204 can be extracted one or more sub-word in text as the keyword of text, and keyword number (generally can get 3~5) can be set according to user's request.
Information determination module 205, for determining subject and the knowledge point under described audio or video according to the incidence relation of described keyword and described knowledge collection of illustrative plates.
Particularly, after having determined the keyword of text, information determination module 205 is determined subject and the knowledge point under described audio or video according to the incidence relation of keyword and subject knowledge collection of illustrative plates.Such as: if the keyword that keyword extracting module 204 the is extracted number of times that place, this knowledge point of equation solution occurs in Mathematics Discipline knowledge collection of illustrative plates is the highest, can determine the equation solution that subject under the corresponding audio or video of this keyword and knowledge point are Mathematics Discipline.
Label is set up module 206, and for setting up the label of corresponding described audio or video, described label comprises: subject, knowledge point under described keyword and described audio or video.Such as: the keyword number obtaining is 5, the label of the audio frequency and video that this keyword is corresponding comprises: 5 keywords, affiliated subject, knowledge point, label is set up module 206 need to set up 7 labels (5 keywords, affiliated subject, knowledge point) for these audio frequency and video.
The audio frequency and video label automatic marking system that the embodiment of the present invention provides, utilize advanced at present speech transcription technology and abundant internet data, audio and video resources is carried out speech transcription, keyword extraction, according to knowledge collection of illustrative plates, determined subject and knowledge point, can fully excavate audio and video resources content, find in time the audio and video resources of high-quality, also can provide good foundation for the follow-up services such as resource supplying.
In order further to optimize the label of automatic marking, the flesh and blood that reflects better audio frequency and video, in another embodiment of audio frequency and video label automatic marking system of the present invention, also comprise: optimize module (not shown), for the selection number of times of counting user to label, according to described selection number of times, add, delete or replace label.Such as: the selection number of times of counting user to label, retaining user selects number of times higher than the label of setting threshold, deleting or replace user selects number of times lower than the label of setting threshold, according to deleting or adding label, hot word resource and knowledge collection of illustrative plates are carried out perfect, and then again set up more excellent audio frequency and video label.
Between each embodiment in this instructions identical similar part mutually referring to, for system embodiment, because it is substantially similar in appearance to embodiment of the method, so describe fairly simplely, relevant part is referring to the part explanation of embodiment of the method.System embodiment described above is only schematic, the wherein said module as separating component explanation can or can not be also physically to separate, the parts that show as module can be or can not be also physical locations, can be positioned at a place, or also can be distributed in a plurality of network element.Can select according to the actual needs some or all of module wherein to realize the object of the present embodiment scheme.Those of ordinary skills, in the situation that not paying creative work, are appreciated that and implement.
All parts embodiment of the present invention can realize with hardware, or realizes with the software module moved on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that and can use in practice microprocessor or digital signal processor (DSP) to realize the some or all functions according to the some or all parts in the embodiment of the present invention.The present invention for example can also be embodied as, for carrying out part or all equipment or device program (, computer program and computer program) of method as described herein.
Above the embodiment of the present invention is described in detail, has applied embodiment herein the present invention is set forth, the explanation of above embodiment is just for helping to understand method and apparatus of the present invention; , for one of ordinary skill in the art, according to thought of the present invention, all will change in specific embodiments and applications, in sum, this description should not be construed as limitation of the present invention meanwhile.

Claims (10)

1. an audio frequency and video label automatic marking method, is characterized in that, comprising:
Capture in advance each subject knowledge point and subject vocabulary, build corresponding subject knowledge collection of illustrative plates;
Using described subject vocabulary as hot word resource, the audio conversion that audio or video to be marked is extracted is write as text;
Extract the keyword in described text, and determine subject and the knowledge point under described audio or video according to the incidence relation of described keyword and described knowledge collection of illustrative plates;
Set up the label of corresponding described audio or video, described label comprises: subject, knowledge point under described keyword and described audio or video.
2. method according to claim 1, is characterized in that, described keyword has one or more.
3. method according to claim 1, is characterized in that, the keyword in the described text of described extraction comprises:
Described text is carried out to participle, obtain each sub-word;
Calculate the TF-IDF value of each sub-word;
Using described TF-IDF value higher than the sub-word of setting threshold as keyword, or according to described TF-IDF value order from high to low choose set number above sub-word as keyword.
4. method according to claim 1, is characterized in that, the incidence relation of described keyword and knowledge collection of illustrative plates comprises: position and number of times that described keyword occurs in described knowledge collection of illustrative plates.
5. according to the method described in claim 1 to 4 any one, it is characterized in that, described method also comprises:
The selection number of times of counting user to label, adds, deletes or replace label according to described selection number of times.
6. an audio frequency and video label automatic marking system, is characterized in that, comprising:
Handling module, for capturing in advance each subject knowledge point and subject vocabulary;
Map construction module, for building the subject knowledge collection of illustrative plates of corresponding described subject knowledge point and subject vocabulary;
Transcription module, is write as text for the audio conversion that audio or video to be marked is extracted, and during transcription, usings described subject vocabulary as hot word resource;
Keyword extracting module, for extracting the keyword of described text;
Information determination module, for determining subject and the knowledge point under described audio or video according to the incidence relation of described keyword and described knowledge collection of illustrative plates;
Label is set up module, and for setting up the label of corresponding described audio or video, described label comprises: subject, knowledge point under described keyword and described audio or video.
7. system according to claim 6, is characterized in that, described keyword has one or more.
8. system according to claim 6, is characterized in that, described keyword extracting module comprises:
Participle unit, for described text is carried out to participle, obtains each sub-word;
Computing unit, for calculating the TF-IDF value of described each sub-word;
Extraction unit, for described TF-IDF value is extracted as to keyword higher than the sub-word of setting threshold, or chooses according to described TF-IDF value order from high to low the sub-word of setting number above and is extracted as keyword.
9. system according to claim 6, is characterized in that, the incidence relation of described keyword and knowledge collection of illustrative plates comprises: position and number of times that described keyword occurs in described knowledge collection of illustrative plates.
10. according to the system described in claim 6 to 9 any one, it is characterized in that, described system also comprises:
Optimize module, the selection number of times for counting user to label, adds, deletes or replace label according to described selection number of times.
CN201410320555.4A 2014-07-07 2014-07-07 Automatic audio/video label labeling method and system Pending CN104090955A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410320555.4A CN104090955A (en) 2014-07-07 2014-07-07 Automatic audio/video label labeling method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410320555.4A CN104090955A (en) 2014-07-07 2014-07-07 Automatic audio/video label labeling method and system

Publications (1)

Publication Number Publication Date
CN104090955A true CN104090955A (en) 2014-10-08

Family

ID=51638671

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410320555.4A Pending CN104090955A (en) 2014-07-07 2014-07-07 Automatic audio/video label labeling method and system

Country Status (1)

Country Link
CN (1) CN104090955A (en)

Cited By (47)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104469543A (en) * 2014-11-07 2015-03-25 重庆晋才富熙科技有限公司 Video marking device based on voice technology
CN104462506A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for establishing knowledge graph based on user annotation information
CN104933120A (en) * 2015-06-04 2015-09-23 无锡天脉聚源传媒科技有限公司 Keyword setting method and device for video album
CN104978401A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN104978402A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN105005610A (en) * 2015-07-08 2015-10-28 无锡天脉聚源传媒科技有限公司 Album classification method and apparatus
CN105138670A (en) * 2015-09-06 2015-12-09 天翼爱音乐文化科技有限公司 Audio file label generation method and system
CN105608067A (en) * 2014-11-07 2016-05-25 华东师范大学 Automatic knowledge extraction method and apparatus for network teaching system
CN106126503A (en) * 2016-07-12 2016-11-16 海信集团有限公司 Business scope localization method and terminal
CN106326211A (en) * 2016-08-17 2017-01-11 海信集团有限公司 Determination method and device for distance between keywords in interactive statement
CN106354762A (en) * 2016-08-17 2017-01-25 海信集团有限公司 Business positioning method and device for interaction statements
CN106713083A (en) * 2016-11-24 2017-05-24 海信集团有限公司 Intelligent home appliance control method and device based on knowledge map, and system
CN107343223A (en) * 2017-07-07 2017-11-10 北京慕华信息科技有限公司 The recognition methods of video segment and device
CN107424100A (en) * 2017-07-21 2017-12-01 深圳市鹰硕技术有限公司 Information providing method and system
CN107609478A (en) * 2017-08-09 2018-01-19 广州思涵信息科技有限公司 A kind of real-time analysis of the students system and method for matching classroom knowledge content
CN108073988A (en) * 2017-06-21 2018-05-25 北京华宇元典信息服务有限公司 A kind of law cognitive approach, device and medium based on intensified learning
CN109168026A (en) * 2018-10-25 2019-01-08 北京字节跳动网络技术有限公司 Instant video display methods, device, terminal device and storage medium
CN109213777A (en) * 2017-06-29 2019-01-15 杭州九阳小家电有限公司 A kind of voice-based recipe processing method and system
CN109376269A (en) * 2018-12-05 2019-02-22 西安交通大学 One kind being based on associated across the curriculum video subgraph recommended method of map
CN109783693A (en) * 2019-01-18 2019-05-21 广东小天才科技有限公司 A kind of determination method and system of video semanteme and knowledge point
CN109977233A (en) * 2019-03-15 2019-07-05 北京金山数字娱乐科技有限公司 A kind of idiom knowledge map construction method and device
CN110110091A (en) * 2018-01-25 2019-08-09 北大方正集团有限公司 Methods of exhibiting, system, computer equipment and the storage medium of Knowledge Element map
CN110309363A (en) * 2018-03-02 2019-10-08 广州润沁教育科技有限公司 A kind of instructional video segment method of commerce of knowledge based point
CN110519654A (en) * 2019-09-11 2019-11-29 广州荔支网络技术有限公司 A kind of label determines method and device
CN110543574A (en) * 2019-08-30 2019-12-06 北京百度网讯科技有限公司 knowledge graph construction method, device, equipment and medium
CN110647613A (en) * 2018-06-26 2020-01-03 上海谦问万答吧云计算科技有限公司 Courseware construction method, courseware construction device, courseware construction server and storage medium
CN110992984A (en) * 2019-12-02 2020-04-10 新华智云科技有限公司 Audio processing method and device and storage medium
CN111078921A (en) * 2019-03-18 2020-04-28 广东小天才科技有限公司 Subject identification method and electronic equipment
CN111225235A (en) * 2020-01-16 2020-06-02 北京合众美华教育投资有限公司 Method for playing network teaching video
CN111261167A (en) * 2020-01-16 2020-06-09 广州荔支网络技术有限公司 Automatic tag generation method for audio hot content
CN111510765A (en) * 2020-04-30 2020-08-07 浙江蓝鸽科技有限公司 Audio label intelligent labeling method and device based on teaching video
CN111831854A (en) * 2020-06-03 2020-10-27 北京百度网讯科技有限公司 Video tag generation method and device, electronic equipment and storage medium
CN111930792A (en) * 2020-06-23 2020-11-13 北京大米科技有限公司 Data resource labeling method and device, storage medium and electronic equipment
CN112052304A (en) * 2020-08-18 2020-12-08 中国建设银行股份有限公司 Course label determining method and device and electronic equipment
WO2020244073A1 (en) * 2019-06-06 2020-12-10 平安科技(深圳)有限公司 Speech-based user classification method and device, computer apparatus, and storage medium
CN112132147A (en) * 2020-08-14 2020-12-25 浙江大学 Learning method based on quality node model
CN112418088A (en) * 2020-11-23 2021-02-26 华中师范大学 Video learning resource extraction and knowledge annotation method and system based on crowd-sourcing
CN112487248A (en) * 2020-12-01 2021-03-12 深圳市易平方网络科技有限公司 Video file label generation method and device, intelligent terminal and storage medium
CN112543371A (en) * 2020-11-27 2021-03-23 中国建设银行股份有限公司 Video data processing method, device, terminal and storage medium
CN113129924A (en) * 2021-03-30 2021-07-16 北京泽桥传媒科技股份有限公司 Automatic audio and video content tag extraction method based on computer vision
CN113254814A (en) * 2021-05-12 2021-08-13 平安国际智慧城市科技股份有限公司 Network course video labeling method and device, electronic equipment and medium
CN110245259B (en) * 2019-05-21 2021-09-21 北京百度网讯科技有限公司 Video labeling method and device based on knowledge graph and computer readable medium
CN113537215A (en) * 2021-07-19 2021-10-22 山东福来克思智能科技有限公司 Method and device for labeling video label
CN114218437A (en) * 2021-12-20 2022-03-22 天翼爱音乐文化科技有限公司 Adaptive picture clipping and fusing method, system, computer device and medium
CN116860979A (en) * 2023-09-04 2023-10-10 上海柯林布瑞信息技术有限公司 Medical text labeling method and device based on label knowledge base
CN116993549A (en) * 2023-09-18 2023-11-03 西北师范大学 Review resource recommendation method for online learning system
CN117672027A (en) * 2024-02-01 2024-03-08 青岛培诺教育科技股份有限公司 VR teaching method, device, equipment and medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101655879A (en) * 2008-08-24 2010-02-24 吴凯凯 Voice record for experiment and used system and method
CN102074235A (en) * 2010-12-20 2011-05-25 上海华勤通讯技术有限公司 Method of video speech recognition and search
CN102207945A (en) * 2010-05-11 2011-10-05 天津海量信息技术有限公司 Knowledge network-based text indexing system and method
US20130212113A1 (en) * 2006-09-22 2013-08-15 Limelight Networks, Inc. Methods and systems for generating automated tags for video files
CN103377185A (en) * 2012-04-24 2013-10-30 腾讯科技(深圳)有限公司 Method and device for adding tags to short texts automatically

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130212113A1 (en) * 2006-09-22 2013-08-15 Limelight Networks, Inc. Methods and systems for generating automated tags for video files
CN101655879A (en) * 2008-08-24 2010-02-24 吴凯凯 Voice record for experiment and used system and method
CN102207945A (en) * 2010-05-11 2011-10-05 天津海量信息技术有限公司 Knowledge network-based text indexing system and method
CN102074235A (en) * 2010-12-20 2011-05-25 上海华勤通讯技术有限公司 Method of video speech recognition and search
CN103377185A (en) * 2012-04-24 2013-10-30 腾讯科技(深圳)有限公司 Method and device for adding tags to short texts automatically

Cited By (66)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105608067B (en) * 2014-11-07 2020-05-01 华东师范大学 Automatic knowledge extraction method and device for network teaching system
CN105608067A (en) * 2014-11-07 2016-05-25 华东师范大学 Automatic knowledge extraction method and apparatus for network teaching system
CN104469543A (en) * 2014-11-07 2015-03-25 重庆晋才富熙科技有限公司 Video marking device based on voice technology
CN104462506A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for establishing knowledge graph based on user annotation information
CN104978401B (en) * 2015-06-04 2019-07-02 无锡天脉聚源传媒科技有限公司 A kind of the keyword setting method and device of video album
CN104933120A (en) * 2015-06-04 2015-09-23 无锡天脉聚源传媒科技有限公司 Keyword setting method and device for video album
CN104978401A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN104978402A (en) * 2015-06-04 2015-10-14 无锡天脉聚源传媒科技有限公司 Keyword setting method and apparatus of video album
CN105005610A (en) * 2015-07-08 2015-10-28 无锡天脉聚源传媒科技有限公司 Album classification method and apparatus
CN105005610B (en) * 2015-07-08 2019-09-13 无锡天脉聚源传媒科技有限公司 A kind of album classification method and device
CN105138670B (en) * 2015-09-06 2018-12-14 天翼爱音乐文化科技有限公司 Audio file label generating method and system
CN105138670A (en) * 2015-09-06 2015-12-09 天翼爱音乐文化科技有限公司 Audio file label generation method and system
CN106126503A (en) * 2016-07-12 2016-11-16 海信集团有限公司 Business scope localization method and terminal
CN106126503B (en) * 2016-07-12 2020-02-11 海信集团有限公司 Service field positioning method and terminal
CN106326211A (en) * 2016-08-17 2017-01-11 海信集团有限公司 Determination method and device for distance between keywords in interactive statement
CN106354762A (en) * 2016-08-17 2017-01-25 海信集团有限公司 Business positioning method and device for interaction statements
CN106354762B (en) * 2016-08-17 2020-03-20 海信集团有限公司 Service positioning method and device for interactive statements
CN106713083A (en) * 2016-11-24 2017-05-24 海信集团有限公司 Intelligent home appliance control method and device based on knowledge map, and system
CN108073988A (en) * 2017-06-21 2018-05-25 北京华宇元典信息服务有限公司 A kind of law cognitive approach, device and medium based on intensified learning
CN108073988B (en) * 2017-06-21 2021-09-03 北京华宇元典信息服务有限公司 Law cognition method, device and medium based on reinforcement learning
CN109213777A (en) * 2017-06-29 2019-01-15 杭州九阳小家电有限公司 A kind of voice-based recipe processing method and system
CN107343223A (en) * 2017-07-07 2017-11-10 北京慕华信息科技有限公司 The recognition methods of video segment and device
CN107343223B (en) * 2017-07-07 2019-10-11 北京慕华信息科技有限公司 The recognition methods of video clip and device
CN107424100B (en) * 2017-07-21 2021-01-01 深圳市鹰硕技术有限公司 Information providing method and system
CN107424100A (en) * 2017-07-21 2017-12-01 深圳市鹰硕技术有限公司 Information providing method and system
CN107609478A (en) * 2017-08-09 2018-01-19 广州思涵信息科技有限公司 A kind of real-time analysis of the students system and method for matching classroom knowledge content
CN110110091A (en) * 2018-01-25 2019-08-09 北大方正集团有限公司 Methods of exhibiting, system, computer equipment and the storage medium of Knowledge Element map
CN110309363A (en) * 2018-03-02 2019-10-08 广州润沁教育科技有限公司 A kind of instructional video segment method of commerce of knowledge based point
CN110647613A (en) * 2018-06-26 2020-01-03 上海谦问万答吧云计算科技有限公司 Courseware construction method, courseware construction device, courseware construction server and storage medium
CN109168026A (en) * 2018-10-25 2019-01-08 北京字节跳动网络技术有限公司 Instant video display methods, device, terminal device and storage medium
CN109376269B (en) * 2018-12-05 2021-01-19 西安交通大学 Cross-course video subgraph recommendation method based on map association
CN109376269A (en) * 2018-12-05 2019-02-22 西安交通大学 One kind being based on associated across the curriculum video subgraph recommended method of map
CN109783693B (en) * 2019-01-18 2021-05-18 广东小天才科技有限公司 Method and system for determining video semantics and knowledge points
CN109783693A (en) * 2019-01-18 2019-05-21 广东小天才科技有限公司 A kind of determination method and system of video semanteme and knowledge point
CN109977233A (en) * 2019-03-15 2019-07-05 北京金山数字娱乐科技有限公司 A kind of idiom knowledge map construction method and device
CN111078921A (en) * 2019-03-18 2020-04-28 广东小天才科技有限公司 Subject identification method and electronic equipment
CN110245259B (en) * 2019-05-21 2021-09-21 北京百度网讯科技有限公司 Video labeling method and device based on knowledge graph and computer readable medium
WO2020244073A1 (en) * 2019-06-06 2020-12-10 平安科技(深圳)有限公司 Speech-based user classification method and device, computer apparatus, and storage medium
CN110543574B (en) * 2019-08-30 2022-05-17 北京百度网讯科技有限公司 Knowledge graph construction method, device, equipment and medium
CN110543574A (en) * 2019-08-30 2019-12-06 北京百度网讯科技有限公司 knowledge graph construction method, device, equipment and medium
CN110519654A (en) * 2019-09-11 2019-11-29 广州荔支网络技术有限公司 A kind of label determines method and device
CN110519654B (en) * 2019-09-11 2021-07-27 广州荔支网络技术有限公司 Label determining method and device, electronic equipment and storage medium
CN110992984A (en) * 2019-12-02 2020-04-10 新华智云科技有限公司 Audio processing method and device and storage medium
CN110992984B (en) * 2019-12-02 2022-12-06 新华智云科技有限公司 Audio processing method and device and storage medium
CN111225235A (en) * 2020-01-16 2020-06-02 北京合众美华教育投资有限公司 Method for playing network teaching video
CN111261167A (en) * 2020-01-16 2020-06-09 广州荔支网络技术有限公司 Automatic tag generation method for audio hot content
CN111510765A (en) * 2020-04-30 2020-08-07 浙江蓝鸽科技有限公司 Audio label intelligent labeling method and device based on teaching video
CN111831854A (en) * 2020-06-03 2020-10-27 北京百度网讯科技有限公司 Video tag generation method and device, electronic equipment and storage medium
CN111930792B (en) * 2020-06-23 2024-04-12 北京大米科技有限公司 Labeling method and device for data resources, storage medium and electronic equipment
CN111930792A (en) * 2020-06-23 2020-11-13 北京大米科技有限公司 Data resource labeling method and device, storage medium and electronic equipment
CN112132147A (en) * 2020-08-14 2020-12-25 浙江大学 Learning method based on quality node model
CN112132147B (en) * 2020-08-14 2022-04-19 浙江大学 Learning method based on quality node model
CN112052304A (en) * 2020-08-18 2020-12-08 中国建设银行股份有限公司 Course label determining method and device and electronic equipment
CN112418088A (en) * 2020-11-23 2021-02-26 华中师范大学 Video learning resource extraction and knowledge annotation method and system based on crowd-sourcing
CN112418088B (en) * 2020-11-23 2022-04-29 华中师范大学 Video learning resource extraction and knowledge annotation method and system based on crowd-sourcing
CN112543371A (en) * 2020-11-27 2021-03-23 中国建设银行股份有限公司 Video data processing method, device, terminal and storage medium
CN112487248A (en) * 2020-12-01 2021-03-12 深圳市易平方网络科技有限公司 Video file label generation method and device, intelligent terminal and storage medium
CN113129924A (en) * 2021-03-30 2021-07-16 北京泽桥传媒科技股份有限公司 Automatic audio and video content tag extraction method based on computer vision
CN113254814A (en) * 2021-05-12 2021-08-13 平安国际智慧城市科技股份有限公司 Network course video labeling method and device, electronic equipment and medium
CN113537215A (en) * 2021-07-19 2021-10-22 山东福来克思智能科技有限公司 Method and device for labeling video label
CN114218437A (en) * 2021-12-20 2022-03-22 天翼爱音乐文化科技有限公司 Adaptive picture clipping and fusing method, system, computer device and medium
CN116860979B (en) * 2023-09-04 2023-12-08 上海柯林布瑞信息技术有限公司 Medical text labeling method and device based on label knowledge base
CN116860979A (en) * 2023-09-04 2023-10-10 上海柯林布瑞信息技术有限公司 Medical text labeling method and device based on label knowledge base
CN116993549A (en) * 2023-09-18 2023-11-03 西北师范大学 Review resource recommendation method for online learning system
CN117672027A (en) * 2024-02-01 2024-03-08 青岛培诺教育科技股份有限公司 VR teaching method, device, equipment and medium
CN117672027B (en) * 2024-02-01 2024-04-30 青岛培诺教育科技股份有限公司 VR teaching method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN104090955A (en) Automatic audio/video label labeling method and system
US20210004405A1 (en) Enhancing tangible content on physical activity surface
CN107633005B (en) Knowledge graph construction and comparison system and method based on classroom teaching content
CN107545791B (en) System and method for automatically generating classroom teaching knowledge map by courseware
US20170193393A1 (en) Automated Knowledge Graph Creation
US20180366013A1 (en) System and method for providing an interactive visual learning environment for creation, presentation, sharing, organizing and analysis of knowledge on subject matter
JP6361351B2 (en) Method, program and computing system for ranking spoken words
EP3842963A1 (en) Method and device for acquiring poi state information, apparatus, and computer storage medium
US11762926B2 (en) Recommending web API's and associated endpoints
US20160133148A1 (en) Intelligent content analysis and creation
CN113065003B (en) Knowledge graph generation method based on multiple indexes
CN110287405A (en) The method, apparatus and storage medium of sentiment analysis
CN109710931A (en) A kind of class teaching content evaluation system and its method based on Intellectual Analysis Technology
CN109033060A (en) A kind of information alignment schemes, device, equipment and readable storage medium storing program for executing
US11532333B1 (en) Smart summarization, indexing, and post-processing for recorded document presentation
CN109086255A (en) A kind of bibliography automatic marking method and system based on deep learning
CN113342968A (en) Text abstract extraction method and device
CN116821377A (en) Primary school Chinese automatic evaluation system based on knowledge graph and large model
CN111199151A (en) Data processing method and data processing device
CN105608067B (en) Automatic knowledge extraction method and device for network teaching system
CN111930976B (en) Presentation generation method, device, equipment and storage medium
CN109657052A (en) A kind of abstract of a thesis contains the abstracting method and device of fine granularity Knowledge Element
CN105718441B (en) The method and apparatus of the similar UI components of function between a kind of lookup different platform
WO2020211397A1 (en) Courseware page display method and apparatus, page set construction method and apparatus, device, and medium
US11929100B2 (en) Video generation method, apparatus, electronic device, storage medium and program product

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20141008