CN102752540B - A kind of automated cataloging method based on face recognition technology - Google Patents
A kind of automated cataloging method based on face recognition technology Download PDFInfo
- Publication number
- CN102752540B CN102752540B CN201110453762.3A CN201110453762A CN102752540B CN 102752540 B CN102752540 B CN 102752540B CN 201110453762 A CN201110453762 A CN 201110453762A CN 102752540 B CN102752540 B CN 102752540B
- Authority
- CN
- China
- Prior art keywords
- face
- key frame
- face material
- information
- picture
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
Landscapes
- Processing Or Creating Images (AREA)
Abstract
The invention discloses a kind of automated cataloging method based on face recognition technology, specifically include:Receive face material database;Receive multimedia file;Crucial frame recording and corresponding key frame data picture are obtained according to the video file;Key frame face picture is obtained according to the key frame data picture;The face material database face image information is inquired about according to the key frame face picture and obtains matching face material text message;Language identification is carried out to the audio file according to the crucial frame recording and obtains key frame cataloguing text;It is recorded according to the key frame in the key frame cataloguing text and merges the face material text message, obtains catalogued file.The present invention solves the problems, such as that catalogued file generation and editor can not be carried out by video file, improves precision and the flexibility of catalogued file generation and processing, saved system cost, reduce error rate, and have more wide applicability.
Description
Technical field
The present invention relates in the material data editor of radio data system and process field, lay particular emphasis in CHINA RFTCOM Co Ltd system
In, emphasis is in the application of digital video-audio industrial field, more particularly to a kind of automated cataloging method based on face recognition technology.
Background technology
With the development of television production technology, popularization, the more matchmakers that generally obtained during program making to collection
Voxel material is pre-processed, and voice messaging therein is identified and obtains corresponding inventory information, especially sport category program,
In the case of news controlling, visiting nursing, variety class program occupation rate more and more higher.It is time-consuming to the manually cataloguing of program to take
Power.Meanwhile this kind of program is using key person as specific picture, such as:Sports star, state leader, host, men and women
Main broadcaster etc. is relatively more fixed with respect to personnel, and the intrinsic biological information of computer automatic analysis face is compiled as the primary of video
Mesh information will largely save artificial Catalogue Work.Personal information more than in the prior art can not directly obtain from audio file
Obtain, it is necessary to be obtained from other approach, the method that manually video content is identified for generally use in the prior art, artificial needs
Name information is inserted in catalogued file according to picture is broadcasted, but in the case where needing to carry out a large amount of manual identifieds, according to people
Generation and operation of the thing picture to inventory information need to put into substantial amounts of manpower and materials, and due to being artificially to participate in, also can be by
The production quality and efficiency of cataloguing material are had influence in human factor.
In inventor realizes process of the present invention, discovery have following defect in the prior art, in the prior art need by
Need that manually people information is identified according to different figure pictures when people information adds catalogued file editor, it is right afterwards
Corresponding catalogued file enters edlin, and therefore, production quality and operating efficiency to catalogued file all rely on artificial operation, take
When it is laborious, while a large amount of system resources are consumed, good catalogued file production effect can not be obtained.
The content of the invention
For in the prior art the defects of, the present invention solves and can not carry out catalogued file generation and volume by video file
The problem of collecting.
In order to solve above technical problem, the invention provides a kind of automated cataloging method based on face recognition technology, tool
Body includes:
Face material database is received, the face material database specifically includes:Face image information and face material text message;
Multimedia file is received, the multimedia file includes:Video file and audio file;
Crucial frame recording and corresponding key frame data picture are obtained according to the video file;
Key frame face picture is obtained according to the key frame data picture;
The face material database face image information is inquired about according to the key frame face picture and obtains matching face material
Text message;
Language identification is carried out to the audio file according to the crucial frame recording and obtains key frame cataloguing text;
It is recorded according to the key frame in the key frame cataloguing text and merges the face material text message, is obtained
Catalogued file.
Wherein, also specifically included before the reception face material database step:Establish face material database.
Wherein, described establish in face material database step specifically includes:Face material is received, the face material passes through people
Face material keyword identification, include in single face material:Multi-angle material, emotion class expression material and class expression element of speaking
Material;Face material database is established according to the face material keyword and corresponding face material.
Wherein, described establish in face material database step specifically includes:Receive face material threedimensional model, the face element
Material threedimensional model includes:Face control point model information and corresponding face material threedimensional model text message;According to institute
State face material three-dimension modeling face material database.
Wherein, the face image information also specifically includes monochrome information attribute.
Wherein, obtained in key frame face picture step and specifically included according to the key frame data picture:
According to the key frame data picture obtain information of shooting angles, shooting monochrome information, emotion class expression material and/
Or class expression material information of speaking;Carried out taking face image processing acquisition key frame face according to the key frame data picture
Picture;According to the information of shooting angles, shooting monochrome information, emotion class expression material and/or class expression material information of speaking
Obtain key frame face image information.
Wherein, it is described that the face material database face image information acquisition matching is inquired about according to the key frame face picture
Face material text message step specifically includes:Looked into according to the key frame face picture and the key frame face image information
Ask the face material database face image information and obtain matching face material text message.
Wherein, the face material text message specifically includes:Name information.
Wherein, it is described that the face material database face image information acquisition matching is inquired about according to the key frame face picture
Specifically included in face material text message step:Face control point model information is obtained according to the key frame face picture;
The face material database face material obtaining three-dimensional model matching face material is inquired about according to face control point model information
Threedimensional model text message.
Wherein, face control point model information specifically includes:Face boundary Control point model information and human face five-sense-organ
Control point model information.
Wherein, it is described that crucial frame recording and corresponding key frame data picture step are obtained according to the video file
Specifically include:Receive shooting monochrome information;The video file is adjusted according to the shooting monochrome information;According to adjustment rear video
File acquisition key frame recording and corresponding key frame data picture.
Wherein, also specifically included after the acquisition catalogued file:Subtitle file is obtained according to the catalogued file;Broadcast Control system
System plays out according to the subtitle file.
Compared with prior art, the embodiment of the present invention has advantages below:Pass through the audio-visual content to Multi-media Material
Separation, on the one hand according to video file intercept key frame picture, facial image is picked up from key frame picture, with face before
Face picture in storehouse is matched, so as to obtain the people information corresponding to face, in addition, knowing to its corresponding voice
Not, corresponding text message is obtained, the people information and text envelope for above recognition of face being obtained according to keyword message
Breath merges, and so as to automatically generate automated cataloging file, therefore, the present invention no longer needs manually to participate in, and improves multimedia
The cataloguing synthesis of program material, treatment effeciency;The intrinsic biological information of computer automatic analysis face is as the first of video
Level inventory information will largely save artificial Catalogue Work.Precision and the flexibility of catalogued file generation and processing are improved, is saved
System cost, reduce error rate, and with more wide applicability.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only this
Some embodiments of invention, for those of ordinary skill in the art, without having to pay creative labor, may be used also
To obtain other accompanying drawings according to these accompanying drawings.
Fig. 1:It is a kind of schematic diagram of the automated cataloging method based on face recognition technology in the embodiment of the present invention 1;
Fig. 2:It is the schematic diagram of automated cataloging method of the another kind based on face recognition technology in the embodiment of the present invention 2.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation describes, it is clear that described embodiment is part of the embodiment of the present invention, rather than whole embodiments.Based on the present invention
In embodiment, those of ordinary skill in the art's every other implementation acquired under the premise of creative work is not made
Example, belongs to the scope of protection of the invention.
A kind of automated cataloging method based on face recognition technology is provided in the embodiment of the present invention 1, as shown in figure 1, bag
Include following steps:
S101:Receive face material database;
This step specifically includes:Face material database is received, the face material database specifically includes:Face image information and people
Face material text message;
S102:Receive multimedia file;
This step specifically includes:Multimedia file is received, the multimedia file includes:Video file and audio file;
S103:Obtain crucial frame recording and corresponding key frame data picture;
This step specifically includes:Crucial frame recording is obtained according to the video file and corresponding key frame data is drawn
Face;
S104:Obtain key frame face picture;
This step specifically includes:Key frame face picture is obtained according to the key frame data picture;
S105:Obtain matching face material text message;
This step specifically includes:The face material database face image information is inquired about according to the key frame face picture to obtain
Take matching face material text message;
S106:Obtain key frame cataloguing text;
This step specifically includes:Language identification is carried out to the audio file according to the crucial frame recording and obtains key frame
Cataloguing text;
S107:Merge face material text message and obtain catalogued file;
This step specifically includes:It is recorded according to the key frame in the key frame cataloguing text and merges the face element
Material text message, obtain catalogued file.
Another automated cataloging method based on face recognition technology is provided in the embodiment of the present invention 2, as shown in Fig. 2
Comprise the following steps:
S201:Establish face material database;
This step specifically includes:Also specifically included before the reception face material database step:Establish face material database;
Described establish in face material database step specifically includes:Face material is received, the face material passes through face element
Material keyword identification, include in single face material:Multi-angle material, emotion class expression material and class expression material of speaking;
Face material database is established according to the face material keyword and corresponding face material;
Described establish in face material database step specifically includes:Receive face material threedimensional model, the face material three
Dimension module includes:Face control point model information and corresponding face material threedimensional model text message;According to the people
Face material three-dimension modeling face material database;
S202:Receive face material database;
This step specifically includes:Face material database is received, the face material database specifically includes:Face image information and people
Face material text message;
The face material text message specifically includes:Name information;
S203:Receive multimedia file;
This step specifically includes:Multimedia file is received, the multimedia file includes:Video file and audio file;
S204:Obtain crucial frame recording and corresponding key frame data picture;
This step specifically includes:Crucial frame recording is obtained according to the video file and corresponding key frame data is drawn
Face;
It is described specific according to the video file crucial frame recording of acquisition and corresponding key frame data picture step
Including:Receive shooting monochrome information;The video file is adjusted according to the shooting monochrome information;
According to adjustment rear video file acquisition key frame recording and corresponding key frame data picture;
S205:Obtain key frame face picture;
This step specifically includes:Key frame face picture is obtained according to the key frame data picture;
The face image information also specifically includes monochrome information attribute;
Obtained in key frame face picture step and specifically included according to the key frame data picture:
According to the key frame data picture obtain information of shooting angles, shooting monochrome information, emotion class expression material and/
Or class expression material information of speaking;
Carried out taking face image processing acquisition key frame face picture according to the key frame data picture;
According to the information of shooting angles, shooting monochrome information, emotion class expression material and/or class expression material of speaking letter
Breath obtains key frame face image information;
S206:Obtain matching face material text message;
This step specifically includes:The face material database face image information is inquired about according to the key frame face picture to obtain
Take matching face material text message;
It is described that the face material database face image information acquisition matching face is inquired about according to the key frame face picture
Material text message step specifically includes:
The face material database face is inquired about according to the key frame face picture and the key frame face image information
Image information obtains matching face material text message;
It is described that the face material database face image information acquisition matching face is inquired about according to the key frame face picture
Specifically included in material text message step:
Face control point model information is obtained according to the key frame face picture;
The face material database face material obtaining three-dimensional model matching is inquired about according to face control point model information
Face material threedimensional model text message;
Face control point model information specifically includes:Face boundary Control point model information and human face five-sense-organ control point
Model information;
S207:Obtain key frame cataloguing text;
This step specifically includes:Language identification is carried out to the audio file according to the crucial frame recording and obtains key frame
Cataloguing text;
S208:Merge face material text message and obtain catalogued file;
This step specifically includes:It is recorded according to the key frame in the key frame cataloguing text and merges the face element
Material text message, obtain catalogued file;
S209:Obtain subtitle file and play out;
Also specifically included after the acquisition catalogued file:Subtitle file is obtained according to the catalogued file;Broadcast control system root
Played out according to the subtitle file.
Through the above description of the embodiments, those skilled in the art can be understood that the present invention can lead to
Hardware realization is crossed, the mode of necessary general hardware platform can also be added by software to realize.Based on such understanding, this hair
Bright technical scheme can be embodied in the form of software product, and the software product can be stored in a non-volatile memories
In medium (can be CD-ROM, USB flash disk, mobile hard disk etc.), including some instructions are causing a computer equipment (can be
Personal computer, server, or network equipment etc.) perform method described in each embodiment of the present invention.
It will be appreciated by those skilled in the art that accompanying drawing is the schematic diagram of a preferred embodiment, module or stream in accompanying drawing
Journey is not necessarily implemented necessary to the present invention.
It will be appreciated by those skilled in the art that the module in device in embodiment can describe be divided according to embodiment
It is distributed in the device of embodiment, respective change can also be carried out and be disposed other than in one or more devices of the present embodiment.On
The module for stating embodiment can be merged into a module, can also be further split into multiple submodule.
The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.
Disclosed above is only several specific embodiments of the present invention, and still, the present invention is not limited to this, any ability
What the technical staff in domain can think change should all fall into protection scope of the present invention.
Claims (12)
- A kind of 1. automated cataloging method based on face recognition technology, it is characterised in that including:Face material database is received, the face material database specifically includes:Face image information and face material text message;Multimedia file is received, the multimedia file includes:Video file and audio file;Crucial frame recording and corresponding key frame data picture are obtained according to the video file;Key frame face picture is obtained according to the key frame data picture;The face material database face image information is inquired about according to the key frame face picture and obtains matching face material text Information;Language identification is carried out to the audio file according to the crucial frame recording and obtains key frame cataloguing text;It is recorded according to the key frame in the key frame cataloguing text and merges the face material text message, obtains cataloguing File.
- 2. method as described in claim 1, it is characterised in that also specifically included before the reception face material database step: Establish face material database.
- 3. method as described in claim 2, it is characterised in that described establish in face material database step specifically includes:Face material is received, the face material is included by face material keyword identification in single face material:It is polygonal Spend material, emotion class expression material and class expression material of speaking;Face material database is established according to the face material keyword and corresponding face material.
- 4. method as described in claim 2, it is characterised in that described establish in face material database step specifically includes:Face material threedimensional model is received, the face material threedimensional model includes:Face control point model information and right with it The face material threedimensional model text message answered;According to the face material three-dimension modeling face material database.
- 5. method as described in claim 1, it is characterised in that the face image information also specifically includes monochrome information category Property.
- 6. the method as described in claim 1 or 5, it is characterised in that key frame is obtained according to the key frame data picture Specifically included in face picture step:Information of shooting angles, shooting monochrome information, emotion class expression material are obtained according to the key frame data picture and/or said Talk about class expression material information;Carried out taking face image processing acquisition key frame face picture according to the key frame data picture;Obtained according to the information of shooting angles, shooting monochrome information, emotion class expression material and/or class expression material information of speaking Take key frame face image information.
- 7. method as described in claim 6, it is characterised in that described that the people is inquired about according to the key frame face picture Face material database face image information obtains matching face material text message step and specifically included:The face material database face picture is inquired about according to the key frame face picture and the key frame face image information Acquisition of information matches face material text message.
- 8. method as described in claim 1, it is characterised in that the face material text message specifically includes:Name is believed Breath.
- 9. method as described in claim 4, it is characterised in that described that the people is inquired about according to the key frame face picture Face material database face image information obtains to be specifically included in matching face material text message step:Face control point model information is obtained according to the key frame face picture;The face material database face material obtaining three-dimensional model matching face is inquired about according to face control point model information Material threedimensional model text message.
- 10. method as described in claim 9, it is characterised in that face control point model information specifically includes:Face Boundary Control point model information and human face five-sense-organ control point model information.
- 11. method as described in claim 1, it is characterised in that described that crucial frame recording is obtained according to the video file And corresponding key frame data picture step specifically includes:Receive shooting monochrome information;The video file is adjusted according to the shooting monochrome information;According to adjustment rear video file acquisition key frame recording and corresponding key frame data picture.
- 12. method as described in claim 1, it is characterised in that also specifically included after the acquisition catalogued file:Subtitle file is obtained according to the catalogued file;Broadcast control system plays out according to the subtitle file.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110453762.3A CN102752540B (en) | 2011-12-30 | 2011-12-30 | A kind of automated cataloging method based on face recognition technology |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110453762.3A CN102752540B (en) | 2011-12-30 | 2011-12-30 | A kind of automated cataloging method based on face recognition technology |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102752540A CN102752540A (en) | 2012-10-24 |
CN102752540B true CN102752540B (en) | 2017-12-29 |
Family
ID=47032421
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110453762.3A Active CN102752540B (en) | 2011-12-30 | 2011-12-30 | A kind of automated cataloging method based on face recognition technology |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN102752540B (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103488764B (en) * | 2013-09-26 | 2016-08-17 | 天脉聚源(北京)传媒科技有限公司 | Individualized video content recommendation method and system |
CN103530652B (en) * | 2013-10-23 | 2016-09-14 | 北京中视广信科技有限公司 | A kind of video categorization based on face cluster, search method and system thereof |
CN104618803B (en) * | 2014-02-26 | 2018-05-08 | 腾讯科技(深圳)有限公司 | Information-pushing method, device, terminal and server |
CN103870559A (en) * | 2014-03-06 | 2014-06-18 | 海信集团有限公司 | Method and equipment for obtaining information based on played video |
CN105447846B (en) * | 2014-08-25 | 2020-06-23 | 联想(北京)有限公司 | Image processing method and electronic equipment |
CN104410882A (en) * | 2014-11-28 | 2015-03-11 | 苏州福丰科技有限公司 | Smart television with three-dimensional face scanning function |
CN105512348B (en) * | 2016-01-28 | 2019-03-26 | 北京旷视科技有限公司 | For handling the method and apparatus and search method and device of video and related audio |
CN107241616B (en) * | 2017-06-09 | 2018-10-26 | 腾讯科技(深圳)有限公司 | video lines extracting method, device and storage medium |
CN108229322B (en) * | 2017-11-30 | 2021-02-12 | 北京市商汤科技开发有限公司 | Video-based face recognition method and device, electronic equipment and storage medium |
CN110855875A (en) * | 2018-08-20 | 2020-02-28 | 珠海格力电器股份有限公司 | Method and device for acquiring background information of image |
CN109684913A (en) * | 2018-11-09 | 2019-04-26 | 长沙小钴科技有限公司 | A kind of video human face mask method and system based on community discovery cluster |
CN112818906B (en) * | 2021-02-22 | 2023-07-11 | 浙江传媒学院 | Intelligent cataloging method of all-media news based on multi-mode information fusion understanding |
CN113284256B (en) * | 2021-05-25 | 2023-10-31 | 成都威爱新经济技术研究院有限公司 | MR (magnetic resonance) mixed reality three-dimensional scene material library generation method and system |
CN113591623A (en) * | 2021-07-16 | 2021-11-02 | 青岛新奥胶南燃气工程有限公司 | Intelligent perimeter detection method and equipment |
CN113656643B (en) * | 2021-08-20 | 2024-05-03 | 珠海九松科技有限公司 | Method for analyzing film viewing mood by using AI |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101273351A (en) * | 2005-09-30 | 2008-09-24 | 皇家飞利浦电子股份有限公司 | Face annotation in streaming video |
CN101506828A (en) * | 2006-06-09 | 2009-08-12 | 索尼爱立信移动通讯股份有限公司 | Media identification |
CN102075695A (en) * | 2010-12-30 | 2011-05-25 | 中国科学院自动化研究所 | New generation intelligent cataloging system and method facing large amount of broadcast television programs |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20070294273A1 (en) * | 2006-06-16 | 2007-12-20 | Motorola, Inc. | Method and system for cataloging media files |
-
2011
- 2011-12-30 CN CN201110453762.3A patent/CN102752540B/en active Active
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101273351A (en) * | 2005-09-30 | 2008-09-24 | 皇家飞利浦电子股份有限公司 | Face annotation in streaming video |
CN101506828A (en) * | 2006-06-09 | 2009-08-12 | 索尼爱立信移动通讯股份有限公司 | Media identification |
CN102075695A (en) * | 2010-12-30 | 2011-05-25 | 中国科学院自动化研究所 | New generation intelligent cataloging system and method facing large amount of broadcast television programs |
Also Published As
Publication number | Publication date |
---|---|
CN102752540A (en) | 2012-10-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102752540B (en) | A kind of automated cataloging method based on face recognition technology | |
CN108769801B (en) | Synthetic method, device, equipment and the storage medium of short-sighted frequency | |
CN103155477B (en) | Data syn-chronization in DCE | |
CN106162223B (en) | News video segmentation method and device | |
CN105022795B (en) | A kind of new media cloud distribution platform and its implementation towards big data | |
CN108235141A (en) | Live video turns method, apparatus, server and the storage medium of fragmentation program request | |
CN103718166A (en) | Information processing apparatus, information processing method, and computer program product | |
CN103442252A (en) | Method and device for processing video | |
CN104244023B (en) | Video cloud editing system and method | |
JP2017503394A (en) | VIDEO PROCESSING METHOD, VIDEO PROCESSING DEVICE, AND DISPLAY DEVICE | |
CN103839562A (en) | Video creation system | |
CN101314081B (en) | Lecture background matching method and apparatus | |
CN106409296A (en) | Voice rapid transcription and correction system based on multi-core processing technology | |
WO2018050021A1 (en) | Virtual reality scene adjustment method and apparatus, and storage medium | |
CN106649620A (en) | Manuscript publishing method and system | |
CN111526427A (en) | Video generation method and device and electronic equipment | |
CN106095881A (en) | Method, system and the mobile terminal of a kind of display photos corresponding information | |
CN105979167A (en) | Video producing method and video producing device | |
CN111428077A (en) | Information processing method and terminal thereof | |
CN108833403A (en) | It is a kind of to melt media information publication generation method with embedded code transplanting | |
CN104883609A (en) | Identification processing and playing methods and system for multimedia files | |
JP2018206292A (en) | Video summary creation device and program | |
CN1322997A (en) | Name card exchanging apparatus, name card exchanging method and recording medium | |
CN106664432A (en) | Multimedia information play methods and systems, acquisition equipment, standardized server | |
CN106571108A (en) | Advisement player having voice interaction function |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |