CN107341139A - Multimedia processing method and device, electronic equipment and storage medium - Google Patents
Multimedia processing method and device, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN107341139A CN107341139A CN201710525763.1A CN201710525763A CN107341139A CN 107341139 A CN107341139 A CN 107341139A CN 201710525763 A CN201710525763 A CN 201710525763A CN 107341139 A CN107341139 A CN 107341139A
- Authority
- CN
- China
- Prior art keywords
- multimedia
- destination
- configured information
- default
- property value
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/40—Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
- G06F16/43—Querying
- G06F16/432—Query formulation
- G06F16/434—Query formulation using image data, e.g. images, photos, pictures taken by a user
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/40—Scenes; Scene-specific elements in video content
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Multimedia (AREA)
- Library & Information Science (AREA)
- Mathematical Physics (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The embodiment of the invention provides a multimedia processing method, a multimedia processing device, electronic equipment and a storage medium, wherein the method comprises the following steps: identifying a target object contained in the target multimedia; acquiring an attribute value corresponding to the target object; and generating indication information corresponding to the attribute value, and adding the indication information into the target multimedia. By adopting the invention, the multimedia editing processing form can be enriched, and the diversity of the multimedia editing processing is increased.
Description
Technical field
The present invention relates to electronic technology field, more particularly to a kind of multi-media processing method, device, electronic equipment and storage
Medium.
Background technology
When multimedia is shared, people like carrying out multimedia various personalized editing and processing, such as increase captions, increasing
Add icon, increase logo, scribble etc..These editing and processing can greatly enrich content of multimedia, meet the personalization of user
Demand.
Current multi-media edit is typically to use post processing mode, that is, has been shot after multimedia by user terminal
Software for editing carries out personalized editing and processing to multimedia, but due to each attribute included in content of multimedia can not be known
Information, and be only that multimedia is handled in itself so that the form of multi-media edit processing compares limitation, reduces more matchmakers
The diversity of body editing and processing.
The content of the invention
The embodiment of the present invention provides a kind of multi-media processing method, device, electronic equipment and storage medium, can solve more
The problem of media editing processing form is single.
First aspect of the embodiment of the present invention provides a kind of multi-media processing method, including:
The destination object included in identification destination multimedia;
Obtain property value corresponding to the destination object;
Generation configured information corresponding with the property value, and the configured information is added to the destination multimedia
In.
Optionally, property value corresponding to the acquisition destination object, including:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database
The property value answered.
Optionally, it is described the configured information is added in the destination multimedia after, in addition to:
The configured information, the default display mode bag are shown in the destination multimedia using default display mode
Include default display location and default display effect.
Optionally, the destination multimedia includes multiple image;
The destination object included in the identification destination multimedia, including:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm
Not.
Optionally, methods described also includes:
The operational order for the configured information is received, the operational order includes amplification instruction, reduces instruction, modification
Any of instruction and deletion instruction;
The configured information is operated according to the operational order.
Second aspect of the embodiment of the present invention provides a kind of multimedia processing apparatus, and described device includes:
Object Identification Module, for identifying the destination object included in destination multimedia;
Data obtaining module, for obtaining property value corresponding to the destination object;
Information add module, for generating configured information corresponding with the property value, and the configured information is added
Into the destination multimedia.
Optionally, described information acquisition module is specifically used for:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database
The property value answered.
Optionally, described device also includes:
Information display module, for showing the configured information in the destination multimedia using default display mode,
The default display mode includes default display location and default display effect.
Optionally, the destination multimedia includes multiple image;
The Object Identification Module is specifically used for:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm
Not.
Optionally, described device also includes:
Command reception module, for receiving the operational order for the configured information, the operational order includes amplification
Instruction, reduce instruction, modification instruction and delete any of instruction;
Operation executing module, for being operated according to the operational order to the configured information.
The third aspect of the embodiment of the present invention provides a kind of computer-readable storage medium, it is characterised in that the computer storage
Media storage has a plurality of instruction, and the instruction is suitable to the method for being loaded by processor and performing above-mentioned first aspect.
Fourth aspect of the embodiment of the present invention provides a kind of electronic equipment, including:Processor and memory;Wherein, it is described to deposit
Reservoir is stored with computer program, the method for realizing above-mentioned first aspect described in the computing device during computer program.
The aspect of the embodiment of the present invention the 5th provides a kind of application program, including programmed instruction, and described program instruction, which is worked as, is held
Method during row for performing above-mentioned first aspect.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and obtain
Take property value corresponding to destination object, generate and the configured information is added to the more matchmakers of target after configured information corresponding with property value
In body.In the prior art due to the various attribute informations included in content of multimedia can not be known, and it is only capable of to multimedia
Itself is handled, and compared with prior art, the present invention can be believed with each attribute included in automatic data collection content of multimedia
Breath, and these attribute informations can be added in multimedia, multi-media edit processing form is enriched, adds multi-media edit
The diversity of processing.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing
There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only this
Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can be with
Other accompanying drawings are obtained according to these accompanying drawings.
Fig. 1 is a kind of schematic flow sheet of multi-media processing method provided in an embodiment of the present invention;
Fig. 2 is the schematic flow sheet of another multi-media processing method provided in an embodiment of the present invention;
Fig. 3 is a kind of interface schematic diagram of destination multimedia provided in an embodiment of the present invention;
Fig. 4 (a) is a kind of interface schematic diagram of configured information display mode provided in an embodiment of the present invention;
Fig. 4 (b) is the interface schematic diagram of another configured information display mode provided in an embodiment of the present invention;
Fig. 5 is a kind of structural representation of multimedia processing apparatus provided in an embodiment of the present invention;
Fig. 6 is the structural representation of another multimedia processing apparatus provided in an embodiment of the present invention;
Fig. 7 is the structural representation of a kind of electronic equipment provided in an embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete
Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, rather than whole embodiments.It is based on
Embodiment in the present invention, those of ordinary skill in the art are obtained every other under the premise of creative work is not paid
Embodiment, belong to the scope of protection of the invention.
It should be noted that the term used in embodiments of the present invention is only merely for the mesh of description specific embodiment
, and it is not intended to be limiting the present invention." one of singulative used in the embodiment of the present invention and appended claims
Kind ", " described " and "the" are also intended to including most forms, unless context clearly shows that other implications.It is also understood that this
Term "and/or" used herein refers to and comprising any or all possible group associated of list items purpose of one or more
Close.In addition, the term " first ", " second ", " the 3rd " in description and claims of this specification and above-mentioned accompanying drawing and "
Four " etc. be to be used to distinguish different objects, rather than for describing particular order.In addition, term " comprising " and " having " and it
Any deformation, it is intended that cover non-exclusive include.Such as contain the process of series of steps or unit, method, be
The step of system, product or equipment are not limited to list or unit, but alternatively also including the step of not listing or list
Member, or alternatively also include for the intrinsic other steps of these processes, method, product or equipment or unit.
Multi-media processing method provided in an embodiment of the present invention can apply to multi-media personal editor's application scenarios, example
Such as:The destination object that multimedia processing apparatus is included by identifying in destination multimedia, and obtain attribute corresponding to destination object
Value, the configured information is added in destination multimedia after generating configured information corresponding with property value.In the prior art due to
The various attribute informations included in content of multimedia can not be known, and be only capable of handling multimedia in itself, it is and existing
Technology is compared, and the present invention can be with the various attribute informations included in automatic data collection content of multimedia, and can believe these attributes
Breath is added in multimedia, enriches multi-media edit processing form, adds the diversity of multi-media edit processing.
The present embodiments relate to multimedia processing apparatus can be it is any possess storage and communication function equipment, example
Such as:It is tablet personal computer, mobile phone, electronic reader, personal computer (Personal Computer, PC), notebook computer, vehicle-mounted
The equipment such as equipment, Web TV, wearable device.
Below in conjunction with accompanying drawing 1- accompanying drawings 4, multi-media processing method provided in an embodiment of the present invention is described in detail.
Fig. 1 is referred to, for the embodiments of the invention provide a kind of schematic flow sheet of multi-media processing method.Such as Fig. 1 institutes
Show, the methods described of the embodiment of the present invention may comprise steps of S101- steps S103.
S101, identify the destination object included in destination multimedia.
Specifically, the destination multimedia can be captured picture or video, can include in destination multimedia
Background area and object.
The destination object included in the identification destination multimedia, it is to be understood that multimedia processing apparatus can use
Image recognition technology identifies destination object.Wherein, image recognition refers to handle image using computer, analyzed and managed
Solution, to identify the target of various different modes and the technology to picture.General industry in use, using industrial camera shoot picture,
Then software is recycled to do further identifying processing according to picture ash jump, the external representative of image recognition software has Cognex
Deng, what the country represented has figure intelligent etc..
Wherein, a kind of common image recognition technology is " general evil spirit " identification model, and it is one kind based on signature analysis
Image identification system.The image recognition of " general evil spirit " identification model system shares 4 levels.First layer is to perform most simple task
" map ghost ", they only record extraneous original images, and erect image retina obtains the map of environmental stimuli, then by " feature
Ghost " further analyzes this map.During analysis, each " feature ghost " looks for the characteristics of image relevant with oneself.
For example, when identifying English alphabet, each feature ghost is responsible for a kind of feature and its quantity of report letter, such as vertical line, level
Line, oblique line, right angle, acute angle, discontinuous curve and full curve etc.;Again by the reaction of " cognition ghost " receptions " feature is terrible ", each
" cognition ghost " all finds the image-related feature with oneself being responsible for identification from the reaction of " feature ghost ", it was found that this feature
When it just " shout ", the feature of discovery is more, and " shout " sound is bigger;Finally, " decision-making ghost " is according to many " cognition ghost " " shouts "
The size of sound, the reaction of cry maximum " cognition ghost " is selected as the image to be identified.
For example, when identifying letter r, " map ghost " first encodes to R, conveys information to " feature ghost " and makees further
Processing, at this moment have 5 " features ghost " and report a vertical line, two horizontal lines included by image respectively, an oblique line, three
Right angle and a discontinuous curve.Then many " cognition ghosts " then identify whether according to these features and its quantity reported
It is oneself responsible letter.At this moment D, P, R ghost can all have reaction, but P ghosts only have 4 features to meet with it, and have a feature (tiltedly
Line) do not met with it;D ghosts only have 3 features to meet with it, and have two features (oblique line, right angle) not met with it;Only R
Ghost has 5 features to meet with it, and this 5 features include R whole features again, so the cry of R ghosts is maximum, therefore
" decision-making ghost " R that just easily makes a choice decision.
In addition, form fit algorithm is also a kind of common image recognition technology, shape is for the important of target identification
Feature, and the expression of the bianry image to target zone.Its usual representation is divided to two classes, coded system, such as chain code, the distance of swimming
Code, freeman codes etc.;Simplified way, such as difference, multinomial, polygonal segments and feature point detection.Pass through feature calculation
The target of given shape in image can be extracted.There are many ripe algorithms easily to extract circular, square, triangle at present
The targets such as shape.
A kind of for example, circle detection algorithm based on adding window Hough transform.Cleaning Principle is:Detect it is round-shaped it
Afterwards, round radius value is obtained, and the round-shaped radius value of target carries out similitude comparison.
A kind of for another example arbitrary triangle detection algorithm based on adding window Hough transform.Cleaning Principle is:In the picture
Appropriately sized window is selected, makees Hough changes to image in window by the origin of coordinates of window center, in the Hough of image
Detection of straight lines section in domain, sliding window, the line segment combination for meeting triangle condition, Ran Houding are found out from the straightway detected
The triangle that these line segments of position are formed.The length condition or angle conditions for changing line segment can also detect right angled triangle, etc.
The special triangles such as lumbar triangle shape, equilateral triangle.
For another example whether there is the algorithm of triangle in a kind of detection image of island school.This method utilizes area filling and triangle
Relational implementation triangular day mark detection between the length and area on the side of shape three.
Optionally, the destination multimedia includes multiple image, then using image recognition algorithm respectively to the multiframe figure
The destination object included in every two field picture as in is identified.
Optionally, if including identical destination object in the destination multimedia, using it is therein any one as mesh
Mark object.
S102, obtain property value corresponding to the destination object.
Specifically, the destination object can be object identity or object address.Wherein, the object identity can be pair
The shape or title of elephant, the object address are the storage address on destination object on the server, as unified resource positions
Accord with (Uniform Resource Locator, URL).The property value can include object calorie value, object type, object chi
The relevant informations such as very little, the object place of production, object functionality.
In a kind of feasible embodiment, the multimedia processing apparatus is according to default object and pair of preset attribute value
It should be related to, property value corresponding to the destination object is searched in default database.In another feasible embodiment,
Info web corresponding to the storage address of the multimedia processing apparatus access target object, the info web is parsed with
Extract the property value of destination object.In another feasible embodiment, the multimedia processing apparatus is to the webserver
The property value search request of destination object is sent, and receives the lookup result of webserver feedback.
Optionally, the object identity of the destination object can be carried in the search request, so that the webserver is searched
Property value corresponding to object identity.
S103, corresponding with property value configured information is generated, and it is more that the configured information is added into the target
In media.
Specifically, after the configured information of multimedia processing apparatus generation identity property value, the configured information is added to mesh
Mark in multimedia.The configured information can be label, or list, the configured information can be added into the target
Position or predeterminated position where object.If including multiple destination objects in same destination multimedia, by the instruction of generation
Information be respectively added to corresponding to position where object or predeterminated position.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and obtain
Take property value corresponding to destination object, generate and the configured information is added to the more matchmakers of target after configured information corresponding with property value
In body.In the prior art due to the various attribute informations included in content of multimedia can not be known, and it is only capable of to multimedia
Itself is handled, and compared with prior art, the present invention can be believed with each attribute included in automatic data collection content of multimedia
Breath, and these attribute informations can be added in multimedia, multi-media edit processing form is enriched, adds multi-media edit
The diversity of processing.
Fig. 2 is referred to, for the embodiments of the invention provide the schematic flow sheet of another multi-media processing method.Such as Fig. 2
Shown, the methods described of the embodiment of the present invention may comprise steps of S201- steps S206.
S201, identify the destination object included in destination multimedia.
Specifically, the destination multimedia can be captured picture or video, can include in destination multimedia
Background area and object.Picture A as shown in Figure 3, wherein A1, A2 and A3 be A included in destination object, remainder
For background area.
In specific implementation, multimedia processing apparatus can use image recognition technology identification destination object.Wherein, image recognition
Refer to handle image using computer, analyzed and understood, to identify the target of various different modes and the technology to picture.
General industry using industrial camera in use, shoot picture, and then recycling software does further identification according to picture ash jump
Processing, the external representative of image recognition software have Cognex etc., and what the country represented has figure intelligence etc..
Wherein, a kind of common image recognition technology is " general evil spirit " identification model, and it is one kind based on signature analysis
Image identification system.The image recognition of " general evil spirit " identification model system shares 4 levels.First layer is to perform most simple task
" map ghost ", they only record extraneous original images, and erect image retina obtains the map of environmental stimuli, then by " feature
Ghost " further analyzes this map.During analysis, each " feature ghost " looks for the characteristics of image relevant with oneself.
For example, when identifying English alphabet, each feature ghost is responsible for a kind of feature and its quantity of report letter, such as vertical line, level
Line, oblique line, right angle, acute angle, discontinuous curve and full curve etc.;Again by the reaction of " cognition ghost " receptions " feature is terrible ", each
" cognition ghost " all finds the image-related feature with oneself being responsible for identification from the reaction of " feature ghost ", it was found that this feature
When it just " shout ", the feature of discovery is more, and " shout " sound is bigger;Finally, " decision-making ghost " is according to many " cognition ghost " " shouts "
The size of sound, the reaction of cry maximum " cognition ghost " is selected as the image to be identified.
For example, when identifying letter r, " map ghost " first encodes to R, conveys information to " feature ghost " and makees further
Processing, at this moment have 5 " features ghost " and report a vertical line, two horizontal lines included by image respectively, an oblique line, three
Right angle and a discontinuous curve.Then many " cognition ghosts " then identify whether according to these features and its quantity reported
It is oneself responsible letter.At this moment D, P, R ghost can all have reaction, but P ghosts only have 4 features to meet with it, and have a feature (tiltedly
Line) do not met with it;D ghosts only have 3 features to meet with it, and have two features (oblique line, right angle) not met with it;Only R
Ghost has 5 features to meet with it, and this 5 features include R whole features again, so the cry of R ghosts is maximum, therefore
" decision-making ghost " R that just easily makes a choice decision.
In addition, form fit algorithm is also a kind of common image recognition technology, shape is for the important of target identification
Feature, and the expression of the bianry image to target zone.Its usual representation is divided to two classes, coded system, such as chain code, the distance of swimming
Code, freeman codes etc.;Simplified way, such as difference, multinomial, polygonal segments and feature point detection.Pass through feature calculation
The target of given shape in image can be extracted.There are many ripe algorithms easily to extract circular, square, triangle at present
The targets such as shape.
A kind of for example, circle detection algorithm based on adding window Hough transform.Cleaning Principle is:Detect it is round-shaped it
Afterwards, round radius value is obtained, and the round-shaped radius value of target carries out similitude comparison.
A kind of for another example arbitrary triangle detection algorithm based on adding window Hough transform.Cleaning Principle is:In the picture
Appropriately sized window is selected, makees Hough changes to image in window by the origin of coordinates of window center, in the Hough of image
Detection of straight lines section in domain, sliding window, the line segment combination for meeting triangle condition, Ran Houding are found out from the straightway detected
The triangle that these line segments of position are formed.The length condition or angle conditions for changing line segment can also detect right angled triangle, etc.
The special triangles such as lumbar triangle shape, equilateral triangle.
For another example whether there is the algorithm of triangle in a kind of detection image of island school.This method utilizes area filling and triangle
Relational implementation triangular day mark detection between the length and area on the side of shape three.
Optionally, the destination multimedia includes multiple image, then using image recognition algorithm respectively to the multiframe figure
The destination object included in every two field picture as in is identified.
S202, according to default object and the corresponding relation of preset attribute value, the target is searched in default database
Property value corresponding to object.
Specifically, the default object can be object identity, object address.Wherein, the object identity can be pair
The shape or title of elephant, the object address are the storage address on destination object, such as uniform resource position mark URL.
It is caloric value to take property value, is as shown in table 1 default object and the mapping table of default caloric value, this reflects
Relation table is penetrated to be stored in default database, then can be according to the mapping when multimedia processing apparatus gets destination object
Table search is to corresponding caloric value.For example, if destination object is apple, corresponding caloric value is 52cal.
Table 1
Object | Caloric value (cal) |
Cauliflower | 24 |
Egg | 144 |
Milk | 54 |
Apple | 52 |
S203, corresponding with property value configured information is generated, and it is more that the configured information is added into the target
In media.
Specifically, after the configured information of multimedia processing apparatus generation identity property value, the configured information is added to mesh
Mark in multimedia.The configured information can be label, or list, the configured information can be added into the target
Position or default specified location where object.If including multiple destination objects in same destination multimedia, by generation
Configured information be respectively added to corresponding to position where object or default position.
For example, such as Fig. 4 (a) show a kind of configured information display mode therein, Fig. 4 (b) is another configured information
Display mode.
S204, the configured information is shown in the destination multimedia using default display mode.
Specifically, the multimedia processing apparatus is shown using default display mode to the configured information.Wherein,
The default display mode is the self-defined setting of the multimedia processing apparatus, including default display location and default display are imitated
Fruit.
Optionally, the multimedia processing apparatus presses the configured information according to the time for generating the configured information
Shown according to time order and function order in default viewing area.Optionally, default viewing area can be superimposed upon multimedia and show
Show overlying regions, for example, default viewing area is superimposed upon multimedia display overlying regions in a transparent way, can so make more matchmakers
The display size of body viewing area and default viewing area reaches maximization;Or multimedia display region and default viewing area
It can specifically not limited in the diverse location of display interface.
S205, the operational order for the configured information is received, the operational order includes amplification instruction, diminution refers to
Make, change instruction and delete any of instruction.
S206, the configured information is operated according to the operational order.
Specifically, when user is operated for the configured information, multimedia processing apparatus then receives operation and referred to
Order, and operated corresponding to execute instruction.User can so be improved in multi-media edit to the operability of configured information.
For example, if user wants the display mode of change configured information, display mode modification operation is carried out, if current
Display mode rolls the position tapered into simultaneously to where destination object to the left to be shown from the lower right of display interface
Put, now then switch to and first pass through amplification mode and highlight preset time, preset time is contracted to the position where destination object later
Put.
In another example if multimedia processing apparatus receives the deletion instruction for configured information, corresponding indicate is deleted
Information.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and
Property value corresponding to destination object is found in presetting database, believes the instruction after generating configured information corresponding with property value
Breath is added in destination multimedia, shows the configured information by using default display mode, while can also refer to according to for this
Show that the operations such as information input addition, deletion, modification are handled configured information.In the prior art due to multimedia can not be known
Various attribute informations included in content, and be only capable of handling multimedia in itself, compared with prior art, the present invention
Can be with the various attribute informations included in automatic data collection content of multimedia, and these attribute informations can be added to multimedia
In, multi-media edit processing form is enriched, adds the diversity of multi-media edit processing.
Fig. 5 is referred to, for the embodiments of the invention provide a kind of structural representation of multimedia processing apparatus.Such as Fig. 5 institutes
Show, the multimedia processing apparatus 1 of the embodiment of the present invention can include:Object Identification Module 11, the and of data obtaining module 12
Information add module 13.
Object Identification Module 11, for identifying the destination object included in destination multimedia.
The destination multimedia can be captured picture or video, and background area can be included in destination multimedia
And object.Picture A as shown in Figure 3, wherein A1, A2 and A3 are the destination object included in A, and remainder is background area
Domain.
In specific implementation, Object Identification Module can use image recognition technology identification destination object.Wherein, image recognition is
Finger is handled image, analyzed and understood using computer, to identify the target of various different modes and the technology to picture.One
As in industrial application, picture is shot using industrial camera, then recycles software to be done according to picture ash jump at further identification
Reason, the external representative of image recognition software have Cognex etc., and what the country represented has figure intelligence etc..
Wherein, a kind of common image recognition technology is " general evil spirit " identification model, and it is one kind based on signature analysis
Image identification system.The image recognition of " general evil spirit " identification model system shares 4 levels.First layer is to perform most simple task
" map ghost ", they only record extraneous original images, and erect image retina obtains the map of environmental stimuli, then by " feature
Ghost " further analyzes this map.During analysis, each " feature ghost " looks for the characteristics of image relevant with oneself.
For example, when identifying English alphabet, each feature ghost is responsible for a kind of feature and its quantity of report letter, such as vertical line, level
Line, oblique line, right angle, acute angle, discontinuous curve and full curve etc.;Again by the reaction of " cognition ghost " receptions " feature is terrible ", each
" cognition ghost " all finds the image-related feature with oneself being responsible for identification from the reaction of " feature ghost ", it was found that this feature
When it just " shout ", the feature of discovery is more, and " shout " sound is bigger;Finally, " decision-making ghost " is according to many " cognition ghost " " shouts "
The size of sound, the reaction of cry maximum " cognition ghost " is selected as the image to be identified.
For example, when identifying letter r, " map ghost " first encodes to R, conveys information to " feature ghost " and makees further
Processing, at this moment have 5 " features ghost " and report a vertical line, two horizontal lines included by image respectively, an oblique line, three
Right angle and a discontinuous curve.Then many " cognition ghosts " then identify whether according to these features and its quantity reported
It is oneself responsible letter.At this moment D, P, R ghost can all have reaction, but P ghosts only have 4 features to meet with it, and have a feature (tiltedly
Line) do not met with it;D ghosts only have 3 features to meet with it, and have two features (oblique line, right angle) not met with it;Only R
Ghost has 5 features to meet with it, and this 5 features include R whole features again, so the cry of R ghosts is maximum, therefore
" decision-making ghost " R that just easily makes a choice decision.
In addition, form fit algorithm is also a kind of common image recognition technology, shape is for the important of target identification
Feature, and the expression of the bianry image to target zone.Its usual representation is divided to two classes, coded system, such as chain code, the distance of swimming
Code, freeman codes etc.;Simplified way, such as difference, multinomial, polygonal segments and feature point detection.Pass through feature calculation
The target of given shape in image can be extracted.There are many ripe algorithms easily to extract circular, square, triangle at present
The targets such as shape.
A kind of for example, circle detection algorithm based on adding window Hough transform.Cleaning Principle is:Detect it is round-shaped it
Afterwards, round radius value is obtained, and the round-shaped radius value of target carries out similitude comparison.
A kind of for another example arbitrary triangle detection algorithm based on adding window Hough transform.Cleaning Principle is:In the picture
Appropriately sized window is selected, makees Hough changes to image in window by the origin of coordinates of window center, in the Hough of image
Detection of straight lines section in domain, sliding window, the line segment combination for meeting triangle condition, Ran Houding are found out from the straightway detected
The triangle that these line segments of position are formed.The length condition or angle conditions for changing line segment can also detect right angled triangle, etc.
The special triangles such as lumbar triangle shape, equilateral triangle.
For another example whether there is the algorithm of triangle in a kind of detection image of island school.This method utilizes area filling and triangle
Relational implementation triangular day mark detection between the length and area on the side of shape three.
Optionally, the destination multimedia includes multiple image, then using image recognition algorithm respectively to the multiframe figure
The destination object included in every two field picture as in is identified.
Optionally, the destination multimedia includes multiple image;
The Object Identification Module 11 is specifically used for:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm
Not.
Data obtaining module 12, for obtaining property value corresponding to the destination object.
Optionally, described information acquisition module 12 is specifically used for:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database
The property value answered.
Optionally, data obtaining module is specifically used for info web corresponding to the storage address of access target object, to this
Info web is parsed to extract the property value of destination object.Optionally, data obtaining module is specifically used for network service
Device sends the property value search request of destination object, and receives the lookup result of webserver feedback.
Information add module 13, for generating configured information corresponding with the property value, and the configured information is added
It is added in the destination multimedia.
Specifically, after the configured information of information add module generation identity property value, the configured information is added to target
In multimedia.The configured information can be label, or list, the configured information can be added into the target pair
As the position at place.If including multiple destination objects in same destination multimedia, the configured information of generation is added respectively
Position to where corresponding object.
Optionally, as shown in fig. 6, described device 1 also includes:
Information display module 14, for showing the instruction letter in the destination multimedia using default display mode
Breath, the default display mode include default display location and default display effect.
Specifically, described information display module is shown using default display mode to the configured information.Wherein, institute
It is the self-defined setting of the multimedia processing apparatus to state default display mode, including default display location and default display are imitated
Fruit.
Optionally, described information display module is according to the time for generating the configured information, by the configured information according to
Time order and function order is shown in default viewing area.Optionally, default viewing area can be superimposed upon multimedia display
Overlying regions, for example, default viewing area is superimposed upon multimedia display overlying regions in a transparent way, it can so make multimedia
The display size of viewing area and default viewing area reaches maximization;Or multimedia display region and default viewing area can
In the diverse location of display interface, not limit specifically.
Optionally, as shown in fig. 6, described device 1 also includes:
Command reception module 15, for receiving the operational order for the configured information, the operational order includes putting
Big instruction, reduce instruction, modification instruction and delete any of instruction;
Operation executing module 16, for being operated according to the operational order to the configured information.
Specifically, when user is operated for the configured information, command reception module then receives operational order,
Operated corresponding to operation executing module execute instruction.User can so be improved in net cast group to interaction image data
Operability.
For example, if user wants the display mode of change configured information, display mode modification operation is carried out, if current
Display mode rolls the position tapered into simultaneously to where destination object to the left to be shown from the lower right of display interface
Put, now then switch to and first pass through amplification mode and highlight preset time, preset time is contracted to the position where destination object later
Put.
In another example if command reception module receives the deletion instruction for configured information, operation executing module is then deleted
Corresponding configured information.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and
Property value corresponding to destination object is found in presetting database, believes the instruction after generating configured information corresponding with property value
Breath is added in destination multimedia, shows the configured information by using default display mode, while can also refer to according to for this
Show that the operations such as information input addition, deletion, modification are handled configured information.In the prior art due to multimedia can not be known
Various attribute informations included in content, and be only capable of handling multimedia in itself, compared with prior art, the present invention
Can be with the various attribute informations included in automatic data collection content of multimedia, and these attribute informations can be added to multimedia
In, multi-media edit processing form is enriched, adds the diversity of multi-media edit processing.
Fig. 7 is referred to, for the embodiments of the invention provide the structural representation of a kind of electronic equipment.It is as shown in fig. 7, described
Electronic equipment 1000 can include:At least one processor 1001, such as CPU, at least one network interface 1004, user interface
1003, memory 1005, at least one communication bus 1002.Wherein, communication bus 1002 is used to realize between these components
Connection communication.Wherein, user interface 1003 can include display screen (Display), keyboard (Keyboard), optional user interface
1003 can also include wireline interface, the wave point of standard.Network interface 1004 can optionally connect including the wired of standard
Mouth, wave point (such as WI-FI interfaces).Memory 1005 can be high-speed RAM memory or non-labile storage
Device (non-volatile memory), for example, at least a magnetic disk storage.Memory 1005 optionally can also be at least one
The individual storage device for being located remotely from aforementioned processor 1001.As shown in fig. 7, as a kind of memory of computer-readable storage medium
Operating system, network communication module, Subscriber Interface Module SIM and multi-media processing application program can be included in 1005.
In the electronic equipment 1000 shown in Fig. 7, user interface 1003 is mainly used in providing the user the interface of input;And
Processor 1001 can be used for calling the multi-media processing application program stored in memory 1005, and specifically perform following grasp
Make:
The destination object included in identification destination multimedia;
Obtain property value corresponding to the destination object;
Generation configured information corresponding with the property value, and the configured information is added to the destination multimedia
In.
In one embodiment, the processor 1001 is when performing property value corresponding to the acquisition destination object, tool
Body performs following steps:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database
The property value answered.
In one embodiment, the configured information is added to the destination multimedia by the processor 1001 in execution
In after, specifically perform following steps:
The configured information, the default display mode bag are shown in the destination multimedia using default display mode
Include default display location and default display effect.
In one embodiment, the destination multimedia includes multiple image;
The processor 1001 is specific to perform following walk when performing the destination object for identifying and being included in destination multimedia
Suddenly:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm
Not.
In one embodiment, the processor 1001 also performs following steps:
The operational order for the configured information is received, the operational order includes amplification instruction, reduces instruction, modification
Any of instruction and deletion instruction;
The configured information is operated according to the operational order.
The embodiment of the present invention also provides a kind of computer-readable storage medium (non-transitorycomputer readable storage medium), described
Computer-readable storage medium is stored with computer program, and the computer program includes program signaling, and described program signaling, which is worked as, to be counted
Calculation machine makes the computer perform method as in the foregoing embodiment when performing, the computer can be mentioned above more
A part for media processor or electronic equipment.
Above-mentioned non-transitorycomputer readable storage medium can use appointing for one or more computer-readable media
Meaning combination.Computer-readable medium can be computer-readable signal media or computer-readable recording medium.Computer can
Read storage medium and for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device
Or device, or any combination above.The more specifically example (non exhaustive list) of computer-readable recording medium includes:
Electrical connection, portable computer diskette, hard disk, random access memory (RAM), read-only storage with one or more wires
Device (Read Only Memory;Hereinafter referred to as:ROM), erasable programmable read only memory (Erasable
Programmable Read Only Memory;Hereinafter referred to as:EPROM) or flash memory, optical fiber, portable compact disc are read-only deposits
Reservoir (CD-ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer
Readable storage medium storing program for executing can be any includes or the tangible medium of storage program, the program can be commanded execution system, device
Either device use or in connection.
Computer-readable signal media can include in a base band or as carrier wave a part propagation data-signal,
Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including --- but
It is not limited to --- electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be
Any computer-readable medium beyond computer-readable recording medium, the computer-readable medium can send, propagate or
Transmit for by instruction execution system, device either device use or program in connection.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but it is unlimited
In --- wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
Can with one or more programming languages or its combination come write for perform the application operation computer
Program code, described program design language include object oriented program language-such as Java, Smalltalk, C++,
Also include conventional procedural programming language-such as " C " language or similar programming language.Program code can be with
Fully perform, partly perform on the user computer on the user computer, the software kit independent as one performs, portion
Divide and partly perform or performed completely on remote computer or server on the remote computer on the user computer.
It is related in the situation of remote computer, remote computer can pass through the network of any kind --- including LAN (Local
Area Network;Hereinafter referred to as:) or wide area network (Wide Area Network LAN;Hereinafter referred to as:WAN) it is connected to user
Computer, or, it may be connected to outer computer (such as passing through Internet connection using ISP).
The embodiment of the present application also provides a kind of computer program product, when the instruction in above computer program product by
When managing device execution, it is possible to achieve the application Fig. 1 or the multi-media processing method of embodiment illustrated in fig. 2 offer.
Through the above description of the embodiments, it is apparent to those skilled in the art that, for description
It is convenient and succinct, can as needed will be upper only with the division progress of above-mentioned each functional module for example, in practical application
State function distribution to be completed by different functional modules, i.e., the internal structure of device is divided into different functional modules, to complete
All or part of function described above.The specific work process of the system, apparatus, and unit of foregoing description, before may be referred to
The corresponding process in embodiment of the method is stated, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed system, apparatus and method can be with
Realize by another way.For example, device embodiment described above is only schematical, for example, the module or
The division of unit, only a kind of division of logic function, can there are other dividing mode, such as multiple units when actually realizing
Or component can combine or be desirably integrated into another system, or some features can be ignored, or not perform.It is another, institute
Display or the mutual coupling discussed or direct-coupling or communication connection can be by some interfaces, device or unit
INDIRECT COUPLING or communication connection, can be electrical, mechanical or other forms.
The unit illustrated as separating component can be or may not be physically separate, show as unit
The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple
On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs
's.
In addition, each functional unit in each embodiment of the application can be integrated in a processing unit, can also
That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list
Member can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is used as independent production marketing or use
When, it can be stored in a computer read/write memory medium.Based on such understanding, the technical scheme of the application is substantially
The part to be contributed in other words to prior art or all or part of the technical scheme can be in the form of software products
Embody, the computer software product is stored in a storage medium, including some instructions are causing a computer
It is each that equipment (can be personal computer, server, or network equipment etc.) or processor (processor) perform the application
The all or part of step of embodiment methods described.And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage
(Read Only Memory;Hereinafter referred to as:ROM), random access memory (Random Access Memory;Hereinafter referred to as:
RAM), magnetic disc or CD etc. are various can be with the medium of store program codes.
Described above, the only embodiment of the application, but the protection domain of the application is not limited thereto is any
Those familiar with the art can readily occur in change or replacement in the technical scope that the application discloses, and should all contain
Cover within the protection domain of the application.Therefore, the protection domain of the application should be based on the protection scope of the described claims.
Claims (10)
1. a kind of multi-media processing method, it is characterised in that methods described includes:
The destination object included in identification destination multimedia;
Obtain property value corresponding to the destination object;
Generation configured information corresponding with the property value, and the configured information is added in the destination multimedia.
2. according to the method for claim 1, it is characterised in that described to obtain property value corresponding to the destination object, bag
Include:
According to default object and the corresponding relation of preset attribute value, searched in default database corresponding to the destination object
Property value.
3. according to the method for claim 1, it is characterised in that described that the configured information is added to the more matchmakers of the target
After in body, in addition to:
The configured information is shown in the destination multimedia using default display mode, the default display mode includes pre-
If display location and default display effect.
4. according to the method for claim 1, it is characterised in that the destination multimedia includes multiple image;
The destination object included in the identification destination multimedia, including:
The destination object included in every two field picture in the multiple image is identified respectively using image recognition algorithm.
5. according to the method for claim 1, it is characterised in that methods described also includes:
The operational order for the configured information is received, the operational order includes amplification instruction, reduces instruction, modification instruction
And delete any of instruction;
The configured information is operated according to the operational order.
6. a kind of multimedia processing apparatus, it is characterised in that described device includes:
Object Identification Module, for identifying the destination object included in destination multimedia;
Data obtaining module, for obtaining property value corresponding to the destination object;
Information add module, for generating configured information corresponding with the property value, and the configured information is added to institute
State in destination multimedia.
7. device according to claim 6, it is characterised in that described information acquisition module is specifically used for:
According to default object and the corresponding relation of preset attribute value, searched in default database corresponding to the destination object
Property value.
8. device according to claim 6, it is characterised in that described device also includes:
Information display module, it is described for showing the configured information in the destination multimedia using default display mode
Default display mode includes default display location and default display effect.
9. a kind of computer-readable storage medium, it is characterised in that the computer-readable storage medium is stored with a plurality of instruction, the instruction
Suitable for being loaded by processor and being performed such as any one of claim 1 to 5 methods described.
10. a kind of electronic equipment, it is characterised in that including:Processor and memory;Wherein, the memory storage has calculating
Machine program, realized described in the computing device during computer program such as any one of claim 1 to 5 methods described.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710525763.1A CN107341139A (en) | 2017-06-30 | 2017-06-30 | Multimedia processing method and device, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710525763.1A CN107341139A (en) | 2017-06-30 | 2017-06-30 | Multimedia processing method and device, electronic equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107341139A true CN107341139A (en) | 2017-11-10 |
Family
ID=60218279
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710525763.1A Pending CN107341139A (en) | 2017-06-30 | 2017-06-30 | Multimedia processing method and device, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107341139A (en) |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108619717A (en) * | 2018-03-21 | 2018-10-09 | 腾讯科技(深圳)有限公司 | Determination method, apparatus, storage medium and the electronic device of operation object |
CN109391849A (en) * | 2018-09-30 | 2019-02-26 | 联想(北京)有限公司 | Processing method and system, multi-media output device and memory |
CN109886258A (en) * | 2019-02-19 | 2019-06-14 | 新华网(北京)科技有限公司 | The method, apparatus and electronic equipment of the related information of multimedia messages are provided |
CN110287345A (en) * | 2019-06-28 | 2019-09-27 | 北京金山安全软件有限公司 | Method and device for detecting image source, electronic equipment and storage medium |
CN112528915A (en) * | 2020-12-18 | 2021-03-19 | 北京华如科技股份有限公司 | Intelligent plotting method based on 'pan magic' recognition model and storage medium thereof |
CN112597648A (en) * | 2020-12-18 | 2021-04-02 | 北京华如科技股份有限公司 | Simulation scenario generation method based on 'pan magic' recognition model and storage medium |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105096180A (en) * | 2015-07-20 | 2015-11-25 | 北京易讯理想科技有限公司 | Commodity information display method and apparatus based augmented reality |
CN105117463A (en) * | 2015-08-24 | 2015-12-02 | 北京旷视科技有限公司 | Information processing method and information processing device |
US20170004386A1 (en) * | 2015-07-02 | 2017-01-05 | Agt International Gmbh | Multi-camera vehicle identification system |
-
2017
- 2017-06-30 CN CN201710525763.1A patent/CN107341139A/en active Pending
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20170004386A1 (en) * | 2015-07-02 | 2017-01-05 | Agt International Gmbh | Multi-camera vehicle identification system |
CN105096180A (en) * | 2015-07-20 | 2015-11-25 | 北京易讯理想科技有限公司 | Commodity information display method and apparatus based augmented reality |
CN105117463A (en) * | 2015-08-24 | 2015-12-02 | 北京旷视科技有限公司 | Information processing method and information processing device |
Non-Patent Citations (2)
Title |
---|
勒普顿 等: "《图形设计新元素》", 30 September 2009 * |
秦亚军 等: "《FLASH动画设计项目实践》", 31 August 2015, 西南交通大学出版社 * |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108619717A (en) * | 2018-03-21 | 2018-10-09 | 腾讯科技(深圳)有限公司 | Determination method, apparatus, storage medium and the electronic device of operation object |
CN109391849A (en) * | 2018-09-30 | 2019-02-26 | 联想(北京)有限公司 | Processing method and system, multi-media output device and memory |
CN109391849B (en) * | 2018-09-30 | 2020-11-20 | 联想(北京)有限公司 | Processing method and system, multimedia output device and memory |
CN109886258A (en) * | 2019-02-19 | 2019-06-14 | 新华网(北京)科技有限公司 | The method, apparatus and electronic equipment of the related information of multimedia messages are provided |
CN110287345A (en) * | 2019-06-28 | 2019-09-27 | 北京金山安全软件有限公司 | Method and device for detecting image source, electronic equipment and storage medium |
CN110287345B (en) * | 2019-06-28 | 2023-08-15 | 北京乐蜜科技有限责任公司 | Image source detection method and device, electronic equipment and storage medium |
CN112528915A (en) * | 2020-12-18 | 2021-03-19 | 北京华如科技股份有限公司 | Intelligent plotting method based on 'pan magic' recognition model and storage medium thereof |
CN112597648A (en) * | 2020-12-18 | 2021-04-02 | 北京华如科技股份有限公司 | Simulation scenario generation method based on 'pan magic' recognition model and storage medium |
CN112597648B (en) * | 2020-12-18 | 2023-09-22 | 北京华如科技股份有限公司 | Simulation design generation method based on 'general magic' recognition model and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107341139A (en) | Multimedia processing method and device, electronic equipment and storage medium | |
CN102687140B (en) | For contributing to the method and apparatus of CBIR | |
Rukhovich et al. | Iterdet: iterative scheme for object detection in crowded environments | |
JP6240199B2 (en) | Method and apparatus for identifying object in image | |
CN107633066A (en) | Information display method and device, electronic equipment and storage medium | |
CN112464814A (en) | Video processing method and device, electronic equipment and storage medium | |
US9313444B2 (en) | Relational display of images | |
US10963700B2 (en) | Character recognition | |
US20130179436A1 (en) | Display apparatus, remote control apparatus, and searching methods thereof | |
CN111259751A (en) | Video-based human behavior recognition method, device, equipment and storage medium | |
CN111541943B (en) | Video processing method, video operation method, device, storage medium and equipment | |
US9851873B2 (en) | Electronic album creating apparatus and method of producing electronic album | |
JP6787831B2 (en) | Target detection device, detection model generation device, program and method that can be learned by search results | |
CN111309200B (en) | Method, device, equipment and storage medium for determining extended reading content | |
CN111738263A (en) | Target detection method and device, electronic equipment and storage medium | |
US20180276471A1 (en) | Information processing device calculating statistical information | |
US8498978B2 (en) | Slideshow video file detection | |
CN110929057A (en) | Image processing method, device and system, storage medium and electronic device | |
CN107451194A (en) | A kind of image searching method and device | |
US11068121B2 (en) | System and method for visual exploration of subnetwork patterns in two-mode networks | |
US20150026013A1 (en) | System and methods for cognitive visual product search | |
US11048713B2 (en) | System and method for visual exploration of search results in two-mode networks | |
CN111291756A (en) | Method and device for detecting text area in image, computer equipment and computer storage medium | |
CN111539390A (en) | Small target image identification method, equipment and system based on Yolov3 | |
CN111506754A (en) | Picture retrieval method and device, storage medium and processor |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171110 |
|
RJ01 | Rejection of invention patent application after publication |