CN107341139A - Multimedia processing method and device, electronic equipment and storage medium - Google Patents

Multimedia processing method and device, electronic equipment and storage medium Download PDF

Info

Publication number
CN107341139A
CN107341139A CN201710525763.1A CN201710525763A CN107341139A CN 107341139 A CN107341139 A CN 107341139A CN 201710525763 A CN201710525763 A CN 201710525763A CN 107341139 A CN107341139 A CN 107341139A
Authority
CN
China
Prior art keywords
multimedia
destination
configured information
default
property value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710525763.1A
Other languages
Chinese (zh)
Inventor
韩沁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Internet Security Software Co Ltd
Original Assignee
Beijing Kingsoft Internet Security Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Internet Security Software Co Ltd filed Critical Beijing Kingsoft Internet Security Software Co Ltd
Priority to CN201710525763.1A priority Critical patent/CN107341139A/en
Publication of CN107341139A publication Critical patent/CN107341139A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/40Information retrieval; Database structures therefor; File system structures therefor of multimedia data, e.g. slideshows comprising image and additional audio data
    • G06F16/43Querying
    • G06F16/432Query formulation
    • G06F16/434Query formulation using image data, e.g. images, photos, pictures taken by a user
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Multimedia (AREA)
  • Library & Information Science (AREA)
  • Mathematical Physics (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention provides a multimedia processing method, a multimedia processing device, electronic equipment and a storage medium, wherein the method comprises the following steps: identifying a target object contained in the target multimedia; acquiring an attribute value corresponding to the target object; and generating indication information corresponding to the attribute value, and adding the indication information into the target multimedia. By adopting the invention, the multimedia editing processing form can be enriched, and the diversity of the multimedia editing processing is increased.

Description

A kind of multi-media processing method, device, electronic equipment and storage medium
Technical field
The present invention relates to electronic technology field, more particularly to a kind of multi-media processing method, device, electronic equipment and storage Medium.
Background technology
When multimedia is shared, people like carrying out multimedia various personalized editing and processing, such as increase captions, increasing Add icon, increase logo, scribble etc..These editing and processing can greatly enrich content of multimedia, meet the personalization of user Demand.
Current multi-media edit is typically to use post processing mode, that is, has been shot after multimedia by user terminal Software for editing carries out personalized editing and processing to multimedia, but due to each attribute included in content of multimedia can not be known Information, and be only that multimedia is handled in itself so that the form of multi-media edit processing compares limitation, reduces more matchmakers The diversity of body editing and processing.
The content of the invention
The embodiment of the present invention provides a kind of multi-media processing method, device, electronic equipment and storage medium, can solve more The problem of media editing processing form is single.
First aspect of the embodiment of the present invention provides a kind of multi-media processing method, including:
The destination object included in identification destination multimedia;
Obtain property value corresponding to the destination object;
Generation configured information corresponding with the property value, and the configured information is added to the destination multimedia In.
Optionally, property value corresponding to the acquisition destination object, including:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database The property value answered.
Optionally, it is described the configured information is added in the destination multimedia after, in addition to:
The configured information, the default display mode bag are shown in the destination multimedia using default display mode Include default display location and default display effect.
Optionally, the destination multimedia includes multiple image;
The destination object included in the identification destination multimedia, including:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm Not.
Optionally, methods described also includes:
The operational order for the configured information is received, the operational order includes amplification instruction, reduces instruction, modification Any of instruction and deletion instruction;
The configured information is operated according to the operational order.
Second aspect of the embodiment of the present invention provides a kind of multimedia processing apparatus, and described device includes:
Object Identification Module, for identifying the destination object included in destination multimedia;
Data obtaining module, for obtaining property value corresponding to the destination object;
Information add module, for generating configured information corresponding with the property value, and the configured information is added Into the destination multimedia.
Optionally, described information acquisition module is specifically used for:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database The property value answered.
Optionally, described device also includes:
Information display module, for showing the configured information in the destination multimedia using default display mode, The default display mode includes default display location and default display effect.
Optionally, the destination multimedia includes multiple image;
The Object Identification Module is specifically used for:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm Not.
Optionally, described device also includes:
Command reception module, for receiving the operational order for the configured information, the operational order includes amplification Instruction, reduce instruction, modification instruction and delete any of instruction;
Operation executing module, for being operated according to the operational order to the configured information.
The third aspect of the embodiment of the present invention provides a kind of computer-readable storage medium, it is characterised in that the computer storage Media storage has a plurality of instruction, and the instruction is suitable to the method for being loaded by processor and performing above-mentioned first aspect.
Fourth aspect of the embodiment of the present invention provides a kind of electronic equipment, including:Processor and memory;Wherein, it is described to deposit Reservoir is stored with computer program, the method for realizing above-mentioned first aspect described in the computing device during computer program.
The aspect of the embodiment of the present invention the 5th provides a kind of application program, including programmed instruction, and described program instruction, which is worked as, is held Method during row for performing above-mentioned first aspect.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and obtain Take property value corresponding to destination object, generate and the configured information is added to the more matchmakers of target after configured information corresponding with property value In body.In the prior art due to the various attribute informations included in content of multimedia can not be known, and it is only capable of to multimedia Itself is handled, and compared with prior art, the present invention can be believed with each attribute included in automatic data collection content of multimedia Breath, and these attribute informations can be added in multimedia, multi-media edit processing form is enriched, adds multi-media edit The diversity of processing.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the required accompanying drawing used in technology description to be briefly described, it should be apparent that, drawings in the following description are only this Some embodiments of invention, for those of ordinary skill in the art, on the premise of not paying creative work, can be with Other accompanying drawings are obtained according to these accompanying drawings.
Fig. 1 is a kind of schematic flow sheet of multi-media processing method provided in an embodiment of the present invention;
Fig. 2 is the schematic flow sheet of another multi-media processing method provided in an embodiment of the present invention;
Fig. 3 is a kind of interface schematic diagram of destination multimedia provided in an embodiment of the present invention;
Fig. 4 (a) is a kind of interface schematic diagram of configured information display mode provided in an embodiment of the present invention;
Fig. 4 (b) is the interface schematic diagram of another configured information display mode provided in an embodiment of the present invention;
Fig. 5 is a kind of structural representation of multimedia processing apparatus provided in an embodiment of the present invention;
Fig. 6 is the structural representation of another multimedia processing apparatus provided in an embodiment of the present invention;
Fig. 7 is the structural representation of a kind of electronic equipment provided in an embodiment of the present invention.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, rather than whole embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other under the premise of creative work is not paid Embodiment, belong to the scope of protection of the invention.
It should be noted that the term used in embodiments of the present invention is only merely for the mesh of description specific embodiment , and it is not intended to be limiting the present invention." one of singulative used in the embodiment of the present invention and appended claims Kind ", " described " and "the" are also intended to including most forms, unless context clearly shows that other implications.It is also understood that this Term "and/or" used herein refers to and comprising any or all possible group associated of list items purpose of one or more Close.In addition, the term " first ", " second ", " the 3rd " in description and claims of this specification and above-mentioned accompanying drawing and " Four " etc. be to be used to distinguish different objects, rather than for describing particular order.In addition, term " comprising " and " having " and it Any deformation, it is intended that cover non-exclusive include.Such as contain the process of series of steps or unit, method, be The step of system, product or equipment are not limited to list or unit, but alternatively also including the step of not listing or list Member, or alternatively also include for the intrinsic other steps of these processes, method, product or equipment or unit.
Multi-media processing method provided in an embodiment of the present invention can apply to multi-media personal editor's application scenarios, example Such as:The destination object that multimedia processing apparatus is included by identifying in destination multimedia, and obtain attribute corresponding to destination object Value, the configured information is added in destination multimedia after generating configured information corresponding with property value.In the prior art due to The various attribute informations included in content of multimedia can not be known, and be only capable of handling multimedia in itself, it is and existing Technology is compared, and the present invention can be with the various attribute informations included in automatic data collection content of multimedia, and can believe these attributes Breath is added in multimedia, enriches multi-media edit processing form, adds the diversity of multi-media edit processing.
The present embodiments relate to multimedia processing apparatus can be it is any possess storage and communication function equipment, example Such as:It is tablet personal computer, mobile phone, electronic reader, personal computer (Personal Computer, PC), notebook computer, vehicle-mounted The equipment such as equipment, Web TV, wearable device.
Below in conjunction with accompanying drawing 1- accompanying drawings 4, multi-media processing method provided in an embodiment of the present invention is described in detail.
Fig. 1 is referred to, for the embodiments of the invention provide a kind of schematic flow sheet of multi-media processing method.Such as Fig. 1 institutes Show, the methods described of the embodiment of the present invention may comprise steps of S101- steps S103.
S101, identify the destination object included in destination multimedia.
Specifically, the destination multimedia can be captured picture or video, can include in destination multimedia Background area and object.
The destination object included in the identification destination multimedia, it is to be understood that multimedia processing apparatus can use Image recognition technology identifies destination object.Wherein, image recognition refers to handle image using computer, analyzed and managed Solution, to identify the target of various different modes and the technology to picture.General industry in use, using industrial camera shoot picture, Then software is recycled to do further identifying processing according to picture ash jump, the external representative of image recognition software has Cognex Deng, what the country represented has figure intelligent etc..
Wherein, a kind of common image recognition technology is " general evil spirit " identification model, and it is one kind based on signature analysis Image identification system.The image recognition of " general evil spirit " identification model system shares 4 levels.First layer is to perform most simple task " map ghost ", they only record extraneous original images, and erect image retina obtains the map of environmental stimuli, then by " feature Ghost " further analyzes this map.During analysis, each " feature ghost " looks for the characteristics of image relevant with oneself. For example, when identifying English alphabet, each feature ghost is responsible for a kind of feature and its quantity of report letter, such as vertical line, level Line, oblique line, right angle, acute angle, discontinuous curve and full curve etc.;Again by the reaction of " cognition ghost " receptions " feature is terrible ", each " cognition ghost " all finds the image-related feature with oneself being responsible for identification from the reaction of " feature ghost ", it was found that this feature When it just " shout ", the feature of discovery is more, and " shout " sound is bigger;Finally, " decision-making ghost " is according to many " cognition ghost " " shouts " The size of sound, the reaction of cry maximum " cognition ghost " is selected as the image to be identified.
For example, when identifying letter r, " map ghost " first encodes to R, conveys information to " feature ghost " and makees further Processing, at this moment have 5 " features ghost " and report a vertical line, two horizontal lines included by image respectively, an oblique line, three Right angle and a discontinuous curve.Then many " cognition ghosts " then identify whether according to these features and its quantity reported It is oneself responsible letter.At this moment D, P, R ghost can all have reaction, but P ghosts only have 4 features to meet with it, and have a feature (tiltedly Line) do not met with it;D ghosts only have 3 features to meet with it, and have two features (oblique line, right angle) not met with it;Only R Ghost has 5 features to meet with it, and this 5 features include R whole features again, so the cry of R ghosts is maximum, therefore " decision-making ghost " R that just easily makes a choice decision.
In addition, form fit algorithm is also a kind of common image recognition technology, shape is for the important of target identification Feature, and the expression of the bianry image to target zone.Its usual representation is divided to two classes, coded system, such as chain code, the distance of swimming Code, freeman codes etc.;Simplified way, such as difference, multinomial, polygonal segments and feature point detection.Pass through feature calculation The target of given shape in image can be extracted.There are many ripe algorithms easily to extract circular, square, triangle at present The targets such as shape.
A kind of for example, circle detection algorithm based on adding window Hough transform.Cleaning Principle is:Detect it is round-shaped it Afterwards, round radius value is obtained, and the round-shaped radius value of target carries out similitude comparison.
A kind of for another example arbitrary triangle detection algorithm based on adding window Hough transform.Cleaning Principle is:In the picture Appropriately sized window is selected, makees Hough changes to image in window by the origin of coordinates of window center, in the Hough of image Detection of straight lines section in domain, sliding window, the line segment combination for meeting triangle condition, Ran Houding are found out from the straightway detected The triangle that these line segments of position are formed.The length condition or angle conditions for changing line segment can also detect right angled triangle, etc. The special triangles such as lumbar triangle shape, equilateral triangle.
For another example whether there is the algorithm of triangle in a kind of detection image of island school.This method utilizes area filling and triangle Relational implementation triangular day mark detection between the length and area on the side of shape three.
Optionally, the destination multimedia includes multiple image, then using image recognition algorithm respectively to the multiframe figure The destination object included in every two field picture as in is identified.
Optionally, if including identical destination object in the destination multimedia, using it is therein any one as mesh Mark object.
S102, obtain property value corresponding to the destination object.
Specifically, the destination object can be object identity or object address.Wherein, the object identity can be pair The shape or title of elephant, the object address are the storage address on destination object on the server, as unified resource positions Accord with (Uniform Resource Locator, URL).The property value can include object calorie value, object type, object chi The relevant informations such as very little, the object place of production, object functionality.
In a kind of feasible embodiment, the multimedia processing apparatus is according to default object and pair of preset attribute value It should be related to, property value corresponding to the destination object is searched in default database.In another feasible embodiment, Info web corresponding to the storage address of the multimedia processing apparatus access target object, the info web is parsed with Extract the property value of destination object.In another feasible embodiment, the multimedia processing apparatus is to the webserver The property value search request of destination object is sent, and receives the lookup result of webserver feedback.
Optionally, the object identity of the destination object can be carried in the search request, so that the webserver is searched Property value corresponding to object identity.
S103, corresponding with property value configured information is generated, and it is more that the configured information is added into the target In media.
Specifically, after the configured information of multimedia processing apparatus generation identity property value, the configured information is added to mesh Mark in multimedia.The configured information can be label, or list, the configured information can be added into the target Position or predeterminated position where object.If including multiple destination objects in same destination multimedia, by the instruction of generation Information be respectively added to corresponding to position where object or predeterminated position.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and obtain Take property value corresponding to destination object, generate and the configured information is added to the more matchmakers of target after configured information corresponding with property value In body.In the prior art due to the various attribute informations included in content of multimedia can not be known, and it is only capable of to multimedia Itself is handled, and compared with prior art, the present invention can be believed with each attribute included in automatic data collection content of multimedia Breath, and these attribute informations can be added in multimedia, multi-media edit processing form is enriched, adds multi-media edit The diversity of processing.
Fig. 2 is referred to, for the embodiments of the invention provide the schematic flow sheet of another multi-media processing method.Such as Fig. 2 Shown, the methods described of the embodiment of the present invention may comprise steps of S201- steps S206.
S201, identify the destination object included in destination multimedia.
Specifically, the destination multimedia can be captured picture or video, can include in destination multimedia Background area and object.Picture A as shown in Figure 3, wherein A1, A2 and A3 be A included in destination object, remainder For background area.
In specific implementation, multimedia processing apparatus can use image recognition technology identification destination object.Wherein, image recognition Refer to handle image using computer, analyzed and understood, to identify the target of various different modes and the technology to picture. General industry using industrial camera in use, shoot picture, and then recycling software does further identification according to picture ash jump Processing, the external representative of image recognition software have Cognex etc., and what the country represented has figure intelligence etc..
Wherein, a kind of common image recognition technology is " general evil spirit " identification model, and it is one kind based on signature analysis Image identification system.The image recognition of " general evil spirit " identification model system shares 4 levels.First layer is to perform most simple task " map ghost ", they only record extraneous original images, and erect image retina obtains the map of environmental stimuli, then by " feature Ghost " further analyzes this map.During analysis, each " feature ghost " looks for the characteristics of image relevant with oneself. For example, when identifying English alphabet, each feature ghost is responsible for a kind of feature and its quantity of report letter, such as vertical line, level Line, oblique line, right angle, acute angle, discontinuous curve and full curve etc.;Again by the reaction of " cognition ghost " receptions " feature is terrible ", each " cognition ghost " all finds the image-related feature with oneself being responsible for identification from the reaction of " feature ghost ", it was found that this feature When it just " shout ", the feature of discovery is more, and " shout " sound is bigger;Finally, " decision-making ghost " is according to many " cognition ghost " " shouts " The size of sound, the reaction of cry maximum " cognition ghost " is selected as the image to be identified.
For example, when identifying letter r, " map ghost " first encodes to R, conveys information to " feature ghost " and makees further Processing, at this moment have 5 " features ghost " and report a vertical line, two horizontal lines included by image respectively, an oblique line, three Right angle and a discontinuous curve.Then many " cognition ghosts " then identify whether according to these features and its quantity reported It is oneself responsible letter.At this moment D, P, R ghost can all have reaction, but P ghosts only have 4 features to meet with it, and have a feature (tiltedly Line) do not met with it;D ghosts only have 3 features to meet with it, and have two features (oblique line, right angle) not met with it;Only R Ghost has 5 features to meet with it, and this 5 features include R whole features again, so the cry of R ghosts is maximum, therefore " decision-making ghost " R that just easily makes a choice decision.
In addition, form fit algorithm is also a kind of common image recognition technology, shape is for the important of target identification Feature, and the expression of the bianry image to target zone.Its usual representation is divided to two classes, coded system, such as chain code, the distance of swimming Code, freeman codes etc.;Simplified way, such as difference, multinomial, polygonal segments and feature point detection.Pass through feature calculation The target of given shape in image can be extracted.There are many ripe algorithms easily to extract circular, square, triangle at present The targets such as shape.
A kind of for example, circle detection algorithm based on adding window Hough transform.Cleaning Principle is:Detect it is round-shaped it Afterwards, round radius value is obtained, and the round-shaped radius value of target carries out similitude comparison.
A kind of for another example arbitrary triangle detection algorithm based on adding window Hough transform.Cleaning Principle is:In the picture Appropriately sized window is selected, makees Hough changes to image in window by the origin of coordinates of window center, in the Hough of image Detection of straight lines section in domain, sliding window, the line segment combination for meeting triangle condition, Ran Houding are found out from the straightway detected The triangle that these line segments of position are formed.The length condition or angle conditions for changing line segment can also detect right angled triangle, etc. The special triangles such as lumbar triangle shape, equilateral triangle.
For another example whether there is the algorithm of triangle in a kind of detection image of island school.This method utilizes area filling and triangle Relational implementation triangular day mark detection between the length and area on the side of shape three.
Optionally, the destination multimedia includes multiple image, then using image recognition algorithm respectively to the multiframe figure The destination object included in every two field picture as in is identified.
S202, according to default object and the corresponding relation of preset attribute value, the target is searched in default database Property value corresponding to object.
Specifically, the default object can be object identity, object address.Wherein, the object identity can be pair The shape or title of elephant, the object address are the storage address on destination object, such as uniform resource position mark URL.
It is caloric value to take property value, is as shown in table 1 default object and the mapping table of default caloric value, this reflects Relation table is penetrated to be stored in default database, then can be according to the mapping when multimedia processing apparatus gets destination object Table search is to corresponding caloric value.For example, if destination object is apple, corresponding caloric value is 52cal.
Table 1
Object Caloric value (cal)
Cauliflower 24
Egg 144
Milk 54
Apple 52
S203, corresponding with property value configured information is generated, and it is more that the configured information is added into the target In media.
Specifically, after the configured information of multimedia processing apparatus generation identity property value, the configured information is added to mesh Mark in multimedia.The configured information can be label, or list, the configured information can be added into the target Position or default specified location where object.If including multiple destination objects in same destination multimedia, by generation Configured information be respectively added to corresponding to position where object or default position.
For example, such as Fig. 4 (a) show a kind of configured information display mode therein, Fig. 4 (b) is another configured information Display mode.
S204, the configured information is shown in the destination multimedia using default display mode.
Specifically, the multimedia processing apparatus is shown using default display mode to the configured information.Wherein, The default display mode is the self-defined setting of the multimedia processing apparatus, including default display location and default display are imitated Fruit.
Optionally, the multimedia processing apparatus presses the configured information according to the time for generating the configured information Shown according to time order and function order in default viewing area.Optionally, default viewing area can be superimposed upon multimedia and show Show overlying regions, for example, default viewing area is superimposed upon multimedia display overlying regions in a transparent way, can so make more matchmakers The display size of body viewing area and default viewing area reaches maximization;Or multimedia display region and default viewing area It can specifically not limited in the diverse location of display interface.
S205, the operational order for the configured information is received, the operational order includes amplification instruction, diminution refers to Make, change instruction and delete any of instruction.
S206, the configured information is operated according to the operational order.
Specifically, when user is operated for the configured information, multimedia processing apparatus then receives operation and referred to Order, and operated corresponding to execute instruction.User can so be improved in multi-media edit to the operability of configured information.
For example, if user wants the display mode of change configured information, display mode modification operation is carried out, if current Display mode rolls the position tapered into simultaneously to where destination object to the left to be shown from the lower right of display interface Put, now then switch to and first pass through amplification mode and highlight preset time, preset time is contracted to the position where destination object later Put.
In another example if multimedia processing apparatus receives the deletion instruction for configured information, corresponding indicate is deleted Information.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and Property value corresponding to destination object is found in presetting database, believes the instruction after generating configured information corresponding with property value Breath is added in destination multimedia, shows the configured information by using default display mode, while can also refer to according to for this Show that the operations such as information input addition, deletion, modification are handled configured information.In the prior art due to multimedia can not be known Various attribute informations included in content, and be only capable of handling multimedia in itself, compared with prior art, the present invention Can be with the various attribute informations included in automatic data collection content of multimedia, and these attribute informations can be added to multimedia In, multi-media edit processing form is enriched, adds the diversity of multi-media edit processing.
Fig. 5 is referred to, for the embodiments of the invention provide a kind of structural representation of multimedia processing apparatus.Such as Fig. 5 institutes Show, the multimedia processing apparatus 1 of the embodiment of the present invention can include:Object Identification Module 11, the and of data obtaining module 12 Information add module 13.
Object Identification Module 11, for identifying the destination object included in destination multimedia.
The destination multimedia can be captured picture or video, and background area can be included in destination multimedia And object.Picture A as shown in Figure 3, wherein A1, A2 and A3 are the destination object included in A, and remainder is background area Domain.
In specific implementation, Object Identification Module can use image recognition technology identification destination object.Wherein, image recognition is Finger is handled image, analyzed and understood using computer, to identify the target of various different modes and the technology to picture.One As in industrial application, picture is shot using industrial camera, then recycles software to be done according to picture ash jump at further identification Reason, the external representative of image recognition software have Cognex etc., and what the country represented has figure intelligence etc..
Wherein, a kind of common image recognition technology is " general evil spirit " identification model, and it is one kind based on signature analysis Image identification system.The image recognition of " general evil spirit " identification model system shares 4 levels.First layer is to perform most simple task " map ghost ", they only record extraneous original images, and erect image retina obtains the map of environmental stimuli, then by " feature Ghost " further analyzes this map.During analysis, each " feature ghost " looks for the characteristics of image relevant with oneself. For example, when identifying English alphabet, each feature ghost is responsible for a kind of feature and its quantity of report letter, such as vertical line, level Line, oblique line, right angle, acute angle, discontinuous curve and full curve etc.;Again by the reaction of " cognition ghost " receptions " feature is terrible ", each " cognition ghost " all finds the image-related feature with oneself being responsible for identification from the reaction of " feature ghost ", it was found that this feature When it just " shout ", the feature of discovery is more, and " shout " sound is bigger;Finally, " decision-making ghost " is according to many " cognition ghost " " shouts " The size of sound, the reaction of cry maximum " cognition ghost " is selected as the image to be identified.
For example, when identifying letter r, " map ghost " first encodes to R, conveys information to " feature ghost " and makees further Processing, at this moment have 5 " features ghost " and report a vertical line, two horizontal lines included by image respectively, an oblique line, three Right angle and a discontinuous curve.Then many " cognition ghosts " then identify whether according to these features and its quantity reported It is oneself responsible letter.At this moment D, P, R ghost can all have reaction, but P ghosts only have 4 features to meet with it, and have a feature (tiltedly Line) do not met with it;D ghosts only have 3 features to meet with it, and have two features (oblique line, right angle) not met with it;Only R Ghost has 5 features to meet with it, and this 5 features include R whole features again, so the cry of R ghosts is maximum, therefore " decision-making ghost " R that just easily makes a choice decision.
In addition, form fit algorithm is also a kind of common image recognition technology, shape is for the important of target identification Feature, and the expression of the bianry image to target zone.Its usual representation is divided to two classes, coded system, such as chain code, the distance of swimming Code, freeman codes etc.;Simplified way, such as difference, multinomial, polygonal segments and feature point detection.Pass through feature calculation The target of given shape in image can be extracted.There are many ripe algorithms easily to extract circular, square, triangle at present The targets such as shape.
A kind of for example, circle detection algorithm based on adding window Hough transform.Cleaning Principle is:Detect it is round-shaped it Afterwards, round radius value is obtained, and the round-shaped radius value of target carries out similitude comparison.
A kind of for another example arbitrary triangle detection algorithm based on adding window Hough transform.Cleaning Principle is:In the picture Appropriately sized window is selected, makees Hough changes to image in window by the origin of coordinates of window center, in the Hough of image Detection of straight lines section in domain, sliding window, the line segment combination for meeting triangle condition, Ran Houding are found out from the straightway detected The triangle that these line segments of position are formed.The length condition or angle conditions for changing line segment can also detect right angled triangle, etc. The special triangles such as lumbar triangle shape, equilateral triangle.
For another example whether there is the algorithm of triangle in a kind of detection image of island school.This method utilizes area filling and triangle Relational implementation triangular day mark detection between the length and area on the side of shape three.
Optionally, the destination multimedia includes multiple image, then using image recognition algorithm respectively to the multiframe figure The destination object included in every two field picture as in is identified.
Optionally, the destination multimedia includes multiple image;
The Object Identification Module 11 is specifically used for:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm Not.
Data obtaining module 12, for obtaining property value corresponding to the destination object.
Optionally, described information acquisition module 12 is specifically used for:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database The property value answered.
Optionally, data obtaining module is specifically used for info web corresponding to the storage address of access target object, to this Info web is parsed to extract the property value of destination object.Optionally, data obtaining module is specifically used for network service Device sends the property value search request of destination object, and receives the lookup result of webserver feedback.
Information add module 13, for generating configured information corresponding with the property value, and the configured information is added It is added in the destination multimedia.
Specifically, after the configured information of information add module generation identity property value, the configured information is added to target In multimedia.The configured information can be label, or list, the configured information can be added into the target pair As the position at place.If including multiple destination objects in same destination multimedia, the configured information of generation is added respectively Position to where corresponding object.
Optionally, as shown in fig. 6, described device 1 also includes:
Information display module 14, for showing the instruction letter in the destination multimedia using default display mode Breath, the default display mode include default display location and default display effect.
Specifically, described information display module is shown using default display mode to the configured information.Wherein, institute It is the self-defined setting of the multimedia processing apparatus to state default display mode, including default display location and default display are imitated Fruit.
Optionally, described information display module is according to the time for generating the configured information, by the configured information according to Time order and function order is shown in default viewing area.Optionally, default viewing area can be superimposed upon multimedia display Overlying regions, for example, default viewing area is superimposed upon multimedia display overlying regions in a transparent way, it can so make multimedia The display size of viewing area and default viewing area reaches maximization;Or multimedia display region and default viewing area can In the diverse location of display interface, not limit specifically.
Optionally, as shown in fig. 6, described device 1 also includes:
Command reception module 15, for receiving the operational order for the configured information, the operational order includes putting Big instruction, reduce instruction, modification instruction and delete any of instruction;
Operation executing module 16, for being operated according to the operational order to the configured information.
Specifically, when user is operated for the configured information, command reception module then receives operational order, Operated corresponding to operation executing module execute instruction.User can so be improved in net cast group to interaction image data Operability.
For example, if user wants the display mode of change configured information, display mode modification operation is carried out, if current Display mode rolls the position tapered into simultaneously to where destination object to the left to be shown from the lower right of display interface Put, now then switch to and first pass through amplification mode and highlight preset time, preset time is contracted to the position where destination object later Put.
In another example if command reception module receives the deletion instruction for configured information, operation executing module is then deleted Corresponding configured information.
In the present invention is implemented, destination object that multimedia processing apparatus is included by identifying in destination multimedia, and Property value corresponding to destination object is found in presetting database, believes the instruction after generating configured information corresponding with property value Breath is added in destination multimedia, shows the configured information by using default display mode, while can also refer to according to for this Show that the operations such as information input addition, deletion, modification are handled configured information.In the prior art due to multimedia can not be known Various attribute informations included in content, and be only capable of handling multimedia in itself, compared with prior art, the present invention Can be with the various attribute informations included in automatic data collection content of multimedia, and these attribute informations can be added to multimedia In, multi-media edit processing form is enriched, adds the diversity of multi-media edit processing.
Fig. 7 is referred to, for the embodiments of the invention provide the structural representation of a kind of electronic equipment.It is as shown in fig. 7, described Electronic equipment 1000 can include:At least one processor 1001, such as CPU, at least one network interface 1004, user interface 1003, memory 1005, at least one communication bus 1002.Wherein, communication bus 1002 is used to realize between these components Connection communication.Wherein, user interface 1003 can include display screen (Display), keyboard (Keyboard), optional user interface 1003 can also include wireline interface, the wave point of standard.Network interface 1004 can optionally connect including the wired of standard Mouth, wave point (such as WI-FI interfaces).Memory 1005 can be high-speed RAM memory or non-labile storage Device (non-volatile memory), for example, at least a magnetic disk storage.Memory 1005 optionally can also be at least one The individual storage device for being located remotely from aforementioned processor 1001.As shown in fig. 7, as a kind of memory of computer-readable storage medium Operating system, network communication module, Subscriber Interface Module SIM and multi-media processing application program can be included in 1005.
In the electronic equipment 1000 shown in Fig. 7, user interface 1003 is mainly used in providing the user the interface of input;And Processor 1001 can be used for calling the multi-media processing application program stored in memory 1005, and specifically perform following grasp Make:
The destination object included in identification destination multimedia;
Obtain property value corresponding to the destination object;
Generation configured information corresponding with the property value, and the configured information is added to the destination multimedia In.
In one embodiment, the processor 1001 is when performing property value corresponding to the acquisition destination object, tool Body performs following steps:
According to default object and the corresponding relation of preset attribute value, the destination object pair is searched in default database The property value answered.
In one embodiment, the configured information is added to the destination multimedia by the processor 1001 in execution In after, specifically perform following steps:
The configured information, the default display mode bag are shown in the destination multimedia using default display mode Include default display location and default display effect.
In one embodiment, the destination multimedia includes multiple image;
The processor 1001 is specific to perform following walk when performing the destination object for identifying and being included in destination multimedia Suddenly:
The destination object included in every two field picture in the multiple image is known respectively using image recognition algorithm Not.
In one embodiment, the processor 1001 also performs following steps:
The operational order for the configured information is received, the operational order includes amplification instruction, reduces instruction, modification Any of instruction and deletion instruction;
The configured information is operated according to the operational order.
The embodiment of the present invention also provides a kind of computer-readable storage medium (non-transitorycomputer readable storage medium), described Computer-readable storage medium is stored with computer program, and the computer program includes program signaling, and described program signaling, which is worked as, to be counted Calculation machine makes the computer perform method as in the foregoing embodiment when performing, the computer can be mentioned above more A part for media processor or electronic equipment.
Above-mentioned non-transitorycomputer readable storage medium can use appointing for one or more computer-readable media Meaning combination.Computer-readable medium can be computer-readable signal media or computer-readable recording medium.Computer can Read storage medium and for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device Or device, or any combination above.The more specifically example (non exhaustive list) of computer-readable recording medium includes: Electrical connection, portable computer diskette, hard disk, random access memory (RAM), read-only storage with one or more wires Device (Read Only Memory;Hereinafter referred to as:ROM), erasable programmable read only memory (Erasable Programmable Read Only Memory;Hereinafter referred to as:EPROM) or flash memory, optical fiber, portable compact disc are read-only deposits Reservoir (CD-ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer Readable storage medium storing program for executing can be any includes or the tangible medium of storage program, the program can be commanded execution system, device Either device use or in connection.
Computer-readable signal media can include in a base band or as carrier wave a part propagation data-signal, Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including --- but It is not limited to --- electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be Any computer-readable medium beyond computer-readable recording medium, the computer-readable medium can send, propagate or Transmit for by instruction execution system, device either device use or program in connection.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but it is unlimited In --- wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
Can with one or more programming languages or its combination come write for perform the application operation computer Program code, described program design language include object oriented program language-such as Java, Smalltalk, C++, Also include conventional procedural programming language-such as " C " language or similar programming language.Program code can be with Fully perform, partly perform on the user computer on the user computer, the software kit independent as one performs, portion Divide and partly perform or performed completely on remote computer or server on the remote computer on the user computer. It is related in the situation of remote computer, remote computer can pass through the network of any kind --- including LAN (Local Area Network;Hereinafter referred to as:) or wide area network (Wide Area Network LAN;Hereinafter referred to as:WAN) it is connected to user Computer, or, it may be connected to outer computer (such as passing through Internet connection using ISP).
The embodiment of the present application also provides a kind of computer program product, when the instruction in above computer program product by When managing device execution, it is possible to achieve the application Fig. 1 or the multi-media processing method of embodiment illustrated in fig. 2 offer.
Through the above description of the embodiments, it is apparent to those skilled in the art that, for description It is convenient and succinct, can as needed will be upper only with the division progress of above-mentioned each functional module for example, in practical application State function distribution to be completed by different functional modules, i.e., the internal structure of device is divided into different functional modules, to complete All or part of function described above.The specific work process of the system, apparatus, and unit of foregoing description, before may be referred to The corresponding process in embodiment of the method is stated, will not be repeated here.
In several embodiments provided herein, it should be understood that disclosed system, apparatus and method can be with Realize by another way.For example, device embodiment described above is only schematical, for example, the module or The division of unit, only a kind of division of logic function, can there are other dividing mode, such as multiple units when actually realizing Or component can combine or be desirably integrated into another system, or some features can be ignored, or not perform.It is another, institute Display or the mutual coupling discussed or direct-coupling or communication connection can be by some interfaces, device or unit INDIRECT COUPLING or communication connection, can be electrical, mechanical or other forms.
The unit illustrated as separating component can be or may not be physically separate, show as unit The part shown can be or may not be physical location, you can with positioned at a place, or can also be distributed to multiple On NE.Some or all of unit therein can be selected to realize the mesh of this embodiment scheme according to the actual needs 's.
In addition, each functional unit in each embodiment of the application can be integrated in a processing unit, can also That unit is individually physically present, can also two or more units it is integrated in a unit.Above-mentioned integrated list Member can both be realized in the form of hardware, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is used as independent production marketing or use When, it can be stored in a computer read/write memory medium.Based on such understanding, the technical scheme of the application is substantially The part to be contributed in other words to prior art or all or part of the technical scheme can be in the form of software products Embody, the computer software product is stored in a storage medium, including some instructions are causing a computer It is each that equipment (can be personal computer, server, or network equipment etc.) or processor (processor) perform the application The all or part of step of embodiment methods described.And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (Read Only Memory;Hereinafter referred to as:ROM), random access memory (Random Access Memory;Hereinafter referred to as: RAM), magnetic disc or CD etc. are various can be with the medium of store program codes.
Described above, the only embodiment of the application, but the protection domain of the application is not limited thereto is any Those familiar with the art can readily occur in change or replacement in the technical scope that the application discloses, and should all contain Cover within the protection domain of the application.Therefore, the protection domain of the application should be based on the protection scope of the described claims.

Claims (10)

1. a kind of multi-media processing method, it is characterised in that methods described includes:
The destination object included in identification destination multimedia;
Obtain property value corresponding to the destination object;
Generation configured information corresponding with the property value, and the configured information is added in the destination multimedia.
2. according to the method for claim 1, it is characterised in that described to obtain property value corresponding to the destination object, bag Include:
According to default object and the corresponding relation of preset attribute value, searched in default database corresponding to the destination object Property value.
3. according to the method for claim 1, it is characterised in that described that the configured information is added to the more matchmakers of the target After in body, in addition to:
The configured information is shown in the destination multimedia using default display mode, the default display mode includes pre- If display location and default display effect.
4. according to the method for claim 1, it is characterised in that the destination multimedia includes multiple image;
The destination object included in the identification destination multimedia, including:
The destination object included in every two field picture in the multiple image is identified respectively using image recognition algorithm.
5. according to the method for claim 1, it is characterised in that methods described also includes:
The operational order for the configured information is received, the operational order includes amplification instruction, reduces instruction, modification instruction And delete any of instruction;
The configured information is operated according to the operational order.
6. a kind of multimedia processing apparatus, it is characterised in that described device includes:
Object Identification Module, for identifying the destination object included in destination multimedia;
Data obtaining module, for obtaining property value corresponding to the destination object;
Information add module, for generating configured information corresponding with the property value, and the configured information is added to institute State in destination multimedia.
7. device according to claim 6, it is characterised in that described information acquisition module is specifically used for:
According to default object and the corresponding relation of preset attribute value, searched in default database corresponding to the destination object Property value.
8. device according to claim 6, it is characterised in that described device also includes:
Information display module, it is described for showing the configured information in the destination multimedia using default display mode Default display mode includes default display location and default display effect.
9. a kind of computer-readable storage medium, it is characterised in that the computer-readable storage medium is stored with a plurality of instruction, the instruction Suitable for being loaded by processor and being performed such as any one of claim 1 to 5 methods described.
10. a kind of electronic equipment, it is characterised in that including:Processor and memory;Wherein, the memory storage has calculating Machine program, realized described in the computing device during computer program such as any one of claim 1 to 5 methods described.
CN201710525763.1A 2017-06-30 2017-06-30 Multimedia processing method and device, electronic equipment and storage medium Pending CN107341139A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710525763.1A CN107341139A (en) 2017-06-30 2017-06-30 Multimedia processing method and device, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710525763.1A CN107341139A (en) 2017-06-30 2017-06-30 Multimedia processing method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN107341139A true CN107341139A (en) 2017-11-10

Family

ID=60218279

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710525763.1A Pending CN107341139A (en) 2017-06-30 2017-06-30 Multimedia processing method and device, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN107341139A (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108619717A (en) * 2018-03-21 2018-10-09 腾讯科技(深圳)有限公司 Determination method, apparatus, storage medium and the electronic device of operation object
CN109391849A (en) * 2018-09-30 2019-02-26 联想(北京)有限公司 Processing method and system, multi-media output device and memory
CN109886258A (en) * 2019-02-19 2019-06-14 新华网(北京)科技有限公司 The method, apparatus and electronic equipment of the related information of multimedia messages are provided
CN110287345A (en) * 2019-06-28 2019-09-27 北京金山安全软件有限公司 Method and device for detecting image source, electronic equipment and storage medium
CN112528915A (en) * 2020-12-18 2021-03-19 北京华如科技股份有限公司 Intelligent plotting method based on 'pan magic' recognition model and storage medium thereof
CN112597648A (en) * 2020-12-18 2021-04-02 北京华如科技股份有限公司 Simulation scenario generation method based on 'pan magic' recognition model and storage medium

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105096180A (en) * 2015-07-20 2015-11-25 北京易讯理想科技有限公司 Commodity information display method and apparatus based augmented reality
CN105117463A (en) * 2015-08-24 2015-12-02 北京旷视科技有限公司 Information processing method and information processing device
US20170004386A1 (en) * 2015-07-02 2017-01-05 Agt International Gmbh Multi-camera vehicle identification system

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170004386A1 (en) * 2015-07-02 2017-01-05 Agt International Gmbh Multi-camera vehicle identification system
CN105096180A (en) * 2015-07-20 2015-11-25 北京易讯理想科技有限公司 Commodity information display method and apparatus based augmented reality
CN105117463A (en) * 2015-08-24 2015-12-02 北京旷视科技有限公司 Information processing method and information processing device

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
勒普顿 等: "《图形设计新元素》", 30 September 2009 *
秦亚军 等: "《FLASH动画设计项目实践》", 31 August 2015, 西南交通大学出版社 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108619717A (en) * 2018-03-21 2018-10-09 腾讯科技(深圳)有限公司 Determination method, apparatus, storage medium and the electronic device of operation object
CN109391849A (en) * 2018-09-30 2019-02-26 联想(北京)有限公司 Processing method and system, multi-media output device and memory
CN109391849B (en) * 2018-09-30 2020-11-20 联想(北京)有限公司 Processing method and system, multimedia output device and memory
CN109886258A (en) * 2019-02-19 2019-06-14 新华网(北京)科技有限公司 The method, apparatus and electronic equipment of the related information of multimedia messages are provided
CN110287345A (en) * 2019-06-28 2019-09-27 北京金山安全软件有限公司 Method and device for detecting image source, electronic equipment and storage medium
CN110287345B (en) * 2019-06-28 2023-08-15 北京乐蜜科技有限责任公司 Image source detection method and device, electronic equipment and storage medium
CN112528915A (en) * 2020-12-18 2021-03-19 北京华如科技股份有限公司 Intelligent plotting method based on 'pan magic' recognition model and storage medium thereof
CN112597648A (en) * 2020-12-18 2021-04-02 北京华如科技股份有限公司 Simulation scenario generation method based on 'pan magic' recognition model and storage medium
CN112597648B (en) * 2020-12-18 2023-09-22 北京华如科技股份有限公司 Simulation design generation method based on 'general magic' recognition model and storage medium

Similar Documents

Publication Publication Date Title
CN107341139A (en) Multimedia processing method and device, electronic equipment and storage medium
CN102687140B (en) For contributing to the method and apparatus of CBIR
Rukhovich et al. Iterdet: iterative scheme for object detection in crowded environments
JP6240199B2 (en) Method and apparatus for identifying object in image
CN107633066A (en) Information display method and device, electronic equipment and storage medium
CN112464814A (en) Video processing method and device, electronic equipment and storage medium
US9313444B2 (en) Relational display of images
US10963700B2 (en) Character recognition
US20130179436A1 (en) Display apparatus, remote control apparatus, and searching methods thereof
CN111259751A (en) Video-based human behavior recognition method, device, equipment and storage medium
CN111541943B (en) Video processing method, video operation method, device, storage medium and equipment
US9851873B2 (en) Electronic album creating apparatus and method of producing electronic album
JP6787831B2 (en) Target detection device, detection model generation device, program and method that can be learned by search results
CN111309200B (en) Method, device, equipment and storage medium for determining extended reading content
CN111738263A (en) Target detection method and device, electronic equipment and storage medium
US20180276471A1 (en) Information processing device calculating statistical information
US8498978B2 (en) Slideshow video file detection
CN110929057A (en) Image processing method, device and system, storage medium and electronic device
CN107451194A (en) A kind of image searching method and device
US11068121B2 (en) System and method for visual exploration of subnetwork patterns in two-mode networks
US20150026013A1 (en) System and methods for cognitive visual product search
US11048713B2 (en) System and method for visual exploration of search results in two-mode networks
CN111291756A (en) Method and device for detecting text area in image, computer equipment and computer storage medium
CN111539390A (en) Small target image identification method, equipment and system based on Yolov3
CN111506754A (en) Picture retrieval method and device, storage medium and processor

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20171110

RJ01 Rejection of invention patent application after publication