CN109409423A - A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing - Google Patents

A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing Download PDF

Info

Publication number
CN109409423A
CN109409423A CN201811196560.3A CN201811196560A CN109409423A CN 109409423 A CN109409423 A CN 109409423A CN 201811196560 A CN201811196560 A CN 201811196560A CN 109409423 A CN109409423 A CN 109409423A
Authority
CN
China
Prior art keywords
plot
target image
image
information
image set
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811196560.3A
Other languages
Chinese (zh)
Inventor
林依柳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gree Electric Appliances Inc of Zhuhai
Original Assignee
Gree Electric Appliances Inc of Zhuhai
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gree Electric Appliances Inc of Zhuhai filed Critical Gree Electric Appliances Inc of Zhuhai
Priority to CN201811196560.3A priority Critical patent/CN109409423A/en
Publication of CN109409423A publication Critical patent/CN109409423A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a kind of image-recognizing method, device, terminal and readable storage medium storing program for executing, this method comprises: obtaining target image set to be identified;The target image set is input in the image recognition model that training is completed in advance, is based on described image identification model, determines at least one corresponding plot of the target image set;Export at least one described plot, the image recognition model completed in the present invention based on preparatory training, it can determine at least one corresponding plot of target image set to be identified and export, realize and help user's especially special population quick and precisely to obtain the meaning or content wanting to convey in image.

Description

A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing
Technical field
The present invention relates to computer vision recognition technology field more particularly to a kind of image-recognizing method, device, terminal and Readable storage medium storing program for executing.
Background technique
With the development of artificial intelligence, image recognition rate is higher and higher, and public security has passed through Identification of Images and arrested multiple criminals Guilty object greatly improves case-solving rate, and portrait recognition technology is more and more mature, and brings great convenience for society Property.Computer Vision Recognition is one core of artificial intelligence field, is constantly subjected to the extensive of academia and industrial circle for a long time Concern.The positioning or identification of object, object detection, vision object detection all have evolved to the height of a relative maturity.
Deep learning refer to machine the mechanism of human brain simulated by deep neural network come learn, judge, decision, and it is deep Degree study is applied to many fields, identification, recognition of face, person's handwriting identification, automatic Pilot including voice etc., and shows There is the realization for the image recognition for applying computer vision recognition technology in technology to be mainly based upon deep learning also to complete.Example If AlphaGo study go is also same principle, a large amount of go case data is inputted to machine by the mankind, machine uses Depth neural network handles image, finally carries out drawing inferences about other cases from one instance for go, makes accurate judgment.
Although the prior art can be realized the identification of portrait in image or object etc., but people can not according to image The meaning or content for wanting to convey in image are rapidly and accurately captured, particularly with certain special populations, such as group visually impaired is deposited In the poor crowd etc. of the crowd of Dyslexia or understandability, ability to express, can rapidly and accurately learn in image The meaning for wanting to convey just is particularly important.
Summary of the invention
The present invention provides a kind of image-recognizing method, device, terminal and readable storage medium storing program for executing, to solve the prior art Middle user cannot rapidly and accurately capture the problem of meaning or content conveyed are wanted in image according to image.
The present invention provides image-recognizing methods, are applied to terminal, this method comprises:
Obtain target image set to be identified;
The target image set is input in the image recognition model that training is completed in advance, is identified based on described image Model determines at least one corresponding plot of the target image set;
Export at least one described plot.
Further, it is described be based on described image identification model, determine the target image set it is corresponding at least one Plot includes:
Based on described image identification model, the fisrt feature information that target image includes in the target image set is determined With second feature information, wherein the fisrt feature information includes shooting time information and shooting location information;
For each target image in the target image set, judge in the target image set with the presence or absence of with Remaining target image of the fisrt feature information association of the target image;If so, according to the sequencing of shooting time, according to The target image and shooting location information and second feature information in remaining described target image, generate a plot; If not, generating a plot according to the shooting location information and second feature information of the target image.
Further, the second feature information comprises at least one of the following: person characteristic information, scenery characteristic information, Object features information, animal character information, seasonal characteristic information and weather characteristics information.
Further, at least one plot described in the output includes:
For each plot at least one described plot, by the plot and the story feelings are generated The corresponding output of the target image of section.
Further, at least one plot described in the output includes:
At least one described plot is exported in a manner of text segment;And/or
At least one described plot is exported in a manner of voice broadcast.
Further, if exporting at least one described plot in a manner of text segment, described in the output extremely After a few plot, the method also includes:
Judge whether to receive user to the edit requests of text segment, wherein carrying in the edit requests to be edited Story in the first text set and edited second text set;
If so, the first text set, which is updated to second character set, merges display.
Further, the training process of described image identification model includes:
For each sample image set in training set, the every of the corresponding handmarking of each sample image set is obtained The corresponding plot of story of the mark and the mark of the story that sample image belongs in a sample image set;
According in the corresponding sample image set of the sample image set, the sample image set got Sample image ownership story mark and the mark the corresponding plot of story, to described image identification model into Row training.
The present invention provides a kind of pattern recognition devices, are applied to terminal, which includes:
Module is obtained, for obtaining target image set to be identified;
Determining module, for the target image set to be input in the image recognition model that training is completed in advance, base In described image identification model, at least one corresponding plot of the target image set is determined;
Output module, for exporting at least one described plot.
Further, the determining module is specifically used for being based on described image identification model, determines the target image set The fisrt feature information and second feature information that target image includes in conjunction, wherein the fisrt feature information includes shooting time Information and shooting location information;For each target image in the target image set, the target image set is judged In with the presence or absence of remaining target image with the fisrt feature information association of the target image;If so, according to shooting time Sequencing is generated according to the shooting location information and second feature information in the target image and remaining described target image One plot;If not, generating a story feelings according to the shooting location information and second feature information of the target image Section.
Further, the second feature information comprises at least one of the following: person characteristic information, scenery characteristic information, Object features information, animal character information, seasonal characteristic information and weather characteristics information.
Further, the output module, specifically for for each story feelings at least one described plot The plot and the target image for generating the plot are corresponded to output by section.
Further, the output module, specifically for exporting at least one described story feelings in a manner of text segment Section;And/or at least one described plot is exported in a manner of voice broadcast.
Further, the output module, if being also used to export at least one described story in a manner of text segment Plot judges whether to receive user to the edit requests of text segment, wherein carrying in the edit requests to be edited The first text set and edited second text set in story;If so, the first text set is updated to institute It states the second character set and merges display.
Further, described device further include:
Training module, for it is corresponding to obtain each sample image set for each sample image set in training set Handmarking each sample image set in sample image ownership story mark and the mark story it is corresponding Plot;According to the corresponding sample image set of the sample image set, the sample image set got The corresponding plot of story of the mark and the mark of the story of middle sample image ownership, to described image identification model It is trained.
The present invention provides a kind of terminals, comprising: processor, communication interface, memory and communication bus, wherein processing Device, communication interface, memory complete mutual communication by communication bus;
It is stored with computer program in the memory, when described program is executed by the processor, so that the place Manage the step of device executes any of the above-described the method.
The present invention provides a kind of computer readable storage medium, it is stored with the computer program that can be executed by terminal, When described program is run on the terminal, so that the step of terminal executes any of the above-described the method.
The present invention provides a kind of image-recognizing method, device, terminal and readable storage medium storing program for executing, this method comprises: obtaining Target image set to be identified;The target image set is input in the image recognition model that training is completed in advance, base In described image identification model, at least one corresponding plot of the target image set is determined;Output described at least one A plot.The image recognition model completed in the present invention based on preparatory training, can determine target image set to be identified It closes at least one corresponding plot and exports, realize and user's especially special population is helped quick and precisely to obtain image In want convey the meaning or content.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is a kind of schematic diagram for image recognition processes that the embodiment of the present invention 1 provides;
Fig. 2 is a kind of schematic diagram for image recognition process that the embodiment of the present invention 5 provides;
Fig. 3 is a kind of structural schematic diagram for terminal that the embodiment of the present invention 7 provides;
Fig. 4 is a kind of pattern recognition device schematic diagram provided in an embodiment of the present invention.
Specific embodiment
The meaning or content for wanting to convey in image to help user quick and precisely to obtain, the embodiment of the invention provides A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing.
Image-recognizing method provided in an embodiment of the present invention, can be applied in terminal, which includes hardware layer, operation Operating system layer on hardware layer, and the application layer of operation on an operating system.
The hardware layer include central processing unit (CPU, Central Processing Unit), memory management unit (MMU, Memory Management Unit) and the hardware such as memory.
The operating system can be any one or more computer behaviour that image recognition is realized by process (Process) Make system, for example, (SuSE) Linux OS, Unix operating system, Android operation system, iOS operating system or windows behaviour Make system etc..
And the terminal can be the handheld devices such as smart phone, tablet computer in embodiments of the present invention, be also possible to The terminal devices such as desktop computer, portable computer, server in the embodiment of the present invention and are not particularly limited, as long as can There is the program of the code of the image-recognizing method in the embodiment of the present invention by log, realizes image recognition.
The executing subject of image recognition in the embodiment of the present invention can be being capable of caller in terminal or terminal And execute the functional module of program.
To make the objectives, technical solutions, and advantages of the present invention clearer, make below in conjunction with the attached drawing present invention into one Step ground detailed description, it is clear that described embodiment is only a part of the embodiments of the present invention, rather than whole implementation Example.Based on the embodiments of the present invention, obtained by those of ordinary skill in the art without making creative efforts Every other embodiment, shall fall within the protection scope of the present invention.
Embodiment 1:
Fig. 1 be a kind of schematic diagram of image recognition processes provided in an embodiment of the present invention, the process the following steps are included:
S101: target image set to be identified is obtained.
Image-recognizing method provided in an embodiment of the present invention is applied to terminal, and the terminal is available to arrive target to be identified Image collection.
The process that the terminal obtains target image set to be identified can be terminal acquisition image, by collected figure As the target image being determined as in target image set, the target image set that the terminal is got in other equipment can be It closes, the target image set that the target image set in the other equipment can be constituted for the other equipment acquired image, It can be the target image set etc. pre-saved in the other equipment.
It may include one or more target image in target image set to be identified.
Terminal obtains after obtaining target image set to be identified and can be the image recognition request for receiving user, can be with It is acquisition when receiving the target image set of other equipment transmission, is also possible to obtain at set time intervals, such as Fruit is to obtain at set time intervals, then terminal preserves the time interval of the setting, and the time interval of the setting is any, Such as can be 5 seconds, 10 minutes, 30 minutes etc., the time interval of the setting can be developer's setting, can be by user According to the demand of itself and habit setting etc..
S102: the target image set is input in the image recognition model that training is completed in advance, is based on the figure As identification model, at least one corresponding plot of the target image set is determined.
The mark of the story by belonging to according to sample image in sample image set, each sample image set is saved in terminal Know and image recognition model that the story corresponding plot training of the mark is completed, therefore according to target image set, Based on image recognition model, the corresponding at least one plot of the target image set is determined.
The image recognition model that the training is completed can determine image recognition model for sample data under big data analysis.
The target image for including at least one corresponding plot of the target image set and the target image set It is related.
S103: at least one described plot is exported.
It, can be defeated by least one plot after terminal determines at least one plot based on image recognition model Out, user is helped quick and precisely to understand the meaning or content that the target image in target image set is intended by.
When terminal exports at least one plot, it can be and exported in a manner of text or voice etc., which can To be to be selected by user according to the use demand of itself or habit, it can be and preserve the preset way of output of default in terminal.
The image-recognizing method of the embodiment of the present invention can be applied to several scenes, is such as directed to group visually impaired, there is reading The crowd of obstacle can be exported by way of voice, allow it will also be appreciated that the meaning in picture;For understandability, The poor crowd of ability to express can solve the problem that it is difficult that it is started writing, realize checking for the leading meaning of image;It can certainly It improves efficiency, user directly can be turned over image when sending out wechat circle of friends, QQ having a talk about or microblogging by the image-recognizing method It is translated into text, the narration of story can be carried out with intelligent recognition plot, without browsing multiple images, that is, may know that image In the content that is intended by.
The image recognition model completed in the embodiment of the present invention based on preparatory training, can determine target image to be identified Gather at least one corresponding plot and export, realizes and user's especially special population is helped quick and precisely to obtain figure The meaning or content conveyed are wanted as in.
Embodiment 2:
On the basis of the above embodiments, in the embodiment of the present invention, it is described be based on described image identification model, determine described in At least one corresponding plot of target image set includes:
Based on described image identification model, the fisrt feature information that target image includes in the target image set is determined With second feature information, wherein the fisrt feature information includes shooting time information and shooting location information;
For each target image in the target image set, judge in the target image set with the presence or absence of with Remaining target image of the fisrt feature information association of the target image;If so, according to the sequencing of shooting time, according to The target image and shooting location information and second feature information in remaining described target image, generate a plot; If not, generating a plot according to the shooting location information and second feature information of the target image.
The second feature information comprises at least one of the following: person characteristic information, scenery characteristic information, object features letter Breath, animal character information, seasonal characteristic information and weather characteristics information.
In order to realize the identification of image, terminal can be based on image recognition model, determine target image set to be identified At least one corresponding plot.
The image recognition model can determine the fisrt feature information and second that target image includes in target image set Characteristic information.
The fisrt feature information includes shooting time information and shooting location information, when which is specific Between information, the shooting location information is available to the better addresses information such as country, province, city be at county level.
Usual shooting time information and shooting location information carry in the picture, therefore can directly extract the in the picture One characteristic information.
The second feature information includes person characteristic information, scenery characteristic information, object features information, animal character letter At least one of breath, seasonal characteristic information and weather characteristics information.The person characteristic information can pass through intelligent measurement personage Role analyzes, specific available to information such as number, gender, crowd, clothing, expression or titles, is described by correlation The description of word, user can also be manually entered person names in advance to realize the subsequent identification of person characteristic information, crowd tool Body can include adult, old age, juvenile, baby etc. for age-colony.The object features information can be used for distinguishing scene, foundation The object of shooting, analyzes the daily object and which place appeared in carries out analysis determination and be currently under which scene, specifically Including indoor and outdoor, parlor, room, office, meeting room, school, hospital, highway, open air, rock-climbing place etc..The animal is special Reference ceases available kind, title etc. to animal.The complete of the plot generated may be implemented in the seasonal characteristic information Property.The weather characteristics information specifically includes fine day, the cloudy day, the rainy day, snows, the description of the associated weathers such as sandstorm.
Usual second feature information will not directly carry in the picture, it is therefore desirable to carry out image analysis in the picture and determine Second feature information.
Terminal determines the fisrt feature information and second feature information of target image based on image recognition model, may be implemented The image of shooting is subjected to intelligent measurement identification, including identification personage, scenery, object, animal, season, weather, time, place Etc., the layout of text is carried out by deep learning, that is, image recognition model, is organized into and is met the image fact and have story logic Plot.
In order to realize that the logicality and integrality of plot, terminal can be for the target images in target image set Whether belong to same story to be determined, specific terminal is directed to each target image, judges whether deposit in target image set In remaining target image of the fisrt feature information association with the target image, when fisrt feature information association can refer to shooting Between information and shooting location information it is identical, can refer to that shooting location information is identical, shooting time is continuous, if it is shooting time Continuously, then the time difference between adjacent shooting time is no more than the time difference threshold value of setting, the time difference threshold value can be by Developer's setting can be and be set by user according to the demand of itself.
If in target image set only including a target image, in target image set necessarily there is no with this Remaining target image of the fisrt feature information association of one target image;If schemed in target image set including at least two Picture is then directed to each target image, in target image set there may be, may there is no the fisrt feature with the target image Remaining target image of information association.
Further, in order to improve the accuracy that plot determines, the whether associated base of fisrt feature information is being determined On plinth, can also judging second feature information, whether there is also associations, and when simultaneously, fisrt feature information and second feature information are deposited It determines there is association in association, when judging second feature information with the presence or absence of association, can be according to recognition of face, clothing Dress ornament and ambient enviroment carry out analysis identification, such as whether the dress for judging that personage wears in the target image can be with it The clothing of personage decorates matching in remaining target image, that is, judges whether clothes attachment matches;Judge that personage is in the target image It is no to be matched with personage in remaining target image, that is, judge whether the face recognized matches;Judge around in the target image Environment, that is, object features information whether can with whether match in the ambient enviroment in remaining target image, i.e. judgement recognizes Whether object features information matches.Remaining target image, which can be, in this step is different from the mesh in target image set Remaining target image of logo image can be remaining target image with the fisrt feature information association of the target image.
For each target image, if there is the fisrt feature information association with the target image in target image set Remaining target image, then a plot can be generated according to the target image and remaining target image, in order to guarantee The continuity of plot generates plot according to the sequencing of shooting time in the embodiment of the present invention, in order to guarantee therefore The logicality of thing section, according to the shooting location information and second of the target image and remaining target image in the embodiment of the present invention Characteristic information generates plot, i.e., according to the shooting location information of the target image and second feature information and remaining mesh The shooting location information and second feature information of logo image generate plot.
For each target image, if there is no close with the fisrt feature information of the target image in target image set Remaining target image of connection then directly can generate a plot according to the target image, in order to guarantee story request Logicality generates plot according to the shooting location information and second feature information of the target image in the embodiment of the present invention.
In order to realize generation plot accuracy, on the basis according to shooting location information and second feature information On, plot can also be generated according to shooting time information, i.e., when there are remaining associated target image, when according to shooting Between sequencing, according to shooting location information, shooting time information and the second feature of target image and remaining target image Information generates a story request;When not depositing remaining associated target image, according to the shooting location information of target image, clap It takes the photograph temporal information and second feature information generates the request of the first story.
The process for generating story request is completed based on image recognition model, therefore is not illustrated herein.
Plot can be generated according to the target image in target image set in the embodiment of the present invention, realize image It identifies, and the logicality and continuity of the plot generated.
Embodiment 3:
On the basis of the various embodiments described above, in the embodiment of the present invention, at least one plot packet described in the output It includes:
For each plot at least one described plot, by the plot and the story feelings are generated The corresponding output of the target image of section.
Since plot is generated according to the target image in target image set, in order to obtain user more accurately Know the plot that target image is intended by, therefore can be defeated simultaneously by the target image and plot that generate plot Out.
Terminal generates plot according to the target image in target image set, and terminal can be according to the story feelings of generation Section determines the target image for generating the plot, therefore terminal is directed at least one corresponding plot of target image set In each plot, determine the target image of the generation plot, and then by the plot and generate the event The corresponding output of the target image of thing section.
Specifically in output, in plot and the identical mark of target image elder generation of the plot can be generated It is marked, can be in output, plot and the target image for generating the plot are continuously exported, with label The plot is by the target image generation etc. for being contiguous therewith output, and there are many modes of corresponding output, in the embodiment of the present invention In be not listed one by one.
By the corresponding output of the target image of plot and generation plot in the embodiment of the present invention, it is able to use family It can accurately know the plot that target image is intended by.
Embodiment 4:
On the basis of the various embodiments described above, in the embodiment of the present invention, at least one plot packet described in the output It includes:
At least one described plot is exported in a manner of text segment;And/or
At least one described plot is exported in a manner of voice broadcast.
In order to facilitate the use of different user group, a variety of output sides of plot are provided in the embodiment of the present invention Formula.
Specific terminal can be and be exported in a manner of text segment and/or with language when exporting at least one plot The mode of sound casting exports.
For disturbance people or reading, there are the crowds of obstacle, can preferentially select the way of output of voice broadcast.
For in the public situation either poor crowd of understandability ability to express, can preferential text segment it is defeated Mode out.
Certain user also can choose to be exported in a manner of voice broadcast while being exported in a manner of text segment.
Terminal shows that text and the process of casting voice belong to the prior art, does not repeat them here in embodiments of the present invention.
A variety of way of outputs of plot are provided in the embodiment of the present invention, facilitate the use of different groups, are improved The popularity and diversity of image-recognizing method user group.
Embodiment 5:
On the basis of the various embodiments described above, in the embodiment of the present invention, if described in being exported in a manner of text segment extremely A few plot, after at least one plot described in the output, the method also includes:
Judge whether to receive user to the edit requests of text segment, wherein carrying in the edit requests to be edited Story in the first text set and edited second text set;
If so, the first text set, which is updated to second character set, merges display.
In order to improve the flexibility of image-recognizing method, the embodiment of the present invention also provides editting function for user, for Family writes text and provides a text template, solves the problems, such as hardly possible of starting writing, user can modify or mend on this basis It fills and is more suitable for the open of thinking.
When terminal exports corresponding at least one plot of target image set in a manner of text segment, if with When at least one plot that terminal exports is edited at family, user can be edited the text segment that terminal is shown Operation, terminal can receive user to the edit requests of text segment at this time, therefore terminal can be by judging whether to receive User determines whether user edits the text segment of display to the edit requests of text segment.
If the terminal determine that receiving user to the edit requests of text segment, terminal can be identified in edit requests and be carried Story to be edited in the first text set and edited second text set, which is updated to Two text set, and the corresponding text segment of updated plot is shown.
If user wants to modify, which can be whole texts or part text in text segment Word;If user wants to supplement, which can be whole texts in text segment or can see Work is blank character.
The various embodiments described above are illustrated with a specific embodiment below, as shown in Fig. 2, the terminal has shooting Function, the terminal shoot photo, and user chooses a photo or multiple pictures in the photo of shooting, and the photo that user chooses can To regard the target image in the target image set with identification as, terminal is based on image recognition model, turns according to picture recognition It is melted into explanatory note, that is, determines at least one corresponding plot in target image set, it is automatic to export related verbal description, At least one plot is exported in the form of text segment, exporting in the form of text segment can edit in order to user Or supplement.After user compiles function to text segment, the text segment after editing successfully can be used for carrying out correlation analysis or multiple System is pasted, and is convenient for the user to use, is shared.
The terminal of that embodiment of the invention provides the editting function of user, improves the spirit of image recognition and plot output Activity.
Embodiment 6:
On the basis of the various embodiments described above, in the embodiment of the present invention, the training process of described image identification model includes:
For each sample image set in training set, the every of the corresponding handmarking of each sample image set is obtained The corresponding plot of story of the mark and the mark of the story that sample image belongs in a sample image set;
According in the corresponding sample image set of the sample image set, the sample image set got Sample image ownership story mark and the mark the corresponding plot of story, to described image identification model into Row training.
Image recognition model provided in an embodiment of the present invention can be the model based on deep learning training completion.
It specifically, include a large amount of sample image set in training set, the sample image collection that training set includes, which is combined into, to be used for The sample for carrying out model training includes one in each sample image set for each sample image set in training set Or multiple sample images, terminal can obtain sample in each sample image set of the corresponding handmarking of sample image set The mark of the story of image ownership, and the corresponding plot of story of mark.
Sample image belongs in the corresponding sample image set of sample image set, sample image set that will acquire Story mark, and the corresponding plot of story of mark, are trained image recognition model.
The process that model is trained can be realized using the prior art according to sample data, in embodiments of the present invention It does not repeat them here.
The embodiment of the present invention ensure that in image recognition by being trained to image recognition model, complete according to training At image recognition model can be realized image is accurately identified.
Embodiment 7:
On the basis of the various embodiments described above, the embodiment of the invention also provides a kind of terminals 300, as shown in figure 3, packet It includes: processor 301, communication interface 302, memory 303 and communication bus 304, wherein processor 301, communication interface 302 are deposited Reservoir 303 completes mutual communication by communication bus 304;
It is stored with computer program in the memory 303, when described program is executed by the processor 301, so that The processor 301 executes following steps:
Obtain target image set to be identified;
The target image set is input in the image recognition model that training is completed in advance, is identified based on described image Model determines at least one corresponding plot of the target image set;
Export at least one described plot.
Image-recognizing method provided in an embodiment of the present invention is applied to terminal.
The communication bus that above-mentioned terminal is mentioned can be Peripheral Component Interconnect standard (Peripheral Component Interconnect, PCI) bus or expanding the industrial standard structure (Extended Industry Standard Architecture, EISA) bus etc..The communication bus can be divided into address bus, data/address bus, control bus etc..For just It is only indicated with a thick line in expression, figure, it is not intended that an only bus or a type of bus.
Communication interface 302 is for the communication between above-mentioned terminal and other equipment.
Memory may include random access memory (Random Access Memory, RAM), also may include non-easy The property lost memory (Non-Volatile Memory, NVM), for example, at least a magnetic disk storage.Optionally, memory may be used also To be storage device that at least one is located remotely from aforementioned processor.
Above-mentioned processor can be general processor, including central processing unit, network processing unit (Network Processor, NP) etc.;It can also be digital command processor (Digital Signal Processing, DSP), dedicated collection At circuit, field programmable gate array or other programmable logic device, discrete gate or transistor logic, discrete hard Part component etc..
In embodiments of the present invention, it when processor executes the program stored on memory, realizes and is based on having trained in advance At image recognition model, can determine at least one corresponding plot of target image set to be identified and export, it is real Show and has helped user's especially special population quick and precisely to obtain the meaning or content wanting to convey in image.
Embodiment 8:
On the basis of the various embodiments described above, the embodiment of the invention also provides a kind of computers to store readable storage medium Matter is stored with the computer program that can be executed by terminal in the computer readable storage medium, when described program is at the end When being run on end, so that the terminal realizes following steps when executing:
Obtain target image set to be identified;
The target image set is input in the image recognition model that training is completed in advance, is identified based on described image Model determines at least one corresponding plot of the target image set;
Export at least one described plot.
Above-mentioned computer readable storage medium can be any usable medium or number that the processor in terminal can access Such as according to storage equipment, including but not limited to magnetic storage such as floppy disk, hard disk, tape, magneto-optic disk (MO) etc., optical memory CD, DVD, BD, HVD etc. and semiconductor memory such as ROM, EPROM, EEPROM, nonvolatile memory (NAND FLASH), solid state hard disk (SSD) etc..
Computer program, computer program are provided in the computer readable storage medium provided in embodiments of the present invention When being executed by processor, realizes the image recognition model completed based on preparatory training, can determine target image set to be identified It closes at least one corresponding plot and exports, realize and user's especially special population is helped quick and precisely to obtain image In want convey the meaning or content.
Fig. 4 is a kind of 400 schematic diagram of pattern recognition device provided in an embodiment of the present invention, is applied to terminal, the device packet It includes:
Module 401 is obtained, for obtaining target image set to be identified;
Determining module 402, for the target image set to be input in the image recognition model that training is completed in advance, Based on described image identification model, at least one corresponding plot of the target image set is determined;
Output module 403, for exporting at least one described plot.
The determining module 402 is specifically used for being based on described image identification model, determines mesh in the target image set The fisrt feature information and second feature information that logo image includes, wherein the fisrt feature information include shooting time information and Shooting location information;For each target image in the target image set, judge in the target image set whether In the presence of remaining target image of the fisrt feature information association with the target image;If so, according to the successive suitable of shooting time Sequence generates an event according to the shooting location information and second feature information in the target image and remaining described target image Thing section;If not, generating a plot according to the shooting location information and second feature information of the target image.
The second feature information comprises at least one of the following: person characteristic information, scenery characteristic information, object features letter Breath, animal character information, seasonal characteristic information and weather characteristics information.
The output module 403, specifically for each plot at least one plot for described in, by this Plot and the corresponding output of the target image for generating the plot.
The output module 403, specifically for exporting at least one described plot in a manner of text segment;With/ Or at least one described plot is exported in a manner of voice broadcast.
The output module 403 is sentenced if being also used to export at least one described plot in a manner of text segment It is disconnected whether to receive user to the edit requests of text segment, wherein being carried in story to be edited in the edit requests First text set and edited second text set;If so, the first text set is updated to second text Word collection merges display.
Described device further include:
Training module 404, for obtaining each sample image set pair for each sample image set in training set The mark for the story that sample image belongs in each sample image set of the handmarking answered and the story of the mark are corresponding Plot;According to the corresponding sample graph image set of the sample image set, the sample image set got The corresponding plot of story of the mark and the mark of the story that sample image belongs in conjunction identifies mould to described image Type is trained.
The image recognition model completed in the embodiment of the present invention based on preparatory training, can determine target image to be identified Gather at least one corresponding plot and export, realizes and user's especially special population is helped quick and precisely to obtain figure The meaning or content conveyed are wanted as in.
For systems/devices embodiment, since it is substantially similar to the method embodiment, so the comparison of description is simple Single, the relevent part can refer to the partial explaination of embodiments of method.
It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality Body or an operation are distinguished with another entity or another operation, without necessarily requiring or implying these entities Or there are any actual relationship or orders between operation.
It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.
The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructions The combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programs Instruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produce A raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for real The device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.
Although the preferred embodiment of the application has been described, it is created once a person skilled in the art knows basic Property concept, then additional changes and modifications can be made to these embodiments.So it includes excellent that the following claims are intended to be interpreted as It selects embodiment and falls into all change and modification of the application range.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the art Mind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologies Within, then the present invention is also intended to include these modifications and variations.

Claims (16)

1. a kind of image-recognizing method, which is characterized in that it is applied to terminal, this method comprises:
Obtain target image set to be identified;
The target image set is input in the image recognition model that training is completed in advance, mould is identified based on described image Type determines at least one corresponding plot of the target image set;
Export at least one described plot.
2. the method as described in claim 1, which is characterized in that it is described to be based on described image identification model, determine the target At least one corresponding plot of image collection includes:
Based on described image identification model, the fisrt feature information and that target image includes in the target image set are determined Two characteristic informations, wherein the fisrt feature information includes shooting time information and shooting location information;
For each target image in the target image set, judge to whether there is and the mesh in the target image set Remaining target image of the fisrt feature information association of logo image;If so, according to the sequencing of shooting time, according to the mesh Logo image and shooting location information and second feature information in remaining described target image, generate a plot;If It is no, according to the shooting location information and second feature information of the target image, generate a plot.
3. method according to claim 2, which is characterized in that the second feature information comprises at least one of the following: personage Characteristic information, scenery characteristic information, object features information, animal character information, seasonal characteristic information and weather characteristics information.
4. method according to claim 2, which is characterized in that at least one plot described in the output includes:
For each plot at least one described plot, by the plot and the plot is generated The corresponding output of target image.
5. method as described in claim 1 or 4, which is characterized in that at least one plot described in the output includes:
At least one described plot is exported in a manner of text segment;And/or
At least one described plot is exported in a manner of voice broadcast.
6. method as claimed in claim 5, which is characterized in that if exporting at least one described event in a manner of text segment Thing section, after at least one plot described in the output, the method also includes:
Judge whether to receive user to the edit requests of text segment, wherein carrying event to be edited in the edit requests The first text set and edited second text set in thing;
If so, the first text set, which is updated to second character set, merges display.
7. method according to claim 1 or 2, which is characterized in that the training process of described image identification model includes:
For each sample image set in training set, each sample of the corresponding handmarking of each sample image set is obtained The corresponding plot of story of the mark and the mark of the story that sample image belongs in this image collection;
According to sample in the corresponding sample image set of the sample image set, the sample image set got The corresponding plot of story of the mark and the mark of the story of image ownership, instructs described image identification model Practice.
8. a kind of pattern recognition device, which is characterized in that be applied to terminal, which includes:
Module is obtained, for obtaining target image set to be identified;
Determining module is based on institute for the target image set to be input in the image recognition model that training is completed in advance Image recognition model is stated, determines at least one corresponding plot of the target image set;
Output module, for exporting at least one described plot.
9. device as claimed in claim 8, which is characterized in that the determining module is specifically used for identifying based on described image Model determines target image includes in the target image set fisrt feature information and second feature information, wherein described Fisrt feature information includes shooting time information and shooting location information;For each target figure in the target image set Picture judges in the target image set with the presence or absence of remaining target figure with the fisrt feature information association of the target image Picture;If so, according to the sequencing of shooting time, according to the shooting location in the target image and remaining described target image Information and second feature information generate a plot;If not, according to the shooting location information of the target image and second Characteristic information generates a plot.
10. device as claimed in claim 9, which is characterized in that the second feature information comprises at least one of the following: personage Characteristic information, scenery characteristic information, object features information, animal character information, seasonal characteristic information and weather characteristics information.
11. device as claimed in claim 9, which is characterized in that the output module, be specifically used for for it is described at least one The plot and the target image for generating the plot are corresponded to output by each plot in plot.
12. the device as described in claim 8 or 11, which is characterized in that the output module, specifically for text segment Mode exports at least one described plot;And/or at least one described plot is exported in a manner of voice broadcast.
13. device as claimed in claim 12, which is characterized in that the output module, if be also used to text segment Mode exports at least one described plot, judges whether to receive user to the edit requests of text segment, wherein described The the first text set and edited second text set in story to be edited are carried in edit requests;If so, will The first text set is updated to second character set and merges display.
14. device as claimed in claim 8 or 9, which is characterized in that described device further include:
Training module, for obtaining the corresponding people of each sample image set for each sample image set in training set The corresponding story of story of the mark and the mark of the story of sample image ownership in each sample image set of work label Plot;According to sample in the corresponding sample image set of the sample image set, the sample image set got The corresponding plot of story of the mark and the mark of the story of this image ownership, carries out described image identification model Training.
15. a kind of terminal characterized by comprising processor, communication interface, memory and communication bus, wherein processor, Communication interface, memory complete mutual communication by communication bus;
It is stored with computer program in the memory, when described program is executed by the processor, so that the processor Perform claim requires the step of any one of 1~7 the method.
16. a kind of computer readable storage medium, which is characterized in that it is stored with the computer program that can be executed by terminal, when When described program is run on the terminal, so that the step of terminal perform claim requires any one of 1~7 the method.
CN201811196560.3A 2018-10-15 2018-10-15 A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing Pending CN109409423A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811196560.3A CN109409423A (en) 2018-10-15 2018-10-15 A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811196560.3A CN109409423A (en) 2018-10-15 2018-10-15 A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing

Publications (1)

Publication Number Publication Date
CN109409423A true CN109409423A (en) 2019-03-01

Family

ID=65467195

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811196560.3A Pending CN109409423A (en) 2018-10-15 2018-10-15 A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing

Country Status (1)

Country Link
CN (1) CN109409423A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109902759A (en) * 2019-03-12 2019-06-18 上海云从企业发展有限公司 Pictures describe method and device
CN110162639A (en) * 2019-04-16 2019-08-23 深圳壹账通智能科技有限公司 Knowledge figure knows the method, apparatus, equipment and storage medium of meaning
CN110378303A (en) * 2019-07-25 2019-10-25 杭州睿琪软件有限公司 Method and system for Object identifying

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105095919A (en) * 2015-09-08 2015-11-25 北京百度网讯科技有限公司 Image recognition method and image recognition device
CN107274884A (en) * 2017-02-15 2017-10-20 赵思聪 A kind of information acquisition method based on text resolution and phonetic synthesis
CN107710188A (en) * 2015-05-05 2018-02-16 斯纳普公司 Automate local story generation and plan exhibition
CN107918782A (en) * 2016-12-29 2018-04-17 中国科学院计算技术研究所 A kind of method and system for the natural language for generating description picture material
CN108235765A (en) * 2017-12-05 2018-06-29 华为技术有限公司 A kind of display methods and device of story photograph album

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107710188A (en) * 2015-05-05 2018-02-16 斯纳普公司 Automate local story generation and plan exhibition
CN105095919A (en) * 2015-09-08 2015-11-25 北京百度网讯科技有限公司 Image recognition method and image recognition device
CN107918782A (en) * 2016-12-29 2018-04-17 中国科学院计算技术研究所 A kind of method and system for the natural language for generating description picture material
CN107274884A (en) * 2017-02-15 2017-10-20 赵思聪 A kind of information acquisition method based on text resolution and phonetic synthesis
CN108235765A (en) * 2017-12-05 2018-06-29 华为技术有限公司 A kind of display methods and device of story photograph album

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109902759A (en) * 2019-03-12 2019-06-18 上海云从企业发展有限公司 Pictures describe method and device
CN110162639A (en) * 2019-04-16 2019-08-23 深圳壹账通智能科技有限公司 Knowledge figure knows the method, apparatus, equipment and storage medium of meaning
CN110378303A (en) * 2019-07-25 2019-10-25 杭州睿琪软件有限公司 Method and system for Object identifying
CN110378303B (en) * 2019-07-25 2021-07-09 杭州睿琪软件有限公司 Method and system for object recognition
US11335087B2 (en) 2019-07-25 2022-05-17 Hangzhou Glority Software Limited Method and system for object identification

Similar Documents

Publication Publication Date Title
CN105808782B (en) A kind of adding method and device of picture tag
CN110119711B (en) Method and device for acquiring character segments of video data and electronic equipment
US10679063B2 (en) Recognizing salient video events through learning-based multimodal analysis of visual features and audio-based analytics
CN107993191A (en) A kind of image processing method and device
JP2019114243A (en) Imaging device and learning method
CN109074358A (en) Geographical location related with user interest is provided
CN109409423A (en) A kind of image-recognizing method, device, terminal and readable storage medium storing program for executing
CN102799890A (en) Image clustering method
CN110914872A (en) Navigating video scenes with cognitive insights
JP2017536635A (en) Picture scene determination method, apparatus and server
CN106528834A (en) Picture resource pushing method and device based on artificial intelligence
CN110266879A (en) Broadcast interface display methods, device, terminal and storage medium
CN103870485A (en) Method and device for achieving augmented reality application
CN112040273B (en) Video synthesis method and device
CN109040605A (en) Shoot bootstrap technique, device and mobile terminal and storage medium
WO2023197979A1 (en) Data processing method and apparatus, and computer device and storage medium
CN114861836B (en) Model deployment method based on artificial intelligence platform and related equipment
WO2018192245A1 (en) Automatic scoring method for photo based on aesthetic assessment
CN113627402B (en) Image identification method and related device
CN106802913A (en) One kind plays content recommendation method and its device
CN110121033A (en) Video categorization and device
CN109446946A (en) A kind of multi-cam real-time detection method based on multithreading
CN116595438A (en) Picture creation method, device, equipment and storage medium
US11200650B1 (en) Dynamic image re-timing
WO2021092935A1 (en) Image data-based message pushing method and device, and computer storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20190301

RJ01 Rejection of invention patent application after publication