CN109697239A - Method for generating the method for graph text information and for generating image data base - Google Patents

Method for generating the method for graph text information and for generating image data base Download PDF

Info

Publication number
CN109697239A
CN109697239A CN201811407298.2A CN201811407298A CN109697239A CN 109697239 A CN109697239 A CN 109697239A CN 201811407298 A CN201811407298 A CN 201811407298A CN 109697239 A CN109697239 A CN 109697239A
Authority
CN
China
Prior art keywords
image
keyword
information
data base
description
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811407298.2A
Other languages
Chinese (zh)
Other versions
CN109697239B (en
Inventor
齐镗泉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Nanjing Shangwang Network Technology Co ltd
Original Assignee
Lianshang Xinchang Network Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lianshang Xinchang Network Technology Co Ltd filed Critical Lianshang Xinchang Network Technology Co Ltd
Priority to CN201811407298.2A priority Critical patent/CN109697239B/en
Publication of CN109697239A publication Critical patent/CN109697239A/en
Priority to PCT/CN2019/119914 priority patent/WO2020103899A1/en
Application granted granted Critical
Publication of CN109697239B publication Critical patent/CN109697239B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/51Indexing; Data structures therefor; Storage structures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/53Querying
    • G06F16/538Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Software Systems (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The embodiment of the present application discloses the method for generating graph text information and the method for generating image data base.The specific embodiment for being used to generate graph text information method includes: that title keyword and content topic are obtained to figure information;It is chosen and title keyword and the matched image of content topic from image data base, wherein image data base is generated based on iamge description;By the image insertion of selection to generate target graph text information in figure information.The embodiment helps to reduce figure cost.

Description

Method for generating the method for graph text information and for generating image data base
Technical field
This application involves field of computer technology, and in particular to for generating the method for graph text information and for generation figure As the method for database.
Background technique
Fast development and the universal of mobile terminal of Internet technology change traditional information (such as news) display platform Show form with content.Compared with traditional plain text information, the information that both pictures and texts are excellent more can reader note Meaning.
In the related technology by the artificial figure realization information that both pictures and texts are excellent, however artificial figure is one time-consuming Laborious and inefficient work.
Summary of the invention
The embodiment of the present application proposes the method for generating graph text information and the method for generating image data base.
In a first aspect, some embodiments of the present application provide a kind of method for generating graph text information, this method packet It includes: obtaining title keyword and content topic to figure information;It is chosen and title keyword and interior from image data base Hold the matched image of theme, wherein image data base is generated based on iamge description;The image of selection is inserted into figure information In, generate target graph text information.
Second aspect, some embodiments of the present application provide a kind of method for generating graph text information, this method packet It includes: obtaining title keyword and content topic to figure information;It is determined based on title keyword and content topic wait choose The classification of image;The image data base with categorical match is chosen from least one image data base;From the image data of selection It is chosen and title keyword and the matched image of content topic in library, wherein at least one image data base is based on image Description generates;By the image insertion of selection to generate target graph text information in figure information.
The third aspect, some embodiments of the present application provide a kind of method for generating image data base, this method It include: acquisition image collection, the image in image collection is the figure in graph text information;For the image in image collection, obtain The description information for taking the image, the image that at least one keyword is extracted from the description information of the image as the image close Keyword;Based on image collection and extracted image keyword, image data base is generated.
Fourth aspect, some embodiments of the present application provide a kind of electronic equipment, comprising: one or more processors; Storage device is stored thereon with one or more programs, when one or more programs are executed by one or more processors, makes Obtain method of the one or more processors realization as described in first aspect to the third aspect.
5th aspect, some embodiments of the present application provide a kind of computer-readable medium, are stored thereon with computer Program realizes the method as described in first aspect to the third aspect when computer program is executed by processor.
Method provided by the embodiments of the present application for generating graph text information and the method for generating image data base, By obtaining title keyword and content topic to figure information, later from the image data generated based on iamge description Selection and title keyword and the matched image of content topic in library, finally by the image insertion of selection to raw in figure information At target graph text information, to help to reduce figure cost.
Detailed description of the invention
By reading a detailed description of non-restrictive embodiments in the light of the attached drawings below, the application its Its feature, objects and advantages will become more apparent upon:
Fig. 1 is that some embodiments of the application can be applied to exemplary system architecture figure therein;
Fig. 2 is the flow chart according to one embodiment of the method for generating image data base of the application;
Fig. 3 is the schematic diagram according to an application scenarios of the method for generating image data base of the application;
Fig. 4 is the flow chart according to one embodiment of the method for generating graph text information of the application;
Fig. 5 is the flow chart according to one embodiment of the method for generating graph text information of the application;
Fig. 6 is the flow chart according to another embodiment of the method for generating graph text information of the application;
Fig. 7 is adapted for the structural schematic diagram for the computer system for realizing the electronic equipment of the embodiment of the present application.
Specific embodiment
The application is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining related invention, rather than the restriction to the invention.It also should be noted that being Convenient for description, part relevant to related invention is illustrated only in attached drawing.
It should be noted that in the absence of conflict, the features in the embodiments and the embodiments of the present application can phase Mutually combination.The application is described in detail below with reference to the accompanying drawings and in conjunction with the embodiments.
As shown in Figure 1, system architecture 100 may include server 101,103 and network 102.Network 102 is to take It is engaged in providing the medium of communication link between device 101 and 103.Network 102 may include various connection types, such as wired, wireless Communication link or glass fiber cables etc..
Server 101 can be the server for generating image data base.The available image collection of server 101 (for example, obtaining image collection from server 103) obtains the description information of each image later and therefrom extracts image key Word generates image image data base corresponding with figure related term.
Server 101 can also be the server that figure is carried out to information.Server 101 can be from information Middle acquisition title keyword and content topic, and matched image is selected from the image data base generated based on iamge description It is inserted into information and generates the information that both pictures and texts are excellent.
Server 102 can be the server for storing image or graph text information.
It should be noted that for generating the method for image data base or for generating provided by the embodiment of the present application The method of graph text information is generally executed by server 101.
It should be noted that server 101,103 can be hardware, it is also possible to software.When server 101,103 is When hardware, the distributed server cluster of multiple server compositions may be implemented into, individual server also may be implemented into.When When server is software, multiple softwares or software module (such as providing Distributed Services) may be implemented into, it can also be real Ready-made single software or software module.It is not specifically limited herein.
It should be pointed out that the local of server 101 can also directly store image collection, server 101 can be direct Obtain local image collection.At this point, exemplary system architecture 100 can not include server 103.
It should be understood that the number of network and server in Fig. 1 is only schematical.According to needs are realized, can have There are the network and server of any suitable number.
With continued reference to Fig. 2, one embodiment of the method for generating image data base according to the application is shown Process 200.The method for being used to generate image data base, may comprise steps of 201~203.
Step 201, image collection is obtained.
In the present embodiment, for generating the executing subject (such as server 101 of Fig. 1) of the method for image data base Image collection can be obtained from Local or Remote.Wherein, the image in image collection can be the figure in graph text information.Figure Literary information may include content of text and figure.For example, graph text information can be the informations such as news, blog article.
Here, the image in image collection can be above-mentioned executing subject or the server different from above-mentioned executing subject It is obtained from the server for storing image or graph text information.For example, being climbed from the webpage in web page server by web crawlers Take figure.
Step 202, for each image in image collection, the description information of the image is obtained, from the description of the image Image keyword of at least one keyword as the image is extracted in information.
In the present embodiment, for each image in image collection, the execution of the method for generating image data base Main body (such as server 101 of Fig. 1) can obtain the description information of the image first, then from the description information of acquisition Extract image keyword of at least one keyword as the image.The description information of image can be includes in image The character express of feature (for example, scene, conspicuousness object etc. in image).Here it is possible to using participle technique from description Keyword is extracted in information.
In some optional implementations of the present embodiment, the description information of image can be obtained as follows: Image recognition (such as passing through image recognition model) is carried out to image, the description of image is then generated according to image recognition result Information.As an example, by image recognition, can determine that the scene of image is in parlor and image for a certain image Conspicuousness object is one chair, then the description information of the image can be " having a chair in parlor ".
In some optional implementations of the present embodiment, iamge description technology (Image Caption can be passed through Technique) description information of image is obtained.For example, can input an image into trained image description model, Obtain description information corresponding with input picture.Here, image description model can be used for characterizing input picture and description information Between corresponding relationship.In the example, LSTM (Long Short-Term Memory, shot and long term memory network) can be used The encoder frame of unit is as image description model.
Optionally, above-mentioned image description model can be trained as follows and be obtained:
Firstly, obtaining multiple images from pre-generated image description data library and corresponding with each image retouching State information.Wherein, image and description information corresponding with image are stored in image description data library.
Then, using each image in above-mentioned multiple images as input, description information corresponding with input picture is made For output, initial pictures descriptive model is trained using the method for machine learning, obtains image description model.
Optionally, above-mentioned image description data library can generate as follows:
Firstly, obtaining graph text information set from Local or Remote.
Later, for each graph text information in above-mentioned graph text information set, obtain figure in the graph text information and Description information corresponding with figure is extracted from the content of text of the graph text information.
Finally, figure and its corresponding description information are performed in accordance with storage, image description data library is established.
In some optional implementations of the present embodiment, the description information of each image can be natural language and retouch The sentence stated.At least one above-mentioned keyword may include noun included in the sentence of natural language description and/or describe Word.Accordingly, for each image in image collection, at least one keyword work is extracted from the description information of the image May include following two step for the image keyword of the image:
The first step segments the sentence of natural language description, and the sentence for obtaining the natural language description is included Noun and/or adjective.
Above-mentioned noun and/or adjective are extracted as the image keyword of the image by second step.
As an example, the description information of an image is " having a chair in parlor ", word segmentation processing is carried out to the sentence, Noun wherein included " parlor ", " chair " are obtained, then can be the figure by " parlor ", " chair " the two keyword extractions The image keyword of picture.
Step 203, it is based on image collection and extracted image keyword, generates image data base.
In the present embodiment, for generating the executing subject (such as server 101 of Fig. 1) of the method for image data base Image collection can be used and the extracted image keyword of step 202 establishes image data base.As an example, can will be upper The each image and its image keyword stated in image collection are performed in accordance with storage, establish image data base.
With continued reference to Fig. 3, it illustrates an applied fields according to the method for generating image data base of the application Scape 300.In the application scenarios 300 of Fig. 3, server 302 obtains image collection from local, and wherein image 301 is image collection In an image.Later, image 301 is input in trained image description model, obtains a description information " Huang The flower of color rises sheer from wilderness ".Word segmentation processing is carried out to foregoing description information, obtains noun therein " flower ", " wilderness " and shape Hold word " yellow ", and by above three crucial phrase at the image keyword (that is, " flower ", " wilderness ", " yellow ") of image 301. Then, it is used as a data record storage into database image 301 and image keyword " flower ", " wilderness ", " yellow ". Similarly, other images in image collection are similarly handled and is stored, finally obtain image data base.
Method provided by the embodiments of the present application for generating image data base, by collecting the figure in graph text information, Obtain the description information of each figure later, and from description information extract image keyword, finally the figure based on collection and The image keyword of extraction generates image data base, to help to reduce figure cost.
With further reference to Fig. 4, another embodiment of the method for generating image data base according to the application is shown Process 400.The method for being used to generate image data base, may comprise steps of 401~405.
Step 401, image collection is obtained.
In the present embodiment, for generating the executing subject (such as server 101 of Fig. 1) of the method for image data base Image collection can be obtained from Local or Remote.Wherein, the image in image collection can be the figure in graph text information.Figure Literary information may include content of text and figure.For example, graph text information can be the informations such as news, blog article.
Step 402, for each image in image collection, the description information of the image is obtained, from the description of the image Image keyword of at least one keyword as the image is extracted in information.
In the present embodiment, for each image in image collection, the execution of the method for generating image data base Main body (such as server 101 of Fig. 1) can obtain the description information of the image first, then from the description information of acquisition Extract image keyword of at least one keyword as the image.The description information of image can be includes in image The character express of feature (for example, scene, conspicuousness object etc. in image).Here it is possible to using participle technique from description Keyword is extracted in information.
Step 403, the similarity between the image keyword of different images in image collection is determined.
In the present embodiment, for generating the executing subject (such as server 101 of Fig. 1) of the method for image data base It can determine the similarity between the image keyword of different images in image collection.As an example, in image collection Any two image A and B, can calculate the similarity between the image keyword of image A and the image keyword of image B.
Step 404, the similarity between the image keyword based on different images carries out the image in image collection Classification, obtains at least one set of sub-images.
In the present embodiment, for generating the executing subject (such as server 101 of Fig. 1) of the method for image data base The similarity size that can use between the image keyword of different images divides each image in above-mentioned image collection Class (for example, being classified by cluster), obtaining at least one set of sub-images, (each set of sub-images represents a classification Image).
Step 405, at least one image data base corresponding at least one set of sub-images is generated.
In the present embodiment, for each set of sub-images at least one above-mentioned set of sub-images, for generating figure As the method for database executing subject (such as server 101 of Fig. 1) can by the set of sub-images each image and Its image keyword is performed in accordance with storage, obtains image data base corresponding with the set of sub-images.
As an example, image collection includes 1,000,000 images, it can should using the similarity between image keyword Image collection is divided into 1000 set of sub-images (it is assumed that each set of sub-images includes 1000 images), and then establishes 1000 image data bases.If only establish an image data base, obtains an image and at most need to match 1,000,000 times. In contrast, an image is obtained in the present embodiment at most to need to match 2000 times and (determine that matched image data base is most Need to match 1000 times, retrieve matched database and at most need to match 1000 times), to effectively increase retrieval image Efficiency.
Figure 4, it is seen that being used to generate image data base in the present embodiment compared with the corresponding embodiment of Fig. 2 Method process 400 embody using the similarity between image keyword to image carry out classification and according to inhomogeneity Other image establishes the step of different image data bases.The scheme of the present embodiment description can effectively improve retrieval figure as a result, The efficiency of picture.
With further reference to Fig. 5, it illustrates one embodiment according to the method for generating graph text information of the application Process 500.The method for being used to generate graph text information may comprise steps of 501~503.
Step 501, title keyword and content topic are obtained to figure information.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information To obtain title keyword and content topic to figure information.Here, it can be to figure information and need to carry out figure Information (for example, only including the news of content of text).
In some optional implementations of the present embodiment, step 501 can specifically include following steps:
Firstly, obtaining the title and content of text to figure information.
Later, keyword is extracted from title using participle technique as title keyword.
Then, content of text is input in trained text subject model, obtains the content master to figure information Topic.Wherein, content topic may include subject key words.Here, text subject model can characterize content of text and content master Corresponding relationship between topic.
Text subject model is to a kind of modeling method for implying theme in text.It is generally acknowledged that each of article Word is all by " with some theme of certain probability selection, and with some word of certain probability selection from this theme ".Text This topic model training method may include LSA (Latent semantic analysis, Latent Semantic analysis), pLSA (Probabilistic latent semantic analysis, the analysis of probability Latent Semantic), LDA (Latent Dirichlet allocation, hidden Di Li Cray distribution) etc..The training of text subject model is to study and answer extensively at present Well-known technique, details are not described herein.
Step 502, it is chosen and title keyword and the matched image of content topic from image data base.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information To choose the image to match with the title keyword of step 502 acquisition and content topic in image data base.Wherein, image Database is generated based on iamge description.
In some optional implementations of the present embodiment, step 502 can specifically include following steps: firstly, really Determine the image keyword of image included by image data base and the similarity of above-mentioned title keyword and above content theme; Then, the image that similarity is greater than or equal to preset threshold is chosen.As an example, image can be chosen from image data base Keyword and title keyword and the similarity of content topic are greater than or equal to the image of similarity threshold (such as 90%).
In some optional implementations of the present embodiment, image data base can specifically pass through following three step It generates:
The first step obtains image collection.Wherein, the image in image collection is the figure in graph text information.
Second step obtains the description information of the image for each image in image collection, from the description of the image Image keyword of at least one keyword as the image is extracted in information.
Third step is based on image collection and extracted image keyword, generates image data base.
The specific descriptions of the above-mentioned first step to third step can refer in the corresponding embodiment of Fig. 2 about step 201~step Rapid 203 specific descriptions, this will not be repeated here.
It is alternatively possible to obtain the description of image by iamge description technology (Image Caption Technique) Information.For example, can input an image into trained image description model, description letter corresponding with input picture is obtained Breath.Here, image description model can be used for characterizing the corresponding relationship between input picture and description information.It, can in the example To be retouched using the encoder frame of LSTM (Long Short-Term Memory, shot and long term memory network) unit as image State model.
Optionally, above-mentioned image description model can be trained as follows and be obtained:
Firstly, obtaining multiple images from pre-generated image description data library and corresponding with each image retouching State information.Wherein, image and description information corresponding with image are stored in image description data library.
Then, using each image in above-mentioned multiple images as input, description information corresponding with input picture is made For output, initial pictures descriptive model is trained using the method for machine learning, obtains image description model.
Optionally, above-mentioned image description data library can generate as follows:
Firstly, obtaining graph text information set from Local or Remote.
Later, for each graph text information in above-mentioned graph text information set, obtain figure in the graph text information and Description information corresponding with figure is extracted from the content of text of the graph text information.
Finally, figure and its corresponding description information are performed in accordance with storage, image description data library is established.
In some optional implementations of the present embodiment, the description information of each image can be natural language and retouch The sentence stated.At least one above-mentioned keyword may include noun included in the sentence of natural language description and/or describe Word.Accordingly, for each image in image collection, at least one keyword work is extracted from the description information of the image May include following two step for the image keyword of the image:
The first step segments the sentence of natural language description, and the sentence for obtaining the natural language description is included Noun and/or adjective.
Above-mentioned noun and/or adjective are extracted as the image keyword of the image by second step.
As an example, the description information of an image is " having a chair in parlor ", word segmentation processing is carried out to the sentence, Noun wherein included " parlor ", " chair " are obtained, then can be the figure by " parlor ", " chair " the two keyword extractions The image keyword of picture.
Step 503, by the image insertion of selection to generate target graph text information in figure information.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information With by choose image according to it is preset insertion rule (for example, insertion image keyword where paragraph after etc.) be inserted into To generate the information that both pictures and texts are excellent in figure information.
Method provided by the embodiments of the present application for generating graph text information, by obtaining the title pass to figure information Keyword and content topic choose matched image from the image data base generated based on iamge description later, will finally choose Image insertion in figure information generate target graph text information, thus facilitate reduce figure cost.
With further reference to Fig. 6, it illustrates another embodiments according to the method for generating graph text information of the application Process 600.The method for being used to generate graph text information may comprise steps of 601~605.
Step 601, title keyword and content topic are obtained to figure information.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information To obtain title keyword and content topic to figure information.Here, it can be to figure information and need to carry out figure Information (for example, only including the news of content of text).
The specific steps for obtaining title keyword and content topic can refer in the corresponding embodiment of Fig. 5 about acquisition The description of title keyword and the specific steps of content topic, details are not described herein.
Step 602, the classification of image to be chosen is determined based on title keyword and content topic.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information To determine the classification of image to be chosen based on title keyword and content topic.As an example, step 601 can be obtained Title keyword and content topic are merged, and determine the classification of image to be chosen.
Step 603, the image data base with categorical match is chosen from least one image data base.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information To choose the image data base that the classification determined with step 602 matches from least one image data base.Wherein, above-mentioned At least one image data base is generated based on iamge description.
In some optional implementations of the present embodiment, image data base can specifically pass through following five steps It generates:
The first step obtains image collection.Wherein, the image in image collection is the figure in graph text information.
Second step obtains the description information of the image for each image in image collection, from the description of the image Image keyword of at least one keyword as the image is extracted in information.
Third step determines the similarity between the image keyword of different images in image collection.
4th step, the similarity between the image keyword based on different images, divides the image in image collection Class obtains at least one set of sub-images.
5th step generates and at least one corresponding image data base at least one above-mentioned set of sub-images.
The specific descriptions of the above-mentioned first step to the 5th step can refer in the corresponding embodiment of Fig. 4 about step 401~step Rapid 405 specific descriptions, this will not be repeated here.
Step 604, it is chosen and title keyword and the matched image of content topic from the image data base of selection.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information It is chosen and title keyword and the matched image of content topic in the image data base chosen with step 603.As an example, can It is greater than or equal to choosing image keyword and the similarity of title keyword and content topic from the image data base of selection The image of similarity threshold (such as 90%).
Step 605, by the image insertion of selection to generate target graph text information in figure information.
It in the present embodiment, can for generating the executing subject (such as server 101 of Fig. 1) of the method for graph text information With by choose image according to it is preset insertion rule (for example, insertion image keyword where paragraph after etc.) be inserted into To generate the information that both pictures and texts are excellent in figure information.
From fig. 6 it can be seen that being used to generate graph text information in the present embodiment compared with the corresponding embodiment of Fig. 5 The process 600 of method embody using the classification that title keyword and content topic determine choose matched image data base with And the step of image is chosen from matched image data base.The scheme of the present embodiment description can effectively improve figure as a result, Efficiency.
Below with reference to Fig. 7, it illustrates the electronic equipment for being suitable for being used to realize the embodiment of the present application (such as the services of Fig. 1 Device 101) computer system 700 structural schematic diagram.Electronic equipment shown in Fig. 7 is only an example, should not be to this Shen Please embodiment function and use scope bring any restrictions.
As shown in fig. 7, computer system 700 includes one or more central processing unit (CPU) 701, it can basis The program that is stored in read-only memory (ROM) 702 is loaded into random access storage device (RAM) from storage section 708 Program in 703 and execute various movements appropriate and processing.In RAM 703, be also stored with system 700 operate it is required Various programs and data.CPU 701, ROM 702 and RAM 703 are connected with each other by bus 704.Input/output (I/O) Interface 705 is also connected to bus 704.
I/O interface 705 is connected to lower component: the importation 706 including mouse, keyboard etc.;Including such as organic hair The output par, c 707 of optical diode (OLED) display, liquid crystal display (LCD) etc. and loudspeaker etc.;Including hard disk etc. Storage section 708;And the communications portion 709 of the network interface card including LAN card, modem etc..Communications portion 709 execute communication process via the network of such as internet.Driver 710 is also connected to I/O interface 705 as needed.It is removable Medium 711, such as disk, CD, magneto-optic disk, semiconductor memory etc. are unloaded, is mounted on driver 710 as needed, with Convenient for being mounted into storage section 708 as needed from the computer program read thereon.
Particularly, in accordance with an embodiment of the present disclosure, it may be implemented as computer above with reference to the process of flow chart description Software program.For example, embodiment of the disclosure includes a kind of computer program product comprising be carried on computer-readable Jie Computer program in matter, the computer program include the program code for method shown in execution flow chart.Such In embodiment, which can be downloaded and installed from network by communications portion 709, and/or from detachable Medium 711 is mounted.When the computer program is executed by central processing unit (CPU) 701, execute in the present processes The above-mentioned function of limiting.
It should be noted that computer-readable medium described herein can be computer-readable signal media or Computer readable storage medium either the two any combination.Computer readable storage medium for example can be --- But be not limited to --- electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor system, device or device, or any above group It closes.The more specific example of computer readable storage medium can include but is not limited to: have the electricity of one or more conducting wires Connection, portable computer diskette, hard disk, random access storage device (RAM), read-only memory (ROM), erasable type are programmable Read-only memory (EPROM or flash memory), optical fiber, portable compact disc read-only memory (CD-ROM), light storage device, magnetic Memory device or above-mentioned any appropriate combination.In this application, computer readable storage medium can be any packet Contain or store the tangible medium of program, which can be commanded execution system, device or device use or in connection It uses.And in this application, computer-readable signal media may include propagating in a base band or as carrier wave a part Data-signal, wherein carrying computer-readable program code.The data-signal of this propagation can use a variety of shapes Formula, including but not limited to electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media is also It can be any computer-readable medium other than computer readable storage medium, which can send, pass It broadcasts or transmits for by the use of instruction execution system, device or device or program in connection.Computer The program code for including on readable medium can transmit with any suitable medium, including but not limited to: wireless, electric wire, light Cable, RF etc. or above-mentioned any appropriate combination.
The calculating of the operation for executing the application can be write with one or more programming languages or combinations thereof Machine program code, described program design language include object oriented program language-such as Java, Smalltalk, C+ +, it further include conventional procedural programming language-such as " C " language or similar programming language.Program code can Fully to execute, partly execute on the user computer on the user computer, be held as an independent software package Part executes on the remote computer or holds on a remote computer or server completely on the user computer for row, part Row.In situations involving remote computers, remote computer can pass through the network of any kind --- including local area network (LAN) or wide area network (WAN)-is connected to subscriber computer, or, it may be connected to outer computer (such as using because of spy Service provider is netted to connect by internet).
Flow chart and block diagram in attached drawing are illustrated according to the system of the various embodiments of the application, method and computer journey The architecture, function and operation in the cards of sequence product.In this regard, each box in flowchart or block diagram can be with A part of a module, program segment or code is represented, a part of the module, program segment or code includes one or more A executable instruction for implementing the specified logical function.It should also be noted that in some implementations as replacements, box Middle marked function can also occur in a different order than that indicated in the drawings.For example, two boxes succeedingly indicated Can actually be basically executed in parallel, they can also be executed in the opposite order sometimes, this according to related function and It is fixed.It is also noted that the group of each box in block diagram and or flow chart and the box in block diagram and or flow chart It closes, can be realized with the dedicated hardware based system for executing defined functions or operations, or specialized hardware can be used Combination with computer instruction is realized.
As on the other hand, present invention also provides a kind of computer-readable medium, which be can be Included in server described in above-described embodiment;It is also possible to individualism, and without in the supplying server.On It states computer-readable medium and carries one or more program, when said one or multiple programs are executed by the server When, so that the server: obtaining title keyword and content topic to figure information;From image data base choose with Title keyword and the matched image of content topic, wherein image data base is generated based on iamge description;By the image of selection Insertion is to generate target graph text information in figure information.
Above description is only the preferred embodiment of the application and the explanation to institute's application technology principle.Art technology Personnel should be appreciated that invention scope involved in the application, however it is not limited to skill made of the specific combination of above-mentioned technical characteristic Art scheme, while should also cover in the case where not departing from foregoing invention design, by above-mentioned technical characteristic or its equivalent feature into Row any combination and the other technical solutions formed.Such as features described above and (but being not limited to) disclosed herein have class Technical characteristic like function is replaced mutually and the technical solution that is formed.

Claims (15)

1. a kind of method for generating graph text information characterized by comprising
Title keyword and content topic are obtained to figure information;
It is chosen and the title keyword and the matched image of the content topic from image data base, wherein described image Database is generated based on iamge description;
The image insertion of selection is described to generate target graph text information in figure information.
2. the method according to claim 1, wherein described image database is generated especially by following steps:
Image collection is obtained, the image in described image set is the figure in graph text information;
For the image in described image set, obtain the description information of the image, from the description information of the image extract to Few image keyword of the keyword as the image;
Based on described image set and extracted image keyword, image data base is generated.
3. according to the method described in claim 2, it is characterized in that, the description information for obtaining the image, comprising:
By in image input image description model trained in advance, the description information of the image is obtained, wherein described image is retouched Model is stated for characterizing the corresponding relationship between input picture and description information.
4. according to the method described in claim 3, it is characterized in that, described image descriptive model is trained as follows It arrives:
Multiple images and description information corresponding with described multiple images are obtained from pre-generated image description data library;
Using the image in described multiple images as input, description information corresponding with input picture is used as and is exported, trained To described image descriptive model.
5. according to the method described in claim 4, it is characterized in that, described image descriptive data base generates as follows:
Obtain graph text information set;
For the graph text information in graph text information set, the figure and description corresponding with figure letter in the graph text information are obtained Breath;
Based on acquired figure and description information corresponding with figure, described image descriptive data base is generated.
6. the method according to any one of claim 2 to 5, which is characterized in that for the image in described image set, The description information of the image is the sentence of natural language description, at least one described keyword includes the natural language description The noun and/or adjective that sentence is included;The conduct of at least one keyword is extracted in the description information from the image should The image keyword of image, comprising:
The sentence of the natural language description is segmented, the noun that the sentence of the natural language description is included is obtained And/or adjective;
The noun and/or adjective are extracted as to the image keyword of the image.
7. the method according to any one of claim 2 to 6, which is characterized in that described to be based on described image set and institute The image keyword of extraction generates image data base, comprising:
Determine the similarity between the image keyword of different images in described image set;
Similarity between image keyword based on different images, classifies to the image in described image set, obtains At least one set of sub-images;
Generate at least one image data base corresponding at least one described set of sub-images.
8. method according to any one of claim 1 to 7, which is characterized in that it is described from image data base choose with The title keyword and the matched image of the content topic, comprising:
Determine the image keyword and the title keyword and the content topic of image included by described image database Similarity;
Choose the image that the similarity is greater than or equal to preset threshold.
9. method according to any one of claim 1 to 8, which is characterized in that described to obtain mark to figure information Inscribe keyword and content topic, comprising:
Obtain the title and content of text to figure information;
The title keyword is extracted from the title;
The content of text is input in text subject model trained in advance, obtains the content topic.
10. a kind of method for generating graph text information characterized by comprising
Title keyword and content topic are obtained to figure information;
The classification of image to be chosen is determined based on the title keyword and the content topic;
The image data base with the categorical match is chosen from least one image data base, wherein at least one described figure As database is generated based on iamge description;
It is chosen and the title keyword and the matched image of the content topic from the image data base of selection;
The image insertion of selection is described to generate target graph text information in figure information.
11. according to the method described in claim 10, it is characterized in that, at least one described image data base as follows It generates:
Image collection is obtained, the image in described image set is the figure in graph text information;
For the image in described image set, obtain the description information of the image, from the description information of the image extract to Few image keyword of the keyword as the image;
Determine the similarity between the image keyword of different images in described image set;
Similarity between image keyword based on different images, classifies to the image in described image set, obtains At least one set of sub-images;
Generate at least one image data base corresponding at least one described set of sub-images.
12. a kind of method for generating image data base characterized by comprising
Image collection is obtained, the image in described image set is the figure in graph text information;
For the image in described image set, obtain the description information of the image, from the description information of the image extract to Few image keyword of the keyword as the image;
Based on described image set and extracted image keyword, image data base is generated.
13. according to the method for claim 12, described be based on described image set and extracted image keyword, generate Image data base, comprising:
Determine the similarity between the image keyword of different images;
Similarity between image keyword based on different images, classifies to the image in described image set, obtains At least one set of sub-images;
Generate at least one image data base corresponding at least one described set of sub-images.
14. a kind of electronic equipment, comprising:
One or more processors;
Storage device is stored thereon with one or more programs;
When one or more of programs are executed by one or more of processors, so that one or more of processors are real The now method as described in any one of claims 1 to 13.
15. a kind of computer-readable medium, is stored thereon with computer program, wherein real when described program is executed by processor The now method as described in any one of claims 1 to 13.
CN201811407298.2A 2018-11-23 2018-11-23 Method for generating teletext information Active CN109697239B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201811407298.2A CN109697239B (en) 2018-11-23 2018-11-23 Method for generating teletext information
PCT/CN2019/119914 WO2020103899A1 (en) 2018-11-23 2019-11-21 Method for generating inforgraphic information and method for generating image database

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811407298.2A CN109697239B (en) 2018-11-23 2018-11-23 Method for generating teletext information

Publications (2)

Publication Number Publication Date
CN109697239A true CN109697239A (en) 2019-04-30
CN109697239B CN109697239B (en) 2022-05-31

Family

ID=66230118

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811407298.2A Active CN109697239B (en) 2018-11-23 2018-11-23 Method for generating teletext information

Country Status (2)

Country Link
CN (1) CN109697239B (en)
WO (1) WO2020103899A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110209862A (en) * 2019-05-22 2019-09-06 招商局金融科技有限公司 Text matches drawing method, electronic device and computer readable storage medium
CN110222189A (en) * 2019-06-19 2019-09-10 北京百度网讯科技有限公司 Method and apparatus for output information
WO2020103899A1 (en) * 2018-11-23 2020-05-28 连尚(新昌)网络科技有限公司 Method for generating inforgraphic information and method for generating image database
CN111753078A (en) * 2019-07-12 2020-10-09 北京京东尚科信息技术有限公司 Image paragraph description generation method, device, medium and electronic equipment
CN112015936A (en) * 2020-08-27 2020-12-01 北京字节跳动网络技术有限公司 Method, device, electronic equipment and medium for generating article display diagram
WO2021008213A1 (en) * 2019-07-12 2021-01-21 智慧芽信息科技(苏州)有限公司 Image database establishing method, searching method, electronic device, and storage medium
CN112287173A (en) * 2020-10-30 2021-01-29 北京有竹居网络技术有限公司 Method and apparatus for generating information
CN112613293A (en) * 2020-12-29 2021-04-06 北京中科闻歌科技股份有限公司 Abstract generation method and device, electronic equipment and storage medium

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010021937A1 (en) * 2000-02-28 2001-09-13 Lorenzo Cicchitelli Automatically selecting fonts
CN102693231A (en) * 2011-03-23 2012-09-26 百度在线网络技术(北京)有限公司 Method, device and device for confirming atlas according to images from network
CN104657375A (en) * 2013-11-20 2015-05-27 中国科学院深圳先进技术研究院 Image-text theme description method, device and system
CN107103073A (en) * 2017-04-21 2017-08-29 北京恒冠网络数据处理有限公司 A kind of image indexing system
US20180173681A1 (en) * 2016-12-21 2018-06-21 Ten-X, Llc System and method for generating content pertaining to real property assets
CN108304431A (en) * 2017-06-14 2018-07-20 腾讯科技(深圳)有限公司 A kind of image search method and device, equipment, storage medium
CN108733779A (en) * 2018-05-04 2018-11-02 百度在线网络技术(北京)有限公司 The method and apparatus of text figure

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090112830A1 (en) * 2007-10-25 2009-04-30 Fuji Xerox Co., Ltd. System and methods for searching images in presentations
CN106951493A (en) * 2017-03-14 2017-07-14 北京搜狐新媒体信息技术有限公司 Automatic figure methods of exhibiting and device without figure news
CN107193962B (en) * 2017-05-24 2021-06-11 百度在线网络技术(北京)有限公司 Intelligent map matching method and device for Internet promotion information
CN109697239B (en) * 2018-11-23 2022-05-31 南京尚网网络科技有限公司 Method for generating teletext information

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20010021937A1 (en) * 2000-02-28 2001-09-13 Lorenzo Cicchitelli Automatically selecting fonts
CN102693231A (en) * 2011-03-23 2012-09-26 百度在线网络技术(北京)有限公司 Method, device and device for confirming atlas according to images from network
CN104657375A (en) * 2013-11-20 2015-05-27 中国科学院深圳先进技术研究院 Image-text theme description method, device and system
US20180173681A1 (en) * 2016-12-21 2018-06-21 Ten-X, Llc System and method for generating content pertaining to real property assets
CN107103073A (en) * 2017-04-21 2017-08-29 北京恒冠网络数据处理有限公司 A kind of image indexing system
CN108304431A (en) * 2017-06-14 2018-07-20 腾讯科技(深圳)有限公司 A kind of image search method and device, equipment, storage medium
CN108733779A (en) * 2018-05-04 2018-11-02 百度在线网络技术(北京)有限公司 The method and apparatus of text figure

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2020103899A1 (en) * 2018-11-23 2020-05-28 连尚(新昌)网络科技有限公司 Method for generating inforgraphic information and method for generating image database
CN110209862A (en) * 2019-05-22 2019-09-06 招商局金融科技有限公司 Text matches drawing method, electronic device and computer readable storage medium
CN110209862B (en) * 2019-05-22 2021-06-25 招商局金融科技有限公司 Text matching method, electronic device and computer readable storage medium
CN110222189A (en) * 2019-06-19 2019-09-10 北京百度网讯科技有限公司 Method and apparatus for output information
CN111753078A (en) * 2019-07-12 2020-10-09 北京京东尚科信息技术有限公司 Image paragraph description generation method, device, medium and electronic equipment
WO2021008213A1 (en) * 2019-07-12 2021-01-21 智慧芽信息科技(苏州)有限公司 Image database establishing method, searching method, electronic device, and storage medium
CN111753078B (en) * 2019-07-12 2024-02-02 北京京东尚科信息技术有限公司 Image paragraph description generation method, device, medium and electronic equipment
CN112015936A (en) * 2020-08-27 2020-12-01 北京字节跳动网络技术有限公司 Method, device, electronic equipment and medium for generating article display diagram
CN112015936B (en) * 2020-08-27 2021-10-26 北京字节跳动网络技术有限公司 Method, device, electronic equipment and medium for generating article display diagram
CN112287173A (en) * 2020-10-30 2021-01-29 北京有竹居网络技术有限公司 Method and apparatus for generating information
CN112613293A (en) * 2020-12-29 2021-04-06 北京中科闻歌科技股份有限公司 Abstract generation method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN109697239B (en) 2022-05-31
WO2020103899A1 (en) 2020-05-28

Similar Documents

Publication Publication Date Title
CN109697239A (en) Method for generating the method for graph text information and for generating image data base
CN107679039B (en) Method and device for determining statement intention
CN110287479B (en) Named entity recognition method, electronic device and storage medium
CN107491547B (en) Search method and device based on artificial intelligence
CN107491534B (en) Information processing method and device
CN107346336B (en) Information processing method and device based on artificial intelligence
CN109543058B (en) Method, electronic device, and computer-readable medium for detecting image
CN109117777A (en) The method and apparatus for generating information
CN108171276A (en) For generating the method and apparatus of information
CN108121699B (en) Method and apparatus for outputting information
CN106919711B (en) Method and device for labeling information based on artificial intelligence
CN109241286A (en) Method and apparatus for generating text
CN109408824A (en) Method and apparatus for generating information
CN110275963A (en) Method and apparatus for output information
CN109299477A (en) Method and apparatus for generating text header
CN112131881B (en) Information extraction method and device, electronic equipment and storage medium
CN109325121A (en) Method and apparatus for determining the keyword of text
US20240143684A1 (en) Information presentation method and apparatus, and device and medium
CN109583389A (en) Draw this recognition methods and device
CN109284367A (en) Method and apparatus for handling text
CN109271624A (en) A kind of target word determines method, apparatus and storage medium
CN110019948A (en) Method and apparatus for output information
Fuad et al. Analysis and classification of mobile apps using topic modeling: A case study on Google Play Arabic apps
CN113011126B (en) Text processing method, text processing device, electronic equipment and computer readable storage medium
CN111800650B (en) Video dubbing method and device, electronic equipment and computer readable medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20200702

Address after: Room 909, No. 12, Huangshan Road, Jianye District, Nanjing City, Jiangsu Province

Applicant after: Nanjing Shangwang Network Technology Co.,Ltd.

Address before: 312599 B-2, 3 floor, Jiayi Plaza, 127 East Renmin Road, Nanming street, Xinchang, Shaoxing, Zhejiang

Applicant before: LIANSHANG (XINCHANG) NETWORK TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant