WO2024000554A1 - Artificial intelligence-based method for configuring image for information, device, medium, and program product - Google Patents

Artificial intelligence-based method for configuring image for information, device, medium, and program product Download PDF

Info

Publication number
WO2024000554A1
WO2024000554A1 PCT/CN2022/103226 CN2022103226W WO2024000554A1 WO 2024000554 A1 WO2024000554 A1 WO 2024000554A1 CN 2022103226 W CN2022103226 W CN 2022103226W WO 2024000554 A1 WO2024000554 A1 WO 2024000554A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
information content
tag
artificial intelligence
topic tag
Prior art date
Application number
PCT/CN2022/103226
Other languages
French (fr)
Chinese (zh)
Inventor
李明宇
庄建家
王嘉楠
吴志伟
郑泽伟
Original Assignee
富途网络科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 富途网络科技(深圳)有限公司 filed Critical 富途网络科技(深圳)有限公司
Priority to PCT/CN2022/103226 priority Critical patent/WO2024000554A1/en
Priority to CN202280002306.3A priority patent/CN115298660A/en
Publication of WO2024000554A1 publication Critical patent/WO2024000554A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/5866Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, manually generated location and time information
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/353Clustering; Classification into predefined classes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition

Definitions

  • This application relates to the field of computer technology, specifically to an information mapping method, device, equipment, medium and program product based on artificial intelligence.
  • the current information matching methods mainly include two types: random matching of cover pictures and manual configuration of cover pictures.
  • random matching of cover pictures some pictures without a specific theme are randomly used as the cover image.
  • the main disadvantage is that the theme of the picture is not clear and it seems to have nothing to do with the cover image. The user feels that the picture and text have nothing to do with it, and the list reading effect is not good. good.
  • When manually configuring the cover image you manually find pictures related to the theme of a single article and upload them as the information cover image.
  • the main disadvantage is that the work is too repetitive and cumbersome, and the matching of pictures for a large amount of information requires a lot of manpower.
  • Embodiments of the present application provide an information mapping method, device, equipment, media and program product based on artificial intelligence, which can realize intelligent configuration of cover images, reduce labor costs, enhance the degree of fit between information content and graphics, and improve the quality of information The efficiency and effectiveness of illustrations.
  • inventions of the present application provide an information mapping method based on artificial intelligence.
  • the method includes:
  • the picture corresponding to the target topic tag is used as the cover image of the information content.
  • inventions of the present application provide an information mapping device based on artificial intelligence.
  • the device includes:
  • Acquisition unit used to obtain information content
  • a determining unit configured to determine the information topic tag of the information content based on a preset algorithm model
  • a judging unit configured to judge whether there is a target topic tag matching the information topic tag in the first gallery, wherein the pictures in the first gallery are marked with corresponding topic tags;
  • a processing unit configured to, if there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as a cover image of the information content.
  • inventions of the present application provide a computer device.
  • the computer device includes a processor and a memory.
  • a computer program is stored in the memory.
  • the processor calls the computer program stored in the memory. Used to execute the information mapping method based on artificial intelligence as described in any of the above embodiments.
  • embodiments of the present application provide a computer-readable storage medium that stores a computer program, and the computer program is suitable for loading by a processor to execute the steps described in any of the above embodiments.
  • Information mapping method based on artificial intelligence.
  • embodiments of the present application provide a computer program product that includes computer instructions.
  • the computer instructions are executed by a processor, the artificial intelligence-based information mapping method described in any of the above embodiments is implemented.
  • the embodiment of the present application obtains information content; determines the information topic tag of the information content based on a preset algorithm model; and determines whether there is a target topic tag matching the information topic tag in the first gallery, where the pictures in the first gallery are marked with The corresponding topic tag; if there is a target topic tag matching the information topic tag in the first gallery, the picture corresponding to the target topic tag will be used as the cover image of the information content.
  • an algorithm based on artificial intelligence is used to identify information topic tags, which improves the accuracy of information positioning for information of various contents; and tag retrieval and matching is performed in the first image gallery, and the information in the first image gallery is
  • the picture corresponding to the target topic tag that matches the topic tag is used as the cover image of the information content, which can realize intelligent configuration of the cover image, reduce labor costs, enhance the fit between the information content and the accompanying images, and improve the efficiency and effect of the information matching.
  • Figure 1 is a schematic flowchart of an information mapping method based on artificial intelligence provided by an embodiment of the present application.
  • Figure 2 is a schematic diagram of the first application scenario provided by the embodiment of the present application.
  • Figure 3 is a schematic diagram of the second application scenario provided by the embodiment of the present application.
  • Figure 4 is a schematic diagram of the third application scenario provided by the embodiment of the present application.
  • Figure 5 is a schematic diagram of the fourth application scenario provided by the embodiment of the present application.
  • Figure 6 is a schematic diagram of the fifth application scenario provided by the embodiment of the present application.
  • Figure 7 is a flow sequence diagram of the artificial intelligence-based information mapping method provided by the embodiment of the present application.
  • Figure 8 is a schematic structural diagram of an information mapping device based on artificial intelligence provided by an embodiment of the present application.
  • Figure 9 is a schematic structural diagram of a server provided by an embodiment of the present application.
  • Embodiments of the present application provide an information mapping method, device, terminal equipment and storage medium based on artificial intelligence.
  • the information mapping method based on artificial intelligence in the embodiment of the present application can be executed by a computer device, where the computer device can be a terminal or a server.
  • the terminal can be a smartphone, tablet, laptop, desktop computer, smart TV, smart speaker, wearable smart device, smart vehicle terminal and other devices.
  • the terminal can also include a client, which can be a financial client, browser server client or instant messaging client, etc.
  • the server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers. It can also provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, and middleware.
  • Cloud servers include software services, domain name services, security services, content distribution network services, and basic cloud computing services such as big data and artificial intelligence platforms, but are not limited to these.
  • Figure 1 is a schematic flow chart of the artificial intelligence-based information mapping method provided by the embodiment of the present application.
  • Figures 2 to 6 are schematic diagrams of application scenarios provided by the embodiment of the present application.
  • Figure 7 is a schematic diagram of the application scenario.
  • the information mapping method based on artificial intelligence in the embodiment of the present application can be applied to the server. The method includes the following steps:
  • Step 110 Obtain information content.
  • the obtained information content includes:
  • the information content is obtained, and the information content is stored in the information content library.
  • the acquisition method of an information content can include crawling the content source, automatically reviewing it, or manually creating a new one and adding it to the database.
  • crawler tools are used to crawl information content from the information content source.
  • the crawler tool is a program or script that automatically crawls World Wide Web information according to certain rules.
  • the crawler tool initiates a request to the target site through the HTTP library, that is, it sends a Request.
  • the request can contain additional headers and other information and waits for the server to respond; if the server can respond normally, it will get a Response, and the content of the Response is the content of the page to be obtained.
  • the type may include HTML, Json string, binary data (such as pictures and videos), etc.;
  • the obtained content can be HTML, which can be parsed using regular expressions and web page parsing libraries;
  • the obtained content can also be Json, which can be directly converted It is parsed for Json objects, usually binary data, which can be saved or further processed;
  • the information crawled by the crawler tool can be saved as text, saved to a database, or saved as a file in a specific format.
  • the information content can also be manually added to the database, and the information content can be obtained in response to a database entry request for the information content sent by the background management device, and the information content can be stored in the information content database.
  • the method further includes:
  • the step of determining the information topic tag of the information content based on the preset algorithm model is performed.
  • the method further includes:
  • a first prompt message indicating that the review fails is generated, and the information content and the first prompt message are sent to the backend management equipment.
  • the information content crawled by the content source will be stored in the information content library first. Before step 120, the information content needs to be subjected to information review processes such as sensitive word matching and filtering rule verification through the information review process.
  • the sensitive thesaurus and the filtering rule thesaurus are pre-selected and built thesaurus.
  • text matching is performed on the title, information source, text and other fields of the information content. If the sensitive word or the filter word is hit, Then it is automatically determined that the review has failed, and the operator needs to conduct manual review before determining whether to further perform step 120. If the sensitive word is not hit and the filter word is not hit, it is automatically determined that the review is passed, and step 120 is further executed.
  • Step 120 Determine the information topic tag of the information content based on a preset algorithm model.
  • determining the information topic tag of the information content based on a preset algorithm model includes:
  • the preset algorithm model includes an entity extraction model and a keyword extraction model, and the information title text is processed based on the preset algorithm model to determine the information topic tag of the information content.
  • the information title text is processed based on an entity extraction model to obtain the entity type label of the information content, wherein the entity extraction model is used to extract company, industry and person name information in the information title text;
  • the information topic tag of the information content is determined.
  • determining the information topic tag of the information content based on the entity type tag and the keyword tag includes:
  • the first-ranked tag in the tag list is determined as the information topic tag of the information content.
  • an information topic tag API interface can be provided on the preset algorithm model side.
  • the entity extraction model (denoted as M1) and the keyword extraction model (denoted as M2 ), ultimately returning a list of tags.
  • the first-ranked tag in the backend selection list is displayed as the information topic tag.
  • the entity extraction model and keyword extraction model of the information topic tag API interface share the same model structure from the model level, but they design different entity tags in the data collection stage, so the training data constructed are different, and the final extraction Ability to achieve different results.
  • the entity extraction model mainly performs data annotation on the three most important entity types that are concerned in the financial field: companies, industries, and names of people.
  • the keyword extraction model mainly annotates data for common and important words in financial information, such as: inflation, stock market, Hong Kong stocks, Federal Reserve, etc.
  • the Bert model is composed of an Embedding layer and 12 Transformer layers, with a total of 110 million parameters.
  • the model parameters are very large.
  • the Bert model serves as text_encoder to extract features from the input information title text.
  • the input information title text will first go through the tokenizer (Tokenizer) in Bert to obtain a token sequence of length L.
  • the token sequence further converts the token text into word id according to the mapping relationship in vocab, obtaining [1,L ], and then input it into the Bert model.
  • Bert serves as the encoder.
  • the tensor of [1, L] passes through the Embedding layer to obtain the tensor of [L, D] dimensions.
  • the function of the full pointer layer is to use text_encoder to extract rich semantic information of the entity, and indicate the head and tail of the entity through a pointer matrix at one time, so that the position of the entity in the original text can be quickly located and directly extracted.
  • a simplified version of the Multi-Head Attention module is used to implement this function.
  • the Multi-Head Attention module uses three matrices Q (query), K (all keys), and V (values) to perform matrix calculations, and then performs Scaled Dot-Product Attention calculations.
  • the relevant formulas are as follows:
  • S ⁇ (i, j) represents the pointer square matrix of the ⁇ th type entity, and its shape is [L, L].
  • n_labels classes each entity category will calculate such a pointer square matrix, so the entire
  • the output of the full pointer layer is a tensor of [n_labels,L,L].
  • the rows of S ⁇ represent the head position of the entity, and the columns represent the tail position of the entity. Therefore, although S ⁇ is a square matrix, only the upper triangular part has practical significance, and the output of the lower triangle is not considered directly.
  • the function of the classification output layer is to extract entities from the output pointer square matrix. Values greater than 0 in [n_labels, L, L] are considered to be the head and tail of activated entities. Therefore, this layer converts the logits output by the model into 0/1 In a binary square matrix, the head and tail of the activated entity are set to 1, and the rest are set to 0.
  • the API interface After the API interface receives the information title text, it first passes the text into the model M1. If the model does not return a result, that is, the entity extraction model M1 does not extract any company, industry, or person name information, it will continue to pass the title into the keywords. Extract model M2 and further extract keywords.
  • the obtained information text data is manually annotated. For example, by performing data annotation on the three most important entity types in the information text data: company, industry, and person's name. Get the first labeled data. By performing data annotation on the industry keywords in the information text data, the second annotation data is obtained.
  • the first annotated data is input into the preset algorithm model for model training.
  • the first annotated data and the Bert model are used to pre-train multiple transformers layers, global pointer layers and classification output layers, and the entity extraction model is obtained as M1).
  • Input the second annotated data into the preset algorithm model for model training. For example, use the second annotated data and the Bert model to pre-train multiple transformers layers, global pointer layers and classification output layers to obtain the keyword extraction model M2).
  • the information title text is input into the entity extraction model M1 for the first time to extract the company, industry and person name information in the information title text, and output the first result set; then, determine whether the first result set is empty , if the first result set is not empty, then the entity type tags contained in the first result set are included in the final result set; if the first result set is empty, the second input keyword of the information title text is retrieved.
  • model M2 the industry keyword information in the information title text is extracted, and a second result set is output, and the keyword tags contained in the second result set are included in the final result set.
  • a tag list can be constructed, and the back-end selects the first-ranked tag in the list to be displayed as the information topic tag. That is, when the algorithm provides multiple tags with the same frequency, the information side can select the first tag for image matching.
  • the above method obtains the entity extraction model M1 and the keyword extraction model M2 based on artificial intelligence training, and finally returns a tag list, and displays the tag ranked first in the back-end selection list as the information topic tag, which improves the determination of information topic tags. accuracy.
  • Step 130 Determine whether there is a target topic tag matching the information topic tag in the first image gallery, where the pictures in the first image gallery are annotated with the corresponding topic tag.
  • a search and matching is performed in the first image gallery to find whether there is a target topic tag matching the information topic tag in the first image gallery.
  • an information management platform interface is displayed on the background management device.
  • the first image gallery (the label image gallery as shown in Figure 3) can be maintained, in which the information
  • the thumbnail, ID information, storage time, corresponding topic tag, number of hits and other information of the picture can be displayed.
  • Step 140 If there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as the cover image of the information content.
  • the intelligent matching can extract the information theme tags of the information content according to the preset algorithm model, and match the pictures in the gallery according to the information theme tags to complete the intelligent matching of the cover image of the information content. . If there is a target topic tag matching the information topic tag in the first image gallery, the picture corresponding to the target topic tag will be used as the cover image of the information content.
  • the picture corresponding to the target topic tag as the cover image of the information content, including:
  • one picture can be randomly selected from the multiple pictures corresponding to the target topic label as the cover image of the information content. For example, according to the number of hits, you can select the picture with the most hits from the multiple pictures corresponding to the target topic tag as the cover image of the information content, so as to select the most commonly used pictures in the target topic tag for illustration, so as to fit the public as much as possible. aesthetic. For example, based on the number of hits, you can select the one with the least number of hits from the multiple pictures corresponding to the target topic tag as the cover image of the information content, so as to select the rarely used pictures in the target topic tag for illustration to increase the freshness of the illustration. .
  • the method further includes:
  • a random picture is obtained from the second image gallery as the cover image of the information content, wherein the picture in the first image gallery is Hashtags not tagged.
  • an information management platform interface is displayed on the background management device.
  • a second image gallery (the ordinary image gallery as shown in Figure 3) can be maintained, in which the information
  • the thumbnail and ID information of the picture can be displayed.
  • abstract pictures without specific themes in the second image gallery can be used as the cover image of the information content.
  • the method further includes:
  • the cover image of the information content is determined according to the configuration information carried in the cover image configuration request.
  • the server receives the cover image configuration request for the information content sent by the background management device, it determines the cover image for the information content based on the configuration information carried in the cover image configuration request. This can greatly increase the number of images. flexibility and improve operational efficiency.
  • the method further includes:
  • the method further includes:
  • the information stream When receiving the triggering operation for the cover image sent by the client, send the information flow of the information content to the client, so as to display the information of the information content on the information preview interface of the client.
  • the information stream includes at least one of information text data and information multimedia data corresponding to the information content.
  • the information flow may include information text data, information multimedia data, etc.
  • the information multimedia data may include multimedia resources such as images, animations, audios, and videos.
  • the information text data may include information title, ID, release time, information content details, information content release end information, etc.
  • the method further includes:
  • each of the image matching records includes a pair of information theme tags and an artificial intelligence-based information matching method corresponding to the information theme tag;
  • mapping record data count the data proportion of each artificial intelligence-based information mapping method within the preset period to obtain mapping statistics
  • the label statistical data and the accompanying picture statistical data are sent to a backend management device, so that the label statistical data and the accompanying picture statistical data are displayed on the backend management device.
  • a picture record will be recorded, and label statistics and picture statistics will use this as the data source statistical result. That is, data statistics are performed based on the picture recording data recorded within a preset period of time to determine the label statistics and picture statistics.
  • the preset time period may be one of the time periods of the last week, the last month, the last three months, the last year, etc.
  • Tag statistical data as shown in the application scenario diagram in Figure 5, when operators query tag statistics, they can enter a preset period, such as the last month, through the information management platform interface displayed on the background management device, and add the preset period Set a time period and send it to the server.
  • the server queries the single-day tag statistics within the preset time period according to the preset time period and aggregates them into tag statistics within the preset time period, and returns the tag statistics within the preset time period to the backend management. device to display.
  • tags By displaying tag statistical data on the background management device, operators can perform targeted statistics on frequently occurring tags when maintaining the first image gallery (the tag image gallery shown in Figure 3), and cover high-frequency tags as much as possible.
  • the information content of the information theme makes the gallery operation more efficient. For example, the operation staff recently discovered that "oil price” is a label that appears frequently, so they maintain multiple related cover images for the "oil price” label. In this way, when the information content of the same label appears again in the future, the intelligent image matching will take effect.
  • the frequency of occurrence of information topic tags is also counted to facilitate targeted maintenance of images.
  • the picture statistics data due to the large amount of picture record data, the required statistical results cannot be obtained simply by relying on the aggregation statistics of the database. Therefore, the picture statistics data within the previous 30 days need to be calculated daily and stored in the database.
  • the preset period is sent to the server.
  • the server queries the single-day picture matching statistics within the preset time period according to the preset time period and aggregates the picture matching statistics within the preset time period, and aggregates the picture matching statistics within the preset time period.
  • the graph statistical data is returned to the background management device for display.
  • the actual data proportion (coverage) of various types of graphics online can be displayed through the graphics statistical data, so as to have a more objective data evaluation of the intelligent graphics mechanism and provide a truly quantifiable statistical library coverage effect.
  • the types of graphics can include label mapping, random mapping, and manual mapping. In the mapping statistics shown in Figure 6, the coverage rate of label mapping is 3.40%, and the coverage rate of random mapping is 96.40%. Manual illustration coverage is 0.20%.
  • the method further includes:
  • the operator can maintain the first image gallery through the information management platform interface displayed on the background management device, input an update request for the first image gallery through the information management platform interface, and send the update request to the server, so that the server updates the first image gallery according to the update request.
  • At least one of the pictures and hashtags in if the update request is to update the first topic tag corresponding to the first picture, then the first topic tag corresponding to the first picture in the first gallery is updated according to the update request.
  • the update request is to update the second picture corresponding to the second topic tag, then the second picture corresponding to the second topic tag in the first gallery is updated according to the update request.
  • the update request is to add a third picture and its corresponding third topic tag, then the third picture and its corresponding third topic tag are added to the first gallery according to the update request.
  • the method further includes:
  • one image cannot create multiple identical tags. If it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first gallery, a second prompt message indicating duplicate tags will be generated, and the update of the first image gallery will be rejected. gallery, and sends the second prompt information to the background management device to display the second prompt information on the background management device to remind the operator that the same tag already exists.
  • the server obtains information content
  • the server crawls the information content based on the information content source and stores the information content into the information content library;
  • the server responds to the storage request for information content sent by the background management device, obtains the information content, and stores the information content into the information content library;
  • the server audits the information content based on the preset audit rules.
  • the preset audit rules at least include sensitive word matching and filtering rule verification;
  • the server determines that the information review fails, generates the first prompt message that the review fails, and sends the information content and the first prompt message to the background.
  • Manage equipment
  • the background management device displays the first prompt message
  • step S1.6 If the information content does not match the sensitive words and the information content does not match the filtering rules, the server determines that the information review has passed and further proceeds to step S2;
  • the server determines the information topic tag of the information content based on the preset algorithm model
  • the server processes the information title text of the information content based on the entity extraction model to obtain the entity type label of the information content.
  • the entity extraction model is used to extract the company, industry and person name information in the information title text;
  • the server processes the information title text of the information content based on the keyword extraction model to obtain the keyword tags of the information content.
  • the keyword extraction model is used to extract industry keyword information in the information title text;
  • the server determines the information topic tag of the information content based on the entity type tag and keyword tag;
  • the server determines whether there is a target topic tag matching the information topic tag in the first gallery, and the pictures in the first gallery are annotated with the corresponding topic tag;
  • the server will use the picture corresponding to the target topic tag as the cover image of the information content;
  • the server obtains a random picture from the second image gallery as the cover image of the information content.
  • the pictures in the first image gallery are not labeled with topic tags;
  • the server receives the cover image configuration request for the information content sent by the background management device, it determines the cover image of the information content based on the configuration information carried in the cover image configuration request;
  • the information flow includes at least one of information text data and information multimedia data corresponding to the information content;
  • the server obtains the picture record data within the preset period.
  • Each picture record contains a pair of information topic tags and a method for determining the cover image corresponding to the information topic tag;
  • the server records the data based on the picture and counts the frequency of occurrence of each information topic tag within the preset period to obtain tag statistics;
  • the server records the data of the accompanying images and counts the data proportion of the determination method of each cover image within the preset period to obtain the statistical data of the accompanying images;
  • the server sends label statistics and picture statistics to the background management device
  • the background management device displays label statistics and picture statistics
  • the server When the server receives an update request for the first gallery sent by the background management device, it updates at least one of the pictures and topic tags in the first gallery according to the update request;
  • the server If it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first gallery, the server generates a second prompt message indicating that the tag is repeated, and refuses to update the first gallery; in step S17, also The second prompt information can be sent to the background management device.
  • the background management device displays the second prompt information.
  • the artificial intelligence-based information mapping method obtained by the embodiment of the present application obtains information content; determines the information topic tag of the information content based on a preset algorithm model; and determines whether there is a target topic tag matching the information topic tag in the first image gallery. , where the pictures in the first gallery are annotated with corresponding topic tags; if there is a target topic tag matching the information topic tag in the first gallery, the picture corresponding to the target topic tag is used as the cover image of the information content.
  • information topic tags are identified through an algorithm, tag retrieval and matching is performed in the first gallery, and the picture corresponding to the target topic tag that matches the information topic tag in the first gallery is used as the cover image of the information content, which can be achieved Intelligent configuration of cover images reduces labor costs, enhances the fit between information content and accompanying images, and improves the efficiency and effect of information accompanying images.
  • the embodiment of the present application also provides a client.
  • Figure 8 is a schematic structural diagram of an artificial intelligence-based information mapping device provided by an embodiment of the present application.
  • the information mapping device 200 based on artificial intelligence may include:
  • Obtaining unit 210 is used to obtain information content
  • the determining unit 220 is configured to determine the information topic tag of the information content based on a preset algorithm model
  • the judging unit 230 is used to judge whether there is a target theme tag matching the information theme tag in the first gallery, wherein the pictures in the first gallery are marked with corresponding theme tags;
  • the processing unit 240 is configured to, if there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as the cover image of the information content.
  • the determining unit 220 is specifically configured to: obtain the information title text corresponding to the information text data in the information content, and process the information title text based on the preset algorithm model to determine The information topic tag of the information content.
  • the preset algorithm model includes an entity extraction model and a keyword extraction model
  • the determination unit 220 processes the information title text based on the preset algorithm model to determine the information content.
  • the information topic label is specifically used to: process the information title text based on an entity extraction model to obtain the entity type label of the information content, wherein the entity extraction model is used to extract the information title text.
  • Company, industry and person name information the information title text is processed based on a keyword extraction model to obtain the keyword tags of the information content, wherein the keyword extraction model is used to extract the industry in the information title text Keyword information: determine the information topic tag of the information content according to the entity type tag and the keyword tag.
  • the determining unit 220 when determining the information topic tag of the information content based on the entity type tag and the keyword tag, is specifically configured to: based on the entity type tag and the key word tag The word tag generates a tag list; the tag ranked first in the tag list is determined as the information topic tag of the information content.
  • the acquisition unit 210 is specifically configured to: crawl information content based on the information content source and store the information content into the information content library; or respond to the information sent by the background management device for the information content.
  • the information content warehousing request obtains the information content and stores the information content into the information content library.
  • the information mapping device 200 based on artificial intelligence is used to obtain information content, it is also used to: conduct information review of the information content based on preset review rules, wherein the preset review The rules at least include sensitive word matching and filtering rule verification; if the information content does not match the sensitive word, and the information content does not match the filtering rule, then perform the step of determining the information topic tag of the information content based on the preset algorithm model .
  • the artificial intelligence-based information mapping device 200 is also configured to: if the information content hits a sensitive word, and/or the information content hits a filtering rule, generate a first review failed Prompt information, and send the information content and the first prompt information to the background management device.
  • the processing unit 240 is also configured to: if there is no target topic tag matching the information topic tag in the first image gallery, obtain a random picture from the second image gallery as the The cover picture of the information content, wherein the pictures in the first gallery are not labeled with topic tags.
  • the processing unit 240 is further configured to: when receiving a cover image configuration request for the information content sent by the background management device, determine the cover image configuration request based on the configuration information carried in the cover image configuration request. Cover image of the information content.
  • the processing unit 240 is further configured to: when receiving a cover image display request for the information content sent by the client, send the cover image of the information content to the client, so as to Display the cover image of the information content on the information preview interface of the client.
  • the processing unit 240 is further configured to: when receiving a triggering operation for the cover image sent by the client, send the information stream of the information content to the client, so as to The information flow of the information content is displayed on the information preview interface of the client.
  • the information flow includes at least one of information text data and information multimedia data corresponding to the information content.
  • the processing unit 240 is also configured to: obtain image record data within a preset period, wherein each image record includes a pair of information topic tags corresponding to the information topic tag. How to determine the cover image; according to the accompanying image record data, count the frequency of occurrence of each information theme tag within the preset period to obtain tag statistical data; according to the accompanying image record data, count the The data proportion of the determined method of each cover image within the preset period is used to obtain the accompanying image statistical data; the label statistical data and the accompanying image statistical data are sent to the background management device for management in the background The label statistical data and the accompanying picture statistical data are displayed on the device.
  • the processing unit 240 is further configured to: when receiving an update request for the first image gallery sent by the background management device, update the information in the first image gallery according to the update request. At least one of pictures and hashtags.
  • the processing unit 240 is further configured to: if it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first gallery, generate a third tag indicating that the tag is repeated. The second prompt message and refusal to update the first gallery.
  • the processing unit 240 is configured to: if there is a target topic tag matching the information topic tag in the first gallery, and there are multiple pictures corresponding to the target topic tag, then from Select one picture from the plurality of pictures corresponding to the target topic tag as the cover picture of the information content.
  • the artificial intelligence-based information mapping device embodiments and method embodiments may correspond to each other, and similar descriptions may refer to the method embodiments. To avoid repetition, they will not be repeated here.
  • the artificial intelligence-based information mapping device shown in Figure 8 can execute the above-mentioned artificial intelligence-based information mapping method embodiment, and the aforementioned and other operations of each unit in the artificial intelligence-based information mapping device and/or or functions respectively implement the corresponding processes of the above method embodiments. For the sake of simplicity, they will not be described again here.
  • this application also provides a computer device, including a memory and a processor.
  • a computer program is stored in the memory.
  • the processor executes the computer program, it implements the steps in the above method embodiments.
  • FIG. 9 is a schematic structural diagram of a computer device provided by an embodiment of the present application.
  • the computer device may be a terminal or a server.
  • the computer device 300 may include: a communication interface 301 , a memory 302 , a processor 303 and a communication bus 304 .
  • the communication interface 301, the memory 302, and the processor 303 realize communication with each other through the communication bus 304.
  • the communication interface 301 is used for data communication between the computer device 300 and external devices.
  • the memory 302 can be used to store software programs and modules, and the processor 303 runs the software programs and modules stored in the memory 302, such as the software programs for corresponding operations in the foregoing method embodiments.
  • the processor 303 can call software programs and modules stored in the memory 302 to perform the following operations: obtain information content; determine the information topic tag of the information content; determine whether the information topic tag exists in the first gallery Matching target topic tags, wherein the pictures in the first gallery are marked with corresponding topic tags; if there is a target topic tag matching the information topic tag in the first gallery, then the target The picture corresponding to the topic tag serves as the cover image of the information content.
  • embodiments of the present application provide a computer-readable storage medium in which multiple computer programs are stored.
  • the computer programs can be loaded by the processor to execute any of the artificial intelligence-based methods provided by the embodiments of the present application. Steps in the information mapping method.
  • steps in the information mapping method For the specific implementation of each of the above operations, please refer to the previous embodiments and will not be described again here.
  • the storage medium may include: read-only memory (Read Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk, etc.
  • the computer program stored in the storage medium can execute the steps in any of the artificial intelligence-based information mapping methods provided by the embodiments of the present application, it is possible to implement any of the artificial intelligence-based information mapping methods provided by the embodiments of the present application.
  • the beneficial effects that the artificial intelligence information mapping method can achieve are detailed in the previous embodiments and will not be described again here.
  • Embodiments of the present application also provide a computer program product.
  • the computer program product includes computer instructions, and the computer instructions are stored in a computer-readable storage medium.
  • the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, causing the computer device to execute the corresponding process in any of the artificial intelligence-based information mapping methods in the embodiments of the present application, For the sake of brevity, no further details will be given here.
  • An embodiment of the present application also provides a computer program.
  • the computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium.
  • the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, causing the computer device to execute the corresponding process in any of the artificial intelligence-based information mapping methods in the embodiments of the present application, For the sake of brevity, no further details will be given here.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Library & Information Science (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Stored Programmes (AREA)

Abstract

The present application discloses an artificial intelligence-based method and apparatus for configuring an image for information, a device, a medium, and a program product. The method comprises: obtaining information content; determining an information theme label of the information content on the basis of a preset algorithm model; determining whether a target theme label matching the information theme label is present in a first image library, wherein images in the first image library are labeled with corresponding theme labels; and if a target theme label matching the information theme label is present in the first image library, using an image corresponding to the target theme label as a cover image of the information content.

Description

基于人工智能的资讯配图方法、设备、介质及程序产品Information mapping methods, equipment, media and program products based on artificial intelligence 技术领域Technical field
本申请涉及计算机技术领域,具体涉及一种基于人工智能的资讯配图方法、装置、设备、介质及程序产品。This application relates to the field of computer technology, specifically to an information mapping method, device, equipment, medium and program product based on artificial intelligence.
背景技术Background technique
在财经资讯运营工作中,配置封面图的工作,往往比较繁琐,每天面对海量资讯,运营人员需要找到版权及质量都合适的图片来配置封面图,工作重复且收效不高。In financial information operations, the work of configuring cover images is often tedious. Faced with massive amounts of information every day, operators need to find pictures with suitable copyright and quality to configure cover images. The work is repetitive and ineffective.
目前的资讯配封面图方式,主要包括两种随机配封面图与人工配置封面图。其中,当随机配封面图时,使用一些无特定主题的图片随机用作封面图,主要缺点是图片主题不明确,看起来与封面图没有关系,用户感觉“图文无关”,列表阅读效果不好。当人工配置封面图时,通过人工找出与单篇资讯主题相关的图片并上传为资讯封面图,主要缺点是工作过于重复繁琐,大量资讯的配图需要耗费很多人力。The current information matching methods mainly include two types: random matching of cover pictures and manual configuration of cover pictures. Among them, when randomly matching the cover image, some pictures without a specific theme are randomly used as the cover image. The main disadvantage is that the theme of the picture is not clear and it seems to have nothing to do with the cover image. The user feels that the picture and text have nothing to do with it, and the list reading effect is not good. good. When manually configuring the cover image, you manually find pictures related to the theme of a single article and upload them as the information cover image. The main disadvantage is that the work is too repetitive and cumbersome, and the matching of pictures for a large amount of information requires a lot of manpower.
发明内容Contents of the invention
本申请实施例提供一种基于人工智能的资讯配图方法、装置、设备、介质及程序产品,可以实现智能配置封面图,降低人工成本,增强资讯内容与配图的贴合程度,提高了资讯配图的效率和效果。Embodiments of the present application provide an information mapping method, device, equipment, media and program product based on artificial intelligence, which can realize intelligent configuration of cover images, reduce labor costs, enhance the degree of fit between information content and graphics, and improve the quality of information The efficiency and effectiveness of illustrations.
一方面,本申请实施例提供一种基于人工智能的资讯配图方法,所述方法包括:On the one hand, embodiments of the present application provide an information mapping method based on artificial intelligence. The method includes:
获取资讯内容;Obtain information content;
基于预设算法模型确定所述资讯内容的资讯主题标签;Determine the information topic tag of the information content based on a preset algorithm model;
判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签;Determine whether there is a target topic tag matching the information topic tag in the first image gallery, wherein the pictures in the first image gallery are marked with corresponding topic tags;
若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。If there is a target topic tag matching the information topic tag in the first image gallery, the picture corresponding to the target topic tag is used as the cover image of the information content.
另一方面,本申请实施例提供一种基于人工智能的资讯配图装置,所述装置包括:On the other hand, embodiments of the present application provide an information mapping device based on artificial intelligence. The device includes:
获取单元,用于获取资讯内容;Acquisition unit, used to obtain information content;
确定单元,用于基于预设算法模型确定所述资讯内容的资讯主题标签;A determining unit, configured to determine the information topic tag of the information content based on a preset algorithm model;
判断单元,用于判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签;A judging unit configured to judge whether there is a target topic tag matching the information topic tag in the first gallery, wherein the pictures in the first gallery are marked with corresponding topic tags;
处理单元,用于若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。A processing unit configured to, if there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as a cover image of the information content.
另一方面,本申请实施例提供一种计算机设备,所述计算机设备包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器通过调用所述存储器中存储的所述计算机程序,用于执行如上任一实施例所述的基于人工智能的资讯配图方法。On the other hand, embodiments of the present application provide a computer device. The computer device includes a processor and a memory. A computer program is stored in the memory. The processor calls the computer program stored in the memory. Used to execute the information mapping method based on artificial intelligence as described in any of the above embodiments.
另一方面,本申请实施例提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序适于处理器进行加载,以执行如上任一实施例所述的基于人工智能的资讯配图方法。On the other hand, embodiments of the present application provide a computer-readable storage medium that stores a computer program, and the computer program is suitable for loading by a processor to execute the steps described in any of the above embodiments. Information mapping method based on artificial intelligence.
另一方面,本申请实施例提供一种计算机程序产品,包括计算机指令,所述计算机指令被处理器执行时实现如上任一实施例所述的基于人工智能的资讯配图方法。On the other hand, embodiments of the present application provide a computer program product that includes computer instructions. When the computer instructions are executed by a processor, the artificial intelligence-based information mapping method described in any of the above embodiments is implemented.
本申请实施例通过获取资讯内容;基于预设算法模型确定资讯内容的资讯主题标签;判断第一图库中是否存在与资讯主题标签相匹配的目标主题标签,其中,第一图库中的图片标注有对应的主题标签;若第一图库中存在与资讯主题标签相匹配的目标主题标签,则将目标主题标签对应的图片作为资讯内容的封面图。本申请实施例,通过基于人工智能的算法识别资讯主题标签,对于各种内容的资讯来说提高了资讯定位的精确性;并在第一图库中进行标签检索匹配,将第一图库中与资讯主题标签相匹配的目标主题标签对应的图片作为资讯内容的封面图,可以实现智能配置封面图,降低人工成本,增强资讯内容与配图的贴合程度,提高了资讯配图的效率和效果。The embodiment of the present application obtains information content; determines the information topic tag of the information content based on a preset algorithm model; and determines whether there is a target topic tag matching the information topic tag in the first gallery, where the pictures in the first gallery are marked with The corresponding topic tag; if there is a target topic tag matching the information topic tag in the first gallery, the picture corresponding to the target topic tag will be used as the cover image of the information content. In the embodiment of the present application, an algorithm based on artificial intelligence is used to identify information topic tags, which improves the accuracy of information positioning for information of various contents; and tag retrieval and matching is performed in the first image gallery, and the information in the first image gallery is The picture corresponding to the target topic tag that matches the topic tag is used as the cover image of the information content, which can realize intelligent configuration of the cover image, reduce labor costs, enhance the fit between the information content and the accompanying images, and improve the efficiency and effect of the information matching.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings needed to be used in the description of the embodiments will be briefly introduced below. Obviously, the drawings in the following description are only some embodiments of the present application. For those skilled in the art, other drawings can also be obtained based on these drawings without exerting creative efforts.
图1为本申请实施例提供的基于人工智能的资讯配图方法的流程示意图。Figure 1 is a schematic flowchart of an information mapping method based on artificial intelligence provided by an embodiment of the present application.
图2为本申请实施例提供的第一应用场景示意图。Figure 2 is a schematic diagram of the first application scenario provided by the embodiment of the present application.
图3为本申请实施例提供的第二应用场景示意图。Figure 3 is a schematic diagram of the second application scenario provided by the embodiment of the present application.
图4为本申请实施例提供的第三应用场景示意图。Figure 4 is a schematic diagram of the third application scenario provided by the embodiment of the present application.
图5为本申请实施例提供的第四应用场景示意图。Figure 5 is a schematic diagram of the fourth application scenario provided by the embodiment of the present application.
图6为本申请实施例提供的第五应用场景示意图。Figure 6 is a schematic diagram of the fifth application scenario provided by the embodiment of the present application.
图7为本申请实施例提供的基于人工智能的资讯配图方法的流程时序图。Figure 7 is a flow sequence diagram of the artificial intelligence-based information mapping method provided by the embodiment of the present application.
图8为本申请实施例提供的基于人工智能的资讯配图装置的结构示意图。Figure 8 is a schematic structural diagram of an information mapping device based on artificial intelligence provided by an embodiment of the present application.
图9为本申请实施例提供的服务器的结构示意图。Figure 9 is a schematic structural diagram of a server provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only some of the embodiments of the present application, rather than all of the embodiments. Based on the embodiments in this application, all other embodiments obtained by those skilled in the art without making creative efforts fall within the scope of protection of this application.
本申请实施例提供一种基于人工智能的资讯配图方法、装置、终端设备和存储介质。具体地,本申请实施例的基于人工智能的资讯配图方法可以由计算机设备执行,其中,该计算机设备可以为终端或者服务器等设备。该终端可以为智能手机、平板电脑、笔记本电脑、台式计算机、智能电视、智能音箱、穿戴式智能设备、智能车载终端等设备,终端还可以包括客户端,该客户端可以是金融客户端、浏览器客户端或即时通信客户端等。服务器可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群或者分布式系统,还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、内容分发网络服务、以及大数据和人工智能平台等基础云计算服务的云服务器,但并不局限于此。Embodiments of the present application provide an information mapping method, device, terminal equipment and storage medium based on artificial intelligence. Specifically, the information mapping method based on artificial intelligence in the embodiment of the present application can be executed by a computer device, where the computer device can be a terminal or a server. The terminal can be a smartphone, tablet, laptop, desktop computer, smart TV, smart speaker, wearable smart device, smart vehicle terminal and other devices. The terminal can also include a client, which can be a financial client, browser server client or instant messaging client, etc. The server can be an independent physical server, or a server cluster or distributed system composed of multiple physical servers. It can also provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, and middleware. Cloud servers include software services, domain name services, security services, content distribution network services, and basic cloud computing services such as big data and artificial intelligence platforms, but are not limited to these.
以下分别进行详细说明。需说明的是,以下实施例的描述顺序不作为对实施例优先顺序的限定。Each is explained in detail below. It should be noted that the description order of the following embodiments is not used to limit the priority order of the embodiments.
请参阅图1至图7,图1为本申请实施例提供的基于人工智能的资讯配图方法的流程示意图,图2至图6均为本申请实施例提供的应用场景示意图,图7为本申请实施例提供的基于人工智能的资讯配图方法的流程时序图。本申请实施例的基于人工智能的资讯配图方法可应用于服务器。该方法包括以下步骤:Please refer to Figures 1 to 7. Figure 1 is a schematic flow chart of the artificial intelligence-based information mapping method provided by the embodiment of the present application. Figures 2 to 6 are schematic diagrams of application scenarios provided by the embodiment of the present application. Figure 7 is a schematic diagram of the application scenario. A flow sequence diagram of the artificial intelligence-based information mapping method provided in the application embodiment. The information mapping method based on artificial intelligence in the embodiment of the present application can be applied to the server. The method includes the following steps:
步骤110,获取资讯内容。Step 110: Obtain information content.
在一些实施例中,所述获取资讯内容,包括:In some embodiments, the obtained information content includes:
基于资讯内容源爬取资讯内容,并将所述资讯内容入库至资讯内容库中;或者Crawl the information content based on the information content source and store the information content into the information content library; or
响应于后台管理设备发送的针对所述资讯内容的入库请求,获取所述资讯内容,并将所述资讯内容入库至资讯内容库中。In response to the storage request for the information content sent by the background management device, the information content is obtained, and the information content is stored in the information content library.
其中,资讯运营不可缺少的一部分就是资讯配图,一篇资讯内容的获取方式可以包括内容源爬取自动审核入库或者人工新建入库。Among them, an indispensable part of information operations is information mapping. The acquisition method of an information content can include crawling the content source, automatically reviewing it, or manually creating a new one and adding it to the database.
例如,在与资讯原网站达成版权合作的基础上,基于爬虫工具从资讯内容源爬取资讯内容。该爬虫工具是一种按照必定的规则,自动地抓取万维网信息的程序或者脚本。爬虫工具经过HTTP库向目标站点发起请求,即发送一个Request,请求能够包含额外的headers等信息,等待服务器响应;若是服务器能正常响应,会获得一个Response,Response的内容即是所要获取的页面内容,类型可能有HTML,Json字符串,二进制数 据(如图片视频)等类型;获得的内容可以是HTML,能够用正则表达式、网页解析库进行解析;获取的内容也可以是Json,能够直接转为Json对象解析,一般为二进制数据,能够作保存或者进一步的处理;爬虫工具爬取的资讯内容能够存为文本,也能够保存至数据库,或者保存特定格式的文件。For example, on the basis of copyright cooperation with the original information website, crawler tools are used to crawl information content from the information content source. The crawler tool is a program or script that automatically crawls World Wide Web information according to certain rules. The crawler tool initiates a request to the target site through the HTTP library, that is, it sends a Request. The request can contain additional headers and other information and waits for the server to respond; if the server can respond normally, it will get a Response, and the content of the Response is the content of the page to be obtained. , the type may include HTML, Json string, binary data (such as pictures and videos), etc.; the obtained content can be HTML, which can be parsed using regular expressions and web page parsing libraries; the obtained content can also be Json, which can be directly converted It is parsed for Json objects, usually binary data, which can be saved or further processed; the information crawled by the crawler tool can be saved as text, saved to a database, or saved as a file in a specific format.
例如,也可以对资讯内容进行人工新建入库,响应于后台管理设备发送的针对资讯内容的入库请求,获取资讯内容,并将资讯内容入库至资讯内容库中。For example, the information content can also be manually added to the database, and the information content can be obtained in response to a database entry request for the information content sent by the background management device, and the information content can be stored in the information content database.
在一些实施例中,在所述获取资讯内容之后,还包括:In some embodiments, after obtaining the information content, the method further includes:
基于预设审核规则对所述资讯内容进行资讯审核,其中,所述预设审核规则至少包括敏感词匹配与过滤规则校验;Conduct information review on the information content based on preset review rules, where the preset review rules at least include sensitive word matching and filtering rule verification;
若所述资讯内容未命中敏感词,且所述资讯内容未命中过滤规则,则执行基于预设算法模型确定所述资讯内容的资讯主题标签的步骤。If the information content does not match the sensitive word and the information content does not match the filtering rule, then the step of determining the information topic tag of the information content based on the preset algorithm model is performed.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
若所述资讯内容命中敏感词,和/或所述资讯内容命中过滤规则,则生成审核不通过的第一提示信息,并将所述资讯内容与所述第一提示信息发送至所述后台管理设备。If the information content hits a sensitive word, and/or the information content hits a filtering rule, a first prompt message indicating that the review fails is generated, and the information content and the first prompt message are sent to the backend management equipment.
例如,以资讯内容源爬取获取的资讯内容为例,资讯内容源爬取占据了一大部分资讯内容的来源,通过这种方式入库的资讯大多数是没有配图的,则后续需要根据资讯内容获取资讯主题标签,自动生成配图。For example, take the information content obtained by crawling the information content source as an example. The crawling of the information content source occupies a large part of the source of the information content. Most of the information stored in this way does not have pictures, so the follow-up needs to be based on Information content obtains information topic tags and automatically generates accompanying images.
内容源爬取入库的资讯内容会先入库至资讯内容库,在步骤120之前,需通过资讯审核流程对资讯内容进行敏感词匹配、过滤规则校验等资讯审核流程。The information content crawled by the content source will be stored in the information content library first. Before step 120, the information content needs to be subjected to information review processes such as sensitive word matching and filtering rule verification through the information review process.
其中,敏感词库和过滤规则词库,是预选搭建的词库,在资讯内容入库时通过对资讯内容的标题、资讯来源、正文等字段进行文本匹配,若命中敏感词或命中过滤词,则自动判断为审核不通过,需要运营人员再进行人工审核后才能确定是否进一步执行步骤120。若未命中敏感词且未命中过滤词,则自动判断为审核通过,则进一步执行步骤120。Among them, the sensitive thesaurus and the filtering rule thesaurus are pre-selected and built thesaurus. When the information content is entered into the database, text matching is performed on the title, information source, text and other fields of the information content. If the sensitive word or the filter word is hit, Then it is automatically determined that the review has failed, and the operator needs to conduct manual review before determining whether to further perform step 120. If the sensitive word is not hit and the filter word is not hit, it is automatically determined that the review is passed, and step 120 is further executed.
步骤120,基于预设算法模型确定所述资讯内容的资讯主题标签。Step 120: Determine the information topic tag of the information content based on a preset algorithm model.
在一些实施例中,所述基于预设算法模型确定所述资讯内容的资讯主题标签,包括:In some embodiments, determining the information topic tag of the information content based on a preset algorithm model includes:
获取所述资讯内容中资讯文本数据对应的资讯标题文本,并基于所述预设算法模型对所述资讯标题文本进行处理,以确定所述资讯内容的资讯主题标签。Obtain the information title text corresponding to the information text data in the information content, and process the information title text based on the preset algorithm model to determine the information topic tag of the information content.
在一些实施例中,所述预设算法模型包括实体抽取模型和关键词抽取模型,所述基于所述预设算法模型对所述资讯标题文本进行处理,以确定所述资讯内容的资讯主题标签,包括:In some embodiments, the preset algorithm model includes an entity extraction model and a keyword extraction model, and the information title text is processed based on the preset algorithm model to determine the information topic tag of the information content. ,include:
基于实体抽取模型对所述资讯标题文本进行处理,得到所述资讯内容的实体类型标签,其中,所述实体抽取模型用于抽取所述资讯标题文本中的公司、行业与人名信息;The information title text is processed based on an entity extraction model to obtain the entity type label of the information content, wherein the entity extraction model is used to extract company, industry and person name information in the information title text;
基于关键词抽取模型对所述资讯标题文本进行处理,得到所述资讯内容的关键词标签,其中,所述关键词抽取模型用于抽取所述资讯标题文本中的行业关键词信息;Process the information title text based on a keyword extraction model to obtain keyword tags for the information content, where the keyword extraction model is used to extract industry keyword information in the information title text;
根据所述实体类型标签与所述关键词标签,确定所述资讯内容的资讯主题标签。According to the entity type tag and the keyword tag, the information topic tag of the information content is determined.
在一些实施例中,所述根据所述实体类型标签与所述关键词标签,确定所述资讯内容的资讯主题标签,包括:In some embodiments, determining the information topic tag of the information content based on the entity type tag and the keyword tag includes:
根据所述实体类型标签与所述关键词标签生成标签列表;Generate a tag list according to the entity type tag and the keyword tag;
将所述标签列表中排名第一的标签确定为所述资讯内容的资讯主题标签。The first-ranked tag in the tag list is determined as the information topic tag of the information content.
在本申请实施例中,可以在预设算法模型侧提供一个资讯主题标签API接口,根据后端传来的资讯标题文本,经过实体抽取模型(记为M1)、关键词抽取模型(记为M2),最终返回一个标签列表。后端选择列表中排名第一的标签作为资讯主题标签进行展示。In the embodiment of this application, an information topic tag API interface can be provided on the preset algorithm model side. According to the information title text transmitted from the back end, the entity extraction model (denoted as M1) and the keyword extraction model (denoted as M2 ), ultimately returning a list of tags. The first-ranked tag in the backend selection list is displayed as the information topic tag.
该资讯主题标签API接口的实体抽取模型和关键词抽取模型,从模型层面上共用同 一个模型结构,但两者在收集数据阶段,设计不同实体标签,因此构造出的训练数据不同,最终拥有抽取不同结果的能力。The entity extraction model and keyword extraction model of the information topic tag API interface share the same model structure from the model level, but they design different entity tags in the data collection stage, so the training data constructed are different, and the final extraction Ability to achieve different results.
例如,实体抽取模型主要针对金融领域关心的公司、行业、人名等三类最重要的实体类型进行数据标注。For example, the entity extraction model mainly performs data annotation on the three most important entity types that are concerned in the financial field: companies, industries, and names of people.
例如,关键词抽取模型主要针对金融资讯中常见且重要的词进行数据标注,例如:通胀、股市、港股、美联储等。For example, the keyword extraction model mainly annotates data for common and important words in financial information, such as: inflation, stock market, Hong Kong stocks, Federal Reserve, etc.
在收集到相应的标注数据后,可以利用自然语言处理预训练模型Bert和全局指针模块,搭建命名实体识别模型。利用不同来源的标注数据,分别对模型进行微调,最终得到2个结果模型,即上述的M1和M2。After collecting the corresponding annotation data, you can use the natural language processing pre-training model Bert and the global pointer module to build a named entity recognition model. The annotated data from different sources were used to fine-tune the models respectively, and finally two result models were obtained, namely the above-mentioned M1 and M2.
其中,Bert模型由嵌入(Embedding)层、12个变换(Transformer)层搭建而成,共有1.1亿参数,模型参数非常庞大。在本项目中,Bert模型作为text_encoder对输入的资讯标题文本进行特征提取。输入的资讯标题文本首先会经过Bert中的分词器(Tokenizer),得到长度为L的标记(tokens)序列,tokens序列进一步根据vocab中的映射关系将token文本转成word id,得到[1,L]的输入张量(tensor),接着输入到Bert模型中,Bert作为encoder,[1,L]的tensor经过Embedding层,得到[L,D]维度的tensor,该tensor记为R(由于是Bert模型,则D=768),接着,该tensor输入全指针层,输出一个[n_labels,L,L],n_labels是实体总类别数,例如:对于同时抽取公司、行业、人名的实体抽取模型M1,n_labels=3;仅抽取关键词的实体抽取模型M2,其n_labels=1。Among them, the Bert model is composed of an Embedding layer and 12 Transformer layers, with a total of 110 million parameters. The model parameters are very large. In this project, the Bert model serves as text_encoder to extract features from the input information title text. The input information title text will first go through the tokenizer (Tokenizer) in Bert to obtain a token sequence of length L. The token sequence further converts the token text into word id according to the mapping relationship in vocab, obtaining [1,L ], and then input it into the Bert model. Bert serves as the encoder. The tensor of [1, L] passes through the Embedding layer to obtain the tensor of [L, D] dimensions. This tensor is recorded as R (because it is Bert model, then D = 768), then, the tensor inputs the full pointer layer and outputs a [n_labels, L, L], n_labels is the total number of entity categories, for example: for the entity extraction model M1 that extracts companies, industries, and personal names at the same time, n_labels=3; the entity extraction model M2 that only extracts keywords has n_labels=1.
全指针层的作用是利用text_encoder提取出实体的丰富语义信息,一次性通过一个指针方阵来指示出实体的头和尾,从而能快速定位到该实体在原文中的位置,进行直接提取。本实施例中采用一个简化版的多头注意力(Multi-Head Attention)模块来实现该功能。Multi-Head Attention模块由三个矩阵Q(查询)、K(所有键)、V(值)进行矩阵计算后,再进行Scaled Dot-Product Attention计算。此处是直接利用Q和K矩阵(均为[D,d]的矩阵),以及上述得到的[L,D]维度的tensor(该tensor记为R),对输入的[L,D]降维到[L,d](d=64通常<<D)的特征空间中,记为q和k,相关公式如下:The function of the full pointer layer is to use text_encoder to extract rich semantic information of the entity, and indicate the head and tail of the entity through a pointer matrix at one time, so that the position of the entity in the original text can be quickly located and directly extracted. In this embodiment, a simplified version of the Multi-Head Attention module is used to implement this function. The Multi-Head Attention module uses three matrices Q (query), K (all keys), and V (values) to perform matrix calculations, and then performs Scaled Dot-Product Attention calculations. Here, the Q and K matrices (both matrices of [D, d]) are directly used, as well as the [L, D] dimension tensor obtained above (the tensor is recorded as R), to reduce the input [L, D] In the feature space with dimensions [L, d] (d=64 usually <<D), they are recorded as q and k. The relevant formulas are as follows:
q=R·Q;k=R·K;
Figure PCTCN2022103226-appb-000001
q=R·Q;k=R·K;
Figure PCTCN2022103226-appb-000001
其中,S α(i,j)表示第α类实体的指针方阵,其形状为[L,L],当有n_labels类时,每个实体类别都将计算得到这样一个指针方阵,因此整个全指针层的输出为[n_labels,L,L]的tensor。需要注意的是,S α的行表示实体头位置,列表示实体尾位置,因此S α虽然是方阵,但只有上三角部分有实际意义,下三角的输出直接不考虑。 Among them, S α (i, j) represents the pointer square matrix of the αth type entity, and its shape is [L, L]. When there are n_labels classes, each entity category will calculate such a pointer square matrix, so the entire The output of the full pointer layer is a tensor of [n_labels,L,L]. It should be noted that the rows of S α represent the head position of the entity, and the columns represent the tail position of the entity. Therefore, although S α is a square matrix, only the upper triangular part has practical significance, and the output of the lower triangle is not considered directly.
分类输出层的作用是将输出的指针方阵进行实体提取,[n_labels,L,L]中大于0的值认为是被激活的实体头尾,因此该层将模型输出的logits转化成0/1二值方阵,被激活的实体头尾被置为1,其余为0。The function of the classification output layer is to extract entities from the output pointer square matrix. Values greater than 0 in [n_labels, L, L] are considered to be the head and tail of activated entities. Therefore, this layer converts the logits output by the model into 0/1 In a binary square matrix, the head and tail of the activated entity are set to 1, and the rest are set to 0.
其中,API接口在接收到资讯标题文本后,将文本先传入模型M1,若模型未返回结果,即实体抽取模型M1未提取到任何公司、行业、人名信息,则继续将标题传入关键词抽取模型M2,进一步抽取关键词。Among them, after the API interface receives the information title text, it first passes the text into the model M1. If the model does not return a result, that is, the entity extraction model M1 does not extract any company, industry, or person name information, it will continue to pass the title into the keywords. Extract model M2 and further extract keywords.
如图2所示的应用场景示意图,在训练阶段,对获取的资讯文本数据进行人工标注,比如,通过针对资讯文本数据中的公司、行业、人名等三类最重要的实体类型进行数据标注,得到第一标注数据。通过针对资讯文本数据中的行业关键词进行数据标注,得到第二标注数据。将第一标注数据输入预设算法模型中进行模型训练,比如采用第一标注数据和Bert模型预训练多个transformers层、全局指针层和分类输出层,得到实体抽取模型为M1)。将第二标注数据输入预设算法模型中进行模型训练,比如采用第二标注数据和Bert模型预训练多个transformers层、全局指针层和分类输出层,得到关键词抽取模型为M2)。As shown in the schematic diagram of the application scenario in Figure 2, during the training phase, the obtained information text data is manually annotated. For example, by performing data annotation on the three most important entity types in the information text data: company, industry, and person's name. Get the first labeled data. By performing data annotation on the industry keywords in the information text data, the second annotation data is obtained. The first annotated data is input into the preset algorithm model for model training. For example, the first annotated data and the Bert model are used to pre-train multiple transformers layers, global pointer layers and classification output layers, and the entity extraction model is obtained as M1). Input the second annotated data into the preset algorithm model for model training. For example, use the second annotated data and the Bert model to pre-train multiple transformers layers, global pointer layers and classification output layers to obtain the keyword extraction model M2).
在应用阶段,将资讯标题文本第1次输入实体抽取模型M1中,以抽取资讯标题文本中的公司、行业与人名信息,并输出第一结果集;然后,判断该第一结果集是否为空, 若该第一结果集不为空,则将第一结果集内包含的实体类型标签归入最终结果集中;若该第一结果集为空,则将资讯标题文本第2次输入关键词取模型M2中,以抽取资讯标题文本中的行业关键词信息,并输出第二结果集,将第二结果集内包含的关键词标签归入最终结果集中。其中,在最终结果集中,可以构建一个标签列表,后端选择列表中排名第一的标签作为资讯主题标签进行展示。即算法提供多个同频率标签时,资讯侧可以选取第一个标签用于配图。In the application stage, the information title text is input into the entity extraction model M1 for the first time to extract the company, industry and person name information in the information title text, and output the first result set; then, determine whether the first result set is empty , if the first result set is not empty, then the entity type tags contained in the first result set are included in the final result set; if the first result set is empty, the second input keyword of the information title text is retrieved. In model M2, the industry keyword information in the information title text is extracted, and a second result set is output, and the keyword tags contained in the second result set are included in the final result set. Among them, in the final result set, a tag list can be constructed, and the back-end selects the first-ranked tag in the list to be displayed as the information topic tag. That is, when the algorithm provides multiple tags with the same frequency, the information side can select the first tag for image matching.
上述方式通过基于人工智能训练得到实体抽取模型M1和关键词抽取模型M2,最终返回一个标签列表,以在后端选择列表中排名第一的标签作为资讯主题标签进行展示,提高了资讯主题标签确定的精确性。The above method obtains the entity extraction model M1 and the keyword extraction model M2 based on artificial intelligence training, and finally returns a tag list, and displays the tag ranked first in the back-end selection list as the information topic tag, which improves the determination of information topic tags. accuracy.
步骤130,判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签。Step 130: Determine whether there is a target topic tag matching the information topic tag in the first image gallery, where the pictures in the first image gallery are annotated with the corresponding topic tag.
例如,在通过预设算法模型生成资讯主题标签后,在第一图库中进行检索匹配,以查找第一图库中是否存在与资讯主题标签相匹配的目标主题标签。For example, after the information topic tag is generated through the preset algorithm model, a search and matching is performed in the first image gallery to find whether there is a target topic tag matching the information topic tag in the first image gallery.
例如,如图3所示的应用场景图,在后台管理设备上显示了一个资讯管理平台界面,在该资讯管理平台界面可以维护第一图库(如图3所示的标签图库),其中,资讯管理平台界面上维护第一图库中的每张图片时,可以显示图片的缩略图、ID信息、入库时间、对应的主题标签、命中次数等信息。For example, in the application scenario diagram shown in Figure 3, an information management platform interface is displayed on the background management device. In this information management platform interface, the first image gallery (the label image gallery as shown in Figure 3) can be maintained, in which the information When maintaining each picture in the first gallery on the management platform interface, the thumbnail, ID information, storage time, corresponding topic tag, number of hits and other information of the picture can be displayed.
步骤140,若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。Step 140: If there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as the cover image of the information content.
其中,一篇资讯从内容源爬取自动审核入库或者人工新建入库之后,需要给资讯内容配一张封面图。智能配图在资讯爬取入库或者人工新建保存后,可以根据预设算法模型提取资讯内容的资讯主题标签,并根据资讯主题标签匹配图库中的图片,完成资讯内容的封面图的智能配图。若第一图库中存在与资讯主题标签相匹配的目标主题标签,则将目标主题标签对应的图片作为资讯内容的封面图。Among them, after a piece of information is crawled from the content source and automatically reviewed and added to the database, or it is manually created and added to the database, a cover image needs to be assigned to the information content. After the information is crawled into the database or manually created and saved, the intelligent matching can extract the information theme tags of the information content according to the preset algorithm model, and match the pictures in the gallery according to the information theme tags to complete the intelligent matching of the cover image of the information content. . If there is a target topic tag matching the information topic tag in the first image gallery, the picture corresponding to the target topic tag will be used as the cover image of the information content.
在一些实施例中,所述若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图,包括:In some embodiments, if there is a target topic tag matching the information topic tag in the first gallery, then using the picture corresponding to the target topic tag as the cover image of the information content, including:
若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,且所述目标主题标签对应的多张图片,则从所述目标主题标签对应的多张图片中选取一张图片所述作为所述资讯内容的封面图。If there is a target topic tag matching the information topic tag in the first gallery, and there are multiple pictures corresponding to the target topic tag, then select one picture from the multiple pictures corresponding to the target topic tag. Said as the cover image of the information content.
例如,当基于资讯主题标签匹配到目标主题标签对应的多张图片时,可以从目标主题标签对应的多张图片中随机选取一张图片作为资讯内容的封面图。例如,可以根据命中次数从目标主题标签对应的多张图片中选取命中次数最多的一张图片作为资讯内容的封面图,以选取该目标主题标签中最常用的图片进行配图,尽量贴合大众审美。例如,可以根据命中次数从目标主题标签对应的多张图片中选取命中次数最少的一张作为资讯内容的封面图,以选取该目标主题标签中少用的图片进行配图,增加配图新鲜感。For example, when multiple pictures corresponding to the target topic label are matched based on the information topic label, one picture can be randomly selected from the multiple pictures corresponding to the target topic label as the cover image of the information content. For example, according to the number of hits, you can select the picture with the most hits from the multiple pictures corresponding to the target topic tag as the cover image of the information content, so as to select the most commonly used pictures in the target topic tag for illustration, so as to fit the public as much as possible. aesthetic. For example, based on the number of hits, you can select the one with the least number of hits from the multiple pictures corresponding to the target topic tag as the cover image of the information content, so as to select the rarely used pictures in the target topic tag for illustration to increase the freshness of the illustration. .
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
若所述第一图库中不存在与所述资讯主题标签相匹配的目标主题标签,则从第二图库中获取随机图片作为所述资讯内容的封面图,其中,所述第一图库中的图片未标注主题标签。If there is no target topic tag matching the information topic tag in the first image gallery, a random picture is obtained from the second image gallery as the cover image of the information content, wherein the picture in the first image gallery is Hashtags not tagged.
例如,如图4所示的应用场景图,在后台管理设备上显示了一个资讯管理平台界面,在该资讯管理平台界面可以维护第二图库(如图3所示的普通图库),其中,资讯管理平台界面上维护第二图库中的每张图片时,可以显示图片的缩略图和ID信息。For example, in the application scenario diagram shown in Figure 4, an information management platform interface is displayed on the background management device. In this information management platform interface, a second image gallery (the ordinary image gallery as shown in Figure 3) can be maintained, in which the information When maintaining each picture in the second gallery on the management platform interface, the thumbnail and ID information of the picture can be displayed.
例如,对于少量在第一图库中没有维护到的资讯主题标签,可以使用第二图库中抽象的、无特定主题的图片作为资讯内容的封面图。For example, for a small number of information topic tags that are not maintained in the first image gallery, abstract pictures without specific themes in the second image gallery can be used as the cover image of the information content.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
当接收到后台管理设备发送的针对所述资讯内容的封面图配置请求时,根据所述封 面图配置请求中携带的配置信息确定所述资讯内容的封面图。When a cover image configuration request for the information content sent by the background management device is received, the cover image of the information content is determined according to the configuration information carried in the cover image configuration request.
例如,在资讯审核入库后,运营人员仍然能够重新编辑资讯内容对应的相关信息。例如,若自动配图产生的封面图仍不符合要求,资讯内容也可以进行人工干预配图,资讯人员可以通过在后台管理设备上输入针对资讯内容的封面图配置请求,该封面图配置请求的配置信息携带有指定的封面图,服务器在收到后台管理设备发送的资讯内容的封面图配置请求时,根据封面图配置请求中携带的配置信息确定资讯内容的封面图,这样可以大大增加配图的灵活度和提高运营的效率。For example, after information is reviewed and stored in the database, operators can still re-edit the relevant information corresponding to the information content. For example, if the cover image generated by automatic image matching still does not meet the requirements, the information content can also be manually intervened in the image matching. The information personnel can enter a cover image configuration request for the information content on the background management device. The cover image configuration request The configuration information carries the specified cover image. When the server receives the cover image configuration request for the information content sent by the background management device, it determines the cover image for the information content based on the configuration information carried in the cover image configuration request. This can greatly increase the number of images. flexibility and improve operational efficiency.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
当接收到客户端发送的针对所述资讯内容的封面图展示请求时,向所述客户端发送所述资讯内容的封面图,以在所述客户端的资讯预览界面上显示所述资讯内容的封面图。When receiving a cover image display request for the information content sent by the client, sending the cover image of the information content to the client to display the cover image of the information content on the information preview interface of the client picture.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
当接收到所述客户端发送的针对所述封面图的触发操作时,向所述客户端发送所述资讯内容的信息流,以在所述客户端的资讯预览界面上显示所述资讯内容的信息流,所述信息流包括所述资讯内容对应的资讯文本数据、资讯多媒体数据中的至少一种。When receiving the triggering operation for the cover image sent by the client, send the information flow of the information content to the client, so as to display the information of the information content on the information preview interface of the client. Stream, the information stream includes at least one of information text data and information multimedia data corresponding to the information content.
例如,当用户对该封面图对应的资讯内容感兴趣时,会通过点击封面图、长按封面图等触发操作,来使得服务器向客户端推送资讯内容的信息流。其中,该信息流可以包括资讯文本数据、资讯多媒体数据等,其中,资讯多媒体数据可以包括图像、动画、音频、视频等多媒体资源。其中,资讯文本数据可以包括资讯标题、ID、发布时间、资讯内容详情、资讯内容的发布端信息等。For example, when a user is interested in the information content corresponding to the cover image, he or she will trigger operations such as clicking on the cover image or long-pressing the cover image, causing the server to push an information flow of information content to the client. The information flow may include information text data, information multimedia data, etc., where the information multimedia data may include multimedia resources such as images, animations, audios, and videos. Among them, the information text data may include information title, ID, release time, information content details, information content release end information, etc.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
获取预设时段内的配图记录数据,其中,每个所述配图记录包含成对的资讯主题标签与所述资讯主题标签对应的基于人工智能的资讯配图方式;Obtain image matching record data within a preset period, wherein each of the image matching records includes a pair of information theme tags and an artificial intelligence-based information matching method corresponding to the information theme tag;
根据所述配图记录数据,统计所述预设时段内每个所述资讯主题标签的出现频率,以得到标签统计数据;According to the picture recording data, count the frequency of occurrence of each information topic tag within the preset period to obtain tag statistics;
根据所述配图记录数据,统计所述预设时段内每个所述基于人工智能的资讯配图方式的数据占比,以得到配图统计数据;According to the mapping record data, count the data proportion of each artificial intelligence-based information mapping method within the preset period to obtain mapping statistics;
将所述标签统计数据与所述配图统计数据发送至后台管理设备,以在所述后台管理设备上显示所述标签统计数据与所述配图统计数据。The label statistical data and the accompanying picture statistical data are sent to a backend management device, so that the label statistical data and the accompanying picture statistical data are displayed on the backend management device.
其中,为了持续丰富标签配图的图库和优化配图的准确性,需要增加统计标签统计和配图统计,运营人员可以根据统计结果持续维护图库以提高智能配图的准确性。对资讯内容的标签统计数据和配图统计数据做单独的后台展示,方便定位复盘问题。Among them, in order to continue to enrich the gallery of tag images and optimize the accuracy of images, it is necessary to add statistical tag statistics and image statistics. Operators can continue to maintain the image gallery based on the statistical results to improve the accuracy of intelligent images. A separate background display is provided for the label statistics and picture statistics of the information content to facilitate locating review issues.
例如,每次配图都会记录一次配图记录,标签统计和配图统计都以此作为数据来源统计结果。即基于取预设时段内记录的配图记录数据进行数据统计,以确定出标签统计数据和配图统计数据。例如,预设时段可以为最近一周、最近一个月、最近三个月、最近一年等时间段中的其中之一者。For example, every time a picture is matched, a picture record will be recorded, and label statistics and picture statistics will use this as the data source statistical result. That is, data statistics are performed based on the picture recording data recorded within a preset period of time to determine the label statistics and picture statistics. For example, the preset time period may be one of the time periods of the last week, the last month, the last three months, the last year, etc.
其中,对于标签统计数据,由于配图记录数据的数据量较大,每日产生的数据量可能会达到几万条,单纯依靠数据库的聚合统计无法到达性能要求,因此需要按日统计出每日的标签统计数据,如图5所示的应用场景图,运营人员在查询标签统计数据时,可以通过后台管理设备上显示的资讯管理平台界面输入预设时段,比如最近一个月,并将该预设时段发送至服务器,服务器根据预设时段查询该预设时段范围内的单日标签统计数据并聚合成预设时段内的标签统计数据,并将预设时段内的标签统计数据返回至后台管理设备进行显示。通过在后台管理设备上显示标签统计数据,可以使得运营人员在维护第一图库(如图3示出的标签图库)时能有针对性的对高频出现的标签进行统计,尽可能覆盖高频资讯主题的资讯内容,从而使图库运营更加高效。例如,运营人员近期发现“油价”为高频出现的标签,即对“油价”标签维护多张相关封面图,这样,后续再出现相同标签的资讯内容时智能配图就将生效。如图5所示,在标签统计时,同时统计 资讯主题标签的出现频率,以方便有针对性行的对图片进行维护。Among them, for label statistics, due to the large amount of data recorded with pictures, the amount of data generated every day may reach tens of thousands. Simply relying on the aggregation statistics of the database cannot meet the performance requirements, so it is necessary to calculate the daily statistics on a daily basis. Tag statistical data, as shown in the application scenario diagram in Figure 5, when operators query tag statistics, they can enter a preset period, such as the last month, through the information management platform interface displayed on the background management device, and add the preset period Set a time period and send it to the server. The server queries the single-day tag statistics within the preset time period according to the preset time period and aggregates them into tag statistics within the preset time period, and returns the tag statistics within the preset time period to the backend management. device to display. By displaying tag statistical data on the background management device, operators can perform targeted statistics on frequently occurring tags when maintaining the first image gallery (the tag image gallery shown in Figure 3), and cover high-frequency tags as much as possible. The information content of the information theme makes the gallery operation more efficient. For example, the operation staff recently discovered that "oil price" is a label that appears frequently, so they maintain multiple related cover images for the "oil price" label. In this way, when the information content of the same label appears again in the future, the intelligent image matching will take effect. As shown in Figure 5, when counting tags, the frequency of occurrence of information topic tags is also counted to facilitate targeted maintenance of images.
其中,对于配图统计数据,由于配图记录数据的数据量较大,无法单纯依靠数据库的聚合统计得到需要的统计结果,因此需要每日统计前30天内的配图统计数据并存储到数据库中,查看配图统计数据时可以根据查询日期到数据库中查询出对应的配图统计数据并返回。如图6所示的应用场景图,运营人员在查询配图统计数据时,可以通过后台管理设备上显示的资讯管理平台界面输入预设时段,比如从2022-5-17往前推7天内,并将该预设时段发送至服务器,服务器根据预设时段查询该预设时段范围内的单日配图统计数据并聚合成预设时段内的配图统计数据,并将预设时段内的配图统计数据返回至后台管理设备进行显示。通过配图统计数据可以展示各类配图类型在线上的实际数据占比(覆盖率),以此对智能配图机制有更客观地数据评估,可以真实可量化的统计图库覆盖效果。例如,配图类型可以包括标签配图、随机配图和人工配图,在图6显示的配图统计数据中,标签配图的覆盖率为3.40%,随机配图的覆盖率为96.40%,人工配图的覆盖率为0.20%。Among them, for the picture statistics data, due to the large amount of picture record data, the required statistical results cannot be obtained simply by relying on the aggregation statistics of the database. Therefore, the picture statistics data within the previous 30 days need to be calculated daily and stored in the database. , when viewing the matching statistics data, you can query the corresponding matching statistics data in the database according to the query date and return it. As shown in the application scenario diagram shown in Figure 6, when operators query image matching statistics, they can enter the preset period through the information management platform interface displayed on the background management device, for example, push forward within 7 days from 2022-5-17. The preset time period is sent to the server. The server queries the single-day picture matching statistics within the preset time period according to the preset time period and aggregates the picture matching statistics within the preset time period, and aggregates the picture matching statistics within the preset time period. The graph statistical data is returned to the background management device for display. The actual data proportion (coverage) of various types of graphics online can be displayed through the graphics statistical data, so as to have a more objective data evaluation of the intelligent graphics mechanism and provide a truly quantifiable statistical library coverage effect. For example, the types of graphics can include label mapping, random mapping, and manual mapping. In the mapping statistics shown in Figure 6, the coverage rate of label mapping is 3.40%, and the coverage rate of random mapping is 96.40%. Manual illustration coverage is 0.20%.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
当接收到所述后台管理设备发送的针对所述第一图库的更新请求时,根据所述更新请求更新所述第一图库中的图片、主题标签中的至少一种。When an update request for the first gallery sent by the background management device is received, at least one of pictures and topic tags in the first gallery is updated according to the update request.
例如,还可以人工维护第一图库,对常见的资讯主题标签维护相关的封面图。运营人员可以通过后台管理设备显示的资讯管理平台界面维护第一图库,通过资讯管理平台界面输入针对第一图库的更新请求,并将更新请求发送至服务器,以使服务器根据更新请求更新第一图库中的图片、主题标签中的至少一种。例如,若更新请求为更新第一图片对应的第一主题标签,则根据更新请求更新第一图库中的第一图片对应的第一主题标签。例如,若更新请求为更新第二主题标签对应的第二图片,则根据更新请求更新第一图库中的第二主题标签对应的第二图片。例如,若更新请求为新增第三图片及其对应的第三主题标签,则根据更新请求将第三图片及其对应的第三主题标签新增至第一图库中。For example, you can also manually maintain the first image gallery and maintain related cover images for common information topic tags. The operator can maintain the first image gallery through the information management platform interface displayed on the background management device, input an update request for the first image gallery through the information management platform interface, and send the update request to the server, so that the server updates the first image gallery according to the update request. At least one of the pictures and hashtags in . For example, if the update request is to update the first topic tag corresponding to the first picture, then the first topic tag corresponding to the first picture in the first gallery is updated according to the update request. For example, if the update request is to update the second picture corresponding to the second topic tag, then the second picture corresponding to the second topic tag in the first gallery is updated according to the update request. For example, if the update request is to add a third picture and its corresponding third topic tag, then the third picture and its corresponding third topic tag are added to the first gallery according to the update request.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
若检测到所述第一图库中已存在与所述更新请求中携带的待更新标签相同的主题标签,则生成表征标签重复的第二提示信息,并拒绝更新所述第一图库。If it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first image gallery, a second prompt message indicating a duplication of tags is generated, and the update of the first image gallery is refused.
例如,一个图片不能创建多个相同标签,若检测到第一图库中已存在与更新请求中携带的待更新标签相同的主题标签,则生成表征标签重复的第二提示信息,并拒绝更新第一图库,并将第二提示信息发送至后台管理设备,以在后台管理设备上显示该第二提示信息,以提示运营人员已存在相同标签。For example, one image cannot create multiple identical tags. If it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first gallery, a second prompt message indicating duplicate tags will be generated, and the update of the first image gallery will be rejected. gallery, and sends the second prompt information to the background management device to display the second prompt information on the background management device to remind the operator that the same tag already exists.
为了更好的说明本申请实施例提供的基于人工智能的资讯配图方法,请参阅图7,本申请实施例提供的基于人工智能的资讯配图方法的流程可总结归纳为下述步骤:In order to better illustrate the artificial intelligence-based information mapping method provided by the embodiment of the present application, please refer to Figure 7. The process of the artificial intelligence-based information mapping method provided by the embodiment of the present application can be summarized into the following steps:
S1.服务器获取资讯内容;S1. The server obtains information content;
S1.1.服务器基于资讯内容源爬取资讯内容,并将资讯内容入库至资讯内容库中;S1.1. The server crawls the information content based on the information content source and stores the information content into the information content library;
S1.2.服务器响应于后台管理设备发送的针对资讯内容的入库请求,获取资讯内容,并将资讯内容入库至资讯内容库中;S1.2. The server responds to the storage request for information content sent by the background management device, obtains the information content, and stores the information content into the information content library;
S1.3.服务器基于预设审核规则对资讯内容进行资讯审核,预设审核规则至少包括敏感词匹配与过滤规则校验;S1.3. The server audits the information content based on the preset audit rules. The preset audit rules at least include sensitive word matching and filtering rule verification;
S1.4.若资讯内容命中敏感词,和/或资讯内容命中过滤规则,则服务器确定资讯审核不通过,生成审核不通过的第一提示信息,并将资讯内容与第一提示信息发送至后台管理设备;S1.4. If the information content hits the sensitive word, and/or the information content hits the filtering rule, the server determines that the information review fails, generates the first prompt message that the review fails, and sends the information content and the first prompt message to the background. Manage equipment;
S1.5.后台管理设备显示第一提示信息;S1.5. The background management device displays the first prompt message;
S1.6.若资讯内容未命中敏感词,且资讯内容未命中过滤规则,则服务器确定资讯审核通过,进一步执行步骤S2;S1.6. If the information content does not match the sensitive words and the information content does not match the filtering rules, the server determines that the information review has passed and further proceeds to step S2;
S2.服务器基于预设算法模型确定资讯内容的资讯主题标签;S2. The server determines the information topic tag of the information content based on the preset algorithm model;
S2.1.服务器基于实体抽取模型对资讯内容资讯标题文本进行处理,得到资讯内容的实体类型标签,实体抽取模型用于抽取资讯标题文本中的公司、行业与人名信息;S2.1. The server processes the information title text of the information content based on the entity extraction model to obtain the entity type label of the information content. The entity extraction model is used to extract the company, industry and person name information in the information title text;
S2.2.服务器基于关键词抽取模型对资讯内容资讯标题文本进行处理,得到资讯内容的关键词标签,关键词抽取模型用于抽取资讯标题文本中的行业关键词信息;S2.2. The server processes the information title text of the information content based on the keyword extraction model to obtain the keyword tags of the information content. The keyword extraction model is used to extract industry keyword information in the information title text;
S2.3.服务器根据实体类型标签与关键词标签,确定资讯内容的资讯主题标签;S2.3. The server determines the information topic tag of the information content based on the entity type tag and keyword tag;
S3.服务器判断第一图库中是否存在与资讯主题标签相匹配的目标主题标签,第一图库中的图片标注有对应的主题标签;S3. The server determines whether there is a target topic tag matching the information topic tag in the first gallery, and the pictures in the first gallery are annotated with the corresponding topic tag;
S4.若第一图库中存在与资讯主题标签相匹配的目标主题标签,则服务器将目标主题标签对应的图片作为资讯内容的封面图;S4. If there is a target topic tag matching the information topic tag in the first gallery, the server will use the picture corresponding to the target topic tag as the cover image of the information content;
S5.若第一图库中不存在与资讯主题标签相匹配的目标主题标签,则服务器从第二图库中获取随机图片作为资讯内容的封面图,第一图库中的图片未标注主题标签;S5. If there is no target topic tag matching the information topic tag in the first image gallery, the server obtains a random picture from the second image gallery as the cover image of the information content. The pictures in the first image gallery are not labeled with topic tags;
S6.当服务器接收到后台管理设备发送的针对资讯内容的封面图配置请求时,根据封面图配置请求中携带的配置信息确定资讯内容的封面图;S6. When the server receives the cover image configuration request for the information content sent by the background management device, it determines the cover image of the information content based on the configuration information carried in the cover image configuration request;
S7.当服务器接收到客户端发送的针对资讯内容的封面图展示请求时,向客户端发送资讯内容的封面图;S7. When the server receives the cover image display request for the information content sent by the client, it sends the cover image of the information content to the client;
S8.在客户端的资讯预览界面上显示资讯内容的封面图;S8. Display the cover image of the information content on the information preview interface of the client;
S9.当服务器接收到客户端发送的针对封面图的触发操作时,向客户端发送资讯内容的信息流;S9. When the server receives the trigger operation for the cover image sent by the client, it sends the information flow of the information content to the client;
S10.在客户端的资讯预览界面上显示资讯内容的信息流,信息流包括资讯内容对应的资讯文本数据、资讯多媒体数据中的至少一种;S10. Display the information flow of the information content on the client's information preview interface. The information flow includes at least one of information text data and information multimedia data corresponding to the information content;
S11.服务器获取预设时段内的配图记录数据,每个配图记录包含成对的资讯主题标签与资讯主题标签对应的封面图的确定方式;S11. The server obtains the picture record data within the preset period. Each picture record contains a pair of information topic tags and a method for determining the cover image corresponding to the information topic tag;
S12.服务器根据配图记录数据,统计预设时段内每个资讯主题标签的出现频率,以得到标签统计数据;S12. The server records the data based on the picture and counts the frequency of occurrence of each information topic tag within the preset period to obtain tag statistics;
S13.服务器根据配图记录数据,统计预设时段内每个封面图的确定方式的数据占比,以得到配图统计数据;S13. The server records the data of the accompanying images and counts the data proportion of the determination method of each cover image within the preset period to obtain the statistical data of the accompanying images;
S14.服务器将标签统计数据与配图统计数据发送至后台管理设备;S14. The server sends label statistics and picture statistics to the background management device;
S15.后台管理设备显示标签统计数据与配图统计数据;S15. The background management device displays label statistics and picture statistics;
S16.当服务器接收到后台管理设备发送的针对第一图库的更新请求时,根据更新请求更新第一图库中的图片、主题标签中的至少一种;S16. When the server receives an update request for the first gallery sent by the background management device, it updates at least one of the pictures and topic tags in the first gallery according to the update request;
S17.若检测到第一图库中已存在与更新请求中携带的待更新标签相同的主题标签,则服务器生成表征标签重复的第二提示信息,并拒绝更新第一图库;在步骤S17中,还可以将第二提示信息发送至后台管理设备。S17. If it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first gallery, the server generates a second prompt message indicating that the tag is repeated, and refuses to update the first gallery; in step S17, also The second prompt information can be sent to the background management device.
S18.后台管理设备显示第二提示信息。S18. The background management device displays the second prompt information.
上述所有的技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above technical solutions can be combined in any way to form optional embodiments of the present application, and will not be described again one by one.
本申请实施例提供的基于人工智能的资讯配图方法,通过获取资讯内容;基于预设算法模型确定资讯内容的资讯主题标签;判断第一图库中是否存在与资讯主题标签相匹配的目标主题标签,其中,第一图库中的图片标注有对应的主题标签;若第一图库中存在与资讯主题标签相匹配的目标主题标签,则将目标主题标签对应的图片作为资讯内容的封面图。本申请实施例,通过算法识别资讯主题标签,并在第一图库中进行标签检索匹配,将第一图库中与资讯主题标签相匹配的目标主题标签对应的图片作为资讯内容的封面图,可以实现智能配置封面图,降低人工成本,增强资讯内容与配图的贴合程度,提高了资讯配图的效率和效果。The artificial intelligence-based information mapping method provided by the embodiment of the present application obtains information content; determines the information topic tag of the information content based on a preset algorithm model; and determines whether there is a target topic tag matching the information topic tag in the first image gallery. , where the pictures in the first gallery are annotated with corresponding topic tags; if there is a target topic tag matching the information topic tag in the first gallery, the picture corresponding to the target topic tag is used as the cover image of the information content. In the embodiment of this application, information topic tags are identified through an algorithm, tag retrieval and matching is performed in the first gallery, and the picture corresponding to the target topic tag that matches the information topic tag in the first gallery is used as the cover image of the information content, which can be achieved Intelligent configuration of cover images reduces labor costs, enhances the fit between information content and accompanying images, and improves the efficiency and effect of information accompanying images.
为便于更好的实施本申请实施例的基于人工智能的资讯配图方法,本申请实施例还提供一种客户端。请参阅图8,图8为本申请实施例提供的基于人工智能的资讯配图装 置的结构示意图。其中,该基于人工智能的资讯配图装置200可以包括:In order to facilitate better implementation of the artificial intelligence-based information mapping method in the embodiment of the present application, the embodiment of the present application also provides a client. Please refer to Figure 8, which is a schematic structural diagram of an artificial intelligence-based information mapping device provided by an embodiment of the present application. Among them, the information mapping device 200 based on artificial intelligence may include:
获取单元210,用于获取资讯内容;Obtaining unit 210 is used to obtain information content;
确定单元220,用于基于预设算法模型确定所述资讯内容的资讯主题标签;The determining unit 220 is configured to determine the information topic tag of the information content based on a preset algorithm model;
判断单元230,用于判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签;The judging unit 230 is used to judge whether there is a target theme tag matching the information theme tag in the first gallery, wherein the pictures in the first gallery are marked with corresponding theme tags;
处理单元240,用于若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。The processing unit 240 is configured to, if there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as the cover image of the information content.
在一些实施例中,所述确定单元220,具体用于:获取所述资讯内容中资讯文本数据对应的资讯标题文本,并基于所述预设算法模型对所述资讯标题文本进行处理,以确定所述资讯内容的资讯主题标签。In some embodiments, the determining unit 220 is specifically configured to: obtain the information title text corresponding to the information text data in the information content, and process the information title text based on the preset algorithm model to determine The information topic tag of the information content.
在一些实施例中,所述预设算法模型包括实体抽取模型和关键词抽取模型,所述确定单元220在基于所述预设算法模型对所述资讯标题文本进行处理,以确定所述资讯内容的资讯主题标签时,具体用于:基于实体抽取模型对所述资讯标题文本进行处理,得到所述资讯内容的实体类型标签,其中,所述实体抽取模型用于抽取所述资讯标题文本中的公司、行业与人名信息;基于关键词抽取模型对所述资讯标题文本进行处理,得到所述资讯内容的关键词标签,其中,所述关键词抽取模型用于抽取所述资讯标题文本中的行业关键词信息;根据所述实体类型标签与所述关键词标签,确定所述资讯内容的资讯主题标签。In some embodiments, the preset algorithm model includes an entity extraction model and a keyword extraction model, and the determination unit 220 processes the information title text based on the preset algorithm model to determine the information content. When the information topic label is used, it is specifically used to: process the information title text based on an entity extraction model to obtain the entity type label of the information content, wherein the entity extraction model is used to extract the information title text. Company, industry and person name information; the information title text is processed based on a keyword extraction model to obtain the keyword tags of the information content, wherein the keyword extraction model is used to extract the industry in the information title text Keyword information: determine the information topic tag of the information content according to the entity type tag and the keyword tag.
在一些实施例中,所述确定单元220在根据所述实体类型标签与所述关键词标签,确定所述资讯内容的资讯主题标签时,具体用于:根据所述实体类型标签与所述关键词标签生成标签列表;将所述标签列表中排名第一的标签确定为所述资讯内容的资讯主题标签。In some embodiments, when determining the information topic tag of the information content based on the entity type tag and the keyword tag, the determining unit 220 is specifically configured to: based on the entity type tag and the key word tag The word tag generates a tag list; the tag ranked first in the tag list is determined as the information topic tag of the information content.
在一些实施例中,所述获取单元210,具体用于:基于资讯内容源爬取资讯内容,并将所述资讯内容入库至资讯内容库中;或者响应于后台管理设备发送的针对所述资讯内容的入库请求,获取所述资讯内容,并将所述资讯内容入库至资讯内容库中。In some embodiments, the acquisition unit 210 is specifically configured to: crawl information content based on the information content source and store the information content into the information content library; or respond to the information sent by the background management device for the information content. The information content warehousing request obtains the information content and stores the information content into the information content library.
在一些实施例中,所述基于人工智能的资讯配图装置200在用于获取资讯内容之后,还用于:基于预设审核规则对所述资讯内容进行资讯审核,其中,所述预设审核规则至少包括敏感词匹配与过滤规则校验;若所述资讯内容未命中敏感词,且所述资讯内容未命中过滤规则,则执行基于预设算法模型确定所述资讯内容的资讯主题标签的步骤。In some embodiments, after the information mapping device 200 based on artificial intelligence is used to obtain information content, it is also used to: conduct information review of the information content based on preset review rules, wherein the preset review The rules at least include sensitive word matching and filtering rule verification; if the information content does not match the sensitive word, and the information content does not match the filtering rule, then perform the step of determining the information topic tag of the information content based on the preset algorithm model .
在一些实施例中,所述基于人工智能的资讯配图装置200,还用于:若所述资讯内容命中敏感词,和/或所述资讯内容命中过滤规则,则生成审核不通过的第一提示信息,并将所述资讯内容与所述第一提示信息发送至所述后台管理设备。In some embodiments, the artificial intelligence-based information mapping device 200 is also configured to: if the information content hits a sensitive word, and/or the information content hits a filtering rule, generate a first review failed Prompt information, and send the information content and the first prompt information to the background management device.
在一些实施例中,所述处理单元240,还用于:若所述第一图库中不存在与所述资讯主题标签相匹配的目标主题标签,则从第二图库中获取随机图片作为所述资讯内容的封面图,其中,所述第一图库中的图片未标注主题标签。In some embodiments, the processing unit 240 is also configured to: if there is no target topic tag matching the information topic tag in the first image gallery, obtain a random picture from the second image gallery as the The cover picture of the information content, wherein the pictures in the first gallery are not labeled with topic tags.
在一些实施例中,所述处理单元240,还用于:当接收到后台管理设备发送的针对所述资讯内容的封面图配置请求时,根据所述封面图配置请求中携带的配置信息确定所述资讯内容的封面图。In some embodiments, the processing unit 240 is further configured to: when receiving a cover image configuration request for the information content sent by the background management device, determine the cover image configuration request based on the configuration information carried in the cover image configuration request. Cover image of the information content.
在一些实施例中,所述处理单元240,还用于:当接收到客户端发送的针对所述资讯内容的封面图展示请求时,向所述客户端发送所述资讯内容的封面图,以在所述客户端的资讯预览界面上显示所述资讯内容的封面图。In some embodiments, the processing unit 240 is further configured to: when receiving a cover image display request for the information content sent by the client, send the cover image of the information content to the client, so as to Display the cover image of the information content on the information preview interface of the client.
在一些实施例中,所述处理单元240,还用于:当接收到所述客户端发送的针对所述封面图的触发操作时,向所述客户端发送所述资讯内容的信息流,以在所述客户端的资讯预览界面上显示所述资讯内容的信息流,所述信息流包括所述资讯内容对应的资讯文本数据、资讯多媒体数据中的至少一种。In some embodiments, the processing unit 240 is further configured to: when receiving a triggering operation for the cover image sent by the client, send the information stream of the information content to the client, so as to The information flow of the information content is displayed on the information preview interface of the client. The information flow includes at least one of information text data and information multimedia data corresponding to the information content.
在一些实施例中,所述处理单元240,还用于:获取预设时段内的配图记录数据, 其中,每个所述配图记录包含成对的资讯主题标签与所述资讯主题标签对应的封面图的确定方式;根据所述配图记录数据,统计所述预设时段内每个所述资讯主题标签的出现频率,以得到标签统计数据;根据所述配图记录数据,统计所述预设时段内每个所述封面图的确定方式的数据占比,以得到配图统计数据;将所述标签统计数据与所述配图统计数据发送至后台管理设备,以在所述后台管理设备上显示所述标签统计数据与所述配图统计数据。In some embodiments, the processing unit 240 is also configured to: obtain image record data within a preset period, wherein each image record includes a pair of information topic tags corresponding to the information topic tag. How to determine the cover image; according to the accompanying image record data, count the frequency of occurrence of each information theme tag within the preset period to obtain tag statistical data; according to the accompanying image record data, count the The data proportion of the determined method of each cover image within the preset period is used to obtain the accompanying image statistical data; the label statistical data and the accompanying image statistical data are sent to the background management device for management in the background The label statistical data and the accompanying picture statistical data are displayed on the device.
在一些实施例中,所述处理单元240,还用于:当接收到所述后台管理设备发送的针对所述第一图库的更新请求时,根据所述更新请求更新所述第一图库中的图片、主题标签中的至少一种。In some embodiments, the processing unit 240 is further configured to: when receiving an update request for the first image gallery sent by the background management device, update the information in the first image gallery according to the update request. At least one of pictures and hashtags.
在一些实施例中,所述处理单元240,还用于:若检测到所述第一图库中已存在与所述更新请求中携带的待更新标签相同的主题标签,则生成表征标签重复的第二提示信息,并拒绝更新所述第一图库。In some embodiments, the processing unit 240 is further configured to: if it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first gallery, generate a third tag indicating that the tag is repeated. The second prompt message and refusal to update the first gallery.
在一些实施例中,所述处理单元240,用于:若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,且所述目标主题标签对应的多张图片,则从所述目标主题标签对应的多张图片中选取一张图片所述作为所述资讯内容的封面图。In some embodiments, the processing unit 240 is configured to: if there is a target topic tag matching the information topic tag in the first gallery, and there are multiple pictures corresponding to the target topic tag, then from Select one picture from the plurality of pictures corresponding to the target topic tag as the cover picture of the information content.
上述所有的技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All the above technical solutions can be combined in any way to form optional embodiments of the present application, and will not be described again one by one.
应理解的是,基于人工智能的资讯配图装置实施例与方法实施例可以相互对应,类似的描述可以参照方法实施例。为避免重复,此处不再赘述。具体地,图8所示的基于人工智能的资讯配图装置可以执行上述基于人工智能的资讯配图方法实施例,并且基于人工智能的资讯配图装置中的各个单元的前述和其它操作和/或功能分别实现上述方法实施例的相应流程,为了简洁,在此不再赘述。It should be understood that the artificial intelligence-based information mapping device embodiments and method embodiments may correspond to each other, and similar descriptions may refer to the method embodiments. To avoid repetition, they will not be repeated here. Specifically, the artificial intelligence-based information mapping device shown in Figure 8 can execute the above-mentioned artificial intelligence-based information mapping method embodiment, and the aforementioned and other operations of each unit in the artificial intelligence-based information mapping device and/or or functions respectively implement the corresponding processes of the above method embodiments. For the sake of simplicity, they will not be described again here.
可选的,本申请还提供了一种计算机设备,包括存储器和处理器,存储器中存储有计算机程序,该处理器执行计算机程序时实现上述各方法实施例中的步骤。Optionally, this application also provides a computer device, including a memory and a processor. A computer program is stored in the memory. When the processor executes the computer program, it implements the steps in the above method embodiments.
图9为本申请实施例提供的计算机设备的结构示意图,该计算机设备可以是终端或服务器。如图9所示,该计算机设备300可以包括:通信接口301,存储器302,处理器303和通信总线304。通信接口301,存储器302,处理器303通过通信总线304实现相互间的通信。通信接口301用于计算机设备300与外部设备进行数据通信。存储器302可用于存储软件程序以及模块,处理器303通过运行存储在存储器302的软件程序以及模块,例如前述方法实施例中的相应操作的软件程序。Figure 9 is a schematic structural diagram of a computer device provided by an embodiment of the present application. The computer device may be a terminal or a server. As shown in FIG. 9 , the computer device 300 may include: a communication interface 301 , a memory 302 , a processor 303 and a communication bus 304 . The communication interface 301, the memory 302, and the processor 303 realize communication with each other through the communication bus 304. The communication interface 301 is used for data communication between the computer device 300 and external devices. The memory 302 can be used to store software programs and modules, and the processor 303 runs the software programs and modules stored in the memory 302, such as the software programs for corresponding operations in the foregoing method embodiments.
可选的,该处理器303可以调用存储在存储器302的软件程序以及模块执行如下操作:获取资讯内容;确定所述资讯内容的资讯主题标签;判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签;若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。Optionally, the processor 303 can call software programs and modules stored in the memory 302 to perform the following operations: obtain information content; determine the information topic tag of the information content; determine whether the information topic tag exists in the first gallery Matching target topic tags, wherein the pictures in the first gallery are marked with corresponding topic tags; if there is a target topic tag matching the information topic tag in the first gallery, then the target The picture corresponding to the topic tag serves as the cover image of the information content.
本领域普通技术人员可以理解,上述实施例的各种方法中的全部或部分步骤可以通过指令来完成,或通过指令控制相关的硬件来完成,该指令可以存储于一计算机可读存储介质中,并由处理器进行加载和执行。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above embodiments can be completed by instructions, or by controlling relevant hardware through instructions. The instructions can be stored in a computer-readable storage medium, and loaded and executed by the processor.
为此,本申请实施例提供一种计算机可读存储介质,其中存储有多条计算机程序,该计算机程序能够被处理器进行加载,以执行本申请实施例所提供的任一种基于人工智能的资讯配图方法中的步骤。以上各个操作的具体实施可参见前面的实施例,在此不再赘述。To this end, embodiments of the present application provide a computer-readable storage medium in which multiple computer programs are stored. The computer programs can be loaded by the processor to execute any of the artificial intelligence-based methods provided by the embodiments of the present application. Steps in the information mapping method. For the specific implementation of each of the above operations, please refer to the previous embodiments and will not be described again here.
其中,该存储介质可以包括:只读存储器(Read Only Memory,ROM)、随机存取记忆体(Random Access Memory,RAM)、磁盘或光盘等。Among them, the storage medium may include: read-only memory (Read Only Memory, ROM), random access memory (Random Access Memory, RAM), magnetic disk or optical disk, etc.
由于该存储介质中所存储的计算机程序,可以执行本申请实施例所提供的任一种基于人工智能的资讯配图方法中的步骤,因此,可以实现本申请实施例所提供的任一种基 于人工智能的资讯配图方法所能实现的有益效果,详见前面的实施例,在此不再赘述。Since the computer program stored in the storage medium can execute the steps in any of the artificial intelligence-based information mapping methods provided by the embodiments of the present application, it is possible to implement any of the artificial intelligence-based information mapping methods provided by the embodiments of the present application. The beneficial effects that the artificial intelligence information mapping method can achieve are detailed in the previous embodiments and will not be described again here.
本申请实施例还提供了一种计算机程序产品,该计算机程序产品包括计算机指令,该计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得计算机设备执行本申请实施例中的任一种基于人工智能的资讯配图方法中的相应流程,为了简洁,在此不再赘述。Embodiments of the present application also provide a computer program product. The computer program product includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, causing the computer device to execute the corresponding process in any of the artificial intelligence-based information mapping methods in the embodiments of the present application, For the sake of brevity, no further details will be given here.
本申请实施例还提供了一种计算机程序,该计算机程序包括计算机指令,计算机指令存储在计算机可读存储介质中。计算机设备的处理器从计算机可读存储介质读取该计算机指令,处理器执行该计算机指令,使得计算机设备执行本申请实施例中的任一种基于人工智能的资讯配图方法中的相应流程,为了简洁,在此不再赘述。An embodiment of the present application also provides a computer program. The computer program includes computer instructions, and the computer instructions are stored in a computer-readable storage medium. The processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions, causing the computer device to execute the corresponding process in any of the artificial intelligence-based information mapping methods in the embodiments of the present application, For the sake of brevity, no further details will be given here.
以上对本申请实施例所提供的一种基于人工智能的资讯配图方法、客户端、服务器、股权激励系统及存储介质进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The above is a detailed introduction to an artificial intelligence-based information mapping method, client, server, equity incentive system and storage medium provided by the embodiments of this application. This article uses specific examples to illustrate the principles and implementation methods of this application. The description of the above embodiments is only used to help understand the method and the core idea of the present application; at the same time, for those skilled in the art, there will be changes in the specific implementation and application scope based on the ideas of the present application. In summary, the content of this specification should not be construed as a limitation on this application.

Claims (19)

  1. 一种基于人工智能的资讯配图方法,其特征在于,所述方法包括:An information mapping method based on artificial intelligence, characterized in that the method includes:
    获取资讯内容;Obtain information content;
    基于预设算法模型确定所述资讯内容的资讯主题标签;Determine the information topic tag of the information content based on a preset algorithm model;
    判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签;Determine whether there is a target topic tag matching the information topic tag in the first image gallery, wherein the pictures in the first image gallery are marked with corresponding topic tags;
    若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。If there is a target topic tag matching the information topic tag in the first image gallery, the picture corresponding to the target topic tag is used as the cover image of the information content.
  2. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述基于预设算法模型确定所述资讯内容的资讯主题标签,包括:The information mapping method based on artificial intelligence according to claim 1, wherein the information topic tags of the information content are determined based on a preset algorithm model, including:
    获取所述资讯内容中资讯文本数据对应的资讯标题文本,并基于所述预设算法模型对所述资讯标题文本进行处理,以确定所述资讯内容的资讯主题标签。Obtain the information title text corresponding to the information text data in the information content, and process the information title text based on the preset algorithm model to determine the information topic tag of the information content.
  3. 如权利要求2所述的基于人工智能的资讯配图方法,其特征在于,所述预设算法模型包括实体抽取模型和关键词抽取模型,所述基于所述预设算法模型对所述资讯标题文本进行处理,以确定所述资讯内容的资讯主题标签,包括:The information mapping method based on artificial intelligence according to claim 2, characterized in that the preset algorithm model includes an entity extraction model and a keyword extraction model, and the information title is extracted based on the preset algorithm model. The text is processed to determine the information topic tag of the information content, including:
    基于实体抽取模型对所述资讯标题文本进行处理,得到所述资讯内容的实体类型标签,其中,所述实体抽取模型用于抽取所述资讯标题文本中的公司、行业与人名信息;The information title text is processed based on an entity extraction model to obtain the entity type label of the information content, wherein the entity extraction model is used to extract company, industry and person name information in the information title text;
    基于关键词抽取模型对所述资讯标题文本进行处理,得到所述资讯内容的关键词标签,其中,所述关键词抽取模型用于抽取所述资讯标题文本中的行业关键词信息;Process the information title text based on a keyword extraction model to obtain keyword tags for the information content, where the keyword extraction model is used to extract industry keyword information in the information title text;
    根据所述实体类型标签与所述关键词标签,确定所述资讯内容的资讯主题标签。According to the entity type tag and the keyword tag, the information topic tag of the information content is determined.
  4. 如权利要求3所述的基于人工智能的资讯配图方法,其特征在于,所述根据所述实体类型标签与所述关键词标签,确定所述资讯内容的资讯主题标签,包括:The artificial intelligence-based information mapping method according to claim 3, wherein determining the information topic tag of the information content based on the entity type tag and the keyword tag includes:
    根据所述实体类型标签与所述关键词标签生成标签列表;Generate a tag list according to the entity type tag and the keyword tag;
    将所述标签列表中排名第一的标签确定为所述资讯内容的资讯主题标签。The first-ranked tag in the tag list is determined as the information topic tag of the information content.
  5. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述获取资讯内容,包括:The information mapping method based on artificial intelligence according to claim 1, characterized in that said obtaining information content includes:
    基于资讯内容源爬取资讯内容,并将所述资讯内容入库至资讯内容库中;或者Crawl the information content based on the information content source and store the information content into the information content library; or
    响应于后台管理设备发送的针对所述资讯内容的入库请求,获取所述资讯内容,并将所述资讯内容入库至资讯内容库中。In response to the storage request for the information content sent by the background management device, the information content is obtained, and the information content is stored in the information content library.
  6. 如权利要求5所述的基于人工智能的资讯配图方法,其特征在于,在所述获取资讯内容之后,还包括:The information mapping method based on artificial intelligence according to claim 5, characterized in that after obtaining the information content, it also includes:
    基于预设审核规则对所述资讯内容进行资讯审核,其中,所述预设审核规则至少包括敏感词匹配与过滤规则校验;Conduct information review on the information content based on preset review rules, where the preset review rules at least include sensitive word matching and filtering rule verification;
    若所述资讯内容未命中敏感词,且所述资讯内容未命中过滤规则,则执行基于预设算法模型确定所述资讯内容的资讯主题标签的步骤。If the information content does not match the sensitive word and the information content does not match the filtering rule, then the step of determining the information topic tag of the information content based on the preset algorithm model is performed.
  7. 如权利要求6所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence according to claim 6, characterized in that the method further includes:
    若所述资讯内容命中敏感词,和/或所述资讯内容命中过滤规则,则生成审核不通过的第一提示信息,并将所述资讯内容与所述第一提示信息发送至所述后台管理设备。If the information content hits a sensitive word, and/or the information content hits a filtering rule, a first prompt message indicating that the review fails is generated, and the information content and the first prompt message are sent to the backend management equipment.
  8. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence as claimed in claim 1, characterized in that the method further includes:
    若所述第一图库中不存在与所述资讯主题标签相匹配的目标主题标签,则从第二图库中获取随机图片作为所述资讯内容的封面图,其中,所述第一图库中的图片未标注主题标签。If there is no target topic tag matching the information topic tag in the first image gallery, a random picture is obtained from the second image gallery as the cover image of the information content, wherein the picture in the first image gallery is Hashtags not tagged.
  9. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包 括:The information mapping method based on artificial intelligence as claimed in claim 1, characterized in that the method also includes:
    当接收到后台管理设备发送的针对所述资讯内容的封面图配置请求时,根据所述封面图配置请求中携带的配置信息确定所述资讯内容的封面图。When a cover image configuration request for the information content sent by the background management device is received, the cover image of the information content is determined according to the configuration information carried in the cover image configuration request.
  10. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence as claimed in claim 1, characterized in that the method further includes:
    当接收到客户端发送的针对所述资讯内容的封面图展示请求时,向所述客户端发送所述资讯内容的封面图,以在所述客户端的资讯预览界面上显示所述资讯内容的封面图。When receiving a cover image display request for the information content sent by the client, sending the cover image of the information content to the client to display the cover image of the information content on the information preview interface of the client picture.
  11. 如权利要求10所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence according to claim 10, characterized in that the method further includes:
    当接收到所述客户端发送的针对所述封面图的触发操作时,向所述客户端发送所述资讯内容的信息流,以在所述客户端的资讯预览界面上显示所述资讯内容的信息流,所述信息流包括所述资讯内容对应的资讯文本数据、资讯多媒体数据中的至少一种。When receiving the triggering operation for the cover image sent by the client, send the information flow of the information content to the client, so as to display the information of the information content on the information preview interface of the client. Stream, the information stream includes at least one of information text data and information multimedia data corresponding to the information content.
  12. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence as claimed in claim 1, characterized in that the method further includes:
    获取预设时段内的配图记录数据,其中,每个所述配图记录包含成对的资讯主题标签与所述资讯主题标签对应的封面图的确定方式;Acquire the picture record data within a preset period, wherein each picture record contains a pair of information topic tags and a method for determining the cover image corresponding to the information topic tag;
    根据所述配图记录数据,统计所述预设时段内每个所述资讯主题标签的出现频率,以得到标签统计数据;According to the picture recording data, count the frequency of occurrence of each information topic tag within the preset period to obtain tag statistics;
    根据所述配图记录数据,统计所述预设时段内每个所述封面图的确定方式的数据占比,以得到配图统计数据;According to the picture matching record data, count the data proportion of the determination method of each cover image within the preset period to obtain picture matching statistics;
    将所述标签统计数据与所述配图统计数据发送至后台管理设备,以在所述后台管理设备上显示所述标签统计数据与所述配图统计数据。The label statistical data and the accompanying picture statistical data are sent to a backend management device, so that the label statistical data and the accompanying picture statistical data are displayed on the backend management device.
  13. 如权利要求12所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence according to claim 12, characterized in that the method further includes:
    当接收到所述后台管理设备发送的针对所述第一图库的更新请求时,根据所述更新请求更新所述第一图库中的图片、主题标签中的至少一种。When an update request for the first gallery sent by the background management device is received, at least one of pictures and topic tags in the first gallery is updated according to the update request.
  14. 如权利要求13所述的基于人工智能的资讯配图方法,其特征在于,所述方法还包括:The information mapping method based on artificial intelligence according to claim 13, characterized in that the method further includes:
    若检测到所述第一图库中已存在与所述更新请求中携带的待更新标签相同的主题标签,则生成表征标签重复的第二提示信息,并拒绝更新所述第一图库。If it is detected that the same topic tag as the tag to be updated carried in the update request already exists in the first image gallery, a second prompt message indicating a duplication of tags is generated, and the update of the first image gallery is refused.
  15. 如权利要求1所述的基于人工智能的资讯配图方法,其特征在于,所述若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图,包括:The information mapping method based on artificial intelligence according to claim 1, characterized in that if there is a target topic tag matching the information topic tag in the first gallery, the target topic tag will be The corresponding picture serves as the cover image of the information content, including:
    若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,且所述目标主题标签对应的多张图片,则从所述目标主题标签对应的多张图片中选取一张图片所述作为所述资讯内容的封面图。If there is a target topic tag matching the information topic tag in the first gallery, and there are multiple pictures corresponding to the target topic tag, then select one picture from the multiple pictures corresponding to the target topic tag. Said as the cover image of the information content.
  16. 一种基于人工智能的资讯配图装置,其特征在于,所述装置包括:An information mapping device based on artificial intelligence, characterized in that the device includes:
    获取单元,用于获取资讯内容;Acquisition unit, used to obtain information content;
    确定单元,用于基于预设算法模型确定所述资讯内容的资讯主题标签;A determining unit, configured to determine the information topic tag of the information content based on a preset algorithm model;
    判断单元,用于判断第一图库中是否存在与所述资讯主题标签相匹配的目标主题标签,其中,所述第一图库中的图片标注有对应的主题标签;A judging unit configured to judge whether there is a target topic tag matching the information topic tag in the first gallery, wherein the pictures in the first gallery are marked with corresponding topic tags;
    处理单元,用于若所述第一图库中存在与所述资讯主题标签相匹配的目标主题标签,则将所述目标主题标签对应的图片作为所述资讯内容的封面图。A processing unit configured to, if there is a target topic tag matching the information topic tag in the first gallery, use the picture corresponding to the target topic tag as a cover image of the information content.
  17. 一种计算机设备,其特征在于,所述计算机设备包括处理器和存储器,所述存储器中存储有计算机程序,所述处理器通过调用所述存储器中存储的所述计算机程序, 用于执行权利要求1-15任一项所述的基于人工智能的资讯配图方法。A computer device, characterized in that the computer device includes a processor and a memory, a computer program is stored in the memory, and the processor is used to execute the claims by calling the computer program stored in the memory. The artificial intelligence-based information mapping method described in any one of 1-15.
  18. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有计算机程序,所述计算机程序适于处理器进行加载,以执行如权利要求1-15任一项所述的基于人工智能的资讯配图方法。A computer-readable storage medium, characterized in that the computer-readable storage medium stores a computer program, and the computer program is suitable for loading by a processor to execute the method based on any one of claims 1-15. Artificial intelligence information mapping method.
  19. 一种计算机程序产品,包括计算机指令,其特征在于,所述计算机指令被处理器执行时实现如权利要求1-15任一项所述的基于人工智能的资讯配图方法。A computer program product, including computer instructions, characterized in that when the computer instructions are executed by a processor, the information mapping method based on artificial intelligence as described in any one of claims 1-15 is implemented.
PCT/CN2022/103226 2022-07-01 2022-07-01 Artificial intelligence-based method for configuring image for information, device, medium, and program product WO2024000554A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
PCT/CN2022/103226 WO2024000554A1 (en) 2022-07-01 2022-07-01 Artificial intelligence-based method for configuring image for information, device, medium, and program product
CN202280002306.3A CN115298660A (en) 2022-07-01 2022-07-01 Information mapping method, apparatus, medium, and program product based on artificial intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/103226 WO2024000554A1 (en) 2022-07-01 2022-07-01 Artificial intelligence-based method for configuring image for information, device, medium, and program product

Publications (1)

Publication Number Publication Date
WO2024000554A1 true WO2024000554A1 (en) 2024-01-04

Family

ID=83819277

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/103226 WO2024000554A1 (en) 2022-07-01 2022-07-01 Artificial intelligence-based method for configuring image for information, device, medium, and program product

Country Status (2)

Country Link
CN (1) CN115298660A (en)
WO (1) WO2024000554A1 (en)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239535A (en) * 2014-09-22 2014-12-24 重庆邮电大学 Method and system for matching pictures with characters, server and terminal
CN108733779A (en) * 2018-05-04 2018-11-02 百度在线网络技术(北京)有限公司 The method and apparatus of text figure
CN112749291A (en) * 2020-08-25 2021-05-04 腾讯科技(深圳)有限公司 Image-text matching method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104239535A (en) * 2014-09-22 2014-12-24 重庆邮电大学 Method and system for matching pictures with characters, server and terminal
CN108733779A (en) * 2018-05-04 2018-11-02 百度在线网络技术(北京)有限公司 The method and apparatus of text figure
CN112749291A (en) * 2020-08-25 2021-05-04 腾讯科技(深圳)有限公司 Image-text matching method and device

Also Published As

Publication number Publication date
CN115298660A (en) 2022-11-04

Similar Documents

Publication Publication Date Title
US8099406B2 (en) Method for human editing of information in search results
US9990422B2 (en) Contextual analysis engine
US8725717B2 (en) System and method for identifying topics for short text communications
US10235681B2 (en) Text extraction module for contextual analysis engine
CA3138730C (en) Public-opinion analysis method and system for providing early warning of enterprise risks
US10713291B2 (en) Electronic document generation using data from disparate sources
US10255363B2 (en) Refining search query results
CN113079417B (en) Method, device and equipment for generating bullet screen and storage medium
CN101568921A (en) Dynamic pricing models for digital content
US20180189359A1 (en) Systems and Methods for Harvesting Data Associated with Fraudulent Content in a Networked Environment
US20190213208A1 (en) Interactive patent visualization systems and methods
US11100159B2 (en) Image metadata enrichment generator
CN112632405A (en) Recommendation method, device, equipment and storage medium
JP2020524833A (en) Method and apparatus for displaying textual information
US10733240B1 (en) Predicting contract details using an unstructured data source
CN110795613A (en) Commodity searching method, device and system and electronic equipment
CN112100216A (en) Creative keyword processing method and device
WO2024000554A1 (en) Artificial intelligence-based method for configuring image for information, device, medium, and program product
KR20100136438A (en) Method and apparutus for automatic contents generation
CN113407678A (en) Knowledge graph construction method, device and equipment
Anh Web Scraping: A Big Data Building Tool And Its Status In The Fintech Sector In Viet Nam
CN106716403A (en) Automated generation of web site entry pages
CN112991033A (en) Method and device for determining value attribute of article
US11886485B1 (en) Computer-implemented visual query matching methods, and systems for implementing thereof
US20240176810A1 (en) Computer-Implemented Visual Query Matching Methods, and Systems for Implementing Thereof

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22948655

Country of ref document: EP

Kind code of ref document: A1