CN104077320B - method and device for generating information to be issued - Google Patents

method and device for generating information to be issued Download PDF

Info

Publication number
CN104077320B
CN104077320B CN201310107953.3A CN201310107953A CN104077320B CN 104077320 B CN104077320 B CN 104077320B CN 201310107953 A CN201310107953 A CN 201310107953A CN 104077320 B CN104077320 B CN 104077320B
Authority
CN
China
Prior art keywords
information
keyword
source
demonstration
keywords
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201310107953.3A
Other languages
Chinese (zh)
Other versions
CN104077320A (en
Inventor
钟淑仪
苏亮
徐明泉
顾忻语
刘艳
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201310107953.3A priority Critical patent/CN104077320B/en
Publication of CN104077320A publication Critical patent/CN104077320A/en
Application granted granted Critical
Publication of CN104077320B publication Critical patent/CN104077320B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

The invention aims to provide a method and a device for generating information to be published. The method according to the invention comprises the following steps: acquiring a central word corresponding to at least one source keyword according to the at least one source keyword; determining demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information comprises demonstration information to be published and demonstration keywords; determining corresponding structured information based on the demonstration information to be issued and demonstration keywords; and respectively generating each piece of information to be issued corresponding to the at least one source keyword based on the determined structural information. The invention can provide the demonstration keywords and the demonstration information to be issued with high quality which are suitable for the plurality of keywords, so that the user can obtain the structured information with better effect, and can help the user to quickly generate the information to be issued corresponding to the plurality of keywords, thereby improving the operation efficiency of the user.

Description

Method and device for generating information to be issued
Technical Field
The present invention relates to the field of computer technologies, and in particular, to a method and an apparatus for generating information to be published.
Background
In the prior art, when structured information such as a template is recommended to a user, only a template in a fixed format can be provided to the user, and a template meeting the characteristics of the user cannot be recommended according to the actual requirements of the user. Therefore, the recommendation method in the prior art has low efficiency and poor effect.
Disclosure of Invention
The invention aims to provide a method and a device for generating information to be published.
According to one aspect of the invention, a method for generating information to be published is provided, wherein the method comprises the following steps:
a, acquiring a central word corresponding to at least one source keyword according to the at least one source keyword;
b, determining demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information comprises demonstration information to be issued and demonstration keywords;
c, determining corresponding structured information based on the demonstration information to be issued and demonstration keywords;
and d, respectively generating each piece of information to be issued corresponding to the at least one source keyword based on the determined structural information.
According to an aspect of the present invention, there is provided an information generating apparatus for generating information to be published, wherein the information generating apparatus includes:
The first acquisition device is used for acquiring a central word corresponding to at least one source keyword according to the at least one source keyword;
the first determining device is used for determining demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information comprises demonstration information to be issued and demonstration keywords;
The second determining device is used for determining corresponding structured information based on the demonstration to-be-issued information and the demonstration source key words;
And the first generating device is used for respectively generating each piece of information to be issued corresponding to the at least one source keyword based on the determined structural information.
Compared with the prior art, the invention has the following advantages: the method and the device can provide the demonstration keywords and the demonstration information to be issued with high quality which are suitable for the plurality of keywords, so that the user can obtain structured information with better effect, and can help the user to quickly generate the information to be issued corresponding to the plurality of keywords, and the operation efficiency of the user is improved.
Drawings
Other features, objects and advantages of the invention will become more apparent upon reading of the detailed description of non-limiting embodiments made with reference to the following drawings:
FIG. 1 illustrates a flow chart of a method for generating information to be published;
Fig. 2 is a schematic structural diagram illustrating an information generating apparatus for generating information to be distributed;
The same or similar reference numbers in the drawings identify the same or similar elements.
Detailed Description
The present invention is described in further detail below with reference to the attached drawing figures.
FIG. 1 illustrates a flow chart of a method for recommending structured information. The method according to the present invention includes step S1, step S2, step S3, and step S4.
The method according to the invention is implemented by an information generating device in a computer device. The computer device includes an electronic device capable of automatically performing numerical calculation and/or information processing according to instructions set or stored in advance, and hardware thereof includes, but is not limited to, a microprocessor, an Application Specific Integrated Circuit (ASIC), a programmable gate array (FPGA), a Digital Signal Processor (DSP), an embedded device, and the like. The computer equipment comprises network equipment and user equipment. The network device includes, but is not limited to, a single network server, a server group consisting of a plurality of network servers, or a cloud based computing (cloud computing) cloud consisting of a large number of hosts or network servers, wherein the cloud computing is one of distributed computing, and is a super virtual computer consisting of a group of loosely coupled computers. The user equipment includes, but is not limited to, any electronic product that can interact with a user through a keyboard, a mouse, a remote controller, a touch panel, or a voice control device, for example, a personal computer, a tablet computer, a smart phone, a PDA, a game console, or an IPTV. The network where the user equipment and the network equipment are located includes, but is not limited to, the internet, a wide area network, a metropolitan area network, a local area network, a VPN network, and the like.
It should be noted that the ue, the network device and the network are only examples, and other existing or future ues, network devices and networks may also be included in the scope of the present invention and are included by reference.
Referring to fig. 1, in step S1, the information generating apparatus acquires a headword corresponding to at least one source keyword from the at least one source keyword.
The information to be published comprises information which is generated based on the keywords and is used for publishing in the network. Preferably, the information to be published includes keywords selected by the user, and the source keywords include keywords for determining structured information.
Specifically, the information generating apparatus extracts, as the central word, a word included in part or all of the source keywords from the part or all of the source keywords of the at least one source keyword.
preferably, the information generating means selects one or more words from among the obtained plurality of source keywords as the central words, based on the frequency of the repeated appearance of each word.
More preferably, the manner in which the information generating apparatus selects one or more words as the central words from the respective words appearing in the plurality of source keywords includes any of:
-selecting as the central word the word with the highest frequency of repeated occurrence from the plurality of words;
-selecting as the central word from the plurality of words, a word having a repetition frequency above a predetermined threshold.
According to a first example of the present invention, the currently existing source keywords are shown in table 1 below:
TABLE 1
the information generating apparatus acquires, as a headword, the word "mobile phone offer" whose frequency of repeated occurrence is greater than 4 times among all the source keywords in table 1.
it should be noted that, the foregoing examples are only for better illustrating the technical solutions of the present invention, and are not limiting to the present invention, and those skilled in the art should understand that any implementation manner of obtaining the central word corresponding to at least one source keyword according to the at least one source keyword should be included in the scope of the present invention.
Next, in step S2, the information generating device determines demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information includes demonstration information to be issued and demonstration keywords.
Preferably, the exemplary to-be-published information includes exemplary advertisement information.
specifically, the information generating device first obtains demonstration to-be-released information corresponding to the central word, and then the information generating device selects one source keyword from the at least one source keyword as a demonstration keyword.
Preferably, the manner of acquiring the exemplary to-be-issued information corresponding to the central word by the information generating device includes, but is not limited to: and performing matching search according to the central word in a plurality of information to be issued corresponding to the at least one source keyword to obtain exemplary information to be issued containing the central word.
For example, the information generating device performs a matching query in a predetermined advertisement information base according to the obtained at least one headword to obtain one or more pieces of advertisement information including the at least one headword. Preferably, the predetermined advertisement information base may be divided into a plurality of bases according to different industries corresponding to advertisement information. Preferably, according to the inventive solution, the advertisement information base is updated periodically.
preferably, the manner in which the information generating apparatus selects one source keyword from the at least one source keyword as the exemplary keyword includes, but is not limited to: and selecting the source keyword which contains the central word and has the longest length from the at least one source keyword as the exemplary keyword.
continuing with the first example, the information to be published in the predetermined information to be published library is shown in table 2 below:
TABLE 2
The information generating apparatus matches the headword "mobile phone quote" obtained in step S1 with each piece of information to be published in table 2, and obtains exemplary information to be published that is simultaneously matched with the headword as follows:
"the latest model of handset quotes for multiple handset brands, all in BB mall! "
"popular mobile phone quoted price was updated recently".
The information generating apparatus selects the longest source keyword "the most popular price of the iphone" from the source keywords in table 1 as the exemplary source keyword.
It should be noted that the foregoing examples are only for better illustrating the technical solutions of the present invention, and not for limiting the present invention, and those skilled in the art should understand that any implementation manner for determining the exemplary information corresponding to the central word and the at least one source keyword should be included in the scope of the present invention.
Next, in step S3, the information generating apparatus determines corresponding structured information based on the demonstration to-be-issued information and the demonstration keyword.
specifically, the information generation device recommends the obtained demonstration information to be published and demonstration keywords to the user, and determines structured information corresponding to the demonstration information to be published and the demonstration keywords according to corresponding user operation.
Continuing with the foregoing first example, the information generating apparatus obtains the exemplary keyword "the most popular apple handset price" and the following two exemplary information to be released in step S2:
"the latest model of handset quotes for multiple handset brands, all in BB mall! "
"popular mobile phone quoted price was updated recently".
The information generation device presents the obtained demonstration keywords and demonstration to-be-released information to a user, detects a selection operation that the user selects one of demonstration to-be-released information "hot cell phone offer latest update", replaces a part of content of "hot cell phone offer" in the to-be-released information with the recommended demonstration keyword "the price of the hottest apple cell phone", so as to obtain new to-be-released information "the price latest update of the hottest apple cell phone", and determines structured information as "# latest update" based on the new to-be-released information, wherein "#" represents a wildcard that can be replaced by other keywords.
Preferably, when the demonstration to-be-published information in each demonstration information is modified, the information generating device generates structured information corresponding to the demonstration information based on the modified demonstration to-be-published information.
The information generation device detects modification operation of a user on demonstration information to be published, and generates structured information based on the modified demonstration information to be published.
For example, if the information generating apparatus includes the exemplary to-be-released information "2011 spring suit special price exhibition and sales", in the exemplary information obtained in step S2, and the information generating apparatus detects that the user modifies the exemplary to-be-released information, obtains the modified to-be-released information as "2013 spring suit special price exhibition and detects that the user selects" the suit special price exhibition and sales "in the to-be-released information to correspond to the wildcard" # ", the information generating apparatus determines that the structured information corresponding to the exemplary to-be-released information includes" 2013 spring # ".
It should be noted that, the above examples are only for better illustrating the technical solutions of the present invention, and not for limiting the present invention, and those skilled in the art should understand that any implementation manner for determining the corresponding structured information based on the exemplary to-be-released information and the exemplary keywords should be included in the scope of the present invention.
next, in step S4, the information generating apparatus generates each piece of information to be distributed corresponding to the at least one source keyword, respectively, based on the determined structured information.
specifically, the information generating device combines the at least one source keyword with the structured information according to the structured information determined in step S3 to generate the information to be published, respectively.
continuing with the explanation by the foregoing first example, the information generating apparatus replaces the symbols "#" in the structured information "# latest update" for each source keyword in table 1, respectively, to obtain each to-be-issued information as shown in table 3 below:
TABLE 3
Serial number Keyword
1 Latest model iPhone latest update
2 recent update of mobile phone quoted price
3 Lowest price latest update of mobile phone
4 Latest update of mobile phone price
5 iPhone handset offer recent update
6 Latest model of mobile phone quotation latest update
7 The price of popular mobile phone brand is updated recently
8 The most popular apple Mobile phones' price was updated recently
9 Recent update of popular mobile phone quotation
It should be noted that, the foregoing examples are only for better illustrating the technical solutions of the present invention, and are not limiting to the present invention, and those skilled in the art should understand that any implementation manner for generating each to-be-published information corresponding to the at least one source keyword based on the determined structured information should be included in the scope of the present invention.
According to one of the preferred embodiments of the present invention, the at least one source keyword has the same language structure, and the method according to the present invention further comprises step S5 (not shown), wherein the step S1 comprises step S101 (not shown).
in step S5, the information generating apparatus detects a language structure of the at least one source keyword.
specifically, the information generating apparatus determines the language structure of the keyword by performing a text processing technique such as segmentation on the keyword.
the language structure is used for indicating information such as the property and connection sequence of each contained word or phrase in the keyword.
for example, the information generating apparatus identifies a language structure of the keyword using "adverb + verb + noun" by segmenting the keyword and analyzing the word components of each segmented word.
For example, the information generating apparatus specifies a language structure of "nominal phrase + verb phrase + noun" as the keyword by performing text processing on the keyword.
Next, in step S101, the information generating apparatus acquires a central word included in the at least one source keyword and having the same language structure as the at least one source keyword, according to the at least one source keyword.
The manner of acquiring, by the information generation apparatus, the core word included in the at least one source keyword and having the same language structure as the at least one source keyword is the same as or similar to the manner of acquiring, by the information generation apparatus, the core word corresponding to the at least one source keyword according to the at least one source keyword, and details are not repeated here.
According to a further preferred embodiment of the present invention, wherein according to the method of the present implementation, the plurality of source keywords respectively belong to a plurality of keyword groups, wherein each keyword group contains at least one source keyword. The method according to the present embodiment includes step S6 (not shown), step S7 (not shown), and step S8 (not shown).
In step S6, the information generating apparatus acquires similarity information between the respective keyword groups.
Specifically, the information generating apparatus compares the respective keywords in the respective keyword groups to obtain a similarity between the two keyword groups.
Next, in step S7, the information generating apparatus selects one or more keyword groups similar to the group to be processed, based on the similarity information between the respective keyword groups.
Specifically, the information generating means selects one or more keyword groups similar to the group to be processed, based on the similarity information between the group to be processed and each of the other keyword groups obtained in step S6.
preferably, the information generating means judges whether or not the similarity information between the packet to be processed and each of the other keyword packets satisfies a similarity threshold value greater than a predetermined threshold value, and determines that the packet to be processed is similar to the other keyword packets when the similarity information is greater than the predetermined threshold value.
Next, in step S8, when the selected keyword group has corresponding demonstration information, the information generating apparatus takes the demonstration information corresponding to the keyword group as the demonstration information of the currently processed keyword group.
It should be noted that, according to the method of the present embodiment, steps S6 to S7 are executed before step S1, and step S8 is executed before step S3; the steps S1 to S2 and S8 are not in sequence.
For example, the information generating apparatus first performs steps S6 to S7 to acquire a plurality of other keyword groups similar to the group to be processed, then the information generating apparatus performs steps S1 and S2 to acquire demonstration information of the group to be processed, and simultaneously performs step S8 to acquire demonstration information of each other keyword group similar to the group to be processed and present all the acquired demonstration information to the user, and then the information generating apparatus continues to perform steps S3 and S4 to acquire pieces of information to be issued in batch generated based on the keywords in the group to be processed.
Preferably, the method according to the present embodiment further includes a step S9 (not shown) and a step S10 (not shown) performed before the step S6.
In step S9, the information generating apparatus groups the plurality of source keywords according to a predetermined grouping rule to obtain at least one group of keyword groups containing at least one source keyword.
Wherein the predetermined grouping rule is used for grouping a plurality of source keywords according to their keyword-related information.
Preferably, the information generating means acquires keyword-related information related to each keyword to group each keyword according to the keyword-related information. Wherein the keyword related information includes but is not limited to at least any one of the following:
1) The linguistic structure of the keyword; for example, a keyword constituted in the form of "noun (or noun phrase) + verb (or verb phrase)"; for example, a keyword formed in the form of "noun 1+ adjective + noun 2";
2) classification information of products and/or services corresponding to the keywords, and the like. For example, the product corresponding to the keyword "iPhone" is "mobile phone", and the classification information thereof is "electronic product".
For example, the initial grouping may be performed based on the classification information of the products and/or services corresponding to the keywords, and the results of the initial grouping are further grouped by using the language structure to obtain each keyword group, where each keyword in the obtained keyword groups corresponds to the same classification information of the products and/or services and has the same language structure.
More preferably, the predetermined grouping rule includes, but is not limited to, determining a corresponding keyword grouping based on the manner of grouping the plurality of keywords disclosed in the prior patent CN 201110216772.5.
next, in step S10, the information generating apparatus selects one keyword group as a group to be processed.
Preferably, the information generating means selects one keyword group as the group to be processed in accordance with a selection operation by the user.
According to the method, the demonstration keywords and the demonstration information to be issued which are adaptive to the keywords and have high quality can be provided for the keywords, so that a user can obtain structured information with a good effect, the user can be helped to quickly generate the information to be issued corresponding to the keywords, and the operation efficiency of the user is improved.
Fig. 2 is a schematic structural diagram of an information generating apparatus for generating information to be distributed. The information generating apparatus according to the present invention includes first acquiring means 1, first determining means 2, second determining means 3, and first generating means 4.
Referring to fig. 2, the first obtaining apparatus 1 obtains a central word corresponding to at least one source keyword according to the at least one source keyword.
The information to be published comprises information which is generated based on the keywords and is used for publishing in the network. Preferably, the information to be published includes keywords selected by the user, and the source keywords include keywords for determining structured information.
Specifically, the first obtaining device 1 extracts, from some or all of the source keywords of the at least one source keyword, words included in the some or all of the source keywords as the central words.
Preferably, the first acquisition means 1 selects one or more words from the respective words as the central words according to the frequency of the repeated appearance of the respective words in the acquired plurality of source keywords.
More preferably, the manner in which the first acquisition means 1 selects one or more words as the central words from the respective words appearing in the plurality of source keywords includes any of:
-selecting as the central word the word with the highest frequency of repeated occurrence from the plurality of words;
-selecting as the central word from the plurality of words, a word having a repetition frequency above a predetermined threshold.
According to a first example of the present invention, the currently existing source keywords are shown in table 4 below:
TABLE 4
the first acquiring means 1 acquires, as a headword, the word "mobile phone offer" whose frequency of repeated occurrence is greater than 4 times among all the source keywords in table 4.
it should be noted that, the foregoing examples are only for better illustrating the technical solutions of the present invention, and are not limiting to the present invention, and those skilled in the art should understand that any implementation manner of obtaining the central word corresponding to at least one source keyword according to the at least one source keyword should be included in the scope of the present invention.
Next, the first determining device 2 determines demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information includes demonstration information to be published and demonstration keywords.
preferably, the exemplary to-be-published information includes exemplary advertisement information.
First, a second obtaining device (not shown) in the first determining device 2 obtains exemplary information to be published corresponding to the central word, and then, a first selecting device (not shown) in the first determining device 2 selects one source keyword from the at least one source keyword as an exemplary keyword.
Preferably, the second obtaining device performs matching search according to the central word in a plurality of pieces of information to be published corresponding to the at least one source keyword, so as to obtain exemplary information to be published including the central word.
for example, the second obtaining means performs a matching query in a predetermined advertisement information base according to the obtained at least one headword to obtain one or more advertisement information containing the at least one headword. Preferably, the predetermined advertisement information base may be divided into a plurality of bases according to different industries corresponding to advertisement information. Preferably, according to the inventive solution, the advertisement information base is updated periodically.
Preferably, the manner of selecting one source keyword from the at least one source keyword as the exemplary keyword by the first selecting means includes but is not limited to: and selecting the source keyword which contains the central word and has the longest length from the at least one source keyword as the exemplary keyword.
Continuing with the first example, the information to be published in the predetermined information to be published library is shown in table 5 below:
TABLE 5
The second obtaining device matches the headword "mobile phone quote" obtained by the first obtaining device with each piece of information to be issued in table 5, and obtains exemplary information to be issued that is simultaneously matched with the headword as follows:
"the latest model of handset quotes for multiple handset brands, all in BB mall! "
"popular mobile phone quoted price was updated recently".
The first selection means selects the longest source keyword "the most popular price of the iphone" from the source keywords in table 4 as the exemplary source keyword.
It should be noted that the foregoing examples are only for better illustrating the technical solutions of the present invention, and not for limiting the present invention, and those skilled in the art should understand that any implementation manner for determining the exemplary information corresponding to the central word and the at least one source keyword should be included in the scope of the present invention.
Then, the second determining device 3 determines the corresponding structured information based on the demonstration to-be-released information and the demonstration keyword.
specifically, the second determining device 3 recommends the obtained demonstration information to be published and demonstration keywords to the user, and determines structured information corresponding to the demonstration information to be published and demonstration keywords according to corresponding user operations.
Continuing with the foregoing first example, the first determination device 2 obtains the exemplary keyword "the most popular apple mobile phone price" and the following two exemplary pieces of information to be released:
"the latest model of handset quotes for multiple handset brands, all in BB mall! "
"popular mobile phone quoted price was updated recently".
The second determining means 3 presents the obtained demonstration keywords and the demonstration to-be-released information to the user, and detects a selection operation that the user selects one of the demonstration to-be-released information "popular mobile phone offer latest update", and replaces a part of the content of "popular mobile phone offer" in the to-be-released information with the recommended demonstration keyword "the price of the hottest apple mobile phone" to obtain new to-be-released information "the price latest update of the hottest apple mobile phone", and determines the structured information as "# latest update" based on the new to-be-released information, wherein "#" represents a wildcard that can be replaced by other keywords.
Preferably, when the exemplary to-be-published information in the respective exemplary information is modified, the second generating means (not shown) in the second determining means 3 generates the structured information corresponding to the exemplary information based on the modified exemplary to-be-published information.
The second generating device detects the modification operation of the user on the demonstration information to be published and generates the structured information based on the modified demonstration information to be published.
For example, the exemplary information obtained by the first determining device 2 includes exemplary to-be-released information "2011 spring suit-dress special price exhibition and sales", and the second generating device detects that the user modifies the exemplary to-be-released information, obtains the modified to-be-released information as "2013 spring suit-dress special price exhibition and sales", and detects that the user selects "woman suit-dress special price exhibition and sales" in the to-be-released information to correspond to wildcard "#", the second generating device determines that the structured information corresponding to the exemplary to-be-released information includes "2013 spring #".
It should be noted that, the above examples are only for better illustrating the technical solutions of the present invention, and not for limiting the present invention, and those skilled in the art should understand that any implementation manner for determining the corresponding structured information based on the exemplary to-be-released information and the exemplary keywords should be included in the scope of the present invention.
next, the first generation device 4 generates each piece of information to be published corresponding to the at least one source keyword, respectively, based on the determined structured information.
specifically, the first generating device 4 combines the at least one source keyword with the structured information according to the structured information determined by the second determining device 3 to generate the information to be published respectively.
continuing with the explanation by the foregoing first example, the first generation apparatus 4 replaces the respective source keywords in table 4 with the "#" symbol in the structured information "# most recent update", respectively, to obtain the respective information to be published as shown in table 6 below:
TABLE 6
serial number Keyword
1 Latest model iPhone latest update
2 Recent update of mobile phone quoted price
3 Lowest price latest update of mobile phone
4 Latest update of mobile phone price
5 iPhone handset offer recent update
6 Latest model of mobile phone quotation latest update
7 The price of popular mobile phone brand is updated recently
8 The most popular apple Mobile phones' price was updated recently
9 Recent update of popular mobile phone quotation
It should be noted that, the foregoing examples are only for better illustrating the technical solutions of the present invention, and are not limiting to the present invention, and those skilled in the art should understand that any implementation manner for generating each to-be-published information corresponding to the at least one source keyword based on the determined structured information should be included in the scope of the present invention.
According to a preferred embodiment of the present invention, the at least one source keyword has the same language structure, and the information generating apparatus according to the present invention further comprises a detecting device (not shown), wherein the first obtaining device comprises a sub-obtaining device (not shown).
The detecting means detects a language structure of the at least one source keyword.
Specifically, the detection device determines the language structure of the keyword by performing a text processing technique such as segmentation on the keyword.
the language structure is used for indicating information such as the property and connection sequence of each contained word or phrase in the keyword.
for example, the detection device determines that the keyword adopts a language structure of "adverb + verb + noun" or the like by segmenting the keyword and analyzing the word components after each segmentation.
For another example, the detection device determines that the keyword adopts a language structure of "nominal phrase + verb phrase + noun" by performing text processing on the keyword.
then, the sub-obtaining device obtains a central word contained in the at least one source keyword and having the same language structure as the at least one source keyword according to the at least one source keyword.
The manner of acquiring, by the sub-acquisition device, the core word included in the at least one source keyword and having the same language structure as the at least one source keyword is the same as or similar to the manner of acquiring, by the information generation device, the core word corresponding to the at least one source keyword according to the at least one source keyword, and details are not repeated here.
According to still another preferred embodiment of the present invention, wherein according to the information generating apparatus of the present embodiment, the plurality of source keywords respectively belong to a plurality of keyword groups, wherein each keyword group contains at least one source keyword. The information generating apparatus according to the present embodiment includes third acquiring means (not shown), second selecting means (not shown), and third determining means (not shown).
the third acquisition means acquires similarity information between the respective keyword groups.
specifically, the third obtaining device compares the keywords in the keyword groups to obtain the similarity between the two keyword groups.
Then, the second selection means selects one or more keyword groups similar to the group to be processed, based on the similarity information between the respective keyword groups.
Specifically, the second selection means selects one or more keyword groups similar to the group to be processed, according to the similarity information between the group to be processed and each of the other keyword groups obtained by the third acquisition means.
preferably, the second selecting means determines whether or not the similarity information between the group to be processed and each of the other keyword groups satisfies a similarity threshold value greater than a predetermined threshold value, and determines that the group to be processed is similar to the other keyword group when the similarity information is greater than the predetermined threshold value.
Then, when the selected keyword group has corresponding demonstration information, the third determining means takes the demonstration information corresponding to the keyword group as the demonstration information of the currently processed keyword group.
It should be noted that, according to the information generating apparatus of the present embodiment, the third acquiring means and the second selecting means perform operations before the first acquiring means 1, and the third determining means performs operations before the second determining means 3; wherein, the first acquiring device 1, the first determining device 2 and the third determining device have no sequence.
for example, the third acquisition means and the second selection means perform an operation first to acquire a plurality of other keyword groups similar to the group to be processed, then the first acquisition means 1 and the first determination means 2 perform an operation to acquire demonstration information of the group to be processed, and the third determination means simultaneously performs an operation to acquire demonstration information of each of the other keyword groups similar to the group to be processed and present all the acquired demonstration information to the user, and then the second determination means 3 and the first generation means 4 continue to perform an operation to acquire information to be issued in batch generated based on the keywords in the group to be processed.
Preferably, the information generating apparatus according to the present embodiment further includes grouping means (not shown) and third selecting means (not shown) that perform operations before the third acquiring means.
In step S9, the information generating apparatus groups the plurality of source keywords according to a predetermined grouping rule to obtain at least one group of keyword groups containing at least one source keyword.
wherein the predetermined grouping rule is used for grouping a plurality of source keywords according to their keyword-related information.
preferably, the grouping means acquires keyword-related information related to each keyword to group each keyword according to the keyword-related information. Wherein the keyword related information includes but is not limited to at least any one of the following:
1) The linguistic structure of the keyword; for example, a keyword constituted in the form of "noun (or noun phrase) + verb (or verb phrase)"; for example, a keyword formed in the form of "noun 1+ adjective + noun 2";
2) classification information of products and/or services corresponding to the keywords, and the like. For example, the product corresponding to the keyword "iPhone" is "mobile phone", and the classification information thereof is "electronic product".
for example, the initial grouping may be performed based on the classification information of the products and/or services corresponding to the keywords, and the results of the initial grouping are further grouped by using the language structure to obtain each keyword group, where each keyword in the obtained keyword groups corresponds to the same classification information of the products and/or services and has the same language structure.
More preferably, the predetermined grouping rule includes, but is not limited to, determining a corresponding keyword grouping based on the manner of grouping the plurality of keywords disclosed in the prior patent CN 201110216772.5.
Next, the third selection means selects one keyword group as a group to be processed.
Preferably, the third selecting means selects one keyword group as the group to be processed according to a selection operation by the user.
according to the scheme of the invention, the demonstration keywords and the demonstration information to be issued which are adaptive to the keywords and have high quality can be provided for the keywords, so that a user can obtain structured information with a better effect, and can be helped to quickly generate the information to be issued corresponding to the keywords, and the operation efficiency of the user is improved.
the software program of the present invention can be executed by a processor to implement the steps or functions described above. Also, the software programs (including associated data structures) of the present invention can be stored in a computer readable recording medium, such as RAM memory, magnetic or optical drive or diskette and the like. Additionally, some of the steps or functionality of the present invention may be implemented in hardware, for example, as circuitry that cooperates with the processor to perform various functions or steps.
In addition, some of the present invention can be applied as a computer program product, such as computer program instructions, which when executed by a computer, can invoke or provide the method and/or technical solution according to the present invention through the operation of the computer. Program instructions which invoke the methods of the present invention may be stored on a fixed or removable recording medium and/or transmitted via a data stream on a broadcast or other signal-bearing medium and/or stored within a working memory of a computer device operating in accordance with the program instructions. An embodiment according to the invention herein comprises an apparatus comprising a memory for storing computer program instructions and a processor for executing the program instructions, wherein the computer program instructions, when executed by the processor, trigger the apparatus to perform a method and/or solution according to embodiments of the invention as described above.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference sign in a claim should not be construed as limiting the claim concerned. Furthermore, it is obvious that the word "comprising" does not exclude other elements or steps, and the singular does not exclude the plural. A plurality of units or means recited in the system claims may also be implemented by one unit or means in software or hardware. The terms first, second, etc. are used to denote names, but not any particular order.

Claims (14)

1. A method for generating information to be published, wherein the method comprises the steps of:
Detecting a language structure of at least one source keyword, wherein the at least one source keyword has the same language structure, and the language structure is used for indicating information of the property and the connection sequence of words or phrases contained in the keywords;
a, acquiring a central word contained in the at least one source keyword and having the same language structure with the at least one source keyword according to the at least one source keyword;
b, determining demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information comprises demonstration information to be issued and demonstration keywords;
c, determining corresponding structured information based on the demonstration information to be issued and demonstration keywords, wherein the structured information comprises wildcards which can be replaced by other keywords;
And d, respectively generating each piece of information to be issued corresponding to the at least one source keyword based on the determined structural information.
2. The method of claim 1, wherein the step b comprises the steps of:
b1, acquiring demonstration to-be-issued information corresponding to the central word;
b2 selecting one source keyword from the at least one source keyword as an exemplary keyword.
3. The method of claim 2, wherein the step b1 includes the steps of:
-performing a matching search according to the central word in a plurality of information to be published corresponding to the at least one source keyword to obtain exemplary information to be published containing the central word.
4. the method according to claim 2 or 3, wherein said step b2 comprises the steps of:
-selecting the source keyword with the longest length and including the central word from the at least one source keyword as the exemplary keyword.
5. the method according to any of claims 1 to 3, wherein the plurality of source keywords belong to a plurality of keyword groups, respectively, wherein each keyword group contains at least one source keyword, wherein the method further comprises the steps of:
-obtaining similarity information between respective keyword groups;
-selecting one or more keyword groups similar to the group to be processed, based on similarity information between the respective keyword groups;
-when the selected keyword group has corresponding demonstration information, taking the demonstration information corresponding to the keyword group as demonstration information of the currently processed keyword group.
6. The method according to claim 5, wherein the method comprises the steps of:
-grouping a plurality of source keywords according to a predetermined grouping rule to obtain at least one group of keywords comprising at least one source keyword;
-selecting a keyword group as the group to be processed.
7. The method according to any one of claims 1 to 3, wherein said step c comprises the steps of:
-generating structured information corresponding to the exemplary information based on the modified exemplary to-be-published information when the exemplary to-be-published information in the respective exemplary information is modified.
8. An information generating apparatus for generating information to be distributed, wherein the information generating apparatus comprises:
The detection device is used for detecting the language structure of at least one source keyword, wherein the at least one source keyword has the same language structure, and the language structure is used for indicating the property and the connection sequence of words or phrases contained in the keywords;
A first obtaining device, configured to obtain, according to the at least one source keyword, a central word included in the at least one source keyword and having a same language structure as the at least one source keyword;
The first determining device is used for determining demonstration information corresponding to the central word and the at least one source keyword, wherein the demonstration information comprises demonstration information to be issued and demonstration keywords;
second determining means, configured to determine corresponding structured information based on the demonstration to-be-released information and a demonstration source keyword, where the structured information includes wildcards that represent that the information can be replaced by other keywords;
And the first generating device is used for respectively generating each piece of information to be issued corresponding to the at least one source keyword based on the determined structural information.
9. The information generating apparatus according to claim 8, wherein the first determining means includes:
The second acquisition device is used for acquiring the demonstration to-be-issued information corresponding to the central word;
first selection means for selecting one source keyword from the at least one source keyword as an exemplary keyword.
10. The information generating apparatus according to claim 9, wherein the second acquiring means is configured to:
-performing a matching search according to the central word in a plurality of information to be published corresponding to the at least one source keyword to obtain exemplary information to be published containing the central word.
11. The information generating apparatus according to claim 9 or 10, wherein the first selecting means is configured to:
-selecting the source keyword with the longest length and including the central word from the at least one source keyword as the exemplary keyword.
12. The information generating apparatus according to any one of claims 8 to 10, wherein a plurality of source keywords respectively belong to a plurality of keyword groups, wherein each keyword group contains at least one source keyword, wherein the information generating apparatus further comprises:
Third obtaining means for obtaining similarity information between each keyword group;
The second selection device is used for selecting one or more keyword groups similar to the group to be processed according to the similarity information among the keyword groups;
And a third determining device, configured to, when the selected keyword group has corresponding demonstration information, take the demonstration information corresponding to the keyword group as the demonstration information of the currently processed keyword group.
13. The information generating apparatus according to claim 12, wherein the information generating apparatus further comprises:
Grouping means for grouping the plurality of source keywords according to a predetermined grouping rule to obtain at least one group of keyword groups containing at least one source keyword;
Third selection means for selecting a keyword group as a group to be processed.
14. the information generating apparatus according to any one of claims 8 to 10, wherein the second determining means includes:
and the second generating device is used for generating the structural information corresponding to the demonstration information based on the modified demonstration information to be published when the demonstration information to be published in each demonstration information is modified.
CN201310107953.3A 2013-03-29 2013-03-29 method and device for generating information to be issued Active CN104077320B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310107953.3A CN104077320B (en) 2013-03-29 2013-03-29 method and device for generating information to be issued

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310107953.3A CN104077320B (en) 2013-03-29 2013-03-29 method and device for generating information to be issued

Publications (2)

Publication Number Publication Date
CN104077320A CN104077320A (en) 2014-10-01
CN104077320B true CN104077320B (en) 2019-12-17

Family

ID=51598579

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310107953.3A Active CN104077320B (en) 2013-03-29 2013-03-29 method and device for generating information to be issued

Country Status (1)

Country Link
CN (1) CN104077320B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105589954A (en) * 2015-12-21 2016-05-18 北京奇虎科技有限公司 Method and device for determining search suggestion based on central words
CN105677709A (en) * 2015-12-28 2016-06-15 北京搜狗科技发展有限公司 Information processing method and apparatus, and device for processing information
CN109510904B (en) * 2018-12-25 2020-10-27 携程旅游网络技术(上海)有限公司 Method and system for detecting call center outbound record
CN111580921B (en) * 2020-05-15 2021-10-22 北京字节跳动网络技术有限公司 Content creation method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101075320A (en) * 2006-05-16 2007-11-21 申凌 System and method for issuing and inquiring information
CN101283350A (en) * 2005-07-15 2008-10-08 思索软件有限公司 Method and apparatus for providing structured data for free text messages
CN102073725A (en) * 2011-01-11 2011-05-25 百度在线网络技术(北京)有限公司 Method for searching structured data and search engine system for implementing same
CN102214208A (en) * 2011-04-27 2011-10-12 百度在线网络技术(北京)有限公司 Method and equipment for generating structured information entity based on non-structured text
CN102298614A (en) * 2011-07-29 2011-12-28 百度在线网络技术(北京)有限公司 Method for determining collection category of page collection information and device and equipment
CN102937973A (en) * 2012-10-15 2013-02-20 北京百度网讯科技有限公司 Method and device for generating presentation configuration information used for information presentation

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR101042515B1 (en) * 2008-12-11 2011-06-17 주식회사 네오패드 Method for searching information based on user's intention and method for providing information
CN102999496A (en) * 2011-09-09 2013-03-27 北京百度网讯科技有限公司 Method for building requirement analysis formwork and method and device for searching requirement recognition

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101283350A (en) * 2005-07-15 2008-10-08 思索软件有限公司 Method and apparatus for providing structured data for free text messages
CN101075320A (en) * 2006-05-16 2007-11-21 申凌 System and method for issuing and inquiring information
CN102073725A (en) * 2011-01-11 2011-05-25 百度在线网络技术(北京)有限公司 Method for searching structured data and search engine system for implementing same
CN102214208A (en) * 2011-04-27 2011-10-12 百度在线网络技术(北京)有限公司 Method and equipment for generating structured information entity based on non-structured text
CN102298614A (en) * 2011-07-29 2011-12-28 百度在线网络技术(北京)有限公司 Method for determining collection category of page collection information and device and equipment
CN102937973A (en) * 2012-10-15 2013-02-20 北京百度网讯科技有限公司 Method and device for generating presentation configuration information used for information presentation

Also Published As

Publication number Publication date
CN104077320A (en) 2014-10-01

Similar Documents

Publication Publication Date Title
US10140368B2 (en) Method and apparatus for generating a recommendation page
US10210243B2 (en) Method and system for enhanced query term suggestion
CN105389722B (en) Malicious order identification method and device
JP5449628B2 (en) Determining category information using multistage
CN107526800A (en) Device, method and the computer-readable recording medium of information recommendation
WO2019041521A1 (en) Apparatus and method for extracting user keyword, and computer-readable storage medium
CN104866478B (en) Malicious text detection and identification method and device
TW201901661A (en) Speech recognition method and system
WO2017075017A1 (en) Automatic conversation creator for news
CN104462051B (en) Segmenting method and device
CN110457672B (en) Keyword determination method and device, electronic equipment and storage medium
CN110413875A (en) A kind of method and relevant apparatus of text information push
CN107357777B (en) Method and device for extracting label information
CN107885717B (en) Keyword extraction method and device
WO2014206151A1 (en) System and method for tagging and searching documents
WO2009026850A1 (en) Domain dictionary creation
CN105550253B (en) Method and device for acquiring type relationship
CN109242537A (en) Advertisement placement method, device, computer equipment and storage medium
CN108920649B (en) Information recommendation method, device, equipment and medium
CN106095912B (en) Method and device for generating expanded query terms
CN105096934A (en) Method for constructing speech feature library as well as speech synthesis method, device and equipment
US20170078425A1 (en) Method, system, computer storage medium, and apparatus for pushing input resources
CN104077320B (en) method and device for generating information to be issued
CN105630767A (en) Text similarity comparison method and device
CN106202200B (en) A kind of emotion tendentiousness of text classification method based on fixed theme

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant