CN109933791A - Material recommended method, device, computer equipment and computer readable storage medium - Google Patents

Material recommended method, device, computer equipment and computer readable storage medium Download PDF

Info

Publication number
CN109933791A
CN109933791A CN201910166860.5A CN201910166860A CN109933791A CN 109933791 A CN109933791 A CN 109933791A CN 201910166860 A CN201910166860 A CN 201910166860A CN 109933791 A CN109933791 A CN 109933791A
Authority
CN
China
Prior art keywords
writing
sentence
viewpoint
sample
type
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910166860.5A
Other languages
Chinese (zh)
Other versions
CN109933791B (en
Inventor
宋巍
付瑞吉
胡国平
秦兵
刘挺
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
iFlytek Co Ltd
Original Assignee
iFlytek Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by iFlytek Co Ltd filed Critical iFlytek Co Ltd
Priority to CN201910166860.5A priority Critical patent/CN109933791B/en
Publication of CN109933791A publication Critical patent/CN109933791A/en
Application granted granted Critical
Publication of CN109933791B publication Critical patent/CN109933791B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Machine Translation (AREA)

Abstract

The embodiment of the invention provides a kind of material recommended method, device, computer equipment and computer readable storage mediums, wherein this method comprises: obtaining theme to be checked;The theme to be checked is matched with the viewpoint deposited;According to the corresponding relationship of the viewpoint and writing material deposited, recommend the corresponding writing material of the viewpoint of successful match.The technical solution is conducive to avoid the problem that recall or screen available material from the indefinite resource of viewpoint in the prior art, or avoid needing author to read the process for screening identical of views material one by one in the prior art, be conducive to that user is time saving, efficiently obtains effective writing material.

Description

Material recommended method, device, computer equipment and computer readable storage medium
Technical field
The present invention relates to natural language processing technique field, in particular to a kind of material recommended method, device, computer are set Standby and computer readable storage medium.
Background technique
Currently, the material of argumentative writing writing is mainly passed through author's accumulation at ordinary times or is searched using keyword in search engine The mode of rope obtains.First way the problem is that: to obtain effective writing material, not only need long-term accumulation and And need therefrom to remember useful material, but the amount of reading of people is limited with memory, which has limited the optional of material Range, while the burden of memory or memory is increased, energy is especially very expended for beginner;Existing for the second way Problem is: many true resources only carry out objective statement to story on internet, have plenty of subjective sight in author's brains Point, and it is frequently present of semantic gap between subjective viewpoint and objective statement, for example, a story can illustrate some reason Or viewpoint, but in the statement of story there is no in the direct description of reason or viewpoint or the statement of story include key Word, but the reason that illustrates of the story or viewpoint may not be inconsistent with the viewpoint of author, just author be needed to read the fact one by one at this time Resource, and the potential reason of each true resource, viewpoint, meaning are rule of thumb inferred to common sense, then judge the fact resource Whether potential reason, viewpoint, meaning are consistent with the viewpoint of author's subjectivity, and then just can determine which true resource for author Subjective viewpoint is effective, available material, so that author expends energy, reduces the efficiency for obtaining material.
Summary of the invention
The embodiment of the invention provides a kind of material recommended methods, to exist when solving and obtaining writing material in the prior art Optional material it is limited, expend the technical issues of energy, low efficiency.This method comprises:
Obtain theme to be checked;
The theme to be checked is matched with the viewpoint deposited;
According to the corresponding relationship of the viewpoint and writing material deposited, recommend the corresponding writing material of the viewpoint of successful match.
The embodiment of the invention also provides a kind of material recommendation apparatus, to deposit when solving and obtaining writing material in the prior art Optional material it is limited, expend the technical issues of energy, low efficiency.The device includes:
Theme obtains module, for obtaining theme to be checked;
Matching module, for matching the theme to be checked with the viewpoint deposited;
Recommending module recommends the viewpoint pair of successful match for the corresponding relationship according to the viewpoint and writing material deposited The writing material answered.
The embodiment of the invention also provides a kind of computer equipments, including memory, processor and storage are on a memory And the computer program that can be run on a processor, the processor realize above-mentioned arbitrary element when executing the computer program Material recommended method, existing optional material is limited when solving to obtain writing material in the prior art, expend energy, low efficiency Technical problem.
The embodiment of the invention also provides a kind of computer readable storage medium, the computer-readable recording medium storage There is the computer program for executing above-mentioned arbitrary material recommended method, it is existing when solving to obtain writing material in the prior art Optional material is limited, expends the technical issues of energy, low efficiency.
In embodiments of the present invention, by matching theme to be checked with the viewpoint deposited, and then according to having deposited Viewpoint and writing material corresponding relationship, recommend the corresponding writing material of the viewpoint of successful match, realize can be directed to it is to be checked Subject recommending writing material corresponding with the matched viewpoint of theme to be checked is ask, so that the writing material and theme to be checked recommended It is corresponding.Be conducive to avoid the problem that recall or screen available material from the indefinite resource of viewpoint in the prior art, or It avoids needing author to read the process for screening identical of views material one by one in the prior art, it is time saving, efficiently to be conducive to user Obtain effective writing material.
Detailed description of the invention
The drawings described herein are used to provide a further understanding of the present invention, constitutes part of this application, not Constitute limitation of the invention.In the accompanying drawings:
Fig. 1 is a kind of flow chart of material recommended method provided in an embodiment of the present invention;
Fig. 2 is a kind of functional block diagram of material recommended method provided in an embodiment of the present invention;
Fig. 3 is a kind of schematic diagram of coding and decoding model provided in an embodiment of the present invention;
Fig. 4 is a kind of structural block diagram of computer equipment provided in an embodiment of the present invention;
Fig. 5 is a kind of structural block diagram of material recommendation apparatus provided in an embodiment of the present invention.
Specific embodiment
To make the objectives, technical solutions, and advantages of the present invention clearer, right below with reference to embodiment and attached drawing The present invention is described in further details.Here, exemplary embodiment and its explanation of the invention is used to explain the present invention, but simultaneously It is not as a limitation of the invention.
In embodiments of the present invention, a kind of material recommended method is provided, as shown in Figure 1, this method comprises:
Step 102: obtaining theme to be checked;
Step 104: the theme to be checked is matched with the viewpoint deposited;
Step 106: according to the corresponding relationship of the viewpoint and writing material deposited, recommending the viewpoint of successful match is corresponding to write Make material.
Process as shown in Figure 1 is it is found that in embodiments of the present invention, by carrying out theme to be checked with the viewpoint deposited Matching, and then according to the corresponding relationship of the viewpoint and writing material deposited, recommend the corresponding writing material of the viewpoint of successful match, Realizing can be for subject recommending to be checked writing material corresponding with the matched viewpoint of theme to be checked, so that recommends writes It is corresponding with theme to be checked to make material.Be conducive to avoid to recall from the indefinite resource of viewpoint in the prior art or screening can The problem of with material, or avoid that author is needed to read the process for screening identical of views material one by one in the prior art, favorably In time saving, efficiently material is effectively write in acquisition.
When it is implemented, above-mentioned material recommended method can read magnanimity material by machinery equipment, and then acquire writing Material simultaneously determines the corresponding viewpoint of writing material, is equivalent to the amount of reading and memory capability for expanding people, writer only needs to input Theme to be checked, it can directly acquire a large amount of writing materials corresponding with theme to be checked, available, allow to efficiently, Accurately recommend effective writing material for writer.Further, it is also possible to targetedly improve the amount of reading of reader and know Storage level is known, for example, reader can obtain largely writing element corresponding with theme to be checked by inputting theme to be checked Material, allow reader for a certain theme expand amount of reading, and by viewpoint with writing material be mapped memory, lay in.
When it is implemented, can recommend and the matched sight pair of theme to be checked for theme to be checked to user to realize The writing material answered, so that the writing material recommended is corresponding with theme to be checked, in the present embodiment, as shown in Fig. 2, pushing away Need to establish the corresponding relationship of writing material and viewpoint (viewpoint can be theme, meaning or position) before recommending, for example, can be with Realize that writing material is corresponding with viewpoint by following steps:
Acquisition writing material;
Using the writing material of acquisition as the input of machine learning component, machine learning component output writing material is corresponding Viewpoint.
When it is implemented, the process of above-mentioned acquisition writing material can use search engine, field website carries out information pumping It takes, to obtain a large amount of writing material, writing material may include true type writing material and/or theoretical type writing material, thing Full mold writing material may include the narrative contents such as story, the description of account property, and theoretical type writing material may include famous person The contents such as well-known saying, ancient poetry, poem.
Specifically, for example, acquisition for true type writing material, it can be used that " celebrity story ", " famous person writes element The keywords such as material " are scanned in search engine, obtain search result web page;It extracts the network address of webpage and is polymerize, obtained The domain list of websites of story V-neck V;Page analysis is carried out to field website and text extracts, the content of story is obtained and is write for true type Make material;The story of all acquisitions is deposited to the acquisition that true type writing material is realized to database.
For the acquisition of theoretical type writing material, keywords such as " famous sayings of famous figures, ancient poetry, poems " can be used and draw in search It holds up and scans for, obtain search result web page;It extracts the network address of webpage and is polymerize, obtain famous sayings of famous figures, ancient poetry, poem neck Domain list of websites;Page analysis is carried out to field website and text extracts, obtaining famous sayings of famous figures text, ancient poetry, poem is theory Type writes material;The theoretical type writing material of all acquisitions is deposited to database, the i.e. acquisition of realization theory type writing material.
It, can also be with when it is implemented, true type writing material and theoretical type writing material be there may be in a database It is respectively stored in two databases, i.e., establishes true type material database and theoretical type material database respectively.
When it is implemented, after acquisition writing material, it is thus necessary to determine that the corresponding viewpoint of writing material, in order to can be efficiently Determine the unseen new corresponding viewpoint of writing material in the past, it is true present applicant proposes being come by way of machine learning component Surely the corresponding viewpoint of writing material, i.e., using the writing material of acquisition as the input of machine learning component, machine learning component is The corresponding viewpoint of exportable writing material.Specifically, the machine learning component can be arbitrary can learning training nerve net Network, machine learning component can have different titles according to particular condition in use, for example, in this application, machine learning component For determining the corresponding viewpoint of writing material, i.e. machine learning component is properly termed as viewpoint annotator;It can also be made according to difference It is named as model, module, learning tool, learning framework etc. with situation, i.e. machine learning component is substantially that can instruct by study Experienced component or structure, the application are not specifically limited the title of machine learning component.
When it is implemented, the machine learning component of the viewpoint of writing material can be determined in order to obtain, in the present embodiment, As shown in Fig. 2, machine learning component can be trained based on following steps:
Sample writing material and corresponding sample viewpoint are obtained, the sample writing material that will acquire and corresponding sample viewpoint As sample data, inputs machine learning component and be trained.
When it is implemented, existing sample writing element can be directly acquired in order to improve the accuracy of machine learning component Material and corresponding sample viewpoint as sample data, can also through this embodiment in following steps obtain sample writing element Material and corresponding sample viewpoint are as sample data, for example, the process for obtaining sample writing material is as follows:
A large amount of data resource is obtained, according to the feature of sentence in data resource, sample writing is extracted from data resource Material and corresponding sample viewpoint.Specifically, the data resource can be arbitrary in the verbal descriptions such as article, record, comment Hold, for example, the data resource may include words type, narrate type etc. different types of data resource.
When it is implemented, can be used existing for the other kinds of data resource except the data resource of words type Method extract sample writing material and corresponding sample viewpoint, for example, manually extract sample writing material and corresponding sample This viewpoint, the application are not specifically limited.For the data resource of words type, the present embodiment proposes a kind of extraction sample writing The method of material, for example, can sentence in the data resource by way of analyzing argumentative writing debate structure according to argumentative writing type Feature, from the data resource of argumentative writing type extract sample write material:
Classify according to argumentative writing debate structure to the sentence in the data resource of argumentative writing type;For example, being divided into introduction The sentence types such as sentence, topic sentence, sub- argument sentence, argument sentence and concluding sentence specifically can carry out sentence using existing classification method Subclassification, the application are not specifically limited.The sentence of each type has different expressions respectively, for example, introduction sentence, is introduced related Material introduces theme;Topic sentence provides article central idea;Never Tongfang is described sub- argument sentence in face of theme;Argument sentence Support purport and argument;Concluding sentence summarizes full text.
Story is searched in the data resource of argumentative writing type and triggers sentence, by the sentence of story triggering sentence and adjacent specified type Merge into true this writing of pattern material;
Sentence in the data resource of argumentative writing type is matched with known theoretical type sentence, by the sentence of successful match Son merges into theoretical this writing of pattern material.
Specifically, it is narrative that story triggering sentence can be characterization during extracting true pattern this writing material Sentence, for example, it may be the sentence of element is described including time, place, personage generation title, name etc., due in an argumentative writing Middle argument is for supporting paper viewpoint, in turn it is considered that the meaning of story and the argument of argumentative writing and its phase in argument It closes, therefore, participle and part-of-speech tagging can be carried out to argument sentence, trigger sentence to search story in argument sentence, find story touching After sending out sentence, story triggering sentence is merged to obtain narrative content with the sentence of adjacent specified type, as true this writing of pattern element Material.Complete in order to obtain, effective narrative content, story can be triggered sentence and adjacent specified type, comprising narration element Sentence merge, for example, story can be triggered sentence and adjacent continuous argument sentence merge into true this writing of pattern material.
Specifically, during extracting theoretical pattern this writing material, by the sentence in the data resource of argumentative writing type It is matched with known theoretical type sentence (for example, well-known saying, ancient poetry etc.), it is theoretical in the data resource to find argumentative writing type Type sentence, in order to it is more acurrate, effectively obtain theoretical pattern this writing material, can be by argument sentence in the data resource of argumentative writing type It is matched with known theoretical type sentence, the sentence of successful match is merged into theoretical this writing of pattern material.
When it is implemented, extracting sample writing material (theoretical this writing of pattern material and this writing of true pattern here Material is referred to as sample writing material) after, sample writing element can also be obtained by way of analyzing argumentative writing debate structure The sample viewpoint of material, to obtain the corresponding data of sample writing material and sample viewpoint as sample.For example, in the present embodiment In, the sentence write according to argumentative writing debate structure to sample in the data resource of the argumentative writing type where material carries out classifying it Afterwards, the classification results obtained include topic sentence and sub- argument sentence, and classification results can also include introduction sentence, argument sentence and concluding sentence Etc. types sentence;The amalgamation result of one of any or any combination item in item following in the data resource of argumentative writing type is made The corresponding sample viewpoint of material: topic, topic sentence and the sub- argument sentence nearest with sample writing material distance is write for sample, Such a sample writing material can correspond to one or more sample viewpoints, and sample writing material is seen with each sample respectively Point corresponds to one<sample writing material, and sample viewpoint>to storing, without merging to multiple sample viewpoints, i.e., correspondence is more The sample writing material of a sample viewpoint may generate multiple<sample writing material, sample viewpoint>right, because of different people pair The understanding of same material may be different.
Specifically, sub- argument sentence passes through the section sequence of section where sub- argument sentence and sub- argument at a distance from sample writing material Sentence sentence sort to indicate, for example, sample writing material where section sub- argument sentence than other sections sub- argument sentence distance more Closely;If section where sample writing material does not have sub- argument sentence, there is sub- argument sentence in other multistages, then the sub- argument sentence ratio in leading portion Sub- argument sentence in back segment is apart from closer;If there is multiple sub- argument sentences in other sections, son of the sub- argument sentence in front than back Argument sentence is apart from closer.
For example, extracting true this writing of pattern material by taking true this writing of pattern material as an example and determining sample viewpoint Process is as follows:
Step 1: the sentence in data resource by way of analyzing argumentative writing debate structure to argumentative writing type divides The analysis result of class is as follows:
[topic sentence]: the life of people is limited, we should make great efforts to create oneself significant life.
[sub- argument sentence]: the meaning of life is to change the world.
[discussion]: human development is the process of a continuous reforming world.
[argument sentence 1]: Qiao Busi changes electronic world, is dedicated to digital revolution throughout one's life.
[argument sentence 2]: his contribution revolutionizes people's lives mode.
Position name from argument sentence first: " Qiao Busi ", i.e. argument sentence 1 are that story triggers sentence;To the generation in argument sentence Word carries out reference resolution, and therefore " he " also refers to Qiao Busi.Argument sentence 1 is closed with the argument sentence 2 for continuously including same personage It and is true pattern this writing material.
Step 2: searching out the nearest sub- argument sentence of distance according to the relationship with true this writing of pattern material distance. Correspond to " meaning of life is to change the world " in this example.
If the sub- argument sentence nearest with true this writing of pattern material distance is considered as fact pattern this writing material Sample viewpoint, the sample writing material indicated in available example 1 as shown in table 1 below.When it is implemented, due to topic sentence It is to summarize full text, topic sentence can also be merged to the viewpoint collectively as sample writing material with apart from nearest sub- argument sentence (or meaning).
By repeating step 1 and step 2, extensive argumentative writing is handled, fairly large < sample can be obtained and write Make material, sample viewpoint > right, the example of as shown in table 1 below 2 true this writing of pattern materials and sample viewpoint.
Table 1
When it is implemented, after obtaining sample writing material and corresponding sample viewpoint as sample data, it can be by sample This writing material and corresponding sample viewpoint input machine learning component are trained, which can be arbitrary Can learning training neural network, for example, in the present embodiment, machine learning component is with the generation model of a sequence to sequence For, the specific can be that classical coding and decoding model as shown in Figure 3, the coding layer of coding and decoding model is using circulation mind Through network, LSTM is may be selected in the basic unit (box shown in Fig. 3 is a basic unit) of Recognition with Recurrent Neural Network (Long Short-Term Memory, shot and long term memory network), two-way LSTM or stacking LSTM (Stacking LSTM) etc. become Kind.
The process of specific training coding and decoding model can be, and write material and corresponding sample viewpoint as sample using sample Notebook data writes material as input, using corresponding sample viewpoint as output, for example, firstly, writing element to sample using sample Each sentence in material is segmented, and by the word vectors after participle, obtains the term vector of each sentence, and by each sentence Input of the term vector as coding layer, the output of coding layer is as the semantic expressiveness for inputting sentence.By the semantic expressiveness of sentence Decoding layer is inputted, decoding layer then generates object statement by word according to the semantic expressiveness of input sentence, which is to input Sample writes the corresponding sample viewpoint of material.
When it is implemented, can be jointly using true this writing of pattern material and corresponding sample viewpoint, theoretical pattern sheet It writes material and corresponding sample viewpoint is one machine learning component of sample training, i.e., write material with different types of sample It comes together to train a machine learning component.
When it is implemented, in order to enable the viewpoint of machine learning component mark is more acurrate, in the present embodiment, for difference Machine learning component can be respectively trained in the sample writing material of type, for example, with true this writing of pattern material and corresponding Sample viewpoint is sample data, and input machine learning component is trained, obtains true type machine learning component, be referred to as True type viewpoint annotator, it is subsequent that true type viewpoint annotator can be used to true type writing material progress viewpoint mark;With Theoretical this writing of pattern material and corresponding sample viewpoint are sample data, and input machine learning component is trained, is managed By type machine learning component, it is referred to as theoretical type viewpoint annotator, it is subsequent that theoretical type viewpoint annotator can be used to reason Viewpoint mark is carried out by type writing material.
When it is implemented, in the present embodiment, sample is write material for the generalization ability of reinforcement machine learning component In proper noun replace with the character of no concrete meaning, the sample writing material and corresponding sample of proper noun is substituted Viewpoint is sample data, and input machine learning component is trained.
Specifically, the proper noun in above-mentioned sample writing material refers to specific or unique people or object, for example, It can be place name, name, building name, country name, unit name, organization name, time etc..It is with true this writing of pattern material Sample can be write specific name in material and be numbered according to appearance sequence, and replace with the word of no concrete meaning by example Symbol, such as PER1, PER2 ....
Such as: Qiao Busi changes electronic world, is that PER1 changes electronic world after replacement name.
PER1 or PER2 ... is considered as a new word and vocabulary is added, also can be by training machine learning object Distributing a vector indicates.
The characteristics of training will make machine learning component more pay attention to event in this way is ignored or weakens proprietary by name etc. Noun bring is different.Such as: it encounters new story and teaches " search that Li Yanhong changes China ", then being based on " PER1 Change " this mode, the characteristics of machine learning component is based only on event marks viewpoint, it is possible to will the story of " Li Yanhong " and The story of " Qiao Busi " connects, and the story of " Li Yanhong " is recommended with the story of " Qiao Busi " as similar writing material To user.
Specific implementation, as shown in Fig. 2, machine learning component training after the completion of, so that it may by machine learning component come Viewpoint is marked to the writing material in database, establishes the corresponding relationship of writing material and viewpoint, for example, can be first by writing element The proper nouns such as the name for including in material replace with PER1, PER2 ... wait characters, then the writing element of proper noun will be substituted Material inputs machine learning component, and the output of machine learning component is the viewpoint of the writing material.At this point, the writing in database Material is no longer simple narration, but establish writing material and viewpoint corresponding relationship, specifically, writing material with it is corresponding Viewpoint can by<writing material, viewpoint>pair mode store.
When it is implemented, the viewpoint vectorization of obtained writing material, the semantic vector for obtaining viewpoint can also be indicated, In database each record can be including<writing material word sequence, viewpoint word sequence, viewpoint semantic vector indicate>etc. Content.Specifically, the process of vectorization can using existing vectorization method realize, as word2vec method, Method or the method for deep neural network of paragraph2vec etc., the application is not specifically limited.
When it is implemented, can be by the corresponding relationship storage of viewpoint and writing material in the database, viewpoint and writing element The corresponding relationship of material can be stored by different modes, for example, with<viewpoint, writing material>pair mode by viewpoint and The specific data of writing material carry out corresponding storage, alternatively, being recorded in the form of table etc. corresponding between viewpoint and writing material The specific data of relationship or incidence relation, viewpoint and writing material can not correspond to storage, etc., as long as can be shown that sight Corresponding relationship between point and writing material, the application do not do viewpoint and the form of expression of the corresponding relationship of writing material It is specific to limit.
When it is implemented, recommending writing material that can also adopt in the present embodiment for the ease of inquiry, quickly to user Viewpoint is indicated with the mode of inverted index and writes the corresponding relationship of material, includes index terms in each record of inverted index And writing material corresponding with index terms, index terms include viewpoint.Specifically, including index terms and element in the list of inverted index Material list two parts, inverted index each record in may include all writing materials corresponding with index terms, when multiple When the corresponding viewpoint of writing material is identical, then the same index terms corresponds to multiple writing materials in a record, for example, writing Make material 1 and write the viewpoint of material 8 to be all " struggle ", as shown in table 2 below, index terms " struggle " corresponds to writing 1 He of material Material 8 is write, similarly, index terms " gratitude " corresponds to writing material 2 and writing material 6.
Specifically, index terms other than including viewpoint, can also include the keyword in the corresponding writing material of viewpoint, It is more with the keyword of theme successful match to be checked when viewpoint and theme successful match to be checked, then it represents that writing element The matching degree of material and theme to be checked is higher.
When it is implemented, when being indexed using inverted index to writing material and corresponding viewpoint, it can also be to writing Material and corresponding viewpoint are segmented, remove stop words, and therefore, each writing material record becomes the sequence of a word, It is equivalent to a document.All writing material records are equivalent to a collection of document.Any inverted index building work can be used Tool (such as Lucene) establishes word to the corresponding relationship of document (writing material) list, as shown in table 2 below.
Table 2
Index terms Material list
…… ……
Struggle Material 1 is write, material 8 ... is write
Feel grateful Material 2 is write, material 6 ... is write
…… ……
When it is implemented, as shown in Fig. 2, line can be carried out after establishing the corresponding relationship of writing material and viewpoint Upper material is recommended.For example, the theme to be checked (or viewpoint character string) of user's input is received, it will be in theme to be checked and database Viewpoint matched (for example, can to user input theme to be checked segment, remove stop words, use remaining word Matched with the viewpoint (or index terms) in database), the corresponding writing material of the viewpoint of successful match is candidate writing Writing material in candidate writing material is recommended user by material.
When it is implemented, recommending the validity of material to recommend successful match in the present embodiment to further increase The corresponding writing material of viewpoint, comprising:
The similarity between theme to be checked and the viewpoint of successful match is calculated, in the corresponding writing of the viewpoint of successful match In material, writing material is recommended according to similarity.
Specifically, vectorization first can be carried out to theme to be checked, the semantic vector for obtaining theme to be checked is indicated, then counts The similarity between the semantic vector expression of theme to be checked and the semantic vector expression of the viewpoint of successful match is calculated, for example, can To calculate similarity using the methods of cosine similarity or Euclidean distance, the application is not specifically limited.Calculate similarity it Afterwards, which can be used as the score of candidate writing material (i.e. the corresponding writing material of the viewpoint of successful match), Jin Ergen Candidate's writing material is ranked up according to the sequence of score from high to low, default writing material for sorting forward can be recommended To user, the writing material after sequence can also all be recommended into user according to sequence.
When it is implemented, may be used also in the present embodiment to meet user to the inquiry needs of different type writing material With the specified type of determination writing material to be recommended;Then in the corresponding writing material of the viewpoint of successful match, recommend to meet The writing material of specified type.
Specifically, the information for the specified type that can be inputted according to user determines the specified type of writing material to be recommended, The writing material for recommending which kind of specified type to the user is determined, for example, the type information of user A input is true type, then In the corresponding writing material of viewpoint of successful match, true type is recommended to write material to user A;It can also be recommended by material Itself setting information of method recommends specified type to designated user to determine the specified type to be recommended for writing material Material is write, for example, the equipment of operation material recommended method or plateform system can input setting information, setting information is for fixed Justice recommends the writing material of specified type to some or certain type of user, and then is assured that be recommended write based on setting information Make the specified type of material, recommend true type writing material for example, realizing to user A, recommends theoretical type writing element to user B Material.
When it is implemented, in the present embodiment, obtaining master to be checked to provide different form, flexible inquiry mode Topic, comprising:
When input content is article (for example, article of argumentative writing type or narrative type), according to argumentative writing debate structure Classify to the sentence in article, for example, classification results include topic sentence and sub- argument sentence, classification results can also include drawing It, will be one of any or any in the theme of article, topic sentence, sub- argument sentence by sentence types such as sentence, argument sentence and concluding sentences Combination is used as theme to be checked.
For example, user can input entire chapter composition after composition writing is completed, operation material recommended method System automatically classifies to the sentence in composition using chapter debate structure analyzer, analyzes theme, the topic sentence of article With sub- argument sentence etc., one of any or any combination in theme, topic sentence and sub- argument sentence that analysis is obtained as to Theme is inquired, it is subsequent that writing material is recommended to user based on theme to be checked.
In the present embodiment, additionally provide a kind of calculation machine equipment, as shown in figure 4, include memory 402, processor 404 and The computer program that can be run on a memory and on a processor is stored, the processor executes real when the computer program Existing above-mentioned arbitrary material recommended method.
Specifically, the computer equipment can be terminal, server or similar arithmetic unit.
In the present embodiment, a kind of computer readable storage medium, the computer-readable recording medium storage are provided There is the computer program for executing above-mentioned arbitrary material recommended method.
Specifically, computer readable storage medium includes permanently and non-permanent, removable and non-removable media can To realize that information is stored by any method or technique.Information can be computer readable instructions, data structure, the module of program Or other data.The example of computer readable storage medium includes, but are not limited to phase change memory (PRAM), static random-access is deposited Reservoir (SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory techniques, CD-ROM are read-only Memory (CD-ROM), digital versatile disc (DVD) or other optical storage, magnetic cassettes, tape magnetic disk storage or Other magnetic storage devices or any other non-transmission medium, can be used for storage can be accessed by a computing device information.According to Herein defines, and computer readable storage medium does not include temporary computer readable media (transitory media), such as adjusts The data-signal and carrier wave of system.
Material recommended method provided in an embodiment of the present invention realizes following technical effect: will by the material recommended method Theme to be checked is matched with the viewpoint deposited, and then according to the corresponding relationship of the viewpoint and writing material deposited, to user The corresponding writing material of viewpoint for recommending successful match, realizing can match for subject recommending to be checked with theme to be checked The corresponding writing material of viewpoint so that recommend writing material it is corresponding with theme to be checked.The material recommended method is advantageous In avoiding the problem that writer recalls from the indefinite resource of viewpoint or screens available material in the prior art, or avoid existing Have and author needed to read the process for screening identical of views material one by one in technology, be conducive to it is time saving, efficiently obtain it is effective Write material.
Based on the same inventive concept, a kind of material recommendation apparatus is additionally provided in the embodiment of the present invention, such as following implementation Described in example.Since the principle that material recommendation apparatus solves the problems, such as is similar to material recommended method, the reality of material recommendation apparatus The implementation that may refer to material recommended method is applied, overlaps will not be repeated.It is used below, term " unit " or " mould The combination of the software and/or hardware of predetermined function may be implemented in block ".Although device described in following embodiment is preferably with soft Part is realized, but the realization of the combination of hardware or software and hardware is also that may and be contemplated.
Fig. 5 is a kind of structural block diagram of the material recommendation apparatus of the embodiment of the present invention, as shown in figure 5, the device includes:
Theme obtains module 502, for obtaining theme to be checked;
Matching module 504, for matching the theme to be checked with the viewpoint deposited;
Recommending module 506 recommends the viewpoint of successful match for the corresponding relationship according to the viewpoint and writing material deposited Corresponding writing material.
In one embodiment, above-mentioned apparatus further include:
Material collection module, for acquiring writing material;
Viewpoint labeling module, input of the writing material as machine learning component for that will acquire, machine learning component The corresponding viewpoint of output writing material.
In one embodiment, above-mentioned apparatus further include:
Training module, for being trained based on following steps to machine learning component:
Sample writing material and corresponding sample viewpoint are obtained, the sample writing material that will acquire and corresponding sample viewpoint As sample data, inputs machine learning component and be trained.
In one embodiment, the training module, comprising:
Data capture unit, for obtaining data resource;
Sample acquisition unit extracts sample writing element for the feature according to sentence in data resource from data resource Material and corresponding sample viewpoint.
In one embodiment, the sample acquisition unit, specifically for being directed to the data resource of argumentative writing type, according to view Paper debate structure classifies to the sentence in the data resource of argumentative writing type;Event is searched in the data resource of argumentative writing type Thing triggers sentence, and the sentence of story triggering sentence and adjacent specified type is merged into true this writing of pattern material;By argumentative writing type Data resource in sentence matched with known theoretical type sentence, the sentence of successful match is merged into theoretical pattern sheet Write material.
In one embodiment, the sample acquisition unit, also particularly useful for for the words where sample writing material The data resource of literary type carries out classification classification to the sentence in the data resource of argumentative writing type according to argumentative writing debate structure, point Class result includes topic sentence and sub- argument sentence;By one of any or any combination in item following in the data resource of argumentative writing type Amalgamation result as the corresponding sample viewpoint of sample writing material: topic, topic sentence and with sample writing material distance Nearest sub- argument sentence.
In one embodiment, the training module, further includes:
Training unit, for inputting machine using true this writing of pattern material and corresponding sample viewpoint as sample data Learning object is trained, and obtains true type machine learning component;
Using theoretical this writing of pattern material and corresponding sample viewpoint as sample data, input machine learning component is instructed Practice, obtains theoretical type machine learning component.
In one embodiment, training unit, the proper noun for being also used to write sample in material are replaced with without specific The character of meaning, the sample proper noun is substituted write material and corresponding sample viewpoint as sample data, input machine Learning object is trained.
In one embodiment, theme obtains module, is also used to when input content is article, according to argumentative writing debate knot Structure classifies to the sentence in article, and classification results include topic sentence and sub- argument sentence, by the theme of article, topic sentence and One of any or any combination in sub- argument sentence is as theme to be checked.
In one embodiment, the recommending module, for calculating between theme to be checked and the viewpoint of successful match Similarity recommends writing material according to similarity in the corresponding writing material of viewpoint of successful match.
In one embodiment, further includes:
Memory module, for indicating viewpoint by the way of inverted index and writing the corresponding relationship of material, inverted index Each record in include index terms and writing material corresponding with index terms, index terms include viewpoint.
In one embodiment, the recommending module, for determining the specified type of writing material to be recommended;Matching at In the corresponding writing material of the viewpoint of function, recommend the writing material for meeting specified type.
Material recommendation apparatus provided in an embodiment of the present invention realizes following technical effect: will by the material recommendation apparatus Theme to be checked is matched with the viewpoint deposited, and then according to the corresponding relationship of the viewpoint and writing material deposited, to user The corresponding writing material of viewpoint for recommending successful match, realizing can match for subject recommending to be checked with theme to be checked The corresponding writing material of viewpoint so that recommend writing material it is corresponding with theme to be checked.The material recommendation apparatus is advantageous In avoiding the problem that writer recalls from the indefinite resource of viewpoint or screens available material in the prior art, or avoid existing Have and author needed to read the process for screening identical of views material one by one in technology, be conducive to it is time saving, efficiently obtain it is effective Write material.
Obviously, those skilled in the art should be understood that each module of the above-mentioned embodiment of the present invention or each step can be with It is realized with general computing device, they can be concentrated on a single computing device, or be distributed in multiple computing devices On composed network, optionally, they can be realized with the program code that computing device can perform, it is thus possible to by it Store and be performed by computing device in the storage device, and in some cases, can be held with the sequence for being different from herein The shown or described step of row, perhaps they are fabricated to each integrated circuit modules or will be multiple in them Module or step are fabricated to single integrated circuit module to realize.In this way, the embodiment of the present invention be not limited to it is any specific hard Part and software combine.
The foregoing is only a preferred embodiment of the present invention, is not intended to restrict the invention, for the skill of this field For art personnel, the embodiment of the present invention can have various modifications and variations.All within the spirits and principles of the present invention, made Any modification, equivalent substitution, improvement and etc. should all be included in the protection scope of the present invention.

Claims (17)

1. a kind of material recommended method characterized by comprising
Obtain theme to be checked;
The theme to be checked is matched with the viewpoint deposited;
According to the corresponding relationship of the viewpoint and writing material deposited, recommend the corresponding writing material of the viewpoint of successful match.
2. material recommended method as described in claim 1, which is characterized in that further include:
Acquisition writing material;
Using the writing material of acquisition as the input of machine learning component, the corresponding sight of machine learning component output writing material Point.
3. material recommended method as claimed in claim 2, which is characterized in that further include:
Machine learning component is trained based on following steps:
Sample writing material and corresponding sample viewpoint are obtained, the sample writing material that will acquire and corresponding sample viewpoint conduct Sample data, input machine learning component are trained.
4. material recommended method as claimed in claim 3, which is characterized in that obtain sample writing material and corresponding sample is seen Point, comprising:
Obtain data resource;
According to the feature of sentence in data resource, sample writing material and corresponding sample viewpoint are extracted from data resource.
5. material recommended method as claimed in claim 4, which is characterized in that according to the feature of sentence in data resource, from number Material is write according to sample is extracted in resource, comprising:
For the data resource of argumentative writing type, the sentence in the data resource of argumentative writing type is carried out according to argumentative writing debate structure Classification;
Story is searched in the data resource of argumentative writing type and triggers sentence, and story triggering sentence is merged with the sentence of adjacent specified type For true this writing of pattern material;
Sentence in the data resource of argumentative writing type is matched with known theoretical type sentence, the sentence of successful match is closed It and is theoretical pattern this writing material.
6. material recommended method as claimed in claim 4, which is characterized in that obtain the corresponding sample of sample writing material and see Point, comprising:
The data resource that the argumentative writing type where material is write for sample, according to argumentative writing debate structure to the number of argumentative writing type Classify according to the sentence in resource, classification results include topic sentence and sub- argument sentence;
The amalgamation result of one of any or any combination item in item following in the data resource of argumentative writing type is write as sample Make the corresponding sample viewpoint of material: topic, topic sentence and the sub- argument sentence nearest with sample writing material distance.
7. material recommended method as claimed in claim 3, which is characterized in that the sample writing material that will acquire and corresponding sample This viewpoint is used as sample data, and input machine learning component is trained, comprising:
Using true this writing of pattern material and corresponding sample viewpoint as sample data, input machine learning component is trained, Obtain true type machine learning component;
Using theoretical this writing of pattern material and corresponding sample viewpoint as sample data, input machine learning component is trained, Obtain theoretical type machine learning component.
8. the material recommended method as described in any one of claims 1 to 7, which is characterized in that obtain theme to be checked, wrap It includes:
When input content is article, classify according to argumentative writing debate structure to the sentence in article, classification results include Topic sentence and sub- argument sentence, using one of any or any combination in the theme of article, topic sentence and sub- argument sentence as to Inquire theme.
9. the material recommended method as described in any one of claims 1 to 7, which is characterized in that further include:
Determine the specified type of writing material to be recommended;
In the corresponding writing material of viewpoint of successful match, recommend the writing material for meeting specified type.
10. a kind of material recommendation apparatus characterized by comprising
Theme obtains module, for obtaining theme to be checked;
Matching module, for matching the theme to be checked with the viewpoint deposited;
Recommending module recommends the viewpoint of successful match corresponding for the corresponding relationship according to the viewpoint and writing material deposited Write material.
11. material recommendation apparatus as claimed in claim 10, which is characterized in that further include:
Material collection module, for acquiring writing material;
Viewpoint labeling module, input of the writing material as machine learning component for that will acquire, machine learning component output Write the corresponding viewpoint of material.
12. material recommendation apparatus as claimed in claim 11, which is characterized in that further include:
Training module, for being trained based on following steps to machine learning component:
Sample writing material and corresponding sample viewpoint are obtained, the sample writing material that will acquire and corresponding sample viewpoint conduct Sample data, input machine learning component are trained.
13. material recommendation apparatus as claimed in claim 12, which is characterized in that the training module, comprising:
Data capture unit, for obtaining data resource;
Sample acquisition unit, for the feature according to sentence in data resource, extracted from data resource sample writing material and Corresponding sample viewpoint.
14. material recommendation apparatus as claimed in claim 13, which is characterized in that the sample acquisition unit is specifically used for, needle To the data resource of argumentative writing type, classify according to argumentative writing debate structure to the sentence in the data resource of argumentative writing type;
Story is searched in the data resource of argumentative writing type and triggers sentence, and story triggering sentence is merged with the sentence of adjacent specified type For true this writing of pattern material;
Sentence in the data resource of argumentative writing type is matched with known theoretical type sentence, the sentence of successful match is closed It and is theoretical pattern this writing material.
15. material recommendation apparatus as claimed in claim 13, which is characterized in that the sample acquisition unit, also particularly useful for, The data resource that the argumentative writing type where material is write for sample is provided according to data of the argumentative writing debate structure to argumentative writing type Sentence in source is classified, and classification results include topic sentence and sub- argument sentence;By item following in the data resource of argumentative writing type In one of any or any combination item amalgamation result as the corresponding viewpoint of writing material: topic, topic sentence and with write Make the nearest sub- argument sentence of material distance.
16. a kind of computer equipment including memory, processor and stores the meter that can be run on a memory and on a processor Calculation machine program, which is characterized in that the processor realizes any one of claims 1 to 9 institute when executing the computer program The material recommended method stated.
17. a kind of computer readable storage medium, which is characterized in that the computer-readable recording medium storage has perform claim It is required that the computer program of material recommended method described in any one of 1 to 9.
CN201910166860.5A 2019-03-06 2019-03-06 Material recommendation method, device, computer equipment and computer readable storage medium Active CN109933791B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910166860.5A CN109933791B (en) 2019-03-06 2019-03-06 Material recommendation method, device, computer equipment and computer readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910166860.5A CN109933791B (en) 2019-03-06 2019-03-06 Material recommendation method, device, computer equipment and computer readable storage medium

Publications (2)

Publication Number Publication Date
CN109933791A true CN109933791A (en) 2019-06-25
CN109933791B CN109933791B (en) 2023-08-01

Family

ID=66986580

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910166860.5A Active CN109933791B (en) 2019-03-06 2019-03-06 Material recommendation method, device, computer equipment and computer readable storage medium

Country Status (1)

Country Link
CN (1) CN109933791B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112182210A (en) * 2020-09-25 2021-01-05 四川华空天行科技有限公司 Language generation model based on composition data feature classifier and writing support method
CN116090473A (en) * 2023-04-06 2023-05-09 北京大学深圳研究生院 Intelligent writing assisting method, device and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188866A1 (en) * 2012-12-31 2014-07-03 Microsoft Corporation Recommendation engine based on conditioned profiles
KR20180036460A (en) * 2016-09-30 2018-04-09 주식회사 솔트룩스 System and method for recommending knowledge actively to write document
CN108763516A (en) * 2018-05-31 2018-11-06 悦未来科技(深圳)有限公司 Write exchange method, device and computer readable storage medium

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140188866A1 (en) * 2012-12-31 2014-07-03 Microsoft Corporation Recommendation engine based on conditioned profiles
KR20180036460A (en) * 2016-09-30 2018-04-09 주식회사 솔트룩스 System and method for recommending knowledge actively to write document
CN108763516A (en) * 2018-05-31 2018-11-06 悦未来科技(深圳)有限公司 Write exchange method, device and computer readable storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
何少华: "新闻主题的提炼与深化", 《新闻爱好者》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112182210A (en) * 2020-09-25 2021-01-05 四川华空天行科技有限公司 Language generation model based on composition data feature classifier and writing support method
CN112182210B (en) * 2020-09-25 2023-11-24 四川华空天行科技有限公司 Language generation model based on composition and theory data feature classifier and composition supporting method
CN116090473A (en) * 2023-04-06 2023-05-09 北京大学深圳研究生院 Intelligent writing assisting method, device and system

Also Published As

Publication number Publication date
CN109933791B (en) 2023-08-01

Similar Documents

Publication Publication Date Title
US10896214B2 (en) Artificial intelligence based-document processing
Lui et al. Accurate language identification of twitter messages
CN104933164B (en) In internet mass data name entity between relationship extracting method and its system
Argyrou et al. Topic modelling on Instagram hashtags: An alternative way to Automatic Image Annotation?
US20200004792A1 (en) Automated website data collection method
CN103853834B (en) Text structure analysis-based Web document abstract generation method
CN107590219A (en) Webpage personage subject correlation message extracting method
Bisandu et al. Data clustering using efficient similarity measures
CN103744981A (en) System for automatic classification analysis for website based on website content
CN113569050B (en) Method and device for automatically constructing government affair field knowledge map based on deep learning
WO2016114790A1 (en) Reading difficulty level based resource recommendation
CN108228612B (en) Method and device for extracting network event keywords and emotional tendency
KR102126911B1 (en) Key player detection method in social media using KeyplayerRank
CN113988057A (en) Title generation method, device, equipment and medium based on concept extraction
CN112015907A (en) Method and device for quickly constructing discipline knowledge graph and storage medium
Baisa et al. Benchmark dataset for propaganda detection in Czech newspaper texts
Antunes et al. Readability of web content
KR102185733B1 (en) Server and method for automatically generating profile
CN104133913B (en) A kind of city retail shop information bank automatic build system being polymerized with search based on video analysis and method
CN109933791A (en) Material recommended method, device, computer equipment and computer readable storage medium
CN109684473A (en) A kind of automatic bulletin generation method and system
Shrestha Detecting fake news with sentiment analysis and network metadata
CN107291686B (en) Method and system for identifying emotion identification
CN110717029A (en) Information processing method and system
CN110019814B (en) News information aggregation method based on data mining and deep learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant