CN108132966A - Knowledge mapping generation method and device - Google Patents

Knowledge mapping generation method and device Download PDF

Info

Publication number
CN108132966A
CN108132966A CN201711217693.XA CN201711217693A CN108132966A CN 108132966 A CN108132966 A CN 108132966A CN 201711217693 A CN201711217693 A CN 201711217693A CN 108132966 A CN108132966 A CN 108132966A
Authority
CN
China
Prior art keywords
keyword
word
basic word
association
knowledge mapping
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711217693.XA
Other languages
Chinese (zh)
Inventor
乔会君
赵瑶宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Hongtai Creation Information Technology Co Ltd
Original Assignee
Beijing Hongtai Creation Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Hongtai Creation Information Technology Co Ltd filed Critical Beijing Hongtai Creation Information Technology Co Ltd
Priority to CN201711217693.XA priority Critical patent/CN108132966A/en
Publication of CN108132966A publication Critical patent/CN108132966A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/21Design, administration or maintenance of databases
    • G06F16/215Improving data quality; Data cleansing, e.g. de-duplication, removing invalid entries or correcting typographical errors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/22Indexing; Data structures therefor; Storage structures
    • G06F16/221Column-oriented storage; Management thereof
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/27Replication, distribution or synchronisation of data between databases or within a distributed database system; Distributed database system architectures therefor
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/36Creation of semantic tools, e.g. ontology or thesauri
    • G06F16/374Thesaurus
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Software Systems (AREA)
  • Quality & Reliability (AREA)
  • Computing Systems (AREA)
  • Fuzzy Systems (AREA)
  • Mathematical Physics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a kind of knowledge mapping generation method and devices.Wherein, this method includes:Basic word is captured in predetermined content platform;The keyword with basic word association is obtained on predetermined content platform;According to the frequency that basic word and keyword occur simultaneously, the degree of association between basis word and keyword is determined;Knowledge mapping is generated according to the determining degree of association, wherein, knowledge mapping includes multiple basic words and the keyword with multiple basic word associations, and knowledge mapping feeds back for the search of content information.By the present invention, solve in the related art, when being scanned for the content information of needs, the technical issues of existence time is long, and efficiency is low.

Description

Knowledge mapping generation method and device
Technical field
The present invention relates to data processing field, in particular to a kind of knowledge mapping generation method and device.
Background technology
With the continuous development of information technology, internet information is intricate.People are when carrying out information search, Zhi Nengjian Frame retrieval singly is entered search terms into, system provides content relevant with the search term according to the search term of input and believes later Breath.But many times, people are not aware that the Accurate Expression of the content oneself to be searched for, only know that some are general interior Hold, and at this time, if it is desired to when getting the content information to be searched for, then can only user input possible search term one by one, And possible search result is shown one by one.And when using such operation, when often having searched for repeatedly or having searched for very long Between also search for less than really desired content information.Therefore, in the related art, when being scanned for the content information of needs, The problem of existence time is long, and efficiency is low.
For it is above-mentioned the problem of, currently no effective solution has been proposed.
Invention content
An embodiment of the present invention provides a kind of knowledge mapping generation method and device, at least to solve in the related art, When being scanned for the content information of needs, the technical issues of existence time is long, and efficiency is low.
One side according to embodiments of the present invention provides a kind of knowledge mapping generation method, including:In predetermined content The basic word of platform crawl;The keyword with the basic word association is obtained on the predetermined content platform;According to the basis The frequency that word occurs simultaneously with the keyword determines the degree of association between the basic word and the keyword;According to determining The degree of association generation knowledge mapping, wherein, the knowledge mapping include it is multiple it is described basis words and with the multiple base The keyword of plinth word association, the knowledge mapping feed back for the search of content information.
Optionally, it obtains on the predetermined content platform and includes with the keyword of the basic word association:It determines With the correlation dimension of the keyword and the basic word of the basic word association;On the predetermined content platform obtain with it is described All keywords of the basic word in the correlation dimension.
Optionally, the knowledge mapping is generated according to the determining degree of association to include:The basic word is obtained described The frequency that the frequency and the keyword that predetermined content platform occurs occur in the predetermined content platform;According to the base Plinth word determines the temperature of the basic word and according to the keyword described in the frequency that the predetermined content platform occurs The frequency that predetermined content platform occurs determines the temperature of the keyword;The knowledge graph is generated according to the determining degree of association Spectrum, wherein, include the degree of association of the basic word and the keyword in the knowledge mapping, the temperature of the basis word, The temperature of the keyword.
Optionally, it after the knowledge mapping is generated according to the determining degree of association, further includes:Receive input Search operation and/or clicking operation;According to the operation of the described search of reception and/or clicking operation, to the knowledge graph of generation Spectrum is modified.
Optionally, it after the knowledge mapping is generated according to the determining degree of association, further includes:Receive input For the search term of search;Judge in the knowledge mapping to whether there is and the matched basic word of described search word;Sentencing Disconnected result in the case of being, obtain the basic word content information and in the keyword of the basic word association Hold information;Show the basic content information of word and the content information of the keyword.
Optionally, show that the basic content information of word and the content information of the keyword include:With it is matched The keyword of the basis word association is ranked up multiple keywords according to the size of the degree of association and obtains there are in the case of multiple Obtain ranking results;While the basic word is shown, the content information of the keyword is shown according to ranking results.
Optionally, the basic content information of word and/or the content information of the keyword include:Domestic News, Trend of investment, action message, industry patent, service information, technical documentation, industry Zone Information.
According to another aspect of the present invention, a kind of knowledge mapping generating means are provided, including:Handling module, for The basic word of predetermined content platform crawl;First acquisition module, for being obtained on the predetermined content platform and the basic word Associated keyword;Determining module for the frequency occurred simultaneously with the keyword according to the basic word, determines the base The degree of association between plinth word and the keyword;Generation module, for generating knowledge mapping according to the determining degree of association, In, the knowledge mapping includes multiple basic words and the keyword with the multiple basic word association, the knowledge graph Spectrum is fed back for the search of content information.
Optionally, first acquisition module includes:First determination unit, for determining the pass with the basic word association The correlation dimension of keyword and the basic word;First acquisition unit, for being obtained and the base on the predetermined content platform All keywords of the plinth word in the correlation dimension.
Optionally, the generation module includes:Second acquisition unit, for obtaining the basic word in the predetermined content The frequency that the frequency and the keyword that platform occurs occur in the predetermined content platform;Second determination unit, for root According to the basic word temperature of the basic word is determined in the frequency that the predetermined content platform occurs and according to the key Word determines the temperature of the keyword in the frequency that the predetermined content platform occurs;Generation unit, for according to determining institute It states the degree of association and generates the knowledge mapping, wherein, the pass of the basic word and the keyword is included in the knowledge mapping Connection degree, the temperature of the basis word, the temperature of the keyword.
Optionally, which further includes:First receiving module, for receiving the search operation of input and/or clicking behaviour Make;Correcting module, for according to the operation of the described search of reception and/or clicking operation, being carried out to the knowledge mapping of generation It corrects.
Optionally, which further includes:Second receiving module, for receiving the search term for search of input;Sentence Disconnected module, for judging to whether there is in the knowledge mapping and the matched basic word of described search word;Second obtains mould Block, in the case where the judgment result is yes, obtain the basic word content information and with the basic word association The content information of keyword;Display module, for showing the basic content information of word and the content information of the keyword.
Optionally, the display module includes:Sequencing unit, in the keyword with the matched basic word association In the case of multiple, acquisition ranking results are ranked up to size of multiple keywords according to the degree of association;Display unit is used In while the basic word is shown, the content information of the keyword is shown according to ranking results.
In embodiments of the present invention, knowledge mapping is generated using according to the degree of association between basic word and the keyword, And the mode for feeding back the knowledge mapping for the search of content information so that there are phases between basic word and keyword The incidence relation answered when being scanned for basic word therein, can also readily obtain the key with the basis word association The content information of word has achieved the purpose that show the content information that may be needed as much as possible, it is achieved thereby that search needs Content information when, shorten duration, the technique effect of search efficiency is improved, and then solve in the related art, to needs When content information scans for, the technical issues of existence time is long, and efficiency is low.
Description of the drawings
Attached drawing described herein is used to provide further understanding of the present invention, and forms the part of the application, this hair Bright illustrative embodiments and their description do not constitute improper limitations of the present invention for explaining the present invention.In the accompanying drawings:
Fig. 1 is the flow chart of knowledge mapping generation method according to embodiments of the present invention;
Fig. 2 is the structure diagram of knowledge mapping generating means according to embodiments of the present invention;
Fig. 3 is the structure diagram of the first acquisition module 22 in knowledge mapping generating means according to embodiments of the present invention;
Fig. 4 is the structure diagram of generation module 28 in knowledge mapping generating means according to embodiments of the present invention;
Fig. 5 is the preferred structure block diagram one of knowledge mapping generating means according to embodiments of the present invention;
Fig. 6 is the preferred structure block diagram two of knowledge mapping generating means according to embodiments of the present invention;
Fig. 7 is the preferred structure block diagram of display module 68 in knowledge mapping generating means according to embodiments of the present invention;
Fig. 8 is showing for the database sharing of the knowledge mapping according to the preferred embodiment of the invention towards Intelligent hardware field It is intended to;
Fig. 9 is that the user foreground of the knowledge mapping according to the preferred embodiment of the invention towards Intelligent hardware field was searched for The schematic diagram of journey.
Specific embodiment
In order to which those skilled in the art is made to more fully understand the present invention program, below in conjunction in the embodiment of the present invention The technical solution in the embodiment of the present invention is clearly and completely described in attached drawing, it is clear that described embodiment is only The embodiment of a part of the invention, instead of all the embodiments.Based on the embodiments of the present invention, ordinary skill people Member's all other embodiments obtained without making creative work should all belong to the model that the present invention protects It encloses.
It should be noted that term " first " in description and claims of this specification and above-mentioned attached drawing, " Two " etc. be the object for distinguishing similar, and specific sequence or precedence are described without being used for.It should be appreciated that it uses in this way Data can be interchanged in the appropriate case, so as to the embodiment of the present invention described herein can in addition to illustrating herein or Sequence other than those of description is implemented.In addition, term " comprising " and " having " and their any deformation, it is intended that cover Cover it is non-exclusive include, be not necessarily limited to for example, containing the process of series of steps or unit, method, system, product or equipment Those steps or unit clearly listed, but may include not listing clearly or for these processes, method, product Or the intrinsic other steps of equipment or unit.
According to embodiments of the present invention, a kind of embodiment of the method for knowledge mapping generation method is provided, it should be noted that Step shown in the flowchart of the accompanying drawings can perform in the computer system of such as a group of computer-executable instructions, and And although showing logical order in flow charts, in some cases, can institute be performed with the sequence being different from herein The step of showing or describing.
Fig. 1 is the flow chart of knowledge mapping generation method according to embodiments of the present invention, as shown in Figure 1, this method includes Following steps:
Step S102 captures basic word in predetermined content platform, wherein, which can be scheduled net Page, naturally it is also possible to be the page that some other includes content information;
Step S104 obtains the keyword with basic word association on predetermined content platform;
Step S106 according to the frequency that basic word and keyword occur simultaneously, determines the pass between basis word and keyword Connection degree;
Step S108 generates knowledge mapping according to the determining degree of association, wherein, knowledge mapping include multiple basic words with With the keyword of multiple basic word associations, knowledge mapping feeds back for the search of content information.
By above-mentioned steps, and, will know using according to the degree of association generation knowledge mapping between basic word and keyword Know mode of the collection of illustrative plates for the search feedback of content information so that there are corresponding incidence relation between basic word and keyword, When being scanned for basic word therein, the content information with the keyword of the basis word association can be also readily obtained, Achieve the purpose that show the content information that may be needed as much as possible, it is achieved thereby that during the content information of search needs, Shorten duration, improve the technique effect of search efficiency, and then solve in the related art, the content information of needs is searched The technical issues of Suo Shi, existence time is long, and efficiency is low.
Optionally, when predetermined content platform captures basic word, the basic word to be captured can also be selected or Screening, so that the knowledge mapping that will be generated is more universal or more practical.For example, according to scheduled screening conditions (for example, for avoiding the screening conditions of uncommon word) screens basic word, by screening operation, it is possible to prevente effectively from looking for not To keyword corresponding with basic word, the success rate of generation knowledge mapping is improved.
When obtaining the keyword with basic word association on predetermined content platform, can according to the basis word and keyword it Between correlation dimension, to obtain all keywords with the basis word association.It should be noted that above-mentioned correlation dimension can be For showing to be associated with rank between basic word and keyword, for example, when basic word and keyword are directly linked, it is believed that The basis word is associated with keyword for level-one;When basic word and keyword need when being associated, to recognize by a word With keyword it is that two level is associated with for the basis word;It, can be with when basic word and keyword are needed through two words come when being associated Think that the basis word is associated with keyword for three-level;The rest may be inferred ....Therefore, by correlation dimension, to obtain and the basis During all keywords of word association, the correlation dimension with the keyword of basic word association and basic word can be first determined;Predetermined All keywords in correlation dimension with basic word are obtained on content platform.It, can be flexibly according to specific by above-mentioned processing Needs control the quantity of the keyword got, can be for example, when relatively broad with the keyword of basic word association Correlation dimension controls few, so that the quantity of the keyword obtained is unlikely to too many;When the pass with basic word association When keyword is more rare, correlation dimension can be controlled more, so that the keyword obtained is comprehensive.
Optionally, when generating knowledge mapping according to the determining degree of association, can include in knowledge mapping to content information Useful information is searched for, for example, it may be some necessary information, for example, the degree of association between basic word and keyword, certainly Can also include some auxiliary informations, for example, the basic temperature of word and the temperature of keyword, including auxiliary information For more fully being shown to the information to be searched for.For example, when being shown to the content information searched out, Ke Yixian Show the temperature of the content information, so as to which user be facilitated preferably to know the attention rate of the content information.When being wrapped in knowledge mapping The auxiliary information included includes the temperature of basic word, during the temperature of keyword, can first pass through in the following manner and obtain basic word Temperature, the temperature of keyword:It first obtains frequency that basic word occurs in predetermined content platform and keyword is put down in predetermined content The frequency that platform occurs;Later, according to basic word the frequency that predetermined content platform occurs determine basic word temperature and according to Keyword determines the temperature of keyword in the frequency that predetermined content platform occurs.So as to generate knowledge graph according to the determining degree of association Spectrum, wherein, the degree of association of basic word and keyword, the temperature of basic word, the temperature of keyword are included in knowledge mapping.
Knowledge mapping to search for for content information is more perfect, so that the content information of search is more smart Really, the knowledge mapping of generation can timely be corrected after according to determining degree of association generation knowledge mapping, wherein Modified mode can be a variety of, for example, can be realized according to the operation issued to predetermined content platform, for example, can first connect Receive the search operation and/or clicking operation of input;According to the search operation and/or clicking operation of reception, to the knowledge of generation Collection of illustrative plates is modified.For concrete example, after above-mentioned knowledge mapping is generated, input of the user to a certain basic word is received, The number for inputting the basis word is counted, when the number of statistics reaches input pre-value, it may be determined that the basis word is works as Under heat search word, therefore, directly by the temperature of the basis word be turned up.Equally receiving click behaviour of the user to a certain basic word When making, the number for clicking the basis word is counted, reaches the pre- timing of click in the number of statistics, it may be determined that the basis word For high concern word instantly, therefore, directly the temperature of the basis word is turned up.The temperature after adjustment is adapted to knowledge graph later In spectrum, according to revised knowledge mapping, to realize the search feedback to content information, so that the content information of feedback is more It is accurate.
It should be noted that after according to determining degree of association generation knowledge mapping, content information is scanned for instead During feedback, following processing mode may be used:Receive the search term for search of input;It whether there is in judgemental knowledge collection of illustrative plates With the matched basic word of search term;In the case where the judgment result is yes, obtain basic word content information and with basic word The content information of associated keyword;The content information of display base word and the content information of keyword.Pass through matched mode The content information of the not only content information of display base word, also display and the keyword of basic word association, realizes quick, high Effect accurately feeds back content information.
Optionally, it in the content information of the content information of display base word and keyword, is closed with matched basic word The keyword of connection may be used diversified forms and the content information of multiple keywords shown there are in the case of multiple, example Such as, it may be used and acquisition ranking results first be ranked up to size of multiple keywords according to the degree of association;Showing basic word While, the content information according to ranking results display keyword.It, can be aobvious according to needing flexibly to control when specifically being shown The quantity shown for example, when the content information of keyword is more, can will show the quantity control of the keyword of content information It obtains smaller;And when the content information of keyword is smaller, it can will show that the quantity of the keyword of content information controls It is more.
It should be noted that it is above-mentioned basis the content information of word and/or the content information of keyword may each comprise it is a variety of, For example, at least one of can be included:Domestic News, trend of investment, action message, industry patent, service information, technology text Shelves, industry Zone Information.Classification annotation or right is carried out by the content information of the content information to basic word and/or keyword Above-mentioned included type is finely divided, for example, Domestic News include:Sports news information, entertainment news information etc., investment are dynamic State includes real estate investment dynamic, equity investment dynamic etc., and industry patent includes patent of invention information, utility model patent information Include commerce services information, service for life etc. Deng, service information.By in the content information and/or keyword to basic word Hold information and carry out classification annotation, it would be desirable to which user oneself sees the content for oneself being needed to be distinguished after the page, believes by content Breath feeds back to user and has just carried out corresponding mark before, and having effectively achieved may need the content distinguished to carry out area user Point so that it is more accurate to the information of user feedback by knowledge mapping.
In embodiments of the present invention, a kind of knowledge mapping generating means are additionally provided, Fig. 2 is according to embodiments of the present invention The structure diagram of knowledge mapping generating means, as shown in Fig. 2, the device includes:Handling module 22, the first acquisition module 24, really Cover half block 26 and generation module 28, below illustrate the device.
Handling module 22, for capturing basic word in predetermined content platform;First acquisition module 22, is connected to above-mentioned crawl Module 22, for obtaining the keyword with basic word association on predetermined content platform;Determining module 26 is connected to above-mentioned first Acquisition module 22 for the frequency occurred simultaneously according to basic word and keyword, determines being associated between basis word and keyword Degree;Generation module 28 is connected to above-mentioned determining module 26, for generating knowledge mapping according to the determining degree of association, wherein, knowledge Collection of illustrative plates includes multiple basic words and the keyword with multiple basic word associations, and knowledge mapping is anti-for the search of content information Feedback.
Fig. 3 is the structure diagram of the first acquisition module 22 in knowledge mapping generating means according to embodiments of the present invention, such as Shown in Fig. 3, which includes:First determination unit 32 and first acquisition unit 34 below carry out first acquisition module 22 Explanation.
First determination unit 32, for determining the correlation dimension with the keyword of basic word association and basic word;First obtains Unit 34 is taken, is connected to above-mentioned first determination unit 32, for being obtained on predetermined content platform with basic word in correlation dimension Interior all keywords.
Fig. 4 is the structure diagram of generation module 28 in knowledge mapping generating means according to embodiments of the present invention, such as Fig. 4 institutes Show, which includes:Second acquisition unit 42, the second determination unit 44 and generation unit 46, below to the generation mould Block 28 illustrates.
Second acquisition unit 42, the frequency occurred for obtaining basic word in predetermined content platform and keyword are pre- Determine the frequency of content platform appearance;Second determination unit 44 is connected to above-mentioned second acquisition unit 42, for being existed according to basic word The frequency that the frequency that predetermined content platform occurs determines the temperature of basic word and occurred according to keyword in predetermined content platform Determine the temperature of keyword;Generation unit 46 is connected to above-mentioned second determination unit 44, for according to determining degree of association generation Knowledge mapping, wherein, the degree of association of basic word and keyword, the temperature of basic word, the heat of keyword are included in knowledge mapping Degree.
Fig. 5 is the preferred structure block diagram one of knowledge mapping generating means according to embodiments of the present invention, as shown in figure 5, should Device further includes in addition to including all structures shown in Fig. 2:First receiving module 52 and correcting module 54, it is preferred to this below Structure illustrates.
First receiving module 52 is connected to above-mentioned generation module 28, for receiving the search operation of input and/or click Operation;Correcting module 54 is connected to above-mentioned first receiving module 52, for the search operation and/or clicking operation according to reception, The knowledge mapping of generation is modified.
Fig. 6 is the preferred structure block diagram two of knowledge mapping generating means according to embodiments of the present invention, as shown in fig. 6, should Device further includes in addition to including all structures shown in Fig. 2:Second receiving module 62, judgment module 64, the second acquisition module 66 With display module 68, the preferred structure is illustrated below.
Second receiving module 62 is connected to above-mentioned generation module 28, for receiving the search term for search of input; Judgment module 64 is connected to above-mentioned second receiving module 62, matched with search term for whether there is in judgemental knowledge collection of illustrative plates Basic word;Second acquisition module 66 is connected to above-mentioned judgment module 64, in the case where the judgment result is yes, obtaining base The content information of plinth word and the content information with the keyword of basic word association;Display module 68 is connected to above-mentioned second and obtains Modulus block 66, for the content information of display base word and the content information of keyword.
Fig. 7 is the preferred structure block diagram of display module 68 in knowledge mapping generating means according to embodiments of the present invention, such as Shown in Fig. 7, which includes:Sequencing unit 72 and display unit 74 below illustrate the display module 68.
Sequencing unit 72, for the keyword of matched basic word association there are in the case of multiple, to multiple passes Keyword is ranked up acquisition ranking results according to the size of the degree of association;Display unit 74 is connected to above-mentioned sequencing unit 72, is used for While basic word is shown, the content information of keyword is shown according to ranking results.
For the complicated internet information of solution, user is allowed to search more accurately information, in embodiments of the present invention, Provide a kind of working knowledge interconnection carries out the method for information search.The purpose of knowledge interconnection is one people of structure and machine The WWW being appreciated that so that network is more intelligent.Knowledge mapping is the semantic knowledge-base of structuring, for symbol Number form describes concept and its correlation in physical world, and basic composition unit is " entity-relationship-entity " ternary Group and entity and its association attributes-value pair are interconnected by relationship between entity, the webbed structure of knowledge of structure.In this hair In bright embodiment, a kind of knowledge mapping structure (or to generate) method towards Intelligent hardware field is specifically provided, wherein, The knowledge mapping be divided into database and user foreground two parts, in database part:Pass through the content platform in setting first Keyword crawl is carried out, the knowledge dictionary of incidence relation is established, data is associated and classified by labeling, are carried out at the same time Data cleansing forms the knowledge base in Intelligent hardware field with concluding.It should be noted that signified intelligence in embodiments of the present invention Energy hardware is a scientific and technological concept, refers to be combined to traditional equipment progress intellectualized reconstruction by hardware and software.Transformation pair As that can be electronic equipment, for example, wrist-watch, TV and other electric appliances;Can also be the equipment without electronization, for example, door lock, Teacup, automobile etc..For example, Intelligent hardware can extend to smart television, smart home, intelligent automobile, doctor from wearable device Treat health, intelligent toy, robot etc..
Fig. 8 is showing for the database sharing of the knowledge mapping according to the preferred embodiment of the invention towards Intelligent hardware field It is intended to, as shown in figure 8, the flow is including as follows:Crawler capturing is carried out at the same time data cleansing;It is carried out according to crawler capturing result Entry and score value are associated with, so as to generate hardware knowledge database.It is for example, specific as follows:It is carried out by the content platform in setting Keyword captures, and establishes basic dictionary, is carried out at the same time data cleansing with concluding, it should be noted that data are carried out cleaning and During conclusion, preset cleaning rule may be used or induction rule carries out, by preset cleaning rule to one A little more miscellaneous or redundancy data are cleaned so that the data after cleaning can more targetedly serve user; Inducing classification is carried out to data by induction rule so that the data after inducing classification can mutually understand, it will be apparent that distinguish, section The discrimination time of user is saved, improves user experience;With these bases, dictionary is served as theme, and obtains basis respectively on the platform of setting The same level-one keyword data of each word association in dictionary.Then to all keywords, how much sorted according to the frequency of appearance So that it is determined that the absolute temperature of a word;According to the frequency of other words that using some word as core, statistics occurs simultaneously with the word Rate, to determine the stiffness of coupling of a word and another word.The forward-backward correlation searched for and clicked according to user, carrys out dynamic corrections word Temperature and word and word between the degree of association.Each keyword is associated with the corresponding Domestic News of setting content platform, throwing behind Provide the content informations such as dynamic, correlated activation, related patents, relevant documentation.
Based on the method for above-mentioned knowledge mapping structure, can be accomplished in several ways, for example, can be following by including The system of cluster or module is realized, for example, the system includes:Reptile cluster, Hadoop distributed storages cluster, natural language Speech processing cluster, Mahout knowledge excavations module and knowledge data base;Wherein, which is used for according to seed address, grabs Web data is taken, and web data is stored in webpage HBase table;Natural language processing cluster is used to be distributed from the Hadoop It obtains the webpage HBase table in formula storage cluster, generates original knowledge information, and the original knowledge information is stored in and original is known Know in HBase table;The Mahout knowledge excavations module is used to carry out the original knowledge information knowledge excavation, and generation is unstructured Data, and the unstructured data is stored in unstructured data HBase table;The knowledge data base is used for according to through remarkable The unstructured data structure knowledge mapping of work audit.
In user's foreground partition:When user foreground key in search term scan for, system can by corresponding keyword into Row displaying, the classifying content that keyword is related to can include a variety of, it may for example comprise technical documentation, industry patent, industry Zone Information, Related service, action message, trend of investment etc..In addition, after user logs in, user inputs a search term, not only provides this The absolute temperature of a search term, moreover it is possible to provide the keyword distribution of different strengths and weaknesses of coupled degree, and combine the strong pass of the degree of coupling Keyword recommends corresponding content.By the knowledge mapping towards Intelligent hardware field, solves Intelligent hardware domain knowledge figure The blank of spectrum realizes the active demand to conformability and relevance of Intelligent hardware industry.Furthermore it is also possible to it provides to the user The more industry data of precise specifications and abundant expression help user more easily to obtain Intelligent hardware domain knowledge.
Fig. 9 is that the user foreground of the knowledge mapping according to the preferred embodiment of the invention towards Intelligent hardware field was searched for The schematic diagram of journey, as shown in figure 9, the user foreground search process is including as follows:Receive search term input by user;By knowing Know database and filter information is carried out to the search term of input;Filter information is sent to displaying end later:It is number in the selection result It is data presence and complete feelings in the selection result according to data of the database there is no search there is no in the case of, are prompted Under condition, the sub-category complete information data of displaying.To further improve user experience, can further be closed in knowledge data base Connection keyword (with basic word association) is associated with content information, for example, for example, user (corresponds to above-mentioned in input tennis Signified search term), the corresponding content information of corresponding with tennis movement (keyword) is shown, to further improve user's body It tests, can also be associated with and be associated with content information with motion association, for example, the other ball information (for example, football) of association, other The action message (for example, information of football match) of movement, other sports equipment information are (for example, the brand that football needs The information of football shirt) etc..
By including above-mentioned database and the two-part knowledge mapping in user foreground, solve in the relevant technologies, it is impossible to carry For the knowledge mapping in Intelligent hardware field, it is impossible to provide the user with the knowledge in the Intelligent hardware field of the profession of system, it is impossible to full The technical issues of user demand in sufficient Intelligent hardware field.By the structure of the above-mentioned knowledge mapping towards Intelligent hardware field, Intelligent hardware field content multi-resources Heterogeneous is efficiently solved, the problem of institutional framework is loose, while realize intelligentized search With personalized recommendation, meet the stratification in Intelligent hardware field, the user demand of structuring.By knowledge mapping technology, more The answer that the feedback user of intelligence needs.
The embodiments of the present invention are for illustration only, do not represent the quality of embodiment.
In the above embodiment of the present invention, all emphasize particularly on different fields to the description of each embodiment, do not have in some embodiment The part of detailed description may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed technology contents can pass through others Mode is realized.Wherein, the apparatus embodiments described above are merely exemplary, such as the division of the unit, Ke Yiwei A kind of division of logic function, can there is an other dividing mode in actual implementation, for example, multiple units or component can combine or Person is desirably integrated into another system or some features can be ignored or does not perform.Another point, shown or discussed is mutual Between coupling, direct-coupling or communication connection can be INDIRECT COUPLING or communication link by some interfaces, unit or module It connects, can be electrical or other forms.
The unit illustrated as separating component may or may not be physically separate, be shown as unit The component shown may or may not be physical unit, you can be located at a place or can also be distributed to multiple On unit.Some or all of unit therein can be selected according to the actual needs to realize the purpose of this embodiment scheme.
In addition, each functional unit in each embodiment of the present invention can be integrated in a processing unit, it can also That each unit is individually physically present, can also two or more units integrate in a unit.Above-mentioned integrated list The form that hardware had both may be used in member is realized, can also be realized in the form of SFU software functional unit.
If the integrated unit is realized in the form of SFU software functional unit and is independent product sale or uses When, it can be stored in a computer read/write memory medium.Based on such understanding, technical scheme of the present invention is substantially The part to contribute in other words to the prior art or all or part of the technical solution can be in the form of software products It embodies, which is stored in a storage medium, is used including some instructions so that a computer Equipment (can be personal computer, server or network equipment etc.) perform each embodiment the method for the present invention whole or Part steps.And aforementioned storage medium includes:USB flash disk, read-only memory (ROM, Read-OnlyMemory), arbitrary access are deposited Reservoir (RAM, Random Access Memory), mobile hard disk, magnetic disc or CD etc. are various can to store program code Medium.
The above is only the preferred embodiment of the present invention, it is noted that for the ordinary skill people of the art For member, various improvements and modifications may be made without departing from the principle of the present invention, these improvements and modifications also should It is considered as protection scope of the present invention.

Claims (13)

1. a kind of knowledge mapping generation method, which is characterized in that including:
Basic word is captured in predetermined content platform;
The keyword with the basic word association is obtained on the predetermined content platform;
According to the frequency that the basic word occurs simultaneously with the keyword, determine between the basic word and the keyword The degree of association;
Knowledge mapping is generated according to the determining degree of association, wherein, the knowledge mapping include multiple basic words with With the keyword of the multiple basic word association, the knowledge mapping feeds back for the search of content information.
2. it according to the method described in claim 1, it is characterized in that, is obtained on the predetermined content platform and the basic word The associated keyword includes:
Determine the correlation dimension with the keyword of the basic word association and the basic word;
All keywords in the correlation dimension with the basic word are obtained on the predetermined content platform.
3. according to the method described in claim 1, it is characterized in that, the knowledge mapping is generated according to the determining degree of association Including:
It obtains frequency that the basic word occurs in the predetermined content platform and the keyword is put down in the predetermined content The frequency that platform occurs;
The temperature of the basic word is determined and according to institute in the frequency that the predetermined content platform occurs according to the basic word State the temperature that keyword determines the keyword in the frequency that the predetermined content platform occurs;
The knowledge mapping is generated according to the determining degree of association, wherein, the basic word is included in the knowledge mapping With the degree of association of the keyword, the temperature of the basis word, the temperature of the keyword.
4. according to the method described in claim 1, it is characterized in that, generating the knowledge graph according to the determining degree of association After spectrum, further include:
Receive the search operation and/or clicking operation of input;
According to the operation of the described search of reception and/or clicking operation, the knowledge mapping of generation is modified.
5. method according to any one of claim 1 to 4, which is characterized in that according to determining degree of association life Into after the knowledge mapping, further include:
Receive the search term for search of input;
Judge in the knowledge mapping to whether there is and the matched basic word of described search word;
In the case where the judgment result is yes, the content information of the basic word and the pass with the basic word association are obtained The content information of keyword;
Show the basic content information of word and the content information of the keyword.
6. according to the method described in claim 5, it is characterized in that, the content information and the keyword of the display basic word Content information include:
With the keyword of the matched basic word association there are in the case of multiple, to multiple keywords according to the degree of association Size is ranked up acquisition ranking results;
While the basic word is shown, the content information of the keyword is shown according to ranking results.
7. method according to claim 5 or 6, which is characterized in that the content information of the basis word and/or the key The content information of word includes:Domestic News, trend of investment, action message, industry patent, service information, technical documentation, industry Information.
8. a kind of knowledge mapping generating means, which is characterized in that including:
Handling module, for capturing basic word in predetermined content platform;
First acquisition module, for obtaining the keyword with the basic word association on the predetermined content platform;
Determining module for the frequency occurred simultaneously with the keyword according to the basic word, determines the basic word and institute State the degree of association between keyword;
Generation module, for generating knowledge mapping according to the determining degree of association, wherein, the knowledge mapping includes multiple The basis word and the keyword with the multiple basic word association, the knowledge mapping feed back for the search of content information.
9. device according to claim 8, which is characterized in that first acquisition module includes:
First determination unit, for determining the correlation dimension with the keyword of the basic word association and the basic word;
First acquisition unit, for obtaining the institute with the basic word in the correlation dimension on the predetermined content platform There is keyword.
10. device according to claim 8, which is characterized in that the generation module includes:
Second acquisition unit, the frequency and the key occurred for obtaining the basic word in the predetermined content platform The frequency that word occurs in the predetermined content platform;
Second determination unit, for determining the basic word in the frequency that the predetermined content platform occurs according to the basic word Temperature and the temperature of the keyword is determined in the frequency that the predetermined content platform occurs according to the keyword;
Generation unit, for generating the knowledge mapping according to the determining degree of association, wherein, it is included in the knowledge mapping There are the degree of association of the basic word and the keyword, the temperature of the basis word, the temperature of the keyword.
11. device according to claim 8, which is characterized in that further include:
First receiving module, for receiving the search operation of input and/or clicking operation;
Correcting module, for according to the operation of the described search of reception and/or clicking operation, being carried out to the knowledge mapping of generation It corrects.
12. the device according to any one of claim 8 to 11, which is characterized in that further include:
Second receiving module, for receiving the search term for search of input;
Judgment module, for judging to whether there is in the knowledge mapping and the matched basic word of described search word;
Second acquisition module, in the case where the judgment result is yes, obtain the basic word content information and with institute State the content information of the keyword of basic word association;
Display module, for showing the basic content information of word and the content information of the keyword.
13. device according to claim 12, which is characterized in that the display module includes:
Sequencing unit, for the keyword of the matched basic word association there are in the case of multiple, to multiple keys Word is ranked up acquisition ranking results according to the size of the degree of association;
Display unit, for while the basic word is shown, the content letter of the keyword to be shown according to ranking results Breath.
CN201711217693.XA 2017-11-28 2017-11-28 Knowledge mapping generation method and device Pending CN108132966A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711217693.XA CN108132966A (en) 2017-11-28 2017-11-28 Knowledge mapping generation method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711217693.XA CN108132966A (en) 2017-11-28 2017-11-28 Knowledge mapping generation method and device

Publications (1)

Publication Number Publication Date
CN108132966A true CN108132966A (en) 2018-06-08

Family

ID=62389806

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711217693.XA Pending CN108132966A (en) 2017-11-28 2017-11-28 Knowledge mapping generation method and device

Country Status (1)

Country Link
CN (1) CN108132966A (en)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109829059A (en) * 2019-01-18 2019-05-31 平安科技(深圳)有限公司 Recommend method, apparatus, equipment and the storage medium of knowledge point
CN110188241A (en) * 2019-06-04 2019-08-30 成都索贝数码科技股份有限公司 A kind of race intelligence manufacturing system and production method
CN110442765A (en) * 2019-07-04 2019-11-12 卓尔智联(武汉)研究院有限公司 Information processing method, device, terminal and storage medium
CN110851610A (en) * 2018-07-25 2020-02-28 百度在线网络技术(北京)有限公司 Knowledge graph generation method and device, computer equipment and storage medium
CN110929019A (en) * 2018-08-30 2020-03-27 深圳市蓝灯鱼智能科技有限公司 Information display method and device, storage medium and electronic device
CN110990584A (en) * 2019-11-26 2020-04-10 口口相传(北京)网络技术有限公司 Knowledge graph generation method and device
CN111737477A (en) * 2020-08-07 2020-10-02 杭州六棱镜知识产权科技有限公司 Intellectual property big data-based intelligence investigation method, system and storage medium
CN112911073A (en) * 2019-04-30 2021-06-04 五竹科技(北京)有限公司 Intelligent knowledge graph construction method and device for outbound process conversation content
CN113987374A (en) * 2021-10-27 2022-01-28 北京达佳互联信息技术有限公司 Word cloud display method and device, electronic equipment, medium and product
CN115658929A (en) * 2022-12-14 2023-01-31 天津理工大学 Asset management knowledge graph generation method, device and system
CN117633253A (en) * 2024-01-25 2024-03-01 南京大学 Scientific-technical association detection method based on knowledge network multidimensional coupling

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090327279A1 (en) * 2008-06-25 2009-12-31 International Business Machines Corporation Apparatus and method for supporting document data search
CN102591862A (en) * 2011-01-05 2012-07-18 华东师范大学 Control method and device of Chinese entity relationship extraction based on word co-occurrence
CN104408102A (en) * 2014-11-19 2015-03-11 北京国双科技有限公司 Data processing method and device for association degree of network hot words and object
CN104462507A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for establishing knowledge graph based on movie songs
CN104462501A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Knowledge graph construction method and device based on structural data
CN105159912A (en) * 2015-07-06 2015-12-16 无锡天脉聚源传媒科技有限公司 Method and apparatus for processing degree of correlation among different words
CN106528616A (en) * 2016-09-30 2017-03-22 厦门快商通科技股份有限公司 Language error correcting method and system for use in human-computer interaction process

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090327279A1 (en) * 2008-06-25 2009-12-31 International Business Machines Corporation Apparatus and method for supporting document data search
CN102591862A (en) * 2011-01-05 2012-07-18 华东师范大学 Control method and device of Chinese entity relationship extraction based on word co-occurrence
CN104408102A (en) * 2014-11-19 2015-03-11 北京国双科技有限公司 Data processing method and device for association degree of network hot words and object
CN104462507A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Method and device for establishing knowledge graph based on movie songs
CN104462501A (en) * 2014-12-19 2015-03-25 北京奇虎科技有限公司 Knowledge graph construction method and device based on structural data
CN105159912A (en) * 2015-07-06 2015-12-16 无锡天脉聚源传媒科技有限公司 Method and apparatus for processing degree of correlation among different words
CN106528616A (en) * 2016-09-30 2017-03-22 厦门快商通科技股份有限公司 Language error correcting method and system for use in human-computer interaction process

Cited By (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110851610A (en) * 2018-07-25 2020-02-28 百度在线网络技术(北京)有限公司 Knowledge graph generation method and device, computer equipment and storage medium
CN110851610B (en) * 2018-07-25 2022-09-27 百度在线网络技术(北京)有限公司 Knowledge graph generation method and device, computer equipment and storage medium
CN110929019A (en) * 2018-08-30 2020-03-27 深圳市蓝灯鱼智能科技有限公司 Information display method and device, storage medium and electronic device
CN110929019B (en) * 2018-08-30 2022-06-10 北京蓝灯鱼智能科技有限公司 Information display method and device, storage medium and electronic device
CN109829059A (en) * 2019-01-18 2019-05-31 平安科技(深圳)有限公司 Recommend method, apparatus, equipment and the storage medium of knowledge point
CN112911073A (en) * 2019-04-30 2021-06-04 五竹科技(北京)有限公司 Intelligent knowledge graph construction method and device for outbound process conversation content
CN110188241A (en) * 2019-06-04 2019-08-30 成都索贝数码科技股份有限公司 A kind of race intelligence manufacturing system and production method
CN110188241B (en) * 2019-06-04 2023-07-25 成都索贝数码科技股份有限公司 Intelligent manufacturing system and manufacturing method for events
CN110442765A (en) * 2019-07-04 2019-11-12 卓尔智联(武汉)研究院有限公司 Information processing method, device, terminal and storage medium
CN110442765B (en) * 2019-07-04 2022-03-11 卓尔智联(武汉)研究院有限公司 Information processing method, device, terminal and storage medium
CN110990584A (en) * 2019-11-26 2020-04-10 口口相传(北京)网络技术有限公司 Knowledge graph generation method and device
CN110990584B (en) * 2019-11-26 2021-02-09 口口相传(北京)网络技术有限公司 Knowledge graph generation method and device
CN111737477A (en) * 2020-08-07 2020-10-02 杭州六棱镜知识产权科技有限公司 Intellectual property big data-based intelligence investigation method, system and storage medium
CN113987374A (en) * 2021-10-27 2022-01-28 北京达佳互联信息技术有限公司 Word cloud display method and device, electronic equipment, medium and product
CN115658929A (en) * 2022-12-14 2023-01-31 天津理工大学 Asset management knowledge graph generation method, device and system
CN115658929B (en) * 2022-12-14 2023-03-28 天津理工大学 Asset management knowledge graph generation method, device and system
CN117633253A (en) * 2024-01-25 2024-03-01 南京大学 Scientific-technical association detection method based on knowledge network multidimensional coupling
CN117633253B (en) * 2024-01-25 2024-04-30 南京大学 Scientific-technical association detection method based on knowledge network multidimensional coupling

Similar Documents

Publication Publication Date Title
CN108132966A (en) Knowledge mapping generation method and device
CN106446195A (en) News recommending method and device based on artificial intelligence
US9135370B2 (en) Method and apparatus of generating update parameters and displaying correlated keywords
CN104809243B (en) It is a kind of that method is recommended based on the mixing excavated to user behavior composite factor
CN103744928B (en) A kind of network video classification method based on history access record
CN105095219B (en) Micro-blog recommendation method and terminal
CN103699700B (en) A kind of generation method of search index, system and associated server
CN107918616A (en) Search system, page display method and client
CN103984740B (en) Based on the method and system that the retrieved page of combination tag shows
CN102163228B (en) Method, apparatus and device for determining sorting result of resource candidates
CN110380954A (en) Data sharing method and device, storage medium and electronic device
CN107633021A (en) A kind of dispensing of graph text information, generation method and device
CN105243087A (en) IT (Information Technology) information aggregation reading personalized recommendation method
CN106682145A (en) Enterprise information processing method, server and client
CN107563867A (en) A kind of commending system cold start-up method based on multi-arm fruit machine confidence upper limit
CN109711867A (en) Shopper based on rating big data, which draws a portrait, constructs marketing method and system
CN106777282B (en) The sort method and device of relevant search
CN103577405A (en) Interest analysis based micro-blogger community classification method
CN108733791A (en) network event detection method
CN103365904A (en) Advertising information searching method and system
CN113590928A (en) Content recommendation method and device and computer-readable storage medium
CN107369058A (en) A kind of correlation recommendation method and server
CN107220745A (en) A kind of recognition methods, system and equipment for being intended to behavioral data
CN104077337A (en) Searching method and device
CN102214227B (en) Automatic public opinion monitoring method based on internet hierarchical structure storage

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20180608