CN112883165A - Intelligent full-text retrieval method and system based on semantic understanding - Google Patents

Intelligent full-text retrieval method and system based on semantic understanding Download PDF

Info

Publication number
CN112883165A
CN112883165A CN202110281426.9A CN202110281426A CN112883165A CN 112883165 A CN112883165 A CN 112883165A CN 202110281426 A CN202110281426 A CN 202110281426A CN 112883165 A CN112883165 A CN 112883165A
Authority
CN
China
Prior art keywords
text
word
short
short text
semantic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110281426.9A
Other languages
Chinese (zh)
Other versions
CN112883165B (en
Inventor
吴士伟
杨春
李慧娟
孙露
孙浩
辛国茂
胡传会
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shandong Ecloud Information Technology Co ltd
Original Assignee
Shandong Ecloud Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shandong Ecloud Information Technology Co ltd filed Critical Shandong Ecloud Information Technology Co ltd
Priority to CN202110281426.9A priority Critical patent/CN112883165B/en
Publication of CN112883165A publication Critical patent/CN112883165A/en
Application granted granted Critical
Publication of CN112883165B publication Critical patent/CN112883165B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation
    • G06F16/3334Selection or weighting of terms from queries, including natural language queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3347Query execution using vector based model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/211Syntactic parsing, e.g. based on context-free grammar [CFG] or unification grammars
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The invention discloses an intelligent full-text retrieval method and system based on semantic understanding, which comprises the following steps: cutting the received search sentence into short texts, and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts; constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text; and based on the semantic information vector and the dependency relationship vector of the short text, performing similarity calculation on the short text information and related information in the intelligent index library to further obtain a search result set. According to the method, the original data are divided into a plurality of short texts to form the search text vector, and the similarity between the search text and the index library text is calculated by calling the semantic understanding interface of the artificial intelligence platform, so that the accuracy of full-text retrieval can be improved.

Description

Intelligent full-text retrieval method and system based on semantic understanding
Technical Field
The invention relates to the technical field of natural language processing, in particular to an intelligent full-text retrieval method and system based on semantic understanding.
Background
The statements in this section merely provide background information related to the present disclosure and may not necessarily constitute prior art.
The full text search takes various data, such as characters, voice, images and the like as processing objects, provides a means for realizing information search according to the content of data materials rather than external characteristics, and comprises two functions: data management and data query help users to quickly manage and retrieve large amounts of document data.
Lucene is currently an open source item of Apache and is also currently the most popular Java-based open source full-network search toolkit. Lucene realizes some general word segmentation algorithms, reserves a plurality of lexical analyzer interfaces for users, and can be conveniently embedded into various applications to realize the full-text retrieval function of the applications. The retrieval essence of the Lucene still belongs to index retrieval, full-text indexing is carried out on files and characters needing to be retrieved, the indexing is quickly retrieved during retrieval to obtain a retrieval position, the position is associated with a document path where a retrieval word appears, and the Lucene returns a retrieval result to a user.
The data volume of the big data era is increased sharply, and with the development of microblogs, forums, jitters and other media and society, the retrieval effect is to be improved with the increase of a plurality of new words and data volumes. The reason is that the traditional full-text retrieval divides original data into words by word segmentation, links keywords with all documents containing the keywords by an inverted index mode, and often only finds and returns the documents containing the search keywords quickly when a user searches, so that the documents are only mechanically matched from the character form, and much information which represents the same concept but expresses different characters is omitted, namely the keywords cannot be understood from the semantic sense. For example, in a city of spring in four seasons, a user wants to obtain the cities such as Kunming, Xiamen, and Ming Dynasty, but the conventional full-text search only matches articles with keywords such as "four seasons", "city", and the like according to the keywords, and the real requirements of the user are difficult to meet.
In addition, most of search fields of full-text retrieval are short texts, the classification mode of short text information is different from the classification process of the traditional long text due to the uniqueness of the short text information, and scholars perform a series of researches on the problems of data sparsity, overhigh latitude, insufficient semantic information and the like of the short text; the prior art applies a Deep Neural Network (DNN) method to a classification study of short texts, which has a certain effect, but still faces some challenges, such as: most short text classification models only consider the literal meaning, have poor recognition effect on ubiquitous polysemous words, and cannot solve the defect of sparsity of short texts.
Disclosure of Invention
In order to solve the problems, the invention provides an intelligent full-text retrieval method and system based on semantic understanding.
In some embodiments, the following technical scheme is adopted:
an intelligent full-text retrieval method based on semantic understanding comprises the following steps:
cutting the received search sentence into short texts, and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts;
constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text;
and based on the semantic information vector and the dependency relationship vector of the short text, performing similarity calculation on the short text information and related information in the intelligent index library to further obtain a search result set.
As a further scheme, obtaining a word segmentation library corresponding to the short text specifically includes:
matching periods or question marks through regular expressions, and cutting the long text into short texts; and performing word segmentation operation on the short text by combining the stop word bank to form a word segmentation bank corresponding to the short text.
As a further scheme, the name of the short text includes the belonging long text flag.
As a further scheme, constructing a semantic information vector and a word dependency relationship vector of a short text specifically includes:
extracting a central word and a word sense co-occurrence word of the short text; the attributes of the core word, the word sense co-occurrence word and the word sense co-occurrence word together form a semantic information vector of the short text;
and obtaining a syntactic dependency relationship tree through syntactic dependency analysis based on the central word and the semantic co-occurrence word in the semantic information vector library to form a word dependency relationship vector of the short text.
As a further scheme, the intelligent index library comprises: the short text, the word segmentation library corresponding to the short text, the semantic information vector and the dependency relationship vector of the short text.
As a further scheme, the similarity calculation of the searched short text information and the related information in the intelligent index library specifically includes:
calculating the similarity between the searched short text and each short text central word in the intelligent index library;
calculating the number of the searched short texts and the number of the same semantic dependency relations in each short text dependency relation tree in the intelligent index library;
calculating the similarity of the core words corresponding to the same semantic dependency relationship in the short text and each short text dependency relationship tree in the intelligent index library;
and calculating the similarity of the words extracted from the words with more word sense co-occurrence words in the short texts and each short text in the intelligent index library.
And as a further scheme, adding the similarity scores obtained by calculation, sorting the similarity scores from large to small according to the total score after addition, and returning to the search result set.
In other embodiments, the following technical solutions are adopted:
an intelligent full-text retrieval system based on semantic understanding, comprising:
the data preprocessing module is used for cutting the received search sentences into short texts and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts;
the short text vector construction module is used for constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text;
and the data indexing module is used for carrying out similarity calculation on the short text information and the related information in the intelligent index library based on the semantic information vector and the dependency relationship vector of the short text so as to obtain a search result set.
In other embodiments, the following technical solutions are adopted:
a terminal device comprising a processor and a memory, the processor being arranged to implement instructions; the memory is used for storing a plurality of instructions which are suitable for being loaded by the processor and executing the intelligent full-text retrieval method based on semantic understanding.
In other embodiments, the following technical solutions are adopted:
a computer-readable storage medium having stored therein a plurality of instructions adapted to be loaded by a processor of a terminal device and to execute the above intelligent full-text retrieval method based on semantic understanding.
Compared with the prior art, the invention has the beneficial effects that:
(1) the word sense co-occurrence words refer to words with similar word senses in the short text, and the extraction of the word sense co-occurrence words can extract the semantics of the short text more quickly and accurately.
The method can analyze the syntactic dependency relationship, extract the core words through the syntactic dependency tree, calculate the similarity of the core words to help judge the similarity of the short texts, and assist in calculating the similarity of the short texts by means of the same number of the dependency relationship, thereby improving the accuracy of full-text retrieval.
Additional features and advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention.
Drawings
FIG. 1 is a flow chart of an intelligent full-text retrieval method based on semantic understanding in an embodiment of the present invention;
FIG. 2 is a diagram illustrating a structure of dependency syntax according to an embodiment of the present invention.
Detailed Description
It should be noted that the following detailed description is exemplary and is intended to provide further explanation of the disclosure. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this application belongs.
It is noted that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of example embodiments according to the present application. As used herein, the singular forms "a", "an" and "the" are intended to include the plural forms as well, and it should be understood that when the terms "comprises" and/or "comprising" are used in this specification, they specify the presence of stated features, steps, operations, devices, components, and/or combinations thereof, unless the context clearly indicates otherwise.
Example one
In one or more embodiments, an intelligent full-text retrieval method based on semantic understanding is disclosed, and with reference to fig. 1, the method comprises the following processes:
(1) cutting the received search sentence into short texts, and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts;
(2) constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text;
(3) and based on the semantic information vector and the dependency relationship vector of the short text, performing similarity calculation on the short text information and related information in the intelligent index library to further obtain a search result set.
Specifically, the process of constructing the intelligent index library in this embodiment specifically includes:
the original data is stored in the Mongo database, most of the original data is long text, and the long text is too long in sentence, the semantic is complex, and the semantic features are difficult to extract, so that the long text is processed into the short text.
The method specifically comprises the following steps:
firstly, matching periods through regular expressions, and cutting a long text into short texts; then artificially enriching a disabled word bank; and finally, performing word segmentation operation on the short text by applying an ltp _ data tool of the Hadamard-language cloud and combining with the stop word bank to form a word segmentation bank corresponding to the short text. The name of the short text comprises the corresponding long text mark, so that the provenance of the short text is marked.
Stop Words refer to that in information retrieval, in order to save storage space and improve search efficiency, some characters or Words are automatically filtered before or after processing natural language data (or text), and the characters or Words are called Stop Words. The stop words are manually input and are not automatically generated, and the generated stop words form a stop word list, namely a stop word library.
Such as: one piece of raw data in the Mongo data is as follows:
Figure BDA0002978882400000061
the above raw data is processed into short text as follows:
Figure BDA0002978882400000071
the first half of the ID is the ID of the long text, and the long text corresponding to the short text can be identified.
After the short text and the word segmentation library thereof are obtained, because only the word segmentation library can not realize semantic understanding, semantic processing needs to be carried out on the short text, including the construction of a semantic information vector and the construction of a dependency relationship vector.
The construction of the semantic information vector comprises the following steps: extracting the central word of the short text through a natural language processing interface; extracting word sense co-occurrence words of the short text through a natural language processing interface; the core word, the word sense co-occurrence word and the co-occurrence word attribute jointly form a semantic information vector of the short text. Such as:
Figure BDA0002978882400000072
and constructing a dependency relationship vector based on the central word and the semantic co-occurrence word in the semantic information vector library, calling a syntactic dependency analysis interface, and forming a word dependency relationship vector of the short text by the obtained syntactic dependency relationship tree.
Referring to FIG. 2, the dependency syntax parses a sentence into a dependency syntax tree describing the dependency relationships between words. The grammar structure with predicates as the center takes verbs as the center words of sentences, other components in the sentences are all governed by the center verbs, and all governed components depend on the governors with certain dependence relationship. Such as:
Figure BDA0002978882400000081
referring to fig. 1, the contents in the intelligent index library include: the short text, the word segmentation library corresponding to the short text, the semantic information vector and the dependency relationship vector of the short text.
After the intelligent index library is constructed, for a text to be searched, cutting a received search sentence into short texts, and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts; then respectively constructing semantic information vectors and dependency relationship vectors of the short texts; finally, a short text with complete semantics of the search text is obtained as follows:
Figure BDA0002978882400000091
and then calling an artificial intelligent text similarity calculation method model to carry out similarity calculation on the obtained search text with complete semantics and the intelligent index library.
The similarity calculation includes four aspects of calculation:
calculating the similarity between the searched short text and each short text central word in the intelligent index library;
calculating the number of the searched short texts and the number of the same semantic dependency relations in each short text dependency relation tree in the intelligent index library;
calculating the similarity of the core words corresponding to the same semantic dependency relationship in the short text and each short text dependency relationship tree in the intelligent index library;
and calculating the similarity of the words extracted from the words with more word sense co-occurrence words in the short texts and each short text in the intelligent index library.
The similarity calculation in the above four aspects all obtains a similarity score, the four scores are added, and the four scores are sorted from high to low according to the order of the scores, and a short text result set is returned, for example, the returned results are as follows:
Figure BDA0002978882400000101
and the value in the frame is the unique identifier of the corresponding long text, and the original data is taken out according to the unique identifier and returned to the user.
Example two
In one or more embodiments, disclosed is a semantic understanding-based intelligent full-text retrieval system, comprising:
the data preprocessing module is used for cutting the received search sentences into short texts and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts;
the short text vector construction module is used for constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text;
and the data indexing module is used for carrying out similarity calculation on the short text information and the related information in the intelligent index library based on the semantic information vector and the dependency relationship vector of the short text so as to obtain a search result set.
The specific implementation of each module is described in detail in the first embodiment, and is not described herein again.
EXAMPLE III
In one or more embodiments, a terminal device is disclosed, which includes a server including a memory, a processor, and a computer program stored in the memory and executable on the processor, and the processor executes the computer program to implement the intelligent full-text retrieval method based on semantic understanding in the first embodiment. For brevity, no further description is provided herein.
It should be understood that in this embodiment, the processor may be a central processing unit CPU, and the processor may also be other general purpose processors, digital signal processors DSP, application specific integrated circuits ASIC, off-the-shelf programmable gate arrays FPGA or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, and so on. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like.
The memory may include both read-only memory and random access memory, and may provide instructions and data to the processor, and a portion of the memory may also include non-volatile random access memory. For example, the memory may also store device type information.
In implementation, the steps of the above method may be performed by integrated logic circuits of hardware in a processor or instructions in the form of software.
The intelligent full-text retrieval method based on semantic understanding in the first embodiment can be directly implemented by a hardware processor, or implemented by a combination of hardware and software modules in the processor. The software modules may be located in ram, flash, rom, prom, or eprom, registers, among other storage media as is well known in the art. The storage medium is located in a memory, and a processor reads information in the memory and completes the steps of the method in combination with hardware of the processor. To avoid repetition, it is not described in detail here.
Those of ordinary skill in the art will appreciate that the various illustrative elements, i.e., algorithm steps, described in connection with the embodiments disclosed herein may be implemented as electronic hardware or combinations of computer software and electronic hardware. Whether such functionality is implemented as hardware or software depends upon the particular application and design constraints imposed on the implementation. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present application.
Example four
In one or more embodiments, a computer-readable storage medium is disclosed, in which a plurality of instructions are stored, the instructions being adapted to be loaded by a processor of a terminal device and implementing the intelligent full-text retrieval method based on semantic understanding described in the first embodiment.
Although the embodiments of the present invention have been described with reference to the accompanying drawings, it is not intended to limit the scope of the present invention, and it should be understood by those skilled in the art that various modifications and variations can be made without inventive efforts by those skilled in the art based on the technical solution of the present invention.

Claims (10)

1. An intelligent full-text retrieval method based on semantic understanding is characterized by comprising the following steps:
cutting the received search sentence into short texts, and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts;
constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text;
and based on the semantic information vector and the dependency relationship vector of the short text, performing similarity calculation on the short text information and related information in the intelligent index library to further obtain a search result set.
2. The intelligent full-text retrieval method based on semantic understanding of claim 1, wherein obtaining the segmentation library corresponding to the short text specifically comprises:
matching periods or question marks through regular expressions, and cutting the long text into short texts; and performing word segmentation operation on the short text by combining the stop word bank to form a word segmentation bank corresponding to the short text.
3. The intelligent full-text retrieval method based on semantic understanding of claim 2, wherein the naming of the short text comprises the associated long text flag.
4. The intelligent full-text retrieval method based on semantic understanding as claimed in claim 1, wherein constructing semantic information vectors and word dependency relationship vectors of short texts specifically comprises:
extracting a central word and a word sense co-occurrence word of the short text; the attributes of the core word, the word sense co-occurrence word and the word sense co-occurrence word together form a semantic information vector of the short text;
and obtaining a syntactic dependency relationship tree through syntactic dependency analysis based on the central word and the semantic co-occurrence word in the semantic information vector library to form a word dependency relationship vector of the short text.
5. The intelligent full-text retrieval method based on semantic understanding according to claim 1, wherein the intelligent index library comprises: the short text, the word segmentation library corresponding to the short text, the semantic information vector and the dependency relationship vector of the short text.
6. The intelligent full-text retrieval method based on semantic understanding as claimed in claim 1, wherein the similarity calculation of the searched short text information and the related information in the intelligent index library specifically comprises:
calculating the similarity between the searched short text and each short text central word in the intelligent index library;
calculating the number of the searched short texts and the number of the same semantic dependency relations in each short text dependency relation tree in the intelligent index library;
calculating the similarity of the core words corresponding to the same semantic dependency relationship in the short text and each short text dependency relationship tree in the intelligent index library;
and calculating the similarity of the words extracted from the words with more word sense co-occurrence words in the short texts and each short text in the intelligent index library.
7. The intelligent full-text retrieval method based on semantic understanding of claim 6, wherein the similarity scores obtained by calculation are added, and are sorted from large to small according to the total score after addition, and a search result set is returned.
8. An intelligent full-text retrieval system based on semantic understanding, comprising:
the data preprocessing module is used for cutting the received search sentences into short texts and performing word segmentation operation on the short texts to obtain word segmentation libraries corresponding to the short texts;
the short text vector construction module is used for constructing a semantic information vector and a dependency relationship vector of the short text; the semantic information vector comprises a central word and a word sense co-occurrence word of the short text;
and the data indexing module is used for carrying out similarity calculation on the short text information and the related information in the intelligent index library based on the semantic information vector and the dependency relationship vector of the short text so as to obtain a search result set.
9. A terminal device comprising a processor and a memory, the processor being arranged to implement instructions; the memory is used for storing a plurality of instructions, wherein the instructions are suitable for being loaded by the processor and executing the intelligent full text retrieval method based on semantic understanding according to any one of claims 1-7.
10. A computer-readable storage medium having stored therein a plurality of instructions, wherein the instructions are adapted to be loaded by a processor of a terminal device and to perform the intelligent full-text retrieval method based on semantic understanding according to any one of claims 1 to 7.
CN202110281426.9A 2021-03-16 2021-03-16 Intelligent full-text retrieval method and system based on semantic understanding Active CN112883165B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110281426.9A CN112883165B (en) 2021-03-16 2021-03-16 Intelligent full-text retrieval method and system based on semantic understanding

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110281426.9A CN112883165B (en) 2021-03-16 2021-03-16 Intelligent full-text retrieval method and system based on semantic understanding

Publications (2)

Publication Number Publication Date
CN112883165A true CN112883165A (en) 2021-06-01
CN112883165B CN112883165B (en) 2022-12-02

Family

ID=76040924

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110281426.9A Active CN112883165B (en) 2021-03-16 2021-03-16 Intelligent full-text retrieval method and system based on semantic understanding

Country Status (1)

Country Link
CN (1) CN112883165B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113449063A (en) * 2021-06-25 2021-09-28 树根互联股份有限公司 Method and device for constructing document structure information retrieval library
CN113946677A (en) * 2021-09-14 2022-01-18 中北大学 Event identification and classification method based on bidirectional cyclic neural network and attention mechanism
CN114201962A (en) * 2021-12-03 2022-03-18 中国中医科学院中医药信息研究所 Thesis novelty analysis method, device, medium and equipment
CN114925692A (en) * 2022-07-21 2022-08-19 中科雨辰科技有限公司 Data processing system for acquiring target event

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101582073A (en) * 2008-12-31 2009-11-18 北京中机科海科技发展有限公司 Intelligent retrieval system and method based on domain ontology
CN102246164A (en) * 2008-12-11 2011-11-16 有限公司呢哦派豆 Information search method and information provision method based on user's intention
CN105975458A (en) * 2016-05-03 2016-09-28 安阳师范学院 Fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity
CN106484664A (en) * 2016-10-21 2017-03-08 竹间智能科技(上海)有限公司 Similarity calculating method between a kind of short text
CN109543190A (en) * 2018-11-29 2019-03-29 北京羽扇智信息科技有限公司 A kind of intension recognizing method, device, equipment and storage medium
CN109815312A (en) * 2018-12-27 2019-05-28 达闼科技(北京)有限公司 A kind of method, apparatus of document query calculates equipment and computer storage medium
CN111814456A (en) * 2020-05-25 2020-10-23 国网上海市电力公司 Verb-based Chinese text similarity calculation method

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102246164A (en) * 2008-12-11 2011-11-16 有限公司呢哦派豆 Information search method and information provision method based on user's intention
CN101582073A (en) * 2008-12-31 2009-11-18 北京中机科海科技发展有限公司 Intelligent retrieval system and method based on domain ontology
CN105975458A (en) * 2016-05-03 2016-09-28 安阳师范学院 Fine-granularity dependence relationship-based method for calculating Chinese long sentence similarity
CN106484664A (en) * 2016-10-21 2017-03-08 竹间智能科技(上海)有限公司 Similarity calculating method between a kind of short text
CN109543190A (en) * 2018-11-29 2019-03-29 北京羽扇智信息科技有限公司 A kind of intension recognizing method, device, equipment and storage medium
CN109815312A (en) * 2018-12-27 2019-05-28 达闼科技(北京)有限公司 A kind of method, apparatus of document query calculates equipment and computer storage medium
CN111814456A (en) * 2020-05-25 2020-10-23 国网上海市电力公司 Verb-based Chinese text similarity calculation method

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
刘佳雯: "语句相似度匹配在自动问答系统中的应用与实现", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
郭炳元: "基于语义树的短文本相似度算法研究与应用", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113449063A (en) * 2021-06-25 2021-09-28 树根互联股份有限公司 Method and device for constructing document structure information retrieval library
CN113946677A (en) * 2021-09-14 2022-01-18 中北大学 Event identification and classification method based on bidirectional cyclic neural network and attention mechanism
CN114201962A (en) * 2021-12-03 2022-03-18 中国中医科学院中医药信息研究所 Thesis novelty analysis method, device, medium and equipment
CN114201962B (en) * 2021-12-03 2023-07-25 中国中医科学院中医药信息研究所 Method, device, medium and equipment for analyzing paper novelty
CN114925692A (en) * 2022-07-21 2022-08-19 中科雨辰科技有限公司 Data processing system for acquiring target event
CN114925692B (en) * 2022-07-21 2022-10-11 中科雨辰科技有限公司 Data processing system for acquiring target event

Also Published As

Publication number Publication date
CN112883165B (en) 2022-12-02

Similar Documents

Publication Publication Date Title
CN110399457B (en) Intelligent question answering method and system
CN109284363B (en) Question answering method and device, electronic equipment and storage medium
CN112883165B (en) Intelligent full-text retrieval method and system based on semantic understanding
CN107066553B (en) Short text classification method based on convolutional neural network and random forest
CN107436864B (en) Chinese question-answer semantic similarity calculation method based on Word2Vec
US10025819B2 (en) Generating a query statement based on unstructured input
US9176949B2 (en) Systems and methods for sentence comparison and sentence-based search
US20180032930A1 (en) System and method to Generate Queries for a Business Database
JP5936698B2 (en) Word semantic relation extraction device
CN110750640B (en) Text data classification method and device based on neural network model and storage medium
CN112035730B (en) Semantic retrieval method and device and electronic equipment
EP3799640A1 (en) Semantic parsing of natural language query
CN111291177A (en) Information processing method and device and computer storage medium
US20210350125A1 (en) System for searching natural language documents
JP2011118689A (en) Retrieval method and system
CN109522396B (en) Knowledge processing method and system for national defense science and technology field
CN115795061B (en) Knowledge graph construction method and system based on word vector and dependency syntax
CN110728135B (en) Text theme indexing method and device, electronic equipment and computer storage medium
CN114997288A (en) Design resource association method
CN111159381A (en) Data searching method and device
CN112528653B (en) Short text entity recognition method and system
CN110705285B (en) Government affair text subject word library construction method, device, server and readable storage medium
CN116049376B (en) Method, device and system for retrieving and replying information and creating knowledge
CN112183110A (en) Artificial intelligence data application system and application method based on data center
CN112507097B (en) Method for improving generalization capability of question-answering system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CP03 Change of name, title or address

Address after: Floor 12, Building 3, Shuntai Plaza, No. 2000 Shunhua Road, High tech Industrial Development Zone, Jinan City, Shandong Province, 250101

Patentee after: SHANDONG ECLOUD INFORMATION TECHNOLOGY CO.,LTD.

Country or region after: China

Address before: 250014 3rd floor, block B, Yinhe building, 2008 Xinluo street, high tech Zone, Jinan City, Shandong Province

Patentee before: SHANDONG ECLOUD INFORMATION TECHNOLOGY CO.,LTD.

Country or region before: China