CN107256260A - A kind of intelligent semantic recognition methods, searching method, apparatus and system - Google Patents

A kind of intelligent semantic recognition methods, searching method, apparatus and system Download PDF

Info

Publication number
CN107256260A
CN107256260A CN201710440790.9A CN201710440790A CN107256260A CN 107256260 A CN107256260 A CN 107256260A CN 201710440790 A CN201710440790 A CN 201710440790A CN 107256260 A CN107256260 A CN 107256260A
Authority
CN
China
Prior art keywords
keyword
target
regular expression
configuration file
node
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710440790.9A
Other languages
Chinese (zh)
Inventor
刘鹏
付安龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Software Co Ltd
Original Assignee
Inspur Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Software Co Ltd filed Critical Inspur Software Co Ltd
Priority to CN201710440790.9A priority Critical patent/CN107256260A/en
Publication of CN107256260A publication Critical patent/CN107256260A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/3332Query translation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides a kind of intelligent semantic recognition methods, searching method, apparatus and system, the intelligent semantic recognition methods includes:Configuration file is built, the configuration file includes at least one keyword and each described keyword distinguishes corresponding regular expression;Obtain at least one keyword of user's input;At least one target regular expression corresponding with least one keyword that the user inputs is determined from the configuration file;According at least one described target regular expression, at least one keyword that the user inputs is converted to the target keyword of at least one setting form;The target keyword of at least one setting form is sent to the search engine of outside.This programme can improve the accuracy of search result.

Description

A kind of intelligent semantic recognition methods, searching method, apparatus and system
Technical field
The present invention relates to field of computer technology, more particularly to a kind of intelligent semantic recognition methods, searching method, device and System.
Background technology
With the arrival in big data epoch, data volume is sharply increased.How fast and accurately to be obtained from the data of magnanimity Useful data, the emphasis paid close attention to as user.Search engine is as the system that can provide the user search service, as solution The first choice of problems.
Full-text search engine is the most frequently used search engine, and its operation principle is generally:According to predefined word segmentation regulation, Participle is carried out to the character in each article, corresponding index is then set up to each word after participle, and indicate that the word exists The number and position occurred in article., can be according to the index search pair pre-established when receiving the keyword of user's input The article answered, and the article found is fed back into user.
Due to the general input for being accustomed to carrying out keyword according to routine use of user, this cause the keyword that user inputs with The word segmentation regulation of search engine is not consistent, so that the keyword for leading to not input using user accurately searches corresponding text Chapter, causes search result accuracy relatively low.
The content of the invention
The embodiments of the invention provide a kind of intelligent semantic recognition methods, searching method, apparatus and system, search can be improved As a result accuracy.
In a first aspect, the embodiments of the invention provide a kind of intelligent semantic recognition methods, including:
Configuration file is built, the configuration file includes at least one keyword and each described keyword difference Corresponding regular expression;
Also include:
Obtain at least one keyword of user's input;
At least one target corresponding with least one keyword that the user inputs is determined from the configuration file Regular expression;
According at least one described target regular expression, by least one keyword that the user inputs be converted to The target keyword of few setting form;
The target keyword of at least one setting form is sent to the search engine of outside.
Preferably,
The structure configuration file, the configuration file includes at least one keyword and each described keyword The corresponding regular expression of difference, including:
Extensible markup language xml document is built, the xml document includes at least one keyword and each institute State keyword and distinguish corresponding regular expression.
Preferably,
The structure extensible markup language xml document, the xml document includes at least one keyword and each The individual keyword distinguishes corresponding regular expression, including:
Build xml original documents;
At least one node is built in the xml original documents, is stored under each described node described at least one Regular expression, forms the xml document;Wherein, each the described regular expression stored under same node is with working as prosthomere The type of corresponding keyword is identical under point.
Preferably,
It is described from the configuration file determine with the user input at least one keyword it is corresponding at least one Target regular expression, including:
Each the described keyword inputted for the user, is performed both by:
According to the form of the keyword, the corresponding type of the keyword is determined;
Node corresponding with the type of the keyword is determined from the xml document;
From at least one regular expression stored under the node determined, it is determined that corresponding with the keyword Target regular expression.
Second aspect, the embodiments of the invention provide a kind of searching method, applied to search engine, including:
Receive the target keyword of at least one setting form;
According to the target keyword of at least one setting form, scan for.
Preferably,
Further comprise:The index built in advance between the keyword and at least one document of at least one setting form is closed System;
The target keyword of at least one setting form, is scanned for described in the basis, including:
According to the index relative, it is determined that at least one target text corresponding with least one described target keyword Shelves.
The third aspect, the embodiments of the invention provide a kind of intelligent semantic identifying device based on configuration file, including:Structure Build unit, acquiring unit, processing unit and transmitting element;Wherein,
The construction unit, for building configuration file, the configuration file includes at least one keyword and every One keyword distinguishes corresponding regular expression;
The acquiring unit, at least one keyword for obtaining user's input;
The processing unit, for determining at least one keyword pair inputted with the user from the configuration file At least one the target regular expression answered;According at least one described target regular expression, by the user input to A few keyword is converted to the target keyword of at least one setting form;
The transmitting element, the search for the target keyword of at least one setting form to be sent to outside is drawn Hold up.
Preferably,
The construction unit, for building xml original documents, and builds in the xml original documents at least one section At least one described regular expression is stored under point, each described node, the xml document is formed;Wherein, same node The type of each described regular expression keyword corresponding with present node of lower storage is identical.
Fourth aspect, the embodiments of the invention provide a kind of search engine, including:Receiving unit and search unit;Wherein,
The receiving unit, the target keyword for receiving at least one setting form;
The search unit, for the target keyword according at least one setting form, is scanned for.
5th aspect, the embodiments of the invention provide a kind of search system, including:Any of the above-described embodiment of the present invention is provided Intelligent semantic identifying device, and the search engine that any of the above-described embodiment of the invention is provided;Wherein,
The intelligent semantic identifying device, at least one keyword for user to be inputted is converted at least one setting The target keyword of form, and the target keyword is sent to the search engine;
The search engine, for receiving the target keyword that the intelligent semantic identifying device is sent, and according to described Target keyword is scanned for.
The embodiments of the invention provide a kind of intelligent semantic recognition methods, searching method, apparatus and system, pass through advance structure Build the configuration file that corresponding regular expression is distinguished including at least one keyword and each keyword.When getting use During the keyword of family input, the target regular expression corresponding with this keyword is determined from configuration file, and according to determination The target regular expression gone out, the keyword got is converted to the target keyword of setting form, then by after conversion Target keyword is sent to search engine, so that search engine is scanned for according to the target keyword received.Due to passing through The keyword that the regular expression built in advance is inputted to user is changed, so that keyword and the search of user's input The word segmentation regulation of engine is consistent, and which thereby enhances the accuracy of search result.
Brief description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to embodiment or existing There is the accompanying drawing used required in technology description to be briefly described, it should be apparent that, drawings in the following description are the present invention Some embodiments, for those of ordinary skill in the art, on the premise of not paying creative work, can also basis These accompanying drawings obtain other accompanying drawings.
Fig. 1 is a kind of flow chart for intelligent semantic recognition methods that one embodiment of the invention is provided;
Fig. 2 is a kind of flow chart for searching method that one embodiment of the invention is provided;
Fig. 3 is a kind of structural representation for intelligent semantic identifying device that one embodiment of the invention is provided;
Fig. 4 is a kind of structural representation for searcher that one embodiment of the invention is provided;
Fig. 5 is a kind of structural representation for search system that one embodiment of the invention is provided;
Fig. 6 is a kind of flow chart of the application method for search system that one embodiment of the invention is provided.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is A part of embodiment of the present invention, rather than whole embodiments, based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained on the premise of creative work is not made, belongs to the scope of protection of the invention.
As shown in figure 1, the embodiments of the invention provide a kind of intelligent semantic recognition methods, this method can include following step Suddenly:
Step 101:Configuration file is built, the configuration file includes at least one keyword and each described pass Keyword distinguishes corresponding regular expression;
Step 102:Obtain at least one keyword of user's input;
Step 103:Determined from the configuration file corresponding at least with the user at least one keyword inputted One target regular expression;
Step 104:According at least one described target regular expression, at least one keyword that the user is inputted Be converted to the target keyword of at least one setting form;
Step 105:The target keyword of at least one setting form is sent to the search engine of outside.
In above-described embodiment, distinguish corresponding including at least one keyword and each keyword by building in advance The configuration file of regular expression.When getting the keyword of user's input, determined and this keyword phase from configuration file Corresponding target regular expression, and according to the target regular expression determined, the keyword got is converted into setting The target keyword of form, is then sent to search engine by the target keyword after conversion, so that search engine is according to reception To target keyword scan for.Because the keyword that the regular expression by building in advance is inputted to user is turned Change, so that the keyword of user's input is consistent with the word segmentation regulation of search engine, which thereby enhance the accurate of search result Property.
In one embodiment of the invention, the embodiment of step 101 can include:
Extensible markup language xml document is built, the xml document includes at least one keyword and each institute State keyword and distinguish corresponding regular expression.
Herein, the configuration file of structure can for extensible markup language (Extensible Markup Language, Xml) document, then the keyword in configuration file and regular expression corresponding with keyword are xml forms.For example, the age The corresponding regular expression of keyword of form is:/ ^ [0-9] { 2 } [-/ Sui]/ ,/^ [0-9] { 2 } [-/ Sui] { 1 }-[0- 9] { 2 } [-/ Sui] { 1 }/.This cause each keyword and corresponding regular expression can directly by computer understanding, from And the conversion efficiency of the keyword inputted to user can be improved, and then improve search efficiency.
Specifically, it is described to build in extensible markup language xml document, the xml document in one embodiment of the invention Distinguish corresponding regular expression including at least one keyword and each described keyword, including:
Build xml original documents;
At least one node is built in the xml original documents, is stored under each described node described at least one Regular expression, forms the xml document;Wherein, each the described regular expression stored under same node is with working as prosthomere The type of corresponding keyword is identical under point.
Herein, xml original documents are built first, and the type respectively with various keywords is built in xml original documents The corresponding regular expression of same type of keyword, is then stored under same node by corresponding node.For example, keyword Type includes age class, date class and license plate number class, then three nodes are set up in xml original documents, is closed respectively with each Keyword type correspondence, then be stored in itself correspondence by the corresponding regular expression of the keyword of age class, date class and license plate number class Node under, formed xml document, consequently facilitating being managed to configuration file.
In one embodiment of the invention, the embodiment of step 103 can include:
Each the described keyword inputted for the user, is performed both by:
According to the form of the keyword, the corresponding type of the keyword is determined;
Node corresponding with the type of the keyword is determined from the xml document;
From at least one regular expression stored under the node determined, it is determined that corresponding with the keyword Target regular expression.
For example, when the keyword of user's input includes age A, date B and license plate number C, included due to the age Character is digital, and numeral and Chinese character or numeral and punctuate can be included in the date, and license plate number includes digital and letter, then can basis The different-format of each keyword, determines the corresponding type of keyword.Herein, age A, date B and license plate number C are right respectively The type answered is age class, date class and license plate number class.The section corresponding with each type can be then determined from xml document Point, then determine from the node determined target regular expression corresponding with keyword.Pass through this side determined step by step Formula, can improve the efficiency for determining target regular expression, and then improve search efficiency.
As shown in Fig. 2 the embodiments of the invention provide a kind of searching method, applied to search engine, this method can be wrapped Include following steps:
Step 201:Receive the target keyword of at least one setting form;
Step 202:According to the target keyword of at least one setting form, scan for.
In above-described embodiment, scanned for according to the target keyword of the setting form received, due to setting form Target keyword is consistent with the word segmentation regulation of search engine, so as to improve the accuracy of search result.
In one embodiment of the invention, this method may further include:The pass of at least one setting form is built in advance Index relative between keyword and at least one document;
The embodiment of step 202, can include:
According to the index relative, it is determined that at least one target text corresponding with least one described target keyword Shelves.
In full-text search engine, the rope of the keyword and at least one article of at least one setting form can be built in advance Draw relation, then after the keyword of setting form is received, can be determined and keyword according to the index relative built in advance Corresponding target article.Due to constructing the index relative between keyword and document in advance, then after keyword is received, Corresponding destination document can directly be determined according to index relative, then caused while the accuracy of search result is improved, also Improve search efficiency.
As shown in figure 3, the embodiments of the invention provide a kind of intelligent semantic identifying device based on configuration file, including: Construction unit 301, acquiring unit 302, processing unit 303 and transmitting element 304;Wherein,
The construction unit 301, for building configuration file, the configuration file include at least one keyword and Each described keyword distinguishes corresponding regular expression;
The acquiring unit 302, at least one keyword for obtaining user's input;
The processing unit 303, for determining to obtain single with described in the configuration file that builds from the construction unit 301 At least one corresponding target regular expression of at least one keyword that member 302 is got;According at least one described target Regular expression, at least one keyword that the user inputs is converted to the target keyword of at least one setting form;
The transmitting element 304, for the processing unit 303 to be changed after at least one setting form target close Keyword is sent to the search engine of outside.
In above-described embodiment, distinguish corresponding including at least one keyword and each keyword by building in advance The configuration file of regular expression.When getting the keyword of user's input, determined and this keyword phase from configuration file Corresponding target regular expression, and according to the target regular expression determined, the keyword got is converted into setting The target keyword of form, is then sent to search engine by the target keyword after conversion, so that search engine is according to reception To target keyword scan for.Because the keyword that the regular expression by building in advance is inputted to user is turned Change, so that the keyword of user's input is consistent with the word segmentation regulation of search engine, which thereby enhance the accurate of search result Property.
In one embodiment of the invention, the construction unit 301, for building xml original documents, and at the beginning of the xml At least one node is built in beginning document, at least one described regular expression is stored under each described node, forms described Xml document;Wherein, the class of each the described regular expression keyword corresponding with present node stored under same node Type is identical.
Herein, the configuration file of structure can for extensible markup language (Extensible Markup Language, Xml) document, then the keyword in configuration file and regular expression corresponding with keyword are xml forms.For example, the age The corresponding regular expression of keyword of form is:/ ^ [0-9] { 2 } [-/ Sui]/ ,/^ [0-9] { 2 } [-/ Sui] { 1 }-[0- 9] { 2 } [-/ Sui] { 1 }/.This cause each keyword and corresponding regular expression can directly by computer understanding, from And the conversion efficiency of the keyword inputted to user can be improved, and then improve search efficiency.
Build xml document when, first build xml original documents, built in xml original documents respectively with various keys The corresponding regular expression of same type of keyword, is then stored under same node by the corresponding node of the type of word.Example Such as, keyword type include age class, date class and license plate number class, then three nodes are set up in xml original documents, respectively with The corresponding regular expression of the keyword of age class, date class and license plate number class, then be stored in by each keyword type correspondence Under itself corresponding node, xml document is formed, consequently facilitating being managed to configuration file.
The contents such as the information exchange between each unit, implementation procedure in said apparatus, due to implementing with the inventive method Example is based on same design, and particular content can be found in the narration in the inventive method embodiment, and here is omitted.
As shown in figure 4, the embodiments of the invention provide a kind of search engine, including:Receiving unit 401 and search unit 402;Wherein,
The receiving unit 401, the target keyword for receiving at least one setting form;
The search unit 402, for the target keyword according at least one setting form, is scanned for.
In above-described embodiment, scanned for according to the target keyword of the setting form received, due to setting form Target keyword is consistent with the word segmentation regulation of search engine, so as to improve the accuracy of search result.
The contents such as the information exchange between each unit, implementation procedure in said apparatus, due to implementing with the inventive method Example is based on same design, and particular content can be found in the narration in the inventive method embodiment, and here is omitted.
As shown in figure 5, the embodiments of the invention provide a kind of search system, including:Any of the above-described embodiment of the present invention is carried The intelligent semantic identifying device 501 of confession, and the search engine 502 that any of the above-described embodiment of the invention is provided;Wherein,
The intelligent semantic identifying device 501, at least one keyword for user to be inputted is converted at least one The target keyword of form is set, and the target keyword is sent to the search engine;
The search engine 502, for receiving the target keyword that the intelligent semantic identifying device is sent, and according to institute Target keyword is stated to scan for.
In above-described embodiment, the target keyword that will convert into setting form is sent to search engine, so that search engine Target keyword according to receiving is scanned for.Because the keyword for setting form is consistent with the word segmentation regulation of search engine, Which thereby enhance the accuracy of search result.
As shown in fig. 6, the embodiments of the invention provide a kind of application method of search system, this method can include following Step:
Step 601:Intelligent semantic identifying device builds xml original documents.
Step 602:At least one node is built in the xml original documents, is stored at least under each described node One regular expression, forms the xml document;Wherein, each the described regular expression stored under same node The type of keyword corresponding with present node is identical.
For example, keyword type includes age class, date class and license plate number class, then three are set up in xml original documents Node, it is corresponding with each keyword type respectively, then by the corresponding canonical of the keyword of age class, date class and license plate number class Expression formula is stored under itself corresponding node, forms xml document.
Step 603:Obtain at least one keyword of user's input.
For example, the keyword of user's input includes age A, date B and license plate number C.
Step 604:According to the form of the keyword, the corresponding type of the keyword is determined, and from the xml document It is middle to determine node corresponding with the type of the keyword.
Step 605:From at least one regular expression stored under the node determined, it is determined that with the key The corresponding target regular expression of word.
For example, the character that the age includes is digital, and numeral and Chinese character or numeral and punctuate, car can be included in the date The trade mark includes numeral and letter, then can determine the corresponding type of keyword according to the different-format of each keyword.At this In, age A, date B and license plate number C distinguish corresponding type for age class, date class and license plate number class.Then can be from xml document In determine the node corresponding with each type, then determine from the node determined target canonical corresponding with keyword Expression formula.
Step 606:According at least one described target regular expression, at least one keyword that the user is inputted Be converted to the target keyword of at least one setting form.
Step 607:The target keyword of at least one setting form is sent to the search engine of outside.
According to regular expression, the keyword that user inputs is changed, the participle rule of search engine are complied with Then.For example, the age A inputted for user, is translated into date of birth A ', then search engine can be carried out according to the date of birth Retrieval.
Step 608:Search engine is built between the keyword and at least one document of at least one setting form in advance Index relative.
For example, building the index relative between different dates of birth and corresponding document in advance.
Step 609:According to the index relative, it is determined that at least one corresponding with least one described target keyword Destination document.
Herein, according to the date of birth A ' obtained after conversion and the index relative built in advance, it is determined that and year of birth The corresponding destination document of month A '.
In summary, because the keyword that user inputs is converted into the participle with search engine by intelligent semantic identifying device The keyword for the setting form that rule is consistent, so that the keyword for the setting form that search engine can be obtained according to conversion enters Row search, so that the searching accuracy improved.
Present invention also offers a kind of computer-readable recording medium, including execute instruction, when described in the computing device of storage control During execute instruction, the storage control performs the method that any of the above-described embodiment of the invention is provided.
In addition, present invention also offers a kind of storage control, including:Processor, memory and bus;The memory For storing execute instruction, the processor is connected with the memory by the bus, when storage control operation When, the execute instruction of memory storage described in the computing device, so that the storage control is performed in the present invention The method that any embodiment offer is provided.
In summary, each embodiment of the invention at least has the advantages that:
1st, in embodiments of the present invention, distinguished by building in advance including at least one keyword and each keyword The configuration file of corresponding regular expression.When getting the keyword of user's input, determined from configuration file and this pass The corresponding target regular expression of keyword, and according to the target regular expression determined, the keyword got is changed To set the target keyword of form, the target keyword after conversion is then sent to search engine, so that search engine root Scanned for according to the target keyword received.Because the keyword that the regular expression by building in advance is inputted to user enters Row conversion, so that the keyword of user's input is consistent with the word segmentation regulation of search engine, which thereby enhances search result Accuracy.
2nd, in embodiments of the present invention, the configuration file of structure can be xml document, then the keyword in configuration file and Regular expression corresponding with keyword is xml forms.This make it that each keyword and corresponding regular expression can be direct By computer understanding, so as to improve the conversion efficiency of the keyword inputted to user, and then search efficiency is improved.
3rd, in embodiments of the present invention, first build xml original documents, built in xml original documents respectively with it is various The corresponding node of the type of keyword, is then stored in same node by the corresponding regular expression of same type of keyword Under, consequently facilitating being managed to configuration file.
4th, in embodiments of the present invention, the form of the keyword inputted according to user, determines the corresponding type of keyword, and The node corresponding with this keyword type, and at least one stored under the node determined are determined from xml document In individual regular expression, it is determined that regular expression corresponding with the keyword that user inputs.It is this determine step by step by way of, The efficiency for determining target regular expression can be improved, and then improves search efficiency.
5th, in embodiments of the present invention, by building at least one keyword and at least one article for setting form in advance Index relative, then receive setting form keyword after, corresponding target can directly be determined according to index relative Document, then while the accuracy of search result is improved, to also improve search efficiency.
It should be noted that herein, such as first and second etc relational terms are used merely to an entity Or operation makes a distinction with another entity or operation, and not necessarily require or imply exist between these entities or operation Any this actual relation or order.Moreover, term " comprising ", "comprising" or its any other variant be intended to it is non- It is exclusive to include, so that process, method, article or equipment including a series of key elements not only include those key elements, But also other key elements including being not expressly set out, or also include solid by this process, method, article or equipment Some key elements.In the absence of more restrictions, the key element limited by sentence " including one ", is not arranged Except also there is other identical factor in the process including the key element, method, article or equipment.
One of ordinary skill in the art will appreciate that:Realizing all or part of step of above method embodiment can pass through Programmed instruction related hardware is completed, and foregoing program can be stored in the storage medium of embodied on computer readable, the program Upon execution, the step of including above method embodiment is performed;And foregoing storage medium includes:ROM, RAM, magnetic disc or light Disk etc. is various can be with the medium of store program codes.
It is last it should be noted that:Presently preferred embodiments of the present invention is the foregoing is only, the skill of the present invention is merely to illustrate Art scheme, is not intended to limit the scope of the present invention.Any modification for being made within the spirit and principles of the invention, Equivalent substitution, improvement etc., are all contained in protection scope of the present invention.

Claims (10)

1. a kind of intelligent semantic recognition methods, it is characterised in that including:
Configuration file is built, the configuration file includes at least one keyword and each described keyword is corresponded to respectively Regular expression;
Also include:
Obtain at least one keyword of user's input;
At least one target canonical corresponding with least one keyword that the user inputs is determined from the configuration file Expression formula;
According at least one described target regular expression, at least one keyword that the user inputs is converted at least one The target keyword of individual setting form;
The target keyword of at least one setting form is sent to the search engine of outside.
2. according to the method described in claim 1, it is characterised in that
The structure configuration file, the configuration file includes at least one keyword and each described keyword difference Corresponding regular expression, including:
Extensible markup language xml document is built, the xml document includes at least one keyword and each described pass Keyword distinguishes corresponding regular expression.
3. method according to claim 2, it is characterised in that
The structure extensible markup language xml document, the xml document includes at least one keyword and each institute State keyword and distinguish corresponding regular expression, including:
Build xml original documents;
At least one node is built in the xml original documents, at least one described canonical is stored under each described node Expression formula, forms the xml document;Wherein, under each described regular expression and present node for being stored under same node The type of corresponding keyword is identical.
4. method according to claim 3, it is characterised in that
It is described that at least one target corresponding with least one keyword that the user inputs is determined from the configuration file Regular expression, including:
Each the described keyword inputted for the user, is performed both by:
According to the form of the keyword, the corresponding type of the keyword is determined;
Node corresponding with the type of the keyword is determined from the xml document;
From at least one regular expression stored under the node determined, it is determined that target corresponding with the keyword Regular expression.
5. a kind of searching method, it is characterised in that applied to search engine, including:
Receive the target keyword of at least one setting form;
According to the target keyword of at least one setting form, scan for.
6. method according to claim 5, it is characterised in that
Further comprise:The index relative between the keyword and at least one document of at least one setting form is built in advance;
The target keyword of at least one setting form, is scanned for described in the basis, including:
According to the index relative, it is determined that at least one destination document corresponding with least one described target keyword.
7. a kind of intelligent semantic identifying device, it is characterised in that including:Construction unit, acquiring unit, processing unit and transmission are single Member;Wherein,
The construction unit, for building configuration file, the configuration file include at least one keyword and each The keyword distinguishes corresponding regular expression;
The acquiring unit, at least one keyword for obtaining user's input;
The processing unit, it is corresponding with least one keyword that the user inputs for being determined from the configuration file At least one target regular expression;According at least one described target regular expression, at least one that the user is inputted Individual keyword is converted to the target keyword of at least one setting form;
The transmitting element, the search engine for the target keyword of at least one setting form to be sent to outside.
8. device according to claim 7, it is characterised in that
The construction unit, builds at least one node, often for building xml original documents, and in the xml original documents At least one described regular expression is stored under one node, the xml document is formed;Wherein, deposited under same node The type of each described regular expression keyword corresponding with present node of storage is identical.
9. a kind of search engine, it is characterised in that including:Receiving unit and search unit;Wherein,
The receiving unit, the target keyword for receiving at least one setting form;
The search unit, for the target keyword according at least one setting form, is scanned for.
10. a kind of search system, it is characterised in that including:Intelligent semantic identifying device described in claim 7 or 8, Yi Jiquan Profit requires the search engine described in 9;Wherein,
The intelligent semantic identifying device, at least one keyword for user to be inputted is converted at least one setting form Target keyword, and the target keyword is sent to the search engine;
The search engine, for receiving the target keyword that the intelligent semantic identifying device is sent, and according to the target Keyword is scanned for.
CN201710440790.9A 2017-06-13 2017-06-13 A kind of intelligent semantic recognition methods, searching method, apparatus and system Pending CN107256260A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710440790.9A CN107256260A (en) 2017-06-13 2017-06-13 A kind of intelligent semantic recognition methods, searching method, apparatus and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710440790.9A CN107256260A (en) 2017-06-13 2017-06-13 A kind of intelligent semantic recognition methods, searching method, apparatus and system

Publications (1)

Publication Number Publication Date
CN107256260A true CN107256260A (en) 2017-10-17

Family

ID=60024574

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710440790.9A Pending CN107256260A (en) 2017-06-13 2017-06-13 A kind of intelligent semantic recognition methods, searching method, apparatus and system

Country Status (1)

Country Link
CN (1) CN107256260A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109284362A (en) * 2018-11-11 2019-01-29 广东小天才科技有限公司 A kind of content search method and system
CN113779935A (en) * 2021-09-10 2021-12-10 北京金堤科技有限公司 Text information acquisition method and system

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3711710B2 (en) * 1996-12-10 2005-11-02 セイコーエプソン株式会社 Information search and collection system and storage medium storing information search and collection program
CN101075435A (en) * 2007-04-19 2007-11-21 深圳先进技术研究院 Intelligent chatting system and its realizing method
CN103092979A (en) * 2013-01-31 2013-05-08 中国科学院对地观测与数字地球科学中心 Processing method and device for searching of natural language by remote sensing data
CN103631882A (en) * 2013-11-14 2014-03-12 北京邮电大学 Semantization service generation system and method based on graph mining technique
US20150293975A1 (en) * 2013-05-30 2015-10-15 Tencent Technology (Shenzhen) Company Limited Method and device for searching for contact object, and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3711710B2 (en) * 1996-12-10 2005-11-02 セイコーエプソン株式会社 Information search and collection system and storage medium storing information search and collection program
CN101075435A (en) * 2007-04-19 2007-11-21 深圳先进技术研究院 Intelligent chatting system and its realizing method
CN103092979A (en) * 2013-01-31 2013-05-08 中国科学院对地观测与数字地球科学中心 Processing method and device for searching of natural language by remote sensing data
US20150293975A1 (en) * 2013-05-30 2015-10-15 Tencent Technology (Shenzhen) Company Limited Method and device for searching for contact object, and storage medium
CN103631882A (en) * 2013-11-14 2014-03-12 北京邮电大学 Semantization service generation system and method based on graph mining technique

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
戎"码"一生: "XML中配置正则表达式的写法", 《HTTPS://WWW.CNBLOGS.COM/LUCKY_HU/ARCHIVE/2013/01/04/2845014.HTML》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109284362A (en) * 2018-11-11 2019-01-29 广东小天才科技有限公司 A kind of content search method and system
CN113779935A (en) * 2021-09-10 2021-12-10 北京金堤科技有限公司 Text information acquisition method and system

Similar Documents

Publication Publication Date Title
US11442932B2 (en) Mapping natural language to queries using a query grammar
US11790006B2 (en) Natural language question answering systems
US11449767B2 (en) Method of building a sorting model, and application method and apparatus based on the model
US8886648B1 (en) System and method for computation of document similarity
WO2020001373A1 (en) Method and apparatus for ontology construction
JP5116775B2 (en) Information retrieval method and apparatus, program, and computer-readable recording medium
CN104361127B (en) The multilingual quick constructive method of question and answer interface based on domain body and template logic
JP6118414B2 (en) Context Blind Data Transformation Using Indexed String Matching
US9367605B2 (en) Abstract generating search method and system
JP6014725B2 (en) Retrieval and information providing method and system for single / multi-sentence natural language queries
CN113762028A (en) Data-driven structure extraction from text documents
KR101522049B1 (en) Coreference resolution in an ambiguity-sensitive natural language processing system
CN107562919B (en) Multi-index integrated software component retrieval method and system based on information retrieval
CN103886099B (en) Semantic retrieval system and method of vague concepts
CN111428494A (en) Intelligent error correction method, device and equipment for proper nouns and storage medium
CN108875065B (en) Indonesia news webpage recommendation method based on content
US9971782B2 (en) Document tagging and retrieval using entity specifiers
CN103605781A (en) Implicit expression chapter relationship type inference method and system
CN109522396B (en) Knowledge processing method and system for national defense science and technology field
CA2853627A1 (en) Automatic creation of clinical study reports
JPWO2014002774A1 (en) Synonym extraction system, method and recording medium
US11151317B1 (en) Contextual spelling correction system
CN107256260A (en) A kind of intelligent semantic recognition methods, searching method, apparatus and system
US20060248037A1 (en) Annotation of inverted list text indexes using search queries
CN117171331B (en) Professional field information interaction method, device and equipment based on large language model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20171017