CN105183884A - Search engine system and method based on big data technique - Google Patents

Search engine system and method based on big data technique Download PDF

Info

Publication number
CN105183884A
CN105183884A CN201510616027.8A CN201510616027A CN105183884A CN 105183884 A CN105183884 A CN 105183884A CN 201510616027 A CN201510616027 A CN 201510616027A CN 105183884 A CN105183884 A CN 105183884A
Authority
CN
China
Prior art keywords
data
index
search
database
search engine
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510616027.8A
Other languages
Chinese (zh)
Inventor
刘守仓
张丹星
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xi'an Following International Information Ltd Co
Original Assignee
Xi'an Following International Information Ltd Co
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xi'an Following International Information Ltd Co filed Critical Xi'an Following International Information Ltd Co
Priority to CN201510616027.8A priority Critical patent/CN105183884A/en
Publication of CN105183884A publication Critical patent/CN105183884A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/9038Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying

Abstract

The invention discloses a search engine system based on the big data technique. The system comprises a collector, an indexer, an index database, an information resource database, a searcher and a big data kernel, wherein the collector is used for collecting a data object to be searched and processing data to be of a format capable of facilitating searching, the indexer is used for indexing data in data resources collected by the collector according to the attribute information of data and storing index data in the index database, the index database is used for storing data indexing information established by the indexer and storing various data resources collected by the collector, and the searcher is used for receiving a searching request of a user and submitting the searching request to the big data kernel according to keyword query conditions submitted by the user. The invention further discloses a search engine method based on the big data technique. The performance of a search engine is improved greatly, searching accuracy is improved, requirements for the search engine in nowadays are met, and data searching in the era of big data is achieved.

Description

A kind of search engine system based on large data technique and search engine method
Technical field
The invention belongs to computer information technology field, be specifically related to a kind of search engine system based on large data technique, the invention still further relates to a kind of search engine method based on large data technique.
Background technology
Along with the development of informationization technology, the especially rapid emergence of social networks, mobile interchange, Internet of Things, large market demand and universal, the data that human social development produces present explosive growth.Nowadays the data created for global every two days are just equivalent to the sum total starting to create to the mankind in 2003 data from human civilization, but also with annual 50% speed increment.The mankind have been brought into brand-new " large data " epoch by the data of undergoes rapid expansion, and data have become the strategic resource of equal importance with natural resources, human resources and production factors.In the face of so huge data, the data how quick obtaining needs from mass data, and excavate the knowledge needed, be the challenge faced now.
Summary of the invention
The object of this invention is to provide a kind of search engine system based on large data technique, need data to realize quick obtaining from mass data.
Another object of the present invention is to provide a kind of search engine method based on large data technique.
First technical scheme of the present invention is, a kind of search engine system based on large data technique, comprises collector, index, index database, information resource database, searcher and large data core;
Collector, is responsible for the data object gathering required search, and processes data into the data layout being convenient to carry out searching for;
Index, is responsible for the data resource gathered collector, sets up index according to the attribute information of data to data, and by index datastore to index database.
Index database, is responsible for the data indexing information storing index foundation, for search engine.
Information resource database, is responsible for the Various types of data resource that storage of collected device gathers, and supports the storage of structuring data and unstructured data;
Searcher, is responsible for the searching request accepting user, and according to the keyword query condition that user submits to, searching request is submitted to large data core;
Large data core is a treatment and analysis kernel based on large data technique, and it is responsible for the request receiving searcher, searches for from index database, from information resource database, the result of coupling is returned to searcher simultaneously.
The feature of the present invention first technical scheme is also,
Collector provides data acquisition access interface, is convenient to collection and the access of carrying out structuring and unstructured data.
Index sets up index according to the attribute information such as type, title, date of data to data.
Second technical scheme of the present invention is, a kind of search engine method of the search engine system based on large data technique is specifically implemented according to following steps:
First step 1, collector provide data acquisition interface, this interface can supported web page, APP, structuring and unstructured data sources access;
Step 2, data acquisition interface carry out data acquisition and crawl according to input parameter information to target data source, while gathering, collector carries out format process to the raw data collected, information resource database is stored in the mode of K-V key assignments, the collection to search data source is realized as data source to be searched, for dissimilar data source, then the data collected are processed into the data layout being convenient to carry out searching for by data acquisition interface input different parameters;
Step 3, while the data collected are stored into information resource database by collector in step 2, index is started working, index is according to the input parameter key word of collector, and the raw data collected is analyzed, index list is set up to data, index list is stored to index database, and index list and raw data are carried out one_to_one corresponding;
Step 4, after index list in step 3 and raw data realize one_to_one corresponding, by search interface inputted search key word and the search-type of searcher, generate searching request, by searcher, searching request is sent to large data core;
After step 5, large data core receive searching request, Rapid matching being carried out according to search key and search-type in index database, when there is no the index mated, returning empty search queue to searcher; If find the index of coupling from index database, large data core meeting visit information resources bank, the data of coupling are gone out according to indexed search, and Search Results is sorted according to similarity, fed back by searcher in the mode of list, terminate search, wherein, when some search keys frequently occur, the Search Results matched can be carried out buffer memory by large data core, can respond searching request fast next time.
The feature of the second technical scheme of the present invention is also,
For dissimilar data source in step 1, data acquisition interface input different parameters specific practice is:
When the data source types collected is webpage: data acquisition interface input web site name, website URL reference address, collection key word, authorization message;
When the data source types collected is APP: the address of the data access interface that data acquisition interface input APP title, APP provide, authorization message;
When the data source types collected is structural data: data acquisition interface input IP address, type of database, port, database-name, user name, password;
When the data source types collected is unstructured data: data acquisition interface input data file store path, data file class, title.
Beneficial effect of the present invention is as follows:
A kind of search engine system based on large data technique of the present invention and search engine method, make use of large data technique and caching technology, compare with traditional search engine, substantially increase the performance of search engine, improve the precision of search simultaneously, adapt to society to the demand of search engine, solve the problem of large data age to data search.
Accompanying drawing explanation
Fig. 1 is the structural representation of a kind of search engine system based on large data technique of the present invention;
Fig. 2 is the designed holder composition of a kind of search engine system based on large data technique of the present invention;
Fig. 3 is the search engine process schematic diagram of a kind of search engine system based on large data technique of the present invention.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearly understand, below in conjunction with the drawings and the specific embodiments, the present invention is further elaborated.
Fig. 1 shows the structural representation based on the search engine system of large data technique in the present invention, should comprise collector 1, index 2, index database 3, information resource database 4, searcher 5 and large data core 6 based on the search engine of large data technique;
Collector 1, is responsible for the data object gathering required search.Collector provides data acquisition access interface, can realize the access of data source and the collection of data by this interface; Collector provides the function of providing data formatting simultaneously, and the data collected can be carried out layout and arrangement according to the format rule pre-set by this function, are convenient to the search in later stage.Before data acquisition, carried out the setting of providing data formatting rule by interface, after starting, collector will carry out collection and the arrangement of data according to rule.As gathered demographic data, carry out gathering a format according to the rule of name, citizenship number, sex, date of birth.
Index 2, is responsible for the data resource gathered collector, sets up index according to the attribute information of data to data, and by index datastore to index database.
Index database 3, is responsible for the data indexing information storing index foundation, for search engine.
Information resource database 4, is responsible for the Various types of data resource that storage of collected device gathers, and supports the storage of structuring data and unstructured data;
Searcher 5, is responsible for the searching request accepting user, and according to the keyword query condition that user submits to, searching request is submitted to large data core;
Large data core 6 is the treatment and analysis kernels based on large data technique, and it is responsible for the request receiving searcher, searches for from index database, from information resource database, the result of coupling is returned to searcher simultaneously.
In the present invention, collector provides data acquisition access interface, be convenient to carry out structuring and unstructured data collection and access;
Index sets up index according to the attribute information such as type, title, date of data to data;
Index database stores data indexing information according to KV (Key-Value) key assignments, conveniently retrieves, and improves search performance.
Based on a search engine method for the search engine system of large data technique, specifically implement according to following steps:
First step 1, collector provide data acquisition interface, this interface can supported web page, APP, structuring and unstructured data sources access;
Step 2, data acquisition interface carry out data acquisition and crawl according to input parameter information to target data source, while gathering, collector carries out format process to the raw data collected, information resource database is stored in the mode of K-V key assignments, the collection to search data source is realized as data source to be searched, for dissimilar data source, then the data collected are processed into the data layout being convenient to carry out searching for by data acquisition interface input different parameters;
Step 3, while the data collected are stored into information resource database by collector in step 2, index is started working, index is according to the input parameter key word of collector, and the raw data collected is analyzed, index list is set up to data, index list is stored to index database, and index list and raw data are carried out one_to_one corresponding;
Step 4, after index list in step 3 and raw data realize one_to_one corresponding, by search interface inputted search key word and the search-type of searcher, generate searching request, by searcher, searching request is sent to large data core;
After step 5, large data core receive searching request, Rapid matching being carried out according to search key and search-type in index database, when there is no the index mated, returning empty search queue to searcher; If find the index of coupling from index database, large data core meeting visit information resources bank, the data of coupling are gone out according to indexed search, and Search Results is sorted according to similarity, fed back by searcher in the mode of list, terminate search, wherein, when some search keys frequently occur, the Search Results matched can be carried out buffer memory by large data core, can respond searching request fast next time.
Wherein, for dissimilar data source in step 1, data acquisition interface input different parameters specific practice is:
When the data source types collected is webpage: data acquisition interface input web site name, website URL reference address, collection key word, authorization message;
When the data source types collected is APP: the address of the data access interface that data acquisition interface input APP title, APP provide, authorization message;
When the data source types collected is structural data: data acquisition interface input IP address, type of database, port, database-name, user name, password;
When the data source types collected is unstructured data: data acquisition interface input data file store path, data file class, title.
In the present invention, the searching request accepting user is responsible for by searcher, and is responsible for searching request to submit to large data core according to the keyword query condition that user submits to.Searcher provides search interface, and user is by submitting to search key to carry out the retrieval of data to searcher.Key word can be one or one group, supports keyword search, full-text search and association search.Searcher is responsible for Search Results to return to user's request simultaneously.Searcher provides secondary filtration and the ranking function of Search Results, can improve search accuracy further.In order to improve search performance, searcher provides caching function, when having same searching request next time, can directly return searching result.
Keyword search retrieves data attribute information such as titles according to search key, is the most basic function of search engine;
Full-text search carries out full-text search according to search key to data content;
Association search is search and all data contents of keyword match, by large data core in conjunction with the search behavior of user and custom and key word etc., can recommend potential Search Results for user provides simultaneously.Belonging to the Premium Features of the search engine based on large data technique, is also a characteristic functions of the present invention.
In the present invention, the association search of searcher is the characteristic functions being different from traditional search engines, it can the search behavior of recording user and custom, for user provides search to recommend, large data technique can be utilized to carry out association analysis to Search Results simultaneously, and machine learning, excavate the potential knowledge of Search Results and result is returned.The principle of association search is by the statistics to user search behavior, and the search custom grasping user carries out record, can perform data search fast; Simultaneously by large data technique, can combine and analyze by the search key little to multiple degree of association, forming potential search key and search for.As searched for bed and bedside cupboard, by analysis, this user possible also needs mattress, and so searcher is while search bed and bedside cupboard, the result of the mattress of search together can be returned to user.
The present invention, by using large data technique, can improve the performance of search on the one hand; Another aspect is due to the characteristic of large data technique, and it has inborn advantage to the process of structuring and unstructured data, expands the category of search engine object search; 3rd aspect its machine learning is provided, and utilize the ability of its large Data Analysis Services, recommendation and the association analysis of potential data search be provided.Also be a characteristic of the present invention,
Result is carried out buffer memory by caching technology by the result that searcher searches, and when user submits same searching request again to, can directly directly be returned by the Search Results in buffer memory, request of need not resubmiting is retrieved, and improves search performance.
Illustrating how Design and implementation below should based on search engine of large data technique, Fig. 2 is the designed holder composition of the search engine system that the present invention is based on large data technique, first build the running environment of search engine, comprising: the infrastructure such as server, storage, network, safety.
Running environment upper strata is index database and the information resource database of search engine, is responsible for storing index and data resource, and sets up the incidence relation between index and data resource.
At the large data core that index database and information resource database upper strata are search engine, provide large data analysis and process ability, be responsible for the request and the execution that accept searcher, from index database and information resources library searching, and return Search Results.
Be collector, index and searcher on large data core, be responsible for acquisition and processing data respectively, set up index, and receive searching request, result is returned.
Fig. 3 is the search procedure schematic diagram of the search engine system that the present invention is based on large data technique.Should based on the search engine of large data technique in use, first collector gathers website, APP, structuring and unstructured data, by the data write information resources bank collected, index creates index by the attribute of data simultaneously, and index information is write index database.User passes through search interface, input key word, submit searching request to, searcher receives searching request, and search mission is submitted to large data core, and large data core is searched for from index database according to key word, from information resource database, the data content of coupling is extracted after searching result, search result list and data content are returned to searcher, is undertaken filtering result, sorting by searcher, finally return to user.Searcher can carry out buffer memory to Search Results simultaneously, is convenient to search next time.

Claims (5)

1. based on a search engine system for large data technique, it is characterized in that, comprise collector, index, index database, information resource database, searcher and large data core;
Collector, is responsible for the data object gathering required search, and processes data into the data layout being convenient to carry out searching for;
Index, is responsible for the data resource gathered collector, sets up index according to the attribute information of data to data, and by index datastore to index database;
Index database, is responsible for the data indexing information storing index foundation, for search engine;
Information resource database, is responsible for the Various types of data resource that storage of collected device gathers, and supports the storage of structuring data and unstructured data;
Searcher, is responsible for the searching request accepting user, and according to the keyword query condition that user submits to, searching request is submitted to large data core;
Large data core is a treatment and analysis kernel based on large data technique, and it is responsible for the request receiving searcher, searches for from index database, from information resource database, the result of coupling is returned to searcher simultaneously.
2. a kind of search engine system based on large data technique according to claim 1, it is characterized in that, described collector provides data acquisition access interface, be convenient to carry out structuring and unstructured data collection and access.
3. a kind of search engine system based on large data technique according to claim 1, is characterized in that, described index sets up index according to the type of data, title, date property information to data.
4. based on a search engine method for the search engine system of large data technique, it is characterized in that, specifically implement according to following steps:
First step 1, collector provide data acquisition interface, this interface can supported web page, APP, structuring and unstructured data sources access;
Step 2, data acquisition interface carry out data acquisition and crawl according to input parameter information to target data source, while gathering, collector carries out format process to the raw data collected, information resource database is stored in the mode of K-V key assignments, the collection to search data source is realized as data source to be searched, for dissimilar data source, then the data collected are processed into the data layout being convenient to carry out searching for by data acquisition interface input different parameters;
Step 3, while the data collected are stored into information resource database by collector in described step 2, index is started working, index is according to the input parameter key word of collector, and the raw data collected is analyzed, index list is set up to data, index list is stored to index database, and index list and raw data are carried out one_to_one corresponding;
Step 4, after in described step 3, index list and raw data realize one_to_one corresponding, by search interface inputted search key word and the search-type of searcher, generate searching request, by searcher, searching request is sent to large data core;
After step 5, large data core receive searching request, Rapid matching being carried out according to search key and search-type in index database, when there is no the index mated, returning empty search queue to searcher; If find the index of coupling from index database, large data core meeting visit information resources bank, the data of coupling are gone out according to indexed search, and Search Results is sorted according to similarity, fed back by searcher in the mode of list, terminate search, wherein, when some search keys frequently occur, the Search Results matched can be carried out buffer memory by large data core, can respond searching request fast next time.
5. the search engine method of a kind of search engine system based on large data technique according to claim 4, is characterized in that, for dissimilar data source in described step 1, data acquisition interface input different parameters specific practice is:
When the data source types collected is webpage: data acquisition interface input web site name, website URL reference address, collection key word, authorization message;
When the data source types collected is APP: the address of the data access interface that data acquisition interface input APP title, APP provide, authorization message;
When the data source types collected is structural data: data acquisition interface input IP address, type of database, port, database-name, user name, password;
When the data source types collected is unstructured data: data acquisition interface input data file store path, data file class, title.
CN201510616027.8A 2015-09-24 2015-09-24 Search engine system and method based on big data technique Pending CN105183884A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510616027.8A CN105183884A (en) 2015-09-24 2015-09-24 Search engine system and method based on big data technique

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510616027.8A CN105183884A (en) 2015-09-24 2015-09-24 Search engine system and method based on big data technique

Publications (1)

Publication Number Publication Date
CN105183884A true CN105183884A (en) 2015-12-23

Family

ID=54905965

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510616027.8A Pending CN105183884A (en) 2015-09-24 2015-09-24 Search engine system and method based on big data technique

Country Status (1)

Country Link
CN (1) CN105183884A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106294695A (en) * 2016-08-08 2017-01-04 深圳市网安计算机安全检测技术有限公司 A kind of implementation method towards the biggest data search engine
CN106844755A (en) * 2017-02-17 2017-06-13 山东浪潮商用系统有限公司 A kind of data assets method for quickly retrieving based on search engine
CN107103011A (en) * 2016-02-23 2017-08-29 阿里巴巴集团控股有限公司 The implementation method and device of terminal data search
CN107679091A (en) * 2017-09-07 2018-02-09 北京海融兴通信息安全技术有限公司 A kind of search system and method based on big data
CN108090071A (en) * 2016-11-22 2018-05-29 北大方正集团有限公司 Collection of resources method and apparatus in resources bank
CN108228743A (en) * 2017-12-18 2018-06-29 深圳供电局有限公司 A kind of real-time big data search engine system
CN109063138A (en) * 2018-08-03 2018-12-21 上海点融信息科技有限责任公司 For in block chain, that is, service platform search data method, equipment and storage medium
CN110472144A (en) * 2019-07-23 2019-11-19 创梦工场(杭州)网络科技有限公司 A kind of information brings service platform and implementation method together
CN111026951A (en) * 2019-12-05 2020-04-17 武汉国贸通大数据有限公司 Intelligent commerce information search system for international trade
CN111368166A (en) * 2020-03-05 2020-07-03 深圳中兴网信科技有限公司 Resource search method, resource search apparatus, and computer-readable storage medium
CN111429987A (en) * 2020-03-20 2020-07-17 深圳市凯沃尔电子有限公司 Searching method and device based on index file
CN112988863A (en) * 2021-02-09 2021-06-18 苏州中科蓝迪软件技术有限公司 Elasticissearch-based efficient search engine method for heterogeneous multiple data sources

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102200974A (en) * 2010-03-25 2011-09-28 北京师范大学 Unified information retrieval intelligent agent system and method for search engine
CN103294715A (en) * 2012-02-29 2013-09-11 腾讯科技(深圳)有限公司 Hidden web data search method and search engine
CN103927342A (en) * 2014-03-28 2014-07-16 苏州中炎工贸有限公司 Vertical search engine system on basis of big data
CN104778200A (en) * 2015-01-13 2015-07-15 东莞中山大学研究院 Heterogeneous processing big data retrieval method combining historical data

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102200974A (en) * 2010-03-25 2011-09-28 北京师范大学 Unified information retrieval intelligent agent system and method for search engine
CN103294715A (en) * 2012-02-29 2013-09-11 腾讯科技(深圳)有限公司 Hidden web data search method and search engine
CN103927342A (en) * 2014-03-28 2014-07-16 苏州中炎工贸有限公司 Vertical search engine system on basis of big data
CN104778200A (en) * 2015-01-13 2015-07-15 东莞中山大学研究院 Heterogeneous processing big data retrieval method combining historical data

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107103011A (en) * 2016-02-23 2017-08-29 阿里巴巴集团控股有限公司 The implementation method and device of terminal data search
CN106294695A (en) * 2016-08-08 2017-01-04 深圳市网安计算机安全检测技术有限公司 A kind of implementation method towards the biggest data search engine
CN108090071A (en) * 2016-11-22 2018-05-29 北大方正集团有限公司 Collection of resources method and apparatus in resources bank
CN106844755A (en) * 2017-02-17 2017-06-13 山东浪潮商用系统有限公司 A kind of data assets method for quickly retrieving based on search engine
CN107679091A (en) * 2017-09-07 2018-02-09 北京海融兴通信息安全技术有限公司 A kind of search system and method based on big data
CN108228743A (en) * 2017-12-18 2018-06-29 深圳供电局有限公司 A kind of real-time big data search engine system
CN109063138A (en) * 2018-08-03 2018-12-21 上海点融信息科技有限责任公司 For in block chain, that is, service platform search data method, equipment and storage medium
WO2020024908A1 (en) * 2018-08-03 2020-02-06 上海点融信息科技有限责任公司 Method and device for data searching on blockchain-as-a-service platform, and storage medium
CN110472144A (en) * 2019-07-23 2019-11-19 创梦工场(杭州)网络科技有限公司 A kind of information brings service platform and implementation method together
CN111026951A (en) * 2019-12-05 2020-04-17 武汉国贸通大数据有限公司 Intelligent commerce information search system for international trade
CN111368166A (en) * 2020-03-05 2020-07-03 深圳中兴网信科技有限公司 Resource search method, resource search apparatus, and computer-readable storage medium
CN111429987A (en) * 2020-03-20 2020-07-17 深圳市凯沃尔电子有限公司 Searching method and device based on index file
CN111429987B (en) * 2020-03-20 2023-05-09 深圳市凯沃尔电子有限公司 Index file-based searching method and device
CN112988863A (en) * 2021-02-09 2021-06-18 苏州中科蓝迪软件技术有限公司 Elasticissearch-based efficient search engine method for heterogeneous multiple data sources

Similar Documents

Publication Publication Date Title
CN105183884A (en) Search engine system and method based on big data technique
US10289700B2 (en) Method for dynamically matching images with content items based on keywords in response to search queries
US9946753B2 (en) Method and system for document indexing and data querying
JP5494454B2 (en) Search result generation method, search result generation program, and search system
US20130013616A1 (en) Systems and Methods for Natural Language Searching of Structured Data
JP2017220203A (en) Method and system for evaluating matching between content item and image based on similarity scores
JP6165955B1 (en) Method and system for matching images and content using whitelist and blacklist in response to search query
US10235387B2 (en) Method for selecting images for matching with content based on metadata of images and content in real-time in response to search queries
CN111522905A (en) Document searching method and device based on database
US20170255653A1 (en) Method for categorizing images to be associated with content items based on keywords of search queries
Davison et al. Finding Relevant Website Queries.
Khodaei et al. Temporal-textual retrieval: Time and keyword search in web documents
Cheng et al. MISDA: web services discovery approach based on mining interface semantics
RU2473119C1 (en) Method and system for semantic search of electronic documents
Zhao et al. A new keywords method to improve web search
Shekhar et al. A WEBIR crawling framework for retrieving highly relevant web documents: evaluation based on rank aggregation and result merging algorithms
Qiu et al. Detection and optimized disposal of near-duplicate pages
Khurana et al. Survey of techniques for deep web source selection and surfacing the hidden web content
TWI423053B (en) Domain Interpretation Data Retrieval Method and Its System
Wardekar et al. SmartCrawler: A Personalized Web Search for Relevant Web Pages
Mourad et al. In-Depth Métan-Search Engine
Xiao-Shu et al. Cloud computing oriented retrieval technology based on big data
Naamha et al. Web Page Ranking Based on Text Content and Link Information Using Data Mining Techniques
Shi et al. VSEC: A Vertical Search Engine for E-commerce
CN113127596A (en) Full-text retrieval method, system, electronic equipment and storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20151223

RJ01 Rejection of invention patent application after publication