CN105183884A - Search engine system and method based on big data technique - Google Patents
Search engine system and method based on big data technique Download PDFInfo
- Publication number
- CN105183884A CN105183884A CN201510616027.8A CN201510616027A CN105183884A CN 105183884 A CN105183884 A CN 105183884A CN 201510616027 A CN201510616027 A CN 201510616027A CN 105183884 A CN105183884 A CN 105183884A
- Authority
- CN
- China
- Prior art keywords
- data
- index
- search
- database
- search engine
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
- G06F16/9038—Presentation of query results
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/903—Querying
Abstract
The invention discloses a search engine system based on the big data technique. The system comprises a collector, an indexer, an index database, an information resource database, a searcher and a big data kernel, wherein the collector is used for collecting a data object to be searched and processing data to be of a format capable of facilitating searching, the indexer is used for indexing data in data resources collected by the collector according to the attribute information of data and storing index data in the index database, the index database is used for storing data indexing information established by the indexer and storing various data resources collected by the collector, and the searcher is used for receiving a searching request of a user and submitting the searching request to the big data kernel according to keyword query conditions submitted by the user. The invention further discloses a search engine method based on the big data technique. The performance of a search engine is improved greatly, searching accuracy is improved, requirements for the search engine in nowadays are met, and data searching in the era of big data is achieved.
Description
Technical field
The invention belongs to computer information technology field, be specifically related to a kind of search engine system based on large data technique, the invention still further relates to a kind of search engine method based on large data technique.
Background technology
Along with the development of informationization technology, the especially rapid emergence of social networks, mobile interchange, Internet of Things, large market demand and universal, the data that human social development produces present explosive growth.Nowadays the data created for global every two days are just equivalent to the sum total starting to create to the mankind in 2003 data from human civilization, but also with annual 50% speed increment.The mankind have been brought into brand-new " large data " epoch by the data of undergoes rapid expansion, and data have become the strategic resource of equal importance with natural resources, human resources and production factors.In the face of so huge data, the data how quick obtaining needs from mass data, and excavate the knowledge needed, be the challenge faced now.
Summary of the invention
The object of this invention is to provide a kind of search engine system based on large data technique, need data to realize quick obtaining from mass data.
Another object of the present invention is to provide a kind of search engine method based on large data technique.
First technical scheme of the present invention is, a kind of search engine system based on large data technique, comprises collector, index, index database, information resource database, searcher and large data core;
Collector, is responsible for the data object gathering required search, and processes data into the data layout being convenient to carry out searching for;
Index, is responsible for the data resource gathered collector, sets up index according to the attribute information of data to data, and by index datastore to index database.
Index database, is responsible for the data indexing information storing index foundation, for search engine.
Information resource database, is responsible for the Various types of data resource that storage of collected device gathers, and supports the storage of structuring data and unstructured data;
Searcher, is responsible for the searching request accepting user, and according to the keyword query condition that user submits to, searching request is submitted to large data core;
Large data core is a treatment and analysis kernel based on large data technique, and it is responsible for the request receiving searcher, searches for from index database, from information resource database, the result of coupling is returned to searcher simultaneously.
The feature of the present invention first technical scheme is also,
Collector provides data acquisition access interface, is convenient to collection and the access of carrying out structuring and unstructured data.
Index sets up index according to the attribute information such as type, title, date of data to data.
Second technical scheme of the present invention is, a kind of search engine method of the search engine system based on large data technique is specifically implemented according to following steps:
First step 1, collector provide data acquisition interface, this interface can supported web page, APP, structuring and unstructured data sources access;
Step 2, data acquisition interface carry out data acquisition and crawl according to input parameter information to target data source, while gathering, collector carries out format process to the raw data collected, information resource database is stored in the mode of K-V key assignments, the collection to search data source is realized as data source to be searched, for dissimilar data source, then the data collected are processed into the data layout being convenient to carry out searching for by data acquisition interface input different parameters;
Step 3, while the data collected are stored into information resource database by collector in step 2, index is started working, index is according to the input parameter key word of collector, and the raw data collected is analyzed, index list is set up to data, index list is stored to index database, and index list and raw data are carried out one_to_one corresponding;
Step 4, after index list in step 3 and raw data realize one_to_one corresponding, by search interface inputted search key word and the search-type of searcher, generate searching request, by searcher, searching request is sent to large data core;
After step 5, large data core receive searching request, Rapid matching being carried out according to search key and search-type in index database, when there is no the index mated, returning empty search queue to searcher; If find the index of coupling from index database, large data core meeting visit information resources bank, the data of coupling are gone out according to indexed search, and Search Results is sorted according to similarity, fed back by searcher in the mode of list, terminate search, wherein, when some search keys frequently occur, the Search Results matched can be carried out buffer memory by large data core, can respond searching request fast next time.
The feature of the second technical scheme of the present invention is also,
For dissimilar data source in step 1, data acquisition interface input different parameters specific practice is:
When the data source types collected is webpage: data acquisition interface input web site name, website URL reference address, collection key word, authorization message;
When the data source types collected is APP: the address of the data access interface that data acquisition interface input APP title, APP provide, authorization message;
When the data source types collected is structural data: data acquisition interface input IP address, type of database, port, database-name, user name, password;
When the data source types collected is unstructured data: data acquisition interface input data file store path, data file class, title.
Beneficial effect of the present invention is as follows:
A kind of search engine system based on large data technique of the present invention and search engine method, make use of large data technique and caching technology, compare with traditional search engine, substantially increase the performance of search engine, improve the precision of search simultaneously, adapt to society to the demand of search engine, solve the problem of large data age to data search.
Accompanying drawing explanation
Fig. 1 is the structural representation of a kind of search engine system based on large data technique of the present invention;
Fig. 2 is the designed holder composition of a kind of search engine system based on large data technique of the present invention;
Fig. 3 is the search engine process schematic diagram of a kind of search engine system based on large data technique of the present invention.
Embodiment
In order to make object of the present invention, technical scheme and advantage clearly understand, below in conjunction with the drawings and the specific embodiments, the present invention is further elaborated.
Fig. 1 shows the structural representation based on the search engine system of large data technique in the present invention, should comprise collector 1, index 2, index database 3, information resource database 4, searcher 5 and large data core 6 based on the search engine of large data technique;
Collector 1, is responsible for the data object gathering required search.Collector provides data acquisition access interface, can realize the access of data source and the collection of data by this interface; Collector provides the function of providing data formatting simultaneously, and the data collected can be carried out layout and arrangement according to the format rule pre-set by this function, are convenient to the search in later stage.Before data acquisition, carried out the setting of providing data formatting rule by interface, after starting, collector will carry out collection and the arrangement of data according to rule.As gathered demographic data, carry out gathering a format according to the rule of name, citizenship number, sex, date of birth.
Index 2, is responsible for the data resource gathered collector, sets up index according to the attribute information of data to data, and by index datastore to index database.
Index database 3, is responsible for the data indexing information storing index foundation, for search engine.
Information resource database 4, is responsible for the Various types of data resource that storage of collected device gathers, and supports the storage of structuring data and unstructured data;
Searcher 5, is responsible for the searching request accepting user, and according to the keyword query condition that user submits to, searching request is submitted to large data core;
Large data core 6 is the treatment and analysis kernels based on large data technique, and it is responsible for the request receiving searcher, searches for from index database, from information resource database, the result of coupling is returned to searcher simultaneously.
In the present invention, collector provides data acquisition access interface, be convenient to carry out structuring and unstructured data collection and access;
Index sets up index according to the attribute information such as type, title, date of data to data;
Index database stores data indexing information according to KV (Key-Value) key assignments, conveniently retrieves, and improves search performance.
Based on a search engine method for the search engine system of large data technique, specifically implement according to following steps:
First step 1, collector provide data acquisition interface, this interface can supported web page, APP, structuring and unstructured data sources access;
Step 2, data acquisition interface carry out data acquisition and crawl according to input parameter information to target data source, while gathering, collector carries out format process to the raw data collected, information resource database is stored in the mode of K-V key assignments, the collection to search data source is realized as data source to be searched, for dissimilar data source, then the data collected are processed into the data layout being convenient to carry out searching for by data acquisition interface input different parameters;
Step 3, while the data collected are stored into information resource database by collector in step 2, index is started working, index is according to the input parameter key word of collector, and the raw data collected is analyzed, index list is set up to data, index list is stored to index database, and index list and raw data are carried out one_to_one corresponding;
Step 4, after index list in step 3 and raw data realize one_to_one corresponding, by search interface inputted search key word and the search-type of searcher, generate searching request, by searcher, searching request is sent to large data core;
After step 5, large data core receive searching request, Rapid matching being carried out according to search key and search-type in index database, when there is no the index mated, returning empty search queue to searcher; If find the index of coupling from index database, large data core meeting visit information resources bank, the data of coupling are gone out according to indexed search, and Search Results is sorted according to similarity, fed back by searcher in the mode of list, terminate search, wherein, when some search keys frequently occur, the Search Results matched can be carried out buffer memory by large data core, can respond searching request fast next time.
Wherein, for dissimilar data source in step 1, data acquisition interface input different parameters specific practice is:
When the data source types collected is webpage: data acquisition interface input web site name, website URL reference address, collection key word, authorization message;
When the data source types collected is APP: the address of the data access interface that data acquisition interface input APP title, APP provide, authorization message;
When the data source types collected is structural data: data acquisition interface input IP address, type of database, port, database-name, user name, password;
When the data source types collected is unstructured data: data acquisition interface input data file store path, data file class, title.
In the present invention, the searching request accepting user is responsible for by searcher, and is responsible for searching request to submit to large data core according to the keyword query condition that user submits to.Searcher provides search interface, and user is by submitting to search key to carry out the retrieval of data to searcher.Key word can be one or one group, supports keyword search, full-text search and association search.Searcher is responsible for Search Results to return to user's request simultaneously.Searcher provides secondary filtration and the ranking function of Search Results, can improve search accuracy further.In order to improve search performance, searcher provides caching function, when having same searching request next time, can directly return searching result.
Keyword search retrieves data attribute information such as titles according to search key, is the most basic function of search engine;
Full-text search carries out full-text search according to search key to data content;
Association search is search and all data contents of keyword match, by large data core in conjunction with the search behavior of user and custom and key word etc., can recommend potential Search Results for user provides simultaneously.Belonging to the Premium Features of the search engine based on large data technique, is also a characteristic functions of the present invention.
In the present invention, the association search of searcher is the characteristic functions being different from traditional search engines, it can the search behavior of recording user and custom, for user provides search to recommend, large data technique can be utilized to carry out association analysis to Search Results simultaneously, and machine learning, excavate the potential knowledge of Search Results and result is returned.The principle of association search is by the statistics to user search behavior, and the search custom grasping user carries out record, can perform data search fast; Simultaneously by large data technique, can combine and analyze by the search key little to multiple degree of association, forming potential search key and search for.As searched for bed and bedside cupboard, by analysis, this user possible also needs mattress, and so searcher is while search bed and bedside cupboard, the result of the mattress of search together can be returned to user.
The present invention, by using large data technique, can improve the performance of search on the one hand; Another aspect is due to the characteristic of large data technique, and it has inborn advantage to the process of structuring and unstructured data, expands the category of search engine object search; 3rd aspect its machine learning is provided, and utilize the ability of its large Data Analysis Services, recommendation and the association analysis of potential data search be provided.Also be a characteristic of the present invention,
Result is carried out buffer memory by caching technology by the result that searcher searches, and when user submits same searching request again to, can directly directly be returned by the Search Results in buffer memory, request of need not resubmiting is retrieved, and improves search performance.
Illustrating how Design and implementation below should based on search engine of large data technique, Fig. 2 is the designed holder composition of the search engine system that the present invention is based on large data technique, first build the running environment of search engine, comprising: the infrastructure such as server, storage, network, safety.
Running environment upper strata is index database and the information resource database of search engine, is responsible for storing index and data resource, and sets up the incidence relation between index and data resource.
At the large data core that index database and information resource database upper strata are search engine, provide large data analysis and process ability, be responsible for the request and the execution that accept searcher, from index database and information resources library searching, and return Search Results.
Be collector, index and searcher on large data core, be responsible for acquisition and processing data respectively, set up index, and receive searching request, result is returned.
Fig. 3 is the search procedure schematic diagram of the search engine system that the present invention is based on large data technique.Should based on the search engine of large data technique in use, first collector gathers website, APP, structuring and unstructured data, by the data write information resources bank collected, index creates index by the attribute of data simultaneously, and index information is write index database.User passes through search interface, input key word, submit searching request to, searcher receives searching request, and search mission is submitted to large data core, and large data core is searched for from index database according to key word, from information resource database, the data content of coupling is extracted after searching result, search result list and data content are returned to searcher, is undertaken filtering result, sorting by searcher, finally return to user.Searcher can carry out buffer memory to Search Results simultaneously, is convenient to search next time.
Claims (5)
1. based on a search engine system for large data technique, it is characterized in that, comprise collector, index, index database, information resource database, searcher and large data core;
Collector, is responsible for the data object gathering required search, and processes data into the data layout being convenient to carry out searching for;
Index, is responsible for the data resource gathered collector, sets up index according to the attribute information of data to data, and by index datastore to index database;
Index database, is responsible for the data indexing information storing index foundation, for search engine;
Information resource database, is responsible for the Various types of data resource that storage of collected device gathers, and supports the storage of structuring data and unstructured data;
Searcher, is responsible for the searching request accepting user, and according to the keyword query condition that user submits to, searching request is submitted to large data core;
Large data core is a treatment and analysis kernel based on large data technique, and it is responsible for the request receiving searcher, searches for from index database, from information resource database, the result of coupling is returned to searcher simultaneously.
2. a kind of search engine system based on large data technique according to claim 1, it is characterized in that, described collector provides data acquisition access interface, be convenient to carry out structuring and unstructured data collection and access.
3. a kind of search engine system based on large data technique according to claim 1, is characterized in that, described index sets up index according to the type of data, title, date property information to data.
4. based on a search engine method for the search engine system of large data technique, it is characterized in that, specifically implement according to following steps:
First step 1, collector provide data acquisition interface, this interface can supported web page, APP, structuring and unstructured data sources access;
Step 2, data acquisition interface carry out data acquisition and crawl according to input parameter information to target data source, while gathering, collector carries out format process to the raw data collected, information resource database is stored in the mode of K-V key assignments, the collection to search data source is realized as data source to be searched, for dissimilar data source, then the data collected are processed into the data layout being convenient to carry out searching for by data acquisition interface input different parameters;
Step 3, while the data collected are stored into information resource database by collector in described step 2, index is started working, index is according to the input parameter key word of collector, and the raw data collected is analyzed, index list is set up to data, index list is stored to index database, and index list and raw data are carried out one_to_one corresponding;
Step 4, after in described step 3, index list and raw data realize one_to_one corresponding, by search interface inputted search key word and the search-type of searcher, generate searching request, by searcher, searching request is sent to large data core;
After step 5, large data core receive searching request, Rapid matching being carried out according to search key and search-type in index database, when there is no the index mated, returning empty search queue to searcher; If find the index of coupling from index database, large data core meeting visit information resources bank, the data of coupling are gone out according to indexed search, and Search Results is sorted according to similarity, fed back by searcher in the mode of list, terminate search, wherein, when some search keys frequently occur, the Search Results matched can be carried out buffer memory by large data core, can respond searching request fast next time.
5. the search engine method of a kind of search engine system based on large data technique according to claim 4, is characterized in that, for dissimilar data source in described step 1, data acquisition interface input different parameters specific practice is:
When the data source types collected is webpage: data acquisition interface input web site name, website URL reference address, collection key word, authorization message;
When the data source types collected is APP: the address of the data access interface that data acquisition interface input APP title, APP provide, authorization message;
When the data source types collected is structural data: data acquisition interface input IP address, type of database, port, database-name, user name, password;
When the data source types collected is unstructured data: data acquisition interface input data file store path, data file class, title.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510616027.8A CN105183884A (en) | 2015-09-24 | 2015-09-24 | Search engine system and method based on big data technique |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510616027.8A CN105183884A (en) | 2015-09-24 | 2015-09-24 | Search engine system and method based on big data technique |
Publications (1)
Publication Number | Publication Date |
---|---|
CN105183884A true CN105183884A (en) | 2015-12-23 |
Family
ID=54905965
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201510616027.8A Pending CN105183884A (en) | 2015-09-24 | 2015-09-24 | Search engine system and method based on big data technique |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN105183884A (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106294695A (en) * | 2016-08-08 | 2017-01-04 | 深圳市网安计算机安全检测技术有限公司 | A kind of implementation method towards the biggest data search engine |
CN106844755A (en) * | 2017-02-17 | 2017-06-13 | 山东浪潮商用系统有限公司 | A kind of data assets method for quickly retrieving based on search engine |
CN107103011A (en) * | 2016-02-23 | 2017-08-29 | 阿里巴巴集团控股有限公司 | The implementation method and device of terminal data search |
CN107679091A (en) * | 2017-09-07 | 2018-02-09 | 北京海融兴通信息安全技术有限公司 | A kind of search system and method based on big data |
CN108090071A (en) * | 2016-11-22 | 2018-05-29 | 北大方正集团有限公司 | Collection of resources method and apparatus in resources bank |
CN108228743A (en) * | 2017-12-18 | 2018-06-29 | 深圳供电局有限公司 | A kind of real-time big data search engine system |
CN109063138A (en) * | 2018-08-03 | 2018-12-21 | 上海点融信息科技有限责任公司 | For in block chain, that is, service platform search data method, equipment and storage medium |
CN110472144A (en) * | 2019-07-23 | 2019-11-19 | 创梦工场(杭州)网络科技有限公司 | A kind of information brings service platform and implementation method together |
CN111026951A (en) * | 2019-12-05 | 2020-04-17 | 武汉国贸通大数据有限公司 | Intelligent commerce information search system for international trade |
CN111368166A (en) * | 2020-03-05 | 2020-07-03 | 深圳中兴网信科技有限公司 | Resource search method, resource search apparatus, and computer-readable storage medium |
CN111429987A (en) * | 2020-03-20 | 2020-07-17 | 深圳市凯沃尔电子有限公司 | Searching method and device based on index file |
CN112988863A (en) * | 2021-02-09 | 2021-06-18 | 苏州中科蓝迪软件技术有限公司 | Elasticissearch-based efficient search engine method for heterogeneous multiple data sources |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102200974A (en) * | 2010-03-25 | 2011-09-28 | 北京师范大学 | Unified information retrieval intelligent agent system and method for search engine |
CN103294715A (en) * | 2012-02-29 | 2013-09-11 | 腾讯科技(深圳)有限公司 | Hidden web data search method and search engine |
CN103927342A (en) * | 2014-03-28 | 2014-07-16 | 苏州中炎工贸有限公司 | Vertical search engine system on basis of big data |
CN104778200A (en) * | 2015-01-13 | 2015-07-15 | 东莞中山大学研究院 | Heterogeneous processing big data retrieval method combining historical data |
-
2015
- 2015-09-24 CN CN201510616027.8A patent/CN105183884A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102200974A (en) * | 2010-03-25 | 2011-09-28 | 北京师范大学 | Unified information retrieval intelligent agent system and method for search engine |
CN103294715A (en) * | 2012-02-29 | 2013-09-11 | 腾讯科技(深圳)有限公司 | Hidden web data search method and search engine |
CN103927342A (en) * | 2014-03-28 | 2014-07-16 | 苏州中炎工贸有限公司 | Vertical search engine system on basis of big data |
CN104778200A (en) * | 2015-01-13 | 2015-07-15 | 东莞中山大学研究院 | Heterogeneous processing big data retrieval method combining historical data |
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107103011A (en) * | 2016-02-23 | 2017-08-29 | 阿里巴巴集团控股有限公司 | The implementation method and device of terminal data search |
CN106294695A (en) * | 2016-08-08 | 2017-01-04 | 深圳市网安计算机安全检测技术有限公司 | A kind of implementation method towards the biggest data search engine |
CN108090071A (en) * | 2016-11-22 | 2018-05-29 | 北大方正集团有限公司 | Collection of resources method and apparatus in resources bank |
CN106844755A (en) * | 2017-02-17 | 2017-06-13 | 山东浪潮商用系统有限公司 | A kind of data assets method for quickly retrieving based on search engine |
CN107679091A (en) * | 2017-09-07 | 2018-02-09 | 北京海融兴通信息安全技术有限公司 | A kind of search system and method based on big data |
CN108228743A (en) * | 2017-12-18 | 2018-06-29 | 深圳供电局有限公司 | A kind of real-time big data search engine system |
CN109063138A (en) * | 2018-08-03 | 2018-12-21 | 上海点融信息科技有限责任公司 | For in block chain, that is, service platform search data method, equipment and storage medium |
WO2020024908A1 (en) * | 2018-08-03 | 2020-02-06 | 上海点融信息科技有限责任公司 | Method and device for data searching on blockchain-as-a-service platform, and storage medium |
CN110472144A (en) * | 2019-07-23 | 2019-11-19 | 创梦工场(杭州)网络科技有限公司 | A kind of information brings service platform and implementation method together |
CN111026951A (en) * | 2019-12-05 | 2020-04-17 | 武汉国贸通大数据有限公司 | Intelligent commerce information search system for international trade |
CN111368166A (en) * | 2020-03-05 | 2020-07-03 | 深圳中兴网信科技有限公司 | Resource search method, resource search apparatus, and computer-readable storage medium |
CN111429987A (en) * | 2020-03-20 | 2020-07-17 | 深圳市凯沃尔电子有限公司 | Searching method and device based on index file |
CN111429987B (en) * | 2020-03-20 | 2023-05-09 | 深圳市凯沃尔电子有限公司 | Index file-based searching method and device |
CN112988863A (en) * | 2021-02-09 | 2021-06-18 | 苏州中科蓝迪软件技术有限公司 | Elasticissearch-based efficient search engine method for heterogeneous multiple data sources |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN105183884A (en) | Search engine system and method based on big data technique | |
US10289700B2 (en) | Method for dynamically matching images with content items based on keywords in response to search queries | |
US9946753B2 (en) | Method and system for document indexing and data querying | |
JP5494454B2 (en) | Search result generation method, search result generation program, and search system | |
US20130013616A1 (en) | Systems and Methods for Natural Language Searching of Structured Data | |
JP2017220203A (en) | Method and system for evaluating matching between content item and image based on similarity scores | |
JP6165955B1 (en) | Method and system for matching images and content using whitelist and blacklist in response to search query | |
US10235387B2 (en) | Method for selecting images for matching with content based on metadata of images and content in real-time in response to search queries | |
CN111522905A (en) | Document searching method and device based on database | |
US20170255653A1 (en) | Method for categorizing images to be associated with content items based on keywords of search queries | |
Davison et al. | Finding Relevant Website Queries. | |
Khodaei et al. | Temporal-textual retrieval: Time and keyword search in web documents | |
Cheng et al. | MISDA: web services discovery approach based on mining interface semantics | |
RU2473119C1 (en) | Method and system for semantic search of electronic documents | |
Zhao et al. | A new keywords method to improve web search | |
Shekhar et al. | A WEBIR crawling framework for retrieving highly relevant web documents: evaluation based on rank aggregation and result merging algorithms | |
Qiu et al. | Detection and optimized disposal of near-duplicate pages | |
Khurana et al. | Survey of techniques for deep web source selection and surfacing the hidden web content | |
TWI423053B (en) | Domain Interpretation Data Retrieval Method and Its System | |
Wardekar et al. | SmartCrawler: A Personalized Web Search for Relevant Web Pages | |
Mourad et al. | In-Depth Métan-Search Engine | |
Xiao-Shu et al. | Cloud computing oriented retrieval technology based on big data | |
Naamha et al. | Web Page Ranking Based on Text Content and Link Information Using Data Mining Techniques | |
Shi et al. | VSEC: A Vertical Search Engine for E-commerce | |
CN113127596A (en) | Full-text retrieval method, system, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20151223 |
|
RJ01 | Rejection of invention patent application after publication |