CN108228884B - Reading difficulty oriented search result preview system and preview method - Google Patents

Reading difficulty oriented search result preview system and preview method Download PDF

Info

Publication number
CN108228884B
CN108228884B CN201810090592.9A CN201810090592A CN108228884B CN 108228884 B CN108228884 B CN 108228884B CN 201810090592 A CN201810090592 A CN 201810090592A CN 108228884 B CN108228884 B CN 108228884B
Authority
CN
China
Prior art keywords
user
concepts
search result
read
preview
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201810090592.9A
Other languages
Chinese (zh)
Other versions
CN108228884A (en
Inventor
张引
赵玉丽
张斌
高克宁
李鹏飞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Northeastern University China
Original Assignee
Northeastern University China
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Northeastern University China filed Critical Northeastern University China
Priority to CN201810090592.9A priority Critical patent/CN108228884B/en
Publication of CN108228884A publication Critical patent/CN108228884A/en
Application granted granted Critical
Publication of CN108228884B publication Critical patent/CN108228884B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

A reading difficulty oriented search result preview system and a preview method belong to the field of search engines. The system comprises: the device comprises a search result list page generating module, a preview result generating module and a preview result displaying module. The search result list page generating module is used for generating a search result list page of the query request; the preview result generation module is used for identifying concepts used in the content of the search results and used and learned concepts of a given user; and the preview result display module is used for displaying the preview result generated by the preview result generation module. The invention enables the user to preview the concepts in a certain search result, the concepts which can be read by the user, the concepts which can be read possibly, and the percentage of the readable concepts and the number of the concepts which can be read possibly in the number of the concepts used in the content of the certain search result in the search result list page, thereby helping the user to preview the reading difficulty of the search result and avoiding clicking the search result which is difficult to read.

Description

Reading difficulty oriented search result preview system and preview method
Technical Field
The invention belongs to the field of search engines, and particularly relates to a reading difficulty-oriented search result preview system and method.
Background
Search engines have become an important tool for users to obtain information and solve problems. The basic process of using a search engine by a user is as follows: (1) submitting a query request to a search engine; (2) reading the search result list page, and clicking in the search result list page to open a group of search results; (3) the open search results are read. In the process (2), the user needs to read the search result preview in the search result list page, determine the value of the search result and the reading difficulty of the search result, so as to decide which search result to click on.
Current search engines present only the title and content summaries of the search results to the user when providing a preview of the search results. Because the information provided by the change of the title and content outline of the search result is very limited, the user usually cannot effectively judge the reading difficulty of the search result based on the title and content outline of the search result. Because the reading difficulty of the search results cannot be effectively judged, a user may click on a large number of search results which are difficult to read. In the process (3), the user needs to read the search results which are difficult to read, so that the user cannot effectively acquire information from the search results, and the effect of acquiring information and solving problems by using a search engine is influenced.
Disclosure of Invention
In order to overcome the defects of the prior art, the invention provides a search result preview system and a search result preview method facing reading difficulty.
The hardware adopted by the invention is a PC connected with the interconnection. A PC serves as a server for a search engine, and a user accesses the server using a PC equipped with a browser.
The technical scheme of the invention is as follows:
a reading difficulty oriented search result preview system comprises three modules: the system comprises a search result list page generating module running on a search engine server, a preview result generating module running on the search engine server and a preview result display module running on a browser;
the search result list page generating module is used for generating a search result list page of the query request;
the preview result generation module is used for identifying concepts used in the content of the search result, calculating the percentage of the number of the concepts used by the user and the number of the concepts learned by the user to the number of the concepts used by the user in the content of the search result for the given user;
a global knowledge base and a user personalized knowledge base are stored in the preview result generating module;
the global knowledge base stores concepts to be identified in the process of previewing the search results;
the user personalized knowledge base stores concepts used by a given user and concepts learned by the user;
the concept used by the user refers to the concept used by the given user in the process of programming and writing the document;
the concept learned by the user refers to the concept that the given user has learned in the classroom learning process but has not used in the process of programming and writing documents;
the preview result display module is used for displaying the preview result generated by the preview result generation module.
A reading difficulty-oriented search result preview method comprises the following steps: step 1: a global knowledge base is stored in the preview result generating module in advance, wherein the global knowledge base comprises concepts which need to be identified from the content of the search result by the preview result generating module;
step 2: a user personalized knowledge base is stored in the preview result generation module in advance, wherein the user personalized knowledge base comprises concepts used by the user and concepts learned by the user, which are required to be identified from the content of the search result by the preview result generation module;
and step 3: a user initiates a query request to a search engine server by using a browser;
and 4, step 4: the search engine server generates a search result list page based on the query request;
and 5: for each search result on the search result list page, the preview result generation module identifies concepts used in the content of the search result based on the global knowledge base, identifies concepts that can be read by the user and concepts that the user may be able to read from among the concepts used in the content of the search result based on the user personalized knowledge base of the user for the user who initiated the query request, and calculates the number of concepts that can be read by the user and the percentage of the number of concepts that can be read by the user to the number of concepts used in the content of the search result;
step 5.1: for a certain search result, the preview result generation module identifies concepts used in the content of the search result based on concepts contained in the global knowledge base;
step 5.2: aiming at a user who initiates a query request, a preview result generation module identifies concepts which can be read by the user and concepts which can be possibly read by the user in concepts used in the content of the search result based on a user personalized knowledge base of the user;
step 5.3: for the search result and the user who initiates the query request, the preview result generating module respectively calculates the percentage of the concepts which can be read by the user and the percentage of the concepts which can be read by the user based on the number of the concepts used in the content of the search result, the number of the concepts which can be read by the user and the number of the concepts which can be read by the user in the concepts used in the content of the search result;
step 6: the search engine returns a search result list page containing concepts used in the content of the search results, concepts that the user can read, concepts that the user may be able to read, and the concepts that the user can read and the percentage of concepts that the user may be able to read to the concepts used in the content of the search results to the user browser;
and 7: displaying a search result list page by the user browser, and displaying concepts used in the content of the search result, concepts which can be read by the user, concepts which can be possibly read by the user, and the percentage of the concepts which can be read by the user and the concepts which can be possibly read by the user in the content of the search result for each search result on the search result list page by using a preview result display module;
step 7.1: displaying the percentage of concepts that can be read by the user and the percentage of concepts that the user is likely to be able to read as a percentage-piled histogram horizontally extending below the title of the search result, with the number of concepts used in the content of a certain search result being 100%;
step 7.2: displaying concepts used in the content of the search results under the search result titles;
step 7.3: marking the concepts which can be read by the user on the background color in the concepts displayed under the search result title based on the concepts which can be read by the user;
step 7.4: based on the concepts that the user may be able to read, the concepts that the user may be able to read in the concepts displayed under the search result title are marked with a background color different from the concepts that the user may be able to read.
The invention has the beneficial effects that: the invention is based on the existing search engine technology, uses the global knowledge base and the user personalized knowledge base as the base, uses the global knowledge base to identify the concepts used in the content of the search result, and uses the user personalized knowledge base to label the concepts which can be read by the user and the concepts which can be read in the concepts used in the searched result content aiming at the user who initiates the query request, thereby providing personalized search result preview for the user, so that the user can preview the concepts in a certain search result, the concepts which can be read by the user, the number of the concepts which can be read by the user and the number of the concepts which can be read by the user in the search result list page account for the number of the concepts used in the content of a certain search result, thereby helping the user to preview the reading difficulty of the search result, and the search result which is difficult to read is prevented from being clicked.
Drawings
Fig. 1 is a block diagram of a search result preview system for reading difficulty according to an embodiment of the present invention.
Fig. 2 is a block diagram of an interface of a search result preview system for reading difficulty according to an embodiment of the present invention.
Detailed Description
The following further describes a specific embodiment of the present invention with reference to the drawings and technical solutions.
A reading difficulty oriented search result preview system is shown in fig. 1. The hardware adopted by the invention is a PC connected with the interconnection. A PC serves as a server for a search engine, and a user accesses the server using a PC equipped with a browser.
In this embodiment, the operating environment of the search engine server is: the Intel Core i7-4770Processor, 32GB DDR3SDRAM and Windows 10 operating system are accessed to the Internet through the education network CERNET, and the server is written by using JSP technology and runs on a Tomcat application server.
In this embodiment, the running environment of the user PC is: intel Core i7-4770Processor, 32GB DDR3SDRAM, Windows 10 operating system; the browser is as follows: firefox 57.0.4 browser.
A reading difficulty oriented search result preview system comprises three modules: the system comprises a search result list page generating module running on a search engine server, a preview result generating module running on the search engine server and a preview result displaying module running on a browser.
The search result list page generating module is used for generating a search result list page of the query request.
The preview result generation module is configured to identify concepts used in the content of the search results and, for a given user, concepts used by the user and concepts learned by the user in the content of the search results, and to calculate a percentage of the number of concepts used by the user and the number of concepts learned to the number of concepts used in the results of the search.
The preview result generating module is internally stored with a global knowledge base and a user personalized knowledge base.
The global knowledge base stores concepts to be identified in the process of previewing the search results.
The user personalized knowledge base stores concepts used by a given user and concepts learned by the user.
The concept used by the user refers to the concept used by the given user in the process of programming and composing the document.
The concept learned by the user refers to the concept that the given user has learned in the classroom learning process but has not used in the process of programming and writing documents.
The preview result display module is used for displaying the preview result generated by the preview result generation module.
When a user accesses the reading difficulty-oriented search result preview system of the invention by using a browser, the interaction with the invention comprises the following steps:
step 1: a global knowledge base is stored in the preview result generating module in advance, wherein the global knowledge base comprises concepts which need to be identified from the content of the search result by the preview result generating module;
step 2: a user personalized knowledge base is stored in the preview result generation module in advance, wherein the user personalized knowledge base comprises concepts used by the user and concepts learned by the user, which are required to be identified from the content of the search result by the preview result generation module;
and step 3: a user initiates a query request to a search engine server by using a browser;
and 4, step 4: the search engine server generates a search result list page based on the query request;
and 5: for each search result on the search result list page, the preview result generation module identifies concepts used in the content of the search result based on the global knowledge base, identifies concepts that can be read by the user and concepts that the user may be able to read from among the concepts used in the content of the search result based on the user personalized knowledge base of the user for the user who initiated the query request, and calculates the number of concepts that can be read by the user and the percentage of the number of concepts that can be read by the user to the number of concepts used in the content of the search result;
step 5.1: for a certain search result, the preview result generation module identifies concepts used in the content of the search result based on concepts contained in the global knowledge base;
step 5.2: for a user who initiates a query request, a preview result generation module identifies concepts that the user can read and concepts that the user may possibly read, from among concepts used in the content of the search result, based on a user personalized knowledge base of the user;
step 5.3: for the search result and the user who initiates the query request, the preview result generating module respectively calculates the percentage of the concepts which can be read by the user and the percentage of the concepts which can be read by the user based on the number of the concepts used in the content of the search result, the number of the concepts which can be read by the user and the number of the concepts which can be read by the user in the concepts used in the content of the search result;
step 6: the search engine returns a search result list page containing concepts used in the content of the search results, concepts that the user can read, concepts that the user may be able to read, and the concepts that the user can read and the percentage of concepts that the user may be able to read to the concepts used in the content of the search results to the user browser;
and 7: the user browser displays the search result list page and displays, for each search result on the search result list page, concepts used in the content of the search result, concepts that the user can read, concepts that the user may be able to read, and percentages of the concepts that the user can read and the concepts that the user may be able to read, as shown in fig. 2, to the concepts used in the content of the search result using the preview result display module;
step 7.1: displaying the percentage of concepts that can be read by the user and the percentage of concepts that the user is likely to be able to read as a percentage-piled histogram horizontally extending below the title of the search result, with the number of concepts used in the content of a certain search result being 100%;
step 7.2: displaying concepts used in the content of the search results under the search result titles;
step 7.3: marking the concepts which can be read by the user on the background color in the concepts displayed under the search result title based on the concepts which can be read by the user;
step 7.4: based on the concepts that the user may be able to read, the concepts that the user may be able to read in the concepts displayed under the search result title are marked with a background color different from the concepts that the user may be able to read.

Claims (1)

1. A reading difficulty oriented search result preview system is characterized by comprising three modules: the system comprises a search result list page generating module running on a search engine server, a preview result generating module running on the search engine server and a preview result display module running on a browser;
the search result list page generating module is used for generating a search result list page of the query request;
the preview result generation module is used for identifying concepts used in the content of the search result, calculating the percentage of the number of the concepts used by the user and the number of the concepts learned by the user to the number of the concepts used by the user in the content of the search result for the given user;
a global knowledge base and a user personalized knowledge base are stored in the preview result generating module;
the global knowledge base stores concepts to be identified in the process of previewing the search results;
the user personalized knowledge base stores concepts used by a given user and concepts learned by the user;
the concept used by the user refers to the concept used by the given user in the process of programming and writing the document;
the concept learned by the user refers to the concept that the given user has learned in the classroom learning process but has not used in the process of programming and writing documents;
the preview result display module is used for displaying the preview result generated by the preview result generating module;
the process for generating the search result specifically comprises the following steps:
step 1: a global knowledge base is stored in the preview result generating module in advance, wherein the global knowledge base comprises concepts which need to be identified from the content of the search result by the preview result generating module;
step 2: a user personalized knowledge base is stored in the preview result generation module in advance, wherein the user personalized knowledge base comprises concepts used by the user and concepts learned by the user, which are required to be identified from the content of the search result by the preview result generation module;
and step 3: a user initiates a query request to a search engine server by using a browser;
and 4, step 4: the search engine server generates a search result list page based on the query request;
and 5: for each search result on the search result list page, the preview result generation module identifies concepts used in the content of the search result based on the global knowledge base, identifies concepts that can be read by the user and concepts that the user may be able to read from among the concepts used in the content of the search result based on the user personalized knowledge base of the user for the user who initiated the query request, and calculates the number of concepts that can be read by the user and the percentage of the number of concepts that can be read by the user to the number of concepts used in the content of the search result;
step 5.1: for a certain search result, the preview result generation module identifies concepts used in the content of the search result based on concepts contained in the global knowledge base;
step 5.2: aiming at a user who initiates a query request, a preview result generation module identifies concepts which can be read by the user and concepts which can be possibly read by the user in concepts used in the content of the search result based on a user personalized knowledge base of the user;
step 5.3: for the search result and the user who initiates the query request, the preview result generating module respectively calculates the percentage of the concepts which can be read by the user and the percentage of the concepts which can be read by the user based on the number of the concepts used in the content of the search result, the number of the concepts which can be read by the user and the number of the concepts which can be read by the user in the concepts used in the content of the search result;
step 6: the search engine returns a search result list page containing concepts used in the content of the search results, concepts that the user can read, concepts that the user may be able to read, and the concepts that the user can read and the percentage of concepts that the user may be able to read to the concepts used in the content of the search results to the user browser;
and 7: displaying a search result list page by the user browser, and displaying concepts used in the content of the search result, concepts which can be read by the user, concepts which can be possibly read by the user, and the percentage of the concepts which can be read by the user and the concepts which can be possibly read by the user in the content of the search result for each search result on the search result list page by using a preview result display module;
step 7.1: displaying the percentage of concepts that can be read by the user and the percentage of concepts that the user is likely to be able to read as a percentage-piled histogram horizontally extending below the title of the search result, with the number of concepts used in the content of a certain search result being 100%;
step 7.2: displaying concepts used in the content of the search results under the search result titles;
step 7.3: marking the concepts which can be read by the user on the background color in the concepts displayed under the search result title based on the concepts which can be read by the user;
step 7.4: based on the concepts that the user may be able to read, the concepts that the user may be able to read in the concepts displayed under the search result title are marked with a background color different from the concepts that the user may be able to read.
CN201810090592.9A 2018-01-30 2018-01-30 Reading difficulty oriented search result preview system and preview method Expired - Fee Related CN108228884B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810090592.9A CN108228884B (en) 2018-01-30 2018-01-30 Reading difficulty oriented search result preview system and preview method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810090592.9A CN108228884B (en) 2018-01-30 2018-01-30 Reading difficulty oriented search result preview system and preview method

Publications (2)

Publication Number Publication Date
CN108228884A CN108228884A (en) 2018-06-29
CN108228884B true CN108228884B (en) 2022-04-05

Family

ID=62669818

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810090592.9A Expired - Fee Related CN108228884B (en) 2018-01-30 2018-01-30 Reading difficulty oriented search result preview system and preview method

Country Status (1)

Country Link
CN (1) CN108228884B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109740062B (en) * 2019-01-04 2020-10-16 东北大学 Search task clustering method based on learning output

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1296587A (en) * 1998-02-02 2001-05-23 蓝道尔·C·沃克 Text processor
CN104881403A (en) * 2015-06-04 2015-09-02 百度在线网络技术(北京)有限公司 Word segmentation method and device
CN105955975A (en) * 2016-04-15 2016-09-21 北京大学 Knowledge recommendation method for academic literature
CN106126621A (en) * 2016-06-22 2016-11-16 腾讯科技(深圳)有限公司 Method and apparatus recommended in article
CN106776705A (en) * 2016-11-16 2017-05-31 东北大学 A kind of interactive browser plug-in system towards search procedure

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6314420B1 (en) * 1996-04-04 2001-11-06 Lycos, Inc. Collaborative/adaptive search engine

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1296587A (en) * 1998-02-02 2001-05-23 蓝道尔·C·沃克 Text processor
CN104881403A (en) * 2015-06-04 2015-09-02 百度在线网络技术(北京)有限公司 Word segmentation method and device
CN105955975A (en) * 2016-04-15 2016-09-21 北京大学 Knowledge recommendation method for academic literature
CN106126621A (en) * 2016-06-22 2016-11-16 腾讯科技(深圳)有限公司 Method and apparatus recommended in article
CN106776705A (en) * 2016-11-16 2017-05-31 东北大学 A kind of interactive browser plug-in system towards search procedure

Also Published As

Publication number Publication date
CN108228884A (en) 2018-06-29

Similar Documents

Publication Publication Date Title
US20200042560A1 (en) Automatically generating a website specific to an industry
Garg et al. Personalized, interactive tag recommendation for flickr
WO2019200783A1 (en) Method for data crawling in page containing dynamic image or table, device, terminal, and storage medium
US7383505B2 (en) Information sharing device and information sharing method
CN101542486B (en) Rank graph
CN109074383B (en) Document search with visualization within the context of a document
US20170011034A1 (en) Computerized system and method for automatically associating metadata with media objects
US20110191317A1 (en) Method for Human Editing of Information in Search Results
US8458584B1 (en) Extraction and analysis of user-generated content
CN111159494B (en) Data labeling method for multi-user concurrent processing
CN106021392A (en) News key information extraction method and system
CN111310693A (en) Intelligent labeling method and device for text in image and storage medium
CN109522490B (en) Map visualization method for internet information
CN108228884B (en) Reading difficulty oriented search result preview system and preview method
US20200073925A1 (en) Method and system for generating a website from collected content
Lommatzsch et al. News Images in MediaEval 2023
US8266140B2 (en) Tagging system using internet search engine
CN112418875A (en) Cross-platform tax intelligent customer service corpus migration method and device
Zhou et al. Assessing and predicting vertical intent for web queries
CN117095419A (en) PDF document data processing and information extracting device and method
KR20020028044A (en) Database link keyword portal service method
CN108268488A (en) The recognition methods of webpage master map and device
CN112149391B (en) Information processing method, information processing apparatus, terminal device, and storage medium
CN111209488B (en) Information sharing method and device
Lehfeldt et al. Metadata in coastal information systems

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220405