CN109710829A - A kind of acquisition of internet medical information inquiry system and method for pushing - Google Patents

A kind of acquisition of internet medical information inquiry system and method for pushing Download PDF

Info

Publication number
CN109710829A
CN109710829A CN201811610965.7A CN201811610965A CN109710829A CN 109710829 A CN109710829 A CN 109710829A CN 201811610965 A CN201811610965 A CN 201811610965A CN 109710829 A CN109710829 A CN 109710829A
Authority
CN
China
Prior art keywords
page
information
layer
url
business logic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811610965.7A
Other languages
Chinese (zh)
Inventor
彭德中
任晓累
刘杰
陈鹏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan Dark Horse Digital Technology Co Ltd
Original Assignee
Sichuan Dark Horse Digital Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan Dark Horse Digital Technology Co Ltd filed Critical Sichuan Dark Horse Digital Technology Co Ltd
Priority to CN201811610965.7A priority Critical patent/CN109710829A/en
Publication of CN109710829A publication Critical patent/CN109710829A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses a kind of acquisition of internet medical information inquiry system and method for pushing, it includes the following steps: 1) to create Business Logic, Business Logic according to the request of request object acquisition user and returns to notification information to view layer using request object as parameter;2) creation data crawl layer, and data are crawled layer and obtained notification information using page URL as parameter, and the notification information that will acquire returns to Business Logic;3) view layer is created, view layer is mainly made of a static HTML page and two DYNAMIC J SP pages, and static HTML page is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list and specific content of announcement.The present invention can crawl relevent information or notification information from medical portal website or civilian hospital website respectively, medical portal website or civilian hospital website information or notice can be shown simultaneously in the same page, user can click the particular content that page hyperlink checks information or notice.

Description

A kind of acquisition of internet medical information inquiry system and method for pushing
Technical field
The present invention relates to searching engine fields, and in particular to one kind can crawl each medical portal website and place for user Information or notice on Hospital Website, and these data are synchronized into the automatic acquisition for the internet information being presented to the user and are pushed Method.
Background technique
In face of huge Internet resources, search engine provides a portal website, and that does not exaggerate says, browses this door All users of family website surf the Internet in any place can search any place for oneself thinking search, therefore it has become and removes The at most online service that uses of people outside Email.What is search engine? how does it work? this word is extensive It is different using in internet area, but in different region significance of which.In the U.S., search engine is typically referred to based on mutual The search engine of networking, they arrive several hundred million network robot programs by the tens million of of collection network, each word is searched to be drawn It holds up and includes, that is, the full-text search described in us, famous internet search engine has Google etc..In China, search is drawn The search service for being often referred to the service of the directory search based on Web or specific website is held up, which searched based on a specific website The specific markers research of rope technology.The search engine of broad sense, which refers to, provides the tool or system of information retrieval service on internet, The searching request of user, the system for returning to corresponding search result are responded on the internet or by internet;Narrow sense is searched Index holds up the resource referred mainly on automatic or manual search web and is organized into a database, and search is provided in the form of website Collection, analysis and the index service of service provide the system of information service using network for user.Briefly: WWW search is drawn Hold up be one group of network environment information retrieval system, it is usually there are two types of different modes: one is the search of classified catalogue type, mutually Networked resources carry out collection arrangement, provide different resource inputs to different Directory Types, figure layer of then classifying, and user can be with Their desired classification information layers are found, are then arrived at the destination, oneself desired information is found;Another kind is based on keyword Retrieval, user can input the keyword of various logic composite type, and search engine is found according to certain rules solves these Then resource required for keyword feeds back to user again and includes the information of keyword, and is linked to these websites.It is true On, search engine is exactly a website, it is the website of special offer search service, it uses unique scheme and collects interconnection Online all information help people to search the website of the information of oneself needs in information ocean.
The doctor or patient for wanting to know about some specific medical information or notice, do not ensure that many times and daily can Medical portal website or civilian hospital website are browsed, can very likely miss some important information or notice in this way, and want It checks that these information need point to open multiple webpages, shuttles in these webpages.If these medical informations can be in a net It is browsed on page, and timely mail notification or short massage notice, then can be conveniently many for us, one will not be missed A little important informations.
Summary of the invention
In view of the above-mentioned problems, the purpose of the present invention is to provide one kind can crawl medical portal website and place doctor for user Information or notice on institute website, and these data are synchronized to the acquisition for the internet medical information inquiry system being presented to the user With method for pushing.
In order to solve the above-mentioned technical problem, the present invention is accomplished by the following way:
A kind of acquisition of internet medical information inquiry system and method for pushing, medical information inquiry system include client Browser and server is held, the server includes that Business Logic, data crawl layer and view layer, is included the following steps:
1) Business Logic is created, Business Logic is using request object as parameter, according to request object acquisition visitor The request of family end browser simultaneously returns to notification information to view layer, and Business Logic handles the parameter that browser is submitted, The medical treatment URL of the portal website or URL of civilian hospital website is generated, and URL is submitted into data as parameter and crawls layer;
2) creation data crawl layer, and data crawl URL that layer is submitted using Business Logic as parameter, corresponding to the URL The page is parsed, and is obtained all notice lists of the page, is parsed one by one for every notice in list, finally crawling Notification information is encapsulated in an object array, returns to Business Logic;
3) view layer is created, view layer is mainly made of a static html page and two dynamic JSP page, quiet The html page of state is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list With specific content of announcement;
User clicks medical portal website or civilian hospital website using client browser, and server obtains the pass clicked Key word, and medical portal website or civilian hospital website are submitted, then by the information or notice of the search results pages of these websites Information crawler goes out, and by the data crawled out, dynamic generation webpage is presented to the user again after treatment.
The step 1) is implemented as follows: 1. class: GetNotice, and the function of establishing a Business Logic is realized Class;2. method: GeNotice (HttpServletRequest request), this method is building method, constructs a construction Function GeNotice is created using request object as parameter, according to the parameter request of request object acquisition user;3. adopting The notification information namely user that user's request is obtained with method public NoticeInfoData [] GetInfo () are closed The news or notification information of the heart, and returned as return value.
The step 2) the specific implementation process is as follows: 1. class: NoticeInfoGet, such crawls the function of layer for data It is able to achieve class;2. method: NoticeInfoGet (String url), the method is building method, with the page when creating object URL is as parameter;3. method: public NoticeInfoData [] GetInfo (), this method crawl URL corresponding page Notification information, and returned as return value.
Compared with prior art, the beneficial effects of the present invention are embodied in:
The present invention can crawl relevent information or notification information, information from medical portal website or civilian hospital website respectively Or notification information includes homepage link, title, issue date, particular content, can show medical treatment simultaneously in the same page Portal website or civilian hospital website information or notice, user click the particular content that page hyperlink checks information or notice; The accuracy for crawling information or notification information that the system of ensure that crawls in real time simultaneously, can cope with medical portal website very well Or the update of civilian hospital website.
Detailed description of the invention
Fig. 1 is the flow chart of medical information inquiry system in internet of the present invention;
Fig. 2 is data flow figure of the invention.
Specific embodiment
The present invention will be further described with reference to the examples below, and described embodiment is only present invention a part Embodiment is not whole embodiment, and based on the embodiments of the present invention, those skilled in the art are not making Other embodiments used obtained, belong to protection scope of the present invention under the premise of creative work.
As shown in Fig. 1~2, a kind of acquisition of internet medical information inquiry system and method for pushing, medical information are looked into Inquiry system includes client browser and server, and the server includes that Business Logic, data crawl layer and view layer, is wrapped Include following steps:
1) Business Logic is created, Business Logic is using request object as parameter, according to request object acquisition visitor The request of family end browser simultaneously returns to notification information to view layer, and Business Logic handles the parameter that browser is submitted, The medical treatment URL of the portal website or URL of civilian hospital website is generated, and URL is submitted into data as parameter and crawls layer;
2) creation data crawl layer, and data crawl URL that layer is submitted using Business Logic as parameter, corresponding to the URL The page is parsed, and is obtained all notice lists of the page, is parsed one by one for every notice in list, finally crawling Notification information is encapsulated in an object array, returns to Business Logic;
3) view layer is created, view layer is mainly made of a static html page and two dynamic JSP page, quiet The html page of state is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list With specific content of announcement;
User clicks medical portal website or civilian hospital website using client browser, and server obtains the pass clicked Key word, and medical portal website or civilian hospital website are submitted, then by the information or notice of the search results pages of these websites Information crawler goes out, and by the data crawled out, dynamic generation webpage is presented to the user again after treatment.
One, Business Logic
The core of Business Logic is by information processing that user submits and to generate the URL of corresponding website, and URL is submitted to Data crawl layer progress data and crawl.The class of Business Logic is called GetNotice, its member and member function see the table below One.
Table one
Member and function Brief introduction
HttpServletRequest request Store the request object reference of JSP page
PageInfoData pageData Store the page related status information of user's request
Public GetNotice(HttpServletRequest request) Constructed fuction
private NoticeInfoData[]Search() User calls the functional based method when submitting searching request
private NoticeInfoData[]Neighbor() User's request calls the functional based method when checking adjacent page notice
public NoticeInfoData[]GetInfo() Obtain the notification data that user requests website
Function GetInfo () is the core methed of class GetNotice, and this method is in response to user's request, it needs to sentence Search notice is also to look at adjacent page notice when the request of disconnected user, is implemented as follows:
The URL format of user's access system are as follows:
Http: // 121.48.204.6:8080/NotificationsQuery/search.jsp? the website key=name or Person
Http: // 121.48.204.6:8080/NotificationsQuery/search.jsp? the website key=name & The current page number of page=notifies total page number.
It is the searching request for pulling down list and submitting to system for the first format;Second of format be user click " on The request of the adjacent page of one page " or " lower one page ".In second of format, centainly there is page parameter to be greater than 1, in this case, industry Business logical layer can instantiate the object pageData of a PageInfoData type, by the value of parameter page be assigned to object at Total, is assigned to the member total of object by member page.Business Logic judges object during generating URL The size relation of the member page and total of pageData, then generate URL, if the member page of object pageData is big In total, URL is not generated.If what user submitted is searching request, system can call Search method, otherwise call Neighbor method.The two methods can search for respectively and the specific URL structure of adjacent page generates URL, be then forwarded to number According to crawling layer.
Two, data crawl layer
The function that data crawl layer obtains the notification information of the page according to the URL that upper one layer (Business Logic) is submitted, And notification information is encapsulated in object array and returns to upper one layer;As the information list of medical portal website crawls calculation Method.
Table two
Member Brief introduction
PageInfoData pageData Store the page of user's request
pageUrl What current needs parsed
Public RegistryInfo Get(PageInfoData pageDatal,String url) Constructed fuction
Private List GetNoticeInfoList() Obtain the notice of current page
The method for crawling andrology's information list and information list are as follows: all hyperlink are selected using Jsoup selector, then Information list required for going matching to filter out according to regular expressions, only regular expression matching mode is different here.It is logical Cross the matching of match pattern and screening gynaecology's information list that screening andrology's information list is found to the analysis of their html texts Mode as the mode of paediatrics information list, be "<a sclass=" news " shref=" shownews.*>.* </a>"。
Three, system view layer
System view layer is mainly made of three web pages, and a static homepage, two are generated by JSP dynamic technique Notice result page, static home page is mainly to be generated using static HTML, and the inside includes system logo, system introduction and some Picture.
The above is only embodiments of the present invention, is stated again, and those skilled in the art are come It says, without departing from the principle of the present invention, several improvement can also be carried out to the present invention, these improvement are also included in the present invention In scope of protection of the claims.

Claims (3)

1. acquisition and the method for pushing of a kind of internet medical information inquiry system, it is characterised in that: medical information inquiry system Including client browser and server, the server includes that Business Logic, data crawl layer and view layer, including as follows Step:
1) Business Logic is created, Business Logic is using request object as parameter, according to request object acquisition client The request of browser simultaneously returns to notification information to view layer, and Business Logic handles the parameter that browser is submitted, and generates The URL of the medical portal website or URL of civilian hospital website, and URL is submitted into data as parameter and crawls layer;
2) creation data crawl layer, and data crawl URL that layer is submitted using Business Logic as parameter, the page corresponding to the URL It is parsed, obtains all notice lists of the page, parsed one by one for every notice in list, finally the notice crawled Information encapsulation returns to Business Logic in an object array;
3) view layer is created, view layer is mainly made of a static html page and two dynamic JSP page, static Html page is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list and tool Body content of announcement;
User clicks medical portal website or civilian hospital website using client browser, and server obtains the key clicked Word, and medical portal website or civilian hospital website are submitted, then the information of the search results pages of these websites or notice are believed Breath crawls out, and by the data crawled out, dynamic generation webpage is presented to the user again after treatment.
2. acquisition and the method for pushing of a kind of internet medical information inquiry system as described in claim 1, it is characterised in that:
The step 1) is implemented as follows: 1. class: GetNotice, and the function of establishing a Business Logic realizes class;② Method: GetNotice (HttpServletRequest request), this method is building method, constructs a constructed fuction GeNotice is created using request object as parameter, according to the parameter request of request object acquisition user;3. using side Method public NoticeInfoData [] GetInfo () obtains the notification information of user's request, and as return value It returns.
3. acquisition and the method for pushing of a kind of internet medical information inquiry system as described in claim 1, it is characterised in that:
The step 2) the specific implementation process is as follows: 1. class: NoticeInfoGet, such is real for the function that data crawl layer Existing class;2. method: NoticeInfoGet (String url), the method is building method, with the URL of the page when creating object As parameter;3. method: public NoticeInfoData [] GetInfo (), this method crawl the notice of URL corresponding page Information, and returned as return value.
CN201811610965.7A 2018-12-27 2018-12-27 A kind of acquisition of internet medical information inquiry system and method for pushing Pending CN109710829A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811610965.7A CN109710829A (en) 2018-12-27 2018-12-27 A kind of acquisition of internet medical information inquiry system and method for pushing

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811610965.7A CN109710829A (en) 2018-12-27 2018-12-27 A kind of acquisition of internet medical information inquiry system and method for pushing

Publications (1)

Publication Number Publication Date
CN109710829A true CN109710829A (en) 2019-05-03

Family

ID=66258694

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811610965.7A Pending CN109710829A (en) 2018-12-27 2018-12-27 A kind of acquisition of internet medical information inquiry system and method for pushing

Country Status (1)

Country Link
CN (1) CN109710829A (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999549A (en) * 2012-09-25 2013-03-27 金博 Method for realizing web crawler tasks
CN104881501A (en) * 2015-06-19 2015-09-02 四川大学 Automatic Internet information obtaining and pushing method

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102999549A (en) * 2012-09-25 2013-03-27 金博 Method for realizing web crawler tasks
CN104881501A (en) * 2015-06-19 2015-09-02 四川大学 Automatic Internet information obtaining and pushing method

Similar Documents

Publication Publication Date Title
US7499965B1 (en) Software agent for locating and analyzing virtual communities on the world wide web
US20210303529A1 (en) Hierarchical structured data organization system
Zaıane Resource and knowledge discovery from the internet and multimedia repositories
US10042898B2 (en) Weighted metalabels for enhanced search in hierarchical abstract data organization systems
Matsuo et al. Average-clicks: A new measure of distance on the World Wide Web
Devi et al. An efficient approach for web indexing of big data through hyperlinks in web crawling
Kumar et al. Web structure mining: Exploring hyperlinks and algorithms for information retrieval
Kumar et al. Design of a mobile Web crawler for hidden Web
US20140164342A1 (en) Human threading search engine
CN104881501A (en) Automatic Internet information obtaining and pushing method
CN109710829A (en) A kind of acquisition of internet medical information inquiry system and method for pushing
Khan et al. Self-adaptive ontology-based focused crawling: a literature survey
Liu et al. Constructing a reliable Web graph with information on browsing behavior
Malik et al. Ontology and Web Usage Mining towards an Intelligent Web focusing web logs
Xu et al. Method of deep web collection for mobile application store based on category keyword searching
Yang et al. Study on food safety semantic retrieval system based on domain ontology
Pirnau Considerations on the functions and importance of a web crawler
Sharma et al. A review study of server log formats for efficient web mining
Wang et al. Research on Internet Corpus Collection Method
Chunyu et al. Network data acquisition method based on crop pest control knowledge
Xu et al. A Preliminary Study of Sort Algorithm for Internet Personal Information Search System
PANI MACHINE LEARNING TECHNIQUES IN DATA MINING APPLICATIONS
Kum word quer
Mishra et al. Web Mining Using Topic Sensitive Weighted PageRank
Chaudhry et al. Information extraction from heterogeneous sources using domain ontologies

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190503