CN109710829A - A kind of acquisition of internet medical information inquiry system and method for pushing - Google Patents
A kind of acquisition of internet medical information inquiry system and method for pushing Download PDFInfo
- Publication number
- CN109710829A CN109710829A CN201811610965.7A CN201811610965A CN109710829A CN 109710829 A CN109710829 A CN 109710829A CN 201811610965 A CN201811610965 A CN 201811610965A CN 109710829 A CN109710829 A CN 109710829A
- Authority
- CN
- China
- Prior art keywords
- page
- information
- layer
- url
- business logic
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Information Transfer Between Computers (AREA)
Abstract
The invention discloses a kind of acquisition of internet medical information inquiry system and method for pushing, it includes the following steps: 1) to create Business Logic, Business Logic according to the request of request object acquisition user and returns to notification information to view layer using request object as parameter;2) creation data crawl layer, and data are crawled layer and obtained notification information using page URL as parameter, and the notification information that will acquire returns to Business Logic;3) view layer is created, view layer is mainly made of a static HTML page and two DYNAMIC J SP pages, and static HTML page is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list and specific content of announcement.The present invention can crawl relevent information or notification information from medical portal website or civilian hospital website respectively, medical portal website or civilian hospital website information or notice can be shown simultaneously in the same page, user can click the particular content that page hyperlink checks information or notice.
Description
Technical field
The present invention relates to searching engine fields, and in particular to one kind can crawl each medical portal website and place for user
Information or notice on Hospital Website, and these data are synchronized into the automatic acquisition for the internet information being presented to the user and are pushed
Method.
Background technique
In face of huge Internet resources, search engine provides a portal website, and that does not exaggerate says, browses this door
All users of family website surf the Internet in any place can search any place for oneself thinking search, therefore it has become and removes
The at most online service that uses of people outside Email.What is search engine? how does it work? this word is extensive
It is different using in internet area, but in different region significance of which.In the U.S., search engine is typically referred to based on mutual
The search engine of networking, they arrive several hundred million network robot programs by the tens million of of collection network, each word is searched to be drawn
It holds up and includes, that is, the full-text search described in us, famous internet search engine has Google etc..In China, search is drawn
The search service for being often referred to the service of the directory search based on Web or specific website is held up, which searched based on a specific website
The specific markers research of rope technology.The search engine of broad sense, which refers to, provides the tool or system of information retrieval service on internet,
The searching request of user, the system for returning to corresponding search result are responded on the internet or by internet;Narrow sense is searched
Index holds up the resource referred mainly on automatic or manual search web and is organized into a database, and search is provided in the form of website
Collection, analysis and the index service of service provide the system of information service using network for user.Briefly: WWW search is drawn
Hold up be one group of network environment information retrieval system, it is usually there are two types of different modes: one is the search of classified catalogue type, mutually
Networked resources carry out collection arrangement, provide different resource inputs to different Directory Types, figure layer of then classifying, and user can be with
Their desired classification information layers are found, are then arrived at the destination, oneself desired information is found;Another kind is based on keyword
Retrieval, user can input the keyword of various logic composite type, and search engine is found according to certain rules solves these
Then resource required for keyword feeds back to user again and includes the information of keyword, and is linked to these websites.It is true
On, search engine is exactly a website, it is the website of special offer search service, it uses unique scheme and collects interconnection
Online all information help people to search the website of the information of oneself needs in information ocean.
The doctor or patient for wanting to know about some specific medical information or notice, do not ensure that many times and daily can
Medical portal website or civilian hospital website are browsed, can very likely miss some important information or notice in this way, and want
It checks that these information need point to open multiple webpages, shuttles in these webpages.If these medical informations can be in a net
It is browsed on page, and timely mail notification or short massage notice, then can be conveniently many for us, one will not be missed
A little important informations.
Summary of the invention
In view of the above-mentioned problems, the purpose of the present invention is to provide one kind can crawl medical portal website and place doctor for user
Information or notice on institute website, and these data are synchronized to the acquisition for the internet medical information inquiry system being presented to the user
With method for pushing.
In order to solve the above-mentioned technical problem, the present invention is accomplished by the following way:
A kind of acquisition of internet medical information inquiry system and method for pushing, medical information inquiry system include client
Browser and server is held, the server includes that Business Logic, data crawl layer and view layer, is included the following steps:
1) Business Logic is created, Business Logic is using request object as parameter, according to request object acquisition visitor
The request of family end browser simultaneously returns to notification information to view layer, and Business Logic handles the parameter that browser is submitted,
The medical treatment URL of the portal website or URL of civilian hospital website is generated, and URL is submitted into data as parameter and crawls layer;
2) creation data crawl layer, and data crawl URL that layer is submitted using Business Logic as parameter, corresponding to the URL
The page is parsed, and is obtained all notice lists of the page, is parsed one by one for every notice in list, finally crawling
Notification information is encapsulated in an object array, returns to Business Logic;
3) view layer is created, view layer is mainly made of a static html page and two dynamic JSP page, quiet
The html page of state is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list
With specific content of announcement;
User clicks medical portal website or civilian hospital website using client browser, and server obtains the pass clicked
Key word, and medical portal website or civilian hospital website are submitted, then by the information or notice of the search results pages of these websites
Information crawler goes out, and by the data crawled out, dynamic generation webpage is presented to the user again after treatment.
The step 1) is implemented as follows: 1. class: GetNotice, and the function of establishing a Business Logic is realized
Class;2. method: GeNotice (HttpServletRequest request), this method is building method, constructs a construction
Function GeNotice is created using request object as parameter, according to the parameter request of request object acquisition user;3. adopting
The notification information namely user that user's request is obtained with method public NoticeInfoData [] GetInfo () are closed
The news or notification information of the heart, and returned as return value.
The step 2) the specific implementation process is as follows: 1. class: NoticeInfoGet, such crawls the function of layer for data
It is able to achieve class;2. method: NoticeInfoGet (String url), the method is building method, with the page when creating object
URL is as parameter;3. method: public NoticeInfoData [] GetInfo (), this method crawl URL corresponding page
Notification information, and returned as return value.
Compared with prior art, the beneficial effects of the present invention are embodied in:
The present invention can crawl relevent information or notification information, information from medical portal website or civilian hospital website respectively
Or notification information includes homepage link, title, issue date, particular content, can show medical treatment simultaneously in the same page
Portal website or civilian hospital website information or notice, user click the particular content that page hyperlink checks information or notice;
The accuracy for crawling information or notification information that the system of ensure that crawls in real time simultaneously, can cope with medical portal website very well
Or the update of civilian hospital website.
Detailed description of the invention
Fig. 1 is the flow chart of medical information inquiry system in internet of the present invention;
Fig. 2 is data flow figure of the invention.
Specific embodiment
The present invention will be further described with reference to the examples below, and described embodiment is only present invention a part
Embodiment is not whole embodiment, and based on the embodiments of the present invention, those skilled in the art are not making
Other embodiments used obtained, belong to protection scope of the present invention under the premise of creative work.
As shown in Fig. 1~2, a kind of acquisition of internet medical information inquiry system and method for pushing, medical information are looked into
Inquiry system includes client browser and server, and the server includes that Business Logic, data crawl layer and view layer, is wrapped
Include following steps:
1) Business Logic is created, Business Logic is using request object as parameter, according to request object acquisition visitor
The request of family end browser simultaneously returns to notification information to view layer, and Business Logic handles the parameter that browser is submitted,
The medical treatment URL of the portal website or URL of civilian hospital website is generated, and URL is submitted into data as parameter and crawls layer;
2) creation data crawl layer, and data crawl URL that layer is submitted using Business Logic as parameter, corresponding to the URL
The page is parsed, and is obtained all notice lists of the page, is parsed one by one for every notice in list, finally crawling
Notification information is encapsulated in an object array, returns to Business Logic;
3) view layer is created, view layer is mainly made of a static html page and two dynamic JSP page, quiet
The html page of state is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list
With specific content of announcement;
User clicks medical portal website or civilian hospital website using client browser, and server obtains the pass clicked
Key word, and medical portal website or civilian hospital website are submitted, then by the information or notice of the search results pages of these websites
Information crawler goes out, and by the data crawled out, dynamic generation webpage is presented to the user again after treatment.
One, Business Logic
The core of Business Logic is by information processing that user submits and to generate the URL of corresponding website, and URL is submitted to
Data crawl layer progress data and crawl.The class of Business Logic is called GetNotice, its member and member function see the table below
One.
Table one
Member and function | Brief introduction |
HttpServletRequest request | Store the request object reference of JSP page |
PageInfoData pageData | Store the page related status information of user's request |
Public GetNotice(HttpServletRequest request) | Constructed fuction |
private NoticeInfoData[]Search() | User calls the functional based method when submitting searching request |
private NoticeInfoData[]Neighbor() | User's request calls the functional based method when checking adjacent page notice |
public NoticeInfoData[]GetInfo() | Obtain the notification data that user requests website |
Function GetInfo () is the core methed of class GetNotice, and this method is in response to user's request, it needs to sentence
Search notice is also to look at adjacent page notice when the request of disconnected user, is implemented as follows:
The URL format of user's access system are as follows:
Http: // 121.48.204.6:8080/NotificationsQuery/search.jsp? the website key=name or
Person
Http: // 121.48.204.6:8080/NotificationsQuery/search.jsp? the website key=name &
The current page number of page=notifies total page number.
It is the searching request for pulling down list and submitting to system for the first format;Second of format be user click " on
The request of the adjacent page of one page " or " lower one page ".In second of format, centainly there is page parameter to be greater than 1, in this case, industry
Business logical layer can instantiate the object pageData of a PageInfoData type, by the value of parameter page be assigned to object at
Total, is assigned to the member total of object by member page.Business Logic judges object during generating URL
The size relation of the member page and total of pageData, then generate URL, if the member page of object pageData is big
In total, URL is not generated.If what user submitted is searching request, system can call Search method, otherwise call
Neighbor method.The two methods can search for respectively and the specific URL structure of adjacent page generates URL, be then forwarded to number
According to crawling layer.
Two, data crawl layer
The function that data crawl layer obtains the notification information of the page according to the URL that upper one layer (Business Logic) is submitted,
And notification information is encapsulated in object array and returns to upper one layer;As the information list of medical portal website crawls calculation
Method.
Table two
Member | Brief introduction |
PageInfoData pageData | Store the page of user's request |
pageUrl | What current needs parsed |
Public RegistryInfo Get(PageInfoData pageDatal,String url) | Constructed fuction |
Private List GetNoticeInfoList() | Obtain the notice of current page |
The method for crawling andrology's information list and information list are as follows: all hyperlink are selected using Jsoup selector, then
Information list required for going matching to filter out according to regular expressions, only regular expression matching mode is different here.It is logical
Cross the matching of match pattern and screening gynaecology's information list that screening andrology's information list is found to the analysis of their html texts
Mode as the mode of paediatrics information list, be "<a sclass=" news " shref=" shownews.*>.*
</a>"。
Three, system view layer
System view layer is mainly made of three web pages, and a static homepage, two are generated by JSP dynamic technique
Notice result page, static home page is mainly to be generated using static HTML, and the inside includes system logo, system introduction and some
Picture.
The above is only embodiments of the present invention, is stated again, and those skilled in the art are come
It says, without departing from the principle of the present invention, several improvement can also be carried out to the present invention, these improvement are also included in the present invention
In scope of protection of the claims.
Claims (3)
1. acquisition and the method for pushing of a kind of internet medical information inquiry system, it is characterised in that: medical information inquiry system
Including client browser and server, the server includes that Business Logic, data crawl layer and view layer, including as follows
Step:
1) Business Logic is created, Business Logic is using request object as parameter, according to request object acquisition client
The request of browser simultaneously returns to notification information to view layer, and Business Logic handles the parameter that browser is submitted, and generates
The URL of the medical portal website or URL of civilian hospital website, and URL is submitted into data as parameter and crawls layer;
2) creation data crawl layer, and data crawl URL that layer is submitted using Business Logic as parameter, the page corresponding to the URL
It is parsed, obtains all notice lists of the page, parsed one by one for every notice in list, finally the notice crawled
Information encapsulation returns to Business Logic in an object array;
3) view layer is created, view layer is mainly made of a static html page and two dynamic JSP page, static
Html page is the homepage of system, is responsible for submitting the keyword of user's input, the DYNAMIC J SP page shows newest notice list and tool
Body content of announcement;
User clicks medical portal website or civilian hospital website using client browser, and server obtains the key clicked
Word, and medical portal website or civilian hospital website are submitted, then the information of the search results pages of these websites or notice are believed
Breath crawls out, and by the data crawled out, dynamic generation webpage is presented to the user again after treatment.
2. acquisition and the method for pushing of a kind of internet medical information inquiry system as described in claim 1, it is characterised in that:
The step 1) is implemented as follows: 1. class: GetNotice, and the function of establishing a Business Logic realizes class;②
Method: GetNotice (HttpServletRequest request), this method is building method, constructs a constructed fuction
GeNotice is created using request object as parameter, according to the parameter request of request object acquisition user;3. using side
Method public NoticeInfoData [] GetInfo () obtains the notification information of user's request, and as return value
It returns.
3. acquisition and the method for pushing of a kind of internet medical information inquiry system as described in claim 1, it is characterised in that:
The step 2) the specific implementation process is as follows: 1. class: NoticeInfoGet, such is real for the function that data crawl layer
Existing class;2. method: NoticeInfoGet (String url), the method is building method, with the URL of the page when creating object
As parameter;3. method: public NoticeInfoData [] GetInfo (), this method crawl the notice of URL corresponding page
Information, and returned as return value.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811610965.7A CN109710829A (en) | 2018-12-27 | 2018-12-27 | A kind of acquisition of internet medical information inquiry system and method for pushing |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811610965.7A CN109710829A (en) | 2018-12-27 | 2018-12-27 | A kind of acquisition of internet medical information inquiry system and method for pushing |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109710829A true CN109710829A (en) | 2019-05-03 |
Family
ID=66258694
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811610965.7A Pending CN109710829A (en) | 2018-12-27 | 2018-12-27 | A kind of acquisition of internet medical information inquiry system and method for pushing |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109710829A (en) |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102999549A (en) * | 2012-09-25 | 2013-03-27 | 金博 | Method for realizing web crawler tasks |
CN104881501A (en) * | 2015-06-19 | 2015-09-02 | 四川大学 | Automatic Internet information obtaining and pushing method |
-
2018
- 2018-12-27 CN CN201811610965.7A patent/CN109710829A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102999549A (en) * | 2012-09-25 | 2013-03-27 | 金博 | Method for realizing web crawler tasks |
CN104881501A (en) * | 2015-06-19 | 2015-09-02 | 四川大学 | Automatic Internet information obtaining and pushing method |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7499965B1 (en) | Software agent for locating and analyzing virtual communities on the world wide web | |
US20210303529A1 (en) | Hierarchical structured data organization system | |
Zaıane | Resource and knowledge discovery from the internet and multimedia repositories | |
US10042898B2 (en) | Weighted metalabels for enhanced search in hierarchical abstract data organization systems | |
Matsuo et al. | Average-clicks: A new measure of distance on the World Wide Web | |
Devi et al. | An efficient approach for web indexing of big data through hyperlinks in web crawling | |
Kumar et al. | Web structure mining: Exploring hyperlinks and algorithms for information retrieval | |
Kumar et al. | Design of a mobile Web crawler for hidden Web | |
US20140164342A1 (en) | Human threading search engine | |
CN104881501A (en) | Automatic Internet information obtaining and pushing method | |
CN109710829A (en) | A kind of acquisition of internet medical information inquiry system and method for pushing | |
Khan et al. | Self-adaptive ontology-based focused crawling: a literature survey | |
Liu et al. | Constructing a reliable Web graph with information on browsing behavior | |
Malik et al. | Ontology and Web Usage Mining towards an Intelligent Web focusing web logs | |
Xu et al. | Method of deep web collection for mobile application store based on category keyword searching | |
Yang et al. | Study on food safety semantic retrieval system based on domain ontology | |
Pirnau | Considerations on the functions and importance of a web crawler | |
Sharma et al. | A review study of server log formats for efficient web mining | |
Wang et al. | Research on Internet Corpus Collection Method | |
Chunyu et al. | Network data acquisition method based on crop pest control knowledge | |
Xu et al. | A Preliminary Study of Sort Algorithm for Internet Personal Information Search System | |
PANI | MACHINE LEARNING TECHNIQUES IN DATA MINING APPLICATIONS | |
Kum | word quer | |
Mishra et al. | Web Mining Using Topic Sensitive Weighted PageRank | |
Chaudhry et al. | Information extraction from heterogeneous sources using domain ontologies |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190503 |