CN105718508A - Aquaculture information collecting and processing system - Google Patents

Aquaculture information collecting and processing system Download PDF

Info

Publication number
CN105718508A
CN105718508A CN201610009741.5A CN201610009741A CN105718508A CN 105718508 A CN105718508 A CN 105718508A CN 201610009741 A CN201610009741 A CN 201610009741A CN 105718508 A CN105718508 A CN 105718508A
Authority
CN
China
Prior art keywords
information
aquaculture
data
module
url
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610009741.5A
Other languages
Chinese (zh)
Inventor
刘延忠
阮怀军
孙传仁
王利民
封文杰
郑纪业
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute Of S&t Information Shandong Academy Of Agricultural Sciences
Original Assignee
Institute Of S&t Information Shandong Academy Of Agricultural Sciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute Of S&t Information Shandong Academy Of Agricultural Sciences filed Critical Institute Of S&t Information Shandong Academy Of Agricultural Sciences
Priority to CN201610009741.5A priority Critical patent/CN105718508A/en
Publication of CN105718508A publication Critical patent/CN105718508A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/02Agriculture; Fishing; Mining

Abstract

The invention discloses an aquaculture information collecting and processing system, which comprises an Nutch web page grabber, a topic filtering module, an information extraction module, an aquaculture information database, a data processing module, a Web server and a browser. The system achieves the functions of grabbing, classifying, filtering, extracting, warehousing, updating and the like of network resources on the basis of topic filtering and information extraction technologies of aquaculture service information and achieves reasonable and accurate classification of the extracted information through building an aquaculture dictionary. A networked service and WEB service-based integrated aquaculture information service system is developed by a cloud computing technology and an artificial intelligence algorithm; and high-quality aquaculture information service is achieved.

Description

A kind of aquaculture data collecting and processing system
Technical field
The invention belongs to technical field of aquaculture, relate to a kind of aquaculture data collecting and processing system.
Background technology
Aquaculture is the production activity breeding, cultivate and gather in the crops aquatic animals and plants under manual control.Along with developing rapidly of culture fishery, increasing modernization breeding way, such as cage culture, flowing water culture etc., three-dimension use waters, land and water laminating production ecological fishery, keep the technology of sustainable use of fishery resources to be used widely.The culture fishery that develops into of technology of Internet of things creates condition, but, to there is search and webpage quantity in current vertical search engine few in aquaculture data, information extraction comprehensively and topic distillation classify inaccurate problem, the present situation of current existing aquaculture mass data storage data message method for digging specific aim difference difficult, big, face the big crucial problem of the big data of Aquatic product four: data preparation, data storage, data platform, data process, and can not meet the aquaculture practitioner great demand to visualization resource.
Summary of the invention
It is an object of the invention to the defect overcoming above-mentioned technology to exist, a kind of aquaculture data collecting and processing system is provided, this system is based on the topic distillation of aquaculture information on services and information extraction technique, realize the functions such as the crawl of Internet resources, classification, filtration, extraction, warehouse-in renewal, and by building Aquatic product dictionary, it is achieved the reasonable Accurate classification to institute's Extracting Information.Adopt cloud computing technology and intelligent algorithm, carry out the aquaculture information comprehensive service system of Network Basedization service, WEB service, it is achieved high-quality aquatic product information service.
Its concrete technical scheme is:
A kind of aquaculture data collecting and processing system, including Nutch webpage capture device, topic distillation module, information extraction module, aquatic product information data base, data processing module, Web server and browser;
Described Nutch webpage capture device URL from Present Domestic aquaculture information service platform captures, capture strategy according to link and access all webpages, parse web page contents and new URL, and the info web parsed and new URL information feeding topic distillation module are filtered;
Described topic distillation module filters out the webpage unrelated with aquaculture theme and URL through topic distillation algorithm, then URL enters URL storehouse, and webpage is sent to information extraction module;
Information extraction module utilizes information extraction technique file connector by Ultraseek of the file of extended formatting and corresponding information extraction technique to be extracted for structurized information data for the Aquatic product web data obtained after topic distillation modular filtration, and the data extracted is stored in aquatic product information data base;
Described data processing module builds rationally comprehensively Aquatic product dictionary, the characteristic information of Internet resources will be obtained according to dictionary, carry out statistical analysis and classification, utilize current existing intelligent algorithm, by algorithm, data are excavated, relative analysis, set up Data Analysis Model, by model analysis, obtain intelligence, deep, valuable information;
Web server and browser pass through Internet and Web server communication, the aquaculture practitioner demand according to self, it is thus achieved that relevant information for aquaculture practitioner.
Further, the characteristic information of described Internet resources includes word frequency, lexeme, word length, webpage.
Further, described current existing intelligent algorithm includes neural network algorithm, genetic algorithm, traditional decision-tree, Rough Set, statistical analysis technique, FUZZY SET APPROACH TO ENVIRONMENTAL.
Compared with prior art, the invention have the benefit that
The present invention is based on the topic distillation of aquaculture information on services and information extraction technique, it is achieved the functions such as the crawl of Internet resources, classification, filtration, extraction, warehouse-in renewal, and by building Aquatic product dictionary, it is achieved the reasonable Accurate classification to institute's Extracting Information.Adopt cloud computing technology and intelligent algorithm, carry out the aquaculture information comprehensive service system of Network Basedization service, WEB service, it is achieved high-quality aquatic product information service.
Accompanying drawing explanation
Fig. 1 is the schematic diagram of aquaculture data collecting and processing system of the present invention.
Detailed description of the invention
Below in conjunction with the drawings and specific embodiments, technical scheme is described in more detail.
With reference to Fig. 1, a kind of aquaculture data collecting and processing system, including Nutch webpage capture device, topic distillation module, information extraction module, aquatic product information data base, data processing module, Web server and browser;
Described Nutch webpage capture device URL from Present Domestic aquaculture information service platform captures, capture strategy according to link and access all webpages, parse web page contents and new URL, and the info web parsed and new URL information feeding topic distillation module are filtered;
Described topic distillation module filters out the webpage unrelated with aquaculture theme and URL through topic distillation algorithm, then URL enters URL storehouse, and webpage is sent to information extraction module;
Information extraction module utilizes information extraction technique file connector by Ultraseek of the file of extended formatting and corresponding information extraction technique to be extracted for structurized information data for the Aquatic product web data obtained after topic distillation modular filtration, and the data extracted is stored in aquatic product information data base;
Described data processing module builds rationally comprehensively Aquatic product dictionary, the characteristic information of Internet resources will be obtained according to dictionary, statistical analysis and classification is carried out such as word frequency, lexeme, word length, webpage grade etc., utilize current existing intelligent algorithm (neural network algorithm, genetic algorithm, traditional decision-tree, Rough Set, statistical analysis technique, FUZZY SET APPROACH TO ENVIRONMENTAL), by algorithm, data are excavated, relative analysis, set up Data Analysis Model, by model analysis, obtain intelligence, deep, valuable information;
Web server and browser pass through Internet and Web server communication, the aquaculture practitioner demand according to self, it is thus achieved that relevant information for aquaculture practitioner.
The above; it is only the present invention preferably detailed description of the invention; protection scope of the present invention is not limited to this; any those familiar with the art is in the technical scope of present disclosure, and the simple change of the technical scheme that can become apparent to or equivalence are replaced and each fallen within protection scope of the present invention.

Claims (3)

1. an aquaculture data collecting and processing system, it is characterised in that include Nutch webpage capture device, topic distillation module, information extraction module, aquatic product information data base, data processing module, Web server and browser;
Described Nutch webpage capture device URL from Present Domestic aquaculture information service platform captures, capture strategy according to link and access all webpages, parse web page contents and new URL, and the info web parsed and new URL information feeding topic distillation module are filtered;
Described topic distillation module filters out the webpage unrelated with aquaculture theme and URL through topic distillation algorithm, then URL enters URL storehouse, and webpage is sent to information extraction module;
Information extraction module utilizes information extraction technique file connector by Ultraseek of the file of extended formatting and corresponding information extraction technique to be extracted for structurized information data for the Aquatic product web data obtained after topic distillation modular filtration, and the data extracted is stored in aquatic product information data base;
Described data processing module builds rationally comprehensively Aquatic product dictionary, the characteristic information of Internet resources will be obtained according to dictionary, carry out statistical analysis and classification, utilize current existing intelligent algorithm, by algorithm, data are excavated, relative analysis, set up Data Analysis Model, by model analysis, obtain intelligence, deep, valuable information;
Web server and browser pass through Internet and Web server communication, the aquaculture practitioner demand according to self, it is thus achieved that relevant information for aquaculture practitioner.
2. aquaculture data collecting and processing system according to claim 1, it is characterised in that the characteristic information of described Internet resources includes word frequency, lexeme, word length, webpage.
3. aquaculture data collecting and processing system according to claim 1, it is characterised in that described current existing intelligent algorithm includes neural network algorithm, genetic algorithm, traditional decision-tree, Rough Set, statistical analysis technique, FUZZY SET APPROACH TO ENVIRONMENTAL.
CN201610009741.5A 2016-01-08 2016-01-08 Aquaculture information collecting and processing system Pending CN105718508A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610009741.5A CN105718508A (en) 2016-01-08 2016-01-08 Aquaculture information collecting and processing system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610009741.5A CN105718508A (en) 2016-01-08 2016-01-08 Aquaculture information collecting and processing system

Publications (1)

Publication Number Publication Date
CN105718508A true CN105718508A (en) 2016-06-29

Family

ID=56147581

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610009741.5A Pending CN105718508A (en) 2016-01-08 2016-01-08 Aquaculture information collecting and processing system

Country Status (1)

Country Link
CN (1) CN105718508A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062511A (en) * 2019-11-14 2020-04-24 佛山科学技术学院 Aquaculture disease prediction method and system based on decision tree and neural network

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104516982A (en) * 2015-01-06 2015-04-15 南通大学 Method and system for extracting Web information based on Nutch

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104516982A (en) * 2015-01-06 2015-04-15 南通大学 Method and system for extracting Web information based on Nutch

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
周鹏: "农业搜索引擎系统的关键技术研究" *
高亮亮 等: "基于Nutch框架的农业信息垂直搜索引擎研究与设计" *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111062511A (en) * 2019-11-14 2020-04-24 佛山科学技术学院 Aquaculture disease prediction method and system based on decision tree and neural network
CN111062511B (en) * 2019-11-14 2023-04-25 佛山科学技术学院 Aquaculture disease prediction method and system based on decision tree and neural network

Similar Documents

Publication Publication Date Title
CN109189901A (en) Automatically a kind of method of the new classification of discovery and corresponding corpus in intelligent customer service system
CN106650725A (en) Full convolutional neural network-based candidate text box generation and text detection method
CN107705066A (en) Information input method and electronic equipment during a kind of commodity storage
CN101650715A (en) Method and device for screening links on web pages
CN108470032A (en) Overseas garden trade and investment promotion service system based on digital earth frame
CN107808375B (en) Merge the rice disease image detecting method of a variety of context deep learning models
CN110134849A (en) A kind of network public-opinion monitoring method and system
CN105160038A (en) Data analysis method and system based on audit database
CN107437038A (en) A kind of detection method and device of webpage tamper
CN103823824A (en) Method and system for automatically constructing text classification corpus by aid of internet
CN101894351A (en) Multi-agent based tour multimedia information personalized service system
CN107885793A (en) A kind of hot microblog topic analyzing and predicting method and system
CN106529564A (en) Food image automatic classification method based on convolutional neural networks
CN103577581B (en) Agricultural product price trend forecasting method
CN106251234A (en) A kind of agricultural product production and marketing integrated service platform based on the Internet and big data
CN103927400A (en) Web site product detailed information classification crawling and product information base establishing method
CN108628994A (en) A kind of public sentiment data processing system
CN104077295A (en) Data label mining method and data label mining system
CN104615701B (en) The embedded big data visualization engine cluster in smart city based on video cloud platform
CN109784408A (en) A kind of embedded time series Decision-Tree Method and system of marginal end
CN106202467A (en) A kind of definable towards peer-to-peer network searches for the web crawlers method of emphasis
CN111582219B (en) Intelligent pet management system
CN103976468A (en) Tobacco leaf grading method
CN103714120B (en) A kind of system that user interest topic is extracted in the access record from user url
CN102306182A (en) Method for excavating user interest based on conceptual semantic background image

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20160629

RJ01 Rejection of invention patent application after publication