CN106503263A - A kind of E-Government news is gathered and edited method automatically - Google Patents
A kind of E-Government news is gathered and edited method automatically Download PDFInfo
- Publication number
- CN106503263A CN106503263A CN201611051209.6A CN201611051209A CN106503263A CN 106503263 A CN106503263 A CN 106503263A CN 201611051209 A CN201611051209 A CN 201611051209A CN 106503263 A CN106503263 A CN 106503263A
- Authority
- CN
- China
- Prior art keywords
- news
- data
- unit
- government
- preference
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
- G06F16/972—Access to data in other repository systems, e.g. legacy data or dynamic Web page generation
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Information Transfer Between Computers (AREA)
Abstract
A kind of method the invention discloses E-Government news is gathered and edited automatically, the method include crawl step S1, extraction step S2 and preference monitoring step S3.The present invention is captured to news by crawl step S1, extraction step S2 and preference monitoring step S3, analysis is sorted out, preference is pushed, the repeated labor of editor can be down to minimum, improve editorial efficiency, and the news gathering and editing of all kinds of government unit can be managed automatically, to realize half/full-automation of news gathering and editing.
Description
Technical field
The present invention relates to news gathering and editing field, more particularly to a kind of E-Government news is gathered and edited method automatically.
Background technology
E-Government news gathering and editing are parts for present E-Government management, E-Government news gathering and editing ageing,
Verity, specific aim, accuracy directly affect the effect of associated electrical government affairs.Existing news briefing is mainly by manually clear
Look at, artificial screening, imagineering, human-edited, artificial issue, artificial news briefing is required for through these loaded down with trivial details flow processs,
Cause work not only uninteresting, and workload is greatly increased, news editor efficiency for issuing significantly cannot be carried always
High.
Content of the invention
A kind of method it is an object of the present invention to provide E-Government news is gathered and edited automatically, can be by the repeated labor of editor
Move and be down to minimum, raising editorial efficiency.
For achieving the above object, there is provided a kind of E-Government news is gathered and edited method automatically, the method includes crawl step
S1, extraction step S2 and preference monitoring step S3, each step process are as follows:
Crawl step S1:Grasping system news according to needed for the rules for grasping for setting from the Internet crawl, and will be grabbed
The news for taking sends to large-scale distributed calculating platform and carries out counting, analyzes classification, then preserves to whole station data base;
Extraction step S2:Electronic Government Affairs Website group extracts the news of required classification according to actual needs from whole station data base
Presented by system front end;
Preference monitoring step S3:Row is browsed by the navigation patterns of user behavior monitoring system registers user and according to described
It is the preference with the standard determination for setting as user, then preserves to data-storage system server, and periodically pushes user
The news information of preference.
Preferably, in preference monitoring step S3, data-storage system server is provided with two and data above storage is single
There are label field in unit, each data in each data storage cell, and user can be by data-storage system server
The label that news increases is set to the label field of the news, processes and preference news push so as to carry out preference.
Preferably, in crawl step S1, grasping system passes through according to the rules for grasping of rules for grasping configuration of described dispensing unit
The network address that collects is put into network address library unit by reptile unit, then, by central scheduler unit according to scheduling rule from net
The network address of location library unit extraction respective amount is put into queue unit to be captured carries out news crawl, and the content of crawl is sent to
Large-scale distributed calculating platform.
Preferably, in crawl step S1, large-scale distributed calculating platform passes through government affairs analysing word library unit, picture
The process of BASE64 transcoding units, typesetting transcoder unit, story label extraction unit and data compression unit, to crawl system
The information that system sends carries out data analysiss, transcoding, process, extraction, classification, and is sent to whole station data base storage for before system
Extract at end.
Preferably, the large-scale distributed calculating platform is by processed offline and grasping system and data storage system service
Device carries out data transmission, and large-scale distributed calculating platform is sent to data storage system by data compression unit after data are compressed
System classification server, wherein, large-scale distributed calculating platform after the data-interface for calling data-storage system server, root
According to the classification of each different government unit, the news content data corresponding with unit content are obtained.
Preferably, in extraction step S2, system front end is by online engine and data-storage system server and whole station
Database communication connects.
Compared with prior art, its advantage is the present invention:
The present invention is captured to news by crawl step S1, extraction step S2 and preference monitoring step S3, and analysis is returned
Class, preference are pushed, and the repeated labor of editor can be down to minimum, raising editorial efficiency.The present invention simultaneously can be to all kinds of governments
The news gathering and editing of unit are managed automatically, to realize half/full-automation of news gathering and editing, meanwhile, number is provided for various systems
According to interface, other systems can get news data by data-interface.
Description of the drawings
Fig. 1 is the operation principle block diagram of the present invention;
Fig. 2 is the structural principle block diagram of the present invention.
Specific embodiment
With reference to embodiment, the invention will be further described, but does not constitute any limitation of the invention, any
The modification of the limited number of time made in scope of the invention as claimed, still in scope of the presently claimed invention.
As shown in Figure 1 to Figure 2, the invention provides a kind of heavy goods vehicles intelligent diagnosing method, the method includes crawl step
S1, extraction step S2 and preference monitoring step S3, each step process are as follows:
Crawl step S1:Grasping system 2 captures required news according to the rules for grasping for setting from the Internet 1, and by institute
The news of crawl sends to large-scale distributed calculating platform 7 and carries out counting, analyzes classification, then preserves to whole station data base 8;
Extraction step S2:Electronic Government Affairs Website group extracts the news of required classification according to actual needs from whole station data base 8
Presented by system front end 6;
Preference monitoring step S3:Row is browsed by the navigation patterns of user behavior monitoring system registers user and according to described
It is the preference with the standard determination for setting as user, then preserves to data-storage system server, and periodically pushes user
The news information of preference.
In preference monitoring step S3, data-storage system server 4 is provided with three data storage cells, each data storage
Each data in unit has label field, and the mark that user can be increased by data-storage system server 4 for news
The label field for being set to the news is signed, is processed and preference news push so as to carry out preference.
In the present embodiment, user is interacted with grasping system by B/S patterns, i.e. Browser/Server Mode, respectively
Data storage cell is respectively the news storage of national government affairs news, provinces and regions government affairs news, the several plates of POLICY.
Additionally, data-storage system server 4 may also be configured to two or five or ten or 20 data storage lists
Unit classifies to news.
In crawl step S1, rules for grasping of the grasping system 2 according to rules for grasping configuration of described dispensing unit, by reptile list
The network address that collects is put into network address library unit by unit, then, by central scheduler unit according to scheduling rule from URL library list
The network address of unit's extraction respective amount is put into queue unit to be captured carries out news crawl, and the content of crawl is sent to large-scale point
Cloth calculating platform.Wherein, rules for grasping and scheduling rule are that system has been configured and completed.
In crawl step S1, large-scale distributed calculating platform 7 passes through government affairs analysing word library unit, picture BASE64 transcodings
The process of unit, typesetting transcoder unit, story label extraction unit and data compression unit, the letter sent by grasping system
Breath carries out data analysiss, transcoding, process, extraction, classification, and is sent to the storage of whole station data base 8 and extracts for system front end 6.
Large-scale distributed calculating platform 7 is carried out with grasping system 2 and data storage system service device 4 by processed offline 3
Data transfer, large-scale distributed calculating platform 7 are sent to data-storage system clothes by data compression unit after data are compressed
Business device 4 classify, wherein, large-scale distributed calculating platform 7 after the data-interface for calling data-storage system server 4, root
According to the classification of each different government unit, the news content data corresponding with unit content are obtained.
In the present embodiment, after the acquisition of large-scale distributed calculating platform 7 grasping system 2 transmits news content, read original
Data are simultaneously extracted by government affairs analysing word library unit, picture BASE64 transcoding units, typesetting transcoder unit, story label single
Unit carries out various analytical calculations, the label of acquisition news content, classification, time, source etc..
In extraction step S2, system front end 6 is by online engine 5 and data-storage system server 4 and whole station data
Storehouse 8 communicates to connect.
In the present embodiment, Systems Operator will need to take passages news website URL, and the configuration rule typing according to system is grabbed
Take system;Then, the data-interface of 6 calling system of Electronic Government Affairs Website system front end, browses e-government Intranet when user is daily
During system of standing, the data for getting are presented to user by front-end technology and are browsed by Electronic Government Affairs Website, so as to realize electronics political affairs
Automatically the issue of gathering and editing of business news.
In the present embodiment, whole station data base 8 is may be disposed in data-storage system server 4.
The above is only the preferred embodiment of the present invention, it should be pointed out that for a person skilled in the art, do not taking off
On the premise of present configuration, some deformations and improvement can also be made, these are all without the effect for affecting the present invention to implement
And practical applicability.
Claims (6)
1. a kind of E-Government news is gathered and edited method automatically, it is characterised in that:The method includes crawl step S1, extraction step S2
With preference monitoring step S3, each step process is as follows:
Crawl step S1:Grasping system news according to needed for the rules for grasping for setting from the Internet crawl, and will be captured
News sends to large-scale distributed calculating platform and carries out counting, analyzes classification, then preserves to whole station data base;
Extraction step S2:Electronic Government Affairs Website group extracts the news of required classification by being from whole station data base according to actual needs
System front end is presented;
Preference monitoring step S3:By the navigation patterns of user behavior monitoring system registers user and according to the navigation patterns with
Preference of the standard determination for setting as user, then preserves to data-storage system server, and periodically pushes user preference
News information.
2. a kind of E-Government news is gathered and edited method automatically as claimed in claim 1, it is characterised in that:In preference monitoring step
In S3, data-storage system server is provided with two and data above memory element, each number in each data storage cell
According to there is a label field, and user can be mark that label that news increases be set to the news by data-storage system server
Signature section, is processed and preference news push so as to carry out preference.
3. a kind of E-Government news is gathered and edited method automatically as claimed in claim 1, it is characterised in that:In crawl step S1
In, the network address that collects is put into by rules for grasping of the grasping system according to rules for grasping configuration of described dispensing unit by reptile unit
Network address library unit, then, is put from the network address that network address library unit extracts respective amount according to scheduling rule by central scheduler unit
Entering queue unit to be captured carries out news crawl, and the content of crawl is sent to large-scale distributed calculating platform.
4. a kind of E-Government news as described in claim 1 or 3 is gathered and edited method automatically, it is characterised in that:In crawl step
In S1, large-scale distributed calculating platform passes through government affairs analysing word library unit, picture BASE64 transcoding units, typesetting code conversion list
The process of unit, story label extraction unit and data compression unit, carries out data analysiss, turns to the information that grasping system sends
Code, process, extract, sort out, and be sent to whole station data base storage for system front end extraction.
5. a kind of E-Government news is gathered and edited method automatically as claimed in claim 4, it is characterised in that:Described large-scale distributed
Calculating platform is carried out data transmission with grasping system and data storage system service device by processed offline, large-scale distributed calculating
Platform is sent to data-storage system classification server by data compression unit after data are compressed, wherein, large-scale distributed
Calculating platform according to the classification of each different government unit, is obtained after the data-interface for calling data-storage system server
Take the news content data corresponding with unit content.
6. a kind of E-Government news is gathered and edited method automatically as claimed in claim 1, it is characterised in that:In extraction step S2
In, system front end is connected with data-storage system server and whole station database communication by online engine.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611051209.6A CN106503263A (en) | 2016-11-25 | 2016-11-25 | A kind of E-Government news is gathered and edited method automatically |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201611051209.6A CN106503263A (en) | 2016-11-25 | 2016-11-25 | A kind of E-Government news is gathered and edited method automatically |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106503263A true CN106503263A (en) | 2017-03-15 |
Family
ID=58328510
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201611051209.6A Pending CN106503263A (en) | 2016-11-25 | 2016-11-25 | A kind of E-Government news is gathered and edited method automatically |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106503263A (en) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113486279A (en) * | 2021-06-29 | 2021-10-08 | 平安信托有限责任公司 | Automatic news generation method, device, equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102495872A (en) * | 2011-11-30 | 2012-06-13 | 中国科学技术大学 | Method and device for conducting personalized news recommendation to mobile device users |
US8521763B1 (en) * | 2005-09-09 | 2013-08-27 | Minnesota Public Radio | Computer-based system and method for processing data for a journalism organization |
CN104166668A (en) * | 2014-06-09 | 2014-11-26 | 南京邮电大学 | News recommendation system and method based on FOLFM model |
CN105653512A (en) * | 2015-12-30 | 2016-06-08 | 梅国平 | News gathering and editing terminal, server, method and system |
-
2016
- 2016-11-25 CN CN201611051209.6A patent/CN106503263A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8521763B1 (en) * | 2005-09-09 | 2013-08-27 | Minnesota Public Radio | Computer-based system and method for processing data for a journalism organization |
CN102495872A (en) * | 2011-11-30 | 2012-06-13 | 中国科学技术大学 | Method and device for conducting personalized news recommendation to mobile device users |
CN104166668A (en) * | 2014-06-09 | 2014-11-26 | 南京邮电大学 | News recommendation system and method based on FOLFM model |
CN105653512A (en) * | 2015-12-30 | 2016-06-08 | 梅国平 | News gathering and editing terminal, server, method and system |
Non-Patent Citations (2)
Title |
---|
屈济荣: ""大数据背景下新闻采编新趋势"", 《报刊纵横》 * |
樊兆欣: ""个性化新闻推荐系统关键技术研究与实现"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 * |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113486279A (en) * | 2021-06-29 | 2021-10-08 | 平安信托有限责任公司 | Automatic news generation method, device, equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109361617B (en) | Convolutional neural network traffic classification method and system based on network packet load | |
CN106778259B (en) | Abnormal behavior discovery method and system based on big data machine learning | |
CN107992490B (en) | Data processing method and data processing equipment | |
CN104298679B (en) | Applied business recommended method and device | |
CN109218223B (en) | Robust network traffic classification method and system based on active learning | |
WO2022134794A1 (en) | Method and apparatus for processing public opinions about news event, storage medium, and computer device | |
CN111914159B (en) | Information recommendation method and terminal | |
CN101605126A (en) | A kind of method and system of multi-protocol data Classification and Identification | |
CN110061931B (en) | Industrial control protocol clustering method, device and system and computer storage medium | |
CN102542061A (en) | Intelligent product classification method | |
CN109698798B (en) | Application identification method and device, server and storage medium | |
CN112667750A (en) | Method and device for determining and identifying message category | |
CN112367273A (en) | Knowledge distillation-based flow classification method and device for deep neural network model | |
CN109660656A (en) | A kind of intelligent terminal method for identifying application program | |
CN106920070A (en) | A kind of resume collection method, apparatus and system | |
CN109062951A (en) | Based on conversation process abstracting method, equipment and the storage medium for being intended to analysis and dialogue cluster | |
CN112036166A (en) | Data labeling method and device, storage medium and computer equipment | |
CN115269438A (en) | Automatic testing method and device for image processing algorithm | |
CN113868509A (en) | Science and technology policy data information consultation service system based on cloud computing | |
CN110929032B (en) | User demand processing system and method for software system | |
CN109871302B (en) | Cloud computing application identification device and method based on resource overhead statistics | |
CN106503263A (en) | A kind of E-Government news is gathered and edited method automatically | |
CN113408630A (en) | Transformer substation indicator lamp state identification method | |
CN117130870A (en) | Transparent request tracking and sampling method and device for Java architecture micro-service system | |
CN112434049A (en) | Table data storage method and device, storage medium and electronic device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170315 |
|
RJ01 | Rejection of invention patent application after publication |