CN106503263A - A kind of E-Government news is gathered and edited method automatically - Google Patents

A kind of E-Government news is gathered and edited method automatically Download PDF

Info

Publication number
CN106503263A
CN106503263A CN201611051209.6A CN201611051209A CN106503263A CN 106503263 A CN106503263 A CN 106503263A CN 201611051209 A CN201611051209 A CN 201611051209A CN 106503263 A CN106503263 A CN 106503263A
Authority
CN
China
Prior art keywords
news
data
unit
government
preference
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611051209.6A
Other languages
Chinese (zh)
Inventor
梁辉
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Wuzhou Winner Technology Co Ltd
Original Assignee
Wuzhou Winner Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Wuzhou Winner Technology Co Ltd filed Critical Wuzhou Winner Technology Co Ltd
Priority to CN201611051209.6A priority Critical patent/CN106503263A/en
Publication of CN106503263A publication Critical patent/CN106503263A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • G06F16/972Access to data in other repository systems, e.g. legacy data or dynamic Web page generation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

A kind of method the invention discloses E-Government news is gathered and edited automatically, the method include crawl step S1, extraction step S2 and preference monitoring step S3.The present invention is captured to news by crawl step S1, extraction step S2 and preference monitoring step S3, analysis is sorted out, preference is pushed, the repeated labor of editor can be down to minimum, improve editorial efficiency, and the news gathering and editing of all kinds of government unit can be managed automatically, to realize half/full-automation of news gathering and editing.

Description

A kind of E-Government news is gathered and edited method automatically
Technical field
The present invention relates to news gathering and editing field, more particularly to a kind of E-Government news is gathered and edited method automatically.
Background technology
E-Government news gathering and editing are parts for present E-Government management, E-Government news gathering and editing ageing, Verity, specific aim, accuracy directly affect the effect of associated electrical government affairs.Existing news briefing is mainly by manually clear Look at, artificial screening, imagineering, human-edited, artificial issue, artificial news briefing is required for through these loaded down with trivial details flow processs, Cause work not only uninteresting, and workload is greatly increased, news editor efficiency for issuing significantly cannot be carried always High.
Content of the invention
A kind of method it is an object of the present invention to provide E-Government news is gathered and edited automatically, can be by the repeated labor of editor Move and be down to minimum, raising editorial efficiency.
For achieving the above object, there is provided a kind of E-Government news is gathered and edited method automatically, the method includes crawl step S1, extraction step S2 and preference monitoring step S3, each step process are as follows:
Crawl step S1:Grasping system news according to needed for the rules for grasping for setting from the Internet crawl, and will be grabbed The news for taking sends to large-scale distributed calculating platform and carries out counting, analyzes classification, then preserves to whole station data base;
Extraction step S2:Electronic Government Affairs Website group extracts the news of required classification according to actual needs from whole station data base Presented by system front end;
Preference monitoring step S3:Row is browsed by the navigation patterns of user behavior monitoring system registers user and according to described It is the preference with the standard determination for setting as user, then preserves to data-storage system server, and periodically pushes user The news information of preference.
Preferably, in preference monitoring step S3, data-storage system server is provided with two and data above storage is single There are label field in unit, each data in each data storage cell, and user can be by data-storage system server The label that news increases is set to the label field of the news, processes and preference news push so as to carry out preference.
Preferably, in crawl step S1, grasping system passes through according to the rules for grasping of rules for grasping configuration of described dispensing unit The network address that collects is put into network address library unit by reptile unit, then, by central scheduler unit according to scheduling rule from net The network address of location library unit extraction respective amount is put into queue unit to be captured carries out news crawl, and the content of crawl is sent to Large-scale distributed calculating platform.
Preferably, in crawl step S1, large-scale distributed calculating platform passes through government affairs analysing word library unit, picture The process of BASE64 transcoding units, typesetting transcoder unit, story label extraction unit and data compression unit, to crawl system The information that system sends carries out data analysiss, transcoding, process, extraction, classification, and is sent to whole station data base storage for before system Extract at end.
Preferably, the large-scale distributed calculating platform is by processed offline and grasping system and data storage system service Device carries out data transmission, and large-scale distributed calculating platform is sent to data storage system by data compression unit after data are compressed System classification server, wherein, large-scale distributed calculating platform after the data-interface for calling data-storage system server, root According to the classification of each different government unit, the news content data corresponding with unit content are obtained.
Preferably, in extraction step S2, system front end is by online engine and data-storage system server and whole station Database communication connects.
Compared with prior art, its advantage is the present invention:
The present invention is captured to news by crawl step S1, extraction step S2 and preference monitoring step S3, and analysis is returned Class, preference are pushed, and the repeated labor of editor can be down to minimum, raising editorial efficiency.The present invention simultaneously can be to all kinds of governments The news gathering and editing of unit are managed automatically, to realize half/full-automation of news gathering and editing, meanwhile, number is provided for various systems According to interface, other systems can get news data by data-interface.
Description of the drawings
Fig. 1 is the operation principle block diagram of the present invention;
Fig. 2 is the structural principle block diagram of the present invention.
Specific embodiment
With reference to embodiment, the invention will be further described, but does not constitute any limitation of the invention, any The modification of the limited number of time made in scope of the invention as claimed, still in scope of the presently claimed invention.
As shown in Figure 1 to Figure 2, the invention provides a kind of heavy goods vehicles intelligent diagnosing method, the method includes crawl step S1, extraction step S2 and preference monitoring step S3, each step process are as follows:
Crawl step S1:Grasping system 2 captures required news according to the rules for grasping for setting from the Internet 1, and by institute The news of crawl sends to large-scale distributed calculating platform 7 and carries out counting, analyzes classification, then preserves to whole station data base 8;
Extraction step S2:Electronic Government Affairs Website group extracts the news of required classification according to actual needs from whole station data base 8 Presented by system front end 6;
Preference monitoring step S3:Row is browsed by the navigation patterns of user behavior monitoring system registers user and according to described It is the preference with the standard determination for setting as user, then preserves to data-storage system server, and periodically pushes user The news information of preference.
In preference monitoring step S3, data-storage system server 4 is provided with three data storage cells, each data storage Each data in unit has label field, and the mark that user can be increased by data-storage system server 4 for news The label field for being set to the news is signed, is processed and preference news push so as to carry out preference.
In the present embodiment, user is interacted with grasping system by B/S patterns, i.e. Browser/Server Mode, respectively Data storage cell is respectively the news storage of national government affairs news, provinces and regions government affairs news, the several plates of POLICY.
Additionally, data-storage system server 4 may also be configured to two or five or ten or 20 data storage lists Unit classifies to news.
In crawl step S1, rules for grasping of the grasping system 2 according to rules for grasping configuration of described dispensing unit, by reptile list The network address that collects is put into network address library unit by unit, then, by central scheduler unit according to scheduling rule from URL library list The network address of unit's extraction respective amount is put into queue unit to be captured carries out news crawl, and the content of crawl is sent to large-scale point Cloth calculating platform.Wherein, rules for grasping and scheduling rule are that system has been configured and completed.
In crawl step S1, large-scale distributed calculating platform 7 passes through government affairs analysing word library unit, picture BASE64 transcodings The process of unit, typesetting transcoder unit, story label extraction unit and data compression unit, the letter sent by grasping system Breath carries out data analysiss, transcoding, process, extraction, classification, and is sent to the storage of whole station data base 8 and extracts for system front end 6.
Large-scale distributed calculating platform 7 is carried out with grasping system 2 and data storage system service device 4 by processed offline 3 Data transfer, large-scale distributed calculating platform 7 are sent to data-storage system clothes by data compression unit after data are compressed Business device 4 classify, wherein, large-scale distributed calculating platform 7 after the data-interface for calling data-storage system server 4, root According to the classification of each different government unit, the news content data corresponding with unit content are obtained.
In the present embodiment, after the acquisition of large-scale distributed calculating platform 7 grasping system 2 transmits news content, read original Data are simultaneously extracted by government affairs analysing word library unit, picture BASE64 transcoding units, typesetting transcoder unit, story label single Unit carries out various analytical calculations, the label of acquisition news content, classification, time, source etc..
In extraction step S2, system front end 6 is by online engine 5 and data-storage system server 4 and whole station data Storehouse 8 communicates to connect.
In the present embodiment, Systems Operator will need to take passages news website URL, and the configuration rule typing according to system is grabbed Take system;Then, the data-interface of 6 calling system of Electronic Government Affairs Website system front end, browses e-government Intranet when user is daily During system of standing, the data for getting are presented to user by front-end technology and are browsed by Electronic Government Affairs Website, so as to realize electronics political affairs Automatically the issue of gathering and editing of business news.
In the present embodiment, whole station data base 8 is may be disposed in data-storage system server 4.
The above is only the preferred embodiment of the present invention, it should be pointed out that for a person skilled in the art, do not taking off On the premise of present configuration, some deformations and improvement can also be made, these are all without the effect for affecting the present invention to implement And practical applicability.

Claims (6)

1. a kind of E-Government news is gathered and edited method automatically, it is characterised in that:The method includes crawl step S1, extraction step S2 With preference monitoring step S3, each step process is as follows:
Crawl step S1:Grasping system news according to needed for the rules for grasping for setting from the Internet crawl, and will be captured News sends to large-scale distributed calculating platform and carries out counting, analyzes classification, then preserves to whole station data base;
Extraction step S2:Electronic Government Affairs Website group extracts the news of required classification by being from whole station data base according to actual needs System front end is presented;
Preference monitoring step S3:By the navigation patterns of user behavior monitoring system registers user and according to the navigation patterns with Preference of the standard determination for setting as user, then preserves to data-storage system server, and periodically pushes user preference News information.
2. a kind of E-Government news is gathered and edited method automatically as claimed in claim 1, it is characterised in that:In preference monitoring step In S3, data-storage system server is provided with two and data above memory element, each number in each data storage cell According to there is a label field, and user can be mark that label that news increases be set to the news by data-storage system server Signature section, is processed and preference news push so as to carry out preference.
3. a kind of E-Government news is gathered and edited method automatically as claimed in claim 1, it is characterised in that:In crawl step S1 In, the network address that collects is put into by rules for grasping of the grasping system according to rules for grasping configuration of described dispensing unit by reptile unit Network address library unit, then, is put from the network address that network address library unit extracts respective amount according to scheduling rule by central scheduler unit Entering queue unit to be captured carries out news crawl, and the content of crawl is sent to large-scale distributed calculating platform.
4. a kind of E-Government news as described in claim 1 or 3 is gathered and edited method automatically, it is characterised in that:In crawl step In S1, large-scale distributed calculating platform passes through government affairs analysing word library unit, picture BASE64 transcoding units, typesetting code conversion list The process of unit, story label extraction unit and data compression unit, carries out data analysiss, turns to the information that grasping system sends Code, process, extract, sort out, and be sent to whole station data base storage for system front end extraction.
5. a kind of E-Government news is gathered and edited method automatically as claimed in claim 4, it is characterised in that:Described large-scale distributed Calculating platform is carried out data transmission with grasping system and data storage system service device by processed offline, large-scale distributed calculating Platform is sent to data-storage system classification server by data compression unit after data are compressed, wherein, large-scale distributed Calculating platform according to the classification of each different government unit, is obtained after the data-interface for calling data-storage system server Take the news content data corresponding with unit content.
6. a kind of E-Government news is gathered and edited method automatically as claimed in claim 1, it is characterised in that:In extraction step S2 In, system front end is connected with data-storage system server and whole station database communication by online engine.
CN201611051209.6A 2016-11-25 2016-11-25 A kind of E-Government news is gathered and edited method automatically Pending CN106503263A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611051209.6A CN106503263A (en) 2016-11-25 2016-11-25 A kind of E-Government news is gathered and edited method automatically

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611051209.6A CN106503263A (en) 2016-11-25 2016-11-25 A kind of E-Government news is gathered and edited method automatically

Publications (1)

Publication Number Publication Date
CN106503263A true CN106503263A (en) 2017-03-15

Family

ID=58328510

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611051209.6A Pending CN106503263A (en) 2016-11-25 2016-11-25 A kind of E-Government news is gathered and edited method automatically

Country Status (1)

Country Link
CN (1) CN106503263A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113486279A (en) * 2021-06-29 2021-10-08 平安信托有限责任公司 Automatic news generation method, device, equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102495872A (en) * 2011-11-30 2012-06-13 中国科学技术大学 Method and device for conducting personalized news recommendation to mobile device users
US8521763B1 (en) * 2005-09-09 2013-08-27 Minnesota Public Radio Computer-based system and method for processing data for a journalism organization
CN104166668A (en) * 2014-06-09 2014-11-26 南京邮电大学 News recommendation system and method based on FOLFM model
CN105653512A (en) * 2015-12-30 2016-06-08 梅国平 News gathering and editing terminal, server, method and system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8521763B1 (en) * 2005-09-09 2013-08-27 Minnesota Public Radio Computer-based system and method for processing data for a journalism organization
CN102495872A (en) * 2011-11-30 2012-06-13 中国科学技术大学 Method and device for conducting personalized news recommendation to mobile device users
CN104166668A (en) * 2014-06-09 2014-11-26 南京邮电大学 News recommendation system and method based on FOLFM model
CN105653512A (en) * 2015-12-30 2016-06-08 梅国平 News gathering and editing terminal, server, method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
屈济荣: ""大数据背景下新闻采编新趋势"", 《报刊纵横》 *
樊兆欣: ""个性化新闻推荐系统关键技术研究与实现"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113486279A (en) * 2021-06-29 2021-10-08 平安信托有限责任公司 Automatic news generation method, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
CN109361617B (en) Convolutional neural network traffic classification method and system based on network packet load
CN106778259B (en) Abnormal behavior discovery method and system based on big data machine learning
CN107992490B (en) Data processing method and data processing equipment
CN104298679B (en) Applied business recommended method and device
CN109218223B (en) Robust network traffic classification method and system based on active learning
WO2022134794A1 (en) Method and apparatus for processing public opinions about news event, storage medium, and computer device
CN111914159B (en) Information recommendation method and terminal
CN101605126A (en) A kind of method and system of multi-protocol data Classification and Identification
CN110061931B (en) Industrial control protocol clustering method, device and system and computer storage medium
CN102542061A (en) Intelligent product classification method
CN109698798B (en) Application identification method and device, server and storage medium
CN112667750A (en) Method and device for determining and identifying message category
CN112367273A (en) Knowledge distillation-based flow classification method and device for deep neural network model
CN109660656A (en) A kind of intelligent terminal method for identifying application program
CN106920070A (en) A kind of resume collection method, apparatus and system
CN109062951A (en) Based on conversation process abstracting method, equipment and the storage medium for being intended to analysis and dialogue cluster
CN112036166A (en) Data labeling method and device, storage medium and computer equipment
CN115269438A (en) Automatic testing method and device for image processing algorithm
CN113868509A (en) Science and technology policy data information consultation service system based on cloud computing
CN110929032B (en) User demand processing system and method for software system
CN109871302B (en) Cloud computing application identification device and method based on resource overhead statistics
CN106503263A (en) A kind of E-Government news is gathered and edited method automatically
CN113408630A (en) Transformer substation indicator lamp state identification method
CN117130870A (en) Transparent request tracking and sampling method and device for Java architecture micro-service system
CN112434049A (en) Table data storage method and device, storage medium and electronic device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170315

RJ01 Rejection of invention patent application after publication