CN110334258A - A kind of network text Content Management method based on customized label - Google Patents
A kind of network text Content Management method based on customized label Download PDFInfo
- Publication number
- CN110334258A CN110334258A CN201810165925.XA CN201810165925A CN110334258A CN 110334258 A CN110334258 A CN 110334258A CN 201810165925 A CN201810165925 A CN 201810165925A CN 110334258 A CN110334258 A CN 110334258A
- Authority
- CN
- China
- Prior art keywords
- content
- user
- module
- text
- management method
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/955—Retrieval from the web using information identifiers, e.g. uniform resource locators [URL]
- G06F16/9566—URL specific, e.g. using aliases, detecting broken or misspelled links
Landscapes
- Engineering & Computer Science (AREA)
- Databases & Information Systems (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Information Transfer Between Computers (AREA)
Abstract
Cybertimes development speed is exceedingly fast, Web content is flooded with the chip time of national 300,000,000 netizens, so how to realize effective management to network fragment content, and combines self-demand to choose effective content and establishes content library, it is the following important research direction to form knowledge base.Web content is by media attributes point, it is broadly divided into content of text, image content, video content etc., the present invention proposes a kind of network text Content Management method based on customized label, it is supplied to user one optional network text content crawl and management tool, by third party's information source, such as by text file, html web page, Web service, the content of relational database etc. automatically grabs, and in the content library for after analysis is handled being put into itself, and pass through metadata management, form knowledge base, an extremely convenient ground content of text management method is provided to user.
Description
Technical field
The present invention relates to information data administrative skill field, particularly relate in a kind of network text based on customized label
Hold management method.
Background technique
Cybertimes development speed is exceedingly fast, and Web content is flooded with the chip time of national 300,000,000 netizens, so how real
It now to effective management of network fragment content, and combines self-demand to choose effective content and establishes content library, know to be formed
Know library, is the following important research direction.
Web content is broadly divided into content of text, image content, video content etc. by media attributes point, and the present invention proposes
A kind of network text Content Management method based on customized label is supplied to user one optional network text content and grabs
It takes and management tool, by third party's information source, such as by text file, html web page, Web service, relational database etc.
Content automatically grabs, and in the content library for being put into itself after analysis is handled, and by metadata management, forms knowledge base.It gives
User provides an extremely convenient ground content of text management method.
Summary of the invention
The present invention proposes a kind of network text Content Management method based on customized label, and being supplied to user one can
The network text content of choosing grabs and management tool, takes by third party's information source, such as by text file, html web page, Web
The content of business, relational database etc. automatically grabs, and in the content library for being put into itself after analysis is handled, and passes through metadata pipe
Reason forms knowledge base, provides an extremely convenient ground content of text management method to user.The present invention includes such as a result,
Lower module:
User interactive module: the user interactive module that the interactive component based on windows system is formed, to receive user's input
Order, request and feedback related content be shown to the module of user;
Data memory module: it is based on relevant database, to store various information, including content text information, metadata object
Manage data format information, user role authority information, buffer area information etc.;
Content library (metadatabase) management module: the content connection point manager (CP manager) realized based on metadata, metadata definition are as follows:
Data K{
Vchar URL;The address //URL
Int kind;// type
};
User role module: to the user realized based on RBAC model and authority management module, user and permission pair are realized
It answers, permission open rights management mode corresponding with content, the content that can be supplied to the customized role of user consults permission,
And maintain easily, it is not easy to form mathematical logic mistake;
Data capture module;The crawl for realizing Web content is requested according to the crawl that user keys in, by grabbing seed URL, shape
At URL queue to be grabbed, parsing DNS, the cycle step for downloading webpage;
Data buffer area module: the Preliminary Content information come to store data capture module crawl, Preliminary Content information are passed through
After user arranges, according in corresponding classification deposit content library (metadatabase), become formal content information.
Specific embodiment
To keep the technical problem to be solved in the present invention, technical solution and advantage clearer, below in conjunction with specific implementation
Example is described in detail.
Embodiment
The present invention proposes a kind of network text Content Management method based on customized label, is supplied to user one optionally
Network text content crawl and management tool, by third party's information source, for example, by text file, html web page, Web service,
The content of relational database etc. automatically grabs, and in the content library for being put into itself after analysis is handled, and passes through metadata management,
Knowledge base is formed, provides an extremely convenient ground content of text management method to user.
The present embodiment includes following component part:
User interactive module: the user interactive module that the interactive component based on windows system is formed, to receive user's input
Order, request and feedback related content be shown to the module of user, the present embodiment is using J2EE realization;
Data memory module: it is based on relevant database, the present embodiment uses MySQL, to store various information, including content
Text information, metadata physical data format information, user role authority information, buffer area information etc.;
Content library (metadatabase) management module: the content connection point manager (CP manager) realized based on metadata, metadata definition are as follows:
Data K{
Vchar URL;The address //URL
Int kind;// type
};
User role module: to the user realized based on RBAC model and authority management module, user and permission pair are realized
It answers, permission open rights management mode corresponding with content, the content that can be supplied to the customized role of user consults permission,
And maintain easily, it is not easy to form mathematical logic mistake;
Data capture module;The crawl for realizing Web content is requested according to the crawl that user keys in, by grabbing seed URL, shape
At URL queue to be grabbed, parsing DNS, the cycle step for downloading webpage;
Data buffer area module: the Preliminary Content information come to store data capture module crawl, Preliminary Content information are passed through
After user arranges, according in corresponding classification deposit content library (metadatabase), become formal content information.
The above is a preferred embodiment of the present invention, it is noted that for those skilled in the art
For, without departing from the principles of the present invention, several improvements and modifications can also be made, these improvements and modifications
It should be regarded as protection scope of the present invention.
Claims (5)
1. the present invention proposes a kind of network text Content Management method based on customized label, be supplied to user one it is optional
Network text content crawl and management tool, by third party's information source, for example, by text file, html web page, Web take
The content of business, relational database etc. automatically grabs, and in the content library for being put into itself after analysis is handled, and passes through metadata pipe
Reason forms knowledge base, provides an extremely convenient ground content of text management method to user, and the present invention includes such as a result,
Lower module:
User interactive module: order, request and feedback related content to receive user's input are shown to the module of user;
Data memory module: it is based on relevant database, to store various information, including content text information, metadata object
Manage data format information, user role authority information, buffer area information etc.;
Content library (metadatabase) management module: the content connection point manager (CP manager) realized based on metadata;
User role module: to the user realized based on RBAC model and authority management module, user and permission pair are realized
It answers, permission open rights management mode corresponding with content, the content that can be supplied to the customized role of user consults permission,
And maintain easily, it is not easy to form mathematical logic mistake;
Data capture module;The crawl for realizing Web content is requested according to the crawl that user keys in, by grabbing seed URL, shape
At URL queue to be grabbed, parsing DNS, the cycle step for downloading webpage;
Data buffer area module: the Preliminary Content information come to store data capture module crawl, Preliminary Content information are passed through
After user arranges, according in corresponding classification deposit content library (metadatabase), become formal content information.
2. a kind of network text Content Management method based on customized label according to claim 1, which is characterized in that
The user interactive module, the interactive component based on windows system are formed.
3. a kind of network text Content Management method based on customized label according to claim 2, which is characterized in that
The data memory module, is formed based on relevant database.
4. a kind of network text Content Management method based on customized label according to claim 3, which is characterized in that
Content library (metadatabase) management module, metadata definition are as follows:
Data K{
Vchar URL;The address //URL
Int kind;// type
}。
5. a kind of network text Content Management method based on customized label according to claim 3, which is characterized in that
The user role module is realized based on RBAC model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810165925.XA CN110334258A (en) | 2018-02-28 | 2018-02-28 | A kind of network text Content Management method based on customized label |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810165925.XA CN110334258A (en) | 2018-02-28 | 2018-02-28 | A kind of network text Content Management method based on customized label |
Publications (1)
Publication Number | Publication Date |
---|---|
CN110334258A true CN110334258A (en) | 2019-10-15 |
Family
ID=68138809
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810165925.XA Pending CN110334258A (en) | 2018-02-28 | 2018-02-28 | A kind of network text Content Management method based on customized label |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110334258A (en) |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103744981A (en) * | 2014-01-14 | 2014-04-23 | 南京汇吉递特网络科技有限公司 | System for automatic classification analysis for website based on website content |
CN106294442A (en) * | 2015-05-28 | 2017-01-04 | 上海池乐信息科技有限公司 | A kind of internet information classifying identification method based on URL and system |
CN106557590A (en) * | 2016-12-01 | 2017-04-05 | 同方知网(北京)技术有限公司 | A kind of intelligent Answer System |
CN107704601A (en) * | 2017-10-13 | 2018-02-16 | 中国人民解放军第三军医大学第附属医院 | Big data search method and system, computer-readable storage medium and electronic equipment |
-
2018
- 2018-02-28 CN CN201810165925.XA patent/CN110334258A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103744981A (en) * | 2014-01-14 | 2014-04-23 | 南京汇吉递特网络科技有限公司 | System for automatic classification analysis for website based on website content |
CN106294442A (en) * | 2015-05-28 | 2017-01-04 | 上海池乐信息科技有限公司 | A kind of internet information classifying identification method based on URL and system |
CN106557590A (en) * | 2016-12-01 | 2017-04-05 | 同方知网(北京)技术有限公司 | A kind of intelligent Answer System |
CN107704601A (en) * | 2017-10-13 | 2018-02-16 | 中国人民解放军第三军医大学第附属医院 | Big data search method and system, computer-readable storage medium and electronic equipment |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US11050690B2 (en) | Method for providing recording and verification service for data received and transmitted by messenger service, and server using method | |
US20220075900A1 (en) | Tracing objects across different parties | |
JP2022529967A (en) | Extracting data from the blockchain network | |
US10929406B2 (en) | Systems and methods for a self-services data file configuration with various data sources | |
WO2017036372A1 (en) | Innovative and creative data processing method, terminal device and display interface | |
Tsai et al. | Intellectual-property blockchain-based protection model for microfilms | |
CN104732331B (en) | grouping management method, device and system | |
US10248801B2 (en) | Systems and methods for role-based file access control | |
US20140280172A1 (en) | System and method for distributed categorization | |
US20150188890A1 (en) | Client side encryption in on-demand applications | |
US20160125070A1 (en) | Unified system for real-time coordination of content-object action items across devices | |
CN107294955B (en) | Electronic file encryption middleware control system and method | |
CN110175316B (en) | Media number interaction method, system and storage medium based on blockchain | |
TW201409273A (en) | Method and Apparatus of Responding to Webpage Access Request | |
CN109344137A (en) | A kind of log storing method and system | |
US10165022B1 (en) | Screen sharing management | |
US10015050B2 (en) | Distributed computing system | |
CN109688123A (en) | The method and system of one-way data transfer between inter-network system based on GM two dimensional code | |
Pei et al. | Bank customer loyalty under the background of internet finance and multimedia technology | |
CN110334258A (en) | A kind of network text Content Management method based on customized label | |
US11586724B1 (en) | System and methods for authenticating content | |
CA2997636A1 (en) | Network-based electronic negotiable instrument system and method and device for realizing same | |
CN111611523A (en) | Resource management system, resource management method, device, and storage medium | |
US20170061379A1 (en) | Systems and methods for master-client virtual workspace communication and management | |
KR20160132854A (en) | Asset collection service through capture of content |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20191015 |