CN103440267A - System for extracting structuralized information by adopting template mode - Google Patents

System for extracting structuralized information by adopting template mode Download PDF

Info

Publication number
CN103440267A
CN103440267A CN2013103324436A CN201310332443A CN103440267A CN 103440267 A CN103440267 A CN 103440267A CN 2013103324436 A CN2013103324436 A CN 2013103324436A CN 201310332443 A CN201310332443 A CN 201310332443A CN 103440267 A CN103440267 A CN 103440267A
Authority
CN
China
Prior art keywords
template
unit
information
extraction
extracting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2013103324436A
Other languages
Chinese (zh)
Inventor
徐方林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN2013103324436A priority Critical patent/CN103440267A/en
Publication of CN103440267A publication Critical patent/CN103440267A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a system for extracting structuralized information by adopting a template mode. The system comprises a target selection unit, a template configuration unit, an object import unit and an information extraction unit; the target selection unit is used for selecting a structuralized information extraction object by utilizing an intelligent selecting module; the template configuration unit is used for configuring a related extraction template according to the selected structuralized information extraction object; the object import unit is used for importing the extraction object and the extraction template into the system; according to the extraction template, the information extraction unit is used for extracting from the extraction object according to the pre-set information to obtain needed structuralized information. The system is simple in structure and smart in design, and can overcome the deficiencies of the prior art by adopting a functionalized structure design, and fill a related market blank, thus realizing the purpose.

Description

A kind of system that adopts template way drawing-out structure information
Technical field
The present invention relates to the messaging software field, specifically, specially refer to a kind of system that adopts template way drawing-out structure information.
Background technology
The magnanimity information occurred on Internet, probably be divided into three kinds of structuring, semi-structured and destructurings.Structured message is as electronic commerce information, and the position of the character of information and the appearance of value is fixed; Semi-structured information is as the segmentation channel on professional website, the suitable standard of the grammer of its title and text, and the scope of keyword is quite limited to; Non-structured information is as BLOG and BBS, and all the elements are all unpredictable.
Structured message and unstructured information are the two worlds of IT application, and they have separately different application evolution characteristics and rule.But, also lack interconnective bridge between this two worlds, and this disappearance makes inevitably to exist in enterprise separating of " activity ", " information and knowledge ", its consequence is exactly: although they are all in the effort of carrying out " more educated ", but the IT application pattern that two worlds separates, be doomed to make it to be difficult to really realize their original intention---" in the most suitable time, send most suitable information to most suitable people.
In sum, for the defect of prior art, need especially a kind of system that adopts template way drawing-out structure information, to solve above-mentioned problem.
Summary of the invention
The object of the present invention is to provide a kind of system that adopts template way drawing-out structure information, by adopting the structural design of functionalization, overcome the deficiency in the conventional art, thereby realized purpose of the present invention.
Technical matters solved by the invention can realize by the following technical solutions:
A kind of system that adopts template way drawing-out structure information, it comprises:
Target is selected unit, adopts Intelligent Selection energy module, for the extracting object of selected structured message;
The template configuration unit, be connected with the selected unit of described target, according to the extracting object of selected structured message, configures relevant extraction template;
Object imports unit, with the selected unit of described target, with the template configuration unit, is connected respectively, for by extracting object and extraction template import system;
The information extraction unit, import unit with described object and be connected, and according to extraction template, according to the information set in advance, extracting object carried out to extraction operation, obtains the structured message needed.
In one embodiment of the invention, the structured message that described extraction template extracts comprises operation content, department's content, web content and content of multimedia.
In one embodiment of the invention, after described structured message extracts, add size, classification, the conversion date of information, be convenient to subsequent treatment.
Beneficial effect of the present invention is: simple in structure, design ingeniously, and by adopting the structural design of functionalization, overcome the deficiency in the conventional art, fill up the blank of relevant market, thereby realized purpose of the present invention.
The accompanying drawing explanation
The structured flowchart of the method that Fig. 1 is employing template way drawing-out structure information of the present invention.
Embodiment
For technological means, creation characteristic that the present invention is realized, reach purpose and effect is easy to understand, below in conjunction with embodiment, further set forth the present invention.
As shown in Figure 1, a kind of system that adopts template way drawing-out structure information of the present invention, it comprises that target selectes unit 100, template configuration unit 200, object and import unit 300 and information extraction unit 400.
The selected unit of described target adopts Intelligent Selection energy module, for the extracting object of selected structured message;
Described template configuration unit is connected with the selected unit of described target, according to the extracting object of selected structured message, configures relevant extraction template;
Described object imports unit and is connected with the template configuration unit with the selected unit of described target respectively, for by extracting object and extraction template import system;
Described information extraction unit imports unit with described object and is connected, and according to extraction template, according to the information set in advance, extracting object is carried out to extraction operation, obtains the structured message needed.
In one embodiment of the invention, in order to increase the wide usage of described method, the structured message that described extraction template extracts comprises operation content, department's content, web content and content of multimedia.
Especially after it is pointed out that described structured message extracts, add size, classification, the conversion date of information, be convenient to subsequent treatment.
The present invention is simple in structure, designs ingeniously, by adopting the structural design of functionalization, has overcome the deficiency in the conventional art, has filled up the blank of relevant market, thereby has realized purpose of the present invention.
Above demonstration and described ultimate principle of the present invention and principal character and advantage of the present invention.The technician of the industry should understand; the present invention is not restricted to the described embodiments; that in above-described embodiment and instructions, describes just illustrates principle of the present invention; without departing from the spirit and scope of the present invention; the present invention also has various changes and modifications, and these changes and improvements all fall in the claimed scope of the invention.The claimed scope of the present invention is defined by appending claims and equivalent thereof.

Claims (3)

1. a system that adopts template way drawing-out structure information, is characterized in that, it comprises:
Target is selected unit, adopts Intelligent Selection energy module, for the extracting object of selected structured message;
The template configuration unit, be connected with the selected unit of described target, according to the extracting object of selected structured message, configures relevant extraction template;
Object imports unit, with the selected unit of described target, with the template configuration unit, is connected respectively, for by extracting object and extraction template import system;
The information extraction unit, import unit with described object and be connected, and according to extraction template, according to the information set in advance, extracting object carried out to extraction operation, obtains the structured message needed.
2. a kind of system that adopts template way drawing-out structure information according to claim 1, is characterized in that, the structured message that described extraction template extracts comprises operation content, department's content, web content and content of multimedia.
3. a kind of system that adopts template way drawing-out structure information according to claim 1, is characterized in that, after described structured message extracts, adds size, classification, the conversion date of information, is convenient to subsequent treatment.
CN2013103324436A 2013-08-02 2013-08-02 System for extracting structuralized information by adopting template mode Pending CN103440267A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2013103324436A CN103440267A (en) 2013-08-02 2013-08-02 System for extracting structuralized information by adopting template mode

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2013103324436A CN103440267A (en) 2013-08-02 2013-08-02 System for extracting structuralized information by adopting template mode

Publications (1)

Publication Number Publication Date
CN103440267A true CN103440267A (en) 2013-12-11

Family

ID=49693959

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2013103324436A Pending CN103440267A (en) 2013-08-02 2013-08-02 System for extracting structuralized information by adopting template mode

Country Status (1)

Country Link
CN (1) CN103440267A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112560460A (en) * 2020-12-08 2021-03-26 北京百度网讯科技有限公司 Method and device for extracting structured information, electronic equipment and readable storage medium

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112560460A (en) * 2020-12-08 2021-03-26 北京百度网讯科技有限公司 Method and device for extracting structured information, electronic equipment and readable storage medium
CN112560460B (en) * 2020-12-08 2022-02-25 北京百度网讯科技有限公司 Method and device for extracting structured information, electronic equipment and readable storage medium

Similar Documents

Publication Publication Date Title
CN104461484B (en) The implementation method and device of front-end template
CN104933130A (en) Comment information marking method and comment information marking device
CN102122280B (en) Method and system for intelligently extracting content object
WO2010047794A3 (en) Environmental data collection
CN104135498A (en) Cross-platform information push system and push method thereof
CN103544219A (en) Question-answering system with intelligent recommendation
CN104504138A (en) Human-based information fusion method and device
CN106202283A (en) A kind of custom field derives the data method to Excel
CN104699473B (en) Generation method, device and the RTL emulators of temporal constraint file
CN104123376B (en) A kind of intelligent text collecting method and system based on row template
CN103440267A (en) System for extracting structuralized information by adopting template mode
CN108255895A (en) A kind of web data acquisition methods using context environmental rule
CN103425760A (en) Webpage-base-level structured information extraction method
CN103425759A (en) Webpage-base-level structured information extraction system
CN102750392B (en) Web topic information extraction method and system
CN103455553A (en) Method for extracting structured information in template mode
CN103116448A (en) Extract method for visualizing information
CN204360290U (en) A kind of computer equipment being applicable to large data processing
CN103020202B (en) A kind of complicated dynamic data relation solution method based on character string
CN204425777U (en) Wiring board contact pin fixed structure
CN103731393A (en) Method for compressing Web resource data
Hakemi et al. Factors influencing information systems adoption: A review of the literature
CN104461492A (en) System and method for generating mobile application client side
CN202362761U (en) Handwriting board with camera
CN204382918U (en) There is the pen of clocking capability

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20131211

WD01 Invention patent application deemed withdrawn after publication