CN107004208A - Media generation system and its execution method - Google Patents

Media generation system and its execution method Download PDF

Info

Publication number
CN107004208A
CN107004208A CN201580052286.0A CN201580052286A CN107004208A CN 107004208 A CN107004208 A CN 107004208A CN 201580052286 A CN201580052286 A CN 201580052286A CN 107004208 A CN107004208 A CN 107004208A
Authority
CN
China
Prior art keywords
media
information
analysis unit
template
management system
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201580052286.0A
Other languages
Chinese (zh)
Inventor
D·帕特森
J·孙达拉姆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Matthews International Corp
Original Assignee
Matthews Resources Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matthews Resources Inc filed Critical Matthews Resources Inc
Priority claimed from PCT/US2015/047205 external-priority patent/WO2016033335A1/en
Publication of CN107004208A publication Critical patent/CN107004208A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0276Advertisement creation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • G06F40/16Automatic learning of transformation rules, e.g. from examples
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/186Templates

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Strategic Management (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Accounting & Taxation (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A kind of media management system includes:Content analysis unit, the information that the content analysis unit is exported to media is analyzed to identify the data structure of described information, and described information structure is compared with Given information structure;Template analysis unit, if the structure of described information is substantially similar to Given information structure, then described information is reformatted as similar file structure by the template analysis unit, if the structure of described information is not to be substantially similar to Given information structure, the template analysis unit creates new message structure based on the structure of the fileinfo;And media production unit, the media production unit produces media product based on the structured message.

Description

Media generation system and its execution method
Background technology
In modern media activity, the bulk information on the look and feel of particular media element is used to produce customization Marketing and mailing data.The information can or can not be by structured format delivering.In addition, when with structured format, In the absence of the uniform rules of the structure of the information used in standardized media exploitation.
The need for such system, permitting deformationization is used for the letter for the media product for producing customization by the system Breath.
The content of the invention
One embodiment of the disclosure includes a kind of media management system, and the media management system includes:Content analysis Unit, the information that the content analysis unit is exported to media is analyzed to identify the data structure of described information, and Described information structure is compared with Given information structure;Template analysis unit, if the structure of described information substantially class Given information structure is similar to, then described information is reformatted as similar file structure by the template analysis unit, if The structure of described information is not to be substantially similar to Given information structure, then the template analysis unit is based on the fileinfo Structure create new message structure;And media production unit, the media production unit is based on the structured message To produce media product.
In another embodiment, content analysis unit can be analyzed the header in file.
In another embodiment, content analysis unit can identify at least one data element in media output.
In another embodiment, content analysis unit can by it is described at least one mark data element with least One given data element is compared.
In another embodiment, template analysis unit can be based on the data element identified and the given data Element relatively carrys out identification information template.
In another embodiment, template analysis unit can identify at least one rule associated with template.
In another embodiment, template analysis unit can be defeated applied to the media by least one described rule Go out.
In another embodiment, the data structure of media outlets can be extensible markup language structure.
In another embodiment, the data structure of media outlets can be CSV varistructure.
In another embodiment, media outlets are non-structured.
Another embodiment of the disclosure includes a kind of method of structuring media, and this method comprises the following steps:Receive Collect the information in media output;The information exported to the media is analyzed to identify the data structure of described information;Will Described information structure is compared with Given information structure;If the structure of described information is substantially similar to Given information knot Described information, then be reformatted as similar file structure by structure;If the structure of described information is not substantially similar to Given information structure, then create new message structure based on the structure of the fileinfo;And based on structuring letter Cease to produce media product.
In another embodiment, methods described includes to wrap the step of analyzing the information in media output Include and the header in file is analyzed.
In another embodiment, methods described includes to wrap the step of analyzing the information in media output The step of including at least one data element in mark media output.
In another embodiment, methods described is included the data element of at least one mark and at least one The step of given data element is compared.
In another embodiment, methods described is included based on the data element identified and the ratio of given data element The step of relatively carrying out identification information template.
In another embodiment, methods described includes identifying at least one regular step associated with template.
In another embodiment, methods described includes the step for exporting at least one described rule applied to media Suddenly.
In another embodiment, the data structure of media outlets can be extensible markup language structure.
In another embodiment, the data structure of media outlets can be CSV varistructure.
In another embodiment, media outlets can be non-structured.
Brief description of the drawings
After features as discussed above is checked, details (including non-limiting benefit and advantage) of the invention for Those of ordinary skill in the related art will become easier to understand, wherein:
Fig. 1 describes the block diagram for the media management system for being suitable for being used together with method and system consistent with the present invention;
Fig. 2 shows the more detailed description of Fig. 1 computer;
Fig. 3 shows the more detailed description of Fig. 1 additional computer;
The illustrative embodiment of the MMS of Fig. 4 depictions 1 operation;
Fig. 5 describes the schematic diagram of the method for the file structure of mark file;And
Fig. 6 describes the schematic diagram of the method for the element in the file structure of mark file.
Embodiment
Although there is described herein various embodiments of the present invention, those skilled in the art will be clearly, More embodiments and realization in the scope of the present invention are possible.Therefore, except according to appended claims and its waiting Outside form, the present invention is unrestricted.
It is used for there is described herein one kind from media file reading media information, identifies the media information and based on the matchmaker The system that body information produces media output.The system is also by reformatting into pre- solid plate and being based on media information New media information produces new template to standardize media information.
Fig. 1 depicts the media management system for being suitable for being used together with method and system consistent with the present invention (" MMS ") 100 block diagram.MMS 100 includes the multiple computers 102,104,106 and 108 connected via network 110.Network 110 are suitable for connection computer for the type of communication, such as circuit-switched network or packet-switched network.In addition, Network 110 can include several different networks, such as LAN, wide area network (such as internet), telephone network (including tool Have the telephone network of dedicated communication link), connectionless networks and wireless network.In the illustrative embodiment shown in Fig. 1, net Network 110 is internet.Each in computer 102,104,106 and 108 shown in Fig. 1 via suitable communication link (such as Dedicated communication line or wireless communication link) it is connected to network 110.
In an illustrative embodiment, computer 102 is used as media generation unit (" MGU "), and the MGU includes information list Member 112, content analysis unit 114, template analysis unit 116 and media production unit 118.Computer and net shown in Fig. 1 The quantity of network configuration is only illustrative embodiment.It will be appreciated by persons skilled in the art that MMS 100 can include varying number Cyber-net.For example, computer 102 can include information collection unit 112 and template analysis unit 116, and it is interior Holding analytic unit 114 and media production unit 118 can reside on different computers.
Fig. 2 shows the more detailed description of computer 102.Computer 102 includes CPU (CPU) 202, defeated Enter output (IO) unit 204, the display device 206 for being communicably coupled to I/O-unit 204, secondary storage device 208 and internal memory 210.Computer 202 may further include standard input device, such as keyboard, mouse, Aristogrid or voice processing apparatus (each not shown explanation).
The internal memory 210 of computer 102, which includes graphic user interface (" GUI ") 212, GUI 212, to be used for via such as herein Described display device 206 and I/O units 204 collects information from user.GUI 212 includes that display device can be shown in Display panel in any user interface on 206, including but not limited to webpage, executable program can be shown in computer Any other interface on screen.GUI 212 is also stored in secondary storage 208.Consistent with the present invention In one embodiment, GUI 212 is browsed software and is shown using commercially available HTML (" HTML "), described commercially available HTML browses software and is such as but not limited to Microsoft Internet Explorer, Google Chrome or any other commercially available HTML Browse software.Secondary storage 208 can include information memory cell 214.Information memory cell can be relational database, It is such as but not limited to SQL, Oracle or any other database of Microsoft.
Fig. 3 shows the more detailed description of computer 104,106 and 108.Each computer 104,106 and 108 includes CPU (CPU) 302, input and output (IO) unit 304, be communicably coupled to I/O-unit 304DE display devices 306, Secondary storage device 308 and internal memory 310.Each computer 104,106 and 108 may further include standard input device, Such as keyboard, mouse, Aristogrid or voice processing apparatus (each not shown explanation).
The internal memory 310 of each computer 104,106 and 108 includes GUI 312, and GUI 312 is used for via institute such as herein The display device 306 and I/O-unit 304 of description collect information from user.GUI 312 includes to be shown in display device 206 Any user interface, including but not limited to the display panel in webpage, executable program or computer screen can be shown in On any other interface.GUI 312 is also stored in secondary storage 208.At consistent with the present invention one In embodiment, GUI 312 browses software using commercially available HTML and is shown, and the commercially available HTML browses software and is such as but not limited to Microsoft Internet Explorer, Google Chrome or any other commercially available HTML browse software.
Fig. 4 depicts the illustrative embodiment of MMS 100 operation.In step 402, in information collection unit 112 Receive the file for including the information on media of generation.In step 404, content analysis unit 116 is determined in this document The form of information.The form can be structuring or unstructured document form, including but not limited to pdf files, XML file, XLS files or any other structuring or unstructured document form.In a step 406, content analysis unit 114 opens this article Part, and the structure of the information in this document is compared with Given information structure.In the structure of relatively this document, content Analytic unit 114 mark file in known designator, such as header, label information, word or character arrangement or it is any its His designator, and the designator is compared with the designator in known data structure.It is used as illustrative embodiment, content Analytic unit 114 can identify the head point of XML file, and by the head point with being stored in information memory cell 214 Known head part is compared.
In a step 408, if the data structure of mark is matched with known data structure, from information memory cell 214 Retrieve the information in known data structure.In step 410, the information in 116 pairs of files of template analysis unit is repaiied Order to meet known data structure.As illustrative embodiment, if the file is identified as XML file, to the text The structure (including mark and head) of part is revised so that it observes the mark and head of known data structure.In step 412, such as Fruit for mark data structure be not present matching, then structure of the template analysis unit 116 based on the information in the file come Produce template.In template is produced, template analysis unit 114 can use conventional OCR and object recognition algorithm to identify not With the separation designator of word and expression.Template analysis unit 114, which can collect external information (such as collecting user's input), to be come Determine the classification of the different keywords or element in the file., will new text after all keywords and element are all identified Part structure is stored in information memory cell 214 as known file structure.
In step 414, template analysis unit 116 carries out structuring to meet new production to the information in the file again New file structure in raw template.In step 416, template analysis unit 116 uses the information and mark in the file File structure create new file.In step 418, media production unit 118 is based on the information and file in the file Structure produces media.As illustrative embodiment, the information in the file can be on that will be printed on label The information of position, arrangement and color.Fileinfo can use unknown mark and son mark to be arranged in xml format.Content point Analysis unit 114 can determine whether mark and son mark and another mark layout are same or like.If mark and son mark It is same or like with the known mark being stored in information memory cell 214 and sub- mark, then using these known marks and son Mark reformats file.If mark is differed or similar with known mark, can be based on the mark and son in file Mark to create new XML format.Once file structure is determined, it is based on the information in file to produce media.By inciting somebody to action Information in file is compared with known data structure, and all media can be configured to reference format for more rapidly Ground and more accurately handle.
Fig. 5 depicts the schematic diagram of the method for the file structure of mark file.In step 502, information collection unit 112 Open the file for including the information on media.In step 504, the header in the mark of content analysis unit 114 this document. In step 506, content analysis unit 114 enters the header of the mark in this document with the header from known file structure Row compares.In step 508, if the header of mark is mismatched with known header, content analysis unit 114 is identified not The header of matching.In step 510, content analysis unit 114 creates the new template for being incorporated to new header.In step 512 In, if the known header of header matching, content analysis unit 114 is by every head in the file and known header It is associated.In the step 514, content analysis unit 114 identifies the data element in the file.Data element can include The information of mark is marked with mark or son or be instructed to accord with the information that (such as comma) separates in XML file.In step 516, Content analysis unit 116 is entered to data element in itself based on header, the information associated with data element or data element Row is sorted out.In step 518, template analysis unit 118 produces the new template for the element for being incorporated to classification.
Fig. 6 depicts the schematic diagram of the method for the element in the file structure of mark file.In step 602, information is received Collect unit 112 and open file.In step 604, the data element in the mark of content analysis unit 114 this document.Data element It can include marking the information of mark with mark or son in XML file or be instructed to accord with the information that (such as comma) separates.In step In rapid 606, the element of mark is compared by content analysis unit 112 with known element type.In step 608, if mark The element of knowledge and the known element type from file template are same or like, then template analysis unit 116 uses and has been incorporated to this The template of major elements type carrys out establishment file.In step 612, template analysis unit 116 be based on in information memory cell 214 Template associated rule the information in the file newly created is verified.Rule can include the media on output Arrangement, color, wording or the information in terms of any other.In step 614, media production unit 118 based on new file come Produce media.In step 610, if the element of mark is mismatched with known element, the new template for being incorporated to new element is produced.
In the disclosure, word " one (a or an) " will be counted as not only including odd number but also including plural number.On the contrary, to plural thing Any reference of product should include odd number at appropriate place.
It should be understood that to the various changes and modifications of currently preferred embodiment disclosed herein for this area skill Art personnel will be clearly.Such change and modification can not depart from spirit and scope of the present disclosure and not weaken its expection Made in the case of advantage.Therefore, it is intended that such change and modification are covered by appended claims.

Claims (20)

1. a kind of media management system, the media management system includes:
Content analysis unit, the content analysis unit
The information exported to the media is analyzed to identify the data structure of described information;And
Described information structure is compared with Given information structure;
Template analysis unit,
If the structure of described information is substantially similar to Given information structure, the template analysis unit is by described information weight Format turns to similar file structure;
If the structure of described information is not to be substantially similar to Given information structure, the template analysis unit is based on described The structure of fileinfo creates new message structure;And
Media production unit, the media production unit produces media product based on the structured message.
2. media management system as claimed in claim 1, wherein the content analysis unit is carried out to the header in file Analysis.
3. media management system as claimed in claim 1, wherein the content analysis unit is identified in the media output At least one data element.
4. media management system as claimed in claim 3, wherein the content analysis unit is by least one mark Data element is compared with least one given data element.
5. media management system as claimed in claim 4, including based on the data element identified and given data member The comparison of element carrys out the template analysis unit of identification information template.
6. media management system as claimed in claim 5, wherein the template analysis unit marks are associated with the template At least one rule.
7. media management system as claimed in claim 6, wherein the template analysis unit answers at least one described rule For media output.
8. media management system as claimed in claim 1, wherein the data structure of the media outlets is extensible markup language Say structure.
9. media management system as claimed in claim 1, wherein the data structure of the media outlets is that CSV is variable Structure.
10. media management system as claimed in claim 1, wherein the media outlets are non-structured.
11. a kind of method of structuring media, the described method comprises the following steps:
Collect the information in media output;
The information exported to the media is analyzed to identify the data structure of described information;
Described information structure is compared with Given information structure;
If the structure of described information is substantially similar to Given information structure, described information is reformatted as similar File structure;
If the structure of described information is not to be substantially similar to Given information structure, the structure based on the fileinfo come Create new message structure;
Media product is produced based on the structured message.
12. method as claimed in claim 11, wherein the step of information exported to media is analyzed including pair Header in file is analyzed.
13. method as claimed in claim 11, wherein the information exported to media includes mark the step of analysis The step of knowing at least one data element in the media output.
14. method as claimed in claim 13, including by known to the data element of at least one mark and at least one The step of data element is compared.
15. method as claimed in claim 14, including the ratio based on the data element identified with the given data element The step of relatively carrying out identification information template.
16. method as claimed in claim 15, including mark at least one regular step associated with the template.
17. method as claimed in claim 16, including the step of at least one described rule is exported applied to the media.
18. method as claimed in claim 11, wherein the data structure of the media outlets is extensible markup language structure.
19. method as claimed in claim 11, wherein the data structure of the media outlets is CSV varistructure.
20. method as claimed in claim 11, wherein the media outlets are non-structured.
CN201580052286.0A 2014-08-27 2015-08-27 Media generation system and its execution method Pending CN107004208A (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US201462042471P 2014-08-27 2014-08-27
US62/042,471 2014-08-27
PCT/US2015/047205 WO2016033335A1 (en) 2014-08-27 2015-08-27 Media generation system and methods of performing the same

Publications (1)

Publication Number Publication Date
CN107004208A true CN107004208A (en) 2017-08-01

Family

ID=59093316

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201580052286.0A Pending CN107004208A (en) 2014-08-27 2015-08-27 Media generation system and its execution method

Country Status (2)

Country Link
EP (1) EP3195144A4 (en)
CN (1) CN107004208A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020099735A1 (en) * 2001-01-19 2002-07-25 Schroeder Jonathan E. System and method for conducting electronic commerce
US20060101332A1 (en) * 1999-12-30 2006-05-11 Tomasz Imielinski Virtual tags and the process of virtual tagging
US20070214695A1 (en) * 2006-03-20 2007-09-20 Lomont Molding, Inc., D.B.A. Paragon Products Lock out tag
CN101236609A (en) * 2007-02-02 2008-08-06 富士通株式会社 Apparatus and method for analyzing and determining correlation of information in a document
CN101427243A (en) * 2006-04-21 2009-05-06 微软公司 Localising unstructured resources
CN101615268A (en) * 2009-07-31 2009-12-30 北京华思维泰克科技有限公司 Method of electronic drawings and archives being collected, managing by digital label and the system that realizes this method
CN101661512A (en) * 2009-09-25 2010-03-03 万斌 System and method for identifying traditional form information and establishing corresponding Web form
CN103150584A (en) * 2013-01-30 2013-06-12 广东电网公司电力调度控制中心 Communication resource motion processing method and system

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130339843A1 (en) * 2012-06-13 2013-12-19 Motorola Mobility, Inc. Methods and Systems for Styling Web Elements

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060101332A1 (en) * 1999-12-30 2006-05-11 Tomasz Imielinski Virtual tags and the process of virtual tagging
US20020099735A1 (en) * 2001-01-19 2002-07-25 Schroeder Jonathan E. System and method for conducting electronic commerce
US20070214695A1 (en) * 2006-03-20 2007-09-20 Lomont Molding, Inc., D.B.A. Paragon Products Lock out tag
CN101427243A (en) * 2006-04-21 2009-05-06 微软公司 Localising unstructured resources
CN101236609A (en) * 2007-02-02 2008-08-06 富士通株式会社 Apparatus and method for analyzing and determining correlation of information in a document
CN101615268A (en) * 2009-07-31 2009-12-30 北京华思维泰克科技有限公司 Method of electronic drawings and archives being collected, managing by digital label and the system that realizes this method
CN101661512A (en) * 2009-09-25 2010-03-03 万斌 System and method for identifying traditional form information and establishing corresponding Web form
CN103150584A (en) * 2013-01-30 2013-06-12 广东电网公司电力调度控制中心 Communication resource motion processing method and system

Also Published As

Publication number Publication date
EP3195144A4 (en) 2018-02-28
EP3195144A1 (en) 2017-07-26

Similar Documents

Publication Publication Date Title
US20170091321A1 (en) Document classification system, document classification method, and document classification program
CN103207913B (en) The acquisition methods of commercial fine granularity semantic relation and system
CN103914478B (en) Webpage training method and system, webpage Forecasting Methodology and system
WO2020073664A1 (en) Anaphora resolution method and electronic device and computer-readable storage medium
US9720912B2 (en) Document management system, document management method, and document management program
CN105139237A (en) Information push method and apparatus
CN112632989B (en) Method, device and equipment for prompting risk information in contract text
US20170358045A1 (en) Data analysis system, data analysis method, and data analysis program
CN110458296B (en) Method and device for marking target event, storage medium and electronic device
CN107315798A (en) Structuring processing method and processing device based on multi-threaded semantic label information MAP
JP6719399B2 (en) Analysis device, analysis method, and program
CN112927782B (en) Heart health state early warning system based on text emotion analysis
CN106054858A (en) Decision tree classification and fault code classification-based vehicle remote diagnosis and spare part retrieval method
Neme et al. Stylistics analysis and authorship attribution algorithms based on self-organizing maps
US9542474B2 (en) Forensic system, forensic method, and forensic program
JP5527845B2 (en) Document classification program, server and method based on textual and external features of document information
CN109614484A (en) A kind of Text Clustering Method and its system based on classification effectiveness
WO2024067387A1 (en) User portrait generation method based on characteristic variable scoring, device, vehicle, and storage medium
KR102107474B1 (en) Social issue deduction system and method using crawling
US20150193529A1 (en) Opinion analyzing system and method
US20170154294A1 (en) Performance evaluation device, control method for performance evaluation device, and control program for performance evaluation device
CN110188207A (en) Knowledge mapping construction method and device, readable storage medium storing program for executing, electronic equipment
CN109815391A (en) News data analysis method and device, electric terminal based on big data
Sabaruddin et al. Malay tweets: discovering mental health situation during covid-19 pandemic in Malaysia
JP6621514B1 (en) Summary creation device, summary creation method, and program

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20210519

Address after: Pennsylvania, USA

Applicant after: MATTHEWS INTERNATIONAL Corp.

Address before: Delaware, USA

Applicant before: MATTHEWS RESOURCES, Inc.