CN102402541A - File analysis system and method - Google Patents

File analysis system and method Download PDF

Info

Publication number
CN102402541A
CN102402541A CN2010102823819A CN201010282381A CN102402541A CN 102402541 A CN102402541 A CN 102402541A CN 2010102823819 A CN2010102823819 A CN 2010102823819A CN 201010282381 A CN201010282381 A CN 201010282381A CN 102402541 A CN102402541 A CN 102402541A
Authority
CN
China
Prior art keywords
file
document
type
analysis
mdb
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2010102823819A
Other languages
Chinese (zh)
Other versions
CN102402541B (en
Inventor
王台弘
黄玉玺
刘柏廷
甘淑慧
简吉廷
梁文广
姚进
罗伟
何宝儒
林晟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Crown Education Polytron Technologies Inc
Original Assignee
Jetta Software (shenzhen) Co Ltd
Hon Hai Precision Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jetta Software (shenzhen) Co Ltd, Hon Hai Precision Industry Co Ltd filed Critical Jetta Software (shenzhen) Co Ltd
Priority to CN201010282381.9A priority Critical patent/CN102402541B/en
Publication of CN102402541A publication Critical patent/CN102402541A/en
Application granted granted Critical
Publication of CN102402541B publication Critical patent/CN102402541B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Abstract

The invention relates to a file analysis method, which comprises the following steps of: reading a file and a file type parameter of the file from a file transfer protocol (FTP) server; judging whether the type of the file is one of textfile (TXT), comma separated values (CSV), meta database (MDB), database file (DBF) and Microsoft Excel (XLS) formats or not according to the file type parameter; when the type of the file is TXT or CSV, calling a JAVA algorithm to analyze TXT or CSV so as to generate an extensible markup language (XML) file; when the type of the file is MDB, calling the JAVA algorithm to analyze MDB so as to generate an XML file; when the type of the file is DBF, calling the JAVA algorithm to analyze DBF so as to generate an XML file; and when the type of the file is XLS, calling the JAVA algorithm to analyze XLS so as to generate an XML file. The invention also provides a file analysis system. By the method and the system, a binary file can be analyzed into a file which can be processed by a file sender adapter and is in an XML format.

Description

The document analysis system and method
Technical field
The present invention relates to a kind of document analysis system and method.
Background technology
Along with the continuous propelling of IT application process, increasing application system has appearred in enterprises, for example purchasing system, accounting system, bonded system, logistics system and marketing system.Each application system has different data layouts and adopts different data storage methods, for example according to FTP (File Transfer Protocol, FTP) storage data.When SAP (Systems Applications and Products in Data Processing) system is introduced original application system; The difficulty because the difference of data layout and data storage method, the data interaction between SAP system and the original application system become.For this reason; Server provides some adapters to SAPPI (Process Integration, flow process is integrated), but these type of adapters are limited; And function singleness; And the IT system of enterprise the inside is various, for using SAP PI to integrate the enterprise of each IT system, uses being limited in scope of SAP PI integral application system.
Sometimes need SAP PI reading of data and handling from ftp file; The file transmission adapter of the SAP PI of standard (File Sender Adapter) can only read and resolve the flat file (Flat File) that meets SAP PI adapter regulation; XML file for example; And powerless for the binary file of form relative complex, for example dbf, xls, and file such as mdb.
Summary of the invention
In view of above content, be necessary to provide a kind of document analysis system, can resolve to the binary file that needs are handled the file of the manageable XML form of file transmission adapter.
In view of above content, also be necessary to provide a kind of document analysis method, can resolve to the binary file that needs are handled the file of the manageable XML form of file transmission adapter.
Said document analysis system; Run in the file transmission adapter that flow process integrated service device comprised; This flow process integrated service device is connected in ftp server through network, and this system comprises: read module is used for reading from ftp server the file type parameters of file and this document; Judge module is used for judging according to the file type parameters of this document whether the type of this document is a kind of of TXT form, CSV form, MDB form, DBF form or XLS form; Parsing module is used for when the type of this document is TXT or CSV, calling the file of this TXT of JAVA arithmetic analysis or CSV form, generates the XML file; Said parsing module also is used for when the type of this document is MDB, calling the file of this MDB form of JAVA arithmetic analysis, generates the XML file; Said parsing module also is used for when the type of this document is DBF, calling the file of this DBF form of JAVA arithmetic analysis, generates the XML file; Said parsing module also is used for when the type of this document is XLS, calling the file of this XLS form of JAVA arithmetic analysis, generates the XML file.
Said document analysis method; Be applied in the file transmission adapter that flow process integrated service device comprised; This flow process integrated service device is connected in ftp server through network, and the method comprising the steps of: the file type parameters that from ftp server, reads file and this document; Judge that according to this document type parameter whether the type of this document is a kind of in TXT form, CSV form, MDB form, DBF form or the XLS form; When the type of this document is TXT or CSV, call the file of this TXT of JAVA arithmetic analysis or CSV form, generate the XML file; When the type of this document is MDB, call the file of this MDB form of JAVA arithmetic analysis, generate the XML file; When the type of this document is DBF, call the file of this DBF form of JAVA arithmetic analysis, generate the XML file; When the type of this document is XLS, call the file of this XLS form of JAVA arithmetic analysis, generate the XML file.
Compared to prior art, described document analysis system and method, the comparatively complicated binary file of the form that can not handle the file transmission adapter resolve to the file of XML form, is convenient to the file transmission adapter data are handled.
Description of drawings
Fig. 1 is the Organization Chart of document analysis of the present invention system preferred embodiment.
Fig. 2 is the process flow diagram of document analysis method of the present invention preferred embodiment.
The main element symbol description
Flow process integrated service device 1
Ftp server 2
The SAP system 3
The file transmission adapter 4
The remote function calls adapter 5
Network 6
The document analysis system 10
Read module 100
Judge module 300
Parsing module 400
Reminding module 500
Embodiment
As shown in Figure 1, be the Organization Chart of document analysis of the present invention system preferred embodiment.Said document analysis system 10 runs on flow process and integrates that (Process Integration is in the file transmission adapter (File Sender Adapter) 4 that PI) server 1 is comprised.Said flow process integrated service device 1 also comprises remote function calls adapter (remote function calladapter, RFC adapter) 5.Said flow process integrated service device 1 is connected in FTP (File Transfer Protocol through network 6; FTP) server 2, are used for the data transmission between ftp server 2 and SAP (the Systems Applications and Products in DataProcessing) system 3.This network 6 can be the Internet (Internet) or Intranet (Intranet).
Said ftp server 2 stored a plurality of ftp files, the type of this ftp file is respectively MDB, DBF, XLS, TXT, CSV etc.In the present embodiment, this ftp server 2 can be set up in purchasing system, accounting system, bonded system, logistics system and the marketing system of enterprise.
Said document analysis system 10 is used for reading file from ftp server 2, and resolves to XML (Extensible MarkupLanguage, the extend markup language) file layout that file transmission adapter 4 can be handled.Said file transmission adapter 4 is used to shine upon the data of this XML file layout, and the data-switching that this mapping obtains is become to meet the data of remote function calls adapter 5 access stencils.Said remote function calls adapter 5 is used for these data that meet its access stencil are sent to SAP system 3.
10 systems of said document analysis system comprise read module 100, judge module 300, parsing module 400 and reminding module 500.
Said read module 100 is used for reading file and reading in the parameter of this document from ftp server 2, and said parameter comprises Namespace (NameSpace), Root (root node), Type (file type) of this document etc.
Said judge module 300 is used for judging according to the Type parameter whether the type of this document is a kind of of TXT form, CSV form, MDB form, DBF form or XLS form.
Said parsing module 400 is used for when the type of this document is TXT or CSV, calls the JAVA algorithm of resolving TXT or csv file, and the data-switching of TXT or csv file is become the data of XML form, generates the XML file.
Said parsing module 400 also is used for when the type of this document is MDB, calls the JAVA algorithm of resolving the MDB file, and the data-switching of MDB file is become the data of XML form, generates the XML file.
Said parsing module 400 also is used for when the type of this document is DBF, calls the JAVA algorithm of resolving the DBF file, and the data-switching of DBF file is become the data of XML form, generates the XML file.
Said parsing module 400 also is used for when the type of this document is XLS, calls the JAVA algorithm of resolving the XLS file, and the data-switching of XLS file is become the data of XML form, generates the XML file.
The data of the XML file layout of above-mentioned generation are by said file transmission adapter 4 mappings and convert the data that meet remote function calls adapter 5 access stencils to, by function call adapter 5 these data that meet its access stencil sent to SAP system 3 again.
Said reminding module 500 is used for type when this document when not being any of above-mentioned TXT, CSV, MDB, DBF or XLS form, and prompting can't be resolved this document type, the document analysis failure.
As shown in Figure 2, be the process flow diagram of document analysis method of the present invention preferred embodiment.
Step S10, said read module 100 reads file from ftp server 2.
Step S12, said read module 100 read in Namespace (NameSpace), Root (root node), the Type parameters such as (file types) of this document.
Step S14, said judge module 300 judges according to the Type parameter whether the type of this document is TXT or CSV.When the type of this document is TXT or CSV, execution in step S16; When the type of this document is not TXT or CSV, execution in step S18.
Step S16, said parsing module 400 call the JAVA algorithm of resolving TXT or csv file, and the data-switching of TXT or csv file is become the data of XML form, generate the XML file.
Step S18, said judge module 300 judges according to the Type parameter whether the type of this document is MDB.When the type of this document is MDB, execution in step S20; When the type of this document is not MDB, execution in step S22.
Step S20, said parsing module 400 call the JAVA algorithm of resolving the MDB file, and the data-switching of MDB file is become the data of XML form, generate the XML file.
Step S22, said judge module 300 judges according to the Type parameter whether the type of this document is DBF.When the type of this document is DBF, execution in step S24; When the type of this document is not DBF, execution in step S26.
Step S24, said parsing module 400 call the JAVA algorithm of resolving the DBF file, and the data-switching of DBF file is become the data of XML form, generate the XML file.
Step S26, said judge module 300 judges according to the Type parameter whether the type of this document is XLS.When the type of this document is XLS, execution in step S28; When the type of this document is not XLS, execution in step S30.
Step S28, said parsing module 400 call the JAVA algorithm of resolving the XLS file, and the data-switching of XLS file is become the data of XML form, generate the XML file.
Step S30, said reminding module 500 promptings can't be resolved this document type, the document analysis failure.
Above embodiment is only unrestricted in order to technical scheme of the present invention to be described; Although the present invention is specified with reference to preferred embodiment; Those of ordinary skill in the art is to be understood that; Can make amendment or be equal to replacement technical scheme of the present invention, and not break away from the spirit and the scope of technical scheme of the present invention.

Claims (6)

1. document analysis system runs in the file transmission adapter that flow process integrated service device comprised, and this flow process integrated service device is connected in ftp server through network, it is characterized in that this system comprises:
Read module is used for reading from ftp server the file type parameters of file and this document;
Judge module is used for judging according to the file type parameters of this document whether the type of this document is a kind of of TXT form, CSV form, MDB form, DBF form or XLS form;
Parsing module is used for when the type of this document is TXT or CSV, calling the file of this TXT of JAVA arithmetic analysis or CSV form, generates the XML file;
Said parsing module also is used for when the type of this document is MDB, calling the file of this MDB form of JAVA arithmetic analysis, generates the XML file;
Said parsing module also is used for when the type of this document is DBF, calling the file of this DBF form of JAVA arithmetic analysis, generates the XML file;
Said parsing module also is used for when the type of this document is XLS, calling the file of this XLS form of JAVA arithmetic analysis, generates the XML file.
2. document analysis as claimed in claim 1 system is characterized in that this system also comprises:
Reminding module, when being used for type when this document and not being any of TXT form, CSV form, MDB form, DBF form or XLS form, prompting can't be resolved this document type, the document analysis failure.
3. document analysis as claimed in claim 1 system is characterized in that, the data of the XML file layout of said this generation of file transmission adapter mapping, and the data-switching that mapping obtains become to meet the data of remote function calls adapter access stencil;
Said remote function calls adapter sends to the SAP system with these data that meet its access stencil.
4. document analysis method is applied in the file transmission adapter that flow process integrated service device comprised, and this flow process integrated service device is connected in ftp server through network, it is characterized in that the method comprising the steps of:
From ftp server, read the file type parameters of file and this document;
Judge that according to this document type parameter whether the type of this document is a kind of in TXT form, CSV form, MDB form, DBF form or the XLS form;
When the type of this document is TXT or CSV, call the file of this TXT of JAVA arithmetic analysis or CSV form, generate the XML file;
When the type of this document is MDB, call the file of this MDB form of JAVA arithmetic analysis, generate the XML file;
When the type of this document is DBF, call the file of this DBF form of JAVA arithmetic analysis, generate the XML file;
When the type of this document is XLS, call the file of this XLS form of JAVA arithmetic analysis, generate the XML file.
5. document analysis method as claimed in claim 4 is characterized in that, this method also comprises step:
When the type of this document was not in TXT form, CSV form, MDB form, DBF form or the XLS form any, prompting can't be resolved this document type, the document analysis failure.
6. document analysis method as claimed in claim 4; It is characterized in that; This method also comprises step: said file transmission adapter shines upon the data of the XML file layout of this generation, and the data-switching that mapping obtains is become to meet the data of remote function calls adapter access stencil;
Said remote function calls adapter sends to the SAP system with these data that meet its access stencil.
CN201010282381.9A 2010-09-14 2010-09-14 File analysis system and method Active CN102402541B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201010282381.9A CN102402541B (en) 2010-09-14 2010-09-14 File analysis system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201010282381.9A CN102402541B (en) 2010-09-14 2010-09-14 File analysis system and method

Publications (2)

Publication Number Publication Date
CN102402541A true CN102402541A (en) 2012-04-04
CN102402541B CN102402541B (en) 2015-02-11

Family

ID=45884755

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201010282381.9A Active CN102402541B (en) 2010-09-14 2010-09-14 File analysis system and method

Country Status (1)

Country Link
CN (1) CN102402541B (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103500203A (en) * 2013-09-27 2014-01-08 金蝶软件(中国)有限公司 Method for checking accounts online and method and device for data storage
CN103559185A (en) * 2013-08-13 2014-02-05 西安航天动力试验技术研究所 Method for parsing and storing test data documents
CN104360890A (en) * 2014-10-17 2015-02-18 蓝盾信息安全技术有限公司 Method for generating XML file based on Java
CN106843864A (en) * 2017-01-09 2017-06-13 武汉开目信息技术股份有限公司 A kind of and SAP integrated callback method
CN107038329A (en) * 2016-10-19 2017-08-11 北京全域医疗技术有限公司 The on-line processing method and device of medical image file
CN107046555A (en) * 2016-10-19 2017-08-15 北京全域医疗技术有限公司 The transmission method and device of medical image file
CN110990079A (en) * 2019-12-02 2020-04-10 北京大学 Method and device for loading remote csv file
CN113220656A (en) * 2020-12-10 2021-08-06 格创东智(深圳)科技有限公司 Method and device for analyzing liquid crystal panel glass production data file
CN113220656B (en) * 2020-12-10 2024-04-16 格创东智(深圳)科技有限公司 Analysis method and device for liquid crystal panel glass production data file

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002039353A1 (en) * 2000-11-09 2002-05-16 Accenture Llp System and method for interfacing a data processing system to a business-to-business integration system
CN1369832A (en) * 2001-02-12 2002-09-18 宏碁电脑股份有限公司 Method and system for converting content format of file archives
US20050114405A1 (en) * 2003-11-25 2005-05-26 Microsoft Corporation Flat file processing method and system
CN1627288A (en) * 2003-12-10 2005-06-15 鸿富锦精密工业(深圳)有限公司 Files conversion system and method

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002039353A1 (en) * 2000-11-09 2002-05-16 Accenture Llp System and method for interfacing a data processing system to a business-to-business integration system
CN1369832A (en) * 2001-02-12 2002-09-18 宏碁电脑股份有限公司 Method and system for converting content format of file archives
US20050114405A1 (en) * 2003-11-25 2005-05-26 Microsoft Corporation Flat file processing method and system
CN1627288A (en) * 2003-12-10 2005-06-15 鸿富锦精密工业(深圳)有限公司 Files conversion system and method

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈振中: "Excel与XML相互转化的Java实现", 《福建电脑》 *

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103559185A (en) * 2013-08-13 2014-02-05 西安航天动力试验技术研究所 Method for parsing and storing test data documents
CN103559185B (en) * 2013-08-13 2016-12-28 西安航天动力试验技术研究所 Test data document resolves storage method
CN103500203A (en) * 2013-09-27 2014-01-08 金蝶软件(中国)有限公司 Method for checking accounts online and method and device for data storage
CN104360890A (en) * 2014-10-17 2015-02-18 蓝盾信息安全技术有限公司 Method for generating XML file based on Java
CN107038329A (en) * 2016-10-19 2017-08-11 北京全域医疗技术有限公司 The on-line processing method and device of medical image file
CN107046555A (en) * 2016-10-19 2017-08-15 北京全域医疗技术有限公司 The transmission method and device of medical image file
CN107038329B (en) * 2016-10-19 2020-04-21 北京全域医疗技术集团有限公司 Online processing method and device for medical image file
CN107046555B (en) * 2016-10-19 2020-07-31 北京全域医疗技术集团有限公司 Medical image file transmission method and device
CN106843864A (en) * 2017-01-09 2017-06-13 武汉开目信息技术股份有限公司 A kind of and SAP integrated callback method
CN110990079A (en) * 2019-12-02 2020-04-10 北京大学 Method and device for loading remote csv file
CN113220656A (en) * 2020-12-10 2021-08-06 格创东智(深圳)科技有限公司 Method and device for analyzing liquid crystal panel glass production data file
CN113220656B (en) * 2020-12-10 2024-04-16 格创东智(深圳)科技有限公司 Analysis method and device for liquid crystal panel glass production data file

Also Published As

Publication number Publication date
CN102402541B (en) 2015-02-11

Similar Documents

Publication Publication Date Title
CN102402541B (en) File analysis system and method
WO2004090684A3 (en) Method and apparatus for multi-realm system modeling
CN104537015A (en) Log analysis computer implementation method, computer and system
CN105786913A (en) Cloud manufacturing platform oriented ERP integrated database service interface encapsulation system and method
CN103345386A (en) Software production method, device and operation system
EP2521043A1 (en) Method for establishing a relationship between semantic data and the running of a widget
WO2002065277A2 (en) Method and system for incorporating legacy applications into a distributed data processing environment
CN110795315A (en) Method and device for monitoring service
CN102007495A (en) A method, apparatus and software for transforming a natural language request for modifying a set of subscriptions for a publish/subscribe topic string
US20120303642A1 (en) Automated file-conversion system and process for a media-generation system
WO2007048702A3 (en) Automated process for identifying and delivering domain specific unstructured content for advanced business analysis
US20080154861A1 (en) System and method for retrieving data from different types of data sources
WO2006042314A3 (en) Methods and apparatus for message oriented invocation
EP2366157A1 (en) Data publication and subscription system
CN102404356B (en) Long-distance function call transmission adapter and data reading method thereof
CN108595480B (en) Big data ETL tool system based on cloud computing and application method
CN111104556A (en) Service processing method and device
CN102377738A (en) Process integration server and method for realizing system integration by utilizing process integration server
CN112016285B (en) Logistics information processing method and processing system
US20210136118A1 (en) Comparing network security specifications for a network
US20210136119A1 (en) Comparing network security specifications across equivalent networks
US7668930B2 (en) Web service distribution system over the World Wide Web using web services description language (WSDL) standard including implementation for uniformly generating all fault conditions in WSDL message format
US20070106689A1 (en) XML data reduction engine (XRE)
WO2012050584A1 (en) System and method for providing a service
CN102546447B (en) Intelligent mail message processing method and intelligent mail message processing device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
ASS Succession or assignment of patent right

Owner name: SCIENBIZIP CONSULTING (SHENZHEN) CO., LTD.

Free format text: FORMER OWNER: GDS SOFTWARE(SHENZHEN)CO.,LTD.

Effective date: 20150104

Free format text: FORMER OWNER: HONGFUJIN PRECISE INDUSTRY CO., LTD.

Effective date: 20150104

C41 Transfer of patent application or patent right or utility model
TA01 Transfer of patent application right

Effective date of registration: 20150104

Address after: 518109 Guangdong province Shenzhen city Longhua District Dragon Road No. 83 wing group building 11 floor

Applicant after: SCIENBIZIP CONSULTING (SHEN ZHEN) CO., LTD.

Address before: 518109, Guangdong, Baoan District, Shenzhen, Longhua Road, road, east side of Foxconn science and Technology Park, D1 district workshop, stamping workshop, third layers, distinguish the body

Applicant before: Jetta software (Shenzhen) Co., Ltd.

Applicant before: Hon Hai Precision Industry Co., Ltd.

C14 Grant of patent or utility model
GR01 Patent grant
C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20150928

Address after: 528437 Guangdong province Zhongshan Torch Development Zone, Cheung Hing Road 6 No. 222 north wing trade building room

Patentee after: Yun Chuan intellectual property Services Co., Ltd of Zhongshan city

Address before: 518109 Guangdong province Shenzhen city Longhua District Dragon Road No. 83 wing group building 11 floor

Patentee before: SCIENBIZIP CONSULTING (SHEN ZHEN) CO., LTD.

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20160601

Address after: 518000 Guangdong Province, Shenzhen New District of Longhua City, Dalang street, Hua Sheng Lu Yong Jingxuan commercial building 1608

Patentee after: Jinyang Shenzhen sea Network Intelligent Technology Co., Ltd.

Address before: 528437 Guangdong province Zhongshan Torch Development Zone, Cheung Hing Road 6 No. 222 north wing trade building room

Patentee before: Yun Chuan intellectual property Services Co., Ltd of Zhongshan city

CB03 Change of inventor or designer information

Inventor after: Xiao Hanlong

Inventor before: Wang Taihong

Inventor before: Lin Cheng

Inventor before: Huang Yuxi

Inventor before: Liu Baiting

Inventor before: Gan Shuhui

Inventor before: Jian Jiting

Inventor before: Liang Wenguang

Inventor before: Yao Jin

Inventor before: Luo Wei

Inventor before: He Baoru

TR01 Transfer of patent right

Effective date of registration: 20170630

Address after: 100089 Beijing city Haidian District wanquanzhuang Road No. 15 304

Patentee after: Beijing crown education Polytron Technologies Inc

Address before: 518000 Guangdong Province, Shenzhen New District of Longhua City, Dalang street, Hua Sheng Lu Yong Jingxuan commercial building 1608

Patentee before: Jinyang Shenzhen sea Network Intelligent Technology Co., Ltd.

CP03 Change of name, title or address
CP03 Change of name, title or address

Address after: Room 266, 2 / F, building 17, Tianzhu Jiayuan, Tianzhu town, Shunyi District, Beijing

Patentee after: Beijing Shouguan Education Technology Group Co., Ltd

Address before: No. 15, Wanquan Zhuang Road, Haidian District, Beijing, Beijing, 304

Patentee before: Beijing crown education Polytron Technologies Inc.