CN112861490B - Engineering quantity list directory comparison system and method based on openpyl - Google Patents

Engineering quantity list directory comparison system and method based on openpyl Download PDF

Info

Publication number
CN112861490B
CN112861490B CN202110270362.2A CN202110270362A CN112861490B CN 112861490 B CN112861490 B CN 112861490B CN 202110270362 A CN202110270362 A CN 202110270362A CN 112861490 B CN112861490 B CN 112861490B
Authority
CN
China
Prior art keywords
keyword
file
excel file
folder
excel
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110270362.2A
Other languages
Chinese (zh)
Other versions
CN112861490A (en
Inventor
钱仲文
李雪维
裘华东
范江东
赵欣
金日强
张志仁
韩欣之
吕晓青
卢孔实
吴越人
郭燕玲
潘丐多
叶凡
林春
张睿
李媛媛
朱力
郑思佳
吴波
徐天天
袁奕文
何佳
杨文颖
喻琤
刘挺
杨钦
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Materials Branch of State Grid Zhejiang Electric Power Co Ltd
Original Assignee
Materials Branch of State Grid Zhejiang Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Materials Branch of State Grid Zhejiang Electric Power Co Ltd filed Critical Materials Branch of State Grid Zhejiang Electric Power Co Ltd
Priority to CN202110270362.2A priority Critical patent/CN112861490B/en
Publication of CN112861490A publication Critical patent/CN112861490A/en
Application granted granted Critical
Publication of CN112861490B publication Critical patent/CN112861490B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/166Editing, e.g. inserting or deleting
    • G06F40/177Editing, e.g. inserting or deleting of tables; using ruled lines
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/16File or folder operations, e.g. details of user interfaces specifically adapted to file systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/194Calculation of difference between files
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/103Workflow collaboration or project management

Abstract

The invention relates to the technical field of technical language document processing, in particular to an engineering quantity list directory comparison system and method based on openpyl. An engineering quantity list directory comparison system based on openpyl comprises an excel file acquisition module, a directory database and a directory database, wherein the excel file acquisition module is used for uploading directory data of the same batch and acquiring an excel file with a sheet list name as an engineering quantity list; the summary document creation module is used for creating a summary document; the excel file processing module is used for acquiring keyword information of the excel file; the keyword information recording module is used for recording the keyword information of the excel file in the summary document; and the summary document export module is used for exporting the summary document. The system and the method can gather the keyword information together through the summary document so as to be convenient for a user to check, effectively improve the processing effect and the processing efficiency of the engineering quantity list catalogue in the field of material project management, and lighten the working complexity of material project management personnel.

Description

Engineering quantity list directory comparison system and method based on openpyl
Technical Field
The invention relates to the technical field of technical language document processing, in particular to an engineering quantity list directory comparison system and method based on openpyl.
Background
Technical language document processing technology mainly aims at identifying useful technical words according to special logic relations aiming at complicated tables and text information. With the increase of massive tables and text data, how to quickly capture information suitable for the user from massive text information, and further reasonably apply and manage the information is a current urgent problem to be solved.
At present, under the field of material project management, tables and documents are mostly directly and artificially processed, for example, an engineering quantity list is caused, so that in the process of arranging the engineering quantity list, the working efficiency is low, error and leakage are easy to occur, and further, the processing efficiency and the processing effect of the engineering quantity list are not ideal.
Disclosure of Invention
Aiming at the problems existing in the prior art, the invention provides an engineering quantity list catalog comparison system and method based on openpyl, which can quickly and effectively collect key information on the engineering quantity list catalog on a summary document, thereby effectively reducing the working complexity of material project management personnel.
The technical scheme adopted for solving the technical problems is as follows: an engineering quantity list directory comparison system based on openpyl comprises
The excel file acquisition module is used for uploading directory data of the same batch and acquiring an excel file with a sheet list name as an engineering quantity list;
the summary document creation module is used for creating a summary document;
the excel file processing module is used for acquiring keyword information of the excel file;
the keyword information recording module is used for recording the keyword information of the excel file in the summary document;
and the summary document export module is used for exporting the summary document.
According to the system, the directory documents can be classified through the excel file acquisition module, the sheet table names are read to obtain the excel files with the sheet table names being engineering quantity lists, the excel file processing module is used for reading the contents in the sheet tables with the names being engineering quantity lists and carrying out keyword matching on the read contents and a secondary purchasing directory in the database to obtain keyword information of the engineering quantity lists, and the keyword information can be summarized together through the total table documents so as to be convenient for a user to check, so that the processing effect and the processing efficiency of the engineering quantity list directories in the field of material project management are effectively improved, and the working complexity of material project management staff is reduced.
Preferably, the excel file obtaining module includes
The total folder establishing unit is used for establishing a batch file total folder for storing the catalog data, wherein the catalog data comprises a compression package, a folder, a word file and an excel file;
the compression package decompression unit is used for decompressing the compression package into a folder in the batch file total folder;
the secondary folder establishing unit is used for establishing a batch file secondary folder for storing the batch file total folder, and word files and excel files in the folder;
the document classification unit is used for classifying the documents to be processed into excel files and word files through the openpyxl functional module;
an excel file determining unit for reading the sheet table name of the excel file by the openpyl function module to determine the excel file having the sheet table name as the engineering quantity list.
Preferably, the excel file processing module is used for matching keywords between contents in a sheet table named as an engineering quantity list and a secondary purchasing directory in a database, and acquiring keyword information if the contents are matched; the keyword information comprises a keyword body, a keyword type, keyword row and column coordinate information, the number of pages of a sheet table where the keywords are located and the name of an excel file where the keywords are located.
Preferably, the excel file processing module reads the content in the sheet table named as the engineering quantity list through an xlrd function module.
Preferably, the keyword information recording module includes
A first recording unit configured to record a keyword body of the keyword information in the summary document;
the second recording unit is used for recording the type corresponding to the keyword body in the summary document;
a third recording unit, configured to record row-column coordinate information of the keyword body in a sheet table in the summary document;
a fourth recording unit for recording the sheet table with the keyword body in the summary document;
and a fifth recording unit, configured to record the name of the excel file corresponding to the keyword body in the summary document.
An engineering quantity list directory comparison method based on openpyl comprises the following steps of
L1 uploads catalog data of the same batch, and obtains an excel file with a sheet list name as an engineering quantity list;
l2 creates a summary document;
the L3 obtains keyword information of an excel file through an excel file processing module, and records the keyword information of the excel file in the summary document through a keyword information recording module;
l4 derives the summary document.
The method can classify the catalog documents and read the sheet table names to obtain the excel file with the sheet table names being the engineering quantity list, can read the contents in the sheet table with the engineering quantity list by the excel file processing module and match the read contents with a second-level purchasing catalog in the database to obtain the keyword information of the engineering quantity list, and can gather the keyword information together by the total table documents so as to be convenient for a user to check, thereby effectively improving the processing effect and the processing efficiency of the engineering quantity list catalog in the field of material project management and reducing the working complexity of material project management staff.
Preferably, the L1 specifically comprises
L11 establishes a batch file total folder, and uploads directory data to the batch file total folder, wherein the directory data comprises a compression package, a folder, a word file and an excel file;
l12 decompresses the compressed package into a folder in the batch file total folder;
l13 establishes a batch file sub-folder in the batch file total folder, and moves the batch file total folder and word files and excel files in the folder to the batch file sub-folder to form a document to be processed;
the L14 classifies the documents to be processed into excel files and word files through an openpyxl functional module;
l15 reads the sheet table name of the excel file through the openpyl function module to determine the excel file with the sheet table name as the engineering quantity list.
Preferably, the L3 specifically comprises
The L31 carries out keyword matching on the content in the sheet table named as the engineering quantity list and a secondary purchasing directory in the database through an excel file processing module, and if the keyword matching is carried out, keyword information is obtained;
and L32 records the keyword information of the excel file in the summary document through a keyword information recording module.
Preferably, in the L31, the excel file processing module reads the content in the sheet table named as the engineering quantity list through an xlrd function module.
Preferably, the L32 specifically comprises
L321 records the keyword body of the keyword information in the summary document;
l322 records the type corresponding to the keyword body in the summary document;
l323 records the row-column coordinate information of the keyword body in the sheet table in the summary document;
l324 records the sheet epitope with the keyword ontology in the summary document with the number of pages in an excel file;
l325 records the names of excel files corresponding to the keyword bodies in the summary document.
Advantageous effects
According to the system and the method, the catalog documents can be classified through the Openpyryl functional module, the sheet table names are read to obtain the excel file with the sheet table names being the engineering quantity list, the excel file processing module is used for reading the contents in the sheet table with the names being the engineering quantity list, keyword matching is carried out on the read contents and a second-level purchasing catalog in the database to obtain keyword information of the engineering quantity list, and the keyword information can be summarized together through the total table documents so as to be convenient for a user to check, so that the processing effect and the processing efficiency of the engineering quantity list catalog in the field of material project management are effectively improved, and the work complexity of material project management staff is reduced.
Drawings
FIG. 1 is a diagram showing the composition of directory data according to the present invention.
Detailed Description
The technical scheme of the invention is further described below by the specific embodiments with reference to the accompanying drawings.
An engineering quantity list directory comparison system based on openpyl comprises an excel file acquisition module, a summary document creation module, an excel file processing module, a keyword information recording module and a summary document export module.
The excel file acquisition module is used for uploading catalog data of the same batch and acquiring an excel file with a sheet list name as an engineering quantity list. The excel file acquisition module comprises a total folder establishment unit, a compression package decompression unit, a secondary folder establishment unit, a document classification unit and an excel file determination unit. The total folder establishing unit is used for establishing a batch file total folder for storing the directory data, wherein the directory data comprises a compression package, a folder, a word file and an excel file as shown in fig. 1. The compression package decompression unit is used for decompressing the compression package into folders in the batch of total folders, firstly obtaining the file name of the compression package, judging the file format of the compression package through the suffix of the file name of the compression package, wherein the common file formats of the compression package comprise zip, tar, rar, 7z and other common formats, and adopting different decompression methods for the compression packages with different formats. The secondary folder establishing unit is used for establishing a batch file secondary folder for storing the batch file total folder, and word files and excel files in the folder. The document classification unit is used for classifying the documents to be processed into excel files and word files through the openpyxl functional module. The Openpyrxl functional module is a Python library for reading and writing an excel 2010 document, is a relatively comprehensive tool, not only can read and modify an excel file at the same time, but also can set cells in the excel file in detail, including cell patterns and other contents, even further supports chart insertion, printing setting and other contents, and can read and write xltm, xltx, xlsm, xlsx and other types of files by using the openpyrxl functional module, and can process the excel file with larger data volume. The system can judge whether the document to be processed is an excel file or not through the openpyxl functional module, and if the document to be processed is not the excel file, the document to be processed is necessarily a word file. The excel file determining unit is used for reading the sheet table name of the excel file through the filestream function of the openpyxl function module to determine the excel file with the sheet table name being an engineering quantity list.
The summary document creation module is used for creating a summary document. A summary document is used to count key information of engineering quantity list of the same batch catalog data. The summary documents may be named with report-batch file total folder names.
The excel file processing module is used for acquiring keyword information of the excel file. And the excel file processing module reads the content in the sheet table named as the engineering quantity list through the xlrd functional module. The xlrd functional module is an extension tool for reading excel, and can read a designated form and a designated cell, and only the python environment is required to be ensured to be installed when the system is used. The excel file processing module is used for carrying out keyword matching on the content in the sheet table named as the engineering quantity list and a secondary purchasing directory in the database, and if the content is matched with the primary purchasing directory, keyword information is obtained; the keyword information comprises a keyword body, a keyword type, keyword row and column coordinate information, the number of pages of a sheet table where the keywords are located and the name of an excel file where the keywords are located.
And the keyword information recording module is used for recording the keyword information of the excel file in the summary document. The keyword information recording module comprises a first recording unit for recording a keyword body of the keyword information in the total surface document, a second recording unit for recording a type corresponding to the keyword body in the total surface document, a third recording unit for recording row-column coordinate information of the keyword body in a sheet table in the total surface document, a fourth recording unit for recording the page number of the sheet table with the keyword body in an excel file in the total surface document, and a fifth recording unit for recording the name of the excel file corresponding to the keyword body in the total surface document.
The summary document export module is used for exporting the summary document, so that the user can conveniently view the summary document.
According to the system, the catalog documents can be classified through the Openpyrl function module, the sheet table names are read to obtain the excel file with the sheet table names being the engineering quantity list, the excel file processing module is used for reading the contents in the sheet table with the names being the engineering quantity list and carrying out keyword matching on the read contents and a secondary purchasing catalog in the database to obtain keyword information of the engineering quantity list, and the keyword information can be summarized together through the total table documents so as to be convenient for a user to check, so that the processing effect and the processing efficiency of the engineering quantity list catalog in the field of material project management are effectively improved.
An engineering quantity list catalog comparison method based on openpyl comprises the following steps,
and L1 uploads catalog data of the same batch, and acquires an excel file with a sheet list name of an engineering quantity list. The method specifically comprises the steps that an L11 establishes a batch file total folder, and catalog data is uploaded to the batch file total folder, wherein the catalog data comprises a compression package, a folder, a word file and an excel file as shown in fig. 1. And L12 decompresses the compressed package into a folder in the batch of file total folders, firstly obtains the file name of the compressed package, judges the file format of the compressed package through the suffix of the file name of the compressed package, wherein the file format of the common compressed package comprises zip, tar, rar, 7z and other common formats, and adopts different decompression methods for the compressed packages with different formats. And L13 establishes a batch file sub-folder in the batch file total folder, and moves the batch file total folder, word files and excel files in the folder to the batch file sub-folder to form a document to be processed. And L14 classifies the documents to be processed into excel files and word files through an openpyl functional module. The Openpyrxl functional module is a Python library for reading and writing an excel 2010 document, is a relatively comprehensive tool, not only can read and modify an excel file at the same time, but also can set cells in the excel file in detail, including cell patterns and other contents, even further supports chart insertion, printing setting and other contents, and can read and write xltm, xltx, xlsm, xlsx and other types of files by using the openpyrxl functional module, and can process the excel file with larger data volume. The system can judge whether the document to be processed is an excel file or not through the openpyxl functional module, and if the document to be processed is not the excel file, the document to be processed is necessarily a word file. And L15 reads the sheet table name of the excel file through the filestream function of the openpyl function module to determine the excel file with the sheet table name as an engineering quantity list.
L2 creates a summary document. A summary document is used to count key information of engineering quantity list of the same batch catalog data. The summary documents may be named with report-batch file total folder names.
And L3 obtains the keyword information of the excel file through an excel file processing module, and records the keyword information of the excel file in the summary document through a keyword information recording module. The method specifically comprises the steps that L31 carries out keyword matching on contents in a sheet table named as an engineering quantity list and a secondary purchasing directory in a database through an excel file processing module, and if matching is carried out, keyword information is obtained. And the excel file processing module reads the content in the sheet table named as the engineering quantity list through the xlrd functional module. The xlrd functional module is an extension tool for reading excel, and can read a designated form and a designated cell, and only the python environment is required to be ensured to be installed when the system is used.
And L32 records the keyword information of the excel file in the summary document through a keyword information recording module. And recording a keyword body of the keyword information in the summary document including L321. L322 records the type corresponding to the keyword body in the summary document. L323 records the row and column coordinate information of the keyword ontology in the sheet table in the summary document. L324 records the sheet epitope with the keyword ontology in the summary document as the number of pages in an excel file. L325 records the names of excel files corresponding to the keyword bodies in the summary document.
L4 derives the summary document, so that the user can view the summary document conveniently.
According to the method, the catalog documents can be classified through the Openpyrl function module, the sheet table names are read to obtain the excel file with the sheet table names being the engineering quantity list, the excel file processing module is used for reading the content in the sheet table with the name being the engineering quantity list and carrying out keyword matching on the read content and a secondary purchasing catalog in the database to obtain keyword information of the engineering quantity list, and the keyword information can be summarized together through the total table documents so as to be convenient for a user to check, so that the processing effect and the processing efficiency of the engineering quantity list catalog in the field of material project management are effectively improved.
The above examples are only illustrative of the preferred embodiments of the present invention and do not limit the spirit and scope of the present invention. Various modifications and improvements of the technical scheme of the present invention will fall within the protection scope of the present invention without departing from the design concept of the present invention, and the technical content of the present invention is fully described in the claims.

Claims (6)

1. The engineering quantity list directory comparison system based on openpyxl is characterized in that: comprising
The excel file acquisition module is used for uploading directory data of the same batch and acquiring an excel file with a sheet list name as an engineering quantity list;
the summary document creation module is used for creating a summary document;
the excel file processing module is used for acquiring keyword information of the excel file;
the keyword information recording module is used for recording the keyword information of the excel file in the summary document;
a summary document export module for exporting the summary document;
the excel file processing module is used for carrying out keyword matching on the content in the sheet table named as the engineering quantity list and a secondary purchasing directory in the database, and if the content is matched with the primary purchasing directory, keyword information is obtained; the keyword information comprises a keyword body, a keyword type, keyword row and column coordinate information, the number of pages of a sheet table where the keywords are located and the name of an excel file where the keywords are located;
the keyword information recording module comprises
A first recording unit configured to record a keyword body of the keyword information in the summary document;
the second recording unit is used for recording the type corresponding to the keyword body in the summary document;
a third recording unit, configured to record row-column coordinate information of the keyword body in a sheet table in the summary document;
a fourth recording unit for recording the sheet table with the keyword body in the summary document;
and a fifth recording unit, configured to record the name of the excel file corresponding to the keyword body in the summary document.
2. The openpyxl-based engineering quantity inventory alignment system of claim 1, wherein: the excel file acquisition module comprises
The total folder establishing unit is used for establishing a batch file total folder for storing the catalog data, wherein the catalog data comprises a compression package, a folder, a word file and an excel file;
the compression package decompression unit is used for decompressing the compression package into a folder in the batch file total folder;
a secondary folder establishing unit, configured to establish a batch file secondary folder in the batch file total folder, and move the batch file total folder and word files and excel files in the folder to the batch file secondary folder to form a document to be processed;
the document classification unit is used for classifying the documents to be processed into excel files and word files through the openpyxl functional module;
an excel file determining unit for reading the sheet table name of the excel file by the openpyl function module to determine the excel file having the sheet table name as the engineering quantity list.
3. The openpyxl-based engineering quantity inventory alignment system of claim 1, wherein: and the excel file processing module reads the content in the sheet table named as the engineering quantity list through the xlrd functional module.
4. The engineering quantity list directory comparison method based on openpyxl is characterized by comprising the following steps of: comprises the following steps
L1 uploads catalog data of the same batch, and obtains an excel file with a sheet list name as an engineering quantity list;
l2 creates a summary document;
the L3 obtains keyword information of an excel file through an excel file processing module, and records the keyword information of the excel file in the summary document through a keyword information recording module;
l4 derives the summary document;
the L3 specifically comprises
The L31 carries out keyword matching on the content in the sheet table named as the engineering quantity list and a secondary purchasing directory in the database through an excel file processing module, and if the keyword matching is carried out, keyword information is obtained;
l32 records the keyword information of the excel file in the summary document through a keyword information recording module;
the L32 specifically comprises
L321 records the keyword body of the keyword information in the summary document;
l322 records the type corresponding to the keyword body in the summary document;
l323 records the row-column coordinate information of the keyword body in the sheet table in the summary document;
l324 records the sheet epitope with the keyword ontology in the summary document with the number of pages in an excel file;
l325 records the names of excel files corresponding to the keyword bodies in the summary document.
5. The method for comparing engineering quantity list catalogues based on openpyl according to claim 4, wherein: the L1 specifically comprises
L11 establishes a batch file total folder, and uploads directory data to the batch file total folder, wherein the directory data comprises a compression package, a folder, a word file and an excel file;
l12 decompresses the compressed package into a folder in the batch file total folder;
l13 establishes a batch file sub-folder in the batch file total folder, and moves the batch file total folder and word files and excel files in the folder to the batch file sub-folder to form a document to be processed;
the L14 classifies the documents to be processed into excel files and word files through an openpyxl functional module;
l15 reads the sheet table name of the excel file through the openpyl function module to determine the excel file with the sheet table name as the engineering quantity list.
6. The method for comparing engineering quantity list catalogues based on openpyl according to claim 4, wherein: in the L31, the excel file processing module reads the content in the sheet table named as the engineering quantity list through an xlrd functional module.
CN202110270362.2A 2021-03-12 2021-03-12 Engineering quantity list directory comparison system and method based on openpyl Active CN112861490B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110270362.2A CN112861490B (en) 2021-03-12 2021-03-12 Engineering quantity list directory comparison system and method based on openpyl

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110270362.2A CN112861490B (en) 2021-03-12 2021-03-12 Engineering quantity list directory comparison system and method based on openpyl

Publications (2)

Publication Number Publication Date
CN112861490A CN112861490A (en) 2021-05-28
CN112861490B true CN112861490B (en) 2024-02-20

Family

ID=75994321

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110270362.2A Active CN112861490B (en) 2021-03-12 2021-03-12 Engineering quantity list directory comparison system and method based on openpyl

Country Status (1)

Country Link
CN (1) CN112861490B (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000231560A (en) * 1999-02-10 2000-08-22 Ricoh Co Ltd Automatic document classification system
KR20020067160A (en) * 2001-02-15 2002-08-22 전석진 Method and system for indexing document
JP2007058804A (en) * 2005-08-26 2007-03-08 Hitachi Ltd Content delivery system, content delivery method and content delivery program
KR20170016657A (en) * 2015-08-04 2017-02-14 서울시립대학교 산학협력단 An apparatus for managing document using table of contents, a method thereof, and a computer recordable medium storing the method
CN110889310A (en) * 2018-09-07 2020-03-17 上海怀若智能科技有限公司 Financial document information intelligent extraction system and method
CN111796800A (en) * 2020-06-28 2020-10-20 上海建科造价咨询有限公司 Python-based engineering quantity list accuracy verification method

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000231560A (en) * 1999-02-10 2000-08-22 Ricoh Co Ltd Automatic document classification system
KR20020067160A (en) * 2001-02-15 2002-08-22 전석진 Method and system for indexing document
JP2007058804A (en) * 2005-08-26 2007-03-08 Hitachi Ltd Content delivery system, content delivery method and content delivery program
KR20170016657A (en) * 2015-08-04 2017-02-14 서울시립대학교 산학협력단 An apparatus for managing document using table of contents, a method thereof, and a computer recordable medium storing the method
CN110889310A (en) * 2018-09-07 2020-03-17 上海怀若智能科技有限公司 Financial document information intelligent extraction system and method
CN111796800A (en) * 2020-06-28 2020-10-20 上海建科造价咨询有限公司 Python-based engineering quantity list accuracy verification method

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
基于BIM 的建筑专业设计合规性自动审查系统及其关键技;邢雪娇;土木工程与管理学报;129-136 *
基于Python的Excel文档合并系统的设计与实现;张孟研;;福建电脑(第06期);123-124 *
基于Python的Excel文档处理程序的设计与实现;周延熙;;信息与电脑(理论版)(第23期);85-87 *

Also Published As

Publication number Publication date
CN112861490A (en) 2021-05-28

Similar Documents

Publication Publication Date Title
US20210342404A1 (en) System and method for indexing electronic discovery data
US8315997B1 (en) Automatic identification of document versions
CN100361493C (en) Document processing device, document processing method, and storage medium recording program therefor
US7372993B2 (en) Gesture recognition
US9068920B2 (en) System and method for scanning and processing printed media
CN111597150A (en) Automatic change and file arrangement information system
US20100097662A1 (en) System and method for scanning and processing printed media
CN112463726A (en) Automatic mobile financial bill filing method
MX2009000589A (en) Data processing over very large databases.
CN109284273B (en) Massive small file query method and system adopting suffix array index
Sankar et al. Digitizing a million books: Challenges for document analysis
CN101408882B (en) Method and system for searching authorization document
CN105701091A (en) Semantic-based PDF document processing method and processing device
CN1588352A (en) Recording method for extendable mark language file repairing trace
CN112861490B (en) Engineering quantity list directory comparison system and method based on openpyl
Boenig et al. Labelling OCR Ground Truth for Usage in Repositories
CN102306175A (en) Personal knowledge management method and device
CN112861473B (en) Directory examination result summarizing system and method based on openpyl
Niu Original order in the digital world
TW420777B (en) A query method of dynamitic attribute database management
CN114218347A (en) Method for quickly searching index of multiple file contents
Holler Toward a reference theory
Estill Shakespearean Extracts, Manuscript Cataloguing, and the Misrepresentation of the Archive
US8630984B1 (en) System and method for data extraction from email files
CN113298914B (en) Knowledge chunk extraction method and device, electronic equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant