CN110675289A - Method for compiling electronic file catalogue with case criminal review - Google Patents

Method for compiling electronic file catalogue with case criminal review Download PDF

Info

Publication number
CN110675289A
CN110675289A CN201910936642.5A CN201910936642A CN110675289A CN 110675289 A CN110675289 A CN 110675289A CN 201910936642 A CN201910936642 A CN 201910936642A CN 110675289 A CN110675289 A CN 110675289A
Authority
CN
China
Prior art keywords
file
files
criminal
directory
catalog
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201910936642.5A
Other languages
Chinese (zh)
Other versions
CN110675289B (en
Inventor
何坤
董晶
周鑫
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sichuan University
Original Assignee
Sichuan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sichuan University filed Critical Sichuan University
Priority to CN201910936642.5A priority Critical patent/CN110675289B/en
Publication of CN110675289A publication Critical patent/CN110675289A/en
Application granted granted Critical
Publication of CN110675289B publication Critical patent/CN110675289B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/18Legal services
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/10File systems; File servers
    • G06F16/11File system administration, e.g. details of archiving or snapshots
    • G06F16/113Details of archiving
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02PCLIMATE CHANGE MITIGATION TECHNOLOGIES IN THE PRODUCTION OR PROCESSING OF GOODS
    • Y02P90/00Enabling technologies with a potential contribution to greenhouse gas [GHG] emissions mitigation
    • Y02P90/30Computing systems specially adapted for manufacturing

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Tourism & Hospitality (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • Primary Health Care (AREA)
  • General Business, Economics & Management (AREA)
  • Human Resources & Organizations (AREA)
  • General Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Technology Law (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention belongs to the technical field of electronic volume catalogue authoring, and discloses a method for authoring an electronic volume catalogue with criminal first-aid, which comprises the following steps: analyzing the criminal examination files, extracting the characteristics of the files and constructing a criminal file characteristic library; classifying and identifying file files of criminal cases, extracting file information according to characteristics, and constructing criminal case file management libraries; and compiling a reading file and an archiving file catalogue by combining the management library. The invention is helpful to know the source and the rough case situation of the specific file files from the catalogue, and integrates and records the file files of each department. Makes up the defects of independent editions of the traditional public security (examination paper), inspection yard (examination paper), court (litigation paper) and administrative and judicial authorities (execution paper). The invention is beneficial to the recording of novel materials and is convenient for the expansion of recording technology.

Description

Method for compiling electronic file catalogue with case criminal review
Technical Field
The invention belongs to the technical field of electronic volume catalogue authoring, and particularly relates to a method for authoring an electronic volume catalogue according to criminal review.
Background
Currently, the closest prior art: with the deepening of judicial informatization construction, the criminal case files stored by the current judicial departments at all levels (court, inspection institute and judicial administrative institution) are increased by tens of millions of levels every year. In order to facilitate criminal case handling and save file management cost, electronic criminal case files and respective online case handling service systems are initially built by all levels of judicial departments: the high and middle-level court develops the business system in the unit independently, such as a trial and judgment management system, an electronic file system and an execution system; the people's court constructs the industry standard of the electronic file catalogue specification; private networks are constructed in national courts and people's courts, one-network office case handling is realized, and trace leaving and supervision in the whole process are realized; the national inspection and inspection institution builds a unified service application system, integrates functions of case handling, management, supervision, statistics and the like into a whole, and realizes online record of case handling information, online management of case handling processes, online supervision of case handling activities and data generation of four-level inspection and inspection institutions in the country. The administrative department of justice is a general term for a plurality of functional institutions: the system comprises administration, notarization, legal assistance, basic legal service, civil mediation, judicial appraisal, community correction and help and education arrangement, prisons, detention houses, detoxification places and the like. Some functional organizations have established business systems for this organization: such as a "judicial administration work information management system", "notary administration and industry management system", "judicial assessment auction management system", "judicial community correction management system", and "prison management information system", etc. Although electronic file processing research has been widely conducted in China, each level of judicial department basically establishes a respective electronic file management system. However, the data flow, sharing and exchange of criminal case files among all levels of judicial departments are not completely realized.
Because the electronic files are started late in China, the current criminal case file catalogue is manually compiled only according to the handling process in each department, and the automatic extraction of document numbers from the files, content-based classification cataloguing and hanging are not realized. The criminal examination volume catalog has the following defects in content: 1) the directory is too simple. The traditional file directory is mainly composed of two levels, one level is the class name inside the department, and the second level is the file name of the file. Conventional volume directories do not contain critical information about the volume files, such as "detention" (secondary directory) for mandatory documents (primary directory) and "audiovisual material record" (secondary directory) for evidentiary documents (primary directory) in the reconnaissance volume directory. The "detention" (secondary directory) does not write information on who and when to perform the detention; the "audiovisual material record" (secondary directory) does not annotate audiovisual material about what. Criminal cases have different numbers of files according to different cases, and the number of files is hundreds of cases, so evidence files are messy and mixed. It is difficult for the reader to roughly understand the basic situation and evidence structure of the case from such a catalog, and the reader does not sufficiently exert the function that the catalog should have. 2) The unified criminal case file bibliographic specifications are lacked among all levels of judicial departments, and the current public inspection method has one set of criminal case file bibliographic specifications, which may cause that one file is inconsistent in names in file catalogues of different departments, such as evidence files. The criminal first-examination papers can be divided into reading papers and filing papers from the use angle, wherein the reading papers refer to readable papers which are distributed among all departments and are formed by partial criminal first-examination papers, the types of papers are different due to different departments or different personal authorities, and the catalogues of the papers are different from person to person. The archive file refers to all file sets of one examination file formed in the criminal case handling process and mainly comprises documents and evidences formed by policemen, all levels of judicial departments and litigation participants. The archived file catalog should include public security, inspection yards, court houses, judicial authorities and litigation participants readable file names, submitted undelivered file names and classified file names.
The current deficiency in reading the file catalog is mainly expressed as follows: 1) the degree of automation of the writing is not high, and automatic file screening and catalog compilation of the existing criminal first-pass papers according to the authority of the reader is not realized. At present, criminal examination papers are circulated among departments, and corresponding catalogs need to be manually screened and compiled by professional personnel. For example, when a lawsuit participant (lawyer) wants to read a file to be detected and transferred to a detection department, the lawsuit participant (lawyer) generally obtains a file which is required to be read or is associated with a filing staff or a case management center at a contact time, and the filing staff or the case management center manually screens the file according to the reading authority and the application and compiles a corresponding directory. 2) The writing is not timely, and the traditional catalogue compilation is generally compiled according to case handling nodes or predetermined time. 3) The integration of the bibliographic is poor, for example, if a litigant participant (lawyer) wants to read the relevant inspection hall and legal portfolio at the same time, he must apply for the inspection hall and the legal hall and make different appointment times. 4) The current catalogue is too simple, and is inconvenient for a reader to briefly know the case from the catalogue. The defects of the archive catalogue are as follows: 1) because each department is independently compiled, the integration is poor; 2) traditional criminal first-pass archive files are provided by courts, and catalogues contain legal documents and evidence materials from the public inspection and some jurisdictions. The contents of all jurisdictions such as prisons, detention houses and detoxification facilities are not included.
Aiming at the problems of low intellectualization degree, poor integration and the like of the electronic file records currently reviewed by criminal offenders, an automatic file record and reading catalog authoring technology covering public security, courts, inspection yards, judicial administrative organs and litigant participants is urgently needed, the functions of file catalogs in different applications are fully exerted, and the high efficiency and the activeness of judicial are promoted. The method aims to solve the problem of data knowledge of a large number of criminal examination files stored in a distributed mode.
In summary, the problems of the prior art are as follows: the existing electronic file book with case criminal review has low intellectualization degree and poor integration.
The difficulty of solving the technical problems is as follows:
(1) constructing a file feature library: the number of documents of criminal cases is more or less, and more documents are hundreds of copies according to the case situation, and meanwhile, the evidence documents are messy, and the contents of the case situation are different in different documents and evidence descriptions. In order to extract key information from the documents and evidences, the invention analyzes the commonalities and differences of the same file samples of different criminal cases to form a file feature library. The accuracy of the file characteristics of a file depends on the number of file samples, and also determines the accuracy of the file profile and the bibliographic.
(2) Constructing a management library of files with files: the documents in the form of criminal papers are in a large number and various forms, and are mainly expressed in the forms of texts, images, audio-visual media, copies, forms and the like, and the information expression modes of the documents in different forms are different. In order to provide a file management library constructed along with file information, the invention integrates a character recognition technology, an image processing technology and fuzzy recognition.
The significance of solving the technical problems is as follows:
(1) the main purpose of constructing a criminal case portfolio feature library is:
1) so that the compiled catalog meets the marking habits of the public inspection law supervision and the corresponding bibliographic specifications;
2) providing guidance information for brief description of files;
3) provides necessary characteristic information for file classification of file files.
(2) The main purpose of constructing a management library of the files with the files is as follows:
1) providing necessary data support for generating marking catalogues and filing catalogues;
2) and providing data support for adding the abstract of the file in the marking directory.
3) The file bibliographic sequence is convenient to arrange.
Disclosure of Invention
Aiming at the problems in the prior art, the invention provides an electronic volume catalogue authoring method along with criminal review.
The invention is realized in such a way that a consummate first-pass electronic volume catalog authoring method comprises the following steps:
analyzing criminal examination files, extracting the characteristics of the files and constructing a criminal file characteristic library;
secondly, classifying and identifying the files of the criminal case to extract file information according to characteristics, and constructing a criminal case management library;
and thirdly, compiling a reading file catalog and an archiving file catalog by combining the management library.
Further, the first step builds a criminal volume feature library: the file management system comprises a file making organization, a file name, a file attribute, a file type, a file category, a directory code and key information;
the file classification is a concrete classification of document files;
the directory code is a file directory number and is specified according to a file directory sequence of a public inspection prison;
the key information records the summary information of the file, and the key information of the file is constructed according to the reading key points of legal workers on the file.
Further, the constructing of the accompanying criminal file management library of the second step includes:
(1) volume file structure: the documents of criminal case volume include the text material of the documents written by the official survey, the self-complainers and the defendees; the template of the document comprises a head part, a body part and a tail part; header writing manufacturing organization, file name, primary and secondary volumes, and others; the text part explains the reason and the offending clauses; the tail part of the system is used for writing a undertaking unit, a contractor and a date;
(2) extracting file information: expressed in text, image, audiovisual media, copy and tabular form;
(3) criminal case volume file management repository:
MYSQL8.1 is used for establishing a file management library which mainly comprises a file manufacturing organization, file names, file attributes, file types, authorities, file ID numbers, file types and brief descriptions;
the manufacturing organization is filled in according to the release department;
the file name is extracted and filled from the file by utilizing a character recognition technology;
the file attribute is extracted from the file by utilizing a character recognition technology and is filled in;
the file type is inquired and filled in from the file feature library according to the file name;
the authority, which records the reading authority of the file, is filled in by the file publisher according to the case;
the file ID number not only represents the sequence of the file files in the directory, but also represents the number of the file files in the file warehouse;
the file type is inquired and filled in from the file feature library according to the file name;
briefly, the summary of the file is recorded, the key information of the file is inquired from the file feature library according to the file name, the related content is retrieved from the file by using the key information, and finally the corresponding item is filled.
Further, the headers of the official documents are classified into four types: the first type has only the name of the document; the second category comprises a manufacturing organization, a file name and a letter number; the third category is composed of manufacturing organs, file names, letter numbers and the like; the fourth type is that the primary volume and the secondary volume are added on the basis of the third type;
the image and audio-visual material mainly consists of two parts: the system comprises a description body and related media data, wherein the description body writes the source, time, place, acquisition personnel and related content description of the media; the copy refers to a valid certificate issued by a relevant unit; the table is composed of table names and table contents; the table names all appear in the top page as separate lines.
Further, the extracting the file information of the file further comprises:
1) document name and information extraction:
analyzing a PDF text structure; secondly, extracting each line of text of the home page by using a character recognition technology; finally, fuzzy matching is carried out on the texts in each row and the file name items in the feature library, and the file names are identified;
and (3) extracting other information of the document: according to the name of the document, firstly, searching corresponding content by combining key information of the document in a feature library to form a brief description of the document; secondly, generating the ID number of the file according to the directory code of the file in the feature library; finally, analyzing the file attribute and category;
2) information extraction of images and audiovisual media:
the description part of the image and the audio-visual material is expressed in PDF format, and the PDF text structure of the description part is firstly analyzed; secondly, the character recognition technology is combined with the time, the place and the collected personnel content in the file feature library, and meanwhile, related content is searched from the description part according to the key information in the feature library to form brief descriptions of images and audio-visual media; finally, the ID number of the file is generated according to the directory code in the feature library;
3) information extraction of the copy:
firstly, detecting the edge of a certificate by using an edge detection algorithm, detecting upper, lower, left and right parallel lines of the boundary of the certificate by carrying out Hough change on the edge, analyzing an angle when the certificate is collected according to the slopes of the upper, lower, left and right parallel lines, and carrying out rotation processing on the certificate according to the angle; secondly, extracting the certificate type, the name of the certificate holder and the issuing time information of the rotated copy by applying an OCR technology; finally, the ID number of the file is generated according to the directory code in the feature library;
4) information extraction of the table:
firstly, analyzing a PDF text structure; secondly, extracting each line of text of the home page, carrying out fuzzy matching on each line of text and the file name items in the feature library, and identifying the table name; and finally, extracting key information according to the table name and the characteristics of the table name to form a brief description of the table name and generate an ID number.
Further, the third step of compiling a reading file and an archiving file catalog in combination with the management library specifically includes:
(1) a directory framework:
the criminal case file catalogue frame is designed as follows: police materials, inspection yard materials, court materials, executive materials, prosecutor materials, defendant materials, third party agency materials, audio and video materials, other litigation related materials, and others as first-class catalogs; the legal documents and evidences are secondary catalogues; the file category is a three-level directory; the specific file files are four-level directories, and the directories are compiled according to related items of a file management library;
(2) file catalog authoring:
the case files are divided into archive files and reading files from the use angle, and an archive file catalogue and a reading file catalogue are correspondingly generated; the directory order is determined by the file ID number in the portfolio management library.
Further, the primary catalog indicates the source of the file, and the manufacturing organization generates the file according to the manufacturing organization item of the file management library;
the secondary directory indicates the file type, legal documents and evidences; generating according to the file type items of the file management library;
the third-level directory indicates the file type and is generated according to the type item of the file management library;
the four-level directory is composed of names and abstracts of specific file files, and the file names are generated according to file names in the file management library.
Further, the volume catalog authoring includes:
1) archive catalog authoring:
the archive files refer to all file sets formed in the criminal case handling process, and all file files of the case are summarized without any constraint in a directory; the directory is used for archiving processing, the directory does not contain an abstract part of a four-level directory, and the directory comprises items such as file names of a first-level directory, a second-level directory, a third-level directory and the four-level directory;
2) reading a file catalog compilation:
the reading file is a readable file of related personnel or departments according to the authority, and the catalogue of the readable file only can comprise file files in the reading authority; the catalog mainly comprises a first-level catalog, a second-level catalog, a third-level catalog and a fourth-level catalog meeting reading authority.
Furthermore, the method for cataloguing and authoring the electronic file according to criminal review further comprises the steps of hooking the file, searching the corresponding file according to the file ID number in the file management library and displaying the file.
In summary, the advantages and positive effects of the invention are: with the increasing social impact of the internet, changes need to be made to existing electronic file catalogs. The criminal case first-pass treatment department is mainly composed of court, inspection yard, court, judicial administration and litigation participants (lawyers). Criminal first-pass electronic volume is a collective term for all legal documents and evidences formed by various departments during the handling process, and each legal document or evidence is called a volume file. The number of files is more or less according to the different cases, the number of files is less than two, the number of files is more than one, and the evidence documents are messy. The criminal first-review electronic files can be divided into reading files and filing files from the use angle, wherein the filing files refer to all file sets formed in the criminal case handling process, and the main function of the catalogues is to summarize all file files of the criminal first-review; the reading files are the readable files of related personnel or departments according to the authority, the catalogue of the readable files is mainly used for helping the reading people to know the case in the authority, and the reading people can see the rough case and evidence from the abstract in the reading catalogue.
The invention combines the file and corresponding recording standard provided by public security, inspection institute, court, administrative and judicial organ and litigation participator, and divides the file of the criminal case into public security material, inspection institute material, court material, executive material, self-complainer material, defendant material, third party institution material (arbitration notary), audio and video material, other litigation related material and others, and uses them as first-class catalog. The method is beneficial to knowing the manufacturing organization or source of the specific file from the catalogue, and integrating and recording the file of each department. Makes up the defects of independent editions of the traditional public security (examination paper), inspection yard (examination paper), court (litigation paper) and administrative and judicial authorities (execution paper). Other materials are beneficial to recording of novel materials and are convenient for extending recording technology.
The system integrates and records criminal cases, and is beneficial for readers to look up file documents of different departments. The time and cost for applying for reading the paper from the reader to each department are reduced.
The invention constructs the management library of the files with files, can automatically generate file catalogues with different purposes at any time according to the management library of the files with files without manual intervention, makes up the defects of the traditional recording technology and saves the labor and the cost for generating the file catalogues.
The invention adds file abstract on the traditional catalogue, which is convenient for the reader to quickly know the basic situation and evidence composition of the case from the catalogue, and improves the quality and efficiency of the examination.
The invention combines the traditional file catalogue of the public inspection prison and the corresponding standard to establish a criminal case first-examination file feature library. And a certain foundation is laid for constructing file feature libraries of other types of cases.
The invention promotes the synchronous deep application of the electronic files on case and reduces the burden of legal workers. Supporting the investigation, supervision, court trial and execution flow of criminal cases and improving the case handling quality and efficiency. Paperless case handling is realized, and the full text of the file is displayed in each link as required in time; and the leader and manager of each department are supported to synchronously consult the file files. The invention fully plays the role of criminal examination and parcel catalogs and makes up the defects of the traditional catalogs.
Drawings
Fig. 1 is a flowchart of a method for cataloging electronic files under investigation for criminal review according to an embodiment of the present invention.
Fig. 2 is a flowchart of an implementation of a method for cataloging electronic files under criminal review according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail with reference to the following embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
In view of the problems of the prior art, the present invention provides a method for cataloguing electronic files according to crime trial, which is described in detail below with reference to the accompanying drawings.
As shown in fig. 1, the method for cataloguing electronic files under criminal review provided by the embodiment of the present invention includes the following steps:
s101: analyzing the criminal examination files, extracting the characteristics of the files and constructing a criminal file characteristic library;
s102: classifying and identifying file files of criminal cases, extracting file information according to characteristics, and constructing criminal case file management libraries;
s103: and compiling a reading file and an archiving file catalogue by combining the management library.
The technical solution of the present invention is further described below with reference to the accompanying drawings.
As shown in fig. 2, the method for cataloguing electronic files under criminal review according to the embodiment of the present invention specifically includes the following steps:
firstly, constructing a criminal volume feature library:
criminal cases differ in the number of files in volume according to cases, but they have the following commonalities: i) the file making organization mainly comprises a public security, a monitoring house, a court, an executive material self-complaint person, an defendant, a third-party organization and a prison; ii) the file types are classified into a document class and an evidence class; iii) the same file name is uniform, for example, the arrest of all criminal cases is named as 'arrest'; iv) the portfolio files are divided into primary and secondary volumes; v) the location of the portfolio files in the portfolio directories of different departments must meet the relevant specifications. By integrating the above related specifications of the commonalities of the electronic file files of the criminal first-aid cases and the catalog editions, the invention establishes a file feature library. The feature library mainly comprises the contents of a file making organization, file names, file attributes (primary and secondary volumes), file types (documents and evidences), file categories, directory codes, key information and the like.
The document category is the specific classification of documents in different departments for facilitating paper reading, for example, legal documents of public security are divided into 3 categories: filing jurisdictional documents, investigation documents, enforcement documents, and the like.
The directory code is a file directory number. The item is designed into a digital sequence AABBB according to the file catalogue order specification of a public inspection prison, wherein AA represents file sources (public security materials, inspection institute materials, court materials, self-prosecutor materials, defendee materials, third-party institution materials (arbitration notarization), audio and video materials and other litigation related materials); the BBB indicates the serial numbers of the different file files in the directory.
The key information records the summary information of the file files, the contents of different file files are different, and the same file is named the same but has different contents. And constructing key information of the file according to the reading key points of legal workers on the file. The key information of the retention syndrome includes the detainee, the performer, the date, etc.
Secondly, constructing a case-following criminal file management library:
criminal first-pass volume is a collection of documents formed by all levels of judicial departments and can be broadly divided into police material, inspection yard material, court material, executive material, prosecutor material, defendant material, third party agency material (arbitration notary), audio-visual material and other litigation-related material (identification of litigation participants and commission procedures), and others. These materials are presented in forms that include mainly PDF formatted text, images, audiovisual media, copies, and forms (trial information forms or criminal entry-monitoring forms).
(1) Volume file structure:
the files of the criminal file are large in quantity and various in forms, but the files in the same form have similar writing formats.
The documents of criminal case volume mainly include official documents written by official inspection, self-complainers and text materials of defendees. Documents written by the official survey (legal documents, notes, arbitration notary documents, and attorney) typically have a unified template that includes a header, a body, and a trailer. Header writing manufacturing authority, file name, primary and secondary volumes, and others. The text part states the reason and the "criminal law" clause of offence). The tail part shows the unit, person and date. The headers of documents can be roughly divided into four categories: the first category is the only document names, such as a presentation for retrieval of an evidence report and a commitment application. The second category contains the manufacturing authority, the file name and the letter number, wherein the letter number has a uniform format: such as: x official criminal arrest word (xxx) No. x found (xxx), x method criminal final word No. x. The third category is composed of the manufacturing organization, the file name, the letter number and others. The header of the first criminal investigation decision includes: manufacturing institutions, file names (xxx people's court criminal judgment books), case numbers and others (origin, judging organization, judging mode and judging pass of the terms of the public institutions, the items of the advertisees, the items of the disputes and the cases). The fourth type is that the positive and the secondary volumes are added on the basis of the third type. The text files of the self-appeal person and the advisee are generally composed of a document name and a body text. In a criminal case paperwork volume, the file name is not uniformly specified in a specific position in the paperwork, but all appears in the first page as a separate line.
Images and audiovisual material generally include a description body and associated media material, where the description body writes a description of the source, time, location, acquisition person, and associated content of the media. Copies are typically valid documents issued by the relevant entity, such as identification cards, marriage certificates, driver licenses. The table mainly comprises table names and table contents. The table names all appear in the top page as separate lines.
(2) Extracting file information:
the content of the file is generally expressed by means of texts, images, audio-visual media, copies, tables and the like, and the information of the file in different forms is presented in different ways. In order to extract file information, the invention mainly comprises the following contents:
1) document name and information extraction:
in criminal first examination volume, text is presented in PDF format. In order to extract file information in a text form, firstly analyzing a PDF text structure; secondly, extracting each line of text of the home page by using a character recognition technology; and finally, carrying out fuzzy matching on the texts in each row and the file name items in the feature library to identify the document names.
And (3) extracting other information of the document: according to the name of the document, firstly, searching corresponding content by combining key information of the document in a feature library to form a brief description of the document; secondly, generating the ID number of the file according to the directory code of the file in the feature library; and finally analyzing the file attributes and categories.
2) Information extraction of images and audiovisual media:
the description part of the image and the audio-visual data is expressed in PDF format, the invention firstly analyzes the PDF text structure of the description part; secondly, the text recognition technology is combined with the time, the place, the collection personnel and other contents in the file feature library, and meanwhile, related contents are searched from the description part according to the key information in the feature library to form a brief description of an image and an audio-visual medium; and finally, generating the ID number of the file according to the directory code in the feature library.
3) Information extraction of the copy:
copies are represented in the form of images in a file, which are captured by a copier. In order to make up for the influence of the placement position and angle of the certificate on the information extraction of the copy. The invention firstly uses an edge detection algorithm to detect the edge of the certificate, carries out Hough change on the edge to detect the upper, lower, left and right parallel lines of the certificate boundary, analyzes the angle of the certificate during acquisition according to the slopes of the upper, lower, left and right parallel lines, and carries out rotation processing on the certificate according to the angle; secondly, extracting information such as certificate types, certificate holders' names, issuing time and the like from the rotated copy by applying an OCR technology; and finally, generating the ID number of the file according to the directory code in the feature library.
4) Information extraction of the table:
the table mainly comprises a table name, a table making time and contents, and is expressed in a text format of PDF. In order to extract the table information, firstly analyzing a PDF text structure; secondly, extracting each line of text of the home page, carrying out fuzzy matching on each line of text and the file name items in the feature library, and identifying the table name; and finally, extracting key information according to the table name and the characteristics of the table name to form a brief description of the table name and generate an ID number.
(3) Criminal case volume file management repository:
the criminal case file is mainly used for reading and archiving, and for convenience of management and timely generation of file catalogues with different purposes, the invention establishes a file management library by using MYSQL 8.1. The management library mainly comprises a file making organization, a file name, a file attribute, a file type, a right, a file ID number, a file type and a brief description.
And the manufacturing organization is filled in according to the release department.
And the file name is extracted from the file by utilizing a character recognition technology and is filled in.
And the file attributes are extracted from the file files and filled in by utilizing a character recognition technology. If not, the positive volume is filled out.
And the file category is inquired and filled in from the file feature library according to the file name.
And the authority records the reading authority of the file, and is filled in by a file publisher according to the case situation.
The file ID number not only represents the sequence of the file files in the directory, but also represents the number of the file files in the file warehouse, such as a numerical sequence AABBBCCC, and AABBB inquires in a file feature library according to the file names; CCC denotes the sequence number of the subfile under the same file (such as the detention of different people in a criminal case), and is automatically generated in time sequence.
And the file type is inquired and filled in from the file feature library according to the file name.
Briefly, a summary of a volume file is recorded. And inquiring key information of the file from the file feature library according to the file name, retrieving related contents from the file by using the key information, and finally filling in a corresponding item.
Thirdly, recording with criminal file:
in order to improve the situation that public security, judicial departments at all levels and litigation participants can quickly know the basic situation and evidence constitution of criminal cases from a large number of files at any time. The invention combines the file files provided by the public security, the inspection yard, the court and the litigation participants and the corresponding bibliographic standard to divide the file files of the criminal case into public security materials, inspection yard materials, court materials, executive materials, self-complaining people materials, defendant materials, third party institution materials (arbitration notary), audio and video materials, other litigation related materials and the like. The public security material is formed by the public security organization in the investigation process and is used for making legal documents generally used by the public security organization and evidence formed in the investigation process. The materials for the hospital are manufactured by the hospital and are classified into a main roll (legal documents and external procedures of the hospital) and a sub-roll (report formed by the hospital and internal procedures). Court materials are the legal documents and evidence that the court formed during the trial. The enforcement material includes legal documents and evidence formed by the law administration in the course of the enforcement of court documents. The self-prosecutor material, the advertisee material is the documentation and evidentiary material submitted by the self-prosecutor or the advertisee.
(1) A directory framework:
in order to embody criminal case transaction nodes and file sources formed with cases in a directory, the invention designs a criminal case file directory frame as follows: police material, inspection yard material, court material, executive material, prosecutor material, defendant material, third party agency material (arbitration notary), audio-video material, other litigation-related material, and others as first-class catalogs; the legal documents and evidences are secondary catalogues; the file category is a three-level directory; the specific file is a four-level directory. And compiling the catalog according to the related items of the file management library.
The primary catalog primarily identifies the source of the file, i.e., the production authority, and is generated from the production authority entries of the file repository.
The secondary catalog primarily specifies the type of file, i.e., legal documents and evidence. And generating according to the file type item of the file management library.
The third-level directory mainly indicates the file type of the file and is generated according to the type items of the file management library.
The four-level directory is mainly composed of names and abstracts of specific file files. The file name is generated according to the file name in the file management library. The abstract is to facilitate the reader to quickly know the basic situation and evidence composition of the case from the catalog. The abstract is different from file information, if the file brief description of the effective certificate only comprises name and certificate handling date, some files are more brief descriptions, such as a spot survey record (time for finding or reporting a case, name and unit of a spot protector, arrival time of the spot protector, survey time, survey place, names, jobs and units of commanders and surveyors for spot survey, name, unit and address of the witness, and spot condition). A brief description of the present document is given in abstract form. The content is written only according to the portfolio management library profile.
(2) File catalog authoring:
the case files can be divided into the archiving files and the reading files from the use angle, and the archiving file catalogue and the reading file catalogue are generated correspondingly. The directory order is determined by the file ID number in the volume management library. The specific sequence numbers are: increasing in AA order. In case of the same AA, the BBB sequence is increased. In the same case of AABBB, the order is increased by CCC.
1) Archive catalog authoring:
an archive file refers to the set of all files formed during the criminal case's handling process, the catalog of which summarizes, without any constraint, all file files of the case. The directory is mainly used for archiving processing, does not contain a brief description part of a third-level directory, and mainly comprises items such as file names of a first-level directory, a second-level directory, a third-level directory and a fourth-level directory.
2) Reading a file catalog compilation:
the reading file is a readable file according to the authority of related personnel or departments, and the catalogue of the readable file only can comprise file files in the reading authority. The purpose of the method is mainly to help a reader to see the general case and evidence from a paper reading directory, wherein the directory mainly comprises a primary directory, a secondary directory, a tertiary directory and a quaternary directory (file name and abstract) meeting reading authority.
Fourthly, hanging the file files:
after the reader knows the basic situation and evidence composition of the case according to the directory, he or she generally holds the status of the question to analyze the case and review the file in a targeted manner, and at this time, he or she needs to read the full text of the file. In contrast, the invention retrieves and displays the corresponding file according to the file ID number in the file management library.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents and improvements made within the spirit and principle of the present invention are intended to be included within the scope of the present invention.

Claims (9)

1. A method of authoring a consummate electronic volume catalog, said method comprising the steps of:
analyzing criminal examination files, extracting the characteristics of the files and constructing a criminal file characteristic library;
secondly, classifying and identifying the files of the criminal case to extract file information according to characteristics, and constructing a criminal case management library;
and thirdly, compiling a reading file catalog and an archiving file catalog by combining the management library.
2. The prosecution-compliant criminal review electronic volume catalog authoring method according to claim 1, wherein said first step builds a criminal volume characteristics library: the file management system comprises a file making organization, a file name, a file attribute, a file type, a file category, a directory code and key information;
the file attribute refers to the attribute of the file, namely the primary and secondary volumes;
whether the file type is a file or evidence;
the file category is a specific classification of the file category;
the directory code is a file directory number and is specified according to a file directory sequence of a public inspection prison;
the key information records the summary information of the file, and the key information of the file is constructed according to the reading key points of legal workers on the file.
3. The method for cataloging a prosecuted electronic volume according to claim 1, wherein said second step of constructing a prosecuted electronic volume management library comprises:
(1) volume file structure: the documents of criminal case volume include the text material of the documents written by the official survey, the self-complainers and the defendees; the template of the document comprises a head part, a body part and a tail part; header writing manufacturing organization, file name, primary and secondary volumes, and others; the text part explains the reason and the offending clauses; the tail part of the system is used for writing a undertaking unit, a contractor and a date;
(2) extracting file information: extracting information by means of text, image, audio-visual media, copy and table form expression and by using text, table, image and streaming media processing technology;
(3) the file management library for criminal cases is characterized by comprising the following steps:
MYSQL8.1 is used for establishing a file management library which mainly comprises a file manufacturing organization, file names, file attributes, file types, authorities, file ID numbers, file types and brief descriptions;
the manufacturing organization is filled in according to the release department;
the file name is extracted and filled from the file by utilizing a character recognition technology;
the file attribute is extracted from the file by utilizing a character recognition technology and is filled in;
the file type is inquired and filled in from the file feature library according to the file name;
the authority, which records the reading authority of the file, is filled in by the file publisher according to the case;
the file ID number not only represents the sequence of the file files in the directory, but also represents the number of the file files in the file warehouse;
the file type is inquired and filled in from the file feature library according to the file name;
briefly, the summary of the file is recorded, the key information of the file is inquired from the file feature library according to the file name, the related content is retrieved from the file by using the key information, and finally the corresponding item is filled.
4. The scripted cataloguing method of incident criminal review electronic volume according to claim 3, wherein the headers of said official documents are classified into four categories: the first type has only the name of the document; the second category comprises a manufacturing organization, a file name and a letter number; the third category is composed of manufacturing organs, file names, letter numbers and the like; the fourth type is that the primary volume and the secondary volume are added on the basis of the third type;
the image and audiovisual material consists of two parts: the system comprises a description body and related media data, wherein the description body writes the source, time, place, acquisition personnel and related content description of the media; the copy refers to a valid certificate issued by a relevant unit; the table is composed of table names and table contents; the table names all appear in the top page as separate lines.
5. The prosecution electronic volume catalog authoring method according to claim 3, wherein said volume file information extraction further comprises:
1) document name and information extraction:
analyzing a PDF text structure; secondly, extracting each line of text of the home page by using a character recognition technology; finally, fuzzy matching is carried out on the texts in each row and the file name items in the feature library, and the file names are identified;
and (3) extracting other information of the document: according to the name of the document, firstly, searching corresponding content by combining key information of the document in a feature library to form a brief description of the document; secondly, generating the ID number of the file according to the directory code of the file in the feature library; finally, analyzing the file attribute and category;
2) information extraction of images and audiovisual media:
the description part of the image and the audio-visual material is expressed in PDF format, and the PDF text structure of the description part is firstly analyzed; secondly, the character recognition technology is combined with the time, the place and the collected personnel content in the file feature library, and meanwhile, related content is searched from the description part according to the key information in the feature library to form brief descriptions of images and audio-visual media; finally, the ID number of the file is generated according to the directory code in the feature library;
3) information extraction of the copy:
firstly, detecting the edge of a certificate by using an edge detection algorithm, detecting upper, lower, left and right parallel lines of the boundary of the certificate by carrying out Hough change on the edge, analyzing an angle when the certificate is collected according to the slopes of the upper, lower, left and right parallel lines, and carrying out rotation processing on the certificate according to the angle; secondly, extracting the certificate type, the name of the certificate holder and the issuing time information of the rotated copy by applying an OCR technology; finally, the ID number of the file is generated according to the directory code in the feature library;
4) information extraction of the table:
firstly, analyzing a PDF text structure; secondly, extracting each line of text of the home page, carrying out fuzzy matching on each line of text and the file name items in the feature library, and identifying the table name; and finally, extracting key information according to the table name and the characteristics of the table name to form a brief description of the table name and generate an ID number.
6. The incident criminal review electronic volume catalog authoring method of claim 1, wherein said third step of compiling a reading volume and archiving volume catalog in conjunction with a management library specifically comprises:
(1) a directory framework:
the criminal case file catalogue frame is designed as follows: police materials, inspection yard materials, court materials, executive materials, prosecutor materials, defendant materials, third party agency materials, audio and video materials, other litigation related materials, and others as first-class catalogs; the legal documents and evidences are secondary catalogues; the file category is a three-level directory; the specific file files are four-level directories, and the directories are compiled according to related items of a file management library;
(2) file catalog authoring:
the case files are divided into archive files and reading files from the use angle, and an archive file catalogue and a reading file catalogue are correspondingly generated; the directory order is determined by the file ID number in the portfolio management library.
7. The method for authoring a scripted criminal review electronic volume catalog according to claim 6, wherein said primary catalog identifies the source of the volume file, the production authority, and the production authority terms of the volume file repository;
the secondary directory indicates the file type, legal documents and evidences; generating according to the file type items of the file management library;
the third-level directory indicates the file type and is generated according to the type item of the file management library;
the four-level directory is composed of names and abstracts of specific file files, and the file names are generated according to file names in the file management library.
8. The docket electronic volume catalog authoring method of claim 6, wherein said docket catalog authoring comprises:
1) archive catalog authoring:
the archive files refer to all file sets formed in the criminal case handling process, and all file files of the case are summarized without any constraint in a directory; the directory is used for archiving processing, the directory does not contain an abstract part of a four-level directory, and the directory comprises items such as file names of a first-level directory, a second-level directory, a third-level directory and the four-level directory;
2) reading a file catalog compilation:
the reading file is a readable file of related personnel or departments according to the authority, and the catalogue of the readable file only can comprise file files in the reading authority; the catalog mainly comprises a first-level catalog, a second-level catalog, a third-level catalog and a fourth-level catalog meeting reading authority.
9. The prosecution electronic volume catalog authoring method according to claim 6, further comprising a volume file mount, retrieving and displaying a corresponding volume file according to a file ID number in a volume management library.
CN201910936642.5A 2019-09-29 2019-09-29 Method for cataloging electronic file along with criminal investigation Active CN110675289B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910936642.5A CN110675289B (en) 2019-09-29 2019-09-29 Method for cataloging electronic file along with criminal investigation

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910936642.5A CN110675289B (en) 2019-09-29 2019-09-29 Method for cataloging electronic file along with criminal investigation

Publications (2)

Publication Number Publication Date
CN110675289A true CN110675289A (en) 2020-01-10
CN110675289B CN110675289B (en) 2023-05-05

Family

ID=69080176

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910936642.5A Active CN110675289B (en) 2019-09-29 2019-09-29 Method for cataloging electronic file along with criminal investigation

Country Status (1)

Country Link
CN (1) CN110675289B (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112612893A (en) * 2020-12-29 2021-04-06 广西安怡臣信息技术有限公司 Electronic file case generation system
CN113157642A (en) * 2021-03-19 2021-07-23 浪潮云信息技术股份公司 Method for realizing electronic material digital process automation
CN113222788A (en) * 2021-05-17 2021-08-06 广西安怡臣信息技术有限公司 Intelligent marking method
CN113222417A (en) * 2021-05-17 2021-08-06 广西安怡臣信息技术有限公司 Electronic file data factory full-process intelligent application management system
CN113254396A (en) * 2021-06-23 2021-08-13 昌和云科技有限公司 Case collaborative management system for multiple departments
CN113609856A (en) * 2021-07-21 2021-11-05 浙江建达科技股份有限公司 Electronic file reading system based on artificial intelligence and marking tool thereof
CN115391577A (en) * 2022-09-29 2022-11-25 浙江星汉信息技术股份有限公司 Electronic archive management method and system based on machine learning algorithm

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2738368A1 (en) * 1995-09-01 1997-03-07 Finance Christian Design and production of personalised multi-media electronic catalogue
CN101853311A (en) * 2010-06-18 2010-10-06 上海百事通信息技术有限公司 Legal service method and system
CN102955822A (en) * 2011-08-31 2013-03-06 河南新创元信息网络有限公司 Classification-type secretarial document management system and method
CN104636835A (en) * 2013-11-06 2015-05-20 北京航天长峰科技工业集团有限公司 Trans-department case coordination processing system
CN105159968A (en) * 2015-08-25 2015-12-16 浪潮(北京)电子信息产业有限公司 Directory management method for file system and client
CN107085584A (en) * 2016-11-09 2017-08-22 中国长城科技集团股份有限公司 A kind of cloud document management method, system and service end based on content
CN109977073A (en) * 2019-03-11 2019-07-05 厦门纵横集团科技股份有限公司 A kind of law court's electronics folder automation filing system and its method
CN110135715A (en) * 2019-05-06 2019-08-16 江苏新视云科技股份有限公司 A kind of intelligence court management method
CN110209632A (en) * 2019-05-27 2019-09-06 武汉市润普网络科技有限公司 A kind of electronics folder with case production, turn shelves system

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2738368A1 (en) * 1995-09-01 1997-03-07 Finance Christian Design and production of personalised multi-media electronic catalogue
CN101853311A (en) * 2010-06-18 2010-10-06 上海百事通信息技术有限公司 Legal service method and system
CN102955822A (en) * 2011-08-31 2013-03-06 河南新创元信息网络有限公司 Classification-type secretarial document management system and method
CN104636835A (en) * 2013-11-06 2015-05-20 北京航天长峰科技工业集团有限公司 Trans-department case coordination processing system
CN105159968A (en) * 2015-08-25 2015-12-16 浪潮(北京)电子信息产业有限公司 Directory management method for file system and client
CN107085584A (en) * 2016-11-09 2017-08-22 中国长城科技集团股份有限公司 A kind of cloud document management method, system and service end based on content
CN109977073A (en) * 2019-03-11 2019-07-05 厦门纵横集团科技股份有限公司 A kind of law court's electronics folder automation filing system and its method
CN110135715A (en) * 2019-05-06 2019-08-16 江苏新视云科技股份有限公司 A kind of intelligence court management method
CN110209632A (en) * 2019-05-27 2019-09-06 武汉市润普网络科技有限公司 A kind of electronics folder with case production, turn shelves system

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
沈蕾等: "论归档文件整理工作的简化", 《档案学通讯》 *

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112612893A (en) * 2020-12-29 2021-04-06 广西安怡臣信息技术有限公司 Electronic file case generation system
CN113157642A (en) * 2021-03-19 2021-07-23 浪潮云信息技术股份公司 Method for realizing electronic material digital process automation
CN113222788A (en) * 2021-05-17 2021-08-06 广西安怡臣信息技术有限公司 Intelligent marking method
CN113222417A (en) * 2021-05-17 2021-08-06 广西安怡臣信息技术有限公司 Electronic file data factory full-process intelligent application management system
CN113254396A (en) * 2021-06-23 2021-08-13 昌和云科技有限公司 Case collaborative management system for multiple departments
CN113254396B (en) * 2021-06-23 2021-09-24 昌和云科技有限公司 Case collaborative management system for multiple departments
CN113609856A (en) * 2021-07-21 2021-11-05 浙江建达科技股份有限公司 Electronic file reading system based on artificial intelligence and marking tool thereof
CN115391577A (en) * 2022-09-29 2022-11-25 浙江星汉信息技术股份有限公司 Electronic archive management method and system based on machine learning algorithm

Also Published As

Publication number Publication date
CN110675289B (en) 2023-05-05

Similar Documents

Publication Publication Date Title
CN110675289A (en) Method for compiling electronic file catalogue with case criminal review
US8935265B2 (en) Document journaling
An An integrated approach to records management
CN114202319B (en) Archive management system based on mixed metadata scheme
WO2006002179A2 (en) Evaluating the relevance of documents and systems and methods therefor
Prom Making digital curation a systematic institutional function
Hamill Archival arrangement and description: Analog to digital
Davenport et al. How AI Is Improving Data Management
Saman et al. E-court: Information and communication technologies for civil court management
Hampshire et al. The digital world and the future of historical research
US8838543B2 (en) Archiving system that facilitates systematic cataloguing of archived documents for searching and management
Bicknese Institutional Repositories and the Institution's Repository: What Is the Role of University Archives with an Institution's On-line Digital Repository?
Forstrom Managing electronic records in manuscript collections: A case study from the Beinecke Rare Book and Manuscript Library
CN112597763A (en) Method and device for extracting and displaying judicial literature information in association manner and storage medium
King Personal digital archiving for journalists: a “private” solution to a public problem
Bhardwaj et al. Metadata framework for online legal information system in indian environment
Samsa Fabrication in a study about honesty: A lost episode of columbo illustrating how forensic statistics is performed
Dimisyqiyani et al. Using Archival Information System for Effective Retrieval of Document
Lambert et al. Grey Literature, institutional repositories, and the organisational context
Kitchen Data Processing Explained: What Case Teams Should Know
von Mering et al. DiSSCo Prepare Milestone report for MS1. 1 “Corpus of Life science user stories and use cases compiled, gaps identified and new surveys, if any, initiated” and MS1. 2 “Corpus of Earth science user stories and use cases compiled, gaps identified and new surveys, if any, initiated
Emery Document and records management: Understanding the differences and embracing integration
Ellis et al. The State of Artists' Files in Canadian GLAMs and ARCs: Report
Manu et al. Research Data Management Lifecycle: An Overview
Mokhsin et al. Design Requirements on Web-Based Ancestry Platform for Islamic Family Inheritance in Malaysia

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant