CN115858474A - AIGC-based file arrangement system - Google Patents

AIGC-based file arrangement system Download PDF

Info

Publication number
CN115858474A
CN115858474A CN202310165385.6A CN202310165385A CN115858474A CN 115858474 A CN115858474 A CN 115858474A CN 202310165385 A CN202310165385 A CN 202310165385A CN 115858474 A CN115858474 A CN 115858474A
Authority
CN
China
Prior art keywords
classification
module
file
hyperplane
processing unit
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202310165385.6A
Other languages
Chinese (zh)
Other versions
CN115858474B (en
Inventor
张卫平
丁烨
张伟
李显阔
米小武
刘顿
王丹
郑小龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Global Digital Group Co Ltd
Original Assignee
Global Digital Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Global Digital Group Co Ltd filed Critical Global Digital Group Co Ltd
Priority to CN202310165385.6A priority Critical patent/CN115858474B/en
Publication of CN115858474A publication Critical patent/CN115858474A/en
Application granted granted Critical
Publication of CN115858474B publication Critical patent/CN115858474B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention provides an AIGC-based file arrangement system which is characterized by comprising a storage module, an analysis module, a classification module, an application module and an output module, wherein the storage module is used for storing files of users, the analysis module analyzes the files based on AIGC technology, the classification module classifies the files by utilizing information generated by the analysis module, the application module provides application functions for the users, and the output module is used for generating reports for decision making; the system analyzes and formulates classification rules through files for training, and rapidly classifies new files based on the formulated classification rules, so that the effect of rapidly arranging a large number of files can be achieved.

Description

AIGC-based file arrangement system
Technical Field
The invention relates to the field of file system structures, in particular to an AIGC-based file arrangement system.
Background
In daily work and life, a large number of text files can be generated, in order to improve working efficiency, the text files need to be sorted and classified, in a traditional file sorting system, a user needs to manually label each file, sort and classify the files and the like, the workload is large, mistakes are easy to make, an intelligent file sorting system is needed to assist in sorting the files, the time spent on sorting the files is shortened, and the effective utilization rate of working time is improved.
The foregoing discussion of the background art is intended only to facilitate an understanding of the present invention. This discussion is not an acknowledgement or admission that any of the material referred to is part of the common general knowledge.
A number of document collating systems have now been developed and, after a number of searches and references, it has been found that existing collating systems are known as those disclosed in CN110245119B, and these systems generally include: the method comprises the steps that a storage system acquires first information of a first file, wherein the first file is stored in a storage medium in a data fragment mode, the first information comprises N data fragments of the first file and the number of unit storage areas occupied by each data fragment in the storage medium, the unit storage areas occupied by two of the N data fragments are discontinuous, and N is an integer greater than or equal to 2; the storage system determines a first parameter of the first file according to the first information, wherein the first parameter is used for representing the aggregation degree of the position of the first file stored in the storage medium. However, the system only arranges files in storage, and cannot automatically classify a large number of files, so that the working efficiency cannot be improved.
Disclosure of Invention
The invention aims to provide a file arrangement system based on AIGC (advanced information consumer computer) aiming at the defects.
The invention adopts the following technical scheme:
a file arrangement system based on AIGC comprises a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology, the classification module classifies the files by using information generated by the analysis module, the application module provides application functions for the user, and the output module is used for generating a report for decision making;
the analysis module receives a file for training, converts the file into characteristic information, creates a hyperplane based on the characteristic information, the hyperplane information is sent to the classification module, the classification module calculates and processes the file to be classified based on the hyperplane to obtain a classification code, each bit of the classification code corresponds to one hyperplane, and the classification module stores the file to be classified to a corresponding area of the storage module according to the classification code;
furthermore, the analysis module comprises a vocabulary statistics unit, a word frequency matrix processing unit, a characteristic processing unit and a classification analysis unit, wherein the vocabulary statistics unit creates a vocabulary table according to input text files and performs vocabulary statistics on each text file based on the vocabulary table, the word frequency matrix processing unit generates an m × n matrix according to a statistical result, m represents the number of the files, n represents the number of vocabularies in the vocabulary table, the characteristic processing unit obtains n-dimensional characteristic numbers of each file according to the word frequency matrix processing, and the classification analysis unit creates a hyperplane based on the characteristic numbers and classification codes of the text files;
further, the feature processing unit performs calculation processing according to the following equation based on the element values in the matrix:
Figure SMS_1
wherein ,
Figure SMS_2
the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
by
Figure SMS_3
The characteristic number of the x-th file is formed;
the hyperplane A set by the classification analysis unit is as follows:
Figure SMS_4
wherein ,
Figure SMS_5
is out of plane coefficient>
Figure SMS_6
Is a hyperplane element, b is a threshold;
the hyperplane A satisfies:
Figure SMS_7
wherein ,
Figure SMS_8
represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>
Figure SMS_9
Representing the file serial numbers corresponding to the second class of characteristic numbers;
furthermore, the classification module comprises a text processing unit, a calculation processing unit and an execution unit, wherein the text processing unit is used for counting the occurrence frequency of specific words in a text file, the calculation processing unit is used for executing calculation tasks, and the execution unit classifies and stores texts into corresponding storage areas according to the calculation results of the calculation processing unit;
further, the calculation processing unit comprises a characteristic number calculation processor and a hyperplane calculation processor, and the characteristic number calculation processor calculates the classification characteristic number of the file to be classified according to the following formula
Figure SMS_10
Figure SMS_11
wherein ,
Figure SMS_12
the y-th vocabulary frequency obtained by statistics of the text processing unit;
the hyperplane calculation processor processes the classification characteristic number according to the hyperplane, and the calculation result of each hyperplane is as follows:
Figure SMS_13
the execution unit is based on
Figure SMS_14
The positive and negative of the text file are determined, the value of the corresponding digit on the classification code is obtained, and the execution unit stores the text file to the corresponding classification area according to the classification code.
The beneficial effects obtained by the invention are as follows:
the system identifies the main body, emotion, key words and other information of the document by analyzing the vocabulary, grammar and semantic information in the text, then calculates the importance of each word, converts the document into the feature number of an n-dimensional vector, and creates the hyperplane based on the feature number, the number of the hyperplane determines the upper limit of the classification number capable of being classified, the system can automatically label and classify the document by utilizing the hyperplane, and the efficiency and the accuracy are greatly improved.
For a better understanding of the features and technical content of the present invention, reference should be made to the following detailed description of the invention and accompanying drawings, which are provided for purposes of illustration and description only and are not intended to limit the invention.
Drawings
FIG. 1 is a schematic view of the overall structural framework of the present invention;
FIG. 2 is a schematic diagram of the analysis module according to the present invention;
FIG. 3 is a schematic diagram of the classification module according to the present invention;
FIG. 4 is a schematic diagram of a computing unit according to the present invention;
FIG. 5 is a schematic flow chart of the present invention for setting a hyperplane.
Detailed Description
The following embodiments are provided to illustrate the present invention by specific examples, and those skilled in the art will be able to understand the advantages and effects of the present invention from the disclosure of the present specification. The invention is capable of other and different embodiments and its several details are capable of modification in various other respects, all without departing from the spirit and scope of the present invention. The drawings of the present invention are for illustrative purposes only and are not intended to be drawn to scale. The following embodiments will further explain the related art of the present invention in detail, but the disclosure is not intended to limit the scope of the present invention.
The first embodiment is as follows:
the embodiment provides an AIGC-based file sorting system, which is combined with a figure 1 and comprises a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology, the classification module classifies the files by using information generated by the analysis module, the application module provides application functions for the user, and the output module is used for generating a report for decision making;
the analysis module receives a file for training, converts the file into characteristic information, creates a hyperplane based on the characteristic information, the hyperplane information is sent to the classification module, the classification module calculates and processes the file to be classified based on the hyperplane to obtain a classification code, each bit of the classification code corresponds to one hyperplane, and the classification module stores the file to be classified to a corresponding area of the storage module according to the classification code;
the analysis module comprises a vocabulary statistic unit, a word frequency matrix processing unit, a feature processing unit and a classification analysis unit, wherein the vocabulary statistic unit creates a vocabulary list according to input text files and performs vocabulary statistics on each text file based on the vocabulary list, the word frequency matrix processing unit generates an m-n matrix according to a statistical result, m represents the number of the files, n represents the number of vocabularies in the vocabulary list, the feature processing unit obtains n-dimensional feature numbers of each file according to the word frequency matrix processing, and the classification analysis unit creates a hyperplane based on the feature numbers and classification codes of the text files;
the feature processing unit performs calculation processing according to the following equation based on the element values in the matrix:
Figure SMS_15
wherein ,
Figure SMS_16
the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
by
Figure SMS_17
The characteristic number of the x-th file is formed;
the hyperplane A set by the classification analysis unit is as follows:
Figure SMS_18
wherein ,
Figure SMS_19
is out of plane coefficient>
Figure SMS_20
Is a hyperplane element, b is a threshold;
the hyperplane A satisfies:
Figure SMS_21
wherein ,
Figure SMS_22
represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>
Figure SMS_23
Representing the file sequence numbers corresponding to the second class of characteristic numbers;
the classification module comprises a text processing unit, a calculation processing unit and an execution unit, wherein the text processing unit is used for counting the occurrence frequency of specific words in a text file, the calculation processing unit is used for executing calculation tasks, and the execution unit classifies and stores texts into corresponding storage areas according to the calculation results of the calculation processing unit;
the calculation processing unit comprises a characteristic number calculation processor and a hyperplane calculation processor, and the characteristic number calculation processor calculates the classification characteristic number of the file to be classified according to the following formula
Figure SMS_24
Figure SMS_25
wherein ,
Figure SMS_26
the y-th vocabulary frequency obtained by statistics of the text processing unit;
the hyperplane calculation processor processes the classification characteristic number according to the hyperplane, and the calculation result of each hyperplane is as follows:
Figure SMS_27
the execution unit is based on
Figure SMS_28
The positive and negative of the code determines the value of the corresponding digit on the classification code, and obtains the classification code, and the execution unit stores the text file to the corresponding classification area according to the classification code.
The second embodiment:
the embodiment includes all contents in the first embodiment, and provides an AIGC-based file arrangement system, which comprises a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology to identify information such as topics and keywords of the file contents, the classification module classifies the files by using the information generated by the analysis module, the application module provides multiple application functions for the user, and the output module is used for generating a report for decision making and presenting the report to the user in a proper form;
with reference to fig. 2, the analysis module includes a vocabulary statistics unit, a word frequency matrix processing unit, a feature processing unit, and a classification analysis unit, where the vocabulary statistics unit is capable of processing each input text file, the processing process includes decomposition, word removal, extraction, and statistics, the decomposition process is to decompose the whole text into a plurality of phrases, the word removal process is to delete connective words, the extraction process is to extract stems from the phrases, the statistics process is to count the occurrence frequency of each stem, and the vocabulary statistics unit creates a vocabulary table for recording the occurring vocabularies according to the extracted stems;
the word frequency matrix processing unit generates an m x n matrix according to the statistical result, wherein m represents the number of files, n represents the number of words in a vocabulary table, each row represents one file, each column represents one word, and each element represents the frequency of the word in the file;
the feature processing unit performs calculation processing according to the following equation based on the element values in the matrix:
Figure SMS_29
wherein ,
Figure SMS_30
the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
by
Figure SMS_31
The characteristic number of the x-th file is formed;
the classification analysis unit is set up
Figure SMS_32
A super plane to realize the pair->
Figure SMS_33
Classification analysis of documents of types, the type of which is &>
Figure SMS_34
Binary number representation of bits, wherein the binary number is called a classification code, each hyperplane corresponds to one bit of the classification code, the classification code is sent to the analysis module along with the text file, and after the text file is processed to obtain a feature number, the classification code and the corresponding feature number are matched and sent to the classification analysis unit;
referring to fig. 5, the process of setting a hyperplane by the classification analysis unit includes the following steps:
s1, determining the number of classification code bits corresponding to a hyperplane, namely target bits;
s2, classifying the classified codes into two classes according to the value of the target bit, wherein the classified code with the value of 1 of the target bit is a first class code, and the classified code with the value of 0 of the target bit is a second class code;
s3, classifying the feature numbers according to the classification result of the classification codes in the step S2, namely, the feature numbers corresponding to the first class codes are first class feature numbers, and the feature numbers corresponding to the second class codes are second class feature numbers;
s4, setting a hyperplane A:
Figure SMS_35
wherein ,
Figure SMS_36
for super-plane coefficients>
Figure SMS_37
Is a hyperplane element, b is a threshold;
the hyperplane A satisfies:
Figure SMS_38
wherein ,
Figure SMS_39
represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>
Figure SMS_40
Representing the file serial numbers corresponding to the second class of characteristic numbers, and determining the corresponding relation between the positive and negative properties and the values of 0 and 1 in the form of an inequality satisfied;
s5, adjusting the numerical value of the threshold b to ensure that the shortest distance from the first-class characteristic number to the hyperplane is equal to the shortest distance from the second-class characteristic number to the hyperplane, and recording the threshold at the moment as
Figure SMS_41
And finally obtaining the determined hyperplane:
Figure SMS_42
number of features
Figure SMS_43
The distance to the hyperplane is formulated as:
Figure SMS_44
the classification analysis unit is based onThe process is set to obtain
Figure SMS_45
The hyperplane information is sent to the classification module, and the hyperplane information comprises the corresponding relation between the positive and negative characters and 0 and 1;
with reference to fig. 3, the classification module includes a text processing unit, a calculation processing unit, and an execution unit, where the text processing unit is configured to count occurrence frequency of a specific vocabulary in a text file, the calculation processing unit is configured to execute a calculation task, and the execution unit classifies and stores a text in a corresponding storage area according to a calculation result of the calculation processing unit;
the text processing unit stores the vocabulary table created by the vocabulary counting unit and counts based on the vocabulary recorded in the vocabulary table;
with reference to fig. 4, the calculation processing unit includes an eigenvalue calculation processor and a hyperplane calculation processor, and the eigenvalue calculation processor calculates the classification eigenvalue of the document to be classified according to the following formula
Figure SMS_46
Figure SMS_47
wherein ,
Figure SMS_48
the y-th vocabulary frequency obtained by statistics of the text processing unit is obtained;
the hyperplane computation processor is based on
Figure SMS_49
The classification characteristic number is processed by each hyperplane, and the calculation result of each hyperplane is as follows:
Figure SMS_50
the execution unit is based on Ns
Figure SMS_51
The positive and negative of the text file are used for determining the value of the corresponding digit on the classification code and obtaining the classification code, and the execution unit stores the text file to the corresponding classification area according to the classification code;
the storage module comprises a storage unit and a logic unit, wherein the storage unit is divided into Ns +1 storage areas, one storage area is used for storing the text files to be classified and is called a temporary area, the other Ns storage areas are used for storing the classified text files and are called classification areas, and the logic unit is used for recording the logic relation among the Ns classification areas;
the application module is used for providing an application interface, the application interface comprises a classification interface and a report interface, the classification interface is connected with the temporary area, the input text file is stored to the temporary area through the classification interface, the classification module is started to store the input text file in a classification mode after the temporary area receives a new file, the report interface is used for receiving time information and counting the newly added text file in the time information, and the proportion of various classification files increased in a corresponding time period is obtained.
The above disclosure is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, so that all the modifications and equivalents of the technical changes and equivalents made by the disclosure and drawings are included in the scope of the present invention, and the elements thereof may be updated as the technology develops.

Claims (5)

1. A file arrangement system based on AIGC is characterized by comprising a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology, the classification module classifies the files by using information generated by the analysis module, the application module provides application functions for the user, and the output module is used for generating a report for decision making;
the analysis module receives a file for training, converts the file into characteristic information, creates a hyperplane based on the characteristic information, the hyperplane information is sent to the classification module, the classification module calculates and processes the file to be classified based on the hyperplane to obtain a classification code, each bit of the classification code corresponds to one hyperplane, and the classification module stores the file to be classified to a corresponding area of the storage module according to the classification code.
2. The AIGC-based filing system of claim 1, wherein the analysis module comprises a vocabulary statistics unit, a word frequency matrix processing unit, a feature processing unit and a classification analysis unit, the vocabulary statistics unit creates a vocabulary table from the inputted text files and performs vocabulary statistics on each text file based on the vocabulary table, the word frequency matrix processing unit generates an m x n matrix according to the statistics result, wherein m represents the number of files, n represents the number of vocabularies in the vocabulary table, the feature processing unit obtains n-dimensional feature numbers of each file according to the word frequency matrix processing, and the classification analysis unit creates a hyperplane based on the feature numbers and the classification codes of the text files.
3. The AIGC-based filing system of claim 2, wherein the feature processing unit performs calculation processing based on element values in the matrix according to the following formula:
Figure QLYQS_1
wherein ,
Figure QLYQS_2
the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
by
Figure QLYQS_3
The characteristic number of the x-th file is formed;
the hyperplane A set by the classification analysis unit is as follows:
Figure QLYQS_4
wherein ,
Figure QLYQS_5
is out of plane coefficient>
Figure QLYQS_6
Is a hyperplane element, b is a threshold;
the hyperplane A satisfies:
Figure QLYQS_7
wherein ,
Figure QLYQS_8
represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>
Figure QLYQS_9
And the file sequence numbers corresponding to the two types of characteristic numbers are shown.
4. The AIGC-based file collating system according to claim 3, wherein the classifying module includes a text processing unit, a calculation processing unit and an executing unit, the text processing unit is configured to count the occurrence frequency of a specific word in a text file, the calculation processing unit is configured to execute a calculation task, and the executing unit classifies and stores the text in a corresponding storage area according to the calculation result of the calculation processing unit.
5. The AIGC-based filing system of claim 4, wherein the calculation processing unit comprises an eigenvalue calculation processor and a hyperplane calculation processorThe characteristic number calculation processor calculates the classification characteristic number of the file to be classified according to the following formula
Figure QLYQS_10
Figure QLYQS_11
wherein ,
Figure QLYQS_12
the y-th vocabulary frequency obtained by statistics of the text processing unit;
the hyperplane calculation processor processes the classification characteristic number according to the hyperplane, and the calculation result of each hyperplane is as follows:
Figure QLYQS_13
the execution unit is based on
Figure QLYQS_14
The positive and negative of the code determines the value of the corresponding digit on the classification code, and obtains the classification code, and the execution unit stores the text file to the corresponding classification area according to the classification code. />
CN202310165385.6A 2023-02-27 2023-02-27 File arrangement system based on AIGC Active CN115858474B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310165385.6A CN115858474B (en) 2023-02-27 2023-02-27 File arrangement system based on AIGC

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310165385.6A CN115858474B (en) 2023-02-27 2023-02-27 File arrangement system based on AIGC

Publications (2)

Publication Number Publication Date
CN115858474A true CN115858474A (en) 2023-03-28
CN115858474B CN115858474B (en) 2023-05-09

Family

ID=85658908

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310165385.6A Active CN115858474B (en) 2023-02-27 2023-02-27 File arrangement system based on AIGC

Country Status (1)

Country Link
CN (1) CN115858474B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116306317A (en) * 2023-05-12 2023-06-23 环球数科集团有限公司 Automatic AIGC modeling system based on artificial intelligence
CN116821319A (en) * 2023-08-30 2023-09-29 环球数科集团有限公司 Quick screening type processing system based on AIGC

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120166366A1 (en) * 2010-12-22 2012-06-28 Microsoft Corporation Hierarchical classification system
CN106407406A (en) * 2016-09-22 2017-02-15 国信优易数据有限公司 A text processing method and system
US20170075974A1 (en) * 2015-09-11 2017-03-16 Adobe Systems Incorporated Categorization of forms to aid in form search
CN107577738A (en) * 2017-08-28 2018-01-12 电子科技大学 A kind of FMECA method by SVM text mining processing datas
US20200026798A1 (en) * 2018-07-20 2020-01-23 EMC IP Holding Company LLC Identification and curation of application programming interface data from different sources
CN111125359A (en) * 2019-12-17 2020-05-08 东软集团股份有限公司 Text information classification method, device and equipment
CN114265937A (en) * 2021-12-24 2022-04-01 中国电力科学研究院有限公司 Intelligent classification analysis method and system of scientific and technological information, storage medium and server
CN114610884A (en) * 2022-03-07 2022-06-10 中国人民解放军63893部队 Classification method based on PCA combined feature extraction and approximate support vector machine

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120166366A1 (en) * 2010-12-22 2012-06-28 Microsoft Corporation Hierarchical classification system
US20170075974A1 (en) * 2015-09-11 2017-03-16 Adobe Systems Incorporated Categorization of forms to aid in form search
CN106407406A (en) * 2016-09-22 2017-02-15 国信优易数据有限公司 A text processing method and system
CN107577738A (en) * 2017-08-28 2018-01-12 电子科技大学 A kind of FMECA method by SVM text mining processing datas
US20200026798A1 (en) * 2018-07-20 2020-01-23 EMC IP Holding Company LLC Identification and curation of application programming interface data from different sources
CN111125359A (en) * 2019-12-17 2020-05-08 东软集团股份有限公司 Text information classification method, device and equipment
CN114265937A (en) * 2021-12-24 2022-04-01 中国电力科学研究院有限公司 Intelligent classification analysis method and system of scientific and technological information, storage medium and server
CN114610884A (en) * 2022-03-07 2022-06-10 中国人民解放军63893部队 Classification method based on PCA combined feature extraction and approximate support vector machine

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116306317A (en) * 2023-05-12 2023-06-23 环球数科集团有限公司 Automatic AIGC modeling system based on artificial intelligence
CN116821319A (en) * 2023-08-30 2023-09-29 环球数科集团有限公司 Quick screening type processing system based on AIGC
CN116821319B (en) * 2023-08-30 2023-10-27 环球数科集团有限公司 Quick screening type processing system based on AIGC

Also Published As

Publication number Publication date
CN115858474B (en) 2023-05-09

Similar Documents

Publication Publication Date Title
CN115858474B (en) File arrangement system based on AIGC
CN110825877A (en) Semantic similarity analysis method based on text clustering
CN106407406B (en) text processing method and system
CN108573045A (en) A kind of alignment matrix similarity retrieval method based on multistage fingerprint
US9501557B2 (en) Information generating computer product, apparatus, and method; and information search computer product, apparatus, and method
CN103995876A (en) Text classification method based on chi square statistics and SMO algorithm
CN107168956B (en) Chinese chapter structure analysis method and system based on pipeline
CN109993216B (en) Text classification method and device based on K nearest neighbor KNN
WO2020063071A1 (en) Sentence vector calculation method based on chi-square test, and text classification method and system
Farhoodi et al. Applying machine learning algorithms for automatic Persian text classification
CN115098690B (en) Multi-data document classification method and system based on cluster analysis
WO2018213783A1 (en) Computerized methods of data compression and analysis
CN112632969B (en) Incremental industry dictionary updating method and system
CN114266256A (en) Method and system for extracting new words in field
Elhassan et al. Arabic text classification on full word
CN111930936A (en) Method and system for excavating platform message text
CN113051462A (en) Multi-classification model training method, system and device
CN117291192B (en) Government affair text semantic understanding analysis method and system
Hamdi et al. Machine learning vs deterministic rule-based system for document stream segmentation
Elhassan et al. The impact of feature selection methods for classifying Arabic texts
Sudha Semi supervised multi text classifications for telugu documents
CN116186067A (en) Industrial data table storage query method and equipment
CN109815475B (en) Text matching method and device, computing equipment and system
CN111341404B (en) Electronic medical record data set analysis method and system based on ernie model
Packiam et al. A Novel Integrated Framework Based on Modular Optimization for Efficient Analytics on Twitter Big Data

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant