CN115858474A - AIGC-based file arrangement system - Google Patents
AIGC-based file arrangement system Download PDFInfo
- Publication number
- CN115858474A CN115858474A CN202310165385.6A CN202310165385A CN115858474A CN 115858474 A CN115858474 A CN 115858474A CN 202310165385 A CN202310165385 A CN 202310165385A CN 115858474 A CN115858474 A CN 115858474A
- Authority
- CN
- China
- Prior art keywords
- classification
- module
- file
- hyperplane
- processing unit
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000004458 analytical method Methods 0.000 claims abstract description 24
- 238000005516 engineering process Methods 0.000 claims abstract description 6
- 238000004364 calculation method Methods 0.000 claims description 47
- 239000011159 matrix material Substances 0.000 claims description 23
- 238000010224 classification analysis Methods 0.000 claims description 15
- 238000000034 method Methods 0.000 claims description 13
- 230000008569 process Effects 0.000 claims description 12
- 230000000694 effects Effects 0.000 abstract description 2
- 239000012634 fragment Substances 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000000354 decomposition reaction Methods 0.000 description 2
- 238000000605 extraction Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000002776 aggregation Effects 0.000 description 1
- 238000004220 aggregation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000008451 emotion Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
Images
Classifications
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02D—CLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
- Y02D10/00—Energy efficient computing, e.g. low power processors, power management or thermal management
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention provides an AIGC-based file arrangement system which is characterized by comprising a storage module, an analysis module, a classification module, an application module and an output module, wherein the storage module is used for storing files of users, the analysis module analyzes the files based on AIGC technology, the classification module classifies the files by utilizing information generated by the analysis module, the application module provides application functions for the users, and the output module is used for generating reports for decision making; the system analyzes and formulates classification rules through files for training, and rapidly classifies new files based on the formulated classification rules, so that the effect of rapidly arranging a large number of files can be achieved.
Description
Technical Field
The invention relates to the field of file system structures, in particular to an AIGC-based file arrangement system.
Background
In daily work and life, a large number of text files can be generated, in order to improve working efficiency, the text files need to be sorted and classified, in a traditional file sorting system, a user needs to manually label each file, sort and classify the files and the like, the workload is large, mistakes are easy to make, an intelligent file sorting system is needed to assist in sorting the files, the time spent on sorting the files is shortened, and the effective utilization rate of working time is improved.
The foregoing discussion of the background art is intended only to facilitate an understanding of the present invention. This discussion is not an acknowledgement or admission that any of the material referred to is part of the common general knowledge.
A number of document collating systems have now been developed and, after a number of searches and references, it has been found that existing collating systems are known as those disclosed in CN110245119B, and these systems generally include: the method comprises the steps that a storage system acquires first information of a first file, wherein the first file is stored in a storage medium in a data fragment mode, the first information comprises N data fragments of the first file and the number of unit storage areas occupied by each data fragment in the storage medium, the unit storage areas occupied by two of the N data fragments are discontinuous, and N is an integer greater than or equal to 2; the storage system determines a first parameter of the first file according to the first information, wherein the first parameter is used for representing the aggregation degree of the position of the first file stored in the storage medium. However, the system only arranges files in storage, and cannot automatically classify a large number of files, so that the working efficiency cannot be improved.
Disclosure of Invention
The invention aims to provide a file arrangement system based on AIGC (advanced information consumer computer) aiming at the defects.
The invention adopts the following technical scheme:
a file arrangement system based on AIGC comprises a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology, the classification module classifies the files by using information generated by the analysis module, the application module provides application functions for the user, and the output module is used for generating a report for decision making;
the analysis module receives a file for training, converts the file into characteristic information, creates a hyperplane based on the characteristic information, the hyperplane information is sent to the classification module, the classification module calculates and processes the file to be classified based on the hyperplane to obtain a classification code, each bit of the classification code corresponds to one hyperplane, and the classification module stores the file to be classified to a corresponding area of the storage module according to the classification code;
furthermore, the analysis module comprises a vocabulary statistics unit, a word frequency matrix processing unit, a characteristic processing unit and a classification analysis unit, wherein the vocabulary statistics unit creates a vocabulary table according to input text files and performs vocabulary statistics on each text file based on the vocabulary table, the word frequency matrix processing unit generates an m × n matrix according to a statistical result, m represents the number of the files, n represents the number of vocabularies in the vocabulary table, the characteristic processing unit obtains n-dimensional characteristic numbers of each file according to the word frequency matrix processing, and the classification analysis unit creates a hyperplane based on the characteristic numbers and classification codes of the text files;
further, the feature processing unit performs calculation processing according to the following equation based on the element values in the matrix:
wherein ,the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
the hyperplane A set by the classification analysis unit is as follows:
the hyperplane A satisfies:
wherein ,represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>Representing the file serial numbers corresponding to the second class of characteristic numbers;
furthermore, the classification module comprises a text processing unit, a calculation processing unit and an execution unit, wherein the text processing unit is used for counting the occurrence frequency of specific words in a text file, the calculation processing unit is used for executing calculation tasks, and the execution unit classifies and stores texts into corresponding storage areas according to the calculation results of the calculation processing unit;
further, the calculation processing unit comprises a characteristic number calculation processor and a hyperplane calculation processor, and the characteristic number calculation processor calculates the classification characteristic number of the file to be classified according to the following formula:
the hyperplane calculation processor processes the classification characteristic number according to the hyperplane, and the calculation result of each hyperplane is as follows:
the execution unit is based onThe positive and negative of the text file are determined, the value of the corresponding digit on the classification code is obtained, and the execution unit stores the text file to the corresponding classification area according to the classification code.
The beneficial effects obtained by the invention are as follows:
the system identifies the main body, emotion, key words and other information of the document by analyzing the vocabulary, grammar and semantic information in the text, then calculates the importance of each word, converts the document into the feature number of an n-dimensional vector, and creates the hyperplane based on the feature number, the number of the hyperplane determines the upper limit of the classification number capable of being classified, the system can automatically label and classify the document by utilizing the hyperplane, and the efficiency and the accuracy are greatly improved.
For a better understanding of the features and technical content of the present invention, reference should be made to the following detailed description of the invention and accompanying drawings, which are provided for purposes of illustration and description only and are not intended to limit the invention.
Drawings
FIG. 1 is a schematic view of the overall structural framework of the present invention;
FIG. 2 is a schematic diagram of the analysis module according to the present invention;
FIG. 3 is a schematic diagram of the classification module according to the present invention;
FIG. 4 is a schematic diagram of a computing unit according to the present invention;
FIG. 5 is a schematic flow chart of the present invention for setting a hyperplane.
Detailed Description
The following embodiments are provided to illustrate the present invention by specific examples, and those skilled in the art will be able to understand the advantages and effects of the present invention from the disclosure of the present specification. The invention is capable of other and different embodiments and its several details are capable of modification in various other respects, all without departing from the spirit and scope of the present invention. The drawings of the present invention are for illustrative purposes only and are not intended to be drawn to scale. The following embodiments will further explain the related art of the present invention in detail, but the disclosure is not intended to limit the scope of the present invention.
The first embodiment is as follows:
the embodiment provides an AIGC-based file sorting system, which is combined with a figure 1 and comprises a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology, the classification module classifies the files by using information generated by the analysis module, the application module provides application functions for the user, and the output module is used for generating a report for decision making;
the analysis module receives a file for training, converts the file into characteristic information, creates a hyperplane based on the characteristic information, the hyperplane information is sent to the classification module, the classification module calculates and processes the file to be classified based on the hyperplane to obtain a classification code, each bit of the classification code corresponds to one hyperplane, and the classification module stores the file to be classified to a corresponding area of the storage module according to the classification code;
the analysis module comprises a vocabulary statistic unit, a word frequency matrix processing unit, a feature processing unit and a classification analysis unit, wherein the vocabulary statistic unit creates a vocabulary list according to input text files and performs vocabulary statistics on each text file based on the vocabulary list, the word frequency matrix processing unit generates an m-n matrix according to a statistical result, m represents the number of the files, n represents the number of vocabularies in the vocabulary list, the feature processing unit obtains n-dimensional feature numbers of each file according to the word frequency matrix processing, and the classification analysis unit creates a hyperplane based on the feature numbers and classification codes of the text files;
the feature processing unit performs calculation processing according to the following equation based on the element values in the matrix:
wherein ,the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
the hyperplane A set by the classification analysis unit is as follows:
the hyperplane A satisfies:
wherein ,represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>Representing the file sequence numbers corresponding to the second class of characteristic numbers;
the classification module comprises a text processing unit, a calculation processing unit and an execution unit, wherein the text processing unit is used for counting the occurrence frequency of specific words in a text file, the calculation processing unit is used for executing calculation tasks, and the execution unit classifies and stores texts into corresponding storage areas according to the calculation results of the calculation processing unit;
the calculation processing unit comprises a characteristic number calculation processor and a hyperplane calculation processor, and the characteristic number calculation processor calculates the classification characteristic number of the file to be classified according to the following formula:
the hyperplane calculation processor processes the classification characteristic number according to the hyperplane, and the calculation result of each hyperplane is as follows:
the execution unit is based onThe positive and negative of the code determines the value of the corresponding digit on the classification code, and obtains the classification code, and the execution unit stores the text file to the corresponding classification area according to the classification code.
The second embodiment:
the embodiment includes all contents in the first embodiment, and provides an AIGC-based file arrangement system, which comprises a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology to identify information such as topics and keywords of the file contents, the classification module classifies the files by using the information generated by the analysis module, the application module provides multiple application functions for the user, and the output module is used for generating a report for decision making and presenting the report to the user in a proper form;
with reference to fig. 2, the analysis module includes a vocabulary statistics unit, a word frequency matrix processing unit, a feature processing unit, and a classification analysis unit, where the vocabulary statistics unit is capable of processing each input text file, the processing process includes decomposition, word removal, extraction, and statistics, the decomposition process is to decompose the whole text into a plurality of phrases, the word removal process is to delete connective words, the extraction process is to extract stems from the phrases, the statistics process is to count the occurrence frequency of each stem, and the vocabulary statistics unit creates a vocabulary table for recording the occurring vocabularies according to the extracted stems;
the word frequency matrix processing unit generates an m x n matrix according to the statistical result, wherein m represents the number of files, n represents the number of words in a vocabulary table, each row represents one file, each column represents one word, and each element represents the frequency of the word in the file;
the feature processing unit performs calculation processing according to the following equation based on the element values in the matrix:
wherein ,the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
the classification analysis unit is set upA super plane to realize the pair->Classification analysis of documents of types, the type of which is &>Binary number representation of bits, wherein the binary number is called a classification code, each hyperplane corresponds to one bit of the classification code, the classification code is sent to the analysis module along with the text file, and after the text file is processed to obtain a feature number, the classification code and the corresponding feature number are matched and sent to the classification analysis unit;
referring to fig. 5, the process of setting a hyperplane by the classification analysis unit includes the following steps:
s1, determining the number of classification code bits corresponding to a hyperplane, namely target bits;
s2, classifying the classified codes into two classes according to the value of the target bit, wherein the classified code with the value of 1 of the target bit is a first class code, and the classified code with the value of 0 of the target bit is a second class code;
s3, classifying the feature numbers according to the classification result of the classification codes in the step S2, namely, the feature numbers corresponding to the first class codes are first class feature numbers, and the feature numbers corresponding to the second class codes are second class feature numbers;
s4, setting a hyperplane A:
the hyperplane A satisfies:
wherein ,represents the file serial number corresponding to one type of characteristic number, and/or the file serial number corresponding to one type of characteristic number>Representing the file serial numbers corresponding to the second class of characteristic numbers, and determining the corresponding relation between the positive and negative properties and the values of 0 and 1 in the form of an inequality satisfied;
s5, adjusting the numerical value of the threshold b to ensure that the shortest distance from the first-class characteristic number to the hyperplane is equal to the shortest distance from the second-class characteristic number to the hyperplane, and recording the threshold at the moment asAnd finally obtaining the determined hyperplane:
the classification analysis unit is based onThe process is set to obtainThe hyperplane information is sent to the classification module, and the hyperplane information comprises the corresponding relation between the positive and negative characters and 0 and 1;
with reference to fig. 3, the classification module includes a text processing unit, a calculation processing unit, and an execution unit, where the text processing unit is configured to count occurrence frequency of a specific vocabulary in a text file, the calculation processing unit is configured to execute a calculation task, and the execution unit classifies and stores a text in a corresponding storage area according to a calculation result of the calculation processing unit;
the text processing unit stores the vocabulary table created by the vocabulary counting unit and counts based on the vocabulary recorded in the vocabulary table;
with reference to fig. 4, the calculation processing unit includes an eigenvalue calculation processor and a hyperplane calculation processor, and the eigenvalue calculation processor calculates the classification eigenvalue of the document to be classified according to the following formula:
wherein ,the y-th vocabulary frequency obtained by statistics of the text processing unit is obtained;
the hyperplane computation processor is based onThe classification characteristic number is processed by each hyperplane, and the calculation result of each hyperplane is as follows:
the execution unit is based on NsThe positive and negative of the text file are used for determining the value of the corresponding digit on the classification code and obtaining the classification code, and the execution unit stores the text file to the corresponding classification area according to the classification code;
the storage module comprises a storage unit and a logic unit, wherein the storage unit is divided into Ns +1 storage areas, one storage area is used for storing the text files to be classified and is called a temporary area, the other Ns storage areas are used for storing the classified text files and are called classification areas, and the logic unit is used for recording the logic relation among the Ns classification areas;
the application module is used for providing an application interface, the application interface comprises a classification interface and a report interface, the classification interface is connected with the temporary area, the input text file is stored to the temporary area through the classification interface, the classification module is started to store the input text file in a classification mode after the temporary area receives a new file, the report interface is used for receiving time information and counting the newly added text file in the time information, and the proportion of various classification files increased in a corresponding time period is obtained.
The above disclosure is only a preferred embodiment of the present invention, and is not intended to limit the scope of the present invention, so that all the modifications and equivalents of the technical changes and equivalents made by the disclosure and drawings are included in the scope of the present invention, and the elements thereof may be updated as the technology develops.
Claims (5)
1. A file arrangement system based on AIGC is characterized by comprising a storage module, an analysis module, a classification module, an application module and an output module;
the storage module is used for storing files of a user, the analysis module analyzes the files based on an AIGC technology, the classification module classifies the files by using information generated by the analysis module, the application module provides application functions for the user, and the output module is used for generating a report for decision making;
the analysis module receives a file for training, converts the file into characteristic information, creates a hyperplane based on the characteristic information, the hyperplane information is sent to the classification module, the classification module calculates and processes the file to be classified based on the hyperplane to obtain a classification code, each bit of the classification code corresponds to one hyperplane, and the classification module stores the file to be classified to a corresponding area of the storage module according to the classification code.
2. The AIGC-based filing system of claim 1, wherein the analysis module comprises a vocabulary statistics unit, a word frequency matrix processing unit, a feature processing unit and a classification analysis unit, the vocabulary statistics unit creates a vocabulary table from the inputted text files and performs vocabulary statistics on each text file based on the vocabulary table, the word frequency matrix processing unit generates an m x n matrix according to the statistics result, wherein m represents the number of files, n represents the number of vocabularies in the vocabulary table, the feature processing unit obtains n-dimensional feature numbers of each file according to the word frequency matrix processing, and the classification analysis unit creates a hyperplane based on the feature numbers and the classification codes of the text files.
3. The AIGC-based filing system of claim 2, wherein the feature processing unit performs calculation processing based on element values in the matrix according to the following formula:
wherein ,the element values of the x-th row and the y-th column in the matrix are represented, i represents a variable of the row number, and j represents a variable of the column number;
the hyperplane A set by the classification analysis unit is as follows:
the hyperplane A satisfies:
4. The AIGC-based file collating system according to claim 3, wherein the classifying module includes a text processing unit, a calculation processing unit and an executing unit, the text processing unit is configured to count the occurrence frequency of a specific word in a text file, the calculation processing unit is configured to execute a calculation task, and the executing unit classifies and stores the text in a corresponding storage area according to the calculation result of the calculation processing unit.
5. The AIGC-based filing system of claim 4, wherein the calculation processing unit comprises an eigenvalue calculation processor and a hyperplane calculation processorThe characteristic number calculation processor calculates the classification characteristic number of the file to be classified according to the following formula:
the hyperplane calculation processor processes the classification characteristic number according to the hyperplane, and the calculation result of each hyperplane is as follows:
the execution unit is based onThe positive and negative of the code determines the value of the corresponding digit on the classification code, and obtains the classification code, and the execution unit stores the text file to the corresponding classification area according to the classification code. />
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310165385.6A CN115858474B (en) | 2023-02-27 | 2023-02-27 | File arrangement system based on AIGC |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202310165385.6A CN115858474B (en) | 2023-02-27 | 2023-02-27 | File arrangement system based on AIGC |
Publications (2)
Publication Number | Publication Date |
---|---|
CN115858474A true CN115858474A (en) | 2023-03-28 |
CN115858474B CN115858474B (en) | 2023-05-09 |
Family
ID=85658908
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202310165385.6A Active CN115858474B (en) | 2023-02-27 | 2023-02-27 | File arrangement system based on AIGC |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN115858474B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116306317A (en) * | 2023-05-12 | 2023-06-23 | 环球数科集团有限公司 | Automatic AIGC modeling system based on artificial intelligence |
CN116821319A (en) * | 2023-08-30 | 2023-09-29 | 环球数科集团有限公司 | Quick screening type processing system based on AIGC |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120166366A1 (en) * | 2010-12-22 | 2012-06-28 | Microsoft Corporation | Hierarchical classification system |
CN106407406A (en) * | 2016-09-22 | 2017-02-15 | 国信优易数据有限公司 | A text processing method and system |
US20170075974A1 (en) * | 2015-09-11 | 2017-03-16 | Adobe Systems Incorporated | Categorization of forms to aid in form search |
CN107577738A (en) * | 2017-08-28 | 2018-01-12 | 电子科技大学 | A kind of FMECA method by SVM text mining processing datas |
US20200026798A1 (en) * | 2018-07-20 | 2020-01-23 | EMC IP Holding Company LLC | Identification and curation of application programming interface data from different sources |
CN111125359A (en) * | 2019-12-17 | 2020-05-08 | 东软集团股份有限公司 | Text information classification method, device and equipment |
CN114265937A (en) * | 2021-12-24 | 2022-04-01 | 中国电力科学研究院有限公司 | Intelligent classification analysis method and system of scientific and technological information, storage medium and server |
CN114610884A (en) * | 2022-03-07 | 2022-06-10 | 中国人民解放军63893部队 | Classification method based on PCA combined feature extraction and approximate support vector machine |
-
2023
- 2023-02-27 CN CN202310165385.6A patent/CN115858474B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120166366A1 (en) * | 2010-12-22 | 2012-06-28 | Microsoft Corporation | Hierarchical classification system |
US20170075974A1 (en) * | 2015-09-11 | 2017-03-16 | Adobe Systems Incorporated | Categorization of forms to aid in form search |
CN106407406A (en) * | 2016-09-22 | 2017-02-15 | 国信优易数据有限公司 | A text processing method and system |
CN107577738A (en) * | 2017-08-28 | 2018-01-12 | 电子科技大学 | A kind of FMECA method by SVM text mining processing datas |
US20200026798A1 (en) * | 2018-07-20 | 2020-01-23 | EMC IP Holding Company LLC | Identification and curation of application programming interface data from different sources |
CN111125359A (en) * | 2019-12-17 | 2020-05-08 | 东软集团股份有限公司 | Text information classification method, device and equipment |
CN114265937A (en) * | 2021-12-24 | 2022-04-01 | 中国电力科学研究院有限公司 | Intelligent classification analysis method and system of scientific and technological information, storage medium and server |
CN114610884A (en) * | 2022-03-07 | 2022-06-10 | 中国人民解放军63893部队 | Classification method based on PCA combined feature extraction and approximate support vector machine |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116306317A (en) * | 2023-05-12 | 2023-06-23 | 环球数科集团有限公司 | Automatic AIGC modeling system based on artificial intelligence |
CN116821319A (en) * | 2023-08-30 | 2023-09-29 | 环球数科集团有限公司 | Quick screening type processing system based on AIGC |
CN116821319B (en) * | 2023-08-30 | 2023-10-27 | 环球数科集团有限公司 | Quick screening type processing system based on AIGC |
Also Published As
Publication number | Publication date |
---|---|
CN115858474B (en) | 2023-05-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN115858474B (en) | File arrangement system based on AIGC | |
CN110825877A (en) | Semantic similarity analysis method based on text clustering | |
CN106407406B (en) | text processing method and system | |
CN108573045A (en) | A kind of alignment matrix similarity retrieval method based on multistage fingerprint | |
US9501557B2 (en) | Information generating computer product, apparatus, and method; and information search computer product, apparatus, and method | |
CN103995876A (en) | Text classification method based on chi square statistics and SMO algorithm | |
CN107168956B (en) | Chinese chapter structure analysis method and system based on pipeline | |
CN109993216B (en) | Text classification method and device based on K nearest neighbor KNN | |
WO2020063071A1 (en) | Sentence vector calculation method based on chi-square test, and text classification method and system | |
Farhoodi et al. | Applying machine learning algorithms for automatic Persian text classification | |
CN115098690B (en) | Multi-data document classification method and system based on cluster analysis | |
WO2018213783A1 (en) | Computerized methods of data compression and analysis | |
CN112632969B (en) | Incremental industry dictionary updating method and system | |
CN114266256A (en) | Method and system for extracting new words in field | |
Elhassan et al. | Arabic text classification on full word | |
CN111930936A (en) | Method and system for excavating platform message text | |
CN113051462A (en) | Multi-classification model training method, system and device | |
CN117291192B (en) | Government affair text semantic understanding analysis method and system | |
Hamdi et al. | Machine learning vs deterministic rule-based system for document stream segmentation | |
Elhassan et al. | The impact of feature selection methods for classifying Arabic texts | |
Sudha | Semi supervised multi text classifications for telugu documents | |
CN116186067A (en) | Industrial data table storage query method and equipment | |
CN109815475B (en) | Text matching method and device, computing equipment and system | |
CN111341404B (en) | Electronic medical record data set analysis method and system based on ernie model | |
Packiam et al. | A Novel Integrated Framework Based on Modular Optimization for Efficient Analytics on Twitter Big Data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |