CN115048925B - Data processing system for determining abnormal text - Google Patents

Data processing system for determining abnormal text Download PDF

Info

Publication number
CN115048925B
CN115048925B CN202210976335.1A CN202210976335A CN115048925B CN 115048925 B CN115048925 B CN 115048925B CN 202210976335 A CN202210976335 A CN 202210976335A CN 115048925 B CN115048925 B CN 115048925B
Authority
CN
China
Prior art keywords
initial
text
target
entity relationship
determining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202210976335.1A
Other languages
Chinese (zh)
Other versions
CN115048925A (en
Inventor
张正义
林方
傅晓航
常鸿宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhongke Yuchen Technology Co Ltd
Original Assignee
Zhongke Yuchen Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhongke Yuchen Technology Co Ltd filed Critical Zhongke Yuchen Technology Co Ltd
Priority to CN202210976335.1A priority Critical patent/CN115048925B/en
Publication of CN115048925A publication Critical patent/CN115048925A/en
Application granted granted Critical
Publication of CN115048925B publication Critical patent/CN115048925B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/205Parsing
    • G06F40/216Parsing using statistical methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/288Entity relationship models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • G06F16/355Class or cluster creation or modification

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Probability & Statistics with Applications (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • General Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

The invention provides a data processing system for determining abnormal texts, which comprises: a database, a processor and a memory storing a computer program which, when executed by the processor, performs the steps of: acquiring an initial statement list and initial characters according to the initial text; acquiring an entity relationship probability list and a target entity relationship probability list; acquiring the maximum probability value of the target entity relationship; acquiring the priority corresponding to the initial text; when the priority is greater than or equal to a preset priority threshold, determining the initial text as a target text; otherwise, determining the initial text as an abnormal text. On one hand, in the process of processing the text, the entity is extracted by only using one preset model, so that the used text data is less, the workload of marking personnel is reduced, and on the other hand, in the process of extracting the entity relationship, the entity relationship is extracted by using a plurality of methods, so that the accuracy of the model for predicting the entity relationship is improved.

Description

Data processing system for determining abnormal text
Technical Field
The invention relates to the technical field of text processing, in particular to a data processing system for determining abnormal texts.
Background
Most of existing abnormal text determination methods determine whether an entity in a text meets a preset condition, input the text to be recognized into an entity recognition model, extract various entities from the text, process the extracted entities, and determine that the text is an abnormal text when the entity does not meet the preset condition.
The existing entity relationship extraction method comprises the following steps: determining a sentence distributed vector of each sentence in the target sentence sub-packet and a relation distributed vector of the query relation; each sentence in the target sentence sub-packet comprises a first entity and a second entity; determining a strategy function according to the sentence distributed vectors and the relation distributed vectors, and dividing all sentences in the target sentence sub-packet into positive examples and unmarked examples by utilizing the strategy function; and training a relation extraction model by using the normal example and the unmarked example to obtain the entity relation between the first entity and the second entity.
However, the above method also has the following technical problems:
firstly, in the process of processing the text, a plurality of preset models are required to be used for extracting the entity, a large amount of text data and storage space are required to be used, and the workload of a marking person is overlarge.
Secondly, in the extraction process of the entity relationship, the processing process is single, the entity relationship can be extracted only in one mode, and the prediction accuracy of the model on the entity relationship is low.
Disclosure of Invention
Aiming at the technical problems, the technical scheme adopted by the invention is as follows:
a data processing system for determining anomalous text, said system comprising: a database, a processor, and a memory storing a computer program, wherein the database comprises: initial text list H = { H = 1 ,……,H i ,……,H m },H i For the ith initial text, i =1 \ 8230; \8230m, m is the number of initial texts, which when executed by a processor, performs the following steps:
s100 according to H i Obtaining H i Corresponding initial sentence list D i ={D i1 ,……,D ij ,……,D ini },D ij =(D 1 ij ,……,D r ij ,……,D sj ij ),D r ij Is H i J =1 \ 8230; ni, ni is H for the r initial character of the jth initial sentence i R =1 \8230 \ 8230;, sj, sj is the number of initial characters in the jth initial sentence.
S200, according to D r ij Obtaining D r ij Corresponding initial entity relationship list G r ij ={G r1 ij ,……,G rx ij ,……,G rq ij },G rx ij Is D r ij The probability value of the corresponding x-th class initial entity relationship, x =1 \82308230q, q is the number of types of initial entity relationships.
S300, traverse G r ij And when G is rx ij When the type of the corresponding initial entity relationship is the type of the non-target relationship, the secondary G r ij Deletion in G rx ij Construction of D r ij Corresponding target entity relationship list U r ij ={U r1 ij ,……,U ry ij ,……,U rp ij },U ry ij Is D r ij The probability value of the corresponding y-th category target entity relationship, y =1 \ 8230 \ 8230, p, p is the type number of the target entity relationship.
S400, traversing U r ij And when U is turned ry ij ≥U 0 While from U r ij In obtaining the maximum probability value, U, of the target entity relationship 0 Is a preset confidence threshold.
S500, determining H according to the maximum probability value of the target entity relationship i Is an exception text.
The invention has at least the following beneficial effects:
the invention provides a data processing system for determining abnormal texts, which comprises: a database, a processor, and a memory storing a computer program, wherein the database comprises: an initial text set, which when executed by a processor, performs the steps of: acquiring an initial sentence list and initial characters corresponding to the initial text according to the initial text; acquiring a corresponding entity relationship probability list according to the initial character, and performing traversal processing on the entity relationship probability list to acquire a target entity relationship probability list corresponding to the initial character; traversing the target entity relationship probability list to obtain the maximum probability value of the target entity relationship; acquiring the priority corresponding to the initial text according to the maximum probability value of the target entity relationship; when the priority is greater than or equal to a preset priority threshold, determining the initial text as a target text; when the priority is smaller than a preset priority threshold, determining the initial text as an abnormal text; therefore, on one hand, in the process of processing the text, the entity can be extracted by using only one preset model, so that the used text data is less, and the workload of marking personnel is reduced; on the other hand, in the extraction process of the entity relationship, the entity relationship can be extracted by using various methods, so that the prediction accuracy of the model on the entity relationship is improved.
Drawings
In order to more clearly illustrate the technical solutions in the embodiments of the present invention, the drawings needed to be used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings without creative efforts.
Fig. 1 is a flowchart of a computer program executed by a data processing system for determining an abnormal text according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or server that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
The embodiment provides a data processing system for determining an abnormal text, which includes: a database, a processor, and a memory storing a computer program, wherein the database comprises: initial text list H = { H = 1 ,……,H i ,……,H m },H i For the ith initial text, i =1 \ 8230 \8230, m, m is the number of initial texts, which when executed by a processor, implements the following steps, as shown in fig. 1:
s100 according to H i Obtaining H i Corresponding initial sentence list D i ={D i1 ,……,D ij ,……,D ini },D ij =(D 1 ij ,……,D r ij ,……,D sj ij ),D r ij Is H i J =1 \ 8230; ni, ni is H for the r initial character of the jth initial sentence i The number of the initial sentences in (1) \8230 \ 8230 \ 8230, sj, sj is the number of the initial characters in the j initial sentence.
Specifically, the initial sentence is a sentence obtained by performing sentence splitting processing on an initial text, where a person skilled in the art knows that any sentence splitting processing method in the prior art belongs to the protection scope of this embodiment, and is not described herein again.
Further, the initial character refers to any character in the initial sentence.
S200, according to D r ij Obtaining D r ij Corresponding initial entity relationship list G r ij ={G r1 ij ,……,G rx ij ,……,G rq ij },G rx ij Is D r ij The probability value of the corresponding x-th type initial entity relation, x =1 \8230, wherein \8230, q and q are the type number of the initial entity relation.
Specifically, the initial entity relationship is an association relationship between entities in a triple corresponding to the initial sentence, wherein a probability value of the entity relationship and the triple are obtained through a preset model.
Further, the preset model is an active learning model, and those skilled in the art know that any active learning model in the prior art belongs to the protection scope of the present embodiment, and will not be described herein again.
In particular, Σ q x=1 (G rx ij )=1。
S300, go through G r ij And when G is rx ij When the type of the corresponding initial entity relationship is the type of the non-target relationship, the secondary G r ij Deletion in G rx ij Construction of D r ij Corresponding target entity relation list U r ij ={U r1 ij ,……,U ry ij ,……,U rp ij },U ry ij Is D r ij And the probability value of the corresponding y-th target entity relationship, y =1 \8230, wherein \8230p, p is the type number of the target entity relationship.
Specifically, the non-target relationship is a relationship without an association state between entities, for example, the non-target relationship.
S400, traversing U r ij And when U is turned ry ij ≥U 0 Then, from U r ij In order to obtain the maximum probability value, U, of the target entity relationship 0 Is a preset probability value threshold.
Specifically, U 0 The value range of (A) is 0.5 to 0.6.
Preferably, U 0 The value is 0.5, and the situation that the threshold value is set too high, so that some relation probabilities are deleted, data are omitted, and further, the abnormal text is judged inaccurately can be avoided.
S500, determining H according to the maximum probability value of the target entity relationship i Is an abnormal text.
Specifically, step S500 includes the steps of:
s501, based on U r ij The maximum probability value of the medium target entity relationship is constructed into H i Corresponding first intermediate data list V i ={V i1 ,……,V ij ,……,V ini },V ij =(V 1 ij ,……,V t ij ,……,V kj ij ),V t ij Is H i The probability value of the t target character in the j initial sentence, t =1 \ 8230 \ 8230: \ 8230j, kj is the number of the target characters in the j initial sentence.
S503 according to V t ij Obtaining F 0 i ,F 0 i The following conditions are met:
Figure 367947DEST_PATH_IMAGE002
s505, when F 0 i ≥F 0 Then, H is determined i Is a target text, wherein F 0 Is a preset priority threshold.
S507, when F 0 i <F 0 When H is determined i Is an abnormal text.
In the process of processing the text, the entity and the data in the text are extracted by using only one preset model instead of extracting the entity and the data in the text by using multiple preset models, so that the used text data are less, and the workload of marking personnel is reduced.
In a specific embodiment, step S500 further includes the following steps:
s501, obtaining H according to the maximum probability value of the target entity relationship i Corresponding priority list F i ={F i1 ,……,F iy ,……,F ip },F iy Is H i The corresponding priority of the y-th category target entity relationship.
Wherein, F is obtained in the step S501 through the following steps iy
S5011 based on U r ij Middle goal practiceThe maximum probability value of the body relationship is constructed as H i Corresponding second intermediate data list C i ={C i1 ,……,C iy ,……,C ip },C iy ={C 1 iy ,……,C g iy ,……,C zy iy },C g iy =(C g1 iy ,……,C ge iy ,……,C gwg iy ),C ge iy Is at H i In the corresponding y category target entity relation, the probability value of the e target character in the g initial sentence is g =1 \ 8230, wherein \ 8230zy is the number of the g initial sentences in the y category target entity relation, and e =1 \ 8230, wherein \ 8230wg and wg are the number of the target characters in the g initial sentence.
S5013 according to C ge iy ,F iy The following conditions are met:
Figure 369270DEST_PATH_IMAGE004
s503, when F ip ≥F 0 When H is determined i Is a target text, wherein F 0 Is a preset priority threshold.
S505, when F ip <F 0 When H is determined i Is an abnormal text.
Compared with the embodiment, the classification method is adopted when the abnormal text is determined, the initial text can be determined to be the abnormal text only when any kind of relation does not exist between the entities, and the prediction accuracy of the model on the entity relation is improved.
In another embodiment, step S500 further includes the steps of:
s501, based on U r ij The maximum probability value of the medium target entity relationship is constructed into H i Corresponding first intermediate data list V i ={V i1 ,……,V ij ,……,V ini },V ij =(V 1 ij ,……,V t ij ,……,V kj ij ),V t ij Is H i The probability value of the t target character in the j initial sentence, t =1 \ 8230 \ 8230: \ 8230j, kj is the number of the target characters in the j initial sentence.
S503 according to V t ij Obtaining F 0 i ,F 0 i The following conditions are met:
Figure 129415DEST_PATH_IMAGE006
s505, when F 0 i ≥F 0 Then, H is determined i Is a target text, wherein F 0 Is a preset priority threshold.
S507, when F 0 i <F 0 Then, H is determined i Is a text to be processed and performs the step S509.
S509, according to H i Obtaining the maximum probability value of the target entity relation to obtain H i Corresponding priority list F i ={F i1 ,……,F iy ,……,F ip },F iy Is H i The corresponding priority of the y-th category target entity relationship.
Further, F is also acquired in step S509 by the following steps iy
S5091 based on U r ij The maximum probability value of the medium target entity relationship is constructed into H i Corresponding second intermediate data list C i ={C i1 ,……,C iy ,……,C ip },C iy ={C 1 iy ,……,C g iy ,……,C zy iy },C g iy =(C g1 iy ,……,C ge iy ,……,C gwg iy ),C ge iy Is at H i In the corresponding y category target entity relation, the probability value of the e target character in the g initial sentence is g = 1\8230, wherein \ 8230is the number of the g initial sentences in the y category target entity relation, and e =1 \82308230wg, wg is the number of target characters in the g-th initial sentence.
S5093 according to C ge iy ,F iy The following conditions are met:
Figure 878672DEST_PATH_IMAGE008
s511, when F ip ≥F 0 Then, H is determined i Is a target text, wherein F 0 Is a preset priority threshold.
S513, when F ip <F 0 Then, H is determined i Is an exception text.
Compared with the first embodiment, the method and the device for predicting the entity relationship have the advantages that the probability of judging the target text into the abnormal text can be reduced, the accuracy of the model for predicting the entity relationship is further improved, compared with the second embodiment, the efficiency is improved, and the workload of annotating personnel is reduced.
Specifically, the target character is when U ry ij ≥U 0 While, U ry ij The corresponding initial character.
Specifically, F 0 The value range of (A) is 0.8-1.
Preferably, F 0 The value is 0.8, and the situation that the judgment on the abnormal text is not accurate enough due to too low threshold setting can be avoided.
More preferably, F 0 The value is 0.9, so that the abnormal text can be judged more accurately.
Most preferably, F 0 The value is 1, and the initial text can be completely determined to be the abnormal text.
Specifically, the target text is text in which the priority of the entity relationship in the annotation text is not less than the priority threshold.
Specifically, the abnormal text is a text in which the priority of the entity relationship in the annotation text is smaller than a priority threshold, where the abnormal text may be understood as an initial text of an entity relationship type corresponding to the initial text in a preset entity relationship type, or the abnormal text may also be understood as an initial text in which an error is annotated to the entity relationship.
The embodiment provides a data processing system for determining an abnormal text, which comprises: a database, a processor, and a memory storing a computer program, wherein the database comprises: an initial text set, which when executed by a processor, performs the steps of: acquiring an initial sentence list and initial characters corresponding to the initial text according to the initial text; acquiring a corresponding entity relationship probability list according to the initial character, and performing traversal processing on the entity relationship probability list to acquire a target entity relationship probability list corresponding to the initial character; traversing the target entity relationship probability list to obtain the maximum probability value of the target entity relationship; acquiring the priority corresponding to the initial text according to the maximum probability value of the target entity relationship; when the priority is greater than or equal to a preset priority threshold, determining the initial text as a target text; when the priority is smaller than a preset priority threshold, determining the initial text as an abnormal text; therefore, on one hand, in the process of processing the text, the entity can be extracted by using only one preset model, so that the used text data is less, and the workload of marking personnel is reduced; on the other hand, in the extraction process of the entity relationship, the entity relationship can be extracted by using a plurality of methods, so that the prediction accuracy of the model on the entity relationship is improved.
Although some specific embodiments of the present invention have been described in detail by way of illustration, it should be understood by those skilled in the art that the above illustration is only for the purpose of illustration and is not intended to limit the scope of the invention. It will also be appreciated by those skilled in the art that various modifications may be made to the embodiments without departing from the scope and spirit of the invention. The scope of the invention is defined by the appended claims.

Claims (10)

1. A data processing system for determining anomalous text, said system comprising: a database, a processor, and a memory storing a computer program, wherein the database comprises: initial text list H = { H = { (H) 1 ,……,H i ,……,H m },H i For the ith initial text, i =1 \ 8230; \8230m, m being the number of initial texts, the computer program, when being executed by a processor, realizes the following steps:
s100 according to H i Obtaining H i Corresponding initial sentence list D i ={D i1 ,……,D ij ,……,D ini },D ij =(D 1 ij ,……,D r ij ,……,D sj ij ),D r ij Is H i J =1 \8230, the r initial character of the j initial sentence, l 8230, ni, ni is H i The number of the initial sentences in (1) r = 8230, the number of initial characters in (8230); sj, sj is the number of initial characters in the jth initial sentence;
s200, according to D r ij Obtaining D r ij Corresponding initial entity relationship list G r ij ={G r1 ij ,……,G rx ij ,……,G rq ij },G rx ij Is D r ij The probability value of the corresponding x-th class initial entity relationship, x =1 \8230 \ 8230, q, q is the type number of the initial entity relationship;
s300, go through G r ij And when G is rx ij When the type of the corresponding initial entity relationship is the type of the non-target relationship, the secondary G r ij Deletion in G rx ij Construction of D r ij Corresponding target entity relation list U r ij ={U r1 ij ,……,U ry ij ,……,U rp ij },U ry ij Is D r ij The probability value of the corresponding y-th category target entity relationship, y =1 \8230, wherein \8230p, p and p are the type number of the target entity relationship;
s400, traversing U r ij And when U is turned ry ij ≥U 0 Then, from U r ij In order to obtain the maximum probability value, U, of the target entity relationship 0 Is a preset probability value threshold;
s500, determining H according to the maximum probability value of the target entity relationship i Is an abnormal text.
2. The data processing system for determining abnormal texts according to claim 1, wherein in step S200, the initial entity relationship is an association relationship between entities in a triple corresponding to an initial sentence.
3. The data processing system for determining an abnormal text according to claim 1, wherein in step S200, Σ q x=1 (G rx ij )=1。
4. The data processing system for determining abnormal text according to claim 1, wherein in the step S300, the non-target relationship is an unassociated relationship between entities.
5. The data processing system for determining an abnormal text according to claim 1, wherein in the step S400, U is 0 The value range of (2) is 0.5 to 0.6.
6. The data processing system for determining an abnormal text according to claim 1, wherein the step S500 further comprises the steps of:
s501, based on U r ij The maximum probability value of the medium target entity relationship is constructed into H i Corresponding first intermediate data list V i ={V i1 ,……,V ij ,……,V ini },V ij =(V 1 ij ,……,V t ij ,……,V kj ij ),V t ij Is H i The probability value of the t target character in the jth initial statement, t =1 \ 8230 \8230: \ 8230j, kj is the number of the target characters in the jth initial statement;
s503 according to V t ij Obtaining F 0 i ,F 0 i The following conditions are met:
Figure 559994DEST_PATH_IMAGE002
s505, when F 0 i ≥F 0 Then, H is determined i Is a target text, wherein F 0 Is a preset priority threshold;
s507, when F 0 i <F 0 When H is determined i Is an abnormal text.
7. The data processing system for determining an abnormal text according to claim 1, wherein the step S500 is performed by:
s501, obtaining H according to the maximum probability value of the target entity relationship i Corresponding priority list F i ={F i1 ,……,F iy ,……,F ip },F iy Is H i The corresponding priority of the y-th type target entity relation;
wherein F is further acquired in the step S501 through the following steps iy
S5011, based on U r ij The maximum probability value of the relation of the middle target entity is constructed into H i Corresponding second intermediate data list C i ={C i1 ,……,C iy ,……,C ip },C iy ={C 1 iy ,……,C g iy ,……,C zy iy },C g iy =(C g1 iy ,……,C ge iy ,……,C gwg iy ),C ge iy Is at H i In the corresponding y category target entity relationship, the probability value of the e target character in the g initial sentence is g =1 \8230, wherein \8230, zy is the number of the g initial sentences in the y category target entity relationship, e =1 \8230, \8230, wg and wg are the number of the target characters in the g initial sentences;
s5013 according to C ge iy ,F iy The following conditions are met:
Figure DEST_PATH_IMAGE004
s503, when F ip ≥F 0 Then, H is determined i Is a target text, wherein F 0 Is a preset priority threshold;
s505, when F ip <F 0 When H is determined i Is an abnormal text.
8. Data processing system for determining anomalous text according to claim 6 or 7, characterised in that F 0 The value range of (A) is 0.8-1.
9. The data processing system for determining abnormal texts according to the claim 6 or 7, wherein the target texts are texts with entity relationships in the annotation texts with priorities not less than a preset priority threshold.
10. The data processing system for determining abnormal text according to claim 6 or 7, wherein the abnormal text is a text in which the priority of the entity relationship in the annotation text is less than a preset priority threshold.
CN202210976335.1A 2022-08-15 2022-08-15 Data processing system for determining abnormal text Active CN115048925B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210976335.1A CN115048925B (en) 2022-08-15 2022-08-15 Data processing system for determining abnormal text

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210976335.1A CN115048925B (en) 2022-08-15 2022-08-15 Data processing system for determining abnormal text

Publications (2)

Publication Number Publication Date
CN115048925A CN115048925A (en) 2022-09-13
CN115048925B true CN115048925B (en) 2022-11-04

Family

ID=83167812

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210976335.1A Active CN115048925B (en) 2022-08-15 2022-08-15 Data processing system for determining abnormal text

Country Status (1)

Country Link
CN (1) CN115048925B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115408160B (en) * 2022-09-16 2023-05-05 湖南胜云光电科技有限公司 Data processing system for process monitoring
CN115410104B (en) * 2022-09-16 2023-06-16 湖南胜云光电科技有限公司 Data processing system for acquiring image acquisition points of aircraft
CN116167339B (en) * 2022-12-02 2023-08-15 中科雨辰科技有限公司 Data processing system for acquiring link entity based on entity relationship

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109344396A (en) * 2018-08-31 2019-02-15 阿里巴巴集团控股有限公司 Text recognition method, device and computer equipment
CN109582833B (en) * 2018-11-06 2023-09-22 创新先进技术有限公司 Abnormal text detection method and device
US11068656B2 (en) * 2019-04-10 2021-07-20 International Business Machines Corporation Displaying text classification anomalies predicted by a text classification model
CN111832287B (en) * 2020-07-22 2024-04-19 广东工业大学 Entity relationship joint extraction method and device
CN112632996A (en) * 2020-12-08 2021-04-09 浙江大学 Entity relation triple extraction method based on comparative learning
CN113011189A (en) * 2021-03-26 2021-06-22 深圳壹账通智能科技有限公司 Method, device and equipment for extracting open entity relationship and storage medium

Also Published As

Publication number Publication date
CN115048925A (en) 2022-09-13

Similar Documents

Publication Publication Date Title
CN115048925B (en) Data processing system for determining abnormal text
CN108446769B (en) Knowledge graph relation inference method, knowledge graph relation inference device, computer equipment and storage medium
WO2020259280A1 (en) Log management method and apparatus, network device and readable storage medium
CN109739975B (en) Hot event extraction method and device, readable storage medium and electronic equipment
CN109918498B (en) Problem warehousing method and device
CN110135681A (en) Risk subscribers recognition methods, device, readable storage medium storing program for executing and terminal device
CN109271957B (en) Face gender identification method and device
CN112069801A (en) Sentence backbone extraction method, equipment and readable storage medium based on dependency syntax
WO2022001517A1 (en) Information sending method and apparatus based on rumor prediction model, and computer device
WO2020024444A1 (en) Group performance grade recognition method and apparatus, and storage medium and computer device
AU2020441876A1 (en) Method for determining human phenotype ontology, and related devices
CN105306252A (en) Method for automatically judging server failures
CN113963205A (en) Classification model training method, device, equipment and medium based on feature fusion
CN115062137B (en) Data processing system for determining abnormal text based on active learning
CN111161730B (en) Voice instruction matching method, device, equipment and storage medium
CN110502620B (en) Method, system and computer equipment for generating guide diagnosis similar problem pairs
CN116186658A (en) User identity verification data processing system
CN114021572B (en) Natural language processing method, device, equipment and readable storage medium
CN113095073B (en) Corpus tag generation method and device, computer equipment and storage medium
CN115017269A (en) Data processing system for determining similar texts
CN113806565A (en) Data processing system for text labeling
CN110533158B (en) Model construction method, system and non-volatile computer readable recording medium
CN110543634A (en) corpus data set processing method and device, electronic equipment and storage medium
CN113077015A (en) Sample selection method and device, computer equipment and storage medium
CN112101019A (en) Requirement template conformance checking optimization method based on part-of-speech tagging and chunk analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant