CN112164403A - Natural language processing system based on artificial intelligence - Google Patents

Natural language processing system based on artificial intelligence Download PDF

Info

Publication number
CN112164403A
CN112164403A CN202011030529.XA CN202011030529A CN112164403A CN 112164403 A CN112164403 A CN 112164403A CN 202011030529 A CN202011030529 A CN 202011030529A CN 112164403 A CN112164403 A CN 112164403A
Authority
CN
China
Prior art keywords
unit
pinyin
target
natural language
voice
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202011030529.XA
Other languages
Chinese (zh)
Inventor
韩岭
蔡进
胥传龙
李明
吴开凯
杨昆
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Sixiang Software Co ltd
Original Assignee
Jiangsu Sixiang Software Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Sixiang Software Co ltd filed Critical Jiangsu Sixiang Software Co ltd
Priority to CN202011030529.XA priority Critical patent/CN112164403A/en
Publication of CN112164403A publication Critical patent/CN112164403A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/126Character encoding
    • G06F40/129Handling non-Latin characters, e.g. kana-to-kanji conversion
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/151Transformation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/253Grammatical analysis; Style critique
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/02Feature extraction for speech recognition; Selection of recognition unit
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1822Parsing for meaning understanding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • G10L2015/0635Training updating or merging of old and new templates; Mean values; Weighting

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Document Processing Apparatus (AREA)

Abstract

The invention belongs to the technical field of natural language processing, and discloses a natural language processing system based on artificial intelligence, which comprises: the voice recognition module is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters; the voice processing module is used for carrying out error correction and correction processing on the target text and outputting the processed target correction scheme as an output result; wherein the voice processing module comprises: the word segmentation unit is used for carrying out pinyin word segmentation on the target text to obtain a pinyin sequence; the text analysis unit is used for identifying the errors of the pinyin sequence and analyzing the types of the errors; the correction unit gives out at least one correction scheme according to the analysis result of the text analysis unit; and the optimization unit is used for optimizing and screening at least one correction scheme and outputting the target correction scheme after optimization and screening as an output result.

Description

Natural language processing system based on artificial intelligence
Technical Field
The invention belongs to the technical field of natural language processing, and particularly relates to a natural language processing system based on artificial intelligence.
Background
Natural Language Processing (NLP) is a branching subject of the fields of artificial intelligence and linguistics. This field discusses how to handle and use natural language; natural language processing basically includes parts of recognition, understanding, generation, and the like. The natural language cognition and understanding is that the computer changes the acquired natural language into meaningful symbols and establishes corresponding relations, and then the symbols are processed according to the purpose of use. Natural language processing has recently become a popular research direction, with machine translation being the earliest research effort in natural language processing.
At present, most of natural language processing is rule-based methods, and in the method, only one isolated sentence is analyzed, and the association between contexts and the conversation context are difficult to match, so that the problems of large language processing limitation, low accuracy and low intelligence degree exist.
Disclosure of Invention
In view of the above, to solve the deficiencies in the prior art, the present invention provides a natural language processing system based on artificial intelligence.
In order to achieve the purpose, the invention provides the following technical scheme: an artificial intelligence based natural language processing system comprising:
the voice recognition module is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters;
the voice processing module is used for carrying out error correction and correction processing on the target text and outputting the processed target correction scheme as an output result;
wherein the voice processing module comprises:
the word segmentation unit is used for carrying out pinyin word segmentation on the target text to obtain a pinyin sequence;
the text analysis unit is used for identifying errors of the pinyin sequence and analyzing the types of the errors, wherein the types of the errors comprise grammar errors, semantic errors and pragmatic errors;
the correction unit gives out at least one correction scheme according to the analysis result of the text analysis unit;
and the optimization unit is used for optimizing and screening at least one correction scheme and outputting the target correction scheme after optimization and screening as an output result.
Preferably, the speech recognition module includes:
the language acquisition unit is used for acquiring voice data on line and converting the voice data into TXT format text for storage;
the preprocessing unit is used for segmenting words of the format text according to a preset word segmentation program, filtering out words or phrases which are stopped in the format text and obtaining a target characteristic set;
the characteristic extraction unit is used for extracting the characteristics of the target characteristic set to obtain a voice characteristic sequence and converting the voice characteristic sequence into a pinyin characteristic sequence;
and the identification unit is used for identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters.
Preferably, the word segmentation program adopted in the preprocessing unit is a Chinese character word segmentation program.
Preferably, when the preprocessing unit performs word segmentation processing, adjacent words or phrases are separated by a space.
Preferably, in the feature extraction unit, converting the speech feature sequence into a pinyin feature sequence includes:
converting the voice characteristic sequence into a pinyin characteristic sequence according to the ASCII code of the Chinese character; or
And converting the voice characteristic sequence into a pinyin characteristic sequence according to the Unicode value of the Chinese character.
Preferably, the processing system further comprises a knowledge base, and the knowledge base comprises a static base and a dynamic base, and the correction unit gives at least one correction scheme in combination with the error type and the application text stored in the static base and the dynamic base.
Preferably, the static library is used for storing standard texts and historical recognized voice texts and can be automatically updated; the dynamic library is automatically constructed based on the currently collected voice data, and standard texts or voice texts with the same context as the currently collected voice data are screened from the static library.
Preferably, in the recognition unit, the acoustic model is an acoustic model pre-trained by a standard text in a static library; and performing pre-training of the acoustic model correspondingly once every time the static library is updated.
Preferably, the optimization unit performs optimization of the target correction scheme according to the number of errors corrected by each correction scheme, and the number of errors corrected by the target correction scheme is the largest.
Preferably, the knowledge base further includes a disabled word base, and the preprocessing unit filters the disabled words or phrases according to the disabled word base.
Compared with the prior art, the invention has the following beneficial effects:
in the invention, the grammar analysis, the semantic analysis and the pragmatic analysis are combined to further correct the errors of the traditionally recognized natural language, so that the final output result can be more accurately adapted to the actual conversation environment, and the accuracy of the whole processing system for speech recognition is effectively improved.
In addition, a knowledge base comprising a static base and a dynamic base is arranged in the whole system, wherein the static base can be automatically updated based on historical recognition records, and the dynamic base automatically screens text information based on currently collected voice data, so that the adaptability and the processing efficiency of the whole system to specific problems are further improved.
Drawings
FIG. 1 is a block diagram of a natural language processing system provided by the present invention;
FIG. 2 is a flow chart of the natural language processing system of the present invention when executing processing.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the embodiment of the present invention, a natural language processing system based on artificial intelligence is disclosed, and specifically, referring to fig. 1, the processing system includes the following structure:
the voice recognition module 10 is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters;
specifically, the speech recognition module 10 includes the following:
the language acquisition unit 11 is used for acquiring voice data on line and converting the voice data into TXT format text for storage;
the preprocessing unit 12 is used for segmenting words of the text in the format of (TXT) according to a preset word segmentation program, filtering out deactivated words or phrases in the text in the format of (TXT) and obtaining a target feature set; the word segmentation program adopts a Chinese character word segmentation program, and when the word segmentation processing is carried out, adjacent words or phrases are separated by a blank space;
the feature extraction unit 13 is used for extracting features of the target feature set to obtain a voice feature sequence and converting the voice feature sequence into a pinyin feature sequence; dividing a target feature set into a plurality of discrete segments according to the change of a voice waveform of the target feature set along with time, and labeling the plurality of segments in sequence, wherein each segment correspondingly comprises at least one word or phrase, so that a voice feature sequence can be formed according to the labels;
and the identification unit 14 is used for identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters.
The voice processing module 20 is configured to perform error correction and correction processing on the target text, and output the processed target correction scheme as an output result;
specifically, the speech processing module 20 includes the following:
a word segmentation unit 21, configured to perform pinyin word segmentation on the target text to obtain a pinyin sequence;
the text analysis unit 22 is used for identifying errors of the pinyin sequence and analyzing the types of the errors, wherein the types of the errors comprise grammar errors, semantic errors and pragmatic errors;
a correction unit 23 giving at least one correction scheme based on the analysis result of the text analysis unit 22;
and the optimization unit 24 is used for performing optimization screening on at least one correction scheme and outputting the target correction scheme after optimization screening as an output result.
The knowledge base 30 includes a static base 31, a dynamic base 32, and a deactivation word base 33. The static library 31 is used for storing standard texts and historical recognized voice texts, and can be automatically updated; the dynamic library 32 is automatically constructed based on the currently collected voice data, and the standard text or voice text having the same context as the currently collected voice data is screened from the static library 31.
In addition:
the correction unit 23 provides at least one correction scheme in association with the type of error and the application text stored in the static library 31 and the dynamic library 32 when correcting the process.
In the recognition unit 14, the acoustic model is an acoustic model pre-trained by standard text in the static library 31; and each time the static library 31 is updated, pre-training of the acoustic model is performed correspondingly.
Specifically, the standard text information stored in the static library 31 may be collected from data sources such as books, news, web pages (e.g., encyclopedia, wikipedia, etc.); for example, word recognition may be performed on words in a book, particularly a standard book (such as a dictionary), and all correspondences may be stored in the static library 31 as standard texts.
In summary, the artificial intelligence based natural language processing system disclosed above specifically includes the following steps when performing the speech recognition processing:
s1, collecting voice data on line, and converting the voice data into a TXT format text for storage;
for example, the speech data collected by the language collecting unit 11 is "when fish will be dropped", and the corresponding application context is set as weather query;
s2, segmenting words of the format text according to a preset Chinese character segmentation program, and filtering out words or phrases which are stopped in the format text to obtain a target characteristic set;
specifically, the words of 'when rains will fall' are divided into words of 'when' - ', when' - 'fish will' and word group sequences; and the sequence does not contain the deactivated word or phrase, depending on the deactivated lexicon 33 in the knowledge base 30.
S3, extracting the characteristics of the target characteristic set to obtain a voice characteristic sequence, and converting the voice characteristic sequence into a pinyin characteristic sequence;
specifically, the voice characteristic sequence is converted into a pinyin characteristic sequence according to the ASCII code of the Chinese character; because the Chinese characters are represented by ASCII codes in the computer system, the sentences can be converted into pinyin sequences only by utilizing the corresponding relation between each pinyin and each ASCII code which is already in the computer system or established by a user. If the sentence contains polyphone, the determination can be made according to the collected pronunciation of the speech data. For example, "meeting" includes "hui" and "kuai", the pronunciation of the speech data is used as the standard to obtain the corresponding speech feature sequence: "shenme" - "shihou" - "hui" - "xiayu".
In addition, the phonetic characteristic sequence can be converted into phonetic characteristic sequence according to the Unicode value of Chinese characters.
S4, identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters;
s5, performing pinyin word segmentation on the target text to obtain a pinyin sequence;
s6, identifying errors of the pinyin sequence and analyzing the types of the errors;
in "shenme" - "shihou" - "hui" - "xiayu", there are included: the semantic error 'xiayu', such as raining/fish dropping/summer Yu/prison, has different semantics; syntax errors, lack of definition;
s7, at least one correction scheme is given by combining the error types and the application texts stored in the static library 31 and the dynamic library 32;
regarding the semantic errors described above, scheme 1 is given: "when it rains"; scheme 2 is given: "when to return to summer" day;
for syntax errors, in combination with scheme 1 above, the syntax lacks time limitation according to the application texts stored in the static library 31 and the dynamic library 32, and then the dynamic library 32 established according to the weather query context gives a correction scheme that: "when it rains on this week"; with reference to the above scheme 2, according to the application texts stored in the static library 31 and the dynamic library 32, the grammar lacks time limitation and purpose limitation, and further, the dynamic library 32 established according to the weather query context gives a modification scheme that: "when to return to summer" day;
s8, carrying out optimization screening on at least one correction scheme, and outputting by taking the target correction scheme subjected to optimization screening as an output result;
as can be seen from the two modifications given above, the number of errors corrected by "when it rains on the week" is the largest, and therefore "when it rains on the week" is output as the output result.
Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims (10)

1. An artificial intelligence based natural language processing system, comprising:
the voice recognition module is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters;
the voice processing module is used for carrying out error correction and correction processing on the target text and outputting the processed target correction scheme as an output result;
wherein the voice processing module comprises:
the word segmentation unit is used for carrying out pinyin word segmentation on the target text to obtain a pinyin sequence;
the text analysis unit is used for identifying errors of the pinyin sequence and analyzing the types of the errors, wherein the types of the errors comprise grammar errors, semantic errors and pragmatic errors;
the correction unit gives out at least one correction scheme according to the analysis result of the text analysis unit;
and the optimization unit is used for optimizing and screening at least one correction scheme and outputting the target correction scheme after optimization and screening as an output result.
2. An artificial intelligence based natural language processing system as claimed in claim 1, wherein said speech recognition module comprises:
the language acquisition unit is used for acquiring voice data on line and converting the voice data into TXT format text for storage;
the preprocessing unit is used for segmenting words of the format text according to a preset word segmentation program, filtering out words or phrases which are stopped in the format text and obtaining a target characteristic set;
the characteristic extraction unit is used for extracting the characteristics of the target characteristic set to obtain a voice characteristic sequence and converting the voice characteristic sequence into a pinyin characteristic sequence;
and the identification unit is used for identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters.
3. An artificial intelligence based natural language processing system according to claim 2, wherein: the word segmentation program adopted in the preprocessing unit is a Chinese character word segmentation program.
4. An artificial intelligence based natural language processing system according to claim 3, wherein: when the preprocessing unit carries out word segmentation processing, adjacent words or phrases are separated through a blank space.
5. The artificial intelligence based natural language processing system of claim 2, wherein in the feature extraction unit, converting the speech feature sequence into a pinyin feature sequence comprises:
converting the voice characteristic sequence into a pinyin characteristic sequence according to the ASCII code of the Chinese character; or
And converting the voice characteristic sequence into a pinyin characteristic sequence according to the Unicode value of the Chinese character.
6. An artificial intelligence based natural language processing system according to any one of claims 2 to 5 wherein: the processing system further comprises a knowledge base, the knowledge base comprises a static base and a dynamic base, and the correcting unit is used for giving out at least one correcting scheme by combining the error type and the application texts stored in the static base and the dynamic base.
7. An artificial intelligence based natural language processing system according to claim 6, wherein: the static library is used for storing standard texts and historical recognized voice texts and can be automatically updated; the dynamic library is automatically constructed based on the currently collected voice data, and standard texts or voice texts with the same context as the currently collected voice data are screened from the static library.
8. An artificial intelligence based natural language processing system according to claim 7, wherein: in the identification unit, the acoustic model is an acoustic model pre-trained by standard texts in a static library;
and performing pre-training of the acoustic model correspondingly once every time the static library is updated.
9. The artificial intelligence based natural language processing system of claim 7, wherein the preference unit performs the preference of the target correction scheme according to the number of errors corrected by each correction scheme, and the number of errors corrected by the target correction scheme is the largest.
10. An artificial intelligence based natural language processing system according to claim 6, wherein: the knowledge base also comprises a disabled word base, and the preprocessing unit filters out disabled words or phrases according to the disabled word base.
CN202011030529.XA 2020-09-27 2020-09-27 Natural language processing system based on artificial intelligence Pending CN112164403A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011030529.XA CN112164403A (en) 2020-09-27 2020-09-27 Natural language processing system based on artificial intelligence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011030529.XA CN112164403A (en) 2020-09-27 2020-09-27 Natural language processing system based on artificial intelligence

Publications (1)

Publication Number Publication Date
CN112164403A true CN112164403A (en) 2021-01-01

Family

ID=73864227

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011030529.XA Pending CN112164403A (en) 2020-09-27 2020-09-27 Natural language processing system based on artificial intelligence

Country Status (1)

Country Link
CN (1) CN112164403A (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102662933A (en) * 2012-03-28 2012-09-12 成都优译信息技术有限公司 Distributive intelligent translation method
CN108229137A (en) * 2017-12-29 2018-06-29 北京长御科技有限公司 A kind of method and device for distributing document permission
WO2018121757A1 (en) * 2016-12-31 2018-07-05 深圳市优必选科技有限公司 Method and system for speech broadcast of text
CN109255113A (en) * 2018-09-04 2019-01-22 郑州信大壹密科技有限公司 Intelligent critique system
CN109635298A (en) * 2018-12-11 2019-04-16 平安科技(深圳)有限公司 Group's state identification method, device, computer equipment and storage medium
CN110110041A (en) * 2019-03-15 2019-08-09 平安科技(深圳)有限公司 Wrong word correcting method, device, computer installation and storage medium
CN110717021A (en) * 2019-09-17 2020-01-21 平安科技(深圳)有限公司 Input text and related device for obtaining artificial intelligence interview
CN110909550A (en) * 2019-11-13 2020-03-24 北京环境特性研究所 Text processing method and device, electronic equipment and readable storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102662933A (en) * 2012-03-28 2012-09-12 成都优译信息技术有限公司 Distributive intelligent translation method
WO2018121757A1 (en) * 2016-12-31 2018-07-05 深圳市优必选科技有限公司 Method and system for speech broadcast of text
CN108229137A (en) * 2017-12-29 2018-06-29 北京长御科技有限公司 A kind of method and device for distributing document permission
CN109255113A (en) * 2018-09-04 2019-01-22 郑州信大壹密科技有限公司 Intelligent critique system
CN109635298A (en) * 2018-12-11 2019-04-16 平安科技(深圳)有限公司 Group's state identification method, device, computer equipment and storage medium
CN110110041A (en) * 2019-03-15 2019-08-09 平安科技(深圳)有限公司 Wrong word correcting method, device, computer installation and storage medium
CN110717021A (en) * 2019-09-17 2020-01-21 平安科技(深圳)有限公司 Input text and related device for obtaining artificial intelligence interview
CN110909550A (en) * 2019-11-13 2020-03-24 北京环境特性研究所 Text processing method and device, electronic equipment and readable storage medium

Similar Documents

Publication Publication Date Title
CN107315737B (en) Semantic logic processing method and system
CN109255113B (en) Intelligent proofreading system
CN109635288A (en) A kind of resume abstracting method based on deep neural network
KR20110038474A (en) Apparatus and method for detecting sentence boundaries
CN109949799B (en) Semantic parsing method and system
CN110942767B (en) Recognition labeling and optimization method and device for ASR language model
CN110008473B (en) Medical text named entity identification and labeling method based on iteration method
CN113761903A (en) Text screening method for high-volume high-noise spoken short text
CN111460175A (en) SNOMED-CT-based medical noun dictionary construction and expansion method
CN114927177B (en) Medical entity identification method and system integrating Chinese medical field characteristics
KR20230009564A (en) Learning data correction method and apparatus thereof using ensemble score
CN112818680A (en) Corpus processing method and device, electronic equipment and computer-readable storage medium
CN111444704A (en) Network security keyword extraction method based on deep neural network
CN113255331A (en) Text error correction method, device and storage medium
CN109918677B (en) English word semantic analysis method and system
CN115240655A (en) Chinese voice recognition system and method based on deep learning
CN116842168B (en) Cross-domain problem processing method and device, electronic equipment and storage medium
JP2018066800A (en) Japanese speech recognition model learning device and program
CN106682642A (en) Multi-language-oriented behavior identification method and multi-language-oriented behavior identification system
CN111046663A (en) Intelligent correction method for Chinese form
Mekki et al. COTA 2.0: An automatic corrector of Tunisian Arabic social media texts
CN115577712A (en) Text error correction method and device
CN112164403A (en) Natural language processing system based on artificial intelligence
CN114996387A (en) Index data-based natural language processing method and system
CN111814433B (en) Uygur language entity identification method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20210101