CN112164403A

CN112164403A - Natural language processing system based on artificial intelligence

Info

Publication number: CN112164403A
Application number: CN202011030529.XA
Authority: CN
Inventors: 韩岭; 蔡进; 胥传龙; 李明; 吴开凯; 杨昆
Original assignee: Jiangsu Sixiang Software Co ltd
Current assignee: Jiangsu Sixiang Software Co ltd
Priority date: 2020-09-27
Filing date: 2020-09-27
Publication date: 2021-01-01

Abstract

The invention belongs to the technical field of natural language processing, and discloses a natural language processing system based on artificial intelligence, which comprises: the voice recognition module is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters; the voice processing module is used for carrying out error correction and correction processing on the target text and outputting the processed target correction scheme as an output result; wherein the voice processing module comprises: the word segmentation unit is used for carrying out pinyin word segmentation on the target text to obtain a pinyin sequence; the text analysis unit is used for identifying the errors of the pinyin sequence and analyzing the types of the errors; the correction unit gives out at least one correction scheme according to the analysis result of the text analysis unit; and the optimization unit is used for optimizing and screening at least one correction scheme and outputting the target correction scheme after optimization and screening as an output result.

Description

Natural language processing system based on artificial intelligence

Technical Field

The invention belongs to the technical field of natural language processing, and particularly relates to a natural language processing system based on artificial intelligence.

Background

Natural Language Processing (NLP) is a branching subject of the fields of artificial intelligence and linguistics. This field discusses how to handle and use natural language; natural language processing basically includes parts of recognition, understanding, generation, and the like. The natural language cognition and understanding is that the computer changes the acquired natural language into meaningful symbols and establishes corresponding relations, and then the symbols are processed according to the purpose of use. Natural language processing has recently become a popular research direction, with machine translation being the earliest research effort in natural language processing.

At present, most of natural language processing is rule-based methods, and in the method, only one isolated sentence is analyzed, and the association between contexts and the conversation context are difficult to match, so that the problems of large language processing limitation, low accuracy and low intelligence degree exist.

Disclosure of Invention

In view of the above, to solve the deficiencies in the prior art, the present invention provides a natural language processing system based on artificial intelligence.

In order to achieve the purpose, the invention provides the following technical scheme: an artificial intelligence based natural language processing system comprising:

the voice recognition module is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters;

the voice processing module is used for carrying out error correction and correction processing on the target text and outputting the processed target correction scheme as an output result;

wherein the voice processing module comprises:

the word segmentation unit is used for carrying out pinyin word segmentation on the target text to obtain a pinyin sequence;

the text analysis unit is used for identifying errors of the pinyin sequence and analyzing the types of the errors, wherein the types of the errors comprise grammar errors, semantic errors and pragmatic errors;

the correction unit gives out at least one correction scheme according to the analysis result of the text analysis unit;

and the optimization unit is used for optimizing and screening at least one correction scheme and outputting the target correction scheme after optimization and screening as an output result.

Preferably, the speech recognition module includes:

the language acquisition unit is used for acquiring voice data on line and converting the voice data into TXT format text for storage;

the preprocessing unit is used for segmenting words of the format text according to a preset word segmentation program, filtering out words or phrases which are stopped in the format text and obtaining a target characteristic set;

the characteristic extraction unit is used for extracting the characteristics of the target characteristic set to obtain a voice characteristic sequence and converting the voice characteristic sequence into a pinyin characteristic sequence;

and the identification unit is used for identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters.

Preferably, the word segmentation program adopted in the preprocessing unit is a Chinese character word segmentation program.

Preferably, when the preprocessing unit performs word segmentation processing, adjacent words or phrases are separated by a space.

Preferably, in the feature extraction unit, converting the speech feature sequence into a pinyin feature sequence includes:

converting the voice characteristic sequence into a pinyin characteristic sequence according to the ASCII code of the Chinese character; or

And converting the voice characteristic sequence into a pinyin characteristic sequence according to the Unicode value of the Chinese character.

Preferably, the processing system further comprises a knowledge base, and the knowledge base comprises a static base and a dynamic base, and the correction unit gives at least one correction scheme in combination with the error type and the application text stored in the static base and the dynamic base.

Preferably, the static library is used for storing standard texts and historical recognized voice texts and can be automatically updated; the dynamic library is automatically constructed based on the currently collected voice data, and standard texts or voice texts with the same context as the currently collected voice data are screened from the static library.

Preferably, in the recognition unit, the acoustic model is an acoustic model pre-trained by a standard text in a static library; and performing pre-training of the acoustic model correspondingly once every time the static library is updated.

Preferably, the optimization unit performs optimization of the target correction scheme according to the number of errors corrected by each correction scheme, and the number of errors corrected by the target correction scheme is the largest.

Preferably, the knowledge base further includes a disabled word base, and the preprocessing unit filters the disabled words or phrases according to the disabled word base.

Compared with the prior art, the invention has the following beneficial effects:

in the invention, the grammar analysis, the semantic analysis and the pragmatic analysis are combined to further correct the errors of the traditionally recognized natural language, so that the final output result can be more accurately adapted to the actual conversation environment, and the accuracy of the whole processing system for speech recognition is effectively improved.

In addition, a knowledge base comprising a static base and a dynamic base is arranged in the whole system, wherein the static base can be automatically updated based on historical recognition records, and the dynamic base automatically screens text information based on currently collected voice data, so that the adaptability and the processing efficiency of the whole system to specific problems are further improved.

Drawings

FIG. 1 is a block diagram of a natural language processing system provided by the present invention;

FIG. 2 is a flow chart of the natural language processing system of the present invention when executing processing.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

In the embodiment of the present invention, a natural language processing system based on artificial intelligence is disclosed, and specifically, referring to fig. 1, the processing system includes the following structure:

the voice recognition module 10 is used for collecting and recognizing voice data and converting the voice data into a target text consisting of pinyin characters;

specifically, the speech recognition module 10 includes the following:

the language acquisition unit 11 is used for acquiring voice data on line and converting the voice data into TXT format text for storage;

the preprocessing unit 12 is used for segmenting words of the text in the format of (TXT) according to a preset word segmentation program, filtering out deactivated words or phrases in the text in the format of (TXT) and obtaining a target feature set; the word segmentation program adopts a Chinese character word segmentation program, and when the word segmentation processing is carried out, adjacent words or phrases are separated by a blank space;

the feature extraction unit 13 is used for extracting features of the target feature set to obtain a voice feature sequence and converting the voice feature sequence into a pinyin feature sequence; dividing a target feature set into a plurality of discrete segments according to the change of a voice waveform of the target feature set along with time, and labeling the plurality of segments in sequence, wherein each segment correspondingly comprises at least one word or phrase, so that a voice feature sequence can be formed according to the labels;

and the identification unit 14 is used for identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters.

The voice processing module 20 is configured to perform error correction and correction processing on the target text, and output the processed target correction scheme as an output result;

specifically, the speech processing module 20 includes the following:

a word segmentation unit 21, configured to perform pinyin word segmentation on the target text to obtain a pinyin sequence;

the text analysis unit 22 is used for identifying errors of the pinyin sequence and analyzing the types of the errors, wherein the types of the errors comprise grammar errors, semantic errors and pragmatic errors;

a correction unit 23 giving at least one correction scheme based on the analysis result of the text analysis unit 22;

and the optimization unit 24 is used for performing optimization screening on at least one correction scheme and outputting the target correction scheme after optimization screening as an output result.

The knowledge base 30 includes a static base 31, a dynamic base 32, and a deactivation word base 33. The static library 31 is used for storing standard texts and historical recognized voice texts, and can be automatically updated; the dynamic library 32 is automatically constructed based on the currently collected voice data, and the standard text or voice text having the same context as the currently collected voice data is screened from the static library 31.

In addition:

the correction unit 23 provides at least one correction scheme in association with the type of error and the application text stored in the static library 31 and the dynamic library 32 when correcting the process.

In the recognition unit 14, the acoustic model is an acoustic model pre-trained by standard text in the static library 31; and each time the static library 31 is updated, pre-training of the acoustic model is performed correspondingly.

Specifically, the standard text information stored in the static library 31 may be collected from data sources such as books, news, web pages (e.g., encyclopedia, wikipedia, etc.); for example, word recognition may be performed on words in a book, particularly a standard book (such as a dictionary), and all correspondences may be stored in the static library 31 as standard texts.

In summary, the artificial intelligence based natural language processing system disclosed above specifically includes the following steps when performing the speech recognition processing:

s1, collecting voice data on line, and converting the voice data into a TXT format text for storage;

for example, the speech data collected by the language collecting unit 11 is "when fish will be dropped", and the corresponding application context is set as weather query;

s2, segmenting words of the format text according to a preset Chinese character segmentation program, and filtering out words or phrases which are stopped in the format text to obtain a target characteristic set;

specifically, the words of 'when rains will fall' are divided into words of 'when' - ', when' - 'fish will' and word group sequences; and the sequence does not contain the deactivated word or phrase, depending on the deactivated lexicon 33 in the knowledge base 30.

S3, extracting the characteristics of the target characteristic set to obtain a voice characteristic sequence, and converting the voice characteristic sequence into a pinyin characteristic sequence;

specifically, the voice characteristic sequence is converted into a pinyin characteristic sequence according to the ASCII code of the Chinese character; because the Chinese characters are represented by ASCII codes in the computer system, the sentences can be converted into pinyin sequences only by utilizing the corresponding relation between each pinyin and each ASCII code which is already in the computer system or established by a user. If the sentence contains polyphone, the determination can be made according to the collected pronunciation of the speech data. For example, "meeting" includes "hui" and "kuai", the pronunciation of the speech data is used as the standard to obtain the corresponding speech feature sequence: "shenme" - "shihou" - "hui" - "xiayu".

In addition, the phonetic characteristic sequence can be converted into phonetic characteristic sequence according to the Unicode value of Chinese characters.

S4, identifying, matching and comparing the pinyin characteristic sequence with the acoustic model to obtain a target identification result, wherein the target identification result is a target text consisting of pinyin characters;

s5, performing pinyin word segmentation on the target text to obtain a pinyin sequence;

s6, identifying errors of the pinyin sequence and analyzing the types of the errors;

in "shenme" - "shihou" - "hui" - "xiayu", there are included: the semantic error 'xiayu', such as raining/fish dropping/summer Yu/prison, has different semantics; syntax errors, lack of definition;

s7, at least one correction scheme is given by combining the error types and the application texts stored in the static library 31 and the dynamic library 32;

regarding the semantic errors described above, scheme 1 is given: "when it rains"; scheme 2 is given: "when to return to summer" day;

for syntax errors, in combination with scheme 1 above, the syntax lacks time limitation according to the application texts stored in the static library 31 and the dynamic library 32, and then the dynamic library 32 established according to the weather query context gives a correction scheme that: "when it rains on this week"; with reference to the above scheme 2, according to the application texts stored in the static library 31 and the dynamic library 32, the grammar lacks time limitation and purpose limitation, and further, the dynamic library 32 established according to the weather query context gives a modification scheme that: "when to return to summer" day;

s8, carrying out optimization screening on at least one correction scheme, and outputting by taking the target correction scheme subjected to optimization screening as an output result;

as can be seen from the two modifications given above, the number of errors corrected by "when it rains on the week" is the largest, and therefore "when it rains on the week" is output as the output result.

Although embodiments of the present invention have been shown and described, it will be appreciated by those skilled in the art that changes, modifications, substitutions and alterations can be made in these embodiments without departing from the principles and spirit of the invention, the scope of which is defined in the appended claims and their equivalents.

Claims

1. An artificial intelligence based natural language processing system, comprising:

wherein the voice processing module comprises:

2. An artificial intelligence based natural language processing system as claimed in claim 1, wherein said speech recognition module comprises:

3. An artificial intelligence based natural language processing system according to claim 2, wherein: the word segmentation program adopted in the preprocessing unit is a Chinese character word segmentation program.

4. An artificial intelligence based natural language processing system according to claim 3, wherein: when the preprocessing unit carries out word segmentation processing, adjacent words or phrases are separated through a blank space.

5. The artificial intelligence based natural language processing system of claim 2, wherein in the feature extraction unit, converting the speech feature sequence into a pinyin feature sequence comprises:

6. An artificial intelligence based natural language processing system according to any one of claims 2 to 5 wherein: the processing system further comprises a knowledge base, the knowledge base comprises a static base and a dynamic base, and the correcting unit is used for giving out at least one correcting scheme by combining the error type and the application texts stored in the static base and the dynamic base.

7. An artificial intelligence based natural language processing system according to claim 6, wherein: the static library is used for storing standard texts and historical recognized voice texts and can be automatically updated; the dynamic library is automatically constructed based on the currently collected voice data, and standard texts or voice texts with the same context as the currently collected voice data are screened from the static library.

8. An artificial intelligence based natural language processing system according to claim 7, wherein: in the identification unit, the acoustic model is an acoustic model pre-trained by standard texts in a static library;

and performing pre-training of the acoustic model correspondingly once every time the static library is updated.

9. The artificial intelligence based natural language processing system of claim 7, wherein the preference unit performs the preference of the target correction scheme according to the number of errors corrected by each correction scheme, and the number of errors corrected by the target correction scheme is the largest.

10. An artificial intelligence based natural language processing system according to claim 6, wherein: the knowledge base also comprises a disabled word base, and the preprocessing unit filters out disabled words or phrases according to the disabled word base.