CN112069805A

CN112069805A - Text labeling method, device, equipment and storage medium combining RPA and AI

Info

Publication number: CN112069805A
Application number: CN202011062863.3A
Authority: CN
Inventors: 刘崴; 张海雷; 胡一川; 汪冠春
Original assignee: Beijing Benying Network Technology Co Ltd; Beijing Laiye Network Technology Co Ltd
Current assignee: Beijing Benying Network Technology Co Ltd; Beijing Laiye Network Technology Co Ltd
Priority date: 2019-12-20
Filing date: 2020-09-30
Publication date: 2020-12-11

Abstract

The application provides a text labeling method and device combining RPA and AI, relating to the RPA and AI speech synthesis technology, wherein the method comprises the following steps: acquiring a Chinese text to be marked, and generating a pinyin text corresponding to the Chinese text; determining pause information corresponding to the Chinese text by adopting an NLP technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information; marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; the Chinese text to be marked, the Chinese text with pause information and the pinyin text after marking processing are provided for a user to be corrected, and the user can obtain the pinyin marking result corresponding to the Chinese text to be marked with high accuracy rate only by correcting the pinyin text after marking processing, so that the user can be assisted in marking the text, and the marking efficiency and the accuracy of the text are improved.

Description

Text labeling method, device, equipment and storage medium combining RPA and AI

Technical Field

The present application relates to the field of computer technologies, and in particular, to a RPA (robot Process Automation) and AI (Artificial Intelligence) speech synthesis technology, and more particularly, to a text labeling method, apparatus, device, and storage medium combining RPA and AI.

Background

Robot Process Automation (RPA) simulates the operation of a human on a computer through specific robot software and automatically executes Process tasks according to rules.

Artificial Intelligence (AI) is a new technology science for researching and developing theories, methods, techniques and application systems for simulating, extending and expanding human Intelligence. Research in the field of artificial intelligence includes robotics, language recognition, image recognition, natural language processing, and expert systems, among others.

Speech synthesis is a technique that can generate speech from text. The construction of the speech synthesis system requires a large amount of audio and pinyin annotation results corresponding to the audio. The quality of the pinyin annotation result determines the tone quality of the speech synthesis system. At present, people mainly perform pinyin annotation on a Chinese text corresponding to audio, which is long in time consumption and high in error rate.

Disclosure of Invention

The object of the present application is to solve at least to some extent one of the above mentioned technical problems.

Therefore, a first objective of the present application is to provide a text labeling method combining RPA and AI, in which a chinese text to be labeled is preprocessed to generate a pinyin text to be corrected, and pinyins meeting conditions to be corrected in the pinyin text are marked to be provided for a user for correction, and the user only needs to correct the pinyin text after marking, so that a pinyin labeling result corresponding to the chinese text to be labeled with high accuracy can be obtained, thereby assisting the user in labeling the text and improving the labeling efficiency and accuracy of the text.

A second objective of the present application is to provide a text annotation device that combines RPA and AI.

A third object of the present application is to provide a text annotation device that combines RPA and AI.

A fourth object of the present application is to propose a non-transitory computer-readable storage medium.

To achieve the above object, an embodiment of a first aspect of the present application provides a text annotation method combining an RPA and an AI, including: acquiring a Chinese text to be marked, and generating a pinyin text corresponding to the Chinese text; determining pause information corresponding to the Chinese text by adopting a Natural Language Processing (NLP) technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information; marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; and providing the Chinese text to be marked, the Chinese text with the pause information and the pinyin text after marking processing for a user for correction.

According to the text marking method combining the RPA and the AI, the Chinese text to be marked is obtained, and the pinyin text corresponding to the Chinese text is generated; determining pause information corresponding to the Chinese text by adopting an NLP technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information; marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; and providing the Chinese text to be marked, the Chinese text with the pause information and the pinyin text after marking processing for a user for correction. The method preprocesses the Chinese text to be marked to generate the pinyin text to be corrected, marks the pinyin meeting the condition to be corrected in the pinyin text and provides the pinyin text to a user for correction, and the user only needs to correct the pinyin text after marking processing to obtain the pinyin marking result corresponding to the Chinese text to be marked with high accuracy, so that the user can be assisted in marking the text, and the marking efficiency and the accuracy of the text are improved.

In order to achieve the above object, a second aspect of the present application provides a text annotation device combining RPA and AI. The device includes: the system comprises an acquisition module, a marking module and a marking module, wherein the acquisition module is used for acquiring a Chinese text to be marked and generating a pinyin text corresponding to the Chinese text; the determining module is used for determining the pause information corresponding to the Chinese text by adopting an NLP technology, inserting the pause information into the Chinese text and obtaining the Chinese text with the pause information; the marking module is used for marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; and the providing module is used for providing the Chinese text to be marked, the Chinese text with the pause information and the pinyin text after marking processing for a user for correction.

The text marking device combining the RPA and the AI in the embodiment of the application generates a pinyin text corresponding to a Chinese text by acquiring the Chinese text to be marked; determining pause information corresponding to the Chinese text by adopting an NLP technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information; marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; and providing the Chinese text to be marked, the Chinese text with the pause information and the pinyin text after marking processing for a user for correction. The device can realize preprocessing the Chinese text to be labeled, generate the pinyin text to be corrected, mark the pinyin meeting the conditions to be corrected in the pinyin text, provide the pinyin text to the user for correction, and the user only needs to correct the pinyin text after marking processing, so that the pinyin labeling result corresponding to the Chinese text to be labeled with high accuracy can be obtained, the user can be assisted in labeling the text, and the labeling efficiency and the accuracy of the text are improved.

In order to achieve the above object, a third aspect of the present application provides a text annotation device combining RPA and AI, including: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the text annotation method combining RPA and AI as described above when executing the program.

In order to achieve the above object, a fourth aspect of the present application provides a non-transitory computer-readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the text annotation method combining RPA and AI as described above.

Additional aspects and advantages of the present application will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the present application.

Drawings

The foregoing and/or additional aspects and advantages of the present application will become apparent and readily appreciated from the following description of the embodiments, taken in conjunction with the accompanying drawings of which:

FIG. 1 is a schematic flow chart of a text annotation method combining RPA and AI according to an embodiment of the present application;

FIG. 2 is a flowchart illustrating a text annotation method according to another embodiment of the present application;

FIG. 3 is a flowchart illustrating a text annotation method in combination with RPA and AI according to another embodiment of the present application;

FIG. 4 is a schematic structural diagram of a text annotation device combining an RPA and an AI according to an embodiment of the present application;

FIG. 5 is a schematic structural diagram of a text annotation device combining an RPA and an AI according to another embodiment of the present application;

fig. 6 is a schematic structural diagram of a text annotation device combining an RPA and an AI according to an embodiment of the present application.

Detailed Description

Reference will now be made in detail to embodiments of the present application, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are exemplary and intended to be used for explaining the present application and should not be construed as limiting the present application.

The following describes a text labeling method and apparatus combining RPA and AI according to an embodiment of the present application with reference to the drawings. The main execution body of the text labeling method combining the RPA and the AI is a text labeling device combining the RPA and the AI.

Fig. 1 is a schematic flowchart of a text annotation method combining an RPA and an AI according to an embodiment of the present disclosure. As shown in fig. 1, the text annotation method combining RPA and AI includes the following steps:

step 101, acquiring a Chinese text to be marked, and generating a pinyin text corresponding to the Chinese text.

In the embodiment of the present application, the chinese text to be labeled may be a chinese text corresponding to a related audio, a partial chinese text in a document, a chinese text on a network, or the like. Correspondingly, the method for acquiring the to-be-labeled Chinese text can be, but is not limited to, converting the audio into the corresponding Chinese text by converting the related audio, intercepting the Chinese text in the document, downloading the Chinese text on the network, and the like.

And then, generating a pinyin text which is composed of pinyin and corresponds to the Chinese text according to the acquired Chinese text to be labeled.

As an example, the pinyin corresponding to each word in the chinese text may be obtained by querying a pinyin library, and the pinyins corresponding to each word are combined to obtain a pinyin text corresponding to the chinese text. For example, the Chinese text to be labeled is "speech synthesis", and the corresponding pinyin text is "yu 3 yin1 he2 cheng 2", wherein the numbers represent pinyin tones.

And 102, determining pause information corresponding to the Chinese text by adopting an NLP technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information.

In the embodiment of the application, the pause information corresponding to the Chinese text can be determined through the Chinese text to be labeled or the audio corresponding to the Chinese text to be labeled. For example, the semantic recognition can be performed on the Chinese to be labeled by the NLP technology, the word segmentation information is determined according to the semantic corresponding to the text to be labeled, and the word segmentation information is used as the pause information of the Chinese text to be labeled. For another example, the audio corresponding to the to-be-labeled chinese text may be analyzed by the NLP technique to determine the pause information corresponding to the audio, and the pause information is used as the pause information of the to-be-labeled chinese text corresponding to the audio. It should be noted that the pause information may be, but is not limited to, word segmentation information, and in general, the pause information is word segmentation information based on "prosodic words" or "prosodic phrases".

As an example, for example, a chinese text to be labeled is input into a pre-trained text pause recognition model, and the text pause recognition model can output pause information corresponding to the chinese text.

As another example, acquiring an audio corresponding to a to-be-labeled Chinese text; and identifying the audio and determining pause information in the audio. For example, the audio corresponding to the chinese text may be obtained, the audio is input to a pre-trained audio pause recognition model, the audio pause recognition model may output pause information corresponding to the audio, and the pause information corresponding to the audio is determined as the pause information corresponding to the chinese text. It should be noted that the audio pause recognition model may analyze the audio based on the NLP technology, and determine pause information corresponding to the audio.

And then, inserting the determined pause information into the Chinese text to obtain the Chinese text with the pause information. For example, the Chinese text to be labeled is: after inserting corresponding pause information, for example, word segmentation information, the exhibition hall obtains a Chinese text with pause information as follows: exhibition | museum.

And 103, marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking.

In the embodiment of the present application, the pinyin meeting the condition to be corrected may include, but is not limited to, any one or more of the following: the pinyin corresponding to polyphone, the pinyin corresponding to inflexion character and the pinyin corresponding to the word composed of continuous three-tone Chinese characters. Preferably, the word is obtained by segmenting the Chinese text according to the pause information.

The pinyin corresponding to the polyphone, the pinyin corresponding to the inflexion character, and the pinyin corresponding to the word composed of the continuous three-tone Chinese characters may be determined based on the pinyin text determined in step 101 and a preset determination mechanism. As an example, the pinyin texts determined in step 101 are matched based on a preset polyphone recognition library and an inflexion character recognition library, and the pinyins corresponding to the polyphone characters and the inflexion characters are determined. As an example, successive three-tone pinyins in the pinyin text determined in step 101 may be detected, and pinyins corresponding to words formed by successive three-tone chinese characters may be determined.

For example, for "a language", where "a" is a polyphone (zhong3, zhong4), and the pinyin corresponding to "a" is labeled, the labeled pinyin text after the labeling process is: yi1{ zhong3} yu3 yan 1. For another example, "a language", "a" is not a polyphone, but needs to be inflected in this context, and is a inflected character, where yi4 is read, and after inflectioning the pinyin corresponding to "a" and performing the marking process, the pinyin after the marking process is expressed as: { yi4} { zhong3} yu3 yan 1.

In addition, in the embodiment of the present application, as shown in fig. 2, when the condition to be corrected is that there is pinyin corresponding to a word composed of consecutive three-tone chinese characters, the pinyin corresponding to the word is subjected to sound change processing to obtain a pinyin text after sound change processing, and the pinyin after sound change processing is marked to obtain a pinyin text after marking processing, which is specifically implemented as follows:

step 201, obtaining pinyin corresponding to words formed by continuous three-tone Chinese characters in the pinyin text.

In the embodiment of the application, the pinyin text corresponding to the Chinese text to be labeled can be intercepted to obtain the pinyin corresponding to the words consisting of the continuous three-tone Chinese characters in the pinyin text.

Step 202, performing voice change processing on the pinyin corresponding to the words consisting of the continuous three-tone Chinese characters to obtain a pinyin text after the voice change processing.

Optionally, acquiring a word sequence corresponding to the continuous three tones in the words consisting of the continuous three-tone Chinese characters; determining the pinyin of all characters before the last character in the character sequence as the pinyin of the sound to be changed; and performing two-tone processing on the pinyin to be varied in the pinyin corresponding to the word consisting of the continuous three-tone Chinese characters.

For example, a word composed of three consecutive tones of chinese characters is a word composed of more than 2 consecutive adjacent 3 tones of chinese characters, i.e. when the word "exhibition" is read by, for example, 2 consecutive 3 tones of chinese characters, the first 3 tones become 2 tones, for example: "exhibition (zhan3) overview (lan 3)", and the pinyin text after the sound change processing is as follows: "zhan 2 lan 3"; for another example, for more than 3 consecutive 3-pronouncing Chinese characters, all 3 pronunciations before the last 2 pronunciations become 2 pronunciations. For example: "exhibition (zhan3) visit (lan3) library (guan 3)", the pinyin text after the sound change processing is: "zhan 2 lan2 guan 3".

And 203, marking the pinyin subjected to the sound changing treatment in the pinyin text to obtain the pinyin text subjected to the marking treatment.

For example, for example: zhan2 lan2 guan3, wherein zhan2 lan2 is the pinyin subjected to the sound change processing, the pinyin subjected to the sound change processing is subjected to marking processing, and the text subjected to the marking processing is as follows: < zhan2> < lan2> guan 3.

And 104, providing the Chinese text to be marked, the Chinese text with the pause information and the pinyin text after marking processing for a user for correction.

Optionally, as shown in fig. 3, the chinese text to be labeled, the chinese text with pause information, and the pinyin text after marking processing are provided to the user for correction, and a pinyin labeling result corresponding to the chinese text to be labeled is obtained, which specifically implements the following process:

step 301, providing the Chinese text to be labeled, the Chinese text with pause information and the pinyin text after marking processing to the user.

Step 302, receiving a pinyin correction request of a user, wherein the correction request comprises: the pinyin to be corrected and the corresponding corrected pinyin.

Step 303, the pinyin text after the marking process is corrected according to the pinyin correction request, so as to obtain a corrected pinyin text.

In the embodiment of the application, the Chinese text to be marked, the Chinese text with pause information and the pinyin text after marking processing can be sent to a user (such as a marking person), and when the user finds that the pinyin text after marking processing has errors, a pinyin correction request can be sent to a text marking device. And then, the text marking device corrects the pinyin text after the marking process according to the pinyin correction request to obtain the corrected pinyin text. The correction request may include, but is not limited to, a pinyin to be corrected and a corresponding corrected pinyin.

For example, take "an exhibition hall" as an example, wherein the Chinese text to be labeled is: an exhibition hall, the Chinese text with pause information is: a seat (1) exhibition hall; the pinyin text after marking processing is as follows: { yi2} zuo 4< zhan2> < lan2> guan 3; when text labeling is carried out, only a text needing to be labeled is displayed in a graphical mode, and parts in brackets (), angle brackets < >, or braces { } in the pinyin text after label processing, namely the parts needing manual inspection in the pinyin text, are displayed in a highlight mode. The annotator only needs to check whether the highlighted portion is correct.

And step 304, deleting the mark in the corrected pinyin text to obtain a pinyin marking result corresponding to the Chinese text to be marked.

And finally, deleting the mark in the corrected pinyin text to obtain a pinyin marking result corresponding to the marked text. For example, the Chinese text to be labeled is: an exhibition hall, the corresponding corrected pinyin is: { yi2} zuo 4< zhan2> < lan2> guan3, deleting the mark in the pinyin text, wherein the result of the pinyin mark corresponding to an exhibition hall is as follows: yi2 zuo 4zhan2 lan2 guan 3.

Optionally, the Chinese text to be marked, the Chinese text with pause information and the pinyin text marked and processed are provided for a user for correction, and after the pinyin marking result corresponding to the Chinese text to be marked is obtained, the pause information in the Chinese text with pause information can be inserted into the pinyin marking result to obtain the pinyin marking result with pause information; and training a preset voice synthesis system according to the audio corresponding to the Chinese text and the pinyin marking result with pause information to obtain the trained voice synthesis system.

That is, the pinyin labeling result with pause information can be used for building a speech synthesis system by matching with corresponding audio data. For example, a speech synthesis system based on neural networks has the ability to "learn". The pinyin marking result with pause information is used as the input of the system, the audio corresponding to the text is used as the output, and a large number of pinyin text-audio pairs are used as training data to train the system. Through training, the speech synthesis system based on the neural network can 'learn' pronunciation, tone and the like corresponding to the Chinese sentences. After training, only need input arbitrary Chinese in the front end of the speech synthesis system, the speech synthesis system can output the speech similar to real person's speech directly.

According to the text marking method combining the RPA and the AI, the Chinese text to be marked is obtained, and the pinyin text corresponding to the Chinese text is generated; determining pause information corresponding to the Chinese text by adopting an NLP technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information; marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; and providing the Chinese text to be marked, the Chinese text with pause information and the pinyin text marked and processed for a user to correct. The method preprocesses the Chinese text to be marked to generate the pinyin text to be corrected, marks the pinyin meeting the condition to be corrected in the pinyin text and provides the pinyin text to a user for correction, and the user only needs to correct the pinyin text after marking processing to obtain the pinyin marking result corresponding to the Chinese text to be marked with high accuracy, so that the user can be assisted in marking the text, and the marking efficiency and the accuracy of the text are improved.

Corresponding to the text labeling methods provided in the foregoing embodiments, an embodiment of the present application further provides a text labeling apparatus combining an RPA and an AI, and since the text labeling apparatus combining an RPA and an AI provided in the embodiment of the present application corresponds to the text labeling methods combining an RPA and an AI provided in the foregoing embodiments, the foregoing embodiments of the text labeling method combining an RPA and an AI are also applicable to the text labeling apparatus combining an RPA and an AI provided in the embodiment, and are not described in detail in the embodiment. Fig. 4 is a schematic structural diagram of a text annotation device combining an RPA and an AI according to an embodiment of the present application. As shown in fig. 4, the text labeling apparatus combining RPA and AI includes: an acquisition module 410, a determination module 420, a labeling module 430, and a providing module 440.

The obtaining module 410 is configured to obtain a chinese text to be labeled, and generate a pinyin text corresponding to the chinese text; the determining module 420 is configured to determine pause information corresponding to the chinese text by using an NLP technique, and insert the pause information into the chinese text to obtain the chinese text with the pause information; the marking module 430 is configured to mark pinyin meeting the condition to be corrected in the pinyin text to obtain a pinyin text after marking; a providing module 440, configured to provide the chinese text to be labeled, the chinese text with pause information, and the pinyin text after label processing to the user for correction.

As a possible implementation manner of the embodiment of the present application, the determining module 420 is specifically configured to obtain an audio corresponding to a to-be-labeled chinese text; identifying the audio, determining pause information in the audio, and determining the pause information in the audio as pause information corresponding to the Chinese text; and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information.

As a possible implementation manner of the embodiment of the present application, the condition to be corrected includes any one or more of the following conditions: the pinyin corresponding to polyphone, the pinyin corresponding to inflexion character and the pinyin corresponding to the word composed of continuous three-tone Chinese characters; the words are obtained by segmenting the Chinese text according to the pause information.

As a possible implementation manner of the embodiment of the present application, when the condition to be corrected is that pinyin corresponding to a term composed of consecutive three-tone chinese characters exists, the marking module 430 is specifically configured to obtain pinyin corresponding to a term composed of consecutive three-tone chinese characters in a pinyin text; performing voice change processing on pinyin corresponding to words consisting of continuous three-tone Chinese characters to obtain a pinyin text after the voice change processing; and marking the pinyin subjected to the sound changing treatment in the pinyin text to obtain the pinyin text subjected to marking treatment.

As a possible implementation manner of the embodiment of the application, the sound change processing of the pinyin corresponding to the words consisting of the continuous three-sound Chinese characters is to obtain word sequences corresponding to the continuous three-sound Chinese characters in the words consisting of the continuous three-sound Chinese characters; determining the pinyin of all characters before the last character in the character sequence as the pinyin of the sound to be changed; and performing two-tone processing on the pinyin to be varied in the pinyin corresponding to the word consisting of the continuous three-tone Chinese characters.

As a possible implementation manner of the embodiment of the present application, the providing module 440 is specifically configured to provide the chinese text to be labeled, the chinese text with pause information, and the pinyin text after the marking process to the user; receiving a pinyin correction request of a user, wherein the correction request comprises: the pinyin to be corrected and the corresponding corrected pinyin; correcting the pinyin text subjected to marking processing according to the pinyin correction request to obtain a corrected pinyin text; and deleting the mark in the corrected pinyin text to obtain a pinyin marking result corresponding to the Chinese text to be marked.

As a possible implementation manner of the embodiment of the present application, as shown in fig. 5, on the basis of fig. 4, the text annotation device combining the RPA and the AI further includes: an insertion module 450 and a training module 460.

The inserting module 450 is configured to insert the pause information in the Chinese text with pause information into the pinyin annotation result to obtain the pinyin annotation result with pause information; the training module 460 is configured to train a preset speech synthesis system according to the audio corresponding to the chinese text and the pinyin annotation result with the pause information, so as to obtain a trained speech synthesis system.

The text marking device combining the RPA and the AI in the embodiment of the application generates a pinyin text corresponding to a Chinese text by acquiring the Chinese text to be marked; determining pause information corresponding to the Chinese text by adopting an NLP technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information; marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking; and providing the Chinese text to be marked, the Chinese text with pause information and the pinyin text marked and processed for a user to correct. The device can realize preprocessing the Chinese text to be labeled, generate the pinyin text to be corrected, mark the pinyin meeting the conditions to be corrected in the pinyin text, provide the pinyin text to the user for correction, and the user only needs to correct the pinyin text after marking processing, so that the pinyin labeling result corresponding to the Chinese text to be labeled with high accuracy can be obtained, the user can be assisted in labeling the text, and the labeling efficiency and the accuracy of the text are improved.

In order to implement the foregoing embodiments, the present application further provides a text annotation device combining an RPA and an AI, and fig. 6 is a schematic structural diagram of a text annotation device combining an RPA and an AI according to an embodiment of the present application. The text labeling device combining the RPA and the AI comprises:

memory 1001, processor 1002, and computer programs stored on memory 1001 and executable on processor 1002.

The processor 1002, when executing the program, implements the text annotation method combining the RPA and the AI provided in the above embodiments.

Further, the text labeling device combining the RPA and the AI further comprises:

a communication interface 1003 for communicating between the memory 1001 and the processor 1002.

A memory 1001 for storing computer programs that may be run on the processor 1002.

Memory 1001 may include high-speed RAM memory and may also include non-volatile memory (e.g., at least one disk memory).

The processor 1002 is configured to implement the text labeling method combining RPA and AI according to the foregoing embodiment when executing the program.

If the memory 1001, the processor 1002, and the communication interface 1003 are implemented independently, the communication interface 1003, the memory 1001, and the processor 1002 may be connected to each other through a bus and perform communication with each other. The bus may be an Industry Standard Architecture (ISA) bus, a Peripheral Component Interconnect (PCI) bus, an Extended ISA (EISA) bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 6, but this is not intended to represent only one bus or type of bus.

Optionally, in a specific implementation, if the memory 1001, the processor 1002, and the communication interface 1003 are integrated on one chip, the memory 1001, the processor 1002, and the communication interface 1003 may complete communication with each other through an internal interface.

The processor 1002 may be a Central Processing Unit (CPU), an Application Specific Integrated Circuit (ASIC), or one or more Integrated circuits configured to implement embodiments of the present Application.

In order to implement the foregoing embodiments, the present application further proposes a non-transitory computer-readable storage medium, on which a computer program is stored, and the computer program, when executed by a processor, implements the text annotation method combining RPA and AI as described in the foregoing embodiments.

In the description herein, reference to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., means that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the application. In this specification, the schematic representations of the terms used above are not necessarily intended to refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.

Furthermore, the terms "first", "second" and "first" are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implicitly indicating the number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include at least one such feature. In the description of the present application, "plurality" means at least two, e.g., two, three, etc., unless specifically limited otherwise.

Any process or method descriptions in flow charts or otherwise described herein may be understood as representing modules, segments, or portions of code which include one or more executable instructions for implementing steps of a custom logic function or process, and alternate implementations are included within the scope of the preferred embodiment of the present application in which functions may be executed out of order from that shown or discussed, including substantially concurrently or in reverse order, depending on the functionality involved, as would be understood by those reasonably skilled in the art of the present application.

The logic and/or steps represented in the flowcharts or otherwise described herein, e.g., an ordered listing of executable instructions that can be considered to implement logical functions, can be embodied in any computer-readable medium for use by or in connection with an instruction execution system, apparatus, or device, such as a computer-based system, processor-containing system, or other system that can fetch the instructions from the instruction execution system, apparatus, or device and execute the instructions. For the purposes of this description, a "computer-readable medium" can be any means that can contain, store, communicate, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device. More specific examples (a non-exhaustive list) of the computer-readable medium would include the following: an electrical connection (electronic device) having one or more wires, a portable computer diskette (magnetic device), a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), an optical fiber device, and a portable compact disc read-only memory (CDROM). Additionally, the computer-readable medium could even be paper or another suitable medium upon which the program is printed, as the program can be electronically captured, via for instance optical scanning of the paper or other medium, then compiled, interpreted or otherwise processed in a suitable manner if necessary, and then stored in a computer memory.

It should be understood that portions of the present application may be implemented in hardware, software, firmware, or a combination thereof. In the above embodiments, the various steps or methods may be implemented in software or firmware stored in memory and executed by a suitable instruction execution system. If implemented in hardware, as in another embodiment, any one or combination of the following techniques, which are known in the art, may be used: a discrete logic circuit having a logic gate circuit for implementing a logic function on a data signal, an application specific integrated circuit having an appropriate combinational logic gate circuit, a Programmable Gate Array (PGA), a Field Programmable Gate Array (FPGA), or the like.

It will be understood by those skilled in the art that all or part of the steps carried by the method for implementing the above embodiments may be implemented by hardware related to instructions of a program, which may be stored in a computer readable storage medium, and when the program is executed, the program includes one or a combination of the steps of the method embodiments.

In addition, functional units in the embodiments of the present application may be integrated into one processing module, or each unit may exist alone physically, or two or more units are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium.

The storage medium mentioned above may be a read-only memory, a magnetic or optical disk, etc. Although embodiments of the present application have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present application, and that variations, modifications, substitutions and alterations may be made to the above embodiments by those of ordinary skill in the art within the scope of the present application.

Claims

1. A text labeling method combining RPA and AI is characterized by comprising the following steps:

s1, acquiring a Chinese text to be labeled, and generating a pinyin text corresponding to the Chinese text;

s2, determining pause information corresponding to the Chinese text by adopting a Natural Language Processing (NLP) technology, and inserting the pause information into the Chinese text to obtain the Chinese text with the pause information;

s3, marking the pinyin meeting the condition to be corrected in the pinyin text to obtain a pinyin text after marking;

s4, providing the Chinese text to be marked, the Chinese text with pause information and the pinyin text after marking processing for a user to correct.

2. The method according to claim 1, wherein the determining the pause information corresponding to the chinese text by the NLP technique, and inserting the pause information into the chinese text to obtain the chinese text with pause information comprises:

s21, acquiring the audio corresponding to the Chinese text to be labeled;

s22, identifying the audio, determining pause information in the audio, and determining the pause information in the audio as pause information corresponding to the Chinese text;

and S23, inserting the pause information into the Chinese text to obtain the Chinese text with the pause information.

3. The method according to claim 1, wherein the condition to be corrected comprises any one or more of the following conditions: the pinyin corresponding to polyphone, the pinyin corresponding to inflexion character and the pinyin corresponding to the word composed of continuous three-tone Chinese characters;

the words are obtained by segmenting the Chinese text according to the pause information.

4. The method as claimed in claim 3, wherein when the condition to be corrected is existence of pinyin corresponding to a word consisting of consecutive three-tone Chinese characters, the marking of the pinyin meeting the condition to be corrected in the pinyin text to obtain a pinyin text after marking, includes:

s31, obtaining pinyin corresponding to words formed by continuous three-tone Chinese characters in the pinyin text;

s32, performing voice change processing on the pinyin corresponding to the words consisting of the continuous three-tone Chinese characters to obtain a pinyin text after the voice change processing;

s33, marking the pinyin subjected to the sound changing processing in the pinyin text to obtain the pinyin text subjected to marking processing.

5. The method of claim 4, wherein the performing of the voicing process on the pinyin corresponding to the word consisting of the three consecutive Chinese characters comprises:

s321, acquiring a word sequence corresponding to continuous three tones in the words consisting of the continuous three-tone Chinese characters;

s322, determining the pinyin of all characters before the last character in the character sequence as the pinyin of the sound to be changed;

s323, the pinyin of the sound to be changed in the pinyin corresponding to the words consisting of the continuous three-sound Chinese characters is processed by two sound.

6. The method according to claim 1, wherein the providing the chinese text to be labeled, the chinese text with pause information, and the pinyin text after labeling processing to a user for correction to obtain a pinyin labeling result corresponding to the chinese text to be labeled comprises:

s41, providing the Chinese text to be labeled, the Chinese text with pause information and the pinyin text after marking processing for a user;

s42, receiving a pinyin correction request of the user, wherein the correction request comprises: the pinyin to be corrected and the corresponding corrected pinyin;

s43, correcting the pinyin text after the marking processing according to the pinyin correction request to obtain a corrected pinyin text;

s44, deleting the mark in the corrected Pinyin text to obtain the Pinyin marking result corresponding to the Chinese text to be marked.

7. The method according to claim 1, wherein after providing the chinese text to be labeled, the chinese text with pause information, and the pinyin text after labeling processing for user correction and obtaining a pinyin labeling result corresponding to the chinese text to be labeled, the method further comprises:

s5, inserting pause information in the Chinese text with the pause information into the pinyin annotation result to obtain the pinyin annotation result with the pause information;

s6, training a preset voice synthesis system according to the audio corresponding to the Chinese text and the pinyin marking result with the pause information to obtain the trained voice synthesis system.

8. A text labeling device combining RPA and AI, comprising:

the system comprises an acquisition module, a marking module and a marking module, wherein the acquisition module is used for acquiring a Chinese text to be marked and generating a pinyin text corresponding to the Chinese text;

the determining module is used for determining the pause information corresponding to the Chinese text by adopting an NLP technology, inserting the pause information into the Chinese text and obtaining the Chinese text with the pause information;

the marking module is used for marking the pinyin meeting the condition to be corrected in the pinyin text to obtain the pinyin text after marking;

and the providing module is used for providing the Chinese text to be marked, the Chinese text with the pause information and the pinyin text after marking processing for a user for correction.

9. A text annotation device that combines RPA and AI, comprising:

memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor, when executing the program, implements the text annotation method according to any one of claims 1 to 7 in combination with RPA and AI.

10. A non-transitory computer-readable storage medium, on which a computer program is stored, which, when executed by a processor, implements the method for text annotation combining RPA and AI according to any one of claims 1 to 7.