WO2021182984A1

WO2021182984A1 - Method and system for applying artificial intelligence in software development

Info

Publication number: WO2021182984A1
Application number: PCT/RU2020/000134
Authority: WO
Inventors: Сергей Станиславович ЧАЙКОВСКИЙ
Original assignee: Сергей Станиславович ЧАЙКОВСКИЙ
Priority date: 2020-03-13
Filing date: 2020-03-13
Publication date: 2021-09-16
Also published as: RU2020110541A3; RU2020110541A

Abstract

The present technical solution relates to systems for coding programs with the aid of artificial intelligence and to a method therefor. More specifically, the present invention relates to a system and a method for recognizing the voice of a user and generating program code corresponding to converted natural language. A method for applying artificial intelligence in software development, which can be carried out by at least one computer device, in which at least one machine learning model is generated by studying the mapping relationships between phrases in a natural language and program code; at least one program code fragment is obtained; the speech of at least one user is recognized by extracting phrases in the natural language; the extracted phrases in the natural language and the at least one previously obtained program code fragment are sent to the generated learning model; and the program code fragment is evaluated through application of the learning model.

Description

METHOD AND SYSTEM FOR APPLYING ARTIFICIAL INTELLIGENCE IN SOFTWARE DEVELOPMENT TECHNICAL FIELD

[001] the Present technical solution relates to systems for coding programs using artificial intelligence and its method. More specifically, the present invention relates to a system and method for recognizing a user's voice and generating program code corresponding to a transformed natural language. The present invention relates to technical solutions in which program code is automatically generated, which is enhanced and optimized by the user's voice using artificial intelligence. LEVEL OF TECHNOLOGY

[002] As digital technology advances and computing performance improves, the successful application of artificial intelligence algorithms based on complex deep learning is increasing. In recent years, deep learning has been used in real life and its scope has expanded.

[003] Since the basis of such artificial intelligence is implemented through programming, it is necessary to train specialists with software expertise. Therefore, the need for various types of programming training to increase the power of computers and the complexity of computations is increasing, and users are increasingly interested in creating software related to applications, big data analysis, data mining and statistical analysis.

[004] Although the need for education for various types of software has increased significantly, training is often not free due to the fact that users must be educated in a certain place and at a certain time. In addition, not only is there a lack of tools, but also a lack of educational space, and therefore there is a problem that the cost of training is too high.

[005] To solve these problems, online education is actively promoted with such advantages as mass and high-quality education, hands-on learning, overcoming time and space constraints, the introduction of mobile user terminals, such as a smartphone or tablet. Intelligent educational services that users can freely use on the move are gaining more and more attention. [006] However, in the case of using educational content (eg, in the programming language R, Python, Java, C, etc.) related to programming, in which practice or coding is important when conducting online learning through a user terminal, often input data for coding practice is inconvenient to use. As a result, a problem arises that the training program cannot be performed well in the same mobile environment. The program listing must be typed manually, which becomes very inconvenient and, in turn, complicates the process.

[007] In addition, most of the software that is currently distributed online is intended for the general public. Therefore, even if a user with disabilities, for example, is interested in the software, he has limitations in learning this software.

[008] Next, a brief description will be given of the prior art in the art of the present invention, and technical problems that the present invention solves in comparison with the prior art will be described.

[009] From the prior art, the application KR20020030156A “Control method of computer program used voice recognition technology” is known (copyright holder: PARK KEE BUM, published: 04.24.2002). This technical solution discloses a system and method for controlling the operating system of a computer and various application programs using only the user's voice commands, using voice recognition technology without a manual input device such as a keyboard or mouse. The present The invention relates to a method for controlling a computer program using speech recognition, which allows not only the general public, but also various people with disabilities to easily use various programs as well as the computer system. [0010] However, in this solution, the computer program is controlled, and not the program code itself and its writing.

[0011] The present invention recognizes the user's voice to perform natural language processing and generates the corresponding program code based on deep learning, thereby not only automatically solving the programming problem, but also technically improving the image quality configuration. Therefore, the prior art is clearly different from the method presented in the present invention.

[0012] Also known from the prior art patent Ns US6604110B1 "Automated software code generation from a metadata-based repository", patentee: International Business Machines Corp, publication date: 31.10.2000. This technical solution discloses a method for providing executable code for use in an enterprise data model (EDM) management application that transfers data from one or more data sources to an EDM application database for use therein in response to user commands.

[0013] The present invention operates only in interactive mode in response to user commands, which is labor intensive and affects the speed of operation. Moreover, the present invention provides only executable code, which significantly narrows the scope of the technical solution and does not provide scripts or metadata on the basis of which, in runtime, this metadata will determine the execution script. ESSENCE OF THE TECHNICAL SOLUTION

[0014] This technical solution is aimed at eliminating the disadvantages known from the prior art. [0015] The technical problem or technical problem solved in this technical solution is to implement a system and method for recognizing a user's voice and generating a program code corresponding to the transformed natural language. [0016] artificial intelligence applications.

[0017] The specified technical result is achieved by implementing a method for coding programs based on artificial intelligence based on speech recognition in accordance with an embodiment of the present invention, which includes the steps of generating a learning model by studying a mapping relationship between a natural language phrase and a program code, extracting a natural language phrase by recognizing the user's speech, and evaluating and outputting the program code to the user's display.

BRIEF DESCRIPTION OF DRAWINGS

[0018] The features and advantages of the present technical solution will become apparent from the following detailed description and the accompanying drawings, in which:

[0019] FIG. 1 shows an example of an implementation of an artificial intelligence program coding system and its method in accordance with the present invention.

[0020] FIG. 2 is a block diagram of an artificial intelligence speech recognition based program coding apparatus according to an embodiment of the present invention.

[0021] FIG. 3 is a flowchart illustrating a process for preprocessing input data that is input to a CNN when performing machine learning via a CNN in a speech recognition-based artificial intelligence program encoder according to an embodiment of the present invention.

[0022] FIG. 4 is a flowchart illustrating the preprocessing of input data to be entered into an RNN when performing machine learning via the RNN in a device. coding artificial intelligence programs based on speech recognition in accordance with an embodiment of the present invention.

[0023] FIG. 5 is a block diagram illustrating a CNN based learning structure and learning process in accordance with an embodiment of the present invention.

[0024] FIG. 6 is a block diagram for explaining a machine learning structure and learning process via an RNN in accordance with an embodiment of the present invention. [0025] FIG. 7 is a flowchart illustrating a process of performing a method for coding a program using artificial intelligence based on speech recognition in accordance with an embodiment of the present invention. DETAILED DESCRIPTION OF THE INVENTION

[0026] Below will be discussed in detail the terms and their definitions used in the description of this technical solution.

[0027] In this invention, the system means a computer system, a computer (electronic computer), CNC (numerical control), PLC (programmable logic controller), computerized control systems and any other devices capable of performing a given, well-defined sequence of operations. (actions, instructions), centralized and distributed databases, smart contracts. [0028] By a command processor is meant an electronic unit or an integrated circuit (microprocessor) executing machine instructions (programs), or the like. A command processor reads and executes machine instructions (programs) from one or more storage devices. The role of data storage devices can be, but are not limited to, hard disks (HDD), flash memory, ROM (read only memory), solid state drives (SSD), optical drives. [0029] A program is a sequence of instructions for execution by a computer control device or command processing device.

[0030] Server (English server) - an electronic device that performs service functions at the request of the client, providing him with access to certain resources. For the purposes of this description, a server is contemplated that has a persistent connection to the internetwork that can transmit data to a client device. The server can process this data and transmit the processing result back to the client device.

[0031] A data exchange unit is a server unit that can represent a receiver of incoming signals, and a converter for subsequent processing, and a translator for further sending.

[0032] The computing unit is a server unit that is a microprocessor specially adapted for complex signal processing.

[0033] As shown in FIG. 1, an artificial intelligence program coding system using speech recognition recognizes the user's voice and converts the user's recognized speech into program code using machine learning through deep learning. A training database (e.g., Sen2Cod) (200), in which mapping tables for natural language labels and program code are stored as data for machine learning, a training database (Sen2Cod), and at least one or more user devices (400 ).

[0034] Initially, the training database 200 stores a program for a programming element, including instructions, functions, variables, constants, or combinations thereof for various programming languages such as R, C ++, JAVA, Python programs, but not limited to. The database includes information that matches the code labels and words that users say. For example, all possible natural language expressions are generated for specific program codes, and natural language expressions and specific program codes are stored in a 1: 1 ratio. [0035] A mapping table for natural language program code marks is used as training data for deep learning in the AI program coding apparatus 100 by speech recognition (1), as shown in FIG. 1. Training data is used as input to machine learning, and as a result, a training model is created, after which the generated training model is stored in the training model database for use in software coding. Here machine learning can be performed with a recurrent neural network (RNN) or a convolutional neural network (CNN), or other suitable types of neural networks, but are not limited to.

[0036] Meanwhile, the artificial intelligence program coding apparatus 100 by speech recognition preprocesses the training data to correspond to the machine learning methods (ie, RNN or CNN) listed above. Then, the apparatus 100 for coding programs using artificial intelligence through speech recognition learns the preprocessed learning data and generates a learning model for evaluating the program code for a specific natural language phrase of the user. The preprocessing process according to CNN or RNN will be described in detail with reference to FIG. 3 and 4, respectively, below.

[0037] The basic data preprocessing unit may contain a text preprocessing unit that is responsible for reducing the variety of possible message texts in order to simplify the operation of the following modules of the system 100. This unit may be designed to split the incoming user input into sentences and words, as well as their morphological parsing, parsing and semantic typing of tokens.

[0038] The stage of tokenization involves the selection of basic text elements (tokens), delimited on both sides by separating characters, spaces or punctuation marks. The elements here are words, numbers, dates, abbreviations, abbreviations, compound prepositions, etc. Tokenization allows you to select discrete units of text, which are the basis for further work at the stages of morphological and parsing. As a result of tokenization, each element is assigned the appropriate type: word, number, date, address, etc.

[0039] The purpose of text preprocessing is to prepare it for high-quality classification and further labels with program code. [0040] The user accesses the program code and uses the learning model from the learning model database 300 to generate the program code through the artificial intelligence program encoder 100 using speech recognition. In the user device 400, through the speech recognition artificial intelligence program encoding apparatus 100 or through a program encoding application downloaded from the speech recognition artificial intelligence program encoding apparatus 100, converts the voice recognition result into a program code. In this process, the user can input mapping data between natural language and program code if the content input by voice and the program result are not optimal. That is, a user or manager with certain authority confirms the output program, then verifies that the program is the optimal program through the verification process, and updates the learning database 200 storing the mapping table. Here, a user or manager with certain powers / skills can be an expert in a particular programming language. Therefore, the user performing the verification process is not an ordinary user, but a person who has an advanced level of proficiency in a particular programming language.

[0041] The user device 400 is a mobile communication device such as a smartphone, tablet computer, or the like, and is a user terminal provided by the user. User device 400 includes a microphone for receiving the user's voice. Thus, the mobile communication device downloads an application for coding programs from the artificial intelligence program coding apparatus 100 by voice recognition, and executes the application to become the user apparatus 400 for coding programs. [0042] When the database is updated again, the speech recognition coding server (6) updates the training model and stores the result in the training model database 300.

[0043] Since the user uses the coding system for speech recognition in accordance with the present invention, the learning model becomes more advanced due to reinforcement learning.

[0044] In addition, the device 100 for encoding programs using artificial intelligence through speech recognition may be implemented by a cloud server, a local server, or a combination thereof. Speech recognition can include receiving user speech from user device 400, but it can be performed on another user device 400 having a speech recognition engine.

[0045] FIG. 2 shows a block diagram of an artificial intelligence speech recognition based program coding apparatus according to an embodiment of the present invention.

[0046] A speech recognition-based artificial intelligence program coding apparatus 100 according to an embodiment of the present invention includes a speech recognition unit 110 for recognizing a user's speech received from a user device 400, a preprocessing unit 120 for preprocessing the user's voice, and training data. To apply the training data in accordance with the machine learning method, a learning model generating unit 130 for generating a learning model by training preprocessed training data, a program code evaluator 140 (a program code evaluator in Fig. 2) for evaluating a program code by applying a preprocessed user voices to the generated learning model, a verification unit 150 for checking the evaluated program code, a training data update unit 160 for updating the training data and parts of the AI speech recognition-based program coding apparatus 100 in general. It is configured to include a controller 170. [0047] The speech recognition unit 110 recognizes the user speech received in real time from the user terminal, extracts the natural language phrase or phrases (hereinafter referred to as natural language) from the recognized speech of the user, and provides the extracted speech to the preprocessing unit 120.

[0048] At this time, the preprocessing unit 120 preprocesses the natural language and injects it into the learning model generated by the learning model generating unit 130, so that the program code for the user's voice can be evaluated through the learning model.

[0049] Thus, the speech recognition unit 110 recognizes the user's speech through the speech recognition engine and performs natural language processing, extracts the natural language from the user's speech, and evaluates the specific program code for the extracted natural language through the learning model.

[0050] Meanwhile, the speech recognition unit 100 performs natural language processing on user speech input by a user who performs programming exercises or coding tasks through morphological, syntactic, semantic, discourse analysis, and the like. For example, Siri or various machine learning methods for speech recognition can be used to perform natural language processing on user speech. In the present invention, there is no limitation on the natural speech processing method of the user. [0051] Meanwhile, natural language user language processing may be performed on a user device 400 equipped with a speech recognition engine. The voice recognition unit 120 receives natural language from the user device 400, and receives data that can be input into the training model. [0052] The preprocessing unit 120 prepares the training data loaded into the training database 200 and preprocesses the loaded training data in accordance with the machine learning method performed by the training model generation unit 130. [0053] On the other hand, the training data is stored in the training database 200 in the form of a mapping table in which the program code label configured in the coding scheme is indexed in the specific program code and in natural language in a 1: 1 ratio. [0054] In addition in addition, block 130 generating a training model can generate a training model by training preprocessed training data through at least one of the machine learning methods, such as CNN or RNN.

[0055] Convolutional neural networks (CNN) provide partial resistance to changes in scale, displacement, rotation, change of perspective and other distortions. Convolutional neural networks combine three architectural ideas to provide invariance to scaling, rotation, shear, and spatial distortion:

• local receptor fields (provide local two-dimensional connectivity of neurons);

• general synaptic coefficients (provide detection of some features anywhere in the image and reduce the total number of weight coefficients);

• hierarchical organization with spatial subsamples. [0056] Currently, the convolutional neural network and its modifications are considered the best algorithms for finding objects in terms of accuracy and speed.

[0057] Recurrent neural networks (RNNs) are a class of machine learning models based on the use of previous network states to compute the current one. Each character in the source text, individual words, punctuation marks and even entire phrases - all this is an atomic element of the input sequence for the neural network. In some implementations, gated recurrent units (GRUs) may be used. The update filter determines how much information will remain from the previous state and how much will be taken from the previous layer. The reset filter works much like a forgetting filter. [0058] For training neural networks, corpuses, namely tribanks, are used as a training sample. In linguistics, a corpus is a collection of texts selected and processed according to certain rules, used as a database for the study of a language. They are used for statistical analysis and statistical hypothesis testing, confirmation of linguistic rules in a given language. Tribank is a collection of parsed sentences (that is, parse graphs) prepared manually or automatically in advance. Tribanks are classified into phrase-structure treebanks and dependency treebanks. In this technical solution, the following tribanks or corpora for the Russian language can be used, but are not limited to: SynTagRus (1.107 thousand tokens), PUD (19 thousand tokens), GSD (99 thousand tokens), Taiga (20 thousand tokens), Dependency Treebanks, etc.

[0059] Next, a Transition-based dependency parsing approach widely known in the art is applied. This approach consists in an attempt to predict a sequence of actions (transitions) from some initial configuration of a phrase or user request to the final one, as a result of which the desired parse tree will be obtained, which allows obtaining a sufficiently high accuracy and achieving fairly high speeds in text processing.

[0060] The Arc-standard system is one of the most popular approaches to implementing a transition-based system. The system is described by a configuration consisting of three parts: c = (s, b, A),

[0061] where: s - data stack; [0062] b - data buffer;

[0063] A is a plurality of dependencies.

[0064] Initially, the configuration for the sequence of characters w ₁ ..., w _{n is as} follows before processing:

[0065] s = [ROOT] - one service symbol in the stack; [0066] b = [w _v w _n ] - the entire sequence of characters is in the buffer;

[0067] A = 0 - the set of dependencies is empty.

[0068] The final configuration after processing is as follows:

[0069] s = [ROOT] - one service symbol in the stack;

[0070] b - empty; [0071] A - contains the desired parse tree.

[0072] Let's consider s _h where i = (1, 2, ...), the i-th element of the stack, b *, i = (1, 2, ...) is the i-th element of the data buffer.

[0073] The Arc-standard system approach has 4 types of operations: [0074] SHIFT - remove b _r from the buffer and add it to the stack;

[0075] LEFT_ARC - adds to A a link from s _x to s ₂ , with a specific label of the link type, and removes s ₂ from the stack;

[0076] RIGHT ARC - similar to LEFT_ARC, only with the replacement of s ₁ and s ₂ .

[0077] SWAP: Returns the second item from the stack to the buffer. [0078] Thus, a total of | T | = 2N ^ + possible actions, where Ni is the number of dependency label types. The purpose of the text parser 210 is to select the most appropriate action for a given configuration.

[0079] To train an artificial neural network, it is required to generate the most appropriate sequence of actions based on the available data. At each step, the configuration will contain the necessary data, and the action will contain the response.

[0080] Accordingly, when the learning model generation unit 130 performs machine learning via the CNN, the preprocessing unit 120 numerically displays the mapping relationship between the natural language of the mapping table and the program code so that the learning data is suitable for the CNN, and stores the displayed learning data in the database 200 training data.

[0081] Next, the learning model generation unit 120 generates a learning model for receiving the displayed training data and evaluating the program code for a specific natural language, and stores the generated learning model in the learning model database 300.

[0082] On the other hand, each image obtained by quantifying the display ratio is set so that the display ratio between a specific natural language and a program code mark is not uniquely duplicated.

[0083] In addition, the size of the CNN may be composed of a plurality of networks of the same size, or the size of the network may gradually increase in accordance with the update or change of the training data. That is, the CNN network is adapted to increase the capacity of all training data to be studied. Accordingly, the depth of the CNN can vary depending on the size of the network.

[0084] The image also contains at least one piece of the image, and the piece of image contains a binary image, a grayscale image, a color image, or a combination thereof depending on the degree of quantification of natural language, program code and display ratio. For example, when natural language, program code and its display ratio are numerically expressed as 0 or 1, the training data can be converted to a binary image. When the numeric value is in the range 0 to 255, the training data is converted to gray. Image resolution is configured for continuous addition according to CNN size. The CNN preprocessing process will be described in detail with reference to FIG. 3. [0085] In addition, when the learning model generation unit 130 performs machine learning via the RNN, the preprocessing unit 120 can quantify the mapping relationship between the natural language of the mapping table and the program code label as a sequence of numbers, so that the training data is suitable for the RNN ... [0086] Thereafter, the learning model generation unit 120 receives a plurality of sequences obtained by digitizing the mapping relationship between the natural language program code labels, and generates a learning model for evaluating the specific program code from the user's speech and stores it in the model database 300 during training. learning.

[0087] On the other hand, each of the series of numeric display ratio values is set so that the label display ratio between a specific natural language and the program code is not duplicated.

[0088] In addition, the RNN may be composed of a plurality of networks of the same size, or the size of the network may gradually increase in accordance with the update or change of the training data. That is, the RNN is adapted to the capacity of all the training data to be learned.

[0089] In addition, the sequence includes at least one sequence fragment, and the sequence fragment includes a combination of at least one bit sequence. On the other hand, the size of the bit stream can be continuously added according to the size of the CNN. The CNN preprocessing process will be described in detail with reference to FIG. 3.

[0090] In addition, the training model generation unit 130 downloads the training data preprocessed by the preprocessing unit 120 from the training database 200 and examines the program code corresponding to the user's voice received from the user device 400, and then generates a training model for evaluation. Meanwhile, the learning model generating unit 130 may perform machine learning by the machine learning method of at least CNN or RNN, and the training data may be generated in accordance with the machine learning method performed by the learning model generating unit 130.

[0091] The learning model generated by the learning model generation unit 130 is for evaluating the program code based on the user's voice recognized by the user device 200 or recognized by the voice recognition unit 110. The input data is data obtained by natural language preprocessing for the voice of the recognized user, and the output is the natural language program code.

[0092] In addition, since the learning model generation unit 130 knows the programming code for a specific natural language, the backpropagation method is applied to the learning process to adjust the weight of the machine learning network (i.e., RNN or CNN), thereby improving the accuracy of the machine learning network. to generate a learning model.

[0093] When the natural language of the user's voice input by the user is input after the learning model is generated through the learning model generating unit 130, the program code evaluator 140 calculates the learning model from the learning model database 300 and applies the natural language to the loaded learning model. thus evaluating the natural language code and providing it to the user device 400. [0094] On the other hand, the natural language of the user's voice preprocesses the data through the same process as preprocessing the training data by the preprocessing unit 120. [0095] That is, when the learning model generation unit 130 learns the learning data via CNN to generate a learning model, a natural language for the user's voice is displayed, and the learning model generation unit 130 learns the learning data via the RNN. The natural language for the user's voice is preprocessed with a series of chunks. [0096] In addition, the checking unit 150 may compare the program code evaluated by the program code evaluator 140 and the natural language of the user's voice based on the estimated program code with a user or administrator having certain privileges (eg, an artificial intelligence program). The encoder 100 can provide information in real time or periodically to a person skilled in the art (eg, a language expert who has specific authority) to perform verification of the generated program code.

[0097] On the other hand, a user or administrator having certain authority validates the natural language, and the program code obtained from the checker 150 determines whether the program code generated for the natural language is the optimal program code. Allows you to update the training database.

[0098] In other words, a user or manager with certain authority can re-create and add, or change the mapping relationship between the natural language and the program code label stored in the training database 200 according to the check result. For example, if the natural language program code obtained through the validator 150 is not optimal, the user or administrator may update the program code label for the natural language and natural language program code. The checker 150 may provide the received updater data to the training data updater 160 to update the training database 200. [0099] The training data update unit 160 may update the training data base 200, reflecting the received update data in the training data base 200, when the update data in accordance with the verification result is received from the checking unit 150. Meanwhile, the update is performed in real time or periodically in accordance with the rules of the checking unit 140 (i.e., the checking process is performed in real time or periodically) that are predetermined.

[00100] When the training database 200 is updated, the training model generation unit 130 regenerates the training model having higher accuracy and reliability than the existing training model by reflecting the updated training data, and stores the training model in the training model database 300. In the case where such a process is performed repeatedly, the learning model generated through the learning unit 130 is progressively advanced so that an optimal program code can be generated from the user's voice.

[00101] Fig. 3 illustrates a diagram showing a preprocessing process for input data that is input to a CNN when machine learning is performed via the CNN in a speech recognition-based artificial intelligence program encoder according to an embodiment of the present invention.

[00102] As shown in FIG. 3, when machine learning is performed via CNN in the artificial intelligence speech recognition based program coding apparatus 100 in accordance with an embodiment of the present invention, the input data preprocessing process is performed in the first training database 200.

[00103] The training data is uniquely displayed with the program code mark corresponding to the natural language and the program code.

[00104] The speech recognition artificial intelligence-based program coding apparatus 100 is characterized in that each natural language is digitized according to a predetermined process and assigned to one slice, and marking is performed on each slice.

[00105] The program code mark is configured in a direct coding scheme format, with each program code mark being indexed with specific program code, and program code means one complete line of natural language program. That is, a program code can represent a complete function such as sum (), printf (), etc., as well as a single code (a = b). Accordingly, the user can sequentially input characters through his voice, thereby allowing multiple programming codes to be connected to create a complete program.

[00106] Then, the speech recognition AI-based program coding apparatus 100 generates image data by rendering a plurality of labeled image tiles into an image having a specific resolution (eg, 32 * 32).

[00107] Meanwhile, the resolution is configured to continuously add in accordance with the total capacity of the training data to be learned. Thus, the speech recognition AI based program coding apparatus 100 can adaptively learn the training data by expanding the resolution.

[00108] Then, the speech recognition artificial intelligence-based program coding apparatus 100 generates a training model for inputting image data into CNN, thereby learning the mapping relationship between the natural language and the program code label and outputting the program code for a specific natural language.

[00109] As described above, a preprocessing process is performed to apply training data to the CNN when the AI speech recognition program encoder 100 performs training via the CNN.

[00110] Figure 4 is a diagram illustrating a preprocessing process for input data to be input to an RNN when machine learning is performed via the RNN in a speech recognition-based artificial intelligence encoder in accordance with an embodiment of the present invention.

[00111] As shown in FIG. 4, in the case where machine learning is performed via the RNN in the speech recognition artificial intelligence-based program coding apparatus 100 according to the embodiment For the implementation of the present invention, the preprocessing process for the input of data entered into the RNN is based on artificial speech recognition. The program encoding apparatus 100 downloads natural language-displayed training data and a program code mark from the training database 200.

[00112] Then, the speech recognition AI-based program coding apparatus 100 generates each of the natural words through a predetermined process and generates them as one sequence, and assigns the corresponding sequences for the generated natural language to one fragment of the sequence. At this time, the sequence section includes a combination of at least one bit string, and the bit string size can be continuously added according to the RNN size. That is, the speech recognition-based artificial intelligence coding apparatus 100 allows the bit string size to be expanded so that the training data can be learned adaptively.

[00113] Marking is then performed on each of the sequence chunks to assign a program code label to each of the sequence chunks to form the final original sequence. Meanwhile, the program code mark is configured in a direct encoding scheme, and the program code mark is a target sequence for the natural language program code.

[00114] Then, the speech recognition artificial intelligence-based program coding apparatus 100 inputs the generated source sequence into the RNN, thereby examining the mapping relationship between the natural language and the program code label and outputting the program code for the specific natural language.

[00115] As described above with reference to FIG. 4, the preprocessing process is performed so that the speech recognition AI based program coding apparatus 100 can apply training data to the RNN while performing training on the training data via the RNN. [00116] Fig. 5 is a diagram illustrating a CNN-based learning structure and a learning process in accordance with an embodiment of the present invention.

[00117] As shown in FIG. 5, when the learning model generating unit 130 of the speech recognition-based artificial intelligence program coding apparatus 100 learns the training data via the CNN, the CNN training pattern is displayed through the preprocessing process. A convolution layer to convolve a specific portion of the image with a kernel uses a kernel of a certain size, an input layer to which the training data is injected, a convolution layer to combine the convolved specific portion into a maximum value or average.

[00118] In addition, the input layer receives the mapped training data through a preprocessing process, and the convolution layer moves in accordance with a predetermined pitch (meaning a moving unit of the core) of the core having a specific weight and core weight, and generates and outputs a feature map for certain part of the image.

[00119] The next layer then selects at least one or more slices of images by combining feature maps according to kernel size and passing through the maximum pool or average pool method.

[00120] In addition, the CNN may repeatedly perform convolution and subsampling to generate a feature map for a program code label for a particular natural language.

[00121] The fully associative layer then associates a feature map for a program code label to output program code for a specific natural language.

[00122] Through this process, the speech recognition AI based program coding apparatus 100 can learn natural language, program code, and learning data in relation to the mapping between natural language and program code. In addition, the speech recognition AI based program coding apparatus 100 can evaluate the optimized program code for the user's voice and provide it to the user by generating an extended training model by reflecting the training data update.

[00123] In addition, since the speech recognition AI-based program coding apparatus 100 knows a specific natural language and corresponding program code in advance, the CNN weight can be adjusted using a backpropagation technique in a learning process. The accuracy of the CNN can be improved.

[00124] Fig. 6 is a diagram for explaining a machine learning structure and a learning process via an RNN in accordance with an embodiment of the present invention.

[00125] As shown in FIG. 6, when the training data is learned via the RNN in the learning model generating unit 130 of the speech recognition-based artificial intelligence program coding apparatus 100 according to an embodiment of the present invention, the RNN includes an encoder and a decoder.

[00126] The encoder sequentially receives the numerical values of the original sequence generated by the preprocessing process, accumulates the values as the state values of the internal nodes, and delivers the final state of each internal node to the decoder according to the weights of the internal nodes.

[00127] In addition, the decoder outputs the final state of the original sequence transmitted from the encoder, one by one, to the target sequence (ie, program code label) for the original sequence according to the weight of each internal node constituting the decoder.

[00128] That is, each of the nodes constituting the encoder and the decoder has different weights, and the encoder sequentially takes the original sequence as input and performs forward propagation in accordance with the time stamp, and transmits the information of the last encoder node to the decoder. At this time, the information passed to the decoder can be viewed as a presentation of the sentence containing all the information of the input source sequence. The decoder receiving the original sequence information outputs the numerical values of the target sequence one after another in accordance with the weights of the nodes, so that the program code for a specific natural language can be estimated.

[00129] FIG. 7 is a flowchart illustrating a process of a method for encoding an artificial intelligence-based program based on speech recognition in accordance with an embodiment of the present invention.

[00130] As shown in FIG. 7, a process of a method for encoding a speech recognition-based artificial intelligence-based program according to an embodiment of the present invention starts from a speech recognition-based artificial intelligence-based program encoding apparatus 100. The training data is preprocessed and stored in the training database 200 so that the training data can be applied to the CNN or RNN (step S110). [00131] As described with reference to FIG. 3 and FIG. 4, a speech recognition-based artificial intelligence program coding apparatus 100 is executed to generate machine learning input data and to generate a learning model. The learning-based artificial intelligence program coding apparatus 100 performs processing on the training data using the training data as an original sequence, and when the training data is learned via the RNN, a preprocessing process is performed for transformation.

[00132] Then, the speech recognition AI-based program coding apparatus 100 downloads the training data preprocessed from the training data database 200 (step S120) and examines the downloaded training data (step S130) to obtain specific program codes, and saves the training model in a database of 300 training models. [00133] Alternatively, training can be performed via RNN or CNN, as described above.

[00134] When the speech recognition AI-based program coding apparatus 100 receives user speech to encode programs from the user device 400 (step S210), the speech recognition artificial intelligence-based program coding apparatus 100 performs a natural language process for recognizing the received user speech, wherein the natural language generated by the natural language processing is preprocessed (step S220).

[00135] On the other hand, it is preferable that the natural language for the user's voice is generated and processed by the speech recognition AI-based program encoder 100, but it can be processed and generated in the user device 200 equipped with the speech recognition engine. At this time, the user device 200 transmits the generated natural language to the device 100 for encoding programs based on artificial intelligence based on speech recognition. [00136] On the other hand, the preprocessing process for the user's speech is performed in the same manner as the preprocessing process performed in step S110 according to the machine learning method (i.e., RNN or CNN) of the artificial speech recognition based program coding apparatus 100. intelligence. [00137] Next, the speech recognition based artificial intelligence coding apparatus 100 downloads a learning model from the learning model database 300 and inputs a natural language for the user's speech preprocessed in the loaded learning model. In step S230, the program code corresponding to the user's voice is evaluated by generating the natural language program code.

[00138] Thus, the input of the learning model is natural language, and the output of the learning model is a natural language program code label, thus, program code for the program code label is generated by evaluating the program code.

[00139] Then, the speech recognition AI based program coding apparatus 100 transmits the estimated program code to the user equipment 400, provided by the user, thereby providing the evaluation code to the user.

[00140] Meanwhile, the speech recognition AI based program coding apparatus 100 improves the accuracy and reliability of the estimated program code through reinforcement learning.

[00141] That is, the speech recognition-based artificial intelligence coding apparatus 100 provides the estimated program code to a user or administrator with certain skills, such as a programming language expert, in real time or periodically (S240).

[00142] Then, the speech recognition AI-based program coding apparatus 100 may update the training data according to the result of checking the checking process (S260).

[00143] That is, a user or manager with certain authority receives the program code estimated based on the natural language of the user's voice and the natural language in real time or periodically from the artificial intelligence speech recognition program encoder 100 if the program code is not optimal, the update data including the optimal natural language program code is provided to the artificial intelligence speech recognition based program coding apparatus 100. [00144] At this time, the speech recognition-based artificial intelligence coding apparatus 100 updates the training database 200 by updating the training data reflecting the provided update data.

[00145] Thereafter, when the training data is updated, the speech recognition AI-based program coding apparatus 100 performs a gain learning process that reflects the updated training data to generate a new training model. By repeating this process, the program encoding apparatus 100 based on artificial intelligence speech recognition can greatly improve the accuracy and reliability of the learning model for coding programs.

[00146] As described above, in accordance with the present invention, in order to generate program code based on the user's voice, natural language display learning data and natural language program code are learned via RNN or CNN to generate a learning model. The effect is achieved that the user can automatically generate the program code by simply transforming the user's voice into natural language and entering it into the learning model to generate the program code.

[00147] In addition, according to the present invention, the training data is updated through the generated program code validation process, thereby performing reinforcement learning on the training data, thereby improving the accuracy and reliability of the generated program code based on the user's voice.

[00148] Although the present invention has been specifically shown and described with reference to exemplary embodiments thereof, it should be understood that the invention is not limited to the disclosed exemplary embodiments. [00149] All components of this technical solution can be connected by a bus. In some examples, a data bus includes one or more data buses. The data bus can be implemented in accordance with the Controller Area Network (CAN) bus protocol, which is defined by the International Organization for Standardization (ISO) 11898-1, the Multimedia System Data Bus (MOST) protocol, the flexible CAN data bus (CAN-FD) protocol (ISO 11898-7) and / or K-line bus protocol (ISO 9141 and ISO 14230-1) and / or Ethemet ™ -iiJHHbi IEEE 802.3 protocol (2002 onwards), etc.

[00150] The elements of the proposed technical solution are in a functional relationship, and their joint use leads to the creation of a new and unique technical solution. Thus, all blocks are functionally linked.

[00151] All blocks used in the system can be implemented using electronic components used to create digital integrated circuits, which is obvious to a person skilled in the art. Not limited to, microcircuits can be used, the logic of which is determined during manufacture, or programmable logic integrated circuits (FPGA), the logic of which is set through programming. For programming, programmers and debugging environments are used that allow you to set the desired structure of a digital device in the form of a circuit diagram or a program in special hardware description languages: Verilog, VHDL, AHDL, etc. An alternative to FPGAs can be programmable logic controllers (PLCs), basic matrix crystals ( BMK) requiring a factory production process for programming; ASICs are specialized custom large integrated circuits (LSI), which are significantly more expensive for small-scale and single-piece production.

[00152] Typically, the FPGA itself consists of the following components: · configurable logic blocks that implement the required logic function;

• programmable electronic links between configurable logic blocks;

• programmable input / output blocks providing connection of the external output of the microcircuit with the internal logic.

[00153] Blocks can also be implemented using read-only memory devices.

[00154] Thus, the implementation of all the blocks used is achieved by standard means based on the classical principles of the implementation of the foundations of computing.

[00155] As one skilled in the art will appreciate, aspects of the present technical solution may be embodied in a system, method, or computer program product. Accordingly, various aspects of the present technical solution may be implemented solely as hardware, as software (including application software, and so on), or as an embodiment combining software and hardware aspects, which may generally be referred to as a "block" , "System" or "architecture". In addition, aspects of the present technical solution can take the form of a computer program product implemented on one or more computer-readable media having a computer-readable program code that is implemented on them.

[00156] Any combination of one or more computer readable media can also be used. The computer-readable storage medium can be, without limitation, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, device, or any suitable combination thereof. More specifically, examples (non-exhaustive list) of a computer-readable storage medium include: an electrical connection using one or more wires, a portable computer diskette; hard disk, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM or Flash memory), fiber optic connection, compact disk read-only memory (CD-ROM), optical storage device, magnetic storage device, or any combination of the above. As used herein, a computer-readable storage medium can be any flexible storage medium that can contain or store a program for use by the system itself, device, apparatus, or in connection therewith.

[00157] The program code embedded in a computer-readable medium can be transmitted using any medium, including, without limitation, wireless, wired, fiber optic, infrared, and any other suitable network or a combination of the above. [00158] Computer program code for performing operations for the steps of the present technical solution may be written in any programming language or combinations of programming languages, including an object-oriented programming language such as Java, Smalltalk, C ++, and so on, and conventional procedural programming languages such as programming language "C" or similar programming languages. The program code can be executed on the user's computer in whole, in part, or as a separate software package, partially on the user's computer and partially on the remote computer, or completely on the remote computer. In the last In this case, the remote computer can be connected to the user's computer through any type of network, including a local area network (LAN), a wide area network (WAN), or a connection to an external computer (for example, via the Internet using Internet service providers). [00159] Aspects of the present technical solution have been described in detail with reference to block diagrams, schematic diagrams, and / or diagrams of methods, devices (systems), and computer program products in accordance with embodiments of the present technical solution. It should be appreciated that each block from the block diagram and / or diagrams, as well as combinations of blocks from the block diagram and / or diagrams, may be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other data processing device to create a procedure, such that instructions executed by a computer processor or other programmable data processing device create means to implement the functions / actions specified in block or blocks of flowchart and / or diagram.

[00160] These computer program instructions may also be stored on a computer-readable medium that can control a computer other than a programmable data processing device or devices that function in a particular way, such that the instructions stored on the computer-readable medium create a device including instructions that perform the functions / actions specified in the block diagram and / or diagram.

Claims

FORMULA

1. A method of using artificial intelligence in software development, performed by at least one computing device, and includes the following steps:

• form at least one machine learning model by examining the mapping relationship between natural language phrases and program code;

• get at least one piece of program code;

• performing speech recognition of at least one user by extracting phrases in natural language;

• send the extracted natural language phrases and at least one previously obtained piece of program code to the generated learning model;

• carry out an assessment of a piece of program code based on the application of a learning model.

2. A method according to claim 1, characterized in that speech recognition of at least one user is performed through a speech recognition engine provided in a cloud server and / or a local server and / or user equipment.

3. The method of claim 1, wherein the step of generating the learning model includes performing machine learning through a convolutional neural network (CNN).

4. A method according to claim 1, characterized in that the CNN consists of a plurality of networks having the same size, or is capable of gradually increasing in size to adaptively correspond to the total capacity to be studied.

5. The method according to claim 3, characterized in that CNN performs training by receiving an image obtained by digitizing a mapping relationship between a natural language phrase and a program code, in which a natural language phrase is assigned to a label for a specific program code, each image obtained by digitizing the mapping relationship, is set in such a way that the relationship mapping between a specific natural language and program code was not duplicated unambiguously.

6. The method according to claim 5, characterized in that the image resolution is continuously added depending on the size of the network.

57. The method according to claim 1, characterized in that the step of generating a learning model includes the step of generating a learning model including performing machine learning through a recursive neural network (RNN).

8. The method according to claim 7, characterized in that the RNN is composed of a plurality of networks having the same size, or is designed so that the dimensions

10 were gradually increased to adaptively correspond to the total capacity to be studied.

9. The method of claim 7, wherein the RNN performs training by inputting a series of numbers obtained by digitizing the mapping relationship between program codes for a particular natural language phrase.

1510. The method according to claim 9, characterized in that each of the number of numerical values of the mapping relationship establishes a mapping relationship between a specific natural language and a program code so as not to be unique, where the sequence contains at least one or more sections of the sequence, and section

Sequence 20 contains a combination of at least one bit sequence.

11. A system for using artificial intelligence in software development, containing:

• at least one unit for generating a learning model for

25 exploring the mapping relationship between a natural language phrase and program code to generate a learning model; and

• at least one block for evaluating the program code for the recognized speech of the user, extracting the phrase on

30 natural language, inputting natural language phrases into the learning model, and evaluating and outputting program code.

12. The system according to claim 11, further comprising a speech recognition unit for recognizing the user's speech and extracting a natural language phrase.