WO2022188742A1 - Information processing method and apparatus, medium and electronic device - Google Patents

Information processing method and apparatus, medium and electronic device Download PDF

Info

Publication number
WO2022188742A1
WO2022188742A1 PCT/CN2022/079530 CN2022079530W WO2022188742A1 WO 2022188742 A1 WO2022188742 A1 WO 2022188742A1 CN 2022079530 W CN2022079530 W CN 2022079530W WO 2022188742 A1 WO2022188742 A1 WO 2022188742A1
Authority
WO
WIPO (PCT)
Prior art keywords
sub
information
question
calculations
calculation
Prior art date
Application number
PCT/CN2022/079530
Other languages
French (fr)
Chinese (zh)
Inventor
金霄然
罗小飞
Original Assignee
北京有竹居网络技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京有竹居网络技术有限公司 filed Critical 北京有竹居网络技术有限公司
Publication of WO2022188742A1 publication Critical patent/WO2022188742A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance

Definitions

  • the embodiments of the present disclosure relate to the field of computer applications, and in particular, to an information processing method, apparatus, medium, and electronic device.
  • Embodiments of the present disclosure provide an information processing method, apparatus, medium, and electronic device.
  • an embodiment of the present disclosure provides an information processing method, including:
  • a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved are generated, wherein the plurality of sub-calculations
  • the calculation formula conforms to the operation logic of the comprehensive calculation formula
  • analysis information For each of the sub-expressions, generate analysis information corresponding to the sub-expressions according to the question stem information of the application question to be solved, the comprehensive expression and the sub-expressions, wherein the analysis information includes the sub-expressions The meaning of the result of the calculation and the unit of the result of the sub-calculation;
  • an information processing apparatus including:
  • the acquisition module is used to acquire the question stem information of the application question to be solved
  • the first generation module is configured to generate a comprehensive formula for completely solving the to-be-solved applied problem and a comprehensive formula for reflecting the to-be-solved applied problem according to the question stem information of the to-be-solved applied problem obtained by the acquisition module A plurality of sub-calculations of the solution process, wherein, the plurality of sub-calculations conform to the operation logic of the comprehensive formula;
  • the second generating module is configured to, for each of the sub-expressions generated by the first generating module, obtain the problem stem information of the applied question to be solved obtained by the obtaining module, the data generated by the first generating module.
  • the integrated calculation formula and the sub-calculation generate analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation;
  • a third generation module configured to generate explanations corresponding to the multiple sub-expressions according to the multiple sub-expressions generated by the first generation module and the analysis information corresponding to each of the sub-expressions generated by the second generation module information.
  • an embodiment of the present disclosure provides a non-volatile computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the first aspect of the present disclosure.
  • an embodiment of the present disclosure provides an electronic device, including: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device to implement the first aspect of the present disclosure Provided the steps of the method.
  • the present disclosure provides a computer program, comprising: instructions that, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
  • the present disclosure provides a computer program product comprising instructions that, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
  • FIG. 1 is a flowchart of an information processing method according to an exemplary embodiment of the present disclosure.
  • Fig. 2 is a schematic structural diagram of a binary tree generated according to a target expression and identification information according to an exemplary embodiment of the present disclosure.
  • FIG. 3 is a flowchart of an information processing method according to another exemplary embodiment of the present disclosure.
  • FIG. 4 is a block diagram of an information processing apparatus according to an exemplary embodiment of the present disclosure.
  • FIG. 5 is a block diagram of an electronic device according to an exemplary embodiment of the present disclosure.
  • the term “including” and variations thereof are open-ended inclusions, ie, "including but not limited to”.
  • the term “based on” is “based at least in part on.”
  • the term “one embodiment” means “at least one embodiment”; the term “another embodiment” means “at least one additional embodiment”; the term “some embodiments” means “at least some embodiments”. Relevant definitions of other terms will be given in the description below.
  • embodiments of the present disclosure provide an information processing method, apparatus, medium, and electronic device.
  • FIG. 1 is a flowchart of an information processing method according to an exemplary embodiment of the present disclosure. As shown in FIG. 1 , the method includes S101 to S104.
  • the application questions describe relevant facts in language or words, reflect a certain mathematical relationship (eg, quantitative relationship, positional relationship, etc.), and solve an unknown number of questions, wherein the application questions to be solved can be in the form of
  • the application questions to be solved can be in the form of
  • the above-mentioned information processing method can be applied to electronic equipment, for example, a terminal equipment or a server communicatively connected to the terminal equipment.
  • the terminal device may be, for example, a smart phone, a tablet computer, a personal computer, or the like.
  • the user can input the question stem information of the application question to be solved to the terminal device in the form of image, text, audio, etc., and then the terminal device receives the question stem information and automatically solves it according to the question stem information, or the terminal device will
  • the question stem information is sent to the server, so that the server can automatically solve the question according to the question stem information, and feedback the solution result to the user through the terminal device.
  • the analysis information includes the meaning of the result of the sub-expression and the unit of the result of the sub-expression.
  • the explanation information may include at least one of explanation text, explanation audio, and explanation video.
  • explanation information corresponding to the multiple sub-calculations can be generated through a rule template according to the multiple sub-calculations and the analysis information corresponding to each sub-calculation.
  • the stem information of the application question to be solved is as follows:
  • a garment factory makes an average of 1,850 sets of clothes every day in the first 12 days of June, and an average of 2,100 sets of clothes every day in the next 18 days. How many sets of clothes are made every day this month?
  • the above-mentioned explanation information can be an explanation text.
  • the explanation text corresponding to the four sub-calculations generated by the rule template is as follows:
  • the algebraic expression in the comprehensive calculation formula is composed of each element in the target expression.
  • the solution model may be, for example, a sequence-to-sequence model, in particular, the solution model may include a first encoding network and a first decoding network.
  • the first encoding network is used for encoding the question stem information of the application question to be solved
  • the first decoding network is used for generating the target expression and representing the target expression according to the encoded question stem information output by the first encoding network Whether each operator in the formula is used as the identification information of the unique operator in a sub-calculation.
  • the first encoding network may be a Recurrent Neural Network (RNN), Convolutional Neural Networks (CNN), a pre-trained language model RoBERTa (a Robustly Optimized BERT Pretraining Approach, a more robust The best optimized BERT pre-training method), etc., among which, the full English name of BERT is Bidirectional Encoder Representation from Transformers, that is, a bidirectional language representation model.
  • the decoding network can be, for example, an RNN, a CNN, a multi-layer Transformer, or the like.
  • the above-mentioned target expression may be a prefix expression, an infix expression, a postfix expression, or the like.
  • the first decoding network For example, for the application question to be solved for calculating the daily output of a garment factory in the above example, the first decoding network generates the target expression according to the encoded question stem information as "/+ ⁇ 1850 12 ⁇ 2100 18+12 18( Prefix expression)", “1850 ⁇ 12+2100 ⁇ 18/12+18(infix expression)”, “1850 12 ⁇ 2100 18 ⁇ 12 18+/(postfix expression)”.
  • the above-mentioned target expression can be a prefix expression, which is more in line with human thinking habits, and does not need to use parentheses to indicate calculation priorities, making the solution model easier to train.
  • the above-mentioned solving model can be obtained by training in the following manner: obtaining first reference information of the first reference application question, wherein the first reference information includes the first reference question stem information, and the information used to completely solve the above-mentioned first reference application question.
  • the first reference comprehensive calculation formula and a plurality of first reference sub-calculations used to embody the solution process of the first reference applied problem can be the applied problem in the question bank; then, according to the first reference comprehensive formula, obtain The reference expression and the reference identification information used to characterize whether each operator in the reference expression is used as a unique operator in a sub-expression; finally, by using the first reference question stem information as the input of the solution model, the reference expression The formula and reference identification information are used as the target output of the solution model to train the model, and the solution model is obtained.
  • the target expression can be directly converted into an algebraic expression according to the form of the target expression (the form can be one of a prefix expression, an infix expression, and a postfix expression); then, the converted result is calculated.
  • the result of the algebraic formula; finally, the algebraic formula obtained after conversion, the equal sign, and the result of the algebraic formula obtained after conversion can be connected in turn to obtain a comprehensive formula.
  • the target expression obtained by the above step (2) is an infix expression
  • the infix expression is 1850 ⁇ 12+2100 ⁇ 18/12+18
  • a binary tree can be generated first according to the target expression and identification information; then, a plurality of sub-expressions can be generated according to the binary tree.
  • the nodes of the above binary tree are composed of numbers and operators in the target expression.
  • the attribute information of the number node in the binary tree includes an attribute value, and the attribute value is the number itself;
  • the attribute information of the operator node in the binary tree includes a flag used to characterize whether the operator node is the only operator in a sub-calculation operator (that is, used to characterize whether the operator node is split) attribute value.
  • the left child of the operator node can be
  • the attribute value of the operator node, the operator represented by the operator node, and the attribute value of the right child node of the operator node constitute an element of the algebraic expression in the sub-expression.
  • the attribute value of the operator node is the attribute value of its left child node.
  • the attribute value of the operator node is the left parenthesis, the attribute value of its left child node, the operator represented by the operator node, and the attribute of its right child node.
  • the above identifier is 0 or 1, wherein 0 is used to indicate that the operator node is not divided, and 1 is used to indicate that the operator node is divided.
  • the generated binary tree is shown in FIG. 2 .
  • the binary tree includes five operator nodes and six numeric nodes.
  • the identifier corresponding to the first operator node “ ⁇ " on the left side of the third layer of the binary tree shown in Figure 2 is 1, and its attribute value is 22200 (ie 1850 ⁇ 12), then the operator node “ ⁇ ” is divided, at this time, the attribute value 1850 of the left child node “1850” of the operator node "x", the operator “x” represented by the operator node “x”, the right side of the operator node “x”
  • the attribute value 12 of the child node "12” constitutes an element of an algebraic expression in a sub-expression, that is, 1850, ⁇ , and 12 constitute an element of an algebraic expression in a sub-expression.
  • the identifier corresponding to the second operator node " ⁇ " on the left side of the third layer of the binary tree shown in Figure 2 is 1, and its attribute value is 37800 (ie 2100 ⁇ 18), then the operator node " ⁇ " is divided , at this time, the attribute value 2100 of the left child node "2100" of the operator node " ⁇ ", the operator “ ⁇ ” represented by the operator node “ ⁇ ”, and the right child node of the operator node “ ⁇ ”
  • the attribute value 18 of "18” constitutes an element of an algebraic expression in another sub-expression, that is, 2100, ⁇ , and 18 constitute an element of an algebraic expression in another sub-expression.
  • the identifier corresponding to the second operator node "+" on the left side of the second layer of the binary tree shown in Figure 2 is 1, and its attribute value is 30 (ie 12+18), then the operator node "+” is divided , at this time, the attribute value 12 of the left child node “12" of the operator node “+”, the operator "+” represented by the operator node “+”, and the right child node of the operator node "+”
  • the attribute value 18 of "18” constitutes the elements of the algebraic part of a sub-expression, that is, 12, +, 18 constitute the elements of the algebraic expression in a sub-expression.
  • the identifier corresponding to the operator node "/" (ie the root node) of the first layer of the binary tree shown in Figure 2 is 0, and its attribute value is 2000 (ie (22200+37800)/30), then the operator node "/" is not divided.
  • the attribute value (22200+37800) of the left child node "+” of the operator node "/” the operator "/” represented by the operator node "/”
  • the attribute value 30 of the right child node "+” of the operator node “/” constitutes the elements of the algebraic part of another sub-expression, namely (22200+37800), /, 30 constitute the elements of the algebraic expression in the other sub-expression.
  • multiple sub-calculations can be generated in the following ways:
  • a binary tree is generated according to the encoded question stem information as shown in FIG. 2 .
  • the binary tree includes five operator nodes and six numeric nodes.
  • the identifier corresponding to the operator node "/" (ie the root node) of the first layer of the binary tree shown in Figure 2 is 0, and its attribute value is 2000 (ie (22200+37800)/30), then the operation The operator node "/" is not divided.
  • the question stem information, the comprehensive formula and the sub-calculation can be sequentially spliced and input into the multi-task sequence-to-sequence model to obtain the analytical information corresponding to the sub-calculation.
  • the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation can be obtained through a model, thereby reducing the complexity of the auxiliary learning system for applied questions, and the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation are related to each other. , which is conducive to mutual learning, thereby improving the accuracy of the analytical information generated by the multi-task sequence-to-sequence model.
  • the multi-task sequence-to-sequence model may include a second encoding network, a second decoding network, and a first classifier.
  • the second encoding network is used to encode the splicing information to obtain the coded information
  • the splicing information is the information obtained by splicing the question stem information, the comprehensive formula and the sub-calculation of the application question to be solved in turn
  • the second decoding network which is used to generate the meaning of the result of the sub-calculation according to the coding information
  • the first classifier is used to generate the unit of the result of the sub-calculation according to the coding information.
  • the second encoding network may be RNN, CNN, a pre-trained language model RoBERTa, etc.
  • the second decoding network may be, for example, RNN, CNN, a multi-layer Transformer, and the like.
  • the above multi-task sequence-to-sequence model can be obtained by training in the following manner: first, acquiring second reference information of the second reference application question, wherein the second reference information includes the stem information of the second reference question, which is used to completely solve the above-mentioned first question.
  • the reference question stem information, the second reference comprehensive formula, and the second reference sub-formula spliced in sequence are used as the input of the multi-task sequence-to-sequence model, and the annotation parsing information corresponding to the second reference sub-formula is used as the multi-task.
  • the model is trained in the way of the target output of the sequence-to-sequence model, and a multi-task sequence-to-sequence model is obtained.
  • the analysis information of the sub-calculation generated in the above S103 also includes the knowledge points involved in the sub-calculation.
  • the analysis information may include the meaning of the result of the sub-expression, the unit of the result of the sub-expression, and the knowledge points involved in the sub-expression.
  • the above-mentioned explanation information can be an explanation text.
  • the explanation text corresponding to the four sub-calculations generated by the rule template is as follows:
  • the above S103 can generate the corresponding sub-calculation in the following manner according to the question stem information, the comprehensive calculation formula and the sub-calculation.
  • Parsing information The question stem information, the comprehensive formula and the sub-calculation are sequentially spliced and input into the multi-task sequence-to-sequence model to obtain the analytical information corresponding to the sub-calculation.
  • three kinds of information including the meaning of the result of the sub-calculation, the unit of the result of the sub-calculation, and the knowledge points involved in the sub-calculation, can be obtained through one model, thereby reducing the complexity of the auxiliary learning system for application questions.
  • the meaning of the result, the unit of the result of the sub-calculation, and the knowledge points involved in the sub-calculation are related to each other, which is conducive to mutual learning, thereby improving the accuracy of the analytical information generated by the multi-task sequence-to-sequence model.
  • the multi-task sequence-to-sequence model includes, in addition to the second encoding network, the second decoding network, and the first classifier, a second classifier, wherein the second classifier is used for encoding information according to the above-mentioned encoding information. , and generate the knowledge points involved in the sub-calculation.
  • the annotation parsing information corresponding to each second reference sub-formula includes the meaning of the result of each second reference sub-formula, the meaning of each second reference sub-formula In addition to the unit of the result, it also includes the knowledge points involved in each second reference sub-calculation.
  • the above explanation information can also include the stem information of the applied problems to be solved, and the algebraic expressions in the comprehensive formulas. The meaning of each number that appears.
  • the above-mentioned method further includes the following step S105:
  • the above S104 can generate explanation information corresponding to the multiple sub-calculations according to the multiple sub-calculations, the analysis information corresponding to each sub-calculation, and the meaning of each number.
  • the above-mentioned explanation information can be an explanation text, according to the above-mentioned four sub-calculations, the corresponding analysis information of each sub-calculation shown in Table 2, and the meaning of each number shown in Table 3, this generated by the rule template.
  • the explanation texts corresponding to the four sub-calculations are as follows:
  • a sequence-to-sequence generation model can be used to generate a coding vector corresponding to each digit according to the replaced question stem information, and for each digit, according to the coding vector corresponding to the digit, the meaning of the digit is generated.
  • the generation model may include an encoding network and a decoding network, wherein the encoding network is used to generate an encoding vector corresponding to each digit according to the replaced stem information, and the decoding network is used for each digit, according to the digit The corresponding encoded vector, generating the meaning of the number.
  • the above-mentioned generative model may be, for example, a BERT model.
  • the above-mentioned generative model can be obtained by training in the following manner: first, obtain third reference information of the third reference application question, wherein the third reference information includes the stem information of the third reference question and the information used to completely solve the above-mentioned third reference application
  • the third reference comprehensive formula of the question, the third reference applied problem can be the applied problem in the question bank; then, in the third reference question stem information, each reference number that appears in the algebraic expression in the third reference comprehensive formula is replaced by Preset identification, and mark the meaning of each reference number; finally, for each reference number, by using the third reference stem information obtained after replacement as the input of the generation model, the meaning of the marked reference number is used as the generation model
  • the model is trained in the way of the target output of the model, and the generative model is obtained.
  • the above-mentioned third reference application questions may be the same as or different from the above first reference application questions, that is, the training samples for solving the model may be the same as or different from the training samples for generating the model.
  • the application questions may be the same as or different from the second reference application questions above, that is, the training samples of the generated model may be the same as or different from the training samples of the above-mentioned multi-task sequence-to-sequence model, which are not specifically limited in this document.
  • FIG. 4 is a block diagram of an information processing apparatus according to another exemplary embodiment of the present disclosure.
  • the device 400 includes: an acquisition module 401 for acquiring the question stem information of the application questions to be solved; a first generation module 402 for obtaining the application questions to be solved according to the acquisition module 401 the question stem information, and generate a comprehensive formula for completely solving the applied problem to be solved and multiple sub-calculations for reflecting the solution process of the applied problem to be solved, wherein, the multiple sub-calculations meet the requirements of the comprehensive formula Operation logic;
  • the second generation module 403 is configured to, for each of the sub-calculations generated by the first generation module 402, obtain the question stem information of the application question to be solved obtained by the obtaining module 401, the The comprehensive calculation formula and the sub-calculation generated by the first generation module 402 generate analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the result of the sub-calculation.
  • the third generation module 404 is configured to generate the multiple sub-expressions according to the plurality of sub-expressions generated by the first generation module 402 and the analysis information corresponding to each of the sub-expressions generated by the second generation module 403 The explanation information corresponding to the sub-calculation.
  • the first generation module 402 includes: a first input sub-module for inputting the question stem information of the application question to be solved into the first coding network of the solving model to obtain the encoded question stem information,
  • the solution model includes the first encoding network and the first decoding network
  • the second input sub-module is used to input the encoded question stem information into the first decoding network to obtain the target expression and Identification information for characterizing whether each operator in the target expression is used as a unique operator in a sub-calculation
  • the first generation sub-module is used to generate the comprehensive calculation formula according to the target expression, and according to the The target expression and the identification information are used to generate the plurality of sub-expressions, wherein the algebraic expression in the comprehensive expression is composed of elements in the target expression.
  • the target expression is a prefix expression.
  • the analysis information further includes knowledge points involved in the sub-calculation.
  • the second generation module 403 is used to input the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation into the multi-task sequence-to-sequence model in turn to obtain the result.
  • the analytical information corresponding to the prescriptive expression.
  • the multi-task sequence-to-sequence model includes a second encoding network, a second decoding network, a first classifier, and a second classifier; wherein the second encoding network is used to encode the splicing information , obtain coding information, and the splicing information is the information obtained by splicing the question stem information of the applied question to be solved, the comprehensive formula and the sub-calculation in sequence; the second decoding network is used for according to the coding information, to generate the meaning of the result of the sub-calculation; the first classifier is used to generate the unit of the result of the sub-calculation according to the coding information; the second classifier is used to generate the unit of the result of the sub-calculation according to the Encoding information to generate knowledge points involved in the sub-calculation.
  • the second encoding network is used to encode the splicing information , obtain coding information
  • the splicing information is the information obtained by splicing the question stem information of the applied question to be solved, the comprehensive
  • the apparatus 400 further includes: a fourth generation module, configured to generate, according to the question stem information of the applied question to be solved, the question stem information of the applied question to be solved and in the comprehensive formula.
  • a fourth generation module configured to generate, according to the question stem information of the applied question to be solved, the question stem information of the applied question to be solved and in the comprehensive formula.
  • the meaning of each number appearing in the algebraic formula the third generation module 404 is configured to generate the plurality of sub-calculations according to the plurality of sub-calculations, the analytical information corresponding to each of the sub-calculations, and the meaning of each of the numbers
  • the explanation information corresponding to the formula is configured to generate, according to the question stem information of the applied question to be solved, the question stem information of the applied question to be solved and in the comprehensive formula.
  • the meaning of each number appearing in the algebraic formula the third generation module 404 is configured to generate the plurality of sub-calculations according to the plurality of sub-calculations, the analytical information corresponding to each of
  • the fourth generation module includes: a replacement submodule for replacing each of the numbers in the question stem information of the applied question to be solved with a preset identifier; the second generation submodule for According to the question stem information obtained after the replacement, a coding vector corresponding to each of the numbers is generated; the third generation sub-module is used to generate the meaning of the numbers according to the coding vector corresponding to the numbers for each of the numbers .
  • Embodiments of the present disclosure further provide a computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the above-mentioned information processing method provided by the embodiments of the present disclosure.
  • FIG. 5 it shows a schematic structural diagram of an electronic device (eg, a terminal device or a server) 500 suitable for implementing an embodiment of the present disclosure.
  • Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablets), PMPs (portable multimedia players), vehicle-mounted terminals (eg, mobile terminals such as in-vehicle navigation terminals), etc., and stationary terminals such as digital TVs, desktop computers, and the like.
  • the electronic device shown in FIG. 5 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
  • an electronic device 500 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 501 that may be loaded into random access according to a program stored in a read only memory (ROM) 502 or from a storage device 508 Various appropriate actions and processes are executed by the programs in the memory (RAM) 503 . In the RAM 503, various programs and data required for the operation of the electronic device 500 are also stored.
  • the processing device 501, the ROM 502, and the RAM 503 are connected to each other through a bus 504.
  • An input/output (I/O) interface 505 is also connected to bus 504 .
  • I/O interface 505 input devices 506 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration
  • An output device 507 such as a computer
  • a storage device 508 including, for example, a magnetic tape, a hard disk, etc.
  • Communication means 509 may allow electronic device 500 to communicate wirelessly or by wire with other devices to exchange data. While FIG. 5 shows electronic device 500 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
  • embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart.
  • the computer program may be downloaded and installed from the network via the communication device 509, or from the storage device 508, or from the ROM 502.
  • the processing apparatus 501 When the computer program is executed by the processing apparatus 501, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
  • the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two.
  • the computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing.
  • a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device.
  • a computer-readable signal medium may include a data signal in baseband or propagated as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • a computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device .
  • Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, electrical wire, optical fiber cable, RF (radio frequency), etc., or any suitable combination of the foregoing.
  • terminal devices and servers can use any currently known or future developed network protocols such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects.
  • network protocols such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects.
  • Examples of communication networks include local area networks (“LAN”), wide area networks (“WAN”), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently known or future development network of.
  • the above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
  • the above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: obtains the question stem information of the application question to be solved; Question stem information to generate a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved, wherein, the plurality of sub-calculations conform to the operation of the comprehensive formula Logic; for each of the sub-expressions, generate analysis information corresponding to the sub-expressions according to the question stem information of the applied question to be solved, the comprehensive expression and the sub-expressions, wherein the analysis information includes all Describe the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation; according to the multiple sub-calculations and the analysis information corresponding to each of the sub-calculations, generate explanation information corresponding to the multiple sub-calculations.
  • Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages.
  • the program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server.
  • the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider to via Internet connection).
  • LAN local area network
  • WAN wide area network
  • each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions.
  • the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
  • the modules involved in the embodiments of the present disclosure may be implemented in software or hardware.
  • the name of the module does not constitute a limitation of the module itself under certain circumstances, for example, the acquisition module can also be described as "a module for acquiring the question stem information of the applied question to be solved".
  • exemplary types of hardware logic components include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical Devices (CPLDs) and more.
  • FPGAs Field Programmable Gate Arrays
  • ASICs Application Specific Integrated Circuits
  • ASSPs Application Specific Standard Products
  • SOCs Systems on Chips
  • CPLDs Complex Programmable Logical Devices
  • a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device.
  • the machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium.
  • Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing.
  • machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
  • RAM random access memory
  • ROM read only memory
  • EPROM or flash memory erasable programmable read only memory
  • CD-ROM compact disk read only memory
  • magnetic storage or any suitable combination of the foregoing.
  • Example 1 provides an information processing method, including: acquiring question stem information of an applied question to be solved; The comprehensive formula of the applied problem to be solved and a plurality of sub-calculations used to embody the solution process of the applied problem to be solved, wherein the plurality of sub-calculations conform to the operation logic of the comprehensive formula; for each of the sub-calculations , according to the question stem information of the application question to be solved, the comprehensive calculation formula and the sub-calculation, generate the analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and all The unit of the result of the sub-calculation is described; according to the plurality of sub-calculations and the analysis information corresponding to each of the sub-calculations, explanation information corresponding to the plurality of sub-calculations is generated.
  • Example 2 provides the method of Example 1, wherein according to the question stem information of the to-be-solved applied question, a comprehensive formula for completely solving the to-be-solved applied question and the method of The multiple sub-calculations that embody the solution process of the application question to be solved include: inputting the question stem information of the application question to be solved into the first coding network of the solving model, and obtaining the encoded question stem information, wherein the The solution model includes the first encoding network and the first decoding network; the encoded question stem information is input into the first decoding network to obtain the target expression and the operations used to characterize the target expression.
  • the operator is used as the identification information of the unique operator in a sub-calculation; according to the target expression, generate the comprehensive calculation formula, and according to the target expression and the identification information, generate the multiple sub-calculations, wherein,
  • the algebraic expression in the comprehensive expression is composed of the elements in the target expression.
  • Example 3 provides the method of Example 2, the target expression being a prefix expression.
  • Example 4 provides the method of Example 1, and the parsing information further includes knowledge points involved in the sub-calculation.
  • Example 5 provides the method of Example 4, wherein according to the question stem information of the applied question to be solved, the comprehensive formula and the sub-calculation, the corresponding sub-calculation is generated
  • the analytical information including: the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation are sequentially spliced and input into the multi-task sequence-to-sequence model, and the analytical information corresponding to the sub-calculation is obtained. .
  • Example 6 provides the method of Example 5, the multi-task sequence-to-sequence model comprising a second encoding network, a second decoding network, a first classifier, and a second classifier; Wherein, the second encoding network is used to encode the splicing information to obtain the coded information, and the splicing information is obtained by splicing the stem information of the application question to be solved, the comprehensive formula and the sub-calculation in sequence.
  • the second decoding network is used for generating the meaning of the result of the sub-calculation according to the coding information;
  • the first classifier is used for generating the sub-calculation according to the coding information The unit of the result; the second classifier is configured to generate the knowledge points involved in the sub-calculation according to the encoded information.
  • Example 7 provides the method of any one of Examples 1-6, the method further comprising: generating the application to be solved according to the question stem information of the application to be solved The meaning of each number that appears in the algebraic formula in the question stem information of the question; the multiple sub-calculations are generated according to the multiple sub-calculations and the analytical information corresponding to each of the sub-calculations
  • the corresponding explanation information includes: generating explanation information corresponding to the plurality of sub-calculations according to the plurality of sub-calculations, the analysis information corresponding to each of the sub-calculations, and the meaning of each of the numbers.
  • Example 8 provides the method of Example 7, wherein generating the question stem information of the applied question to be solved according to the question stem information of the applied question to be solved, in the The meaning of each number that appears in the algebraic formula in the comprehensive formula includes: replacing each of the numbers in the question stem information of the applied question to be solved with a preset identifier; according to the question stem information obtained after the replacement, generating A coding vector corresponding to each of the numbers; for each of the numbers, the meaning of the number is generated according to the coding vector corresponding to the number.
  • Example 9 provides an information processing apparatus, including: an acquisition module for acquiring the question stem information of an application question to be solved; a first generation module for obtaining the information according to the acquisition module The obtained question stem information of the applied problem to be solved generates a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved, wherein the A plurality of sub-calculations conform to the operation logic of the comprehensive calculation formula; a second generation module is used for each of the sub-calculations generated by the first generation module, according to the obtained application problem to be solved by the acquisition module Question stem information, the comprehensive formula and the sub-calculation generated by the first generating module, generate analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the The unit of the result of the sub-calculation; the third generation module is configured to generate all the sub-calculations according to the plurality of sub-calculations
  • Example 10 provides a non-transitory computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements any of Examples 1-8. steps of the method described.
  • Example 11 provides an electronic device, including: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device to The steps of implementing the following method: obtaining the question stem information of the application question to be solved; generating a comprehensive formula for completely solving the application question to be solved and a comprehensive formula used to represent the application question to be solved according to the question stem information of the application question to be solved.
  • a plurality of sub-calculations of the solving process of an applied problem wherein the plurality of sub-calculations conform to the operation logic of the comprehensive formula; for each of the sub-calculations, according to the question stem information of the applied problem to be solved, the comprehensive formula and the sub-calculation, generating analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation; according to the multiple sub-calculations and each The analysis information corresponding to the sub-expressions is used to generate explanation information corresponding to the plurality of sub-expressions.
  • Example 12 provides the electronic device of Example 11, the comprehensive formula and The multiple sub-calculations used to embody the solution process of the applied problem to be solved include: inputting the problem stem information of the applied problem to be solved into the first coding network of the solving model, and obtaining the encoded problem stem information, wherein, The solution model includes the first encoding network and the first decoding network; the encoded question stem information is input into the first decoding network to obtain a target expression and a target expression for characterizing each of the target expressions.
  • the operator is used as the identification information of the unique operator in a sub-calculation; according to the target expression, generate the comprehensive calculation formula, and according to the target expression and the identification information, generate the multiple sub-calculations, wherein , the algebraic expression in the comprehensive calculation formula is composed of each element in the target expression.
  • Example 13 provides the electronic device of Example 12, the target expression being a prefix expression.
  • Example 14 provides the electronic device of Example 11, and the analysis information further includes knowledge points involved in the sub-calculation.
  • Example 15 provides the electronic device of Example 14, the sub-equation is generated according to the stem information of the application question to be solved, the comprehensive equation and the sub-equation Corresponding analysis information includes: splicing the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation in sequence, and then inputting them into the multi-task sequence-to-sequence model to obtain the analysis corresponding to the sub-calculation information.
  • Example 16 provides the electronic device of Example 15, the multitasking sequence-to-sequence model including a second encoding network, a second decoding network, a first classifier, and a second classifier
  • the second coding network is used to encode the splicing information to obtain coding information
  • the splicing information is to sequentially splicing the question stem information, the comprehensive formula and the sub-calculation of the application question to be solved.
  • the second decoding network is used to generate the meaning of the result of the sub-calculation according to the coding information; the first classifier is used to generate the sub-calculation according to the coding information The unit of the result; the second classifier is configured to generate the knowledge points involved in the sub-calculation according to the encoding information.
  • Example 17 provides the electronic device of any one of Examples 11-16, and the method further includes the step of: generating the The meaning of each number that appears in the algebraic formula in the comprehensive formula in the question stem information of the application question to be solved; the generation of the The explanation information corresponding to the multiple sub-calculations includes: generating explanation information corresponding to the multiple sub-calculations according to the multiple sub-calculations, the analysis information corresponding to each of the sub-calculations, and the meaning of each of the numbers.
  • Example 18 provides the electronic device of Example 17, wherein the question stem information of the applied question to be solved is generated according to the question stem information of the applied question to be solved, in all
  • the meaning of each number that appears in the algebraic formula in the comprehensive formula includes: replacing each of the numbers in the question stem information of the applied question to be solved with a preset identifier; according to the question stem information obtained after the replacement, A coding vector corresponding to each of the numbers is generated; for each of the numbers, the meaning of the number is generated according to the coding vector corresponding to the number.
  • a computer program comprising: instructions which, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
  • a computer program product comprising instructions which, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.

Abstract

Provided are an information processing method and apparatus, a medium and an electronic device. The method comprises: acquiring problem stem information of a word problem to be solved (S101); according to the problem stem information, generating a comprehensive equation for completely solving the word problem to be solved and a plurality of sub-equations for embodying a solving process (S102); for each sub-equation, generating analysis information, corresponding to the sub-equation, according to the problem stem information, the comprehensive equation and the sub-equation (S103); and generating, according to the plurality of sub-equations and the analysis information corresponding to each sub-equation, explanation information corresponding to the plurality of sub-equations (S104).

Description

信息处理方法、装置、介质及电子设备Information processing method, apparatus, medium and electronic equipment
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请是以CN申请号为202110251093.5,申请日为2021年3月8日的申请为基础,并主张其优先权,该CN申请的公开内容在此作为整体引入本申请中。This application is based on the CN application number 202110251093.5 and the filing date is March 8, 2021, and claims its priority. The disclosure of the CN application is hereby incorporated into this application as a whole.
技术领域technical field
本公开实施例涉及计算机应用领域,具体地,涉及一种信息处理方法、装置、介质及电子设备。The embodiments of the present disclosure relate to the field of computer applications, and in particular, to an information processing method, apparatus, medium, and electronic device.
背景技术Background technique
数学应用题对于培养学生的数学抽象能力极为重要,是数学教学中的重点和难点。相关技术中,应用题辅助学习系统大多是根据用户输入的应用题题干,在数据库中进行搜索,以找出与用户输入的题干的文本相似度最高的目标题干,之后,将与目标题干对应的答案和解析反馈给用户。Mathematical application problems are extremely important for cultivating students' mathematical abstraction ability, and are the key and difficult points in mathematics teaching. In the related art, most of the application question assisted learning systems search in the database according to the application question stem input by the user to find the target question stem with the highest similarity with the text of the question stem input by the user, and then compare it with the target question stem. The title stem corresponds to the answer and parsing feedback to the user.
发明内容SUMMARY OF THE INVENTION
提供该发明内容部分以便以简要的形式介绍构思,这些构思将在后面的具体实施方式部分被详细描述。该发明内容部分并不旨在标识要求保护的技术方案的关键特征或必要特征,也不旨在用于限制所要求的保护的技术方案的范围。This Summary is provided to introduce concepts in a simplified form that are described in detail in the Detailed Description section that follows. This summary section is not intended to identify key features or essential features of the claimed technical solution, nor is it intended to be used to limit the scope of the claimed technical solution.
本公开实施例提供了一种信息处理方法、装置、介质及电子设备。Embodiments of the present disclosure provide an information processing method, apparatus, medium, and electronic device.
第一方面,本公开实施例提供一种信息处理方法,包括:In a first aspect, an embodiment of the present disclosure provides an information processing method, including:
获取待求解应用题的题干信息;Obtain the stem information of the application questions to be solved;
根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;According to the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved are generated, wherein the plurality of sub-calculations The calculation formula conforms to the operation logic of the comprehensive calculation formula;
针对每一所述子算式,根据所述待求解应用题的题干信息、所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;For each of the sub-expressions, generate analysis information corresponding to the sub-expressions according to the question stem information of the application question to be solved, the comprehensive expression and the sub-expressions, wherein the analysis information includes the sub-expressions The meaning of the result of the calculation and the unit of the result of the sub-calculation;
根据所述多个子算式和每一所述子算式对应的解析信息,生成所述多个子算式对 应的讲解信息。According to the plurality of sub-calculations and the analysis information corresponding to each of the sub-calculations, explanation information corresponding to the plurality of sub-calculations is generated.
第二方面,本公开实施例提供一种信息处理装置,包括:In a second aspect, an embodiment of the present disclosure provides an information processing apparatus, including:
获取模块,用于获取待求解应用题的题干信息;The acquisition module is used to acquire the question stem information of the application question to be solved;
第一生成模块,用于根据所述获取模块获取到的所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;The first generation module is configured to generate a comprehensive formula for completely solving the to-be-solved applied problem and a comprehensive formula for reflecting the to-be-solved applied problem according to the question stem information of the to-be-solved applied problem obtained by the acquisition module A plurality of sub-calculations of the solution process, wherein, the plurality of sub-calculations conform to the operation logic of the comprehensive formula;
第二生成模块,用于针对所述第一生成模块生成的每一所述子算式,根据所述获取模块获取到的所述待求解应用题的题干信息、所述第一生成模块生成的所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;The second generating module is configured to, for each of the sub-expressions generated by the first generating module, obtain the problem stem information of the applied question to be solved obtained by the obtaining module, the data generated by the first generating module The integrated calculation formula and the sub-calculation generate analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation;
第三生成模块,用于根据所述第一生成模块生成的所述多个子算式和所述第二生成模块生成的每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息。a third generation module, configured to generate explanations corresponding to the multiple sub-expressions according to the multiple sub-expressions generated by the first generation module and the analysis information corresponding to each of the sub-expressions generated by the second generation module information.
第三方面,本公开实施例提供一种非易失性计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现本公开第一方面提供的所述方法的步骤。In a third aspect, an embodiment of the present disclosure provides a non-volatile computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the method provided in the first aspect of the present disclosure.
第四方面,本公开实施例提供一种电子设备,包括:存储装置,其上存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现本公开第一方面提供的所述方法的步骤。In a fourth aspect, an embodiment of the present disclosure provides an electronic device, including: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device to implement the first aspect of the present disclosure Provided the steps of the method.
第五方面,本公开提供一种计算机程序,包括:指令,所述指令当由处理器执行时使所述处理器执行本公开第一方面提供的方法。In a fifth aspect, the present disclosure provides a computer program, comprising: instructions that, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
第六方面,本公开提供一种计算机程序产品,包括指令,所述指令当由处理器执行时使所述处理器执行本公开第一方面提供的方法。In a sixth aspect, the present disclosure provides a computer program product comprising instructions that, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
本公开的其他特征和优点将在随后的具体实施方式部分予以详细说明。Other features and advantages of the present disclosure will be described in detail in the detailed description that follows.
附图说明Description of drawings
结合附图并参考以下具体实施方式,本公开各实施例的上述和其他特征、优点及方面将变得更加明显。贯穿附图中,相同或相似的附图标记表示相同或相似的元素。应当理解附图是示意性的,原件和元素不一定按照比例绘制。在附图中:The above and other features, advantages and aspects of various embodiments of the present disclosure will become more apparent when taken in conjunction with the accompanying drawings and with reference to the following detailed description. Throughout the drawings, the same or similar reference numbers refer to the same or similar elements. It should be understood that the drawings are schematic and that the originals and elements are not necessarily drawn to scale. In the attached image:
图1是根据本公开一示例性实施例示出的一种信息处理方法的流程图。FIG. 1 is a flowchart of an information processing method according to an exemplary embodiment of the present disclosure.
图2是根据本公开一示例性实施例示出的一种根据目标表达式和标识信息,生成 的二叉树的结构示意图。Fig. 2 is a schematic structural diagram of a binary tree generated according to a target expression and identification information according to an exemplary embodiment of the present disclosure.
图3是根据本公开另一示例性实施例示出的一种信息处理方法的流程图。FIG. 3 is a flowchart of an information processing method according to another exemplary embodiment of the present disclosure.
图4是根据本公开一示例性实施例示出的一种信息处理装置的框图。FIG. 4 is a block diagram of an information processing apparatus according to an exemplary embodiment of the present disclosure.
图5是根据本公开一示例性实施例示出的一种电子设备的框图。FIG. 5 is a block diagram of an electronic device according to an exemplary embodiment of the present disclosure.
具体实施方式Detailed ways
下面将参照附图更详细地描述本公开的实施例。虽然附图中显示了本公开的某些实施例,然而应当理解的是,本公开可以通过各种形式来实现,而且不应该被解释为限于这里阐述的实施例,相反提供这些实施例是为了更加透彻和完整地理解本公开。应当理解的是,本公开的附图及实施例仅用于示例性作用,并非用于限制本公开的保护范围。Embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While certain embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be construed as limited to the embodiments set forth herein, but rather are provided for the purpose of A more thorough and complete understanding of the present disclosure. It should be understood that the drawings and embodiments of the present disclosure are only for exemplary purposes, and are not intended to limit the protection scope of the present disclosure.
应当理解,本公开的方法实施方式中记载的各个步骤可以按照不同的顺序执行,和/或并行执行。此外,方法实施方式可以包括附加的步骤和/或省略执行示出的步骤。本公开的范围在此方面不受限制。It should be understood that the various steps described in the method embodiments of the present disclosure may be performed in different orders and/or in parallel. Furthermore, method embodiments may include additional steps and/or omit performing the illustrated steps. The scope of the present disclosure is not limited in this regard.
本文使用的术语“包括”及其变形是开放性包括,即“包括但不限于”。术语“基于”是“至少部分地基于”。术语“一个实施例”表示“至少一个实施例”;术语“另一实施例”表示“至少一个另外的实施例”;术语“一些实施例”表示“至少一些实施例”。其他术语的相关定义将在下文描述中给出。As used herein, the term "including" and variations thereof are open-ended inclusions, ie, "including but not limited to". The term "based on" is "based at least in part on." The term "one embodiment" means "at least one embodiment"; the term "another embodiment" means "at least one additional embodiment"; the term "some embodiments" means "at least some embodiments". Relevant definitions of other terms will be given in the description below.
需要注意,本公开中提及的“第一”、“第二”等概念仅用于对不同的装置、模块或单元进行区分,并非用于限定这些装置、模块或单元所执行的功能的顺序或者相互依存关系。It should be noted that concepts such as "first" and "second" mentioned in the present disclosure are only used to distinguish different devices, modules or units, and are not used to limit the order of functions performed by these devices, modules or units or interdependence.
需要注意,本公开中提及的“一个”、“多个”的修饰是示意性而非限制性的,本领域技术人员应当理解,除非在上下文另有明确指出,否则应该理解为“一个或多个”。It should be noted that the modifications of "a" and "a plurality" mentioned in the present disclosure are illustrative rather than restrictive, and those skilled in the art should understand that unless the context clearly indicates otherwise, they should be understood as "one or a plurality of". multiple".
本公开实施方式中的多个装置之间所交互的消息或者信息的名称仅用于说明性的目的,而并不是用于对这些消息或信息的范围进行限制。The names of messages or information exchanged between multiple devices in the embodiments of the present disclosure are only for illustrative purposes, and are not intended to limit the scope of these messages or information.
由于应用题是无穷无尽、千变万化的,数据库中不可能涵盖所有的应用题。这样,当用户查询的应用题不在数据库中时,只能给出最相似应用题的解答,而无法给出原题解答,无法完全满足用户的需求。Since the application questions are endless and ever-changing, it is impossible to cover all the application questions in the database. In this way, when the application question queried by the user is not in the database, only the answer to the most similar application question can be given, but the answer to the original question cannot be given, and the user's needs cannot be fully met.
针对上述技术问题,本公开实施例提供了一种信息处理方法、装置、介质及电子 设备。In view of the above technical problems, embodiments of the present disclosure provide an information processing method, apparatus, medium, and electronic device.
图1是根据本公开一示例性实施例示出的一种信息处理方法的流程图。如图1所示,该方法包括S101~S104。FIG. 1 is a flowchart of an information processing method according to an exemplary embodiment of the present disclosure. As shown in FIG. 1 , the method includes S101 to S104.
在S101中,获取待求解应用题的题干信息。In S101, the question stem information of the application question to be solved is obtained.
在本公开实施例中,应用题是用语言或文字叙述有关事实,反映某种数学关系(例如:数量关系、位置关系等),并求解未知数量的题目,其中,待求解应用题的形式可以为需要用户给出求解过程的常规应用题,还可以以选择题或者填空题的形式呈现。In the embodiments of the present disclosure, the application questions describe relevant facts in language or words, reflect a certain mathematical relationship (eg, quantitative relationship, positional relationship, etc.), and solve an unknown number of questions, wherein the application questions to be solved can be in the form of For general application questions that require users to give the solution process, they can also be presented in the form of multiple-choice questions or fill-in-the-blank questions.
上述信息处理方法可以应用于电子设备,例如,终端设备或者与终端设备通信连接的服务器。其中,终端设备可以例如是智能手机、平板电脑、个人计算机等。其中,用户可以以图像、文本、音频等形式向终端设备输入待求解应用题的题干信息,之后,终端设备接收该题干信息,并根据该题干信息进行自动求解,或者,终端设备将该题干信息发送给服务器,以由服务器根据该题干信息进行自动求解,并将求解结果通过终端设备反馈给用户。The above-mentioned information processing method can be applied to electronic equipment, for example, a terminal equipment or a server communicatively connected to the terminal equipment. The terminal device may be, for example, a smart phone, a tablet computer, a personal computer, or the like. Among them, the user can input the question stem information of the application question to be solved to the terminal device in the form of image, text, audio, etc., and then the terminal device receives the question stem information and automatically solves it according to the question stem information, or the terminal device will The question stem information is sent to the server, so that the server can automatically solve the question according to the question stem information, and feedback the solution result to the user through the terminal device.
在S102中,根据待求解应用题的题干信息,生成用于完整求解待求解应用题的综合算式和用于体现待求解应用题的求解过程的多个子算式。In S102, according to the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved are generated.
在本公开实施例中,上述多个子算式符合综合算式的运算逻辑。In the embodiment of the present disclosure, the above-mentioned multiple sub-calculations conform to the operation logic of the comprehensive calculation formula.
在S103中,针对每一子算式,根据待求解应用题的题干信息、综合算式以及该子算式,生成该子算式对应的解析信息。In S103, for each sub-expression, according to the question stem information of the application question to be solved, the comprehensive formula, and the sub-expression, the analysis information corresponding to the sub-expression is generated.
在本公开实施例中,解析信息包括子算式的结果的含义和子算式的结果的单位。In the embodiment of the present disclosure, the analysis information includes the meaning of the result of the sub-expression and the unit of the result of the sub-expression.
在S104中,根据多个子算式和每一子算式对应的解析信息,生成多个子算式对应的讲解信息。In S104, according to the plurality of sub-expressions and the analysis information corresponding to each sub-expression, explanation information corresponding to the plurality of sub-expressions is generated.
在本公开实施例中,讲解信息可以包括讲解文本、讲解音频、讲解视频中的至少一者。并且,可以根据多个子算式和每一子算式对应的解析信息,通过规则模板生成多个子算式对应的讲解信息。In an embodiment of the present disclosure, the explanation information may include at least one of explanation text, explanation audio, and explanation video. In addition, explanation information corresponding to the multiple sub-calculations can be generated through a rule template according to the multiple sub-calculations and the analysis information corresponding to each sub-calculation.
在上述技术方案中,在获取到待求解应用题的题干信息后,根据其生成用于完整求解上述待求解应用题的综合算式和用于体现该待求解应用题的求解过程的多个子算式。这样,不依赖于数据库,能够对用户输入的待求解应用题进行针对性的解答,提升了解答的准确性。另外,不但能够提供用于完整求解上述待求解应用题的综合算式,还能够给出用于体现该待求解应用题的求解过程的多个子算式以及多个子算式对应的讲解信息,并且,讲解信息中包括各子算式的结果的含义和单位,便于用户快速、 彻底理解待求解应用题的解题思想,提升学习效果和效率。In the above technical solution, after obtaining the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the above applied problem to be solved and a plurality of sub-calculations used to reflect the solution process of the applied problem to be solved are generated according to it. . In this way, without relying on the database, the application problem to be solved input by the user can be answered in a targeted manner, which improves the accuracy of the answer. In addition, it can not only provide a comprehensive formula for completely solving the above-mentioned applied problem to be solved, but also can provide a plurality of sub-calculations used to reflect the solving process of the applied problem to be solved and the explanation information corresponding to the multiple sub-calculations, and the explanation information It includes the meaning and unit of the result of each sub-expression, which is convenient for users to quickly and thoroughly understand the problem-solving idea of the application problem to be solved, and improve the learning effect and efficiency.
示例地,待求解应用题的题干信息如下:For example, the stem information of the application question to be solved is as follows:
一个服装厂,6月份前12天平均每天做衣服1850套,后18天平均每天做衣服2100套,这个月平均每天做衣服多少套?A garment factory makes an average of 1,850 sets of clothes every day in the first 12 days of June, and an average of 2,100 sets of clothes every day in the next 18 days. How many sets of clothes are made every day this month?
通过上述S102,可以得到用于完整求解上述待求解应用题的综合算式(1850×12+2100×18)/(12+18)=2000以及用于体现待求解应用题的求解过程的四个子算式:1850×12=22200、2100×18=37800、12+18=30、(22200+37800)/30=2000。Through the above S102, a comprehensive formula (1850×12+2100×18)/(12+18)=2000 for completely solving the above-mentioned applied problem to be solved and four sub-calculations for reflecting the solving process of the applied problem to be solved can be obtained : 1850×12=22200, 2100×18=37800, 12+18=30, (22200+37800)/30=2000.
针对上述四个子算式中的每一子算式,通过上述步骤103生成的解析信息如下表1中所示:For each of the above four sub-calculations, the analytical information generated by the above step 103 is shown in Table 1 below:
表1 子算式对应的解析信息Table 1 Analytical information corresponding to sub-expressions
子算式sub-expression 子算式的结果的含义Meaning of the result of the subexpression 子算式的结果的单位the unit of the result of the subexpression
1850×12=222001850×12=22200 前12天做的衣服Clothes made in the previous 12 days set
2100×18=378002100×18=37800 后18天做的衣服Clothes made after 18 days set
12+18=3012+18=30 总的天数total days sky
(22200+37800)/30=2000(22200+37800)/30=2000 每天做的衣服clothes made every day set
示例地,上述讲解信息可以为讲解文本,根据上述四个子算式和表1中所示的每一子算式对应的解析信息,通过规则模板生成的该四个子算式对应的讲解文本如下:Exemplarily, the above-mentioned explanation information can be an explanation text. According to the above-mentioned four sub-calculations and the analysis information corresponding to each sub-calculation shown in Table 1, the explanation text corresponding to the four sub-calculations generated by the rule template is as follows:
亲爱的同学,我们一起来看下这道题吧!Dear students, let's take a look at this question together!
为了求得每天做的衣服,我们首先需要求前12天做的衣服,那么前12天做的衣服为1850×12=22200套。In order to obtain the clothes made every day, we first need to ask for the clothes made in the previous 12 days, then the clothes made in the first 12 days are 1850×12=22200 sets.
接下来,我们求后18天做的衣服,那么后18天做的衣服为2100×18=37800套。Next, we ask for the clothes made in the last 18 days, then the clothes made in the last 18 days are 2100×18=37800 sets.
然后,我们求一下总的天数,那么总的天数为12+18=30天。Then, we find the total number of days, then the total number of days is 12+18=30 days.
最后,我们来求每天做的衣服。用前12天做的衣服22200套,后18天做的衣服37800套和总的天数30天进行计算,得出每天做的衣服为(22200+37800)/30=2000套。Finally, let's ask for the clothes we make every day. Using 22,200 sets of clothes made in the first 12 days, 37,800 sets of clothes made in the next 18 days and a total of 30 days to calculate, the clothes made every day are (22,200+37,800)/30=2,000 sets.
这样,这道题就全部解出来啦。不要忘了写答句哦!In this way, the problem is all solved. Don't forget to write your answer!
在上述技术方案中,在获取到待求解应用题的题干信息后,根据其生成用于完整求解上述待求解应用题的综合算式和用于体现该待求解应用题的求解过程的多个子算式。这样,不依赖于数据库,能够对用户输入的待求解应用题进行针对性的解答, 提升了解答的准确性。另外,不但能够提供用于完整求解上述待求解应用题的综合算式,还能够给出用于体现该待求解应用题的求解过程的多个子算式以及多个子算式对应的讲解信息,并且,讲解信息中包括各子算式的结果的含义和单位,便于用户快速、彻底理解待求解应用题的解题思想,提升学习效果和效率。In the above technical solution, after obtaining the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the above applied problem to be solved and a plurality of sub-calculations used to reflect the solution process of the applied problem to be solved are generated according to it. . In this way, without relying on the database, the application problem to be solved input by the user can be answered in a targeted manner, which improves the accuracy of the answer. In addition, it can not only provide a comprehensive formula for completely solving the above-mentioned applied problem to be solved, but also can provide a plurality of sub-calculations used to reflect the solving process of the applied problem to be solved and the explanation information corresponding to the multiple sub-calculations, and the explanation information It includes the meaning and unit of the results of each sub-calculation, which is convenient for users to quickly and thoroughly understand the problem-solving idea of the application problem to be solved, and improve the learning effect and efficiency.
下面针对上述S102中的根据待求解应用题的题干信息,生成用于完整求解待求解应用题的综合算式和用于体现待求解应用题的求解过程的多个子算式的具体实施方式进行详细说明。具体来说,可以通过以下步骤(1)~步骤(3)来实现。The specific implementation of generating a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved according to the question stem information of the applied problem to be solved in the above S102 will be described in detail below. . Specifically, it can be realized by the following steps (1) to (3).
(1)将待求解应用题的题干信息输入求解模型的第一编码网络中,得到编码后的题干信息。(1) Input the question stem information of the application question to be solved into the first coding network of the solving model to obtain the encoded question stem information.
(2)将编码后的题干信息输入求解模型的第一解码网络中,得到目标表达式和用于表征目标表达式中的各运算符是否作为一子算式中的唯一运算符的标识信息。(2) Input the encoded question stem information into the first decoding network of the solving model, and obtain the target expression and identification information used to characterize whether each operator in the target expression is used as a unique operator in a sub-expression.
(3)根据目标表达式,生成综合算式,并根据目标表达式和标识信息,生成多个子算式。(3) According to the target expression, a comprehensive formula is generated, and according to the target expression and the identification information, a plurality of sub-calculations are generated.
在本公开实施例中,综合算式中的代数式由目标表达式中的各元素构成。求解模型可以例如是序列到序列模型,具体来说,该求解模型可以包括第一编码网络和第一解码网络。其中,第一编码网络用于对待求解应用题的题干信息进行编码,第一解码网络用于根据第一编码网络输出的、编码后的题干信息,生成目标表达式和用于表征目标表达式中的各运算符是否作为一子算式中的唯一运算符的标识信息。In the embodiment of the present disclosure, the algebraic expression in the comprehensive calculation formula is composed of each element in the target expression. The solution model may be, for example, a sequence-to-sequence model, in particular, the solution model may include a first encoding network and a first decoding network. Wherein, the first encoding network is used for encoding the question stem information of the application question to be solved, and the first decoding network is used for generating the target expression and representing the target expression according to the encoded question stem information output by the first encoding network Whether each operator in the formula is used as the identification information of the unique operator in a sub-calculation.
示例地,第一编码网络可以为采用循环神经网络(Recurrent Neural Network,RNN)、卷积神经网络(Convolutional Neural Networks,CNN)、预训练语言模型RoBERTa(a Robustly Optimized BERT Pretraining Approach,一种更鲁棒的优化BERT预训练方法)等,其中,BERT的英文全称为Bidirectional Encoder Representation from Transformers,即双向语言表征模型。解码网络可以例如是RNN、CNN、多层的Transformer等。Exemplarily, the first encoding network may be a Recurrent Neural Network (RNN), Convolutional Neural Networks (CNN), a pre-trained language model RoBERTa (a Robustly Optimized BERT Pretraining Approach, a more robust The best optimized BERT pre-training method), etc., among which, the full English name of BERT is Bidirectional Encoder Representation from Transformers, that is, a bidirectional language representation model. The decoding network can be, for example, an RNN, a CNN, a multi-layer Transformer, or the like.
另外,上述目标表达式可以为前缀表达式、中缀表达式、后缀表达式等。In addition, the above-mentioned target expression may be a prefix expression, an infix expression, a postfix expression, or the like.
示例地,针对上述示例中的计算服装厂日产量的待求解应用题,第一解码网络根据编码后的题干信息,生成的目标表达式为“/+×1850 12×2100 18+12 18(前缀表达式)”、“1850×12+2100×18/12+18(中缀表达式)”、“1850 12×2100 18×12 18+/(后缀表达式)”中一者。For example, for the application question to be solved for calculating the daily output of a garment factory in the above example, the first decoding network generates the target expression according to the encoded question stem information as "/+×1850 12×2100 18+12 18( Prefix expression)", "1850×12+2100×18/12+18(infix expression)", "1850 12×2100 18×12 18+/(postfix expression)".
优选地,上述目标表达式可以为前缀表达式,这样,更加符合人类思考习惯,并 且无需使用括号来表示计算优先级,使得求解模型更容易训练。Preferably, the above-mentioned target expression can be a prefix expression, which is more in line with human thinking habits, and does not need to use parentheses to indicate calculation priorities, making the solution model easier to train.
此外,上述求解模型可以通过以下方式训练得到:获取第一参考应用题的第一参考信息,其中,该第一参考信息包括第一参考题干信息、用于完整求解上述第一参考应用题的第一参考综合算式以及用于体现该第一参考应用题的求解过程的多个第一参考子算式,第一参考应用题可以为题库中的应用题;然后,根据第一参考综合算式,得到参考表达式和用于表征该参考表达式中的各运算符是否作为一子算式中的唯一运算符的参考标识信息;最后,通过将第一参考题干信息作为求解模型的输入,将参考表达式和参考标识信息作为求解模型的目标输出的方式进行模型训练,得到求解模型。In addition, the above-mentioned solving model can be obtained by training in the following manner: obtaining first reference information of the first reference application question, wherein the first reference information includes the first reference question stem information, and the information used to completely solve the above-mentioned first reference application question. The first reference comprehensive calculation formula and a plurality of first reference sub-calculations used to embody the solution process of the first reference applied problem, the first reference applied problem can be the applied problem in the question bank; then, according to the first reference comprehensive formula, obtain The reference expression and the reference identification information used to characterize whether each operator in the reference expression is used as a unique operator in a sub-expression; finally, by using the first reference question stem information as the input of the solution model, the reference expression The formula and reference identification information are used as the target output of the solution model to train the model, and the solution model is obtained.
下面针对上述步骤(3)中的根据目标表达式,生成综合算式的具体实施方式进行详细说明。The specific implementation of generating a comprehensive formula according to the target expression in the above step (3) will be described in detail below.
具体来说,可以根据目标表达式的形式(形式可以为前缀表达式、中缀表达式以及后缀表达式中的一者),直接将该目标表达式转换为代数式;然后,计算转换后所得的代数式的结果;最后,可以将转换后所得的代数式、等号、转换后所得的代数式的结果依次连接,得到综合算式。Specifically, the target expression can be directly converted into an algebraic expression according to the form of the target expression (the form can be one of a prefix expression, an infix expression, and a postfix expression); then, the converted result is calculated. The result of the algebraic formula; finally, the algebraic formula obtained after conversion, the equal sign, and the result of the algebraic formula obtained after conversion can be connected in turn to obtain a comprehensive formula.
示例地,通过上述步骤(2)得到的目标表达式为中缀表达式,该中缀表达式为1850×12+2100×18/12+18,将该中缀表达式1850×12+2100×18/12+18转换为代数式(1850×12+2100×18)/(12+18);然后,计算出的代数式(1850×12+2100×18)/(12+18)结果为2000;最后,可以将转换后所得的代数式(1850×12+2100×18)/(12+18)、=、相应的结果2000依次连接,得到综合算式(1850×12+2100×18)/(12+18)=2000。For example, the target expression obtained by the above step (2) is an infix expression, the infix expression is 1850×12+2100×18/12+18, and the infix expression 1850×12+2100× 18/12+18 is converted into the algebraic formula (1850×12+2100×18)/(12+18); then, the calculated algebraic formula (1850×12+2100×18)/(12+18) results in 2000; finally , the converted algebraic formula (1850×12+2100×18)/(12+18), =, and the corresponding result 2000 can be connected in turn to obtain the comprehensive formula (1850×12+2100×18)/(12+18 )=2000.
下面针对上述步骤(3)中的根据目标表达式和标识信息,生成多个子算式的具体实施方式进行详细说明。The specific implementation of generating multiple sub-expressions according to the target expression and the identification information in the above step (3) will be described in detail below.
具体来说,可以先根据目标表达式和标识信息,生成二叉树;然后,根据该二叉树,生成多个子算式。Specifically, a binary tree can be generated first according to the target expression and identification information; then, a plurality of sub-expressions can be generated according to the binary tree.
在本公开实施例中,上述二叉树的节点由目标表达式中的数字和运算符构成。其中,二叉树中数字节点的属性信息包括属性值,并且,该属性值为数字本身;二叉树中运算符节点的属性信息包括用于表征该运算符节点是否作为一子算式中的唯一运算符的标志符(即用于表征该运算符节点是否被分割)属性值。也就是说,当标志符表征该运算符节点被分割时,或者,当标志符表征该运算符节点不被分割、且该运算 符节点是根节点时,可以将该运算符节点的左子节点的属性值、该运算符节点所表征的运算符、该运算符节点的右子节点的属性值构成一子算式中代数式的元素,此时,运算符节点的属性值为其左子节点的属性值、右子节点的属性值经过该运算符节点所表征的运算符运算后所得的结果;当标志符表征该运算符节点不被分割、且该运算符节点并非是根节点时,该运算符节点至少需要与其父节点一起构成子算式的代数式,此时,运算符节点的属性值为左括号、其左子节点的属性值、该运算符节点所表征的运算符、其右子节点的属性值、右括号依次连接后所得的代数式。In the embodiment of the present disclosure, the nodes of the above binary tree are composed of numbers and operators in the target expression. Wherein, the attribute information of the number node in the binary tree includes an attribute value, and the attribute value is the number itself; the attribute information of the operator node in the binary tree includes a flag used to characterize whether the operator node is the only operator in a sub-calculation operator (that is, used to characterize whether the operator node is split) attribute value. That is, when the identifier indicates that the operator node is split, or, when the identifier indicates that the operator node is not split, and the operator node is the root node, the left child of the operator node can be The attribute value of the operator node, the operator represented by the operator node, and the attribute value of the right child node of the operator node constitute an element of the algebraic expression in the sub-expression. At this time, the attribute value of the operator node is the attribute value of its left child node. The result obtained after the value and the attribute value of the right child node are operated by the operator represented by the operator node; when the identifier indicates that the operator node is not divided, and the operator node is not the root node, the operator A node at least needs to form an algebraic expression of a sub-expression together with its parent node. At this time, the attribute value of the operator node is the left parenthesis, the attribute value of its left child node, the operator represented by the operator node, and the attribute of its right child node. The algebraic expression obtained by connecting the value and the closing parenthesis in sequence.
示例地,上述标志符为0或1,其中,0用于表征该运算符节点不被分割,1用于表征该运算符节点被分割。For example, the above identifier is 0 or 1, wherein 0 is used to indicate that the operator node is not divided, and 1 is used to indicate that the operator node is divided.
示例地,针对上述示例中的计算服装厂日产量的待求解应用题,根据目标表达式和标识信息,生成的二叉树如图2所示。由图2可知,该二叉树包括五个运算符节点和六个数字节点。Illustratively, for the application problem to be solved for calculating the daily output of a garment factory in the above example, according to the target expression and identification information, the generated binary tree is shown in FIG. 2 . As can be seen from Figure 2, the binary tree includes five operator nodes and six numeric nodes.
其中,图2中所示的二叉树第三层左侧第一个运算符节点“×”对应的标识符为1,其属性值为22200(即1850×12),则该运算符节点“×”被分割,此时,该运算符节点“×”的左子节点“1850”的属性值1850、该运算符节点“×”所表征的运算符“×”、该运算符节点“×”的右子节点“12”的属性值12构成一子算式中代数式的元素,即1850、×、12构成一子算式中代数式的元素。Among them, the identifier corresponding to the first operator node "×" on the left side of the third layer of the binary tree shown in Figure 2 is 1, and its attribute value is 22200 (ie 1850×12), then the operator node "×" is divided, at this time, the attribute value 1850 of the left child node "1850" of the operator node "x", the operator "x" represented by the operator node "x", the right side of the operator node "x" The attribute value 12 of the child node "12" constitutes an element of an algebraic expression in a sub-expression, that is, 1850, ×, and 12 constitute an element of an algebraic expression in a sub-expression.
图2中所示的二叉树第三层左侧第二个运算符节点“×”对应的标识符为1,其属性值为37800(即2100×18),则该运算符节点“×”被分割,此时,该运算符节点“×”的左子节点“2100”的属性值2100、该运算符节点“×”所表征的运算符“×”、该运算符节点“×”的右子节点“18”的属性值18构成另一子算式中代数式的元素,即2100、×、18构成另一子算式中代数式的元素。The identifier corresponding to the second operator node "×" on the left side of the third layer of the binary tree shown in Figure 2 is 1, and its attribute value is 37800 (ie 2100×18), then the operator node "×" is divided , at this time, the attribute value 2100 of the left child node "2100" of the operator node "×", the operator "×" represented by the operator node "×", and the right child node of the operator node "×" The attribute value 18 of "18" constitutes an element of an algebraic expression in another sub-expression, that is, 2100, ×, and 18 constitute an element of an algebraic expression in another sub-expression.
图2中所示的二叉树第二层左侧第一个运算符节点“+”对应的标识符为0,其属性值为(22200+37800),则该运算符节点“+”不被分割,由于该运算符节点“+”并非是根节点,此时,该运算符节点“+”至少需要与其父节点“/”一起构成子算式的代数式。The identifier corresponding to the first operator node "+" on the left side of the second layer of the binary tree shown in Figure 2 is 0, and its attribute value is (22200+37800), then the operator node "+" is not divided, Since the operator node "+" is not the root node, at this time, the operator node "+" needs to at least form the algebraic expression of the sub-expression together with its parent node "/".
图2中所示的二叉树第二层左侧第二个运算符节点“+”对应的标识符为1,其属性值为30(即12+18),则该运算符节点“+”被分割,此时,该运算符节点“+”的左子节点“12”的属性值12、该运算符节点“+”所表征的运算符“+”、该运算符节点“+”的右子节点“18”的属性值18构成一子算式的代数式部分的元素,即12、+、 18构成一子算式中代数式的元素。The identifier corresponding to the second operator node "+" on the left side of the second layer of the binary tree shown in Figure 2 is 1, and its attribute value is 30 (ie 12+18), then the operator node "+" is divided , at this time, the attribute value 12 of the left child node "12" of the operator node "+", the operator "+" represented by the operator node "+", and the right child node of the operator node "+" The attribute value 18 of "18" constitutes the elements of the algebraic part of a sub-expression, that is, 12, +, 18 constitute the elements of the algebraic expression in a sub-expression.
图2中所示的二叉树第一层的运算符节点“/”(即根节点)对应的标识符为0,其属性值为2000(即(22200+37800)/30),则该运算符节点“/”不被分割,此时,该运算符节点“/”的左子节点“+”的属性值(22200+37800)、该运算符节点“/”所表征的运算符“/”、该运算符节点“/”的右子节点“+”的属性值30构成另一子算式的代数式部分的元素,即(22200+37800)、/、30构成另一子算式中代数式的元素。The identifier corresponding to the operator node "/" (ie the root node) of the first layer of the binary tree shown in Figure 2 is 0, and its attribute value is 2000 (ie (22200+37800)/30), then the operator node "/" is not divided. At this time, the attribute value (22200+37800) of the left child node "+" of the operator node "/", the operator "/" represented by the operator node "/", the The attribute value 30 of the right child node "+" of the operator node "/" constitutes the elements of the algebraic part of another sub-expression, namely (22200+37800), /, 30 constitute the elements of the algebraic expression in the other sub-expression.
具体来说,可以根据上述二叉树,通过以下方式来生成多个子算式:Specifically, according to the above binary tree, multiple sub-calculations can be generated in the following ways:
首先,针对二叉树中的每一运算符:根据该运算符节点的标志符,来判定其是否被分割;若该运算符节点被分割,或者该运算符节点不被分割、且该运算符节点为根节点,则将该运算符节点的左子节点的属性值、该运算符节点所表征的运算符、该运算符节点的右子节点的属性值依次连接,得到一子算式的代数式,之后,将该代数式、等号、该运算符节点的属性值依次连接,得到一子算式;若该运算符节点不被分割、且该运算符节点并非是根节点,则不生成子算式,即不执行任何操作。First, for each operator in the binary tree: according to the identifier of the operator node, determine whether it is divided; if the operator node is divided, or the operator node is not divided, and the operator node is the root node, then connect the attribute value of the left child node of the operator node, the operator represented by the operator node, and the attribute value of the right child node of the operator node in order to obtain an algebraic expression of a sub-expression, and then, Connect the algebraic expression, the equal sign, and the attribute value of the operator node in turn to obtain a sub-expression; if the operator node is not divided and the operator node is not the root node, the sub-expression will not be generated, that is, it will not be executed. any action.
示例地,针对上述示例中的计算服装厂日产量的待求解应用题,根据编码后的题干信息,生成的二叉树如图2所示。由图2可知,该二叉树包括五个运算符节点和六个数字节点。Illustratively, for the application question to be solved for calculating the daily output of a garment factory in the above example, a binary tree is generated according to the encoded question stem information as shown in FIG. 2 . As can be seen from Figure 2, the binary tree includes five operator nodes and six numeric nodes.
针对图2中所示的二叉树的第三层左侧第一个运算符节点“×”,其对应的标识符为1,其属性值为22200(即1850×12),则该运算符节点“×”被分割,此时,将该运算符节点的左子节点“1850”的属性值1850、该运算符节点“×”所表征的运算符“×”、该运算符节点的右子节点“12”的属性值12依次连接,得到一子算式的代数式1850×12,之后,将可以将该代数式1850×12、=、该运算符节点“×”的属性值22200依次连接,得到一子算式1850×12=22200。For the first operator node "×" on the left side of the third layer of the binary tree shown in Figure 2, its corresponding identifier is 1, and its attribute value is 22200 (ie 1850×12), then the operator node " ×” is divided, at this time, the attribute value 1850 of the left child node “1850” of the operator node, the operator “×” represented by the operator node “×”, and the right child node of the operator node “ The attribute value 12 of 12" is connected in turn to obtain the algebraic formula 1850×12 of a sub-calculation. After that, the algebraic formula 1850×12, =, and the attribute value 22200 of the operator node "×" can be connected in sequence to obtain a sub-calculation. 1850×12=22200.
针对图2中所示的二叉树的第三层左侧第二个运算符节点“×”,其对应的标识符为1,其属性值为37800(即2100×18),则该运算符节点“×”被分割,此时,将该运算符节点“×”的左子节点“2100”的属性值2100、该运算符节点“×”所表征的运算符“×”、该运算符节点“×”的右子节点“18”的属性值18依次连接,得到另一子算式的代数式2100×18,之后,将可以将该代数式2100×18、=、该运算符节点“×”的属性值37800依次连接,得到另一子算式2100×18=37800。For the second operator node "×" on the left side of the third layer of the binary tree shown in Figure 2, its corresponding identifier is 1, and its attribute value is 37800 (ie 2100×18), then the operator node " ×” is divided, at this time, the attribute value 2100 of the left child node “2100” of the operator node “×”, the operator “×” represented by the operator node “×”, and the operator node “×” The attribute value 18 of the right child node "18" of "" is connected in turn to obtain the algebraic formula 2100×18 of another sub-expression. After that, the algebraic formula 2100×18, =, the attribute value of the operator node "×" can be 37800 Connect in sequence to obtain another sub-calculation 2100×18=37800.
针对图2中所示的二叉树的第二层左侧第一个运算符节点“+”,其对应的标识符为0,其属性值为(22200+37800),则该运算符节点“+”不被分割,由于该运算符 节点“+”并非是根节点,此时,则不生成子算式,即不执行任何操作。For the first operator node "+" on the left side of the second level of the binary tree shown in Figure 2, its corresponding identifier is 0, and its attribute value is (22200+37800), then the operator node "+" It is not divided, because the operator node "+" is not the root node, in this case, no sub-expression is generated, that is, no operation is performed.
针对图2中所示的二叉树的第二层左侧第二个运算符节点“+”,其对应的标识符为1,其属性值为30(即12+18),则该运算符节点“+”被分割,此时,将该运算符节点“+”的左子节点“12”的属性值12、该运算符节点“+”所表征的运算符“+”、该运算符节点“+”的右子节点“18”的属性值18依次连接,得到另一子算式的代数式12+18,之后,将可以将该代数式12+18、=、该运算符节点“+”的属性值30依次连接,得到另一子算式12+18=30。For the second operator node "+" on the left side of the second level of the binary tree shown in Figure 2, its corresponding identifier is 1, and its attribute value is 30 (ie 12+18), then the operator node " +" is divided, at this time, the attribute value 12 of the left child node "12" of the operator node "+", the operator "+" represented by the operator node "+", the operator node "+" The attribute value 18 of the right child node "18" of "" is connected in turn to obtain the algebraic formula 12+18 of another sub-expression. After that, the algebraic formula 12+18, =, the attribute value 30 of the operator node "+" Connect in sequence to obtain another sub-calculation 12+18=30.
针对图2中所示的二叉树的第一层的运算符节点“/”(即根节点)对应的标识符为0,其属性值为2000(即(22200+37800)/30),则该运算符节点“/”不被分割,由于该运算符节点“+”为根节点,此时,将该运算符节点“/”的左子节点“+”的属性值(22200+37800)、该运算符节点“/”所表征的运算符“/”、该运算符节点“/”的右子节点“+”的属性值30依次连接,得到另一子算式的代数式(22200+37800)/30,之后,将可以将该代数式(22200+37800)/30、=、该运算符节点“/”的属性值2000依次连接,得到另一子算式(22200+37800)/30=2000。The identifier corresponding to the operator node "/" (ie the root node) of the first layer of the binary tree shown in Figure 2 is 0, and its attribute value is 2000 (ie (22200+37800)/30), then the operation The operator node "/" is not divided. Since the operator node "+" is the root node, at this time, the attribute value (22200+37800) of the left child node "+" of the operator node "/", the operator The operator "/" represented by the symbol node "/" and the attribute value 30 of the right child node "+" of the operator node "/" are connected in turn to obtain the algebraic formula (22200+37800)/30 of another sub-expression, Afterwards, the algebraic formula (22200+37800)/30, =, and the attribute value 2000 of the operator node "/" can be connected in sequence to obtain another sub-calculation (22200+37800)/30=2000.
由此,可以得到以下四个子算式:1850×12=22200、2100×18=37800、12+18=30、(22200+37800)/30=2000。From this, the following four sub-calculations can be obtained: 1850×12=22200, 2100×18=37800, 12+18=30, (22200+37800)/30=2000.
下面针对上述S103中的根据待求解应用题的题干信息、综合算式以及该子算式,生成该子算式对应的解析信息的具体实施方式进行详细说明。The specific implementation of generating the analytical information corresponding to the sub-calculation according to the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation in the above S103 will be described in detail below.
具体来说,可以将题干信息、综合算式以及该子算式依次拼接后输入到多任务的序列到序列模型中,得到该子算式对应的解析信息。这样,通过一个模型即可获取到子算式的结果的含义和子算式的结果的单位,从而降低了应用题辅助学习系统的复杂度,并且,子算式的结果的含义和子算式的结果的单位相互关联,有利于互相学习,从而提升多任务的序列到序列模型生成的解析信息的准确性。Specifically, the question stem information, the comprehensive formula and the sub-calculation can be sequentially spliced and input into the multi-task sequence-to-sequence model to obtain the analytical information corresponding to the sub-calculation. In this way, the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation can be obtained through a model, thereby reducing the complexity of the auxiliary learning system for applied questions, and the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation are related to each other. , which is conducive to mutual learning, thereby improving the accuracy of the analytical information generated by the multi-task sequence-to-sequence model.
在本公开实施例中,多任务的序列到序列模型可以包括第二编码网络、第二解码网络以及第一分类器。其中,所述第二编码网络用于对拼接信息进行编码,得到编码信息,拼接信息为将待求解应用题的题干信息、综合算式以及该子算式依次拼接后所得的信息;第二解码网络,用于根据编码信息,生成该子算式的结果的含义;第一分类器,用于根据编码信息,生成该子算式的结果的单位。In an embodiment of the present disclosure, the multi-task sequence-to-sequence model may include a second encoding network, a second decoding network, and a first classifier. Wherein, the second encoding network is used to encode the splicing information to obtain the coded information, and the splicing information is the information obtained by splicing the question stem information, the comprehensive formula and the sub-calculation of the application question to be solved in turn; the second decoding network , which is used to generate the meaning of the result of the sub-calculation according to the coding information; the first classifier is used to generate the unit of the result of the sub-calculation according to the coding information.
示例地,第二编码网络可以为RNN、CNN、预训练语言模型RoBERTa等,第二解码网络可以例如是RNN、CNN、多层的Transformer等。For example, the second encoding network may be RNN, CNN, a pre-trained language model RoBERTa, etc., and the second decoding network may be, for example, RNN, CNN, a multi-layer Transformer, and the like.
上述多任务的序列到序列模型可以通过以下方式训练得到:首先,获取第二参考应用题的第二参考信息,其中,该第二参考信息包括第二参考题干信息、用于完整求解上述第二参考应用题的第二参考综合算式、用于体现第二参考应用题的求解过程的多个第二参考子算式、以及每一第二参考子算式对应的标注解析信息,第二参考应用题可以为题库中的应用题,标注解析信息包括每一第二参考子算式的结果的含义和每一第二参考子算式的结果的单位;然后,针对每一第二参考算式,通过将第二参考题干信息、第二参考综合算式以及该第二参考子算式依次拼接后的字符串作为多任务的序列到序列模型的输入,将该第二参考子算式对应的标注解析信息作为多任务的序列到序列模型的目标输出的方式进行模型训练,得到多任务的序列到序列模型。The above multi-task sequence-to-sequence model can be obtained by training in the following manner: first, acquiring second reference information of the second reference application question, wherein the second reference information includes the stem information of the second reference question, which is used to completely solve the above-mentioned first question. The second reference comprehensive formula of the second reference application question, a plurality of second reference sub-calculations used to reflect the solution process of the second reference application problem, and the annotation analysis information corresponding to each second reference sub-calculation, the second reference application problem It can be an applied question in the question bank, and the annotation analysis information includes the meaning of the result of each second reference sub-equation and the unit of the result of each second reference sub-equation; The reference question stem information, the second reference comprehensive formula, and the second reference sub-formula spliced in sequence are used as the input of the multi-task sequence-to-sequence model, and the annotation parsing information corresponding to the second reference sub-formula is used as the multi-task. The model is trained in the way of the target output of the sequence-to-sequence model, and a multi-task sequence-to-sequence model is obtained.
需要说明的是,上述第二参考应用题可以与上第一参考应用题相同,也可以不同,在本公中不作具体限定。It should be noted that the above-mentioned second reference application questions may be the same as or different from the above first reference application questions, which are not specifically limited in this disclosure.
另外,为了便于用户更加快速、彻底地理解待求解应用题解题思想,以进一步提升学习效果和效率,上述S103中生成的子算式的解析信息还包括该子算式涉及的知识点。这样,解析信息可以包括子算式的结果的含义、子算式的结果的单位、子算式涉及的知识点。In addition, in order to facilitate the user to more quickly and thoroughly understand the problem-solving idea of the application question to be solved, so as to further improve the learning effect and efficiency, the analysis information of the sub-calculation generated in the above S103 also includes the knowledge points involved in the sub-calculation. In this way, the analysis information may include the meaning of the result of the sub-expression, the unit of the result of the sub-expression, and the knowledge points involved in the sub-expression.
示例地,针对上述示例中的计算服装厂日产量的待求解应用题,上述四个子算式(1850×12=22200、2100×18=37800、12+18=30、(22200+37800)/30=2000)中的每一子算式,通过上述S103生成的解析信息如下表2中所示:Illustratively, for the application problem to be solved for calculating the daily output of a garment factory in the above example, the above four sub-formulas (1850×12=22200, 2100×18=37800, 12+18=30, (22200+37800)/30= 2000), the analytical information generated by above-mentioned S103 is shown in the following table 2:
表2 子算式对应的解析信息Table 2 Analytical information corresponding to sub-expressions
Figure PCTCN2022079530-appb-000001
Figure PCTCN2022079530-appb-000001
示例地,上述讲解信息可以为讲解文本,根据上述四个子算式和表2中所示的每一子算式对应的解析信息,通过规则模板生成的该四个子算式对应的讲解文本如下:Exemplarily, the above-mentioned explanation information can be an explanation text. According to the above-mentioned four sub-calculations and the analysis information corresponding to each sub-calculation shown in Table 2, the explanation text corresponding to the four sub-calculations generated by the rule template is as follows:
亲爱的同学,我们一起来看下这道题吧!Dear students, let's take a look at this question together!
为了求得每天做的衣服,我们首先需要求前12天做的衣服。通过公式“平均数×个数=总数(×、÷)”,可以算出前12天做的衣服为1850×12=22200套。In order to get the clothes made every day, we first need to ask for the clothes made in the previous 12 days. Through the formula "average × number = total number (×, ÷)", it can be calculated that the clothes made in the first 12 days are 1850 × 12 = 22200 sets.
接下来,我们求后18天做的衣服。采用跟上一步相同的公式,可以算出后18天做的衣服为2100×18=37800套。Next, we asked for clothes made after 18 days. Using the same formula as the previous step, it can be calculated that the clothes made in the next 18 days are 2100×18=37800 sets.
然后,我们求一下总的天数。通过公式部分量1+部分量2=总量,可以算出,总的天数为12+18=30天。Then, we ask for the total number of days. Through the formula Partial Quantity 1 + Partial Quantity 2 = Total, it can be calculated that the total number of days is 12+18=30 days.
最后,我们来求每天做的衣服。用前12天做的衣服22200套,后18天做的衣服37800套和总的天数30天进行计算,得出每天做的衣服为(22200+37800)/30=2000套。Finally, let's ask for the clothes we make every day. Using 22,200 sets of clothes made in the first 12 days, 37,800 sets of clothes made in the next 18 days and a total of 30 days to calculate, the clothes made every day are (22,200+37,800)/30=2,000 sets.
这样,这道题就全部解出来啦。不要忘了写答句哦!In this way, the problem is all solved. Don't forget to write your answer!
在解析信息包括子算式的结果的含义、子算式的结果的单位以及子算式涉及的知识点时,上述S103可以根据题干信息、综合算式以及该子算式,通过以下方式生成该子算式对应的解析信息:将题干信息、综合算式以及该子算式依次拼接后输入到多任务的序列到序列模型中,得到该子算式对应的解析信息。这样,通过一个模型即可获取到子算式的结果的含义、子算式的结果的单位、子算式涉及的知识点三种信息,从而降低了应用题辅助学习系统的复杂度,并且,子算式的结果的含义、子算式的结果的单位、子算式涉及的知识点三者相互关联,有利于互相学习,从而提升多任务的序列到序列模型生成的解析信息的准确性。When the analysis information includes the meaning of the result of the sub-calculation, the unit of the result of the sub-calculation, and the knowledge points involved in the sub-calculation, the above S103 can generate the corresponding sub-calculation in the following manner according to the question stem information, the comprehensive calculation formula and the sub-calculation. Parsing information: The question stem information, the comprehensive formula and the sub-calculation are sequentially spliced and input into the multi-task sequence-to-sequence model to obtain the analytical information corresponding to the sub-calculation. In this way, three kinds of information, including the meaning of the result of the sub-calculation, the unit of the result of the sub-calculation, and the knowledge points involved in the sub-calculation, can be obtained through one model, thereby reducing the complexity of the auxiliary learning system for application questions. The meaning of the result, the unit of the result of the sub-calculation, and the knowledge points involved in the sub-calculation are related to each other, which is conducive to mutual learning, thereby improving the accuracy of the analytical information generated by the multi-task sequence-to-sequence model.
在本公开实施例中,多任务的序列到序列模型除了包括第二编码网络、第二解码网络、第一分类器外,还包括第二分类器,其中,第二分类器用于根据上述编码信息,生成子算式涉及的知识点。In the embodiment of the present disclosure, the multi-task sequence-to-sequence model includes, in addition to the second encoding network, the second decoding network, and the first classifier, a second classifier, wherein the second classifier is used for encoding information according to the above-mentioned encoding information. , and generate the knowledge points involved in the sub-calculation.
此时,该多任务的序列到序列模型在训练过程中,上述每一第二参考子算式对应的标注解析信息除了包括每一第二参考子算式的结果的含义、每一第二参考子算式的结果的单位外,还包括每一第二参考子算式涉及的知识点。At this time, during the training process of the multi-task sequence-to-sequence model, the annotation parsing information corresponding to each second reference sub-formula includes the meaning of the result of each second reference sub-formula, the meaning of each second reference sub-formula In addition to the unit of the result, it also includes the knowledge points involved in each second reference sub-calculation.
此外,为了便于用户更加快速、彻底地理解待求解应用题解题思想,以进一步提 升学习效果和效率,上述讲解信息还可以包括待求解应用题的题干信息中、在综合算式中的代数式中出现的每一数字的含义。具体来说,如图3所示,在上述S104之前,上述方法还包括以下步骤S105:In addition, in order to facilitate the user to more quickly and thoroughly understand the problem-solving ideas of the applied problems to be solved, and to further improve the learning effect and efficiency, the above explanation information can also include the stem information of the applied problems to be solved, and the algebraic expressions in the comprehensive formulas. The meaning of each number that appears. Specifically, as shown in FIG. 3, before the above-mentioned S104, the above-mentioned method further includes the following step S105:
在S105中,根据待求解应用题的题干信息,生成待求解应用题的题干信息中、在综合算式中的代数式中出现的每一数字的含义。In S105, according to the question stem information of the application question to be solved, the meaning of each number appearing in the algebraic expression in the comprehensive calculation formula in the question stem information of the application question to be solved is generated.
示例地,针对上述示例中的计算服装厂日产量的待求解应用题,其题干信息中、在综合算式(1850×12+2100×18)/(12+18)=2000中的代数式(1850×12+2100×18)/(12+18)中出现的数字包括1850、12、2100、18。则通过上述步骤确定出的各数字的含义如下表3所示:For example, for the application question to be solved for calculating the daily output of a garment factory in the above example, in the question stem information, the algebraic formula (1850 in the comprehensive formula (1850×12+2100×18)/(12+18)=2000) The numbers appearing in ×12+2100×18)/(12+18) include 1850, 12, 2100, and 18. The meaning of each number determined through the above steps is shown in Table 3 below:
表3 数字含义表Table 3 Number meaning table
数字number 数字的含义meaning of numbers
1212 none
1818 none
18501850 6月前12天平均每天做的衣服数量Average number of clothes made per day in the first 12 days of June
21002100 6月后18天平均每天做的衣服数量Average number of clothes made per day for 18 days after June
这样,上述S104可以根据多个子算式、每一子算式对应的解析信息以及每一数字的含义,生成多个子算式对应的讲解信息。In this way, the above S104 can generate explanation information corresponding to the multiple sub-calculations according to the multiple sub-calculations, the analysis information corresponding to each sub-calculation, and the meaning of each number.
示例地,上述讲解信息可以为讲解文本,根据上述四个子算式、表2中所示的每一子算式对应的解析信息以及表3中所示的每一数字的含义,通过规则模板生成的该四个子算式对应的讲解文本如下:Exemplarily, the above-mentioned explanation information can be an explanation text, according to the above-mentioned four sub-calculations, the corresponding analysis information of each sub-calculation shown in Table 2, and the meaning of each number shown in Table 3, this generated by the rule template. The explanation texts corresponding to the four sub-calculations are as follows:
亲爱的同学,我们一起来看下这道题吧!Dear students, let's take a look at this question together!
为了求得每天做的衣服,我们首先需要求前12天做的衣服。我们知道,6月前12天平均每天做的衣服数量为1850,通过公式“平均数×个数=总数(×、÷)”,可以算出,前12天做的衣服为1850×12=22200套。In order to get the clothes made every day, we first need to ask for the clothes made in the previous 12 days. We know that the average number of clothes made every day in the first 12 days of June is 1850. Through the formula "average × number = total number (×, ÷)", we can calculate that the clothes made in the first 12 days are 1850 × 12 = 22200 sets .
接下来,我们求后18天做的衣服。采用跟上一步相同的公式,用6月后18天平均每天做的衣服数量为2100与18进行计算,得出后18天做的衣服为2100×18=37800套。Next, we asked for clothes made after 18 days. Using the same formula as the previous step, the average number of clothes made every day in the 18 days after June is 2100 and 18 for calculation, and the clothes made in the next 18 days are 2100×18=37800 sets.
然后,我们求一下总的天数。通过公式部分量1+部分量2=总量,可以算出,总的天数为12+18=30天。Then, we ask for the total number of days. Through the formula Partial Quantity 1 + Partial Quantity 2 = Total, it can be calculated that the total number of days is 12+18=30 days.
最后,我们来求每天做的衣服。用前12天做的衣服22200套,后18天做的衣服37800套和总的天数30天进行计算,得出每天做的衣服为(22200+37800)/30=2000套。Finally, let's ask for the clothes we make every day. Using 22,200 sets of clothes made in the first 12 days, 37,800 sets of clothes made in the next 18 days and a total of 30 days to calculate, the clothes made every day are (22,200+37,800)/30=2,000 sets.
这样,这道题就全部解出来啦。不要忘了写答句哦!In this way, the problem is all solved. Don't forget to write your answer!
下面针对上述S105的根据待求解应用题的题干信息,生成待求解应用题的题干信息中、在综合算式中的代数式中出现的每一数字的含义的具体实施方式进行详细说明。具体来说,可以通过以下步骤来实现:The specific implementation of generating the meaning of each number appearing in the algebraic formula in the comprehensive formula in the question stem information of the applied problem to be solved according to the question stem information of the applied question to be solved in the above S105 will be described in detail below. Specifically, it can be achieved through the following steps:
首先,将题干信息中的每一数字替换为预设标识;然后,根据替换后所得的题干信息,生成每一数字对应的编码向量;最后,针对每一数字,根据该数字对应的编码向量,生成该数字的含义。First, replace each number in the question stem information with a preset identifier; then, generate a coding vector corresponding to each number according to the question stem information obtained after the replacement; finally, for each number, according to the code corresponding to the number A vector that generates the meaning of that number.
其中,可以通过序列到序列的生成模型来根据替换后的题干信息,生成每一数字对应的编码向量,并针对每一数字,根据该数字对应的编码向量,生成该数字的含义。具体来说,该生成模型可以包括编码网络和解码网络,其中,编码网络用于根据替换后的题干信息,生成每一数字对应的编码向量,解码网络用于针对每一数字,根据该数字对应的编码向量,生成该数字的含义。示例地,上述生成模型可以例如是BERT模型。Wherein, a sequence-to-sequence generation model can be used to generate a coding vector corresponding to each digit according to the replaced question stem information, and for each digit, according to the coding vector corresponding to the digit, the meaning of the digit is generated. Specifically, the generation model may include an encoding network and a decoding network, wherein the encoding network is used to generate an encoding vector corresponding to each digit according to the replaced stem information, and the decoding network is used for each digit, according to the digit The corresponding encoded vector, generating the meaning of the number. Illustratively, the above-mentioned generative model may be, for example, a BERT model.
另外,上述生成模型可以通过以下方式训练得到:首先,获取第三参考应用题的第三参考信息,其中,该第三参考信息包括第三参考题干信息和用于完整求解上述第三参考应用题的第三参考综合算式,第三参考应用题可以为题库中的应用题;然后,将第三参考题干信息中、在第三参考综合算式中的代数式中出现的每一参考数字替换为预设标识,并标注该每一参考数字的含义;最后,针对每一参考数字,通过将替换后所得的第三参考题干信息作为生成模型的输入,将标注的该参考数字的含义作为生成模型的目标输出的方式进行模型训练,得到生成模型。In addition, the above-mentioned generative model can be obtained by training in the following manner: first, obtain third reference information of the third reference application question, wherein the third reference information includes the stem information of the third reference question and the information used to completely solve the above-mentioned third reference application The third reference comprehensive formula of the question, the third reference applied problem can be the applied problem in the question bank; then, in the third reference question stem information, each reference number that appears in the algebraic expression in the third reference comprehensive formula is replaced by Preset identification, and mark the meaning of each reference number; finally, for each reference number, by using the third reference stem information obtained after replacement as the input of the generation model, the meaning of the marked reference number is used as the generation model The model is trained in the way of the target output of the model, and the generative model is obtained.
需要说明的是,上述第三参考应用题可以与上第一参考应用题相同,也可以不同,即求解模型的训练样本可以与生成模型的训练样本相同,也可以不同,并且,上述第三参考应用题可以与上第二参考应用题相同,也可以不同,即生成模型的训练样本可以与上述多任务的序列到序列模型的训练样本相同,也可以不同,在本公中均不作具体限定。It should be noted that the above-mentioned third reference application questions may be the same as or different from the above first reference application questions, that is, the training samples for solving the model may be the same as or different from the training samples for generating the model. The application questions may be the same as or different from the second reference application questions above, that is, the training samples of the generated model may be the same as or different from the training samples of the above-mentioned multi-task sequence-to-sequence model, which are not specifically limited in this document.
图4是根据本公开另一示例性实施例示出的一种信息处理装置的框图。如图4所 示,该装置400包括:获取模块401,用于获取待求解应用题的题干信息;第一生成模块402,用于根据所述获取模块401获取到的所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;第二生成模块403,用于针对所述第一生成模块402生成的每一所述子算式,根据所述获取模块401获取到的所述待求解应用题的题干信息、所述第一生成模块402生成的所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;第三生成模块404,用于根据所述第一生成模块402生成的所述多个子算式和所述第二生成模块403生成的每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息。FIG. 4 is a block diagram of an information processing apparatus according to another exemplary embodiment of the present disclosure. As shown in FIG. 4 , the device 400 includes: an acquisition module 401 for acquiring the question stem information of the application questions to be solved; a first generation module 402 for obtaining the application questions to be solved according to the acquisition module 401 the question stem information, and generate a comprehensive formula for completely solving the applied problem to be solved and multiple sub-calculations for reflecting the solution process of the applied problem to be solved, wherein, the multiple sub-calculations meet the requirements of the comprehensive formula Operation logic; the second generation module 403 is configured to, for each of the sub-calculations generated by the first generation module 402, obtain the question stem information of the application question to be solved obtained by the obtaining module 401, the The comprehensive calculation formula and the sub-calculation generated by the first generation module 402 generate analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the result of the sub-calculation. unit; the third generation module 404 is configured to generate the multiple sub-expressions according to the plurality of sub-expressions generated by the first generation module 402 and the analysis information corresponding to each of the sub-expressions generated by the second generation module 403 The explanation information corresponding to the sub-calculation.
在上述技术方案中,在获取到待求解应用题的题干信息后,根据其生成用于完整求解上述待求解应用题的综合算式和用于体现该待求解应用题的求解过程的多个子算式。这样,不依赖于数据库,能够对用户输入的待求解应用题进行针对性的解答,提升了解答的准确性。另外,不但能够提供用于完整求解上述待求解应用题的综合算式,还能够给出用于体现该待求解应用题的求解过程的多个子算式以及多个子算式对应的讲解信息,并且,讲解信息中包括各子算式的结果的含义和单位,便于用户快速、彻底理解待求解应用题的解题思想,提升学习效果和效率。In the above technical solution, after obtaining the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the above applied problem to be solved and a plurality of sub-calculations used to reflect the solution process of the applied problem to be solved are generated according to it. . In this way, without relying on the database, the application problem to be solved input by the user can be answered in a targeted manner, which improves the accuracy of the answer. In addition, it can not only provide a comprehensive formula for completely solving the above-mentioned applied problem to be solved, but also can provide a plurality of sub-calculations used to reflect the solving process of the applied problem to be solved and the explanation information corresponding to the multiple sub-calculations, and the explanation information It includes the meaning and unit of the results of each sub-calculation, which is convenient for users to quickly and thoroughly understand the problem-solving idea of the application problem to be solved, and improve the learning effect and efficiency.
可选地,所述第一生成模块402包括:第一输入子模块,用于将所述待求解应用题的题干信息输入求解模型的第一编码网络中,得到编码后的题干信息,其中,所述求解模型包括所述第一编码网络和第一解码网络;第二输入子模块,用于将所述编码后的题干信息输入所述第一解码网络中,得到目标表达式和用于表征所述目标表达式中各运算符是否作为一子算式中的唯一运算符的标识信息;第一生成子模块,用于根据所述目标表达式,生成所述综合算式,并根据所述目标表达式和所述标识信息,生成所述多个子算式,其中,所述综合算式中的代数式由所述目标表达式中的各元素构成。Optionally, the first generation module 402 includes: a first input sub-module for inputting the question stem information of the application question to be solved into the first coding network of the solving model to obtain the encoded question stem information, Wherein, the solution model includes the first encoding network and the first decoding network; the second input sub-module is used to input the encoded question stem information into the first decoding network to obtain the target expression and Identification information for characterizing whether each operator in the target expression is used as a unique operator in a sub-calculation; the first generation sub-module is used to generate the comprehensive calculation formula according to the target expression, and according to the The target expression and the identification information are used to generate the plurality of sub-expressions, wherein the algebraic expression in the comprehensive expression is composed of elements in the target expression.
可选地,所述目标表达式为前缀表达式。Optionally, the target expression is a prefix expression.
可选地,所述解析信息还包括所述子算式涉及的知识点。Optionally, the analysis information further includes knowledge points involved in the sub-calculation.
可选地,所述第二生成模块403用于将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后输入到多任务的序列到序列模型中,得到所述子算式对应的解析信息。Optionally, the second generation module 403 is used to input the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation into the multi-task sequence-to-sequence model in turn to obtain the result. The analytical information corresponding to the prescriptive expression.
可选地,所述多任务的序列到序列模型包括第二编码网络、第二解码网络、第一分类器以及第二分类器;其中,所述第二编码网络,用于对拼接信息进行编码,得到编码信息,所述拼接信息为将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后所得的信息;所述第二解码网络,用于根据所述编码信息,生成所述子算式的结果的含义;所述第一分类器,用于根据所述编码信息,生成所述子算式的结果的单位;所述第二分类器,用于根据所述编码信息,生成所述子算式涉及的知识点。Optionally, the multi-task sequence-to-sequence model includes a second encoding network, a second decoding network, a first classifier, and a second classifier; wherein the second encoding network is used to encode the splicing information , obtain coding information, and the splicing information is the information obtained by splicing the question stem information of the applied question to be solved, the comprehensive formula and the sub-calculation in sequence; the second decoding network is used for according to the coding information, to generate the meaning of the result of the sub-calculation; the first classifier is used to generate the unit of the result of the sub-calculation according to the coding information; the second classifier is used to generate the unit of the result of the sub-calculation according to the Encoding information to generate knowledge points involved in the sub-calculation.
可选地,所述装置400还包括:第四生成模块,用于根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中、在所述综合算式中的代数式中出现的每一数字的含义;所述第三生成模块404用于根据所述多个子算式、每一所述子算式对应的解析信息以及每一所述数字的含义,生成所述多个子算式对应的讲解信息。Optionally, the apparatus 400 further includes: a fourth generation module, configured to generate, according to the question stem information of the applied question to be solved, the question stem information of the applied question to be solved and in the comprehensive formula. The meaning of each number appearing in the algebraic formula; the third generation module 404 is configured to generate the plurality of sub-calculations according to the plurality of sub-calculations, the analytical information corresponding to each of the sub-calculations, and the meaning of each of the numbers The explanation information corresponding to the formula.
可选地,所述第四生成模块包括:替换子模块,用于将所述待求解应用题的题干信息中的每一所述数字替换为预设标识;第二生成子模块,用于根据替换后所得的题干信息,生成每一所述数字对应的编码向量;第三生成子模块,用于针对每一所述数字,根据所述数字对应的编码向量,生成所述数字的含义。Optionally, the fourth generation module includes: a replacement submodule for replacing each of the numbers in the question stem information of the applied question to be solved with a preset identifier; the second generation submodule for According to the question stem information obtained after the replacement, a coding vector corresponding to each of the numbers is generated; the third generation sub-module is used to generate the meaning of the numbers according to the coding vector corresponding to the numbers for each of the numbers .
本公开实施例还提供一种计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现本公开实施例提供的上述信息处理方法的步骤。Embodiments of the present disclosure further provide a computer-readable medium on which a computer program is stored, and when the program is executed by a processing apparatus, implements the steps of the above-mentioned information processing method provided by the embodiments of the present disclosure.
下面参考图5,其示出了适于用来实现本公开实施例的电子设备(例如终端设备或服务器)500的结构示意图。本公开实施例中的终端设备可以包括但不限于诸如移动电话、笔记本电脑、数字广播接收器、PDA(个人数字助理)、PAD(平板电脑)、PMP(便携式多媒体播放器)、车载终端(例如车载导航终端)等等的移动终端以及诸如数字TV、台式计算机等等的固定终端。图5示出的电子设备仅仅是一个示例,不应对本公开实施例的功能和使用范围带来任何限制。Referring next to FIG. 5 , it shows a schematic structural diagram of an electronic device (eg, a terminal device or a server) 500 suitable for implementing an embodiment of the present disclosure. Terminal devices in the embodiments of the present disclosure may include, but are not limited to, such as mobile phones, notebook computers, digital broadcast receivers, PDAs (personal digital assistants), PADs (tablets), PMPs (portable multimedia players), vehicle-mounted terminals (eg, mobile terminals such as in-vehicle navigation terminals), etc., and stationary terminals such as digital TVs, desktop computers, and the like. The electronic device shown in FIG. 5 is only an example, and should not impose any limitation on the function and scope of use of the embodiments of the present disclosure.
如图5所示,电子设备500可以包括处理装置(例如中央处理器、图形处理器等)501,其可以根据存储在只读存储器(ROM)502中的程序或者从存储装置508加载到随机访问存储器(RAM)503中的程序而执行各种适当的动作和处理。在RAM 503中,还存储有电子设备500操作所需的各种程序和数据。处理装置501、ROM 502以及RAM 503通过总线504彼此相连。输入/输出(I/O)接口505也连接至总线504。As shown in FIG. 5 , an electronic device 500 may include a processing device (eg, a central processing unit, a graphics processor, etc.) 501 that may be loaded into random access according to a program stored in a read only memory (ROM) 502 or from a storage device 508 Various appropriate actions and processes are executed by the programs in the memory (RAM) 503 . In the RAM 503, various programs and data required for the operation of the electronic device 500 are also stored. The processing device 501, the ROM 502, and the RAM 503 are connected to each other through a bus 504. An input/output (I/O) interface 505 is also connected to bus 504 .
通常,以下装置可以连接至I/O接口505:包括例如触摸屏、触摸板、键盘、鼠标、摄像头、麦克风、加速度计、陀螺仪等的输入装置506;包括例如液晶显示器(LCD)、 扬声器、振动器等的输出装置507;包括例如磁带、硬盘等的存储装置508;以及通信装置509。通信装置509可以允许电子设备500与其他设备进行无线或有线通信以交换数据。虽然图5示出了具有各种装置的电子设备500,但是应理解的是,并不要求实施或具备所有示出的装置。可以替代地实施或具备更多或更少的装置。Typically, the following devices can be connected to the I/O interface 505: input devices 506 including, for example, a touch screen, touchpad, keyboard, mouse, camera, microphone, accelerometer, gyroscope, etc.; including, for example, a liquid crystal display (LCD), speakers, vibration An output device 507 such as a computer; a storage device 508 including, for example, a magnetic tape, a hard disk, etc.; and a communication device 509 . Communication means 509 may allow electronic device 500 to communicate wirelessly or by wire with other devices to exchange data. While FIG. 5 shows electronic device 500 having various means, it should be understood that not all of the illustrated means are required to be implemented or provided. More or fewer devices may alternatively be implemented or provided.
特别地,根据本公开的实施例,上文参考流程图描述的过程可以被实现为计算机软件程序。例如,本公开的实施例包括一种计算机程序产品,其包括承载在非暂态计算机可读介质上的计算机程序,该计算机程序包含用于执行流程图所示的方法的程序代码。在这样的实施例中,该计算机程序可以通过通信装置509从网络上被下载和安装,或者从存储装置508被安装,或者从ROM 502被安装。在该计算机程序被处理装置501执行时,执行本公开实施例的方法中限定的上述功能。In particular, according to embodiments of the present disclosure, the processes described above with reference to the flowcharts may be implemented as computer software programs. For example, embodiments of the present disclosure include a computer program product comprising a computer program carried on a non-transitory computer readable medium, the computer program containing program code for performing the method illustrated in the flowchart. In such an embodiment, the computer program may be downloaded and installed from the network via the communication device 509, or from the storage device 508, or from the ROM 502. When the computer program is executed by the processing apparatus 501, the above-mentioned functions defined in the methods of the embodiments of the present disclosure are executed.
需要说明的是,本公开上述的计算机可读介质可以是计算机可读信号介质或者计算机可读存储介质或者是上述两者的任意组合。计算机可读存储介质例如可以是——但不限于——电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。计算机可读存储介质的更具体的例子可以包括但不限于:具有一个或多个导线的电连接、便携式计算机磁盘、硬盘、随机访问存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑磁盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。在本公开实施例中,计算机可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。而在本公开实施例中,计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了计算机可读的程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。计算机可读信号介质还可以是计算机可读存储介质以外的任何计算机可读介质,该计算机可读信号介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。计算机可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于:电线、光缆、RF(射频)等等,或者上述的任意合适的组合。It should be noted that the computer-readable medium mentioned above in the present disclosure may be a computer-readable signal medium or a computer-readable storage medium, or any combination of the above two. The computer-readable storage medium can be, for example, but not limited to, an electrical, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus or device, or a combination of any of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections with one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), fiber optics, portable compact disk read only memory (CD-ROM), optical storage devices, magnetic storage devices, or any suitable combination of the foregoing. In the embodiments of the present disclosure, a computer-readable storage medium may be any tangible medium that contains or stores a program that can be used by or in conjunction with an instruction execution system, apparatus, or device. Rather, in embodiments of the present disclosure, a computer-readable signal medium may include a data signal in baseband or propagated as part of a carrier wave, carrying computer-readable program code therein. Such propagated data signals may take a variety of forms, including but not limited to electromagnetic signals, optical signals, or any suitable combination of the foregoing. A computer-readable signal medium can also be any computer-readable medium other than a computer-readable storage medium that can transmit, propagate, or transport the program for use by or in connection with the instruction execution system, apparatus, or device . Program code embodied on a computer readable medium may be transmitted using any suitable medium including, but not limited to, electrical wire, optical fiber cable, RF (radio frequency), etc., or any suitable combination of the foregoing.
在一些实施方式中,终端设备、服务器可以利用诸如HTTP(HyperText Transfer Protocol,超文本传输协议)之类的任何当前已知或未来研发的网络协议进行通信,并且可以与任意形式或介质的数字数据通信(例如,通信网络)互连。通信网络的示例包括局域网(“LAN”),广域网(“WAN”),网际网(例如,互联网)以及端对 端网络(例如,ad hoc端对端网络),以及任何当前已知或未来研发的网络。In some embodiments, terminal devices and servers can use any currently known or future developed network protocols such as HTTP (HyperText Transfer Protocol) to communicate, and can communicate with digital data in any form or medium Communication (eg, a communication network) interconnects. Examples of communication networks include local area networks ("LAN"), wide area networks ("WAN"), the Internet (eg, the Internet), and peer-to-peer networks (eg, ad hoc peer-to-peer networks), as well as any currently known or future development network of.
上述计算机可读介质可以是上述电子设备中所包含的;也可以是单独存在,而未装配入该电子设备中。The above-mentioned computer-readable medium may be included in the above-mentioned electronic device; or may exist alone without being assembled into the electronic device.
上述计算机可读介质承载有一个或者多个程序,当上述一个或者多个程序被该电子设备执行时,使得该电子设备:获取待求解应用题的题干信息;根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;针对每一所述子算式,根据所述待求解应用题的题干信息、所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;根据所述多个子算式和每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息。The above-mentioned computer-readable medium carries one or more programs, and when the above-mentioned one or more programs are executed by the electronic device, the electronic device: obtains the question stem information of the application question to be solved; Question stem information to generate a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved, wherein, the plurality of sub-calculations conform to the operation of the comprehensive formula Logic; for each of the sub-expressions, generate analysis information corresponding to the sub-expressions according to the question stem information of the applied question to be solved, the comprehensive expression and the sub-expressions, wherein the analysis information includes all Describe the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation; according to the multiple sub-calculations and the analysis information corresponding to each of the sub-calculations, generate explanation information corresponding to the multiple sub-calculations.
可以以一种或多种程序设计语言或其组合来编写用于执行本公开的操作的计算机程序代码,上述程序设计语言包括但不限于面向对象的程序设计语言—诸如Java、Smalltalk、C++,还包括常规的过程式程序设计语言——诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算机上执行、部分地在用户计算机上执行、作为一个独立的软件包执行、部分在用户计算机上部分在远程计算机上执行、或者完全在远程计算机或服务器上执行。在涉及远程计算机的情形中,远程计算机可以通过任意种类的网络——包括局域网(LAN)或广域网(WAN)——连接到用户计算机,或者,可以连接到外部计算机(例如利用因特网服务提供商来通过因特网连接)。Computer program code for performing operations of the present disclosure may be written in one or more programming languages, including but not limited to object-oriented programming languages—such as Java, Smalltalk, C++, and This includes conventional procedural programming languages - such as the "C" language or similar programming languages. The program code may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer, or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or may be connected to an external computer (eg, using an Internet service provider to via Internet connection).
附图中的流程图和框图,图示了按照本公开各种实施例的系统、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段、或代码的一部分,该模块、程序段、或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个接连地表示的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或操作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。The flowchart and block diagrams in the Figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code that contains one or more logical functions for implementing the specified functions executable instructions. It should also be noted that, in some alternative implementations, the functions noted in the blocks may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It is also noted that each block of the block diagrams and/or flowchart illustrations, and combinations of blocks in the block diagrams and/or flowchart illustrations, can be implemented in dedicated hardware-based systems that perform the specified functions or operations , or can be implemented in a combination of dedicated hardware and computer instructions.
描述于本公开实施例中所涉及到的模块可以通过软件的方式实现,也可以通过硬件的方式来实现。其中,模块的名称在某种情况下并不构成对该模块本身的限定,例 如,获取模块还可以被描述为“获取待求解应用题的题干信息的模块”。The modules involved in the embodiments of the present disclosure may be implemented in software or hardware. Wherein, the name of the module does not constitute a limitation of the module itself under certain circumstances, for example, the acquisition module can also be described as "a module for acquiring the question stem information of the applied question to be solved".
本文中以上描述的功能可以至少部分地由一个或多个硬件逻辑部件来执行。例如,非限制性地,可以使用的示范类型的硬件逻辑部件包括:现场可编程门阵列(FPGA)、专用集成电路(ASIC)、专用标准产品(ASSP)、片上系统(SOC)、复杂可编程逻辑设备(CPLD)等等。The functions described herein above may be performed, at least in part, by one or more hardware logic components. For example, without limitation, exemplary types of hardware logic components that may be used include: Field Programmable Gate Arrays (FPGAs), Application Specific Integrated Circuits (ASICs), Application Specific Standard Products (ASSPs), Systems on Chips (SOCs), Complex Programmable Logical Devices (CPLDs) and more.
在本公开的上下文中,机器可读介质可以是有形的介质,其可以包含或存储以供指令执行系统、装置或设备使用或与指令执行系统、装置或设备结合地使用的程序。机器可读介质可以是机器可读信号介质或机器可读储存介质。机器可读介质可以包括但不限于电子的、磁性的、光学的、电磁的、红外的、或半导体系统、装置或设备,或者上述内容的任何合适组合。机器可读存储介质的更具体示例会包括基于一个或多个线的电气连接、便携式计算机盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦除可编程只读存储器(EPROM或快闪存储器)、光纤、便捷式紧凑盘只读存储器(CD-ROM)、光学储存设备、磁储存设备、或上述内容的任何合适组合。In the context of the present disclosure, a machine-readable medium may be a tangible medium that may contain or store a program for use by or in connection with the instruction execution system, apparatus or device. The machine-readable medium may be a machine-readable signal medium or a machine-readable storage medium. Machine-readable media may include, but are not limited to, electronic, magnetic, optical, electromagnetic, infrared, or semiconductor systems, devices, or devices, or any suitable combination of the foregoing. More specific examples of machine-readable storage media would include one or more wire-based electrical connections, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable programmable read only memory (EPROM or flash memory), fiber optics, compact disk read only memory (CD-ROM), optical storage, magnetic storage, or any suitable combination of the foregoing.
根据本公开的一个或多个实施例,示例1提供了一种信息处理方法,包括:获取待求解应用题的题干信息;根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;针对每一所述子算式,根据所述待求解应用题的题干信息、所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;根据所述多个子算式和每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息。According to one or more embodiments of the present disclosure, Example 1 provides an information processing method, including: acquiring question stem information of an applied question to be solved; The comprehensive formula of the applied problem to be solved and a plurality of sub-calculations used to embody the solution process of the applied problem to be solved, wherein the plurality of sub-calculations conform to the operation logic of the comprehensive formula; for each of the sub-calculations , according to the question stem information of the application question to be solved, the comprehensive calculation formula and the sub-calculation, generate the analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and all The unit of the result of the sub-calculation is described; according to the plurality of sub-calculations and the analysis information corresponding to each of the sub-calculations, explanation information corresponding to the plurality of sub-calculations is generated.
根据本公开的一个或多个实施例,示例2提供了示例1的方法,所述根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,包括:将所述待求解应用题的题干信息输入求解模型的第一编码网络中,得到编码后的题干信息,其中,所述求解模型包括所述第一编码网络和第一解码网络;将所述编码后的题干信息输入所述第一解码网络中,得到目标表达式和用于表征所述目标表达式中各运算符是否作为一子算式中的唯一运算符的标识信息;根据所述目标表达式,生成所述综合算式,并根据所述目标表达式和所述标识信息,生成所述多个子算式,其中,所述综合算式中的代数式由所述目标表达式中的各元素构成。According to one or more embodiments of the present disclosure, Example 2 provides the method of Example 1, wherein according to the question stem information of the to-be-solved applied question, a comprehensive formula for completely solving the to-be-solved applied question and the method of The multiple sub-calculations that embody the solution process of the application question to be solved include: inputting the question stem information of the application question to be solved into the first coding network of the solving model, and obtaining the encoded question stem information, wherein the The solution model includes the first encoding network and the first decoding network; the encoded question stem information is input into the first decoding network to obtain the target expression and the operations used to characterize the target expression. Whether the operator is used as the identification information of the unique operator in a sub-calculation; according to the target expression, generate the comprehensive calculation formula, and according to the target expression and the identification information, generate the multiple sub-calculations, wherein, The algebraic expression in the comprehensive expression is composed of the elements in the target expression.
根据本公开的一个或多个实施例,示例3提供了示例2的方法,所述目标表达式为前缀表达式。According to one or more embodiments of the present disclosure, Example 3 provides the method of Example 2, the target expression being a prefix expression.
根据本公开的一个或多个实施例,示例4提供了示例1的方法,所述解析信息还包括所述子算式涉及的知识点。According to one or more embodiments of the present disclosure, Example 4 provides the method of Example 1, and the parsing information further includes knowledge points involved in the sub-calculation.
根据本公开的一个或多个实施例,示例5提供了示例4的方法,所述根据所述待求解应用题的题干信息、所述综合算式以及所述子算式,生成所述子算式对应的解析信息,包括:将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后输入到多任务的序列到序列模型中,得到所述子算式对应的解析信息。According to one or more embodiments of the present disclosure, Example 5 provides the method of Example 4, wherein according to the question stem information of the applied question to be solved, the comprehensive formula and the sub-calculation, the corresponding sub-calculation is generated The analytical information, including: the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation are sequentially spliced and input into the multi-task sequence-to-sequence model, and the analytical information corresponding to the sub-calculation is obtained. .
根据本公开的一个或多个实施例,示例6提供了示例5的方法,所述多任务的序列到序列模型包括第二编码网络、第二解码网络、第一分类器以及第二分类器;其中,所述第二编码网络,用于对拼接信息进行编码,得到编码信息,所述拼接信息为将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后所得的信息;所述第二解码网络,用于根据所述编码信息,生成所述子算式的结果的含义;所述第一分类器,用于根据所述编码信息,生成所述子算式的结果的单位;所述第二分类器,用于根据所述编码信息,生成所述子算式涉及的知识点。According to one or more embodiments of the present disclosure, Example 6 provides the method of Example 5, the multi-task sequence-to-sequence model comprising a second encoding network, a second decoding network, a first classifier, and a second classifier; Wherein, the second encoding network is used to encode the splicing information to obtain the coded information, and the splicing information is obtained by splicing the stem information of the application question to be solved, the comprehensive formula and the sub-calculation in sequence. the obtained information; the second decoding network is used for generating the meaning of the result of the sub-calculation according to the coding information; the first classifier is used for generating the sub-calculation according to the coding information The unit of the result; the second classifier is configured to generate the knowledge points involved in the sub-calculation according to the encoded information.
根据本公开的一个或多个实施例,示例7提供了示例1-6中任一项的方法,所述方法还包括:根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中、在所述综合算式中的代数式中出现的每一数字的含义;所述根据所述多个子算式和每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息,包括:根据所述多个子算式、每一所述子算式对应的解析信息以及每一所述数字的含义,生成所述多个子算式对应的讲解信息。According to one or more embodiments of the present disclosure, Example 7 provides the method of any one of Examples 1-6, the method further comprising: generating the application to be solved according to the question stem information of the application to be solved The meaning of each number that appears in the algebraic formula in the question stem information of the question; the multiple sub-calculations are generated according to the multiple sub-calculations and the analytical information corresponding to each of the sub-calculations The corresponding explanation information includes: generating explanation information corresponding to the plurality of sub-calculations according to the plurality of sub-calculations, the analysis information corresponding to each of the sub-calculations, and the meaning of each of the numbers.
根据本公开的一个或多个实施例,示例8提供了示例7的方法,所述根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中、在所述综合算式中的代数式中出现的每一数字的含义,包括:将所述待求解应用题的题干信息中的每一所述数字替换为预设标识;根据替换后所得的题干信息,生成每一所述数字对应的编码向量;针对每一所述数字,根据所述数字对应的编码向量,生成所述数字的含义。According to one or more embodiments of the present disclosure, Example 8 provides the method of Example 7, wherein generating the question stem information of the applied question to be solved according to the question stem information of the applied question to be solved, in the The meaning of each number that appears in the algebraic formula in the comprehensive formula includes: replacing each of the numbers in the question stem information of the applied question to be solved with a preset identifier; according to the question stem information obtained after the replacement, generating A coding vector corresponding to each of the numbers; for each of the numbers, the meaning of the number is generated according to the coding vector corresponding to the number.
根据本公开的一个或多个实施例,示例9提供了一种信息处理装置,包括:获取模块,用于获取待求解应用题的题干信息;第一生成模块,用于根据所述获取模块获取到的所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符 合所述综合算式的运算逻辑;第二生成模块,用于针对所述第一生成模块生成的每一所述子算式,根据所述获取模块获取到的所述待求解应用题的题干信息、所述第一生成模块生成的所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;第三生成模块,用于根据所述第一生成模块生成的所述多个子算式和所述第二生成模块生成的每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息。According to one or more embodiments of the present disclosure, Example 9 provides an information processing apparatus, including: an acquisition module for acquiring the question stem information of an application question to be solved; a first generation module for obtaining the information according to the acquisition module The obtained question stem information of the applied problem to be solved generates a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved, wherein the A plurality of sub-calculations conform to the operation logic of the comprehensive calculation formula; a second generation module is used for each of the sub-calculations generated by the first generation module, according to the obtained application problem to be solved by the acquisition module Question stem information, the comprehensive formula and the sub-calculation generated by the first generating module, generate analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the The unit of the result of the sub-calculation; the third generation module is configured to generate all the sub-calculations according to the plurality of sub-calculations generated by the first generation module and the analysis information corresponding to each of the sub-calculations generated by the second generation module. Describe the explanation information corresponding to the multiple sub-calculations.
根据本公开的一个或多个实施例,示例10提供了一种非易失性计算机可读介质,其上存储有计算机程序,该程序被处理装置执行时实现示例1-8中任一项所述方法的步骤。According to one or more embodiments of the present disclosure, Example 10 provides a non-transitory computer-readable medium having stored thereon a computer program that, when executed by a processing apparatus, implements any of Examples 1-8. steps of the method described.
根据本公开的一个或多个实施例,示例11提供了一种电子设备,包括:存储装置,其上存储有计算机程序;处理装置,用于执行所述存储装置中的所述计算机程序,以实现以下方法的步骤:获取待求解应用题的题干信息;根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;针对每一所述子算式,根据所述待求解应用题的题干信息、所述综合算式以及所述子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;根据所述多个子算式和每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息。According to one or more embodiments of the present disclosure, Example 11 provides an electronic device, including: a storage device on which a computer program is stored; and a processing device for executing the computer program in the storage device to The steps of implementing the following method: obtaining the question stem information of the application question to be solved; generating a comprehensive formula for completely solving the application question to be solved and a comprehensive formula used to represent the application question to be solved according to the question stem information of the application question to be solved. A plurality of sub-calculations of the solving process of an applied problem, wherein the plurality of sub-calculations conform to the operation logic of the comprehensive formula; for each of the sub-calculations, according to the question stem information of the applied problem to be solved, the comprehensive formula and the sub-calculation, generating analysis information corresponding to the sub-calculation, wherein the analysis information includes the meaning of the result of the sub-calculation and the unit of the result of the sub-calculation; according to the multiple sub-calculations and each The analysis information corresponding to the sub-expressions is used to generate explanation information corresponding to the plurality of sub-expressions.
根据本公开的一个或多个实施例,示例12提供了示例11的电子设备,所述根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,包括:将所述待求解应用题的题干信息输入求解模型的第一编码网络中,得到编码后的题干信息,其中,所述求解模型包括所述第一编码网络和第一解码网络;将所述编码后的题干信息输入所述第一解码网络中,得到目标表达式和用于表征所述目标表达式中各运算符是否作为一子算式中的唯一运算符的标识信息;根据所述目标表达式,生成所述综合算式,并根据所述目标表达式和所述标识信息,生成所述多个子算式,其中,所述综合算式中的代数式由所述目标表达式中的各元素构成。According to one or more embodiments of the present disclosure, Example 12 provides the electronic device of Example 11, the comprehensive formula and The multiple sub-calculations used to embody the solution process of the applied problem to be solved include: inputting the problem stem information of the applied problem to be solved into the first coding network of the solving model, and obtaining the encoded problem stem information, wherein, The solution model includes the first encoding network and the first decoding network; the encoded question stem information is input into the first decoding network to obtain a target expression and a target expression for characterizing each of the target expressions. Whether the operator is used as the identification information of the unique operator in a sub-calculation; according to the target expression, generate the comprehensive calculation formula, and according to the target expression and the identification information, generate the multiple sub-calculations, wherein , the algebraic expression in the comprehensive calculation formula is composed of each element in the target expression.
根据本公开的一个或多个实施例,示例13提供了示例12的电子设备,所述目标表达式为前缀表达式。According to one or more embodiments of the present disclosure, Example 13 provides the electronic device of Example 12, the target expression being a prefix expression.
根据本公开的一个或多个实施例,示例14提供了示例11的电子设备,所述解析 信息还包括所述子算式涉及的知识点。According to one or more embodiments of the present disclosure, Example 14 provides the electronic device of Example 11, and the analysis information further includes knowledge points involved in the sub-calculation.
根据本公开的一个或多个实施例,示例15提供了示例14的电子设备,所述根据所述待求解应用题的题干信息、所述综合算式以及所述子算式,生成所述子算式对应的解析信息,包括:将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后输入到多任务的序列到序列模型中,得到所述子算式对应的解析信息。According to one or more embodiments of the present disclosure, Example 15 provides the electronic device of Example 14, the sub-equation is generated according to the stem information of the application question to be solved, the comprehensive equation and the sub-equation Corresponding analysis information includes: splicing the question stem information of the application question to be solved, the comprehensive formula and the sub-calculation in sequence, and then inputting them into the multi-task sequence-to-sequence model to obtain the analysis corresponding to the sub-calculation information.
根据本公开的一个或多个实施例,示例16提供了示例15的电子设备,所述多任务的序列到序列模型包括第二编码网络、第二解码网络、第一分类器以及第二分类器;其中,所述第二编码网络,用于对拼接信息进行编码,得到编码信息,所述拼接信息为将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后所得的信息;所述第二解码网络,用于根据所述编码信息,生成所述子算式的结果的含义;所述第一分类器,用于根据所述编码信息,生成所述子算式的结果的单位;所述第二分类器,用于根据所述编码信息,生成所述子算式涉及的知识点。According to one or more embodiments of the present disclosure, Example 16 provides the electronic device of Example 15, the multitasking sequence-to-sequence model including a second encoding network, a second decoding network, a first classifier, and a second classifier Wherein, the second coding network is used to encode the splicing information to obtain coding information, and the splicing information is to sequentially splicing the question stem information, the comprehensive formula and the sub-calculation of the application question to be solved. The second decoding network is used to generate the meaning of the result of the sub-calculation according to the coding information; the first classifier is used to generate the sub-calculation according to the coding information The unit of the result; the second classifier is configured to generate the knowledge points involved in the sub-calculation according to the encoding information.
根据本公开的一个或多个实施例,示例17提供了示例11-16中任一项的电子设备,所述方法还包括以下步骤:根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中、在所述综合算式中的代数式中出现的每一数字的含义;所述根据所述多个子算式和每一所述子算式对应的解析信息,生成所述多个子算式对应的讲解信息,包括:根据所述多个子算式、每一所述子算式对应的解析信息以及每一所述数字的含义,生成所述多个子算式对应的讲解信息。According to one or more embodiments of the present disclosure, Example 17 provides the electronic device of any one of Examples 11-16, and the method further includes the step of: generating the The meaning of each number that appears in the algebraic formula in the comprehensive formula in the question stem information of the application question to be solved; the generation of the The explanation information corresponding to the multiple sub-calculations includes: generating explanation information corresponding to the multiple sub-calculations according to the multiple sub-calculations, the analysis information corresponding to each of the sub-calculations, and the meaning of each of the numbers.
根据本公开的一个或多个实施例,示例18提供了示例17的电子设备,所述根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中、在所述综合算式中的代数式中出现的每一数字的含义,包括:将所述待求解应用题的题干信息中的每一所述数字替换为预设标识;根据替换后所得的题干信息,生成每一所述数字对应的编码向量;针对每一所述数字,根据所述数字对应的编码向量,生成所述数字的含义。According to one or more embodiments of the present disclosure, Example 18 provides the electronic device of Example 17, wherein the question stem information of the applied question to be solved is generated according to the question stem information of the applied question to be solved, in all The meaning of each number that appears in the algebraic formula in the comprehensive formula includes: replacing each of the numbers in the question stem information of the applied question to be solved with a preset identifier; according to the question stem information obtained after the replacement, A coding vector corresponding to each of the numbers is generated; for each of the numbers, the meaning of the number is generated according to the coding vector corresponding to the number.
根据本公开的一个或多个实施例,提供一种计算机程序,包括:指令,所述指令当由处理器执行时使所述处理器执行本公开第一方面提供的方法。According to one or more embodiments of the present disclosure, there is provided a computer program comprising: instructions which, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
根据本公开的一个或多个实施例,提供一种计算机程序产品,包括指令,所述指令当由处理器执行时使所述处理器执行本公开第一方面提供的方法。According to one or more embodiments of the present disclosure, there is provided a computer program product comprising instructions which, when executed by a processor, cause the processor to perform the method provided by the first aspect of the present disclosure.
以上描述仅为本公开的较佳实施例以及对所运用技术原理的说明。本领域技术人员应当理解,本公开中所涉及的公开范围,并不限于上述技术特征的特定组合而成的 技术方案,同时也应涵盖在不脱离上述公开构思的情况下,由上述技术特征或其等同特征进行任意组合而形成的其它技术方案。例如上述特征与本公开中公开的(但不限于)具有类似功能的技术特征进行互相替换而形成的技术方案。The above description is merely a preferred embodiment of the present disclosure and an illustration of the technical principles employed. Those skilled in the art should understand that the scope of the disclosure involved in the present disclosure is not limited to the technical solutions formed by the specific combination of the above-mentioned technical features, and should also cover, without departing from the above-mentioned disclosed concept, the technical solutions formed by the above-mentioned technical features or Other technical solutions formed by any combination of its equivalent features. For example, a technical solution is formed by replacing the above features with the technical features disclosed in the present disclosure (but not limited to) with similar functions.
此外,虽然采用特定次序描绘了各操作,但是这不应当理解为要求这些操作以所示出的特定次序或以顺序次序执行来执行。在一定环境下,多任务和并行处理可能是有利的。同样地,虽然在上面论述中包含了若干具体实现细节,但是这些不应当被解释为对本公开的范围的限制。在单独的实施例的上下文中描述的某些特征还可以组合地实现在单个实施例中。相反地,在单个实施例的上下文中描述的各种特征也可以单独地或以任何合适的子组合的方式实现在多个实施例中。Additionally, although operations are depicted in a particular order, this should not be construed as requiring that the operations be performed in the particular order shown or in a sequential order. Under certain circumstances, multitasking and parallel processing may be advantageous. Likewise, although the above discussion contains several implementation-specific details, these should not be construed as limitations on the scope of the present disclosure. Certain features that are described in the context of separate embodiments can also be implemented in combination in a single embodiment. Conversely, various features that are described in the context of a single embodiment can also be implemented in multiple embodiments separately or in any suitable subcombination.
尽管已经采用特定于结构特征和/或方法逻辑动作的语言描述了本主题,但是应当理解所附权利要求书中所限定的主题未必局限于上面描述的特定特征或动作。相反,上面所描述的特定特征和动作仅仅是实现权利要求书的示例形式。关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Although the subject matter has been described in language specific to structural features and/or logical acts of method, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are merely example forms of implementing the claims. Regarding the apparatus in the above-mentioned embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.

Claims (14)

  1. 一种信息处理方法,包括:An information processing method, comprising:
    获取待求解应用题的题干信息;Obtain the stem information of the application questions to be solved;
    根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;According to the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the applied problem to be solved and a plurality of sub-calculations for reflecting the solution process of the applied problem to be solved are generated, wherein the plurality of sub-calculations The calculation formula conforms to the operation logic of the comprehensive calculation formula;
    针对所述多个子算式中的每一个,根据所述待求解应用题的题干信息、所述综合算式以及子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;For each of the plurality of sub-expressions, according to the question stem information of the applied question to be solved, the comprehensive formula and the sub-expressions, the analysis information corresponding to the sub-expression is generated, wherein the analysis information includes the the meaning of the result of the sub-expression and the unit of the result of said sub-expression;
    根据所述多个子算式和所述多个子算式中的每一个对应的解析信息,生成所述多个子算式对应的讲解信息。According to the plurality of sub-calculations and the analysis information corresponding to each of the plurality of sub-calculations, explanation information corresponding to the plurality of sub-calculations is generated.
  2. 根据权利要求1所述的方法,其中,所述根据所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,包括:The method according to claim 1, wherein, according to the question stem information of the applied problem to be solved, a comprehensive formula for completely solving the applied problem to be solved and a formula for reflecting the applied problem to be solved are generated. Multiple sub-calculations of the solution process, including:
    将所述待求解应用题的题干信息输入求解模型的第一编码网络中,得到编码后的题干信息,其中,所述求解模型包括所述第一编码网络和第一解码网络;Input the question stem information of the application question to be solved into the first coding network of the solving model, and obtain the encoded question stem information, wherein the solving model includes the first coding network and the first decoding network;
    将所述编码后的题干信息输入所述第一解码网络中,得到目标表达式和用于表征所述目标表达式中各运算符是否作为一个子算式中的唯一运算符的标识信息;The question stem information after described coding is input in described first decoding network, obtains target expression and is used to characterize whether each operator in described target expression is as the identification information of the unique operator in a sub-calculation;
    根据所述目标表达式,生成所述综合算式,并根据所述目标表达式和所述标识信息,生成所述多个子算式,其中,所述综合算式中的代数式由所述目标表达式中的各元素构成。According to the target expression, the comprehensive calculation formula is generated, and according to the target expression and the identification information, the plurality of sub-calculations are generated, wherein the algebraic formula in the comprehensive calculation formula is determined by the target expression. composition of each element.
  3. 根据权利要求2所述的方法,其中,所述目标表达式为前缀表达式。The method of claim 2, wherein the target expression is a prefix expression.
  4. 根据权利要求1所述的方法,其中,所述解析信息还包括所述子算式涉及的知识点。The method according to claim 1, wherein the analysis information further includes knowledge points involved in the sub-calculation.
  5. 根据权利要求4所述的方法,其中,所述根据所述待求解应用题的题干信息、 所述综合算式以及子算式,生成所述子算式对应的解析信息,包括:The method according to claim 4, wherein, generating the analytical information corresponding to the sub-calculation according to the question stem information of the applied problem to be solved, the comprehensive formula and the sub-calculation, comprising:
    将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后输入到多任务的序列到序列模型中,得到所述子算式对应的解析信息。The stem information of the application question to be solved, the comprehensive formula and the sub-calculation are sequentially spliced and input into the multi-task sequence-to-sequence model to obtain the analytical information corresponding to the sub-calculation.
  6. 根据权利要求5所述的方法,其中,所述多任务的序列到序列模型包括第二编码网络、第二解码网络、第一分类器以及第二分类器,The method of claim 5, wherein the multi-task sequence-to-sequence model includes a second encoding network, a second decoding network, a first classifier, and a second classifier,
    所述第二编码网络,用于对拼接信息进行编码,得到编码信息,所述拼接信息为将所述待求解应用题的题干信息、所述综合算式以及所述子算式依次拼接后所得的信息,The second encoding network is used to encode the splicing information to obtain the coded information, and the splicing information is obtained by sequentially splicing the stem information of the applied problem to be solved, the comprehensive formula and the sub-calculation. information,
    所述第二解码网络,用于根据所述编码信息,生成所述子算式的结果的含义,The second decoding network is used to generate the meaning of the result of the sub-calculation according to the encoding information,
    所述第一分类器,用于根据所述编码信息,生成所述子算式的结果的单位,the first classifier, configured to generate the unit of the result of the sub-calculation according to the encoding information,
    所述第二分类器,用于根据所述编码信息,生成所述子算式涉及的知识点。The second classifier is configured to generate the knowledge points involved in the sub-calculation according to the encoding information.
  7. 根据权利要求1-6中任一项所述的方法,还包括:The method of any one of claims 1-6, further comprising:
    根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中的每一数字的含义、在所述综合算式中的代数式中出现的每一数字的含义;According to the question stem information of the applied question to be solved, the meaning of each number in the question stem information of the applied question to be solved and the meaning of each number appearing in the algebraic formula in the comprehensive calculation formula are generated;
    其中,所述根据所述多个子算式和所述多个子算式中的每一个对应的解析信息,生成所述多个子算式对应的讲解信息,包括:Wherein, generating the explanation information corresponding to the multiple sub-calculations according to the multiple sub-calculations and the analysis information corresponding to each of the multiple sub-calculations includes:
    根据所述多个子算式、所述多个子算式中的每一个对应的解析信息以及所述每一数字的含义,生成所述多个子算式对应的讲解信息。According to the plurality of sub-calculations, analysis information corresponding to each of the plurality of sub-calculations, and the meaning of each number, explanation information corresponding to the plurality of sub-calculations is generated.
  8. 根据权利要求7所述的方法,其中,所述根据所述待求解应用题的题干信息,生成所述待求解应用题的题干信息中的每一数字的含义、在所述综合算式中的代数式中出现的每一数字的含义,包括:The method according to claim 7, wherein the meaning of each number in the question stem information of the applied question to be solved is generated according to the question stem information of the applied question to be solved, in the comprehensive calculation formula The meaning of each number that appears in the algebraic expression of , including:
    将所述待求解应用题的题干信息中的所述每一数字替换为预设标识;Replace each number in the question stem information of the applied question to be solved with a preset identifier;
    根据替换后所得的题干信息,生成所述每一数字对应的编码向量;According to the question stem information obtained after the replacement, the coding vector corresponding to each number is generated;
    针对所述每一数字,根据数字对应的编码向量,生成所述数字的含义。For each number, the meaning of the number is generated according to the encoding vector corresponding to the number.
  9. 根据权利要求1-6中任一项所述的方法,其中,所述题干信息包括图像信息、文本信息或音频信息,所述讲解信息包括讲解文本、讲解音频、讲解视频中的至少一 个。The method according to any one of claims 1-6, wherein the question stem information includes image information, text information or audio information, and the explanation information includes at least one of explanation text, explanation audio, and explanation video.
  10. 一种信息处理装置,包括:An information processing device, comprising:
    获取模块,用于获取待求解应用题的题干信息;The acquisition module is used to acquire the question stem information of the application question to be solved;
    第一生成模块,用于根据所述获取模块获取到的所述待求解应用题的题干信息,生成用于完整求解所述待求解应用题的综合算式和用于体现所述待求解应用题的求解过程的多个子算式,其中,所述多个子算式符合所述综合算式的运算逻辑;The first generation module is configured to generate a comprehensive formula for completely solving the to-be-solved applied problem and a comprehensive formula for reflecting the to-be-solved applied problem according to the question stem information of the to-be-solved applied problem obtained by the acquisition module A plurality of sub-calculations of the solution process, wherein, the plurality of sub-calculations conform to the operation logic of the comprehensive formula;
    第二生成模块,用于针对所述多个子算式中的每一个,根据所述待求解应用题的题干信息、所述综合算式以及子算式,生成所述子算式对应的解析信息,其中,所述解析信息包括所述子算式的结果的含义和所述子算式的结果的单位;The second generation module is configured to, for each of the plurality of sub-expressions, generate analytical information corresponding to the sub-expressions according to the question stem information of the applied question to be solved, the comprehensive expression and the sub-expressions, wherein, The analysis information includes the meaning of the result of the sub-expression and the unit of the result of the sub-expression;
    第三生成模块,用于根据所述多个子算式和所述多个子算式中的每一个对应的解析信息,生成所述多个子算式对应的讲解信息。The third generating module is configured to generate explanation information corresponding to the multiple sub-calculations according to the multiple sub-calculations and the analysis information corresponding to each of the multiple sub-calculations.
  11. 一种非易失性计算机可读介质,其上存储有计算机程序,其中,该程序被处理装置执行时实现权利要求1-9中任一项所述方法的步骤。A non-volatile computer-readable medium having stored thereon a computer program, wherein the program, when executed by a processing device, implements the steps of the method of any one of claims 1-9.
  12. 一种电子设备,包括:An electronic device comprising:
    存储装置,其上存储有计算机程序;a storage device on which a computer program is stored;
    处理装置,用于执行所述存储装置中的所述计算机程序,以实现权利要求1-9中任一项所述方法的步骤。A processing device is configured to execute the computer program in the storage device to implement the steps of the method of any one of claims 1-9.
  13. 一种计算机程序,包括:A computer program comprising:
    指令,所述指令当由处理器执行时使所述处理器执行根据权利要求1-9中任一项所述的方法。Instructions which, when executed by a processor, cause the processor to perform the method of any of claims 1-9.
  14. 一种计算机程序产品,包括指令,所述指令当由处理器执行时使所述处理器执行根据权利要求1-9中任一项所述的方法。A computer program product comprising instructions which, when executed by a processor, cause the processor to perform the method of any of claims 1-9.
PCT/CN2022/079530 2021-03-08 2022-03-07 Information processing method and apparatus, medium and electronic device WO2022188742A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110251093.5 2021-03-08
CN202110251093.5A CN112907409A (en) 2021-03-08 2021-03-08 Application problem solving method, device, medium and electronic equipment

Publications (1)

Publication Number Publication Date
WO2022188742A1 true WO2022188742A1 (en) 2022-09-15

Family

ID=76107903

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/079530 WO2022188742A1 (en) 2021-03-08 2022-03-07 Information processing method and apparatus, medium and electronic device

Country Status (2)

Country Link
CN (1) CN112907409A (en)
WO (1) WO2022188742A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112907409A (en) * 2021-03-08 2021-06-04 北京有竹居网络技术有限公司 Application problem solving method, device, medium and electronic equipment

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109685137A (en) * 2018-12-24 2019-04-26 上海仁静信息技术有限公司 A kind of topic classification method, device, electronic equipment and storage medium
CN109948473A (en) * 2019-03-04 2019-06-28 中国计量大学 A kind of method neural network based promoting student's applied problem solution topic ability
CN109992785A (en) * 2019-04-09 2019-07-09 腾讯科技(深圳)有限公司 Content calculation method, device and equipment based on machine learning
CN111259632A (en) * 2020-02-10 2020-06-09 暗物智能科技(广州)有限公司 Semantic alignment-based tree structure mathematical application problem solving method and system
US20200242956A1 (en) * 2017-03-13 2020-07-30 Vitruv Inc. Method and system for supporting learning, and non-transitory computer-readable recording medium
CN112183048A (en) * 2020-09-30 2021-01-05 北京有竹居网络技术有限公司 Automatic problem solving method and device, computer equipment and storage medium
CN112907409A (en) * 2021-03-08 2021-06-04 北京有竹居网络技术有限公司 Application problem solving method, device, medium and electronic equipment

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200242956A1 (en) * 2017-03-13 2020-07-30 Vitruv Inc. Method and system for supporting learning, and non-transitory computer-readable recording medium
CN109685137A (en) * 2018-12-24 2019-04-26 上海仁静信息技术有限公司 A kind of topic classification method, device, electronic equipment and storage medium
CN109948473A (en) * 2019-03-04 2019-06-28 中国计量大学 A kind of method neural network based promoting student's applied problem solution topic ability
CN109992785A (en) * 2019-04-09 2019-07-09 腾讯科技(深圳)有限公司 Content calculation method, device and equipment based on machine learning
CN111259632A (en) * 2020-02-10 2020-06-09 暗物智能科技(广州)有限公司 Semantic alignment-based tree structure mathematical application problem solving method and system
CN112183048A (en) * 2020-09-30 2021-01-05 北京有竹居网络技术有限公司 Automatic problem solving method and device, computer equipment and storage medium
CN112907409A (en) * 2021-03-08 2021-06-04 北京有竹居网络技术有限公司 Application problem solving method, device, medium and electronic equipment

Also Published As

Publication number Publication date
CN112907409A (en) 2021-06-04

Similar Documents

Publication Publication Date Title
JP2021108094A (en) Method and device for generating interactive models
CN111026944B (en) Information processing method, device, medium and electronic equipment
WO2022121801A1 (en) Information processing method and apparatus, and electronic device
CN112163076B (en) Knowledge question bank construction method, question and answer processing method, device, equipment and medium
WO2023274187A1 (en) Information processing method and apparatus based on natural language inference, and electronic device
CN111159220A (en) Method and apparatus for outputting structured query statement
CN114385780B (en) Program interface information recommendation method and device, electronic equipment and readable medium
CN113392197B (en) Question-answering reasoning method and device, storage medium and electronic equipment
WO2022188742A1 (en) Information processing method and apparatus, medium and electronic device
Baclawski et al. Ontology summit 2020 communiqué: Knowledge graphs
CN112287659B (en) Information generation method and device, electronic equipment and storage medium
CN111260975B (en) Method, device, medium and electronic equipment for multimedia blackboard teaching interaction
CN115270717A (en) Method, device, equipment and medium for detecting vertical position
CN117290477A (en) Generating type building knowledge question-answering method based on secondary retrieval enhancement
CN111104796B (en) Method and device for translation
CN111008213A (en) Method and apparatus for generating language conversion model
WO2023125000A1 (en) Content output method and apparatus, computer readable medium, and electronic device
CN110689285A (en) Test method, test device, electronic equipment and computer readable storage medium
CN112863277B (en) Interaction method, device, medium and electronic equipment for live broadcast teaching
CN109657046B (en) Content analysis processing method and device, electronic equipment and storage medium
CN114757206A (en) Consultation service method, system and storage medium
CN113778846A (en) Method and apparatus for generating test data
Timilsina et al. Enabling Dataspaces Using Foundation Models: Technical, Legal and Ethical Considerations and Future Trends
WO2023116306A1 (en) Information processing method and apparatus, and readable storage medium and electronic device
CN117743569A (en) Information generation method, model training method, device, equipment and readable medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22766265

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 22766265

Country of ref document: EP

Kind code of ref document: A1