WO2019196205A1 - Foreign language teaching evaluation information generating method and apparatus - Google Patents
Foreign language teaching evaluation information generating method and apparatus Download PDFInfo
- Publication number
- WO2019196205A1 WO2019196205A1 PCT/CN2018/092777 CN2018092777W WO2019196205A1 WO 2019196205 A1 WO2019196205 A1 WO 2019196205A1 CN 2018092777 W CN2018092777 W CN 2018092777W WO 2019196205 A1 WO2019196205 A1 WO 2019196205A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- information
- lip
- foreign language
- pronunciation
- user
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 65
- 238000011156 evaluation Methods 0.000 title claims abstract description 59
- 238000012360 testing method Methods 0.000 claims abstract description 63
- 230000001815 facial effect Effects 0.000 claims description 12
- 238000004590 computer program Methods 0.000 claims description 4
- 239000000284 extract Substances 0.000 claims description 4
- 238000012549 training Methods 0.000 claims description 2
- 238000012545 processing Methods 0.000 description 12
- 238000010586 diagram Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 4
- 230000006870 function Effects 0.000 description 3
- 230000003993 interaction Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 230000006978 adaptation Effects 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 230000001788 irregular Effects 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000002787 reinforcement Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B19/00—Teaching not covered by other main groups of this subclass
- G09B19/04—Speaking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/10—Services
- G06Q50/20—Education
- G06Q50/205—Education administration or guidance
-
- G—PHYSICS
- G09—EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
- G09B—EDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
- G09B5/00—Electrically-operated educational appliances
- G09B5/06—Electrically-operated educational appliances with both visual and audible presentation of the material to be studied
- G09B5/065—Combinations of audio and video presentations, e.g. videotapes, videodiscs, television systems
Definitions
- the present disclosure relates to the field of computer technology, and in particular, to a foreign language teaching evaluation information generating method, apparatus, electronic device, and computer readable storage medium.
- the patent application with the application number CN201610823063.6 discloses a voice mouth type animation recognition method and device, and extracts a voice feature from the voice to be recognized; and inputs the extracted voice feature into a pre-trained voice mouth recognition model; Determining a lip type corresponding to the voice feature output by the voice port recognition model; determining a lip animation corresponding to the lip type according to the lip type output of the voice port recognition model; A lip animation that recognizes the recognized voice.
- the application processes the extracted voice features in a pre-trained voice lip recognition model to obtain a corresponding lip shape, but does not collect and analyze the user's mouth shape.
- Patent Application No. CN201611075466.3 discloses a lip language recognition method and apparatus for acquiring image information of a target human body object; acquiring a lip region image of the target human body object from the image information; and from the lip region image The lip features are extracted and the lip features are lip language recognized.
- the application first acquires image information of a human body object, and then acquires an image of a lip region therein, thereby extracting a lip feature, but the image of the lip region is obtained by positioning the feature of the nose tip, not through the contour or color of the lip.
- the patent application with the application number CN201611076381.7 discloses a lip image interaction method based on a depth image and a lip language interaction device for acquiring depth image information of a target human body object; and acquiring a lip of the target human body object from the depth image information
- the application relies on image recognition to realize the positioning of the lip and the extraction of the lip feature, the purpose of which is to identify the corresponding lip language to realize the control of the interactive device, and can not display the lip feature to the user to enable the user to reach The purpose of foreign language learning.
- An object of the present disclosure is to provide a foreign language teaching evaluation information generating method, apparatus, electronic device, and computer readable storage medium, thereby at least partially obviating one or more problems due to limitations and disadvantages of the related art.
- a method for generating foreign language teaching evaluation information includes:
- a method for generating foreign language teaching evaluation information characterized in that the method comprises:
- the pronunciation information including standard pronunciation and standard lip type features
- the method further includes:
- the teaching evaluation information is generated according to the first feature deviation value and the second feature deviation value.
- generating the teaching evaluation information according to the first feature deviation value and the second feature deviation value includes:
- the score information is used as teaching evaluation information.
- the method further includes:
- Teaching evaluation information is generated based on the first deviation information and the matched standard pronunciation and standard lip type features.
- the method further includes:
- the teaching evaluation information is generated according to the lip type feature of the increased prompt identifier, and the teaching evaluation information is controlled to be displayed on the user display device.
- positioning a user's lips in the video picture information includes:
- the lip features include a lip contour, a lip diameter, a lip opening angle, a lip height, and a lip width.
- matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model includes:
- the vectorized lip features are matched to the standard lip features.
- the method further includes:
- the foreign language test information and the pronunciation information in the foreign language practice model are periodically sent to the user for the foreign language intensive exercise.
- a device for generating foreign language teaching evaluation information includes:
- a signal detecting module configured to output a foreign language test information, detect a video signal collected by the video capture device, and respectively extract video image information and audio information in the video signal;
- a feature capture module configured to locate a user's lip in the video screen information, and capture a lip shape of the user's lip;
- An information matching module configured to match pronunciation information corresponding to the foreign language test information in a pre-established foreign language test information and a pronunciation information model, where the pronunciation information includes a standard pronunciation and a standard lip type feature;
- an information generating module configured to calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
- an electronic device comprising:
- a memory having stored thereon computer readable instructions that, when executed by the processor, implement the method of any of the above.
- a computer readable storage medium having stored thereon a computer program, the computer program being executed by a processor, implements the method of any of the above.
- a method for generating a foreign language teaching evaluation information in an exemplary embodiment of the present disclosure outputting foreign language test information, detecting a video signal collected by a video capture device, and extracting video image information and audio information in the video signal, respectively, for the video image Positioning the user's lip in the information, and grasping the lip shape of the user's lip, matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, the pronunciation information
- the standard pronunciation and the standard lip shape feature are included, and the first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature are calculated, and the teaching evaluation information is generated according to the first feature deviation value.
- the lip shape of the user's lips is adopted.
- the user can intuitively feel the gap between the mouth shape and the standard mouth shape, and realize the mouth shape adjustment more quickly to make the calibration more accurate.
- the pronunciation greatly enhances the user experience.
- FIG. 1 illustrates a flowchart of a foreign language teaching evaluation information generating method according to an exemplary embodiment of the present disclosure
- FIG. 2 illustrates a schematic diagram including a lip feature in accordance with an exemplary embodiment of the present disclosure
- FIG. 3 illustrates a schematic diagram of a lip-shaped feature contrast display image application scenario according to an exemplary embodiment of the present disclosure
- FIG. 4 shows a schematic block diagram of a foreign language teaching evaluation information generating apparatus according to an exemplary embodiment of the present disclosure
- FIG. 5 schematically illustrates a block diagram of an electronic device in accordance with an exemplary embodiment of the present disclosure
- FIG. 6 schematically illustrates a schematic diagram of a computer readable storage medium in accordance with an exemplary embodiment of the present disclosure.
- a method for generating a foreign language teaching evaluation information is first provided, which can be applied to an electronic device such as a computer.
- the foreign language teaching evaluation information generating method may include the following steps:
- Step S110 outputting foreign language test information, detecting a video signal collected by the video capture device, and separately extracting video picture information and audio information in the video signal;
- Step S120 Positioning a user's lip in the video screen information, and grasping a lip shape of the user's lip;
- Step S130 Matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, the pronunciation information including standard pronunciation and standard lip type features;
- Step S140 Calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
- the face recognition of the user video signal and the processing algorithm of the specific area of the face realize the rapid positioning of the lip, and the positioning speed and accuracy are improved.
- the user can intuitively feel the mouth shape and the standard mouth shape. Gap, faster implementation of lip adjustment to give a more accurate pronunciation, greatly enhance the user experience.
- step S110 the foreign language test information may be output, the video signal collected by the video capture device is detected, and the video screen information and the audio information in the video signal are respectively extracted.
- the foreign language test information is displayed on the display device of the user, and then the corresponding pronunciation audio information of the foreign language test information and the video picture information of the facial feature including the pronunciation are obtained by the video collection device.
- step S120 the user's lip in the video screen information may be located and the lip shape of the user's lips may be grasped.
- the lip of the user may be directly positioned according to the facial features of the user in the video picture information, so that the lip feature of the lip of the user may be collected, or the video image may be first
- the user's facial contour is positioned in the information to realize lip positioning.
- positioning the user's lip in the video picture information includes: identifying and locating a user's facial contour in the video picture information; using a lip color filter on the user's lips in a specified area of the facial contour
- the department performs positioning and tracking. Because the existing facial recognition algorithm based on video picture information is a mature technology, the user's facial contour is first recognized, and then the designated area is specified in the facial contour, and the lip of the user is positioned by using the lip color filter. And tracking, compared with directly positioning the user's lips, can greatly increase the recognition and enhance the robustness of the system.
- the lip features include a lip contour, a lip aperture, a lip opening angle, a lip height, and a lip width.
- FIG. 2 is a schematic diagram of a lip-shaped feature
- the lip-shaped feature parameter is extracted, and the digitized parameter identification can be implemented on the irregular lip-shaped feature.
- matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model includes: according to the lip contour, the lip diameter, the lip opening angle, and the lip height And geometric modeling of at least one of the lip widths to form a vectorized lip shape feature.
- the corresponding lip shape feature is also a change of the mouth shape feature parameter.
- Each of the mouth shape feature parameters after geometric modeling can establish each port when the mouth shape changes.
- the vectorized lip shape of the type parameter completely records the change process of each mouth shape characteristic parameter. Thereafter, the vectorized lip feature is matched to the standard lip feature, and the standard lip feature may also be a vectorized standard lip feature.
- the pronunciation information corresponding to the foreign language test information may be matched in the pre-established foreign language test information and the pronunciation information model, and the pronunciation information includes standard pronunciation and standard lip type features.
- the pronunciation information corresponding to the foreign language test information may be matched in the pre-established foreign language test information and the pronunciation information model, and the standard pronunciation and the standard lip-shaped feature corresponding to the foreign language test information are obtained. Further, the pronunciation information corresponding to the user's pronunciation or the lip-shaped feature may be matched in the pre-established foreign language test information and the pronunciation information model, and used to check the correctness of the user's foreign language test information.
- step S140 a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature may be calculated, and the teaching evaluation information is generated according to the first feature deviation value.
- the method further includes: determining whether the first feature deviation value satisfies a preset deviation condition; if the feature deviation value satisfies a preset deviation condition, generating a teaching evaluation according to the first characteristic deviation value
- the information, the teaching evaluation information may be an evaluation score of the foreign language test information, or may be an evaluation file value of the foreign language test information, such as "qualified", “unqualified” or “excellent”, "good”, “failed”, and the like.
- the method further includes: obtaining first deviation information according to the first characteristic deviation value; and generating teaching evaluation information according to the first deviation information and the matched standard pronunciation and standard lip type features.
- the method further includes: adding a prompt identifier to the lip-shaped feature region that satisfies the preset deviation condition; generating teaching evaluation information according to the lip-shaped feature of the added prompt identifier, and controlling the teaching evaluation information to be Displayed on the user display device.
- the solid line part is a lip-shaped feature of the foreign language test information “C” when the user tests in a foreign language
- the dotted line part is a standard lip-shaped feature corresponding to the foreign language test information “C”, in the solid line part and the dotted line.
- the prompt identifier is added, which is convenient for the user to further correct the mouth shape feature according to the prompt identifier, and achieve a complete fitting of the user mouth type feature and the standard mouth shape feature, thereby issuing a more standard foreign language test information "C" pronunciation.
- the method further includes: analyzing the audio information, identifying a user's pronunciation in the audio information; calculating a second feature deviation value of the user's pronunciation and the matched standard pronunciation; The first feature deviation value and the second feature deviation value are used to generate teaching evaluation information.
- the user's pronunciation should also be judged, and the user's pronunciation in the collected audio information is compared with the matched standard pronunciation to obtain the second characteristic deviation.
- the value, the second characteristic deviation value reflects the deviation information of the user's pronunciation, and combined with the first characteristic deviation value, the teaching evaluation information of the user's foreign language test information may be comprehensively generated.
- generating the teaching evaluation information according to the first feature deviation value and the second feature deviation value includes: generating the score information according to the first feature deviation value; and using the score information as the teaching evaluation information.
- the method further includes: determining whether the score information is less than a preset score; if yes, saving the score information and foreign language test information and pronunciation information corresponding to the score information; The score information and the corresponding foreign language test information and the pronunciation information are used to train the foreign language practice model; the foreign language test information and the pronunciation information in the foreign language practice model are periodically sent to the user for the foreign language intensive exercise.
- the above method is a further enhancement of foreign language learning after the evaluation of foreign language teaching. By means of the preset score line of the facility, the user's foreign language pronunciation level can be quickly checked, and the number of foreign language pronunciation tests that do not satisfy the score line is increased, and the purpose of reinforcement learning is realized, which is beneficial to the purpose. Enhance the pronunciation of foreign language pronunciation.
- the foreign language teaching evaluation information generating apparatus 400 may include: a signal detecting module 410, configured to output foreign language test information, detect a video signal collected by the video capturing device, and respectively extract video image information in the video signal and Audio information
- the feature capture module 420 is configured to locate a user's lip in the video screen information, and capture a lip shape of the user's lip;
- the information matching module 430 is configured to match the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, where the pronunciation information includes standard pronunciation and standard lip-shaped features;
- the information generating module 440 is configured to calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
- modules or units of the foreign language teaching evaluation information generating apparatus 400 are mentioned in the above detailed description, such division is not mandatory. Indeed, in accordance with embodiments of the present disclosure, the features and functions of two or more modules or units described above may be embodied in one module or unit. Conversely, the features and functions of one of the modules or units described above may be further divided into multiple modules or units.
- an electronic device capable of implementing the above method is also provided.
- aspects of the present invention can be implemented as a system, method, or program product. Accordingly, aspects of the present invention may be embodied in the form of a complete hardware embodiment, a complete software embodiment (including firmware, microcode, etc.), or a combination of hardware and software aspects, which may be collectively referred to herein. "Circuit,” “module,” or “system.”
- FIG. 5 An electronic device 500 in accordance with such an embodiment of the present invention is described below with reference to FIG. 5 is merely an example and should not impose any limitation on the function and scope of use of the embodiments of the present invention.
- electronic device 500 is embodied in the form of a general purpose computing device.
- the components of the electronic device 500 may include, but are not limited to, the at least one processing unit 510, the at least one storage unit 520, the bus 530 connecting the different system components (including the storage unit 520 and the processing unit 510), and the display unit 540.
- the storage unit stores program code, which can be executed by the processing unit 510, such that the processing unit 510 performs various exemplary embodiments according to the present invention described in the "Exemplary Method" section of the present specification.
- the processing unit 510 can perform steps S110 to S140 as shown in FIG. 1.
- the storage unit 520 can include a readable medium in the form of a volatile storage unit, such as a random access storage unit (RAM) 5201 and/or a cache storage unit 5202, and can further include a read only storage unit (ROM) 5203.
- RAM random access storage unit
- ROM read only storage unit
- the storage unit 520 can also include a program/utility 5204 having a set (at least one) of the program modules 5205, such as but not limited to: an operating system, one or more applications, other program modules, and program data, Implementations of the network environment may be included in each or some of these examples.
- a program/utility 5204 having a set (at least one) of the program modules 5205, such as but not limited to: an operating system, one or more applications, other program modules, and program data, Implementations of the network environment may be included in each or some of these examples.
- Bus 530 may be representative of one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, a graphics acceleration port, a processing unit, or a local area using any of a variety of bus structures. bus.
- the electronic device 500 can also communicate with one or more external devices 570 (eg, a keyboard, pointing device, Bluetooth device, etc.), and can also communicate with one or more devices that enable the user to interact with the electronic device 500, and/or with Any device (eg, router, modem, etc.) that enables the electronic device 500 to communicate with one or more other computing devices. This communication can take place via an input/output (I/O) interface 550. Also, electronic device 500 can communicate with one or more networks (e.g., a local area network (LAN), a wide area network (WAN), and/or a public network, such as the Internet) via network adapter 560. As shown, network adapter 560 communicates with other modules of electronic device 500 via bus 530.
- network adapter 560 communicates with other modules of electronic device 500 via bus 530.
- the exemplary embodiments described herein may be implemented by software, or may be implemented by software in combination with necessary hardware. Therefore, the technical solution according to an embodiment of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a USB flash drive, a mobile hard disk, etc.) or on a network.
- a non-volatile storage medium which may be a CD-ROM, a USB flash drive, a mobile hard disk, etc.
- a number of instructions are included to cause a computing device (which may be a personal computer, server, terminal device, or network device, etc.) to perform a method in accordance with an embodiment of the present disclosure.
- a computer readable storage medium having stored thereon a program product capable of implementing the above method of the present specification.
- aspects of the present invention may also be embodied in the form of a program product comprising program code for causing said program product to run on a terminal device The terminal device performs the steps according to various exemplary embodiments of the present invention described in the "Exemplary Method" section of the present specification.
- a program product 600 for implementing the above method which may employ a portable compact disk read only memory (CD-ROM) and includes program code, and may be in a terminal device, is illustrated in accordance with an embodiment of the present invention.
- CD-ROM portable compact disk read only memory
- the program product of the present invention is not limited thereto, and in the present document, the readable storage medium may be any tangible medium containing or storing a program that can be used by or in connection with an instruction execution system, apparatus or device.
- the program product can employ any combination of one or more readable media.
- the readable medium can be a readable signal medium or a readable storage medium.
- the readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the above. More specific examples (non-exhaustive lists) of readable storage media include: electrical connections with one or more wires, portable disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing.
- the computer readable signal medium may include a data signal that is propagated in the baseband or as part of a carrier, carrying readable program code. Such propagated data signals can take a variety of forms including, but not limited to, electromagnetic signals, optical signals, or any suitable combination of the foregoing.
- the readable signal medium can also be any readable medium other than a readable storage medium that can transmit, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
- Program code embodied on a readable medium can be transmitted using any suitable medium, including but not limited to wireless, wireline, optical cable, RF, etc., or any suitable combination of the foregoing.
- Program code for performing the operations of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C++, etc., including conventional procedural Programming language—such as the "C" language or a similar programming language.
- the program code can execute entirely on the user computing device, partially on the user device, as a stand-alone software package, partially on the remote computing device on the user computing device, or entirely on the remote computing device or server. Execute on.
- the remote computing device can be connected to the user computing device via any kind of network, including a local area network (LAN) or wide area network (WAN), or can be connected to an external computing device (eg, provided using an Internet service) Businesses are connected via the Internet).
- LAN local area network
- WAN wide area network
- Businesses are connected via the Internet.
- the lip shape of the user's lips is adopted.
- the user can intuitively feel the gap between the mouth shape and the standard mouth shape, and realize the mouth shape adjustment more quickly to make the calibration more accurate.
- the pronunciation greatly enhances the user experience.
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Educational Administration (AREA)
- Educational Technology (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Strategic Management (AREA)
- Tourism & Hospitality (AREA)
- General Health & Medical Sciences (AREA)
- Multimedia (AREA)
- Economics (AREA)
- Human Resources & Organizations (AREA)
- Marketing (AREA)
- Primary Health Care (AREA)
- General Business, Economics & Management (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Entrepreneurship & Innovation (AREA)
- Electrically Operated Instructional Devices (AREA)
Abstract
The present disclosure relates to a foreign language teaching evaluation information generating method and apparatus, an electronic device, and a storage medium. Said method comprises: outputting foreign language test information, detecting a video signal acquired by a video acquiring device, and extracting video image information and audio information in the video signal, respectively; positioning a user's lips in the video image information, and capturing mouth shape features of the user's lips; matching, in a pre-established foreign language test information and pronunciation information model, pronunciation information corresponding to the foreign language test information, the pronunciation information comprising a standard pronunciation and standard mouth shape features; calculating a first feature deviation value between the mouth shape features of the user's lip and the matched standard mouth shape features, and generating teaching evaluation information according to the first feature deviation value. The present disclosure can realize the evaluation of foreign language teaching by positioning and comparing mouth shape features in foreign language test information.
Description
本公开涉及计算机技术领域,具体而言,涉及一种外语教学评价信息生成方法、装置、电子设备以及计算机可读存储介质。The present disclosure relates to the field of computer technology, and in particular, to a foreign language teaching evaluation information generating method, apparatus, electronic device, and computer readable storage medium.
外语学习中发音是否准确直接关系到教学学习效果,实际教学中,准确的判定用户每次的发音与口型的正确程度,让用户知悉待更正的方向,可以让提高用户的学习效率,实现事半功倍的效果。Whether the pronunciation is accurate in foreign language learning is directly related to the teaching and learning effect. In the actual teaching, it is accurate to determine the correctness of the user's pronunciation and mouth shape each time, so that the user can know the direction to be corrected, which can improve the learning efficiency of the user and achieve more with less. Effect.
然而,在实际教学中,只能通过一对一教学的方式才能实现对用户每次的发音与口型的观察和判定,这样的方式需要耗用占用大量的教学资源,同时学习时间和场所也是很的制约因素。However, in the actual teaching, only one-to-one teaching can only achieve the observation and judgment of the user's pronunciation and mouth each time. This way requires a large amount of teaching resources, and the learning time and place are also Very restrictive factor.
在现有技术中,围绕注释内容的显示这个主题,现有技术中已经有一些专利申请进行了有益的尝试,比如:In the prior art, there have been some patent applications in the prior art that have made useful attempts around the topic of display of annotated content, such as:
申请号为CN201610823063.6的专利申请公开了一种语音口型动画的识别方法及装置,从待识别语音中提取语音特征;将提取的所述语音特征,输入预先训练的语音口型识别模型;确定所述语音口型识别模型输出的与所述语音特征对应的口型类别;根据所述语音口型识别模型输出的口型类别,确定与所述口型类别对应的口型动画,作为所述待识别语音的口型动画。该申请通过将提取的所述语音特征在预先训练的语音口型识别模型处理得到对应的口型,但是并没有对用户的口型进行采集分析。The patent application with the application number CN201610823063.6 discloses a voice mouth type animation recognition method and device, and extracts a voice feature from the voice to be recognized; and inputs the extracted voice feature into a pre-trained voice mouth recognition model; Determining a lip type corresponding to the voice feature output by the voice port recognition model; determining a lip animation corresponding to the lip type according to the lip type output of the voice port recognition model; A lip animation that recognizes the recognized voice. The application processes the extracted voice features in a pre-trained voice lip recognition model to obtain a corresponding lip shape, but does not collect and analyze the user's mouth shape.
申请号为CN201611075466.3的专利申请公开了唇语识别方法以及装置,获取目标人体对象的图像信息;从所述图像信息中获取所述目标人体对象的嘴唇区域图像;从所述嘴唇区域图像中提取唇部特征,并对所述唇部特征进行唇语识别。该申请先对获取人体对象的图像信息,然后在其中获取嘴唇区域图像,进而提取唇部特征,但其获取嘴唇区域图像是通过对鼻尖的特征定位来实现的,并不是通过嘴唇的轮廓或者颜色特征来实现的。Patent Application No. CN201611075466.3 discloses a lip language recognition method and apparatus for acquiring image information of a target human body object; acquiring a lip region image of the target human body object from the image information; and from the lip region image The lip features are extracted and the lip features are lip language recognized. The application first acquires image information of a human body object, and then acquires an image of a lip region therein, thereby extracting a lip feature, but the image of the lip region is obtained by positioning the feature of the nose tip, not through the contour or color of the lip. Features to achieve.
申请号为CN201611076381.7的专利申请公开了一种基于深度图像的唇语交互方法以及唇语交互装置,获取目标人体对象的深度图像信息;从深度图像信息中获取所述目标人体对象的唇部区域图像;从嘴唇区域图像提取唇部特征,根据唇部特征进行唇语识别;将唇语识别的结果转化成对应的操作指令,并根据所述操作指令进行交互。该申请是依靠图像识别的方式实现对唇部的定位及唇部特征的提取的,其目的是识别对应的唇语以实现对交互设备的控制,并不能将唇部特征显示给用户使用户达到外语学习的目的。The patent application with the application number CN201611076381.7 discloses a lip image interaction method based on a depth image and a lip language interaction device for acquiring depth image information of a target human body object; and acquiring a lip of the target human body object from the depth image information The region image; the lip feature is extracted from the lip region image, and the lip language recognition is performed according to the lip feature; the result of the lip language recognition is converted into a corresponding operation instruction, and the interaction is performed according to the operation instruction. The application relies on image recognition to realize the positioning of the lip and the extraction of the lip feature, the purpose of which is to identify the corresponding lip language to realize the control of the interactive device, and can not display the lip feature to the user to enable the user to reach The purpose of foreign language learning.
现有技术中,关于外语教学评价信息生成方法中口型特征的处理还存在以下问题:In the prior art, there are the following problems regarding the processing of lip-type features in the method for generating foreign language teaching evaluation information:
1、通过视频采集设备采集并分析口型设备后,并没有在用户的显示终端中显示;1. After the mouth device is collected and analyzed by the video capture device, it is not displayed in the user's display terminal;
2、不能先通过人体识别,进而通过嘴唇的轮廓或者颜色特征来实现对唇部的定位;2, can not first through the body recognition, and then through the contours or color features of the lips to achieve the positioning of the lip;
3、此外,也并未关注到用户口型与标准口型对比显示的问题。3. In addition, there is no concern about the comparison between the user's mouth type and the standard mouth type.
因此,需要提供一种或多种至少能够解决上述问题的技术方案。Therefore, it is desirable to provide one or more technical solutions that at least solve the above problems.
需要说明的是,在上述背景技术部分公开的信息仅用于加强对本公开的背景的理解,因此可以包括不构成对本领域普通技术人员已知的现有技术的信息。It should be noted that the information disclosed in the Background section above is only for enhancement of understanding of the background of the present disclosure, and thus may include information that does not constitute prior art known to those of ordinary skill in the art.
发明内容Summary of the invention
本公开的目的在于提供一种外语教学评价信息生成方法、装置、电子设备以及计算机可读存储介质,进而至少在一定程度上克服由于相关技术的限制和缺陷而导致的一个或者多个问题。An object of the present disclosure is to provide a foreign language teaching evaluation information generating method, apparatus, electronic device, and computer readable storage medium, thereby at least partially obviating one or more problems due to limitations and disadvantages of the related art.
根据本公开的一个方面,提供一种外语教学评价信息生成方法,包括:According to an aspect of the present disclosure, a method for generating foreign language teaching evaluation information includes:
一种外语教学评价信息生成方法,其特征在于,所述方法包括:A method for generating foreign language teaching evaluation information, characterized in that the method comprises:
输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所 述视频信号中的视频画面信息以及音频信息;Outputting foreign language test information, detecting a video signal collected by the video capture device, and separately extracting video picture information and audio information in the video signal;
对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征;Positioning a user's lips in the video screen information and grabbing a lip shape of the user's lips;
在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征;Matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, the pronunciation information including standard pronunciation and standard lip type features;
计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。Calculating a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generating teaching evaluation information according to the first feature deviation value.
在本公开的一种示例性实施例中,所述方法还包括:In an exemplary embodiment of the present disclosure, the method further includes:
分析所述音频信息,识别所述音频信息中的用户发音;Analyzing the audio information to identify a user's pronunciation in the audio information;
计算所述用户发音与匹配到的所述标准发音的第二特征偏差值;Calculating a second feature deviation value of the user pronunciation and the matched standard pronunciation;
根据所述第一特征偏差值以及第二特征偏差值所述生成教学评价信息。The teaching evaluation information is generated according to the first feature deviation value and the second feature deviation value.
在本公开的一种示例性实施例中,根据所述第一特征偏差值以及第二特征偏差值生成教学评价信息,包括:In an exemplary embodiment of the present disclosure, generating the teaching evaluation information according to the first feature deviation value and the second feature deviation value includes:
根据所述第一特征偏差值生成得分信息;Generating score information according to the first feature deviation value;
将所述得分信息作为教学评价信息。The score information is used as teaching evaluation information.
在本公开的一种示例性实施例中,所述方法还包括:In an exemplary embodiment of the present disclosure, the method further includes:
判断所述第一特征偏差值是否满足预设偏差条件;Determining whether the first feature deviation value satisfies a preset deviation condition;
若所述特征偏差值满足预设偏差条件,根据所述第一特征偏差值生成教学评价信息,包括:And if the characteristic deviation value satisfies the preset deviation condition, generating teaching evaluation information according to the first characteristic deviation value, including:
根据所述第一特征偏差值得到第一偏差信息;Obtaining first deviation information according to the first characteristic deviation value;
根据所述第一偏差信息以及匹配到的标准发音以及标准口型特征生成教学评价信息。Teaching evaluation information is generated based on the first deviation information and the matched standard pronunciation and standard lip type features.
在本公开的一种示例性实施例中,所述方法还包括:In an exemplary embodiment of the present disclosure, the method further includes:
对满足预设偏差条件的口型特征区域增加提示标识;Adding a prompt identifier to the lip-shaped feature area that satisfies the preset deviation condition;
根据所述增加提示标识的口型特征生成教学评价信息,并控制所述教 学评价信息在用户显示设备上显示。The teaching evaluation information is generated according to the lip type feature of the increased prompt identifier, and the teaching evaluation information is controlled to be displayed on the user display device.
在本公开的一种示例性实施例中,对所述视频画面信息中的用户唇部进行定位,包括:In an exemplary embodiment of the present disclosure, positioning a user's lips in the video picture information includes:
识别并定位所述视频画面信息中的用户面部轮廓;Identifying and locating a user's facial contour in the video picture information;
对面部轮廓中指定区域使用唇色过滤器对用户唇部进行定位并跟踪。Use a lip filter to position and track the user's lips for a specified area of the facial contour.
在本公开的一种示例性实施例中,所述口型特征包括唇部轮廓、唇部口径、唇部张角、唇高以及唇宽。In an exemplary embodiment of the present disclosure, the lip features include a lip contour, a lip diameter, a lip opening angle, a lip height, and a lip width.
在本公开的一种示例性实施例中,在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,包括:In an exemplary embodiment of the present disclosure, matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model includes:
根据所述唇部轮廓、唇部口径、唇部张角、唇高以及唇宽的至少一项进行几何建模,形成矢量化口型特征;Geometrically modeling according to at least one of the lip contour, the lip diameter, the lip opening angle, the lip height, and the lip width to form a vectorized lip shape feature;
将所述矢量化口型特征与所述标准口型特征进行匹配。The vectorized lip features are matched to the standard lip features.
在本公开的一种示例性实施例中,所述方法还包括:In an exemplary embodiment of the present disclosure, the method further includes:
判断所述得分信息是否小于预设分数;Determining whether the score information is less than a preset score;
若是,保存所述得分信息以及与所述得分信息对应的外语测试信息以及发音信息;If yes, saving the score information and foreign language test information and pronunciation information corresponding to the score information;
根据保存的所述得分信息以及对应的外语测试信息以及发音信息训练外语练习模型;Training a foreign language practice model according to the saved score information and corresponding foreign language test information and pronunciation information;
定期将所述外语练习模型中的外语测试信息以及发音信息发送至用户,以进行外语强化练习。The foreign language test information and the pronunciation information in the foreign language practice model are periodically sent to the user for the foreign language intensive exercise.
在本公开的一个方面,提供一种外语教学评价信息生成装置,包括:In an aspect of the present disclosure, a device for generating foreign language teaching evaluation information includes:
信号检测模块,用于输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息;a signal detecting module, configured to output a foreign language test information, detect a video signal collected by the video capture device, and respectively extract video image information and audio information in the video signal;
特征抓取模块,用于对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征;a feature capture module, configured to locate a user's lip in the video screen information, and capture a lip shape of the user's lip;
信息匹配模块,用于在预先建立的外语测试信息与发音信息模型中匹 配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征;An information matching module, configured to match pronunciation information corresponding to the foreign language test information in a pre-established foreign language test information and a pronunciation information model, where the pronunciation information includes a standard pronunciation and a standard lip type feature;
信息生成模块,用于计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。And an information generating module, configured to calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
在本公开的一个方面,提供一种电子设备,包括:In an aspect of the disclosure, an electronic device is provided, comprising:
处理器;以及Processor;
存储器,所述存储器上存储有计算机可读指令,所述计算机可读指令被所述处理器执行时实现根据上述任意一项所述的方法。A memory having stored thereon computer readable instructions that, when executed by the processor, implement the method of any of the above.
在本公开的一个方面,提供一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现根据上述任意一项所述的方法。In an aspect of the present disclosure, a computer readable storage medium having stored thereon a computer program, the computer program being executed by a processor, implements the method of any of the above.
本公开的示例性实施例中的外语教学评价信息生成方法,输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息,对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征,在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征,计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。一方面,通过对用户视频信号的面部识别进而对面部特定区域的处理算法,实现了对唇部的快速定位,提高了定位的速度和准确性;再一方面,通过用户唇部的口型特征与匹配到的所述标准口型特征的对比,并将偏差区域标识显示的方式,可以使用户直观的感受到口型和标准口型的差距,更快速的实现口型调节,以发出更准确的读音,极大的增强了用户体验。a method for generating a foreign language teaching evaluation information in an exemplary embodiment of the present disclosure, outputting foreign language test information, detecting a video signal collected by a video capture device, and extracting video image information and audio information in the video signal, respectively, for the video image Positioning the user's lip in the information, and grasping the lip shape of the user's lip, matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, the pronunciation information The standard pronunciation and the standard lip shape feature are included, and the first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature are calculated, and the teaching evaluation information is generated according to the first feature deviation value. On the one hand, through the face recognition of the user video signal and the processing algorithm of the specific area of the face, the rapid positioning of the lip is realized, and the speed and accuracy of the positioning are improved; on the other hand, the lip shape of the user's lips is adopted. Compared with the matching standard lip shape features and the display of the deviation area identification, the user can intuitively feel the gap between the mouth shape and the standard mouth shape, and realize the mouth shape adjustment more quickly to make the calibration more accurate. The pronunciation greatly enhances the user experience.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本公开。The above general description and the following detailed description are intended to be illustrative and not restrictive.
通过参照附图来详细描述其示例实施例,本公开的上述和其它特征及优点将变得更加明显。The above and other features and advantages of the present disclosure will become more apparent from the detailed description.
图1示出了根据本公开一示例性实施例的外语教学评价信息生成方法的流程图;FIG. 1 illustrates a flowchart of a foreign language teaching evaluation information generating method according to an exemplary embodiment of the present disclosure;
图2示出了根据本公开一示例性实施例的包含口型特征的示意图;2 illustrates a schematic diagram including a lip feature in accordance with an exemplary embodiment of the present disclosure;
图3示出了根据本公开一示例性实施例的口型特征对比显示图像应用场景的示意图;FIG. 3 illustrates a schematic diagram of a lip-shaped feature contrast display image application scenario according to an exemplary embodiment of the present disclosure; FIG.
图4示出了根据本公开一示例性实施例的外语教学评价信息生成装置的示意框图;FIG. 4 shows a schematic block diagram of a foreign language teaching evaluation information generating apparatus according to an exemplary embodiment of the present disclosure;
图5示意性示出了根据本公开一示例性实施例的电子设备的框图;以及FIG. 5 schematically illustrates a block diagram of an electronic device in accordance with an exemplary embodiment of the present disclosure;
图6示意性示出了根据本公开一示例性实施例的计算机可读存储介质的示意图。FIG. 6 schematically illustrates a schematic diagram of a computer readable storage medium in accordance with an exemplary embodiment of the present disclosure.
现在将参考附图更全面地描述示例实施例。然而,示例实施例能够以多种形式实施,且不应被理解为限于在此阐述的实施例;相反,提供这些实施例使得本公开将全面和完整,并将示例实施例的构思全面地传达给本领域的技术人员。在图中相同的附图标记表示相同或类似的部分,因而将省略对它们的重复描述。Example embodiments will now be described more fully with reference to the accompanying drawings. However, the exemplary embodiments can be embodied in a variety of forms and should not be construed as being limited to the embodiments set forth herein. To those skilled in the art. The same reference numerals in the drawings denote the same or similar parts, and the repeated description thereof will be omitted.
此外,所描述的特征、结构或特性可以以任何合适的方式结合在一个或更多实施例中。在下面的描述中,提供许多具体细节从而给出对本公开的实施例的充分理解。然而,本领域技术人员将意识到,可以实践本公开的技术方案而没有所述特定细节中的一个或更多,或者可以采用其它的方法、组元、材料、装置、步骤等。在其它情况下,不详细示出或描述公知结构、方法、装置、实现、材料或者操作以避免模糊本公开的各方面。Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are set forth However, one skilled in the art will appreciate that the technical solution of the present disclosure may be practiced without one or more of the specific details, or other methods, components, materials, devices, steps, etc. may be employed. In other instances, well-known structures, methods, devices, implementations, materials, or operations are not shown or described in detail to avoid obscuring aspects of the present disclosure.
附图中所示的方框图仅仅是功能实体,不一定必须与物理上独立的实体相对应。即,可以采用软件形式来实现这些功能实体,或在一个或多个软件硬化的模块中实现这些功能实体或功能实体的一部分,或在不同网络和/或处理器装置和/或微控制器装置中实现这些功能实体。The block diagrams shown in the figures are merely functional entities and do not necessarily have to correspond to physically separate entities. That is, these functional entities may be implemented in software, or implemented in one or more software-hardened modules, or in different network and/or processor devices and/or microcontroller devices. Implement these functional entities.
在本示例实施例中,首先提供了一种外语教学评价信息生成方法,可以应用于计算机等电子设备;参考图1中所示,该外语教学评价信息生成方法可以包括以下步骤:In the present exemplary embodiment, a method for generating a foreign language teaching evaluation information is first provided, which can be applied to an electronic device such as a computer. Referring to FIG. 1, the foreign language teaching evaluation information generating method may include the following steps:
步骤S110.输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息;Step S110: outputting foreign language test information, detecting a video signal collected by the video capture device, and separately extracting video picture information and audio information in the video signal;
步骤S120.对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征;Step S120. Positioning a user's lip in the video screen information, and grasping a lip shape of the user's lip;
步骤S130.在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征;Step S130. Matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, the pronunciation information including standard pronunciation and standard lip type features;
步骤S140.计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。Step S140. Calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
根据本示例实施例中的外语教学评价信息生成方法,一方面,通过对用户视频信号的面部识别进而对面部特定区域的处理算法,实现了对唇部的快速定位,提高了定位的速度和准确性;再一方面,通过用户唇部的口型特征与匹配到的所述标准口型特征的对比,并将偏差区域标识显示的方式,可以使用户直观的感受到口型和标准口型的差距,更快速的实现口型调节,以发出更准确的读音,极大的增强了用户体验。According to the foreign language teaching evaluation information generating method in the exemplary embodiment, on the one hand, the face recognition of the user video signal and the processing algorithm of the specific area of the face realize the rapid positioning of the lip, and the positioning speed and accuracy are improved. On the other hand, through the comparison between the lip characteristics of the user's lips and the matching standard lip features, and the display of the deviation region identification, the user can intuitively feel the mouth shape and the standard mouth shape. Gap, faster implementation of lip adjustment to give a more accurate pronunciation, greatly enhance the user experience.
下面,将对本示例实施例中的外语教学评价信息生成方法进行进一步的说明。Next, the foreign language teaching evaluation information generating method in the present exemplary embodiment will be further described.
在步骤S110中,可以输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息。In step S110, the foreign language test information may be output, the video signal collected by the video capture device is detected, and the video screen information and the audio information in the video signal are respectively extracted.
本示例实施方式中,在用户的显示设备上显示外语测试信息,然后通过视频采集设备获取用户对所述外语测试信息的对应的发音音频信息及包 含发音时面部特征的视频画面信息。In the exemplary embodiment, the foreign language test information is displayed on the display device of the user, and then the corresponding pronunciation audio information of the foreign language test information and the video picture information of the facial feature including the pronunciation are obtained by the video collection device.
在步骤S120中,可以对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征。In step S120, the user's lip in the video screen information may be located and the lip shape of the user's lips may be grasped.
本示例实施方式中,可以根据所述视频画面信息中用户发音时的面部特征直接对用户唇部进行定位,实现对所述用户唇部的口型特征的采集,也可以先对所述视频画面信息中的用户面部轮廓定位,进而实现唇部定位,具体方法如下:In this example, the lip of the user may be directly positioned according to the facial features of the user in the video picture information, so that the lip feature of the lip of the user may be collected, or the video image may be first The user's facial contour is positioned in the information to realize lip positioning. The specific method is as follows:
本示例实施方式中,对所述视频画面信息中的用户唇部进行定位,包括:识别并定位所述视频画面信息中的用户面部轮廓;对面部轮廓中指定区域使用唇色过滤器对用户唇部进行定位并跟踪。因为现有的基于视频画面信息的面部识别算法是成熟的技术,先对用户面部轮廓识别,进而在识别的面部轮廓中指定区域,通过使用唇色过滤器的方式,实现对用户唇部的定位和跟踪,与直接对用户唇部进行定位相比,可以大幅增加识别度,增强系统的鲁棒性。In this example embodiment, positioning the user's lip in the video picture information includes: identifying and locating a user's facial contour in the video picture information; using a lip color filter on the user's lips in a specified area of the facial contour The department performs positioning and tracking. Because the existing facial recognition algorithm based on video picture information is a mature technology, the user's facial contour is first recognized, and then the designated area is specified in the facial contour, and the lip of the user is positioned by using the lip color filter. And tracking, compared with directly positioning the user's lips, can greatly increase the recognition and enhance the robustness of the system.
本示例实施方式中,所述口型特征包括唇部轮廓、唇部口径、唇部张角、唇高以及唇宽。如图2所示,为口型特征的示意图,提取所述口型特征参数,可以对不规则的口型特征实现数字化参数标识。In this exemplary embodiment, the lip features include a lip contour, a lip aperture, a lip opening angle, a lip height, and a lip width. As shown in FIG. 2, which is a schematic diagram of a lip-shaped feature, the lip-shaped feature parameter is extracted, and the digitized parameter identification can be implemented on the irregular lip-shaped feature.
本示例实施方式中,在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,包括:根据所述唇部轮廓、唇部口径、唇部张角、唇高以及唇宽的至少一项进行几何建模,形成矢量化口型特征。由于在外语测试时,往往并不是单一音节的发音,所以对应的口型特征也是变化的口型特征参数,各个通过几何建模后的口型特征参数,可以在口型变化时,建立各个口型参数的矢量化口型特征,完整的记录各个口型特征参数的变化过程。之后,将所述矢量化口型特征与所述标准口型特征进行匹配,所述标准口型特征也可以是矢量化的标准口型特征。In the example embodiment, matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model includes: according to the lip contour, the lip diameter, the lip opening angle, and the lip height And geometric modeling of at least one of the lip widths to form a vectorized lip shape feature. Because in the foreign language test, it is often not the pronunciation of a single syllable, so the corresponding lip shape feature is also a change of the mouth shape feature parameter. Each of the mouth shape feature parameters after geometric modeling can establish each port when the mouth shape changes. The vectorized lip shape of the type parameter completely records the change process of each mouth shape characteristic parameter. Thereafter, the vectorized lip feature is matched to the standard lip feature, and the standard lip feature may also be a vectorized standard lip feature.
在步骤S130中,可以在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征。In step S130, the pronunciation information corresponding to the foreign language test information may be matched in the pre-established foreign language test information and the pronunciation information model, and the pronunciation information includes standard pronunciation and standard lip type features.
本示例实施方式中,可以在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,得到所述外语测试信息对应的标准发音以及标准口型特征。进一步的,还可以在预先建立的外语测试信息与发音信息模型中匹配与用户发音或口型特征对应的发音信息,用来检验用户外语测试信息的正确程度。In the example embodiment, the pronunciation information corresponding to the foreign language test information may be matched in the pre-established foreign language test information and the pronunciation information model, and the standard pronunciation and the standard lip-shaped feature corresponding to the foreign language test information are obtained. Further, the pronunciation information corresponding to the user's pronunciation or the lip-shaped feature may be matched in the pre-established foreign language test information and the pronunciation information model, and used to check the correctness of the user's foreign language test information.
在步骤S140中,可以计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。In step S140, a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature may be calculated, and the teaching evaluation information is generated according to the first feature deviation value.
本示例实施方式中,将用户外语测试信息对应的口型特征与在外语测试信息与发音信息模型中匹配到的所述标准口型特征进行对比,分别统计各个口型特征参数的偏差,综合所述偏差后生成第一特征偏差值,所述第一特征偏差值反应了用户各个口型特征参数的偏差信息,可以将所述第一特征偏差值作为教学评价信息的生成依据。In this example embodiment, comparing the lip-shaped feature corresponding to the user's foreign language test information with the standard lip-shaped feature matched in the foreign language test information and the pronunciation information model, and separately calculating the deviation of each port type feature parameter, the comprehensive After the deviation, a first feature deviation value is generated, and the first feature deviation value reflects deviation information of each mouth shape characteristic parameter of the user, and the first feature deviation value may be used as a basis for generating teaching evaluation information.
本示例实施方式中,所述方法还包括:判断所述第一特征偏差值是否满足预设偏差条件;若所述特征偏差值满足预设偏差条件,根据所述第一特征偏差值生成教学评价信息,所述教学评价信息可以是对外语测试信息的评价分值,也可以是外语测试信息的评价档值,如“合格”“不合格”或“优秀”“良好”“不及格”等。具体的还包括:根据所述第一特征偏差值得到第一偏差信息;根据所述第一偏差信息以及匹配到的标准发音以及标准口型特征生成教学评价信息。In this example, the method further includes: determining whether the first feature deviation value satisfies a preset deviation condition; if the feature deviation value satisfies a preset deviation condition, generating a teaching evaluation according to the first characteristic deviation value The information, the teaching evaluation information may be an evaluation score of the foreign language test information, or may be an evaluation file value of the foreign language test information, such as "qualified", "unqualified" or "excellent", "good", "failed", and the like. Specifically, the method further includes: obtaining first deviation information according to the first characteristic deviation value; and generating teaching evaluation information according to the first deviation information and the matched standard pronunciation and standard lip type features.
本示例实施方式中,所述方法还包括:对满足预设偏差条件的口型特征区域增加提示标识;根据所述增加提示标识的口型特征生成教学评价信息,并控制所述教学评价信息在用户显示设备上显示。如图3所示,实线部分为用户在外语测试时,对外语测试信息“C”的口型特征,虚线部分为外语测试信息“C”对应的标准口型特征,在实线部分与虚线部分的未重合区域,增加了提示标识,方便用户根据提示标识进一步更正口型特征,实现用户口型特征与标准口型特征的完全拟合,进而发出更标准的外语测试信息“C”的读音。In this example, the method further includes: adding a prompt identifier to the lip-shaped feature region that satisfies the preset deviation condition; generating teaching evaluation information according to the lip-shaped feature of the added prompt identifier, and controlling the teaching evaluation information to be Displayed on the user display device. As shown in FIG. 3, the solid line part is a lip-shaped feature of the foreign language test information “C” when the user tests in a foreign language, and the dotted line part is a standard lip-shaped feature corresponding to the foreign language test information “C”, in the solid line part and the dotted line. Part of the uncoincident area, the prompt identifier is added, which is convenient for the user to further correct the mouth shape feature according to the prompt identifier, and achieve a complete fitting of the user mouth type feature and the standard mouth shape feature, thereby issuing a more standard foreign language test information "C" pronunciation. .
本示例实施方式中,所述方法还包括:分析所述音频信息,识别所述 音频信息中的用户发音;计算所述用户发音与匹配到的所述标准发音的第二特征偏差值;根据所述第一特征偏差值以及第二特征偏差值所述生成教学评价信息。为了使用户最终实现标准外语发音,除了对用户口型特征的比较,还应判断用户的发音,将采集到的音频信息中的用户发音与匹配到的所述标准发音对比,得到第二特征偏差值,所述第二特征偏差值反应了用户发音的偏差信息,结合第一特征偏差值,可以综合生成用户外语测试信息的教学评价信息。In this example embodiment, the method further includes: analyzing the audio information, identifying a user's pronunciation in the audio information; calculating a second feature deviation value of the user's pronunciation and the matched standard pronunciation; The first feature deviation value and the second feature deviation value are used to generate teaching evaluation information. In order to enable the user to finally achieve the standard foreign language pronunciation, in addition to comparing the user's mouth shape characteristics, the user's pronunciation should also be judged, and the user's pronunciation in the collected audio information is compared with the matched standard pronunciation to obtain the second characteristic deviation. The value, the second characteristic deviation value reflects the deviation information of the user's pronunciation, and combined with the first characteristic deviation value, the teaching evaluation information of the user's foreign language test information may be comprehensively generated.
本示例实施方式中,根据所述第一特征偏差值以及第二特征偏差值生成教学评价信息,包括:根据所述第一特征偏差值生成得分信息;将所述得分信息作为教学评价信息。如图4所示,为用户外语测试时对外语测试信息“C”对应的得分信息的教学评价信息。In the present exemplary embodiment, generating the teaching evaluation information according to the first feature deviation value and the second feature deviation value includes: generating the score information according to the first feature deviation value; and using the score information as the teaching evaluation information. As shown in FIG. 4, the teaching evaluation information of the score information corresponding to the foreign language test information "C" at the time of the user's foreign language test.
本示例实施方式中,所述方法还包括:判断所述得分信息是否小于预设分数;若是,保存所述得分信息以及与所述得分信息对应的外语测试信息以及发音信息;根据保存的所述得分信息以及对应的外语测试信息以及发音信息训练外语练习模型;定期将所述外语练习模型中的外语测试信息以及发音信息发送至用户,以进行外语强化练习。以上方法是对外语教学评价后外语学习的进一步强化,通过设施预设分数线的方式,可以快速检验用户的外语发音水平,增加不满足分数线的外语发音测试的次数,实现强化学习的目的,有利于增强外语发音学习效果。In this example embodiment, the method further includes: determining whether the score information is less than a preset score; if yes, saving the score information and foreign language test information and pronunciation information corresponding to the score information; The score information and the corresponding foreign language test information and the pronunciation information are used to train the foreign language practice model; the foreign language test information and the pronunciation information in the foreign language practice model are periodically sent to the user for the foreign language intensive exercise. The above method is a further enhancement of foreign language learning after the evaluation of foreign language teaching. By means of the preset score line of the facility, the user's foreign language pronunciation level can be quickly checked, and the number of foreign language pronunciation tests that do not satisfy the score line is increased, and the purpose of reinforcement learning is realized, which is beneficial to the purpose. Enhance the pronunciation of foreign language pronunciation.
需要说明的是,尽管在附图中以特定顺序描述了本公开中方法的各个步骤,但是,这并非要求或者暗示必须按照该特定顺序来执行这些步骤,或是必须执行全部所示的步骤才能实现期望的结果。附加的或备选的,可以省略某些步骤,将多个步骤合并为一个步骤执行,以及/或者将一个步骤分解为多个步骤执行等。It should be noted that, although the various steps of the method of the present disclosure are described in a particular order in the drawings, this does not require or imply that the steps must be performed in the specific order, or that all the steps shown must be performed. Achieve the desired results. Additionally or alternatively, certain steps may be omitted, multiple steps being combined into one step execution, and/or one step being decomposed into multiple step executions and the like.
此外,在本示例实施例中,还提供了一种外语教学评价信息生成装置。参照图4所示,该外语教学评价信息生成装置400可以包括:信号检测模块410,用于输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息;Further, in the present exemplary embodiment, a foreign language teaching evaluation information generating apparatus is also provided. Referring to FIG. 4, the foreign language teaching evaluation information generating apparatus 400 may include: a signal detecting module 410, configured to output foreign language test information, detect a video signal collected by the video capturing device, and respectively extract video image information in the video signal and Audio information
特征抓取模块420,用于对所述视频画面信息中的用户唇部进行定位, 并抓取所述用户唇部的口型特征;The feature capture module 420 is configured to locate a user's lip in the video screen information, and capture a lip shape of the user's lip;
信息匹配模块430,用于在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征;The information matching module 430 is configured to match the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, where the pronunciation information includes standard pronunciation and standard lip-shaped features;
信息生成模块440,用于计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。The information generating module 440 is configured to calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
上述中各外语教学评价信息生成装置模块的具体细节已经在对应的音频段落识别方法中进行了详细的描述,因此此处不再赘述。The specific details of the above-mentioned foreign language teaching evaluation information generating device modules have been described in detail in the corresponding audio paragraph identifying method, and therefore will not be described herein.
应当注意,尽管在上文详细描述中提及了外语教学评价信息生成装置400的若干模块或者单元,但是这种划分并非强制性的。实际上,根据本公开的实施方式,上文描述的两个或更多模块或者单元的特征和功能可以在一个模块或者单元中具体化。反之,上文描述的一个模块或者单元的特征和功能可以进一步划分为由多个模块或者单元来具体化。It should be noted that although several modules or units of the foreign language teaching evaluation information generating apparatus 400 are mentioned in the above detailed description, such division is not mandatory. Indeed, in accordance with embodiments of the present disclosure, the features and functions of two or more modules or units described above may be embodied in one module or unit. Conversely, the features and functions of one of the modules or units described above may be further divided into multiple modules or units.
此外,在本公开的示例性实施例中,还提供了一种能够实现上述方法的电子设备。Further, in an exemplary embodiment of the present disclosure, an electronic device capable of implementing the above method is also provided.
所属技术领域的技术人员能够理解,本发明的各个方面可以实现为系统、方法或程序产品。因此,本发明的各个方面可以具体实现为以下形式,即:完全的硬件实施例、完全的软件实施例(包括固件、微代码等),或硬件和软件方面结合的实施例,这里可以统称为“电路”、“模块”或“系统”。Those skilled in the art will appreciate that various aspects of the present invention can be implemented as a system, method, or program product. Accordingly, aspects of the present invention may be embodied in the form of a complete hardware embodiment, a complete software embodiment (including firmware, microcode, etc.), or a combination of hardware and software aspects, which may be collectively referred to herein. "Circuit," "module," or "system."
下面参照图5来描述根据本发明的这种实施例的电子设备500。图5显示的电子设备500仅仅是一个示例,不应对本发明实施例的功能和使用范围带来任何限制。An electronic device 500 in accordance with such an embodiment of the present invention is described below with reference to FIG. The electronic device 500 shown in FIG. 5 is merely an example and should not impose any limitation on the function and scope of use of the embodiments of the present invention.
如图5所示,电子设备500以通用计算设备的形式表现。电子设备500的组件可以包括但不限于:上述至少一个处理单元510、上述至少一个存储单元520、连接不同系统组件(包括存储单元520和处理单元510)的总线530、显示单元540。As shown in FIG. 5, electronic device 500 is embodied in the form of a general purpose computing device. The components of the electronic device 500 may include, but are not limited to, the at least one processing unit 510, the at least one storage unit 520, the bus 530 connecting the different system components (including the storage unit 520 and the processing unit 510), and the display unit 540.
其中,所述存储单元存储有程序代码,所述程序代码可以被所述处理 单元510执行,使得所述处理单元510执行本说明书上述“示例性方法”部分中描述的根据本发明各种示例性实施例的步骤。例如,所述处理单元510可以执行如图1中所示的步骤S110至步骤S140。Wherein the storage unit stores program code, which can be executed by the processing unit 510, such that the processing unit 510 performs various exemplary embodiments according to the present invention described in the "Exemplary Method" section of the present specification. The steps of the examples. For example, the processing unit 510 can perform steps S110 to S140 as shown in FIG. 1.
存储单元520可以包括易失性存储单元形式的可读介质,例如随机存取存储单元(RAM)5201和/或高速缓存存储单元5202,还可以进一步包括只读存储单元(ROM)5203。The storage unit 520 can include a readable medium in the form of a volatile storage unit, such as a random access storage unit (RAM) 5201 and/or a cache storage unit 5202, and can further include a read only storage unit (ROM) 5203.
存储单元520还可以包括具有一组(至少一个)程序模块5205的程序/实用工具5204,这样的程序模块5205包括但不限于:操作系统、一个或者多个应用程序、其它程序模块以及程序数据,这些示例中的每一个或某种组合中可能包括网络环境的实现。The storage unit 520 can also include a program/utility 5204 having a set (at least one) of the program modules 5205, such as but not limited to: an operating system, one or more applications, other program modules, and program data, Implementations of the network environment may be included in each or some of these examples.
总线530可以为表示几类总线结构中的一种或多种,包括存储单元总线或者存储单元控制器、外围总线、图形加速端口、处理单元或者使用多种总线结构中的任意总线结构的局域总线。 Bus 530 may be representative of one or more of several types of bus structures, including a memory unit bus or memory unit controller, a peripheral bus, a graphics acceleration port, a processing unit, or a local area using any of a variety of bus structures. bus.
电子设备500也可以与一个或多个外部设备570(例如键盘、指向设备、蓝牙设备等)通信,还可与一个或者多个使得用户能与该电子设备500交互的设备通信,和/或与使得该电子设备500能与一个或多个其它计算设备进行通信的任何设备(例如路由器、调制解调器等等)通信。这种通信可以通过输入/输出(I/O)接口550进行。并且,电子设备500还可以通过网络适配器560与一个或者多个网络(例如局域网(LAN),广域网(WAN)和/或公共网络,例如因特网)通信。如图所示,网络适配器560通过总线530与电子设备500的其它模块通信。应当明白,尽管图中未示出,可以结合电子设备500使用其它硬件和/或软件模块,包括但不限于:微代码、设备驱动器、冗余处理单元、外部磁盘驱动阵列、RAID系统、磁带驱动器以及数据备份存储系统等。The electronic device 500 can also communicate with one or more external devices 570 (eg, a keyboard, pointing device, Bluetooth device, etc.), and can also communicate with one or more devices that enable the user to interact with the electronic device 500, and/or with Any device (eg, router, modem, etc.) that enables the electronic device 500 to communicate with one or more other computing devices. This communication can take place via an input/output (I/O) interface 550. Also, electronic device 500 can communicate with one or more networks (e.g., a local area network (LAN), a wide area network (WAN), and/or a public network, such as the Internet) via network adapter 560. As shown, network adapter 560 communicates with other modules of electronic device 500 via bus 530. It should be understood that although not shown in the figures, other hardware and/or software modules may be utilized in conjunction with electronic device 500, including but not limited to: microcode, device drivers, redundant processing units, external disk drive arrays, RAID systems, tape drives. And data backup storage systems, etc.
通过以上的实施例的描述,本领域的技术人员易于理解,这里描述的示例实施例可以通过软件实现,也可以通过软件结合必要的硬件的方式来实现。因此,根据本公开实施例的技术方案可以以软件产品的形式体现出来,该软件产品可以存储在一个非易失性存储介质(可以是CD-ROM,U盘,移动硬盘等)中或网络上,包括若干指令以使得一台计算设备(可以是个 人计算机、服务器、终端装置、或者网络设备等)执行根据本公开实施例的方法。Through the description of the above embodiments, those skilled in the art can easily understand that the exemplary embodiments described herein may be implemented by software, or may be implemented by software in combination with necessary hardware. Therefore, the technical solution according to an embodiment of the present disclosure may be embodied in the form of a software product, which may be stored in a non-volatile storage medium (which may be a CD-ROM, a USB flash drive, a mobile hard disk, etc.) or on a network. A number of instructions are included to cause a computing device (which may be a personal computer, server, terminal device, or network device, etc.) to perform a method in accordance with an embodiment of the present disclosure.
在本公开的示例性实施例中,还提供了一种计算机可读存储介质,其上存储有能够实现本说明书上述方法的程序产品。在一些可能的实施例中,本发明的各个方面还可以实现为一种程序产品的形式,其包括程序代码,当所述程序产品在终端设备上运行时,所述程序代码用于使所述终端设备执行本说明书上述“示例性方法”部分中描述的根据本发明各种示例性实施例的步骤。In an exemplary embodiment of the present disclosure, there is also provided a computer readable storage medium having stored thereon a program product capable of implementing the above method of the present specification. In some possible embodiments, aspects of the present invention may also be embodied in the form of a program product comprising program code for causing said program product to run on a terminal device The terminal device performs the steps according to various exemplary embodiments of the present invention described in the "Exemplary Method" section of the present specification.
参考图6所示,描述了根据本发明的实施例的用于实现上述方法的程序产品600,其可以采用便携式紧凑盘只读存储器(CD-ROM)并包括程序代码,并可以在终端设备,例如个人电脑上运行。然而,本发明的程序产品不限于此,在本文件中,可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统、装置或者器件使用或者与其结合使用。Referring to FIG. 6, a program product 600 for implementing the above method, which may employ a portable compact disk read only memory (CD-ROM) and includes program code, and may be in a terminal device, is illustrated in accordance with an embodiment of the present invention. For example running on a personal computer. However, the program product of the present invention is not limited thereto, and in the present document, the readable storage medium may be any tangible medium containing or storing a program that can be used by or in connection with an instruction execution system, apparatus or device.
所述程序产品可以采用一个或多个可读介质的任意组合。可读介质可以是可读信号介质或者可读存储介质。可读存储介质例如可以为但不限于电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。可读存储介质的更具体的例子(非穷举的列表)包括:具有一个或多个导线的电连接、便携式盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。The program product can employ any combination of one or more readable media. The readable medium can be a readable signal medium or a readable storage medium. The readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the above. More specific examples (non-exhaustive lists) of readable storage media include: electrical connections with one or more wires, portable disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read-only memory (EPROM or flash memory), optical fiber, portable compact disk read only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing.
计算机可读信号介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了可读程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。可读信号介质还可以是可读存储介质以外的任何可读介质,该可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。The computer readable signal medium may include a data signal that is propagated in the baseband or as part of a carrier, carrying readable program code. Such propagated data signals can take a variety of forms including, but not limited to, electromagnetic signals, optical signals, or any suitable combination of the foregoing. The readable signal medium can also be any readable medium other than a readable storage medium that can transmit, propagate, or transport a program for use by or in connection with an instruction execution system, apparatus, or device.
可读介质上包含的程序代码可以用任何适当的介质传输,包括但不限于无线、有线、光缆、RF等等,或者上述的任意合适的组合。Program code embodied on a readable medium can be transmitted using any suitable medium, including but not limited to wireless, wireline, optical cable, RF, etc., or any suitable combination of the foregoing.
可以以一种或多种程序设计语言的任意组合来编写用于执行本发明操作的程序代码,所述程序设计语言包括面向对象的程序设计语言—诸如Java、C++等,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在用户计算设备上执行、部分地在用户设备上执行、作为一个独立的软件包执行、部分在用户计算设备上部分在远程计算设备上执行、或者完全在远程计算设备或服务器上执行。在涉及远程计算设备的情形中,远程计算设备可以通过任意种类的网络,包括局域网(LAN)或广域网(WAN),连接到用户计算设备,或者,可以连接到外部计算设备(例如利用因特网服务提供商来通过因特网连接)。Program code for performing the operations of the present invention may be written in any combination of one or more programming languages, including an object oriented programming language such as Java, C++, etc., including conventional procedural Programming language—such as the "C" language or a similar programming language. The program code can execute entirely on the user computing device, partially on the user device, as a stand-alone software package, partially on the remote computing device on the user computing device, or entirely on the remote computing device or server. Execute on. In the case of a remote computing device, the remote computing device can be connected to the user computing device via any kind of network, including a local area network (LAN) or wide area network (WAN), or can be connected to an external computing device (eg, provided using an Internet service) Businesses are connected via the Internet).
此外,上述附图仅是根据本发明示例性实施例的方法所包括的处理的示意性说明,而不是限制目的。易于理解,上述附图所示的处理并不表明或限制这些处理的时间顺序。另外,也易于理解,这些处理可以是例如在多个模块中同步或异步执行的。Further, the above-described drawings are merely illustrative of the processes included in the method according to the exemplary embodiments of the present invention, and are not intended to be limiting. It is easy to understand that the processing shown in the above figures does not indicate or limit the chronological order of these processes. In addition, it is also easy to understand that these processes may be performed synchronously or asynchronously, for example, in a plurality of modules.
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其他实施例。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由权利要求指出。Other embodiments of the present disclosure will be apparent to those skilled in the <RTIgt; The present application is intended to cover any variations, uses, or adaptations of the present disclosure, which are in accordance with the general principles of the disclosure and include common general knowledge or common technical means in the art that are not disclosed in the present disclosure. . The specification and examples are to be regarded as illustrative only,
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限。It is to be understood that the invention is not limited to the details of the details and The scope of the disclosure is to be limited only by the appended claims.
一方面,通过对用户视频信号的面部识别进而对面部特定区域的处理算法,实现了对唇部的快速定位,提高了定位的速度和准确性;再一方面,通过用户唇部的口型特征与匹配到的所述标准口型特征的对比,并将偏差区域标识显示的方式,可以使用户直观的感受到口型和标准口型的差距,更快速的实现口型调节,以发出更准确的读音,极大的增强了用户体验。On the one hand, through the face recognition of the user video signal and the processing algorithm of the specific area of the face, the rapid positioning of the lip is realized, and the speed and accuracy of the positioning are improved; on the other hand, the lip shape of the user's lips is adopted. Compared with the matching standard lip shape features and the display of the deviation area identification, the user can intuitively feel the gap between the mouth shape and the standard mouth shape, and realize the mouth shape adjustment more quickly to make the calibration more accurate. The pronunciation greatly enhances the user experience.
Claims (12)
- 一种外语教学评价信息生成方法,其特征在于,所述方法包括:A method for generating foreign language teaching evaluation information, characterized in that the method comprises:输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息;Outputting foreign language test information, detecting a video signal collected by the video capture device, and separately extracting video picture information and audio information in the video signal;对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征;Positioning a user's lips in the video screen information and grabbing a lip shape of the user's lips;在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征;Matching the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model, the pronunciation information including standard pronunciation and standard lip type features;计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。Calculating a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generating teaching evaluation information according to the first feature deviation value.
- 如权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 wherein the method further comprises:分析所述音频信息,识别所述音频信息中的用户发音;Analyzing the audio information to identify a user's pronunciation in the audio information;计算所述用户发音与匹配到的所述标准发音的第二特征偏差值;Calculating a second feature deviation value of the user pronunciation and the matched standard pronunciation;根据所述第一特征偏差值以及第二特征偏差值所述生成教学评价信息。The teaching evaluation information is generated according to the first feature deviation value and the second feature deviation value.
- 如权利要求2所述的方法,其特征在于,根据所述第一特征偏差值以及第二特征偏差值生成教学评价信息,包括:The method according to claim 2, wherein the generating the teaching evaluation information according to the first feature deviation value and the second feature deviation value comprises:根据所述第一特征偏差值生成得分信息;Generating score information according to the first feature deviation value;将所述得分信息作为教学评价信息。The score information is used as teaching evaluation information.
- 如权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1 wherein the method further comprises:判断所述第一特征偏差值是否满足预设偏差条件;Determining whether the first feature deviation value satisfies a preset deviation condition;若所述特征偏差值满足预设偏差条件,根据所述第一特征偏差值生成教学评价信息,包括:And if the characteristic deviation value satisfies the preset deviation condition, generating teaching evaluation information according to the first characteristic deviation value, including:根据所述第一特征偏差值得到第一偏差信息;Obtaining first deviation information according to the first characteristic deviation value;根据所述第一偏差信息以及匹配到的标准发音以及标准口型特征生成教学评价信息。Teaching evaluation information is generated based on the first deviation information and the matched standard pronunciation and standard lip type features.
- 如权利要求4所述的方法,其特征在于,所述方法还包括:The method of claim 4, wherein the method further comprises:对满足预设偏差条件的口型特征区域增加提示标识;Adding a prompt identifier to the lip-shaped feature area that satisfies the preset deviation condition;根据所述增加提示标识的口型特征生成教学评价信息,并控制所述教学评价信息在用户显示设备上显示。And generating teaching evaluation information according to the lip type feature of the increased prompt identifier, and controlling the teaching evaluation information to be displayed on the user display device.
- 如权利要求1所述的方法,其特征在于,对所述视频画面信息中的用户唇部进行定位,包括:The method of claim 1, wherein locating the user's lips in the video picture information comprises:识别并定位所述视频画面信息中的用户面部轮廓;Identifying and locating a user's facial contour in the video picture information;对面部轮廓中指定区域使用唇色过滤器对用户唇部进行定位并跟踪。Use a lip filter to position and track the user's lips for a specified area of the facial contour.
- 如权利要求1所述的方法,其特征在于,所述口型特征包括唇部轮廓、唇部口径、唇部张角、唇高以及唇宽。The method of claim 1 wherein said lip features include a lip contour, a lip diameter, a lip opening angle, a lip height, and a lip width.
- 如权利要求7所述的方法,其特征在于,在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,包括:The method according to claim 7, wherein the matching of the pronunciation information corresponding to the foreign language test information in the pre-established foreign language test information and the pronunciation information model comprises:根据所述唇部轮廓、唇部口径、唇部张角、唇高以及唇宽的至少一项进行几何建模,形成矢量化口型特征;Geometrically modeling according to at least one of the lip contour, the lip diameter, the lip opening angle, the lip height, and the lip width to form a vectorized lip shape feature;将所述矢量化口型特征与所述标准口型特征进行匹配。The vectorized lip features are matched to the standard lip features.
- 如权利要求3所述的方法,其特征在于,所述方法还包括:The method of claim 3, wherein the method further comprises:判断所述得分信息是否小于预设分数;Determining whether the score information is less than a preset score;若是,保存所述得分信息以及与所述得分信息对应的外语测试信息以及发音信息;If yes, saving the score information and foreign language test information and pronunciation information corresponding to the score information;根据保存的所述得分信息以及对应的外语测试信息以及发音信息训练外语练习模型;Training a foreign language practice model according to the saved score information and corresponding foreign language test information and pronunciation information;定期将所述外语练习模型中的外语测试信息以及发音信息发送至用户,以进行外语强化练习。The foreign language test information and the pronunciation information in the foreign language practice model are periodically sent to the user for the foreign language intensive exercise.
- 一种外语教学评价信息生成装置,其特征在于,所述装置包括:A device for generating foreign language teaching evaluation information, characterized in that the device comprises:信号检测模块,用于输出外语测试信息,检测视频采集设备采集的视频信号,分别提取所述视频信号中的视频画面信息以及音频信息;a signal detecting module, configured to output a foreign language test information, detect a video signal collected by the video capture device, and respectively extract video image information and audio information in the video signal;特征抓取模块,用于对所述视频画面信息中的用户唇部进行定位,并抓取所述用户唇部的口型特征;a feature capture module, configured to locate a user's lip in the video screen information, and capture a lip shape of the user's lip;信息匹配模块,用于在预先建立的外语测试信息与发音信息模型中匹配与所述外语测试信息对应的发音信息,所述发音信息包括标准发音以及标准口型特征;An information matching module, configured to match pronunciation information corresponding to the foreign language test information in a pre-established foreign language test information and a pronunciation information model, where the pronunciation information includes a standard pronunciation and a standard lip type feature;信息生成模块,用于计算所述用户唇部的口型特征与匹配到的所述标准口型特征的第一特征偏差值,根据所述第一特征偏差值生成教学评价信息。And an information generating module, configured to calculate a first feature deviation value of the lip shape of the user's lip and the matched standard lip shape feature, and generate teaching evaluation information according to the first feature deviation value.
- 一种电子设备,其特征在于,包括:An electronic device, comprising:处理器;以及Processor;存储器,所述存储器上存储有计算机可读指令,所述计算机可读指令被所述处理器执行时实现根据权利要求1至9中任一项所述的方法。A memory having computer readable instructions stored thereon, the computer readable instructions being executed by the processor to implement the method of any one of claims 1 to 9.
- 一种计算机可读存储介质,其上存储有计算机程序,所述计算机程序被处理器执行时实现根据权利要求1至9中任一项所述方法。A computer readable storage medium having stored thereon a computer program, the computer program being executed by a processor to implement the method of any one of claims 1 to 9.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810308933.5A CN108537702A (en) | 2018-04-09 | 2018-04-09 | Foreign language teaching evaluation information generation method and device |
CN201810308933.5 | 2018-04-09 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2019196205A1 true WO2019196205A1 (en) | 2019-10-17 |
Family
ID=63483388
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2018/092777 WO2019196205A1 (en) | 2018-04-09 | 2018-06-26 | Foreign language teaching evaluation information generating method and apparatus |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN108537702A (en) |
WO (1) | WO2019196205A1 (en) |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108537702A (en) * | 2018-04-09 | 2018-09-14 | 深圳市鹰硕技术有限公司 | Foreign language teaching evaluation information generation method and device |
CN109671316B (en) * | 2018-09-18 | 2022-05-06 | 张滕滕 | Language learning system |
CN109448463A (en) * | 2018-12-29 | 2019-03-08 | 江苏师范大学 | Foreign language pronunciation autonomous learning training system and its method based on virtual reality technology |
CN109767658B (en) * | 2019-03-25 | 2021-05-04 | 重庆医药高等专科学校 | English video example sentence sharing method and system |
CN111951629A (en) * | 2019-05-16 | 2020-11-17 | 上海流利说信息技术有限公司 | Pronunciation correction system, method, medium and computing device |
CN110930794A (en) * | 2019-09-16 | 2020-03-27 | 上海少立教育科技有限公司 | Intelligent language education system and method |
CN110689464A (en) * | 2019-10-09 | 2020-01-14 | 重庆医药高等专科学校 | Mouth shape recognition-based English pronunciation quality assessment method |
CN113051985B (en) * | 2019-12-26 | 2024-07-05 | 深圳云天励飞技术有限公司 | Information prompting method, device, electronic equipment and storage medium |
CN111652165B (en) * | 2020-06-08 | 2022-05-17 | 北京世纪好未来教育科技有限公司 | Mouth shape evaluating method, mouth shape evaluating equipment and computer storage medium |
CN112150583B (en) * | 2020-09-02 | 2024-07-23 | 广东小天才科技有限公司 | Spoken language pronunciation assessment method and terminal equipment |
CN113077819A (en) * | 2021-03-19 | 2021-07-06 | 北京有竹居网络技术有限公司 | Pronunciation evaluation method and device, storage medium and electronic equipment |
CN113297924B (en) * | 2021-04-30 | 2024-07-19 | 北京有竹居网络技术有限公司 | Pronunciation correction method and device, storage medium and electronic equipment |
CN114245194A (en) * | 2021-12-23 | 2022-03-25 | 深圳市优必选科技股份有限公司 | Video teaching interaction method and device and electronic equipment |
CN114708642B (en) * | 2022-05-24 | 2022-11-18 | 成都锦城学院 | Business English simulation training device, system, method and storage medium |
CN115440222A (en) * | 2022-08-31 | 2022-12-06 | 云知声智能科技股份有限公司 | Language exercise video processing method and device, electronic equipment and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102169642A (en) * | 2011-04-06 | 2011-08-31 | 李一波 | Interactive virtual teacher system having intelligent error correction function |
KR20160097089A (en) * | 2015-02-06 | 2016-08-17 | 손현성 | Method for learning foreign language pronunciation |
CN107945624A (en) * | 2017-10-18 | 2018-04-20 | 金碧波 | A kind of pearl bar of abacus learning machine |
CN108537702A (en) * | 2018-04-09 | 2018-09-14 | 深圳市鹰硕技术有限公司 | Foreign language teaching evaluation information generation method and device |
Family Cites Families (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101187990A (en) * | 2007-12-14 | 2008-05-28 | 华南理工大学 | A session robotic system |
CN102663928A (en) * | 2012-03-07 | 2012-09-12 | 天津大学 | Electronic teaching method for deaf people to learn speaking |
CN103745423B (en) * | 2013-12-27 | 2016-08-24 | 浙江大学 | A kind of shape of the mouth as one speaks teaching system and teaching method |
-
2018
- 2018-04-09 CN CN201810308933.5A patent/CN108537702A/en active Pending
- 2018-06-26 WO PCT/CN2018/092777 patent/WO2019196205A1/en active Application Filing
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102169642A (en) * | 2011-04-06 | 2011-08-31 | 李一波 | Interactive virtual teacher system having intelligent error correction function |
KR20160097089A (en) * | 2015-02-06 | 2016-08-17 | 손현성 | Method for learning foreign language pronunciation |
CN107945624A (en) * | 2017-10-18 | 2018-04-20 | 金碧波 | A kind of pearl bar of abacus learning machine |
CN108537702A (en) * | 2018-04-09 | 2018-09-14 | 深圳市鹰硕技术有限公司 | Foreign language teaching evaluation information generation method and device |
Also Published As
Publication number | Publication date |
---|---|
CN108537702A (en) | 2018-09-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2019196205A1 (en) | Foreign language teaching evaluation information generating method and apparatus | |
CN108962282B (en) | Voice detection analysis method and device, computer equipment and storage medium | |
US10438077B2 (en) | Face liveness detection method, terminal, server and storage medium | |
CN108875833B (en) | Neural network training method, face recognition method and device | |
CN107818798B (en) | Customer service quality evaluation method, device, equipment and storage medium | |
US20180261236A1 (en) | Speaker recognition method and apparatus, computer device and computer-readable medium | |
CN109614934B (en) | Online teaching quality assessment parameter generation method and device | |
US10522136B2 (en) | Method and device for training acoustic model, computer device and storage medium | |
US10056096B2 (en) | Electronic device and method capable of voice recognition | |
US20220375225A1 (en) | Video Segmentation Method and Apparatus, Device, and Medium | |
WO2019218427A1 (en) | Method and apparatus for detecting degree of attention based on comparison of behavior characteristics | |
CN109087670B (en) | Emotion analysis method, system, server and storage medium | |
WO2019080639A1 (en) | Object identifying method, computer device and computer readable storage medium | |
CN109063587B (en) | Data processing method, storage medium and electronic device | |
WO2020215722A1 (en) | Method and device for video processing, electronic device, and computer-readable storage medium | |
WO2020019591A1 (en) | Method and device used for generating information | |
US11322172B2 (en) | Computer-generated feedback of user speech traits meeting subjective criteria | |
CN109785846B (en) | Role recognition method and device for mono voice data | |
US11734954B2 (en) | Face recognition method, device and electronic equipment, and computer non-volatile readable storage medium | |
WO2020029608A1 (en) | Method and apparatus for detecting burr of electrode sheet | |
WO2019223056A1 (en) | Gesture recognition-based teaching and learning method and apparatus | |
WO2019051814A1 (en) | Target recognition method and apparatus, and intelligent terminal | |
WO2020056995A1 (en) | Method and device for determining speech fluency degree, computer apparatus, and readable storage medium | |
CN111127699A (en) | Method, system, equipment and medium for automatically recording automobile defect data | |
WO2022199360A1 (en) | Moving object positioning method and apparatus, electronic device, and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 18914004 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 18914004 Country of ref document: EP Kind code of ref document: A1 |