CN108509903A

CN108509903A - Lip reading translating equipment based on 3D imaging technique and lip reading interpretation method

Info

Publication number: CN108509903A
Application number: CN201810276020.XA
Authority: CN
Inventors: 戴佑俊
Original assignee: Angrui Shanghai Information Technology Co Ltd
Current assignee: Shanghai Qingyan Heshi Technology Co ltd
Priority date: 2018-03-30
Filing date: 2018-03-30
Publication date: 2018-09-07
Anticipated expiration: 2038-03-30
Also published as: CN108509903B

Abstract

The invention discloses a kind of lip reading translating equipment and lip reading interpretation method based on 3D imaging technique, the lip reading translating equipment includes that an at least 3D video cameras, an identification module, a processing module and an output module, the 3D video cameras is used to obtain the 3D images of a mouth；The identification module characteristic point on 3D images for identification, every 3D images combine corresponding with the characteristic point formed by unique characteristics point；The processing module combines matched nozzle type with characteristic point for identification；The output module is used to export the information corresponding to 3D Image Matching nozzle type.The identification to lip reading can be realized the present invention is based on the lip reading translating equipment of 3D imaging technique and lip reading interpretation method, the user of aphasis is facilitated to use, the image for more accurately obtaining lip reading is more clear accurate model to establish, and then accurately can translate lip reading to come.

Description

Lip reading translating equipment based on 3D imaging technique and lip reading interpretation method

Technical field

The present invention relates to a kind of lip reading translating equipment and lip reading interpretation method based on 3D imaging technique.

Background technology

Lip reading is to understand others' word by seeing others' action of lip when speaking, and is kind of the skill for being difficult, needs A large amount of practice, has some person hard of hearing that can be exchanged with other people using this skill.

The identification of lip reading is very difficult.

Invention content

The technical problem to be solved by the present invention is in order to overcome in the prior art lip reading be difficult to the defect identified, one kind is provided The image of lip reading can be more accurately obtained, is more clear accurate model to establish, and then can accurately turn over lip reading Translate the lip reading translating equipment come and lip reading interpretation method.

The present invention is to solve above-mentioned technical problem by following technical proposals：

A kind of lip reading translating equipment based on 3D imaging technique, feature are that the lip reading translating equipment includes at least One 3D video cameras, an identification module, a processing module and an output module,

The 3D video cameras are used to obtain the 3D images of a mouth；

The identification module characteristic point on 3D images for identification, every 3D images and the spy formed by unique characteristics point Sign point combination is corresponding；

The processing module combines matched nozzle type with characteristic point for identification；

The output module is used to export the information corresponding to 3D Image Matching nozzle type.

Preferably, the camera is 3D cameras, the camera includes that a RF transmitter and an infrared ray connect Device is received, the RF transmitter is for emitting infrared speckle field, and the infrared receiver is for receiving infrared speckle field Feedback, the numbers of the 3D video cameras are 2, and the first 3D video cameras shoot the mouth from front, and the 2nd 3D video cameras are from the The shooting direction angle of the shooting mouth below one 3D video cameras, the first 3D video cameras and the 2nd 3D video cameras is acute angle, institute It further includes a concatenation module to state lip reading translating equipment,

Characteristic point on the concatenation module 3D images that 2 3D video cameras obtain for identification, and 2 3D images are led to The mode for crossing the coincidence of same characteristic features point is sutured to generate a 3D models；

The identification module space characteristics point on 3D model construction layers for identification, every 3D models with by self space The characteristic point combination that characteristic point is formed is corresponding.

Preferably, the characteristic point on the structure sheaf includes lip characteristic point, dental features point, tongue characteristic point and chin Characteristic point, the relative position relation of the whole characteristic points of characteristic point combination record in the 3 d space.

Preferably, the lip reading translating equipment further includes a logging modle and an adjustment module,

The 3D video cameras are used to chronologically obtain the 3D images of several mouths；

The logging modle is used to record the phonetic corresponding to every 3D Image Matching nozzle type；

The adjustment module is used to choose the corresponding Chinese character of each phonetic according to the context of whole phonetics；

The output module is used to press the sequential export all Chinese characters.

Preferably, the lip reading translating equipment further includes a training module,

The training module is used to do model training using an a database pair target nozzle type, and the database includes several Meaning is the 3D images of the target nozzle type.

The present invention also provides a kind of lip reading interpretation method realized using lip reading translating equipment, feature is, the lip Language translating equipment includes an at least 3D video cameras, and the lip reading interpretation method includes：

The 3D video cameras obtain the 3D images of a mouth；

Identify the characteristic point on 3D images, every 3D images combine corresponding with the characteristic point formed by unique characteristics point；

Matched nozzle type is combined in identification with characteristic point；

Export the information corresponding to 3D Image Matching nozzle type.

Preferably, the camera is 3D cameras, the camera includes that a RF transmitter and an infrared ray connect Device is received, the RF transmitter is for emitting infrared speckle field, and the infrared receiver is for receiving infrared speckle field Feedback, the numbers of the 3D video cameras are 2, and the first 3D video cameras shoot the mouth from front, and the 2nd 3D video cameras are from the The shooting direction angle of the shooting mouth below one 3D video cameras, the first 3D video cameras and the 2nd 3D video cameras is acute angle, institute Stating lip reading interpretation method includes：

It identifies the characteristic point on the 3D images of 2 3D video cameras acquisition, and 2 3D images is overlapped by same characteristic features point Mode suture to generate a 3D models；

Identify the space characteristics point on 3D model construction layers, every 3D models and the feature formed by self space characteristic point Point combination is corresponding.

Preferably, the lip reading interpretation method includes：

The 3D video cameras chronologically obtain the 3D images of several mouths；

Record the phonetic corresponding to every 3D Image Matching nozzle type；

The corresponding Chinese character of each phonetic is chosen according to the context of whole phonetics；

By the whole Chinese character of the sequential export.

Preferably, the lip reading interpretation method includes：

Model training is done using an a database pair target nozzle type, the database includes that several meanings are the target mouth The 3D images of type.

On the basis of common knowledge of the art, above-mentioned each optimum condition can be combined arbitrarily to get each preferable reality of the present invention Example.

The positive effect of the present invention is that：The present invention is based on the translations of the lip reading translating equipment and lip reading of 3D imaging technique Method can realize the identification to lip reading, and the user of aphasis is facilitated to use, and the image of lip reading more accurately be obtained, to build It is vertical to be more clear accurate model, and then accurately can translate lip reading to come.

Description of the drawings

Fig. 1 is the flow chart of 1 lip reading interpretation method of the embodiment of the present invention.

Fig. 2 is another flow chart of 1 lip reading interpretation method of the embodiment of the present invention.

Specific implementation mode

It is further illustrated the present invention below by the mode of embodiment, but does not therefore limit the present invention to the reality It applies among a range.

Embodiment 1

A kind of lip reading translating equipment based on 3D imaging technique of the present embodiment, the lip reading translating equipment are taken the photograph including 1 3D Camera.

The camera is 3D cameras, and the camera includes a RF transmitter and an infrared receiver, institute It states RF transmitter and is used to receive the feedback of infrared speckle field for emitting infrared speckle field, the infrared receiver.

The lip reading translating equipment further includes an identification module, a processing module, an output module, a logging modle, one Adjust module and a training module.

The 3D video cameras are used to obtain the 3D images of a mouth.

The identification module characteristic point on 3D images for identification, every 3D images and the spy formed by unique characteristics point Sign point combination is corresponding.

The processing module combines matched nozzle type with characteristic point for identification.

The 3D images include structure sheaf and pixel layer.

The present invention can distinguish nozzle type by the spatial model of mouth.

So as to obtain the characteristic point combination of a mouth, its characteristic point combination of similar nozzle type exists identification feature point Rule is simultaneously very much like.

In addition, the lip reading translating equipment of the present embodiment also has automatic calibration function, specifically：

The 3D video cameras are used to chronologically obtain the 3D images of several mouths.

The logging modle is used to record the phonetic corresponding to every 3D Image Matching nozzle type.

The adjustment module is used to choose the corresponding Chinese character of each phonetic according to the context of whole phonetics.

The logging modle can obtain the information of multiple sequential nozzle type, some nozzle type easily identify can be accurate Its meaning is obtained, some nozzle type correspond to the pronunciation of multiple words, and language can be accurately obtained by the contextual analysis of entire sentence Sentence.

The present embodiment can be trained in identification 3D images by inputting a large amount of nozzle type 3D images corresponding to mouth image Nozzle type accuracy.

Referring to Fig. 1, using the lip reading translating equipment of the present embodiment, the present embodiment also provides a kind of lip reading interpretation method, packet It includes：

Step 100, the 3D video cameras obtain the 3D images of a mouth；

Characteristic point in step 101, identification 3D images, every 3D images are combined with the characteristic point formed by unique characteristics point It is corresponding；

Matched nozzle type is combined in step 102, identification with characteristic point；

Information corresponding to step 103, output 3D Image Matching nozzle type.

Further, the lip reading interpretation method of the present embodiment includes：

Step 200, the 3D video cameras chronologically obtain the 3D images of several mouths.

Characteristic point in step 201, the every 3D images of identification, every 3D images and the characteristic point formed by unique characteristics point It combines corresponding.

Step 202 combines each characteristic point, and matched nozzle type is combined in identification with characteristic point.

Step 203 records phonetic corresponding to every 3D Image Matching nozzle type.

Step 204 chooses the corresponding Chinese character of each phonetic according to the context of whole phonetics.

Step 205 presses the sequential export all Chinese characters.

The lip reading interpretation method of the present embodiment further includes：Model training is done using an a database pair target nozzle type, it is described Database includes the 3D images that several meanings are the target nozzle type.

The lip reading translating equipment based on 3D imaging technique and lip reading interpretation method of the present embodiment can be realized to lip reading Identification, facilitates the user of aphasis to use, and more accurately obtains the image of lip reading, and accurate mould is more clear to establish Type, and then accurately can translate lip reading to come.

Embodiment 2

The present embodiment is substantially the same manner as Example 1, the difference is that only：

The camera is 3D cameras, and the camera includes a RF transmitter and an infrared receiver, institute It states RF transmitter and is used to receive the feedback of infrared speckle field, institute for emitting infrared speckle field, the infrared receiver The number for stating 3D video cameras is 2, and the first 3D video cameras shoot the mouth from front, and the 2nd 3D video cameras are imaged from the first 3D Shoot the mouth below machine, the shooting direction angle of the first 3D video cameras and the 2nd 3D video cameras be 30 degree of first video camera and Second video camera is at an angle in order to the variation of chin is observed, so as to infer the form of tongue.

The lip reading translating equipment further includes a concatenation module,

The processing module nozzle type with features described above point combinations matches for identification.

The space characteristics point can find out characteristic point (including lip exterior feature, trigonum trace, people medium) from pixel layer, so Afterwards using the corresponding structure sheaf characteristic point of pixel layer characteristic point as space characteristics point.

The space characteristics point can also identify the features such as protrusion, the recess of structure as space spy directly from structure sheaf Sign point.

Characteristic point on the structure sheaf includes lip characteristic point, dental features point, tongue characteristic point and chin characteristic point, The relative position relation of the whole characteristic points of characteristic point combination record in the 3 d space.

The 3D models can also be by identifying at least three peak point, then by 3D images on the structure sheaf of 3D images Structure sheaf sutured in such a way that identical peak point overlaps, the peak point includes salient point and concave point, the number that peak point overlaps Amount is at least 3.

Corresponding, the lip reading interpretation method of the present embodiment includes：

It identifies the characteristic point on the 3D images of 2 3D video cameras acquisition, and 2 3D images is overlapped by same characteristic features point Mode suture to generate a 3D models.

Matched nozzle type is combined in identification with characteristic point.

Export the information corresponding to 3D Image Matching nozzle type.

The lip reading translating equipment of the present embodiment can obtain a complete mouth 3D model, special using the space of 3D models The combination of sign point can preferably be corresponded to nozzle type, more accurate to make lip reading translate.

Although specific embodiments of the present invention have been described above, it will be appreciated by those of skill in the art that these It is merely illustrative of, protection scope of the present invention is defined by the appended claims.Those skilled in the art is not carrying on the back Under the premise of from the principle and substance of the present invention, many changes and modifications may be made, but these are changed Protection scope of the present invention is each fallen with modification.

Claims

1. a kind of lip reading translating equipment based on 3D imaging technique, which is characterized in that the lip reading translating equipment includes at least one 3D video cameras, an identification module, a processing module and an output module,

The 3D video cameras are used to obtain the 3D images of a mouth；

The identification module characteristic point on 3D images for identification, every 3D images and the characteristic point formed by unique characteristics point It combines corresponding；

2. lip reading translating equipment as described in claim 1, which is characterized in that the camera is 3D cameras, the camera shooting Head includes a RF transmitter and an infrared receiver, and the RF transmitter is described for emitting infrared speckle field Infrared receiver is used to receive the feedback of infrared speckle field, and the number of the 3D video cameras is 2, and the first 3D video cameras are from just Face shoots the mouth, and the 2nd 3D video cameras shoot the mouth, the first 3D video cameras and second below the first 3D video cameras The shooting direction angle of 3D video cameras is acute angle, and the lip reading translating equipment further includes a concatenation module,

Characteristic point on the concatenation module 3D images that 2 3D video cameras obtain for identification, and 2 3D images are passed through into phase The mode overlapped with characteristic point is sutured to generate a 3D models；

The identification module space characteristics point on 3D model construction layers for identification, every 3D models with by self space feature The characteristic point combination that point is formed is corresponding.

3. lip reading translating equipment as claimed in claim 2, which is characterized in that the characteristic point on the structure sheaf includes lip spy Point, dental features point, tongue characteristic point and chin characteristic point are levied, the whole characteristic points of characteristic point combination record are in the 3 d space Relative position relation.

4. lip reading translating equipment as described in claim 1, which is characterized in that the lip reading translating equipment further includes a record mould Block and an adjustment module,

5. lip reading translating equipment as described in claim 1, which is characterized in that the lip reading translating equipment further includes a training mould Block,

The training module is used to do model training using an a database pair target nozzle type, and the database includes several meanings For the 3D images of the target nozzle type.

6. a kind of lip reading interpretation method realized using lip reading translating equipment, which is characterized in that the lip reading translating equipment includes An at least 3D video cameras, the lip reading interpretation method include：

The 3D video cameras obtain the 3D images of a mouth；

Matched nozzle type is combined in identification with characteristic point；

Export the information corresponding to 3D Image Matching nozzle type.

7. lip reading interpretation method as claimed in claim 6, which is characterized in that the camera is 3D cameras, the camera shooting Head includes a RF transmitter and an infrared receiver, and the RF transmitter is described for emitting infrared speckle field Infrared receiver is used to receive the feedback of infrared speckle field, and the number of the 3D video cameras is 2, and the first 3D video cameras are from just Face shoots the mouth, and the 2nd 3D video cameras shoot the mouth, the first 3D video cameras and second below the first 3D video cameras The shooting direction angle of 3D video cameras is acute angle, and the lip reading interpretation method includes：

Identify the characteristic point on the 3D images of 2 3D video cameras acquisition, and the side that 2 3D images are overlapped by same characteristic features point Formula is sutured to generate a 3D models；

Identify the space characteristics point on 3D model construction layers, every 3D models and the feature point group formed by self space characteristic point It closes corresponding.

8. lip reading interpretation method as claimed in claim 7, which is characterized in that the characteristic point on the structure sheaf includes lip spy Point, dental features point, tongue characteristic point and chin characteristic point are levied, the whole characteristic points of characteristic point combination record are in the 3 d space Relative position relation.

9. lip reading interpretation method as claimed in claim 6, which is characterized in that the lip reading interpretation method includes：

The 3D video cameras chronologically obtain the 3D images of several mouths；

Record the phonetic corresponding to every 3D Image Matching nozzle type；

By the whole Chinese character of the sequential export.

10. lip reading interpretation method as claimed in claim 6, which is characterized in that the lip reading interpretation method includes：

Model training is done using an a database pair target nozzle type, the database includes that several meanings are the target nozzle type 3D images.