CN105338282B - A kind of information processing method and electronic equipment - Google Patents

A kind of information processing method and electronic equipment Download PDF

Info

Publication number
CN105338282B
CN105338282B CN201410283199.3A CN201410283199A CN105338282B CN 105338282 B CN105338282 B CN 105338282B CN 201410283199 A CN201410283199 A CN 201410283199A CN 105338282 B CN105338282 B CN 105338282B
Authority
CN
China
Prior art keywords
information
image information
image
electronic equipment
lip
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410283199.3A
Other languages
Chinese (zh)
Other versions
CN105338282A (en
Inventor
张磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201410283199.3A priority Critical patent/CN105338282B/en
Publication of CN105338282A publication Critical patent/CN105338282A/en
Application granted granted Critical
Publication of CN105338282B publication Critical patent/CN105338282B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses a kind of information processing methods, are applied to the first electronic equipment, the electronic equipment includes image acquisition units, which comprises carry out Image Acquisition by operating body of the image acquisition units to the first electronic equipment, obtain the first image information;The image information for extracting at least one predetermined sub-region in the first image information is analyzed, and the second image information is obtained, and parses the second image information, generates the first information of corresponding first image information;The first information and the first image information are sent to the second electronic equipment.The invention also discloses a kind of electronic equipment.

Description

A kind of information processing method and electronic equipment
Technical field
The present invention relates to video communication technology more particularly to a kind of information processing methods and electronic equipment.
Background technique
Current video communication using video communication that is very extensive, generalling use at present are as follows: an electronic equipment Acquisition is sent to the opposite end electronic equipment of video communication just after the image and voice information of the user of video communication, described right End electronic equipment performs image display after receiving image and voice information and voice broadcast.And under some scenes, user needs Video communication is carried out under mute environment, that is to say, that electronic equipment will not acquire voice messaging or can not collect language Message breath;So, in such a scenario, how to guarantee normal video communication, i.e. the guarantee normal language of video communication both sides Interaction, this is a technical problem to be solved urgently.
Summary of the invention
To solve existing technical problem, an embodiment of the present invention is intended to provide a kind of information processing methods and electronics to set It is standby.
The embodiment of the invention provides a kind of information processing methods, are applied to the first electronic equipment, the electronic equipment packet Include image acquisition units, which comprises
Image Acquisition is carried out by operating body of the described image acquisition unit to first electronic equipment, obtains the first figure As information;
The image information for extracting at least one predetermined sub-region in the first image information is analyzed, and the second figure is obtained As information, second image information is parsed, generates the first information of corresponding the first image information;
The first information and the first image information are sent to the second electronic equipment.
The embodiment of the invention also provides a kind of information processing methods, are applied to the second electronic equipment, which comprises
The first image information and corresponding with the first image information first for receiving the transmission of the first electronic equipment are believed Breath;
The first image information, and the mould according to locating for second electronic equipment are shown by display broadcast unit The first information of match-type is played simultaneously in formula.
The embodiment of the invention also provides a kind of first electronic equipments, comprising:
Image acquisition units carry out Image Acquisition for the operating body to first electronic equipment, obtain the first image Information;
Information acquisition unit, for extract the image information of at least one predetermined sub-region in the first image information into Row analysis, obtains the second image information, parses second image information, generates the first letter of corresponding the first image information Breath;
Information transmitting unit, for the first information and the first image information to be sent to the second electronic equipment.
The embodiment of the invention also provides a kind of second electronic equipments, comprising:
Information receiving unit, for receive the first electronic equipment transmission the first image information and with first figure As the corresponding first information of information;
Broadcast unit is shown, for showing the first image information, and the mould according to locating for second electronic equipment The first information of match-type is played simultaneously in formula.
A kind of information processing method and electronic equipment provided by the embodiment of the present invention, being in electronic equipment can not acquire In the case where voice, it still is able to turn over the lip reading expression of the user of electronic equipment by the lip reading identification technology to acquisition image It is translated into corresponding text and/or voice messaging is sent to the opposite end of video communication, make the user of opposite end it will be appreciated that electronic equipment User semantic meaning representation, to guarantee being normally carried out for video communication.It is logical that the embodiment of the present invention proposes a kind of new video Letter mode is a kind of completely new experience.
Detailed description of the invention
Fig. 1 is the flow chart of the information processing method of the embodiment of the present invention one;
Fig. 2 is the flow chart of the information processing method of the embodiment of the present invention two;
Fig. 3 is the composed structure schematic diagram of the first electronic equipment of the embodiment of the present invention three;
Fig. 4 is the composed structure schematic diagram of the second electronic equipment of the embodiment of the present invention four;
Fig. 5 is the composed structure schematic diagram of the electronic equipment of the embodiment of the present invention five.
Specific embodiment
The technical solution of the present invention is further elaborated in the following with reference to the drawings and specific embodiments.
Embodiment one
To realize the normal video communication under squelch, the embodiment of the present invention one provides a kind of information processing method, This method is applied in the first electronic equipment, and first electronic equipment includes image acquisition units;In the present embodiment, first Electronic equipment refers in particular to the electronic equipment of the Image Acquisition transmitting terminal in video communication.As shown in Figure 1, this method comprises:
Step 101, Image Acquisition is carried out by operating body of the image acquisition units to the first electronic equipment, obtains the first figure As information.
The operating body of first electronic equipment refers to and operates the user that first electronic equipment carries out video communication. First electronic equipment carries out figure to the user for operating the first electronic equipment progress video communication by its image acquisition units As acquisition, the first image information is obtained.It should be noted that image acquisition units are mainly the face-image letter for acquiring user Breath, that is to say, that mainly include the facial image information of user in the first image information.
Step 102, it extracts the image information of at least one predetermined sub-region in the first image information to be analyzed, obtains the Two image informations parse the second image information, generate the first information of corresponding first image information.
Preferably, the first electronic equipment can carry out sub-zone dividing to the first image information, and from having divided subregion The first image information in extract the image information of at least one predetermined sub-region;It is special that image is carried out to extracted image information Sign analysis and lip image recognition, and according to described image signature analysis and lip image recognition as a result, generating the second image Information.Mainly include in second image information is the lip image information identified.
Wherein, the image information of at least one predetermined sub-region is extracted from the first image information for having divided subregion, It can be realized using mode below: facial characteristics identification being carried out to the first image information for having divided subregion, extracts and knows The not image information of subregion shared by gained facial characteristics.
Preferably, the second image information of the parsing, generates the first information of corresponding first image information, comprising: according to Second image information inquires scheduled Image Database, obtain in Image Database with the matched third image of the second image information Information, and obtain the corresponding first information of third image information;It wherein, include third image information and first in Image Database The mapping relations of information.
Wherein, the first information is the first information of literal type and/or the first information of sound-type, that is to say, that institute It states in Image Database either include the mapping relations of third image information and corresponding text information, is also possible to include the The mapping relations of three image informations and corresponding voice messaging can also be including third image information and corresponding text information and language The mapping relations of message breath.Third image information in described image information bank refers to lip image corresponding to various lip readings Information, that is to say, that is stored in described image information bank is the mapping relations of various lip readings with corresponding text and/or pronunciation. So, according to the second image information query image information bank, can obtain lip reading corresponding with the second image information text and/ Or pronunciation.
Step 103, the first information and the first image information are sent to the second electronic equipment.
Preferably, the first electronic equipment mode according to locating for the second electronic equipment, if the second electronic equipment is in the One mode then sends the first information of the first image information and literal type to the second electronic equipment;
If the second electronic equipment is in second mode, the first image information and voice class are sent to the second electronic equipment The first information of type;
If the second electronic equipment is in the third mode, the first image information, Yi Jiwen are sent to the second electronic equipment The first information of word type and sound-type.
Wherein, in the flrst mode, the second electronic equipment allows playing the first image information while showing corresponding text Word information, such as the second electronic equipment are under squelch;Under the second mode, the second electronic equipment allows playing the first figure As information plays corresponding voice messaging simultaneously;In a third mode, the second electronic equipment allows playing the first image information It shows corresponding text information simultaneously and plays corresponding voice messaging.Mode locating for second electronic equipment, can be by first Electronic equipment obtains from the second electronic equipment, specifically, can be the second electronic equipment is actively supplied to the first electronic equipment, It is also possible to the second electronic equipment and the first electronic equipment is supplied to according to the request of the first electronic equipment.
Preferably, described be sent to the second electronic equipment for the first information and the first image information, comprising: will be continuously every First image information of one frame and its corresponding first information are sent to the second electronic equipment.That is, in video communication Cheng Zhong, the first electronic equipment are the first image informations of continuous acquisition multiframe, then, for the first figure of each frame of acquisition As information, the first electronic equipment requires to execute the operation in abovementioned steps 102 and 103, so that the first electronic equipment is sent to Second electronic equipment be continuous multiple frames the first image information and institute corresponding with the first image information of each frame State the first information.
Embodiment one through the invention, in the case where the first electronic equipment, which is in, to acquire voice, the first electronics Equipment still is able to through the lip reading identification technology to acquisition image, and the lip reading expression of the user of the first electronic equipment is translated as phase The text and/or voice messaging answered are sent to the second electronic equipment, make the user of the second electronic equipment it will be appreciated that the first electronics The semantic meaning representation of the user of equipment, to guarantee being normally carried out for video communication.
Embodiment two
The embodiment of the present invention two additionally provides a kind of information processing method, is applied to the second electronic equipment, in this implementation In example, the second electronic equipment refers in particular to the electronic equipment that the image information in video communication receives display end.As shown in Fig. 2, the party Method includes:
Step 201, the first image information and corresponding with the first image information that the first electronic equipment is sent are received One information.
The received first information of second electronic equipment can be text information corresponding with the first image information and/or voice Information.In implementation process, the first electronic equipment sends the what type of first information to the second electronic equipment, depending on concrete condition Depending on.
Step 202, by showing that broadcast unit shows the first image information, and the mould according to locating for the second electronic equipment The first information of match-type is played simultaneously in formula.
Preferably, the second electronic equipment needs to judge itself current place after receiving the first image information and the first information In which kind of video communication mode;If the second electronic equipment is in first mode, the second electronic equipment is broadcast except through display It puts unit and shows the first image information, also simultaneous display text information corresponding with the first image information;If the second electronics is set Standby to be in second mode, then the second electronic equipment shows the first image information except through display broadcast unit, is also played simultaneously Voice messaging corresponding with the first image information;If the second electronic equipment is in the third mode, the second electronic equipment in addition to By showing that broadcast unit shows the first image information, simultaneous display text information corresponding with the first image information is gone back, and same Step plays voice messaging corresponding with the first image information.
Preferably, the first information for playing match-type, comprising: the first figure of continuous each frame based on the received It is continuous to play the first information corresponding with the first image information of each frame as information and the first information.That is, regarding In frequency communication process, the first electronic equipment is the first image information of continuous acquisition multiframe, then, for each frame of acquisition The first image information, the first electronic equipment require execute abovementioned steps 102 and 103 in operation, so that the first electronics is set What preparation gave the second electronic equipment is the first image information of continuous multiple frames and distinguishes with the first image information of each frame The corresponding first information;So correspondingly, the second electronic equipment it is received be also successive frame the first image information and The first information, the second electronic equipment need to show the first image information of successive frame, and continuously broadcasting and the first of each frame The corresponding first information of image information.
Embodiment two through the invention, in the case where the first electronic equipment, which is in, to acquire voice, the first electronics Equipment still is able to through the lip reading identification technology to acquisition image, and the lip reading expression of the user of the first electronic equipment is translated as phase The text and/or voice messaging answered are sent to the second electronic equipment, and the user of the second electronic equipment also can be based on the received Text and/or voice messaging understand the semantic meaning representation of the user of the first electronic equipment, to guarantee being normally carried out for video communication.
Embodiment three
The information processing method of the corresponding embodiment of the present invention one, the embodiment of the present invention three additionally provide a kind of first electronics Equipment, first electronic equipment refer to the electronic equipment of the Image Acquisition transmitting terminal in video communication.As shown in figure 3, this One electronic equipment includes:
Image acquisition units 10 carry out Image Acquisition for the operating body to the first electronic equipment, obtain the first image letter Breath;
Information acquisition unit 20, the image information for extracting at least one predetermined sub-region in the first image information carry out Analysis obtains the second image information, parses the second image information, generates the first information of corresponding first image information;
Information transmitting unit 30, for the first information and the first image information to be sent to the second electronic equipment.
Preferably, information acquisition unit 20 is further used for, sub-zone dividing carried out to the first image information, and from having drawn The image information of at least one predetermined sub-region is extracted in first image information of molecular domains;To extracted image information into Row image characteristic analysis and lip image recognition, and according to image characteristic analysis and lip image recognition as a result, generating second Image information.Mainly include in second image information is the lip image information identified.
Wherein, the image information of at least one predetermined sub-region is extracted from the first image information for having divided subregion, It can be realized using mode below: facial characteristics identification being carried out to the first image information for having divided subregion, extracts and knows The not image information of subregion shared by gained facial characteristics.
Preferably, information acquisition unit 20 is further used for, scheduled Image Database is inquired according to the second image information, It obtains in Image Database with the matched third image information of the second image information, and obtains third image information corresponding first Information;
It wherein, include the mapping relations of third image information and the first information in Image Database.
Third image information in Image Database refers to lip image information corresponding to various lip readings, that is to say, that What is stored in described image information bank is the mapping relations of various lip readings with corresponding text and/or pronunciation.So, according to the second figure As information query image information bank, text and/or the pronunciation of lip reading corresponding with the second image information can be obtained.
Preferably, the shown first information is the first information of literal type and/or the first information of sound-type;
The information transmitting unit 30 is further used for,
It is set when the second electronic equipment is in first mode to the second electronics according to mode locating for the second electronic equipment Preparation send the first information of the first image information and corresponding literal type;
When the second electronic equipment is in second mode, the second electronic equipment of Xiang Suoshu send the first image information and The first information of corresponding sound-type;
When second electronic equipment is in the third mode, the first image information of the second electronic equipment of Xiang Suoshu transmission, And the first information of corresponding text and sound-type.
Wherein, in the flrst mode, the second electronic equipment allows playing the first image information while showing corresponding text Word information, such as the second electronic equipment are under squelch;Under the second mode, the second electronic equipment allows playing the first figure As information plays corresponding voice messaging simultaneously;In a third mode, the second electronic equipment allows playing the first image information It shows corresponding text information simultaneously and plays corresponding voice messaging.Mode locating for second electronic equipment, can be by first Electronic equipment obtains from the second electronic equipment, specifically, can be the second electronic equipment is actively supplied to the first electronic equipment, It is also possible to the second electronic equipment and the first electronic equipment is supplied to according to the request of the first electronic equipment.
Preferably, information transmitting unit 30 is further used for, by the first image information and its correspondence of continuous each frame The first information be sent to the second electronic equipment.
Embodiment three through the invention, in the case where the first electronic equipment, which is in, to acquire voice, the first electronics Equipment still is able to through the lip reading identification technology to acquisition image, and the lip reading expression of the user of the first electronic equipment is translated as phase The text and/or voice messaging answered are sent to the second electronic equipment, make the user of the second electronic equipment it will be appreciated that the first electronics The semantic meaning representation of the user of equipment, to guarantee being normally carried out for video communication.
It should be noted that above-mentioned image acquisition units 10 can realize that information obtains by the camera of the first electronic equipment Take unit 20 can be by central processing unit (CPU, Central Processing Unit), the microprocessor of the first electronic equipment (MPU, Micro Processing Unit), digital signal processor (DSP, Digital Signal Processor) can Programmed logic array (PLA) (FPGA, Field-Programmable Gate Array) realizes that information transmitting unit 30 can be by the The communication function chip of one electronic equipment is realized.
Example IV
The information processing method of the corresponding embodiment of the present invention two, the embodiment of the present invention four additionally provide a kind of second electronics Equipment, the second electronic equipment refer to that the image information in video communication receives the electronic equipment of display end.As shown in figure 4, this Two electronic equipments include:
Information receiving unit 40, for receive the first electronic equipment transmission the first image information and with the first image The corresponding first information of information;
Show broadcast unit 50, it is synchronous for showing the first image information, and the mode according to locating for the second electronic equipment Play the first information of match-type.
Wherein, the received first information of the second electronic equipment can be text information corresponding with the first image information and/ Or voice messaging.In implementation process, the first electronic equipment sends the what type of first information to the second electronic equipment, depending on tool Depending on body situation.
Preferably, display broadcast unit 50 is further used for,
When the second electronic equipment is in first mode, the first information of synchronization display words type;
When the second electronic equipment is in second mode, the first information of sound-type is played simultaneously;
When the second electronic equipment is in the third mode, the first information of synchronization display words type, and language is played simultaneously The first information of sound type.
That is, showing broadcast unit 50 after information receiving unit 40 receives the first image information and the first information Need to judge which kind of video communication mode the second electronic equipment is currently at;If the second electronic equipment is in first mode, Display broadcast unit 50 goes back simultaneous display text information corresponding with the first image information in addition to showing the first image information;Such as The second electronic equipment of fruit is in second mode, then shows broadcast unit 50 in addition to showing the first image information, be also played simultaneously with The corresponding voice messaging of first image information;If the second electronic equipment is in the third mode, show broadcast unit 50 in addition to It shows the first image information, also simultaneous display text information corresponding with the first image information, and is played simultaneously and the first image The corresponding voice messaging of information.
Preferably, display broadcast unit 50 is further used for, the first image information of continuous each frame based on the received And the first information, it is continuous to play the first information corresponding with the first image information of each frame.
Example IV through the invention, in the case where the first electronic equipment, which is in, to acquire voice, the first electronics Equipment still is able to through the lip reading identification technology to acquisition image, and the lip reading expression of the user of the first electronic equipment is translated as phase The text and/or voice messaging answered are sent to the second electronic equipment, and the user of the second electronic equipment also can be based on the received Text and/or voice messaging understand the semantic meaning representation of the user of the first electronic equipment, to guarantee being normally carried out for video communication.
It should be noted that above- mentioned information receiving unit 40 can be realized by the communication function chip of the second electronic equipment, Display broadcast unit 50 can be realized jointly by the display and CPU, MPU, DSP or FPGA of the second electronic equipment.
It should also be noted that, in practical applications, an electronic equipment in video communication should usually acquire local terminal Image information be sent to opposite end, also to receive opposite end transmission image information shown in local terminal;That is that is, practical An electronic equipment in can be provided simultaneously with the function and example IV of first electronic equipment of above-described embodiment three The second electronic functionalities.
Embodiment five
The present embodiment provides a kind of electronic equipment for being provided simultaneously with the first electronic functionalities and the second electronic functionalities, As shown in figure 5, the electronic equipment includes:
Image acquisition units 10 carry out Image Acquisition for the operating body to video communication local terminal, obtain the first image letter Breath;
Information acquisition unit 20, the image information for extracting at least one predetermined sub-region in the first image information carry out Analysis obtains the second image information, parses the second image information, generates the first information of corresponding first image information;
Information transmitting unit 30, the electronics for the first information and the first image information to be sent to video communication opposite end are set It is standby.
Preferably, information acquisition unit 20 is further used for, sub-zone dividing carried out to the first image information, and from having drawn The image information of at least one predetermined sub-region is extracted in first image information of molecular domains;To extracted image information into Row image characteristic analysis and lip image recognition, and according to image characteristic analysis and lip image recognition as a result, generating second Image information.Mainly include in second image information is the lip image information identified.
Preferably, information acquisition unit 20 is further used for, scheduled Image Database is inquired according to the second image information, It obtains in Image Database with the matched third image information of the second image information, and obtains third image information corresponding first Information;
It wherein, include the mapping relations of third image information and the first information in Image Database.
Preferably, the shown first information is the first information of literal type and/or the first information of sound-type;
The information transmitting unit 30 is further used for,
According to mode locating for the electronic equipment of video communication opposite end, when opposite end is in first mode, sent to opposite end The first information of first image information and corresponding literal type;
When opposite end is in second mode, the first letter of the first image information and corresponding sound-type is sent to opposite end Breath;
When opposite end is in the third mode, the first image information and corresponding text and sound-type are sent to opposite end The first information.
Preferably, information transmitting unit 30 is further used for, by the first image information and its correspondence of continuous each frame The first information be sent to the electronic equipment of video communication opposite end.
The electronic equipment further include:
Information receiving unit 40, for receive video communication opposite end electronic equipment send the first image information and The first information corresponding with the first image information;
It shows broadcast unit 50, for showing the first image information, and the mode according to locating for opposite end, matching is played simultaneously The first information of type.
Preferably, display broadcast unit 50 is further used for,
When opposite end is in first mode, the first information of synchronization display words type;
When opposite end is in second mode, the first information of sound-type is played simultaneously;
When opposite end is in the third mode, the first information of synchronization display words type, and sound-type is played simultaneously The first information.
Preferably, display broadcast unit 50 is further used for, the first image information of continuous each frame based on the received And the first information, it is continuous to play the first information corresponding with the first image information of each frame.
It should be noted that above-mentioned image acquisition units 10 can be realized by the camera of electronic equipment, acquisition of information list Member 20 can realize by CPU, MPU, DSP or FPGA of electronic equipment, and information transmitting unit 30 and information receiving unit 40 can be with By electronic equipment communication function chip realize, display broadcast unit 50 can by electronic equipment display and CPU, MPU, DSP or FPGA are realized jointly.
It illustrates again below and the information processing method and electronic equipment of the embodiment of the present invention is further elaborated on.
Embodiment six
In the embodiment of the present invention six, the first electronic equipment is under squelch, i.e., not can be carried out voice collecting, the Two electronic equipments are in first mode, and Image Database is preset in the first electronic equipment, include that lip reading is believed in Image Database The mapping relations of breath and corresponding text information.Information processing method in video communication specifically includes that
1, the first electronic equipment carries out Image Acquisition to this end subscriber by its image acquisition units, obtains the first image letter Breath;For the first image information of each frame of acquisition, the image letter of at least one predetermined sub-region in the first image information is extracted Breath carries out image characteristic analysis and lip image recognition, obtains the lip image information identified;According to the lip figure identified As the scheduled Image Database of information inquiry, obtaining the lip reading information to match in image data base (is also each of expression lip Kind of image information) corresponding to text information;
2, the first electronic equipment knows that the second electronic equipment is in first mode, thus by the first figure of continuous each frame As information and its corresponding text information are sent to the second electronic equipment;Alternatively, the first electronic equipment is without knowing the second electronics Mode locating for equipment, and the first image information of continuous each frame and its corresponding text information are directly sent to second Electronic equipment;
3, the second electronic equipment receives the first image information and text information from the first electronic equipment, judges the second electricity Sub- equipment is currently at first mode, thus show that broadcast unit continuously shows the first image information of each frame by it, And simultaneous display text information corresponding with the first image information of each frame.
Embodiment seven
In the embodiment of the present invention seven, the first electronic equipment is under squelch, i.e., not can be carried out voice collecting, the Two electronic equipments are in second mode, and Image Database is preset in the first electronic equipment, include that lip reading is believed in Image Database The mapping relations of breath and corresponding text information.Information processing method in video communication specifically includes that
1, the first electronic equipment carries out Image Acquisition to this end subscriber by its image acquisition units, obtains the first image letter Breath;For the first image information of each frame of acquisition, the image letter of at least one predetermined sub-region in the first image information is extracted Breath carries out image characteristic analysis and lip image recognition, obtains the lip image information identified;According to the lip figure identified As the scheduled Image Database of information inquiry, obtaining the lip reading information to match in image data base (is also each of expression lip Kind of image information) corresponding to text information;
2, the first electronic equipment knows that the second electronic equipment is in second mode, thus by the first image of each frame of correspondence The text information of information is converted to corresponding voice messaging, and by the first image information of continuous each frame and its corresponding language Message breath is sent to the second electronic equipment;Alternatively, the first electronic equipment is not necessarily to know mode locating for the second electronic equipment, and it is straight It connects and the first image information of continuous each frame and its corresponding text information is sent to the second electronic equipment, the second electronics is set It is standby to judge that itself is in second mode, so that the text information of the first image information of each frame of the correspondence received is converted to phase The voice messaging answered;
3, the second electronic equipment is in second mode, shows that broadcast unit continuously shows the first figure of each frame by it As information, and voice messaging corresponding with the first image information of each frame is played simultaneously.
Embodiment eight
In the embodiment of the present invention eight, the first electronic equipment is under squelch, i.e., not can be carried out voice collecting, the Two electronic equipments are in the third mode, and Image Database is preset in the first electronic equipment, include that lip reading is believed in Image Database The mapping relations of breath and corresponding text information.Information processing method in video communication specifically includes that
1, the first electronic equipment carries out Image Acquisition to this end subscriber by its image acquisition units, obtains the first image letter Breath;For the first image information of each frame of acquisition, the image letter of at least one predetermined sub-region in the first image information is extracted Breath carries out image characteristic analysis and lip image recognition, obtains the lip image information identified;According to the lip figure identified As the scheduled Image Database of information inquiry, obtaining the lip reading information to match in image data base (is also each of expression lip Kind of image information) corresponding to text information;
2, the first electronic equipment knows that the second electronic equipment is in the third mode, thus by the first image of each frame of correspondence The text information of information is converted to corresponding voice messaging, and by the first image information of continuous each frame and its corresponding text Word and voice messaging are sent to the second electronic equipment;Alternatively, the first electronic equipment is without knowing mould locating for the second electronic equipment Formula, and the first image information of continuous each frame and its corresponding text information are directly sent to the second electronic equipment, the Two electronic equipments judge that itself is in the third mode, thus by the text information of the first image information of each frame of the correspondence received Be converted to corresponding voice messaging;
3, the second electronic equipment is in the third mode, shows that broadcast unit continuously shows the first figure of each frame by it As information, simultaneous display text information corresponding with the first image information of each frame, and be played simultaneously with each frame first The corresponding voice messaging of image information.
Embodiment six, seven, eight through the invention, in the case where the first electronic equipment, which is in, to acquire voice, the One electronic equipment still is able to turn over the lip reading expression of the user of the first electronic equipment by the lip reading identification technology to acquisition image It is translated into corresponding text and/or voice messaging is sent to the second electronic equipment, and the user of the second electronic equipment also being capable of basis Received text and/or voice messaging understand the semantic meaning representation of the user of the first electronic equipment, to guarantee video communication just Often carry out.
In addition, in a preferred embodiment, the first electronic equipment can recorde the time for generating text information, and Text information generated is locally saved as into text chat record according to the sequencing of time;Second electronic equipment can be remembered Record receives the time of text information, and the received text information of institute is locally saved as text chat according to the sequencing of time Record.
Second electronic equipment can also record to the content of voice broadcast, and record the time of recording, according to the time Sequencing each section of recording is ranked up, locally save as voice-enabled chat record.
In several embodiments provided by the present invention, it should be understood that disclosed method, apparatus and electronic equipment, It may be implemented in other ways.Apparatus embodiments described above are merely indicative, for example, the unit is drawn Point, only a kind of logical function partition, there may be another division manner in actual implementation, such as: multiple units or components can To combine, or it is desirably integrated into another system, or some features can be ignored or not executed.In addition, shown or discussed The mutual coupling of each component part or direct-coupling or communication connection can be through some interfaces, equipment or unit Indirect coupling or communication connection can be electrical, mechanical or other forms.
Above-mentioned unit as illustrated by the separation member, which can be or may not be, to be physically separated, aobvious as unit The component shown can be or may not be physical unit, it can and it is in one place, it may be distributed over multiple network lists In member;Some or all of units can be selected to achieve the purpose of the solution of this embodiment according to the actual needs.
In addition, each functional unit in various embodiments of the present invention can be fully integrated in one processing unit, it can also To be each unit individually as a unit, can also be integrated in one unit with two or more units;It is above-mentioned Integrated unit both can take the form of hardware realization, can also realize in the form of hardware adds SFU software functional unit.
Those of ordinary skill in the art will appreciate that: realize that all or part of the steps of above method embodiment can pass through The relevant hardware of program instruction is completed, and program above-mentioned can be stored in a computer readable storage medium, the program When being executed, step including the steps of the foregoing method embodiments is executed;And storage medium above-mentioned include: movable storage device, it is read-only Memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or The various media that can store program code such as person's CD.
If alternatively, the above-mentioned integrated unit of the embodiment of the present invention is realized in the form of software function module and as independence Product when selling or using, also can store in a computer readable storage medium.Based on this understanding, this hair Substantially the part that contributes to existing technology can body in the form of software products in other words for the technical solution of bright embodiment Reveal and, which is stored in a storage medium, including some instructions are with so that a computer is set Standby (can be personal computer, server or network equipment etc.) executes the whole of each embodiment the method for the present invention Or part.And storage medium above-mentioned includes: that movable storage device, ROM, RAM, magnetic or disk etc. are various can store journey The medium of sequence code.
The above description is merely a specific embodiment, but scope of protection of the present invention is not limited thereto, any Those familiar with the art in the technical scope disclosed by the present invention, can easily think of the change or the replacement, and should all contain Lid is within protection scope of the present invention.Therefore, protection scope of the present invention should be based on the protection scope of the described claims.
The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention.

Claims (9)

1. a kind of information processing method is applied to the first electronic equipment, the electronic equipment includes image acquisition units, the side Method includes:
Image Acquisition is carried out by operating body of the described image acquisition unit to first electronic equipment, obtains the first image letter Breath;
The image information for extracting at least one predetermined sub-region in the first image information is analyzed, and the second image letter is obtained It ceases, includes the lip image information identified in second image information;
Inquire scheduled Image Database according to second image information, obtain in described image information bank with second figure As the third image information of information matches, and obtain the corresponding first information of the third image information;Wherein, described image is believed The mapping relations in library including third image information and the first information are ceased, the third image information includes corresponding to various lip readings Lip image information;
The first information and the first image information are sent to the second electronic equipment.
2. information processing method according to claim 1, which is characterized in that at least one in the first image information of the extraction The image information of predetermined sub-region is analyzed, and the second image information is obtained, comprising:
Sub-zone dividing is carried out to the first image information, and is extracted at least from the first image information for having divided subregion The image information of one predetermined sub-region;
Image characteristic analysis and lip image recognition carried out to extracted image information, and according to described image signature analysis and Lip image recognition as a result, generate the second image information.
3. information processing method according to claim 1, which is characterized in that the first information is the first letter of literal type The first information of breath and/or sound-type;
It is described the first information is sent to the second electronic equipment to include:
According to mode locating for second electronic equipment, if second electronic equipment is in first mode, Xiang Suoshu The first information of second electronic equipment transmission literal type;
If second electronic equipment is in second mode, the first letter of sound-type is sent to second electronic equipment Breath;
If second electronic equipment is in the third mode, literal type and voice class are sent to second electronic equipment The first information of type.
4. information processing method according to any one of the claim 1 to 3, which is characterized in that described by the first information and first Image information is sent to the second electronic equipment, comprising:
The first image information of continuous each frame and its corresponding first information are sent to second electronic equipment.
5. a kind of information processing method is applied to the second electronic equipment, which comprises
Receive the first image information and the first information corresponding with the first image information that the first electronic equipment is sent;
The first image information, and the mode according to locating for second electronic equipment are shown by display broadcast unit, together Step plays the first information of match-type;
Wherein, the first information is the second image letter extracted in the first image information by first electronic equipment Breath inquires scheduled Image Database according to second image information, obtain in described image information bank with second figure As the third image information of information matches, the obtained corresponding first information of the third image information;Wherein, described image is believed The mapping relations in library including third image information and the first information are ceased, the third image information includes corresponding to various lip readings Lip image information;It include the lip image information identified in second image information.
6. information processing method according to claim 5, which is characterized in that the mould according to locating for the second electronic equipment The first information of match-type is played simultaneously in formula, comprising:
If second electronic equipment is in first mode, the first information of synchronization display words type;
If second electronic equipment is in second mode, the first information of sound-type is played simultaneously;
If second electronic equipment is in the third mode, the first information of synchronization display words type, and synchronizes Play the first information of sound-type.
7. according to the information processing method of claim 5 or 6, which is characterized in that the first information for playing match-type, Include:
The first image information and the first information of continuous each frame based on the received, it is continuous to play and each frame The corresponding first information of the first image information.
8. a kind of first electronic equipment, comprising:
Image acquisition units carry out Image Acquisition for the operating body to first electronic equipment, obtain the first image information;
Information acquisition unit, the image information for extracting at least one predetermined sub-region in the first image information are divided Analysis obtains the second image information, includes the lip image information identified in second image;According to second image Information inquires scheduled Image Database, obtain in described image information bank with the matched third image of second image information Information, and obtain the corresponding first information of the third image information;It wherein, include that third image is believed in described image information bank The mapping relations of breath and the first information, the third image information includes lip image information corresponding to various lip readings;
Information transmitting unit, for the first information and the first image information to be sent to the second electronic equipment.
9. a kind of second electronic equipment, comprising:
Information receiving unit, for receiving the first image information of the first electronic equipment transmission and believing with the first image Cease the corresponding first information, wherein the first information is to extract the first image information by first electronic equipment In the second image information, scheduled Image Database is inquired according to second image information, obtains described image information bank In with the matched third image information of second image information, the obtained corresponding first information of the third image information; It wherein, include the mapping relations of third image information and the first information, the third image information packet in described image information bank Include lip image information corresponding to various lip readings;It include the lip image information identified in second image information;
Show broadcast unit, for showing the first image information, and the mode according to locating for second electronic equipment, together Step plays the first information of match-type.
CN201410283199.3A 2014-06-23 2014-06-23 A kind of information processing method and electronic equipment Active CN105338282B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410283199.3A CN105338282B (en) 2014-06-23 2014-06-23 A kind of information processing method and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410283199.3A CN105338282B (en) 2014-06-23 2014-06-23 A kind of information processing method and electronic equipment

Publications (2)

Publication Number Publication Date
CN105338282A CN105338282A (en) 2016-02-17
CN105338282B true CN105338282B (en) 2019-07-26

Family

ID=55288530

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410283199.3A Active CN105338282B (en) 2014-06-23 2014-06-23 A kind of information processing method and electronic equipment

Country Status (1)

Country Link
CN (1) CN105338282B (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107071328B (en) * 2016-12-16 2019-12-03 维沃移动通信有限公司 A kind of video calling processing method and mobile terminal
CN107087133B (en) * 2017-03-24 2020-07-03 宇龙计算机通信科技(深圳)有限公司 Safety control method and terminal equipment
CN110213431B (en) * 2019-04-30 2021-06-25 维沃移动通信有限公司 Message sending method and mobile terminal

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101510256A (en) * 2009-03-20 2009-08-19 深圳华为通信技术有限公司 Mouth shape language conversion method and device

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4761568B2 (en) * 2004-05-12 2011-08-31 貴司 吉峰 Conversation support device

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101510256A (en) * 2009-03-20 2009-08-19 深圳华为通信技术有限公司 Mouth shape language conversion method and device

Also Published As

Publication number Publication date
CN105338282A (en) 2016-02-17

Similar Documents

Publication Publication Date Title
US11568876B2 (en) Method and device for user registration, and electronic device
CN104992709B (en) A kind of the execution method and speech recognition apparatus of phonetic order
CN103680497B (en) Speech recognition system and method based on video
CN109254669B (en) Expression picture input method and device, electronic equipment and system
US20070188657A1 (en) Synchronizing method and system
CN107682249B (en) Method, device and equipment for counting personnel and joining group under group chat scene
CN106024009A (en) Audio processing method and device
CN109117233A (en) Method and apparatus for handling information
CN108847214A (en) Method of speech processing, client, device, terminal, server and storage medium
CN109036416B (en) Simultaneous interpretation method and system, storage medium and electronic device
CN110853615B (en) Data processing method, device and storage medium
CN111883168B (en) Voice processing method and device
CN104598502A (en) Method, device and system for obtaining background music information in played video
CN112653902B (en) Speaker recognition method and device and electronic equipment
JP6432177B2 (en) Interactive communication system, terminal device and program
CN107943914A (en) Voice information processing method and device
CN105338282B (en) A kind of information processing method and electronic equipment
CN107862071A (en) The method and apparatus for generating minutes
CN109509472A (en) Method, apparatus and system based on voice platform identification background music
CN110072140A (en) A kind of video information reminding method, device, equipment and storage medium
CN110347869B (en) Video generation method and device, electronic equipment and storage medium
CN106558311A (en) Voice content reminding method and device
CN107193922B (en) A kind of method and device of information processing
CN107196979A (en) Pre- system for prompting of calling out the numbers based on speech recognition
CN111583932A (en) Sound separation method, device and equipment based on human voice model

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant