CN105389318A - Information processing method and electronic equipment - Google Patents

Information processing method and electronic equipment Download PDF

Info

Publication number
CN105389318A
CN105389318A CN201410455557.4A CN201410455557A CN105389318A CN 105389318 A CN105389318 A CN 105389318A CN 201410455557 A CN201410455557 A CN 201410455557A CN 105389318 A CN105389318 A CN 105389318A
Authority
CN
China
Prior art keywords
information
video image
primary importance
user
image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201410455557.4A
Other languages
Chinese (zh)
Other versions
CN105389318B (en
Inventor
杨文建
洪世洋
许沐锌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lenovo Beijing Ltd
Original Assignee
Lenovo Beijing Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lenovo Beijing Ltd filed Critical Lenovo Beijing Ltd
Priority to CN201410455557.4A priority Critical patent/CN105389318B/en
Publication of CN105389318A publication Critical patent/CN105389318A/en
Application granted granted Critical
Publication of CN105389318B publication Critical patent/CN105389318B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Abstract

The invention discloses an information processing method, and aims to solve the technical problem of poor prompt effect of electronic equipment. The method comprises the following steps: acquiring image information within a predetermined range through at least one image acquiring unit, and determining first position information in a direction corresponding to first sound information when the existence of the first sound information is detected through one of the sound acquiring units; marking the first sound information according to the first position information, and determining a user corresponding to the first position information as a first user; acquiring a first video image corresponding to the first position information, and marking the first video image according to the first position information, wherein the first video image includes an image of the first user; and establishing a first association relationship between the first sound information and the first video image which are marked according the first position information. The invention also discloses corresponding electronic equipment.

Description

A kind of information processing method and electronic equipment
Technical field
The present invention relates to field of computer technology, particularly a kind of information processing method and electronic equipment.
Background technology
In daily work, some user needs to participate in a lot of teleconferences, when participating in teleconference, sometimes whole conference process is recorded, the entirety of 360 ° such as generally can be adopted to record mode, the all personnel participated in a conference under complete recording, so that the situation can watching meeting at that time later at any time.
When carrying out meeting, different users may serve as speaker in the different time periods, such as, when carrying out meeting, may have the speech of meeting presider, having the speech that some are led, and the speech of some employees etc.And the conference content recorded now, generally can only find out is that multiple people is participating in a conference, and can only hear the sound of speech wherein when someone makes a speech, for specifically who, in speech, may not be very good identification.Particularly distant view is recorded sometimes, and people seems smaller in video, at this moment, want to recognize spokesman just more difficult, and electronic equipment also cannot be pointed out to user.
Visible, the prompting effect of electronic equipment of the prior art is poor, if user wants to know that specifically who, in speech, may need the sound characteristic of very familiar each spokesman, or may need repeatedly to watch video just can find out, obviously bring inconvenience to user.
Summary of the invention
The embodiment of the present invention provides a kind of information processing method and electronic equipment, the technical matters that the prompting effect for solving electronic equipment is poor.
A kind of information processing method, is applied to electronic equipment, and described electronic equipment comprises at least one image acquisition units and at least one sound collection unit; Said method comprising the steps of:
Gathering the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determining the primary importance information in the corresponding orientation of described first acoustic information;
The first acoustic information according to described primary importance information flag, and determine that user corresponding to described primary importance information is for first user;
Obtain first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image;
For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation.
Optionally, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information, comprise: when detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
Optionally, obtain first video image corresponding with described primary importance information, comprising:
According to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit;
In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
Optionally, after described first acoustic information and described first video image for marked described primary importance information equally sets up the first incidence relation, also comprise:
If existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
If described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
For described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
Optionally, if after described second place information and described primary importance information are not same position information, also comprise:
For described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
Optionally, after existence first acoustic information being detected by one of them sound collection unit, also comprise: noise reduction process is carried out to described first acoustic information.
Optionally, after obtaining first video image corresponding with described primary importance information, also comprise:
The area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
Optionally, determining that user corresponding to described primary importance information is for after first user, also comprises: by facial recognition techniques, determines the first user information corresponding to described first user;
For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation, comprising: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
A kind of electronic equipment, described electronic equipment comprises at least one image acquisition units and at least one sound collection unit; Described electronic equipment also comprises:
First determination module, for being gathered the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information;
Second determination module, for the first acoustic information according to described primary importance information flag, and determines that user corresponding to described primary importance information is for first user;
Acquisition module, for obtaining first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image;
First sets up module, sets up the first incidence relation for described first acoustic information and described first video image for marked described primary importance information equally.
Optionally, described first determination module specifically for: when detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
Optionally, described acquisition module, for obtaining first video image corresponding with described primary importance information, is specially:
According to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit;
In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
Optionally, described first determination module also for: setting up module described first is marked after described first acoustic information of described primary importance information and described first video image set up described first incidence relation equally, if existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
Described second determination module also for: if described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Described acquisition module also for: obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
Described first set up module also for: for described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
Optionally, described electronic equipment also comprises second and sets up module, for: if after described second place information and described primary importance information are not same position information, for described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
Optionally, described electronic equipment also comprises processing module, for: after being detected by one of them sound collection unit and there is described first acoustic information, noise reduction process is carried out to described first acoustic information.
Optionally, described electronic equipment also comprises control module, after obtaining first video image corresponding with described primary importance information at described acquisition module, the area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
Optionally, described electronic equipment also comprises identification module, for: determine that user corresponding to described primary importance information is for after first user, by facial recognition techniques, determines the first user information corresponding to described first user at described second determination module;
Described first set up module specifically for: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
In the embodiment of the present invention, when existence the first acoustic information being detected, the first video image corresponding with this first acoustic information in orientation can be determined, and according to same positional information described first acoustic information of mark and described first video image, thus described first incidence relation can be set up for described first acoustic information and described first video image, like this, when user is when checking, as long as find described first acoustic information, just can see described first video image according to described first incidence relation simultaneously, or, as long as find described first video image, just can listen to described first acoustic information according to described first incidence relation simultaneously.Such as, when carrying out meeting, different users may serve as speaker in the different time periods, if be user A the first period speaker, then when hearing the sound of user A, the video image seeing user A that just can be corresponding according to incidence relation, like this, can determine it is who is in speech on earth easily, enhance the prompting effect of electronic equipment, repeatedly check without the need to user or try to figure out, saving the running time, improving Consumer's Experience.
Accompanying drawing explanation
Fig. 1 is the main flow figure of information processing method in the embodiment of the present invention;
Fig. 2 is the primary structure block diagram of electronic equipment in the embodiment of the present invention.
Embodiment
The embodiment of the present invention provides a kind of information processing method, is applied to electronic equipment, and described electronic equipment comprises at least one image acquisition units and at least one sound collection unit; Described method comprises: gather the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information; The first acoustic information according to described primary importance information flag, and determine that user corresponding to described primary importance information is for first user; Obtain first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image; For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation.
In the embodiment of the present invention, when existence the first acoustic information being detected, the first video image corresponding with this first acoustic information in orientation can be determined, and according to same positional information described first acoustic information of mark and described first video image, thus described first incidence relation can be set up for described first acoustic information and described first video image, like this, when user is when checking, as long as find described first acoustic information, just can see described first video image according to described first incidence relation simultaneously, or, as long as find described first video image, just can listen to described first acoustic information according to described first incidence relation simultaneously.Such as, when carrying out meeting, different users may serve as speaker in the different time periods, if be user A the first period speaker, then when hearing the sound of user A, the video image seeing user A that just can be corresponding according to incidence relation, like this, can determine it is who is in speech on earth easily, enhance the prompting effect of electronic equipment, repeatedly check without the need to user or try to figure out, saving the running time, improving Consumer's Experience.
For making the object of the embodiment of the present invention, technical scheme and advantage clearly, below in conjunction with the accompanying drawing in the embodiment of the present invention, technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is the present invention's part embodiment, instead of whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art, not making the every other embodiment obtained under creative work prerequisite, belong to the scope of protection of the invention.
In the embodiment of the present invention, described electronic equipment can be such as mobile phone, PAD (panel computer), PC (personal computer), notebook, video camera, intelligent television, or can be such as special recording arrangement, etc., the present invention does not limit.
In addition, term "and/or" herein, being only a kind of incidence relation describing affiliated partner, can there are three kinds of relations in expression, and such as, A and/or B, can represent: individualism A, exists A and B simultaneously, these three kinds of situations of individualism B.In addition, character "/" herein, if no special instructions, general expression forward-backward correlation is to the relation liking a kind of "or".
Below in conjunction with accompanying drawing, the preferred embodiment of the present invention is described in detail.
Refer to Fig. 1, the embodiment of the present invention provides a kind of information processing method, and described method can be applied to electronic equipment, and described electronic equipment can comprise at least one image acquisition units and at least one sound collection unit.The main flow of described method is described below:
Step 101: gather the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determines the primary importance information in the corresponding orientation of described first acoustic information.
In the embodiment of the present invention, image acquisition units can be such as camera, and sound collection unit can be such as Mike.
In the embodiment of the present invention, described electronic equipment can be in the recording carrying out multimedia file, can both comprise video and also comprise audio frequency in described multimedia file.Such as scene is: holding a group session, and described electronic equipment carries out whole process to meeting and records.Then when recording, be that at least one sound collection unit described and at least one image acquisition units described carry out work all at the same time.
Why described electronic equipment can comprise at least one image acquisition units described and at least one sound collection unit described, to record multimedia file better, different image acquisition units can gather the image of different azimuth, and different sound collection unit also can gather the sound of different azimuth.Be such as in recorded meeting, then can ensure it is carrying out 360 ° of omnibearing recordings as far as possible, in everyone attending a meeting being recorded in, avoid drain message.
Described preset range such as can refer to the spatial dimension corresponding to video in the multimedia file of recording.When gathering the image information in described preset range by least one image acquisition units described, if collected acoustic information by one of them sound collection unit, this acoustic information is called described first acoustic information, then can determine the information in the orientation corresponding to described first acoustic information, this information is called described primary importance information, that is, described primary importance information refers to the position at the sound source place of described first acoustic information.
Optionally, in the embodiment of the present invention, determine the described primary importance information that described first acoustic information is corresponding, some general auditory localization algorithms in prior art can be adopted, because these algorithms all comparative maturity, just seldom repeat herein.
Optionally, in the embodiment of the present invention, determine the described primary importance information that described first acoustic information is corresponding except adopting general auditory localization algorithm, can also additive method be adopted.Concrete, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information, can comprise: when detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
That is, in the embodiment of the present invention, when collecting described first acoustic information, the positional information corresponding to sound collection unit gathering described first acoustic information can be determined, can using this positional information directly as described primary importance information.
In the embodiment of the present invention, when arranging at least one image acquisition units described and at least one sound collection unit described, such as, can make the corresponding orientation of each image acquisition units, and make the corresponding orientation of each sound collection unit.Preferably, each sound collection unit correspondence can be made to gather the sound of a user as far as possible, or all corresponding sound gathering a user of multiple sound collection unit can be made, positioning result can be made so more accurate.Same, each image acquisition units correspondence also can be made as far as possible to gather the image of a user, or all corresponding image gathering a user of multiple image acquisition units can be made, positioning result can be made so more accurate.
Optionally, in the embodiment of the present invention, after being detected by one of them sound collection unit and there is described first acoustic information, can also comprise: noise reduction process is carried out to described first acoustic information.
This noise reduction process can be carried throughout, that is, until described first acoustic information gathers complete, can carry out noise reduction process to described first acoustic information before always.
For the concrete processing mode of noise reduction, had a lot in prior art, such as Noise Elimination from Wavelet Transform etc., the present invention is not restricted this.
Step 102: the first acoustic information according to described primary importance information flag, and determine that user corresponding to described primary importance information is for first user.
After determining described primary importance information, can mark for described first acoustic information, the mode of marking can be such as carry out marking according to described primary importance information, be the upper described primary importance information of described first acoustic information mark, represent the primary importance of the position at the sound source place of described first acoustic information corresponding to described primary importance information.
Further, after determining described primary importance information, naturally also can determine the user that described primary importance information is corresponding, such as, this user is called described first user.
Certainly, in this case, be the corresponding user of acquiescence sound collection unit, or the corresponding user of multiple sound collection unit, and a sound collection unit is only with user's correspondence.
Step 103: obtain first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image.
After determining described primary importance information, described first video image corresponding with described primary importance information can be obtained by least one image acquisition units described, because user corresponding to described primary importance information is described first user, therefore, the image comprising described first user is needed in described first video image.
After described first video image of acquisition, described first video image can be similarly mark, the mode of marking continues to be carry out marking according to described primary importance information, be described primary importance information on described first Video Image Marker, represent the primary importance of position corresponding to described primary importance information corresponding to described first video image.
Like this, just marked described primary importance information for described first acoustic information and described first video image simultaneously, this also just shows that the sound corresponding to described primary importance is described first acoustic information, and the video corresponding to described primary importance is described first video image.
Certainly, because the restriction of hardware condition, in some electronic equipments, the quantity of possible image acquisition units does not have the quantity of sound collection unit many, in this case, also likely an image acquisition units needs the corresponding image gathering multiple orientation, and namely an image acquisition units may need the corresponding image gathering multiple user specifically.
Therefore, optionally, in the embodiment of the present invention, obtain first video image corresponding with described primary importance information, can comprise: according to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit; In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
Described first video image is gathered by an image acquisition units at least one image acquisition units described, therefore, if the quantity of image acquisition units is not less than 2, obtain described first video image, first will determine this image acquisition units from multiple image acquisition units.
Such as, be previously stored with the position corresponding relation between sound collection unit and image acquisition units in described electronic equipment, in this position corresponding relation, it can be such as the corresponding image acquisition units of a sound collection unit, i.e. sound collection unit and image acquisition units one_to_one corresponding, this is a kind of more satisfactory corresponding relation, or, if the quantity of image acquisition units is less than the quantity of sound collection unit, so in this position corresponding relation, can be the corresponding multiple sound collection unit of an image acquisition units, for example, see table 1:
Table 1
As can be seen from Table 1, if the quantity of image acquisition units is less than the quantity of sound collection unit, the image acquisition units then had can corresponding multiple sound collection unit, some image acquisition units can a corresponding sound collection unit, the corresponding several sound collection unit of a concrete image acquisition units, relevant to the position of image acquisition units and sound collection unit, the present invention is not restricted.
Such as, for table 1.What obtain described first acoustic information if determine is sound collection unit 4, then can determine that the image acquisition units corresponding with sound collection unit 4 is image acquisition units 2 according to table 1.Simultaneously, also can see from table 1, the sound collection unit of image acquisition units 2 correspondence has 3, that is, image acquisition units 2 may gather the image of 3 users simultaneously, and these 3 users are respectively: the user 2 corresponding with sound collection unit 2, the user 3 corresponding with sound collection unit 3 and the user 4 corresponding with sound collection unit.And, define corresponding user according to described first acoustic information, be the user 4 corresponding with sound collection unit 4, therefore, need the image intercepting out user 4 image of each user gathered from image acquisition units 2.Because the position corresponding with described first acoustic information, the position corresponding to user 4 is identical, be described primary importance, the area image that then image of user 4 is namely corresponding with described primary importance information, after intercepting out described area image, just can using described area image as described first video image.
Certainly, after determining image acquisition units according to sound collection unit, if this image acquisition units correspondence acquires the image of multiple user, so, also can from determining described image acquisition units, make described image acquisition units only gather the image of single user, this user is described first user, and the image of the described first user described image acquisition units gathered is as described first video image.
That is, obtaining described first video image can have two kinds of modes, and a kind of is the image intercepting out corresponding single user the image of the multiple users gathered from image acquisition units, and this mode can retain the image of multi-user, for other purposes.Another kind is the image making image acquisition units only take corresponding single user, the user images that this mode gathers comparatively complete display, is convenient to user's viewing.
Optionally, in the embodiment of the present invention, after obtaining described first video image corresponding with described primary importance information, can also comprise: the area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
That is, after described first video image of acquisition, described first video image can be carried out amplification display, be convenient to user's viewing.
Certainly, after described first video image of acquisition, the display position of described first video image can also be adjusted, namely, after obtaining described first video image corresponding with described primary importance information, can also comprise: the 4th position is changed into by the 3rd position in the position controlling described first video image viewing area occupied on the display unit of described electronic equipment.Such as described first video image is presented at the marginal position of described display unit originally, and the center can being adjusted to described display unit shows, and is convenient to user's viewing.
Step 104: for described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation.
For described first acoustic information and described first video image all marked described primary importance information, then can set up described first incidence relation for described first acoustic information and described first video image.Like this, when user just when determining described first acoustic information can determine according to described first incidence relation and check described first video image, same, when user just when determining described first video image can determine according to described first incidence relation and listen to described first acoustic information.Such as if a user is watching the meeting of recording, when speaker A makes a speech, user just can determine according to the acoustic information of speaker A the video image that there is incidence relation with this acoustic information, the i.e. video image of speaker A, thus that can know speech soon is speaker A, without the need to searching by repeating the modes such as viewing video again, save query time, improve operating efficiency, also improve Consumer's Experience.
Further, in the embodiment of the present invention, after described first acoustic information and described first video image for marked described primary importance information equally sets up described first incidence relation, can also comprise:
If existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
If described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
For described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
Namely, after described first acoustic information and described first video image set up described first incidence relation, record also in continuation, if detect again to there is described second acoustic information by one of them sound collection unit, then can determine the described second place information that described second acoustic information is corresponding.Determine the mode of described second place information and determine that the mode of described primary importance information can be similar, seldom repeating.
After determining described second place information, can judge whether described second place information and described primary importance information are same position information, if described second place information and described primary importance information are same position information, so can determine that described second acoustic information is a part for described first acoustic information, can continue to gather.
If described second place information and described primary importance information are not same position information, so can according to described second place information flag the second acoustic information, and determine the user that described second place information is corresponding, this user is called described second user.Afterwards, obtain described second video image, described second video image comprises the image of described second user, and be similarly second place information described in described second Video Image Marker, and, for described second video image and described second acoustic information set up described second incidence relation, to enable user determine described second video image according to described second acoustic information, or user is enable to determine described second acoustic information according to described second video image.So just can be corresponding determine different speaker and corresponding video image, be convenient to user and check.
That is, in recording process, until when positional information corresponding to acoustic information is different, the acoustic information recorded is an entirety before, before corresponding video image is also an entirety, can unify to set up an incidence relation.When positional information corresponding to acoustic information is different, be associated relation again, the step not only but also before starting repetition.
Optionally, in the embodiment of the present invention, if after described second place information and described primary importance information are not same position information, can also comprise: for described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
Storing described first acoustic information and described first video image according to common storage tags, together with described first acoustic information can being stored into described first video image like this, also just can check together when checking.
In the embodiment of the present invention, when storing described first acoustic information and described first video image, described first acoustic information and described first video image can be taken out independent storage, be described first acoustic information and described first video image sets up described storage tags, store according to described storage tags.This storage mode is more clear, facilitates user to search at any time.
Or, when storing described first acoustic information and described first video image, also can not store separately, but still described first acoustic information and described first video image are carried out overall storage as a part for the multimedia file recorded, but in the multimedia file stored, there is described first incidence relation in described first acoustic information and described first video image, as long as so user can determine that described first acoustic information just can determine described first video image, as long as or user can determine that described first video image just can determine described first acoustic information.This storage mode, without the need to storing separately multiple information, comparatively saves storage space, also makes storage space more orderly.
Optionally, in the embodiment of the present invention, determining that user corresponding to described primary importance information is for after first user, can also comprise: by facial recognition techniques, determine the first user information corresponding to described first user.So, in this case, for described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation, can comprise: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
Namely, after determining described first user, in order to be convenient to identify described first user better, the described first user information can determining corresponding to described first user by facial recognition techniques, it is one or more that described first user information such as can comprise in the name, age, department, position, telephone number, mailbox etc. of described first user.So natural, when setting up described first incidence relation, namely set up described first incidence relation for described first acoustic information, described first video image and described first user information.
Such as, user has found described first acoustic information, when playing described first acoustic information, described electronic equipment can determine described first video information according to described first incidence relation automatically, and play simultaneously, and when playing described first video information, described display unit will demonstrate described first user information, thus user can understand some information of speaker easily, more may contribute to user and understand the content that speaker says, and be convenient to user and speaker contacts etc.
Refer to Fig. 2, based on same inventive concept, the embodiment of the present invention provides a kind of electronic equipment, described electronic equipment can comprise at least one image acquisition units and at least one sound collection unit, and described electronic equipment can also comprise the first determination module 201, second determination module 202, acquisition module 203 and first sets up module 204.
First determination module 201 is for gathering the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information;
Second determination module 202 for the first acoustic information according to described primary importance information flag, and determines that user corresponding to described primary importance information is for first user;
Acquisition module 203 is for obtaining first video image corresponding with described primary importance information, and described first video image comprises the image of described first user, and according to described primary importance information flag the first video image;
First sets up module 204 sets up the first incidence relation for described first acoustic information and described first video image for marked described primary importance information equally.
Optionally, in the embodiment of the present invention, first determination module 201 specifically for: when detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
Optionally, in the embodiment of the present invention, acquisition module 203, for obtaining described first video image corresponding with described primary importance information, is specially:
According to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit;
In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
Optionally, in the embodiment of the present invention, first determination module 201 also for: setting up module 204 first is marked after described first acoustic information of described primary importance information and described first video image set up described first incidence relation equally, if existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
Second determination module 202 also for: if described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Acquisition module 203 also for: obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
First set up module 204 also for: for described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
Optionally, in the embodiment of the present invention, described electronic equipment also comprises second and sets up module, and described second sets up module sets up module 204 be all connected with the first determination module 201, second determination module 202, acquisition module 203 and first.Described second set up module for: if after described second place information and described primary importance information are not same position information, for described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
Optionally, in the embodiment of the present invention, described electronic equipment also comprises processing module, and described processing module and the first determination module 201, second determination module 202, acquisition module 203, first are set up module 204 and described second and set up module and be all connected.Described processing module is used for: after being detected by one of them sound collection unit and there is described first acoustic information, carry out noise reduction process to described first acoustic information.
Optionally, in the embodiment of the present invention, described electronic equipment also comprises control module, and described control module and the first determination module 201, second determination module 202, acquisition module 203, first are set up module 204, described second and set up module and be all connected with described processing module.Described control module is used for: after acquisition module 203 obtains first video image corresponding with described primary importance information, the area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
Optionally, in the embodiment of the present invention, described electronic equipment also comprises identification module, and described identification module is with the first determination module 201, second determination module 202, acquisition module 203, first sets up module 204, described second sets up module, described processing module is all connected with described control module.Described identification module is used for: determine that user corresponding to described primary importance information is for after first user, by facial recognition techniques, determines the first user information corresponding to described first user at the second determination module 202; First set up module 204 specifically for: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
The embodiment of the present invention provides a kind of information processing method, is applied to electronic equipment, and described electronic equipment comprises at least one image acquisition units and at least one sound collection unit; Described method comprises: gather the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information; The first acoustic information according to described primary importance information flag, and determine that user corresponding to described primary importance information is for first user; Obtain first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image; For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation.
In the embodiment of the present invention, when existence the first acoustic information being detected, the first video image corresponding with this first acoustic information in orientation can be determined, and according to same positional information described first acoustic information of mark and described first video image, thus described first incidence relation can be set up for described first acoustic information and described first video image, like this, when user is when checking, as long as find described first acoustic information, just can see described first video image according to described first incidence relation simultaneously, or, as long as find described first video image, just can listen to described first acoustic information according to described first incidence relation simultaneously.Such as, when carrying out meeting, different users may serve as speaker in the different time periods, if be user A the first period speaker, then when hearing the sound of user A, the video image seeing user A that just can be corresponding according to incidence relation, like this, can determine it is who is in speech on earth easily, enhance the prompting effect of electronic equipment, repeatedly check without the need to user or try to figure out, saving the running time, improving Consumer's Experience.
Those skilled in the art can be well understood to, for convenience and simplicity of description, only be illustrated with the division of above-mentioned each functional module, in practical application, can distribute as required and by above-mentioned functions and be completed by different functional modules, inner structure by device is divided into different functional modules, to complete all or part of function described above.The system of foregoing description, the specific works process of device and unit, with reference to the corresponding process in preceding method embodiment, can not repeat them here.
In several embodiments that the application provides, should be understood that, disclosed system, apparatus and method, can realize by another way.Such as, device embodiment described above is only schematic, such as, the division of described module or unit, be only a kind of logic function to divide, actual can have other dividing mode when realizing, such as multiple unit or assembly can in conjunction with or another system can be integrated into, or some features can be ignored, or do not perform.Another point, shown or discussed coupling each other or direct-coupling or communication connection can be by some interfaces, and the indirect coupling of device or unit or communication connection can be electrical, machinery or other form.
The described unit illustrated as separating component or can may not be and physically separates, and the parts as unit display can be or may not be physical location, namely can be positioned at a place, or also can be distributed in multiple network element.Some or all of unit wherein can be selected according to the actual needs to realize the object of the present embodiment scheme.
In addition, each functional unit in each embodiment of the application can be integrated in a processing unit, also can be that the independent physics of unit exists, also can two or more unit in a unit integrated.Above-mentioned integrated unit both can adopt the form of hardware to realize, and the form of SFU software functional unit also can be adopted to realize.
If described integrated unit using the form of SFU software functional unit realize and as independently production marketing or use time, can be stored in a computer read/write memory medium.Based on such understanding, the part that the technical scheme of the application contributes to prior art in essence in other words or all or part of of this technical scheme can embody with the form of software product, this computer software product is stored in a storage medium, comprising some instructions in order to make a computer equipment (can be personal computer, server, or the network equipment etc.) or processor (processor) perform all or part of step of method described in each embodiment of the application.And aforesaid storage medium comprises: USB flash disk, portable hard drive, ROM (read-only memory) (Read-OnlyMemory, ROM), random access memory (RandomAccessMemory, RAM), magnetic disc or CD etc. various can be program code stored medium.
Specifically, the computer program instructions that a kind of information processing method in the embodiment of the present application is corresponding can be stored in CD, hard disk, on the storage mediums such as USB flash disk, when the computer program instructions corresponding with a kind of information processing method in storage medium is read by an electronic equipment or be performed, comprise the steps:
Gathering the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determining the primary importance information in the corresponding orientation of described first acoustic information;
The first acoustic information according to described primary importance information flag, and determine that user corresponding to described primary importance information is for first user;
Obtain first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image;
For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation.
Optionally, that store in described storage medium and step: when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information, corresponding computer instruction, in the process be performed, specifically comprises:
When detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
Optionally, store in described storage medium with step: obtain first video image corresponding with described primary importance information, corresponding computer instruction, in the process be performed, specifically comprises:
According to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit;
In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
Optionally, store in described storage medium with step: described first acoustic information that marked described primary importance information for same and described first video image set up the first incidence relation, and corresponding computer instruction, after being specifically performed, also comprises:
If existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
If described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
For described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
Optionally, store in described storage medium and step: if described second place information and described primary importance information are not same position information, corresponding computer instruction after being specifically performed in, also comprise:
For described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
Optionally, that store in described storage medium and step: really existence first acoustic information detected by one of them sound collection unit, corresponding computer instruction, after being specifically performed, also comprises:
Noise reduction process is carried out to described first acoustic information.
Optionally, store in described storage medium with step: obtain first video image corresponding with described primary importance information, corresponding computer instruction, after being specifically performed, also comprises:
The area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
Optionally, store in described storage medium with step: determine that user corresponding to described primary importance information is for first user, corresponding computer instruction, after being specifically performed, also comprising: by facial recognition techniques, determines the first user information corresponding to described first user; Optionally, store in described storage medium with step: described first acoustic information that marked described primary importance information for same and described first video image set up the first incidence relation, corresponding computer instruction, in the process be specifically performed, comprising: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
The above, above embodiment is only in order to be described in detail the technical scheme of the application, but the explanation of above embodiment just understands method of the present invention and core concept thereof for helping, and should not be construed as limitation of the present invention.Those skilled in the art are in the technical scope that the present invention discloses, and the change that can expect easily or replacement, all should be encompassed within protection scope of the present invention.

Claims (16)

1. an information processing method, is applied to electronic equipment, and described electronic equipment comprises at least one image acquisition units and at least one sound collection unit; Said method comprising the steps of:
Gathering the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determining the primary importance information in the corresponding orientation of described first acoustic information;
The first acoustic information according to described primary importance information flag, and determine that user corresponding to described primary importance information is for first user;
Obtain first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image;
For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation.
2. the method for claim 1, it is characterized in that, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information, comprise: when detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
3. method as claimed in claim 2, is characterized in that, obtain first video image corresponding with described primary importance information, comprising:
According to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit;
In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
4. the method for claim 1, is characterized in that, after described first acoustic information and described first video image for marked described primary importance information equally sets up the first incidence relation, also comprises:
If existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
If described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
For described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
5. method as claimed in claim 4, is characterized in that, if after described second place information and described primary importance information are not same position information, also comprise:
For described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
6. method as claimed in claim 5, is characterized in that, after existence first acoustic information being detected by one of them sound collection unit, also comprise: carry out noise reduction process to described first acoustic information.
7. the method as described in as arbitrary in claim 1-6, is characterized in that, after obtaining first video image corresponding with described primary importance information, also comprises:
The area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
8. the method as described in as arbitrary in claim 1-6, is characterized in that, determining that user corresponding to described primary importance information is for after first user, also comprises: by facial recognition techniques, determines the first user information corresponding to described first user;
For described first acoustic information and described first video image that marked described primary importance information equally set up the first incidence relation, comprising: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
9. an electronic equipment, described electronic equipment comprises at least one image acquisition units and at least one sound collection unit; Described electronic equipment also comprises:
First determination module, for being gathered the image information in preset range by least one image acquisition units described, when existence the first acoustic information being detected by one of them sound collection unit, determine the primary importance information in the corresponding orientation of described first acoustic information;
Second determination module, for the first acoustic information according to described primary importance information flag, and determines that user corresponding to described primary importance information is for first user;
Acquisition module, for obtaining first video image corresponding with described primary importance information, described first video image comprises the image of described first user, and according to described primary importance information flag the first video image;
First sets up module, sets up the first incidence relation for described first acoustic information and described first video image for marked described primary importance information equally.
10. electronic equipment as claimed in claim 9, it is characterized in that, described first determination module specifically for: when detected by one of them sound collection unit there is described first acoustic information time, determine the positional information in the orientation corresponding to described sound collection unit, and using the positional information determined as described primary importance information.
11. electronic equipments as claimed in claim 10, is characterized in that, described acquisition module, for obtaining first video image corresponding with described primary importance information, is specially:
According to the position corresponding relation between sound collection unit and image acquisition units, determine the image acquisition units corresponding with described sound collection unit;
In the image that described image acquisition units gathers, choose the area image corresponding with described primary importance information, and using described area image as described first video image.
12. electronic equipments as claimed in claim 9, it is characterized in that, described first determination module also for: setting up module described first is marked after described first acoustic information of described primary importance information and described first video image set up described first incidence relation equally, if existence second acoustic information detected by one of them sound collection unit, determine the second place information in the corresponding orientation of described second acoustic information;
Described second determination module also for: if described second place information and described primary importance information are not same position information, then the second acoustic information according to described second place information flag, and determine that user corresponding to described second place information is the second user;
Described acquisition module also for: obtain second video image corresponding with described second place information, described second video image comprises the image of described second user, and according to described second video image of described second place information flag mark;
Described first set up module also for: for described second acoustic information and described second video image that marked described second place information equally set up the second incidence relation.
13. electronic equipments as claimed in claim 12, it is characterized in that, described electronic equipment also comprises second and sets up module, for: if after described second place information and described primary importance information are not same position information, for described first acoustic information and described first video image that there is described first incidence relation set up common storage tags, store described first acoustic information and described first video image according to the storage tags set up.
14. electronic equipments as claimed in claim 13, it is characterized in that, described electronic equipment also comprises processing module, for: after being detected by one of them sound collection unit and there is described first acoustic information, noise reduction process is carried out to described first acoustic information.
15. as arbitrary in claim 9-14 as described in electronic equipment, it is characterized in that, described electronic equipment also comprises control module, after obtaining first video image corresponding with described primary importance information at described acquisition module, the area controlling described first video image viewing area occupied on the display unit of described electronic equipment changes into second area by the first area, and described first area is less than described second area.
16. as arbitrary in claim 9-14 as described in electronic equipment, it is characterized in that, described electronic equipment also comprises identification module, for: determine that user corresponding to described primary importance information is for after first user at described second determination module, by facial recognition techniques, determine the first user information corresponding to described first user;
Described first set up module specifically for: for described first acoustic information, described first video image and described first user information set up described first incidence relation.
CN201410455557.4A 2014-09-09 2014-09-09 A kind of information processing method and electronic equipment Active CN105389318B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410455557.4A CN105389318B (en) 2014-09-09 2014-09-09 A kind of information processing method and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410455557.4A CN105389318B (en) 2014-09-09 2014-09-09 A kind of information processing method and electronic equipment

Publications (2)

Publication Number Publication Date
CN105389318A true CN105389318A (en) 2016-03-09
CN105389318B CN105389318B (en) 2019-09-24

Family

ID=55421615

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410455557.4A Active CN105389318B (en) 2014-09-09 2014-09-09 A kind of information processing method and electronic equipment

Country Status (1)

Country Link
CN (1) CN105389318B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107995543A (en) * 2017-12-27 2018-05-04 广东小天才科技有限公司 A kind of method for controlling microphone apparatus to close and microphone apparatus
CN108376058A (en) * 2018-02-09 2018-08-07 斑马网络技术有限公司 Sound control method and device and electronic equipment and storage medium
CN112153461A (en) * 2020-09-25 2020-12-29 北京百度网讯科技有限公司 Method and device for positioning sound production object, electronic equipment and readable storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002271741A (en) * 2001-03-13 2002-09-20 Matsushita Electric Ind Co Ltd Video sound contents compiling apparatus and method for imparting index to video sound contents
KR20090107712A (en) * 2008-04-10 2009-10-14 (주)엠앤소프트 Korean data searching method and system using the double indexing
CN201426153Y (en) * 2009-05-27 2010-03-17 中山佳时光电科技有限公司 Intelligent camera control system for video conference
CN101771814A (en) * 2009-12-29 2010-07-07 天津市亚安科技电子有限公司 Pan and tilt camera with sound identification and positioning function
CN103986901A (en) * 2013-02-08 2014-08-13 中兴通讯股份有限公司 Method for obtaining needed video streams in video conference and corresponding devices

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2002271741A (en) * 2001-03-13 2002-09-20 Matsushita Electric Ind Co Ltd Video sound contents compiling apparatus and method for imparting index to video sound contents
KR20090107712A (en) * 2008-04-10 2009-10-14 (주)엠앤소프트 Korean data searching method and system using the double indexing
CN201426153Y (en) * 2009-05-27 2010-03-17 中山佳时光电科技有限公司 Intelligent camera control system for video conference
CN101771814A (en) * 2009-12-29 2010-07-07 天津市亚安科技电子有限公司 Pan and tilt camera with sound identification and positioning function
CN103986901A (en) * 2013-02-08 2014-08-13 中兴通讯股份有限公司 Method for obtaining needed video streams in video conference and corresponding devices

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107995543A (en) * 2017-12-27 2018-05-04 广东小天才科技有限公司 A kind of method for controlling microphone apparatus to close and microphone apparatus
CN107995543B (en) * 2017-12-27 2019-12-03 广东小天才科技有限公司 A kind of method and microphone apparatus of the closing of control microphone apparatus
CN108376058A (en) * 2018-02-09 2018-08-07 斑马网络技术有限公司 Sound control method and device and electronic equipment and storage medium
CN112153461A (en) * 2020-09-25 2020-12-29 北京百度网讯科技有限公司 Method and device for positioning sound production object, electronic equipment and readable storage medium

Also Published As

Publication number Publication date
CN105389318B (en) 2019-09-24

Similar Documents

Publication Publication Date Title
CN103327181B (en) Voice chatting method capable of improving efficiency of voice information learning for users
CN103905474B (en) A kind of information sharing method, terminal, server and system
US7433327B2 (en) Method and system for coordinating communication devices to create an enhanced representation of an ongoing event
US9064160B2 (en) Meeting room participant recogniser
CN104270806B (en) Transmission power adjustment method and device
CN105975241A (en) Volume regulation method and device
CN105657537A (en) Video editing method and device
US9549295B2 (en) System and method for broadcasting audio tweets
US20130326575A1 (en) Social Media Driven Generation of a Highlight Clip from a Media Content Stream
CN105100521A (en) Method and server for realizing ordered speech in teleconference
CN105376515A (en) Method, apparatus and system for presenting communication information in video communication
CN102081501A (en) Method and device for providing shortcut operation application programs for user and mobile terminal
CN104335591A (en) System for adaptive delivery of context-based media
CN106302997A (en) A kind of output control method, electronic equipment and system
JP2019536070A (en) User positioning method, information push method, and related apparatus
CN103902040A (en) Processing device and method for mobile terminal and electronic device
CN105323243A (en) Method and device for secure voice communication based on instant messaging
CN105389318A (en) Information processing method and electronic equipment
CN105933784A (en) Bullet screen play and conversion method, bullet screen player, server, and play system
CN103702222A (en) Interactive information generation method and video file playing method for mobile terminal
CN105959823A (en) Message presentation method and device for video direct broadcast application
CN105992029A (en) Wallpaper recommendation method and system, server, and mobile terminal
CN109257498B (en) Sound processing method and mobile terminal
CN105554386A (en) Mobile terminal and camera shooting control method thereof
CN108320761A (en) Audio recording method, intelligent sound pick-up outfit and computer readable storage medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant