CN106792341A - Audio output method and device and terminal equipment - Google Patents

Audio output method and device and terminal equipment Download PDF

Info

Publication number
CN106792341A
CN106792341A CN201611056298.3A CN201611056298A CN106792341A CN 106792341 A CN106792341 A CN 106792341A CN 201611056298 A CN201611056298 A CN 201611056298A CN 106792341 A CN106792341 A CN 106792341A
Authority
CN
China
Prior art keywords
user
orientation
image
loudspeaker
collection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201611056298.3A
Other languages
Chinese (zh)
Inventor
汤中良
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201611056298.3A priority Critical patent/CN106792341A/en
Publication of CN106792341A publication Critical patent/CN106792341A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/403Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers loud-speakers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/10Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Otolaryngology (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The embodiment of the invention discloses an audio output method, an audio output device and terminal equipment. The method comprises the following steps: when the loudspeaker is detected to be in an audio output state, determining the direction of a user; and controlling the loudspeaker to output audio to the direction of the user. The embodiment of the invention solves the technical problem that the directional loudspeaker cannot automatically identify the direction of the user by determining the direction of the user and outputting the audio to the direction of the user, and realizes the technical effects of automatically identifying the direction of the user and outputting the audio to the direction in a directional manner.

Description

A kind of audio-frequency inputting method, device and terminal device
Technical field
The present embodiments relate to intelligent terminal technical field, more particularly to a kind of audio-frequency inputting method, device and terminal Equipment.
Background technology
With the fast development of intelligent terminal, intelligent terminal (for example, smart mobile phone and Intelligent worn device etc.) is wide It is general to be applied to people's work, the every field of life.
Loudspeaker is equipped with current intelligent terminal, speaker sound output function is supported.And ventional loudspeakers send Sound to all the winds propagate, in order to reduce the interference to surrounding population, occur in that a kind of with ventional loudspeakers work The different directional loudspeaker of principle, first directional loudspeaker by low frequency sound signals be loaded in the very strong high-frequency signal of directive property it On, then by amplifying, being transmitted into air, then, air can be high-frequency signal rapid filtration, and audible signal thereon is just Meeting nature is leached, and realizes the direction propagation as laser.
But, once existing directional loudspeaker or the intelligent terminal equipped with directional loudspeaker, its position are right after fixed The direction of the loudspeaker output sound answered is fixed.Under many scenes, for example, user is back to loudspeaker sound propagation side Xiang Shi, the sound of above-mentioned output can not well be received by user.
The content of the invention
The present invention provides a kind of audio-frequency inputting method, device and terminal device, to realize automatic identification audio output direction, Sound is exported towards user direction.
In a first aspect, the embodiment of the invention provides a kind of audio-frequency inputting method, the method includes:
When loudspeaker is detected in audio output state, orientation where user is determined;
The loudspeaker is controlled to output audio in orientation where the user.
Further, orientation where determining user includes:
Carry out IMAQ to the space where the loudspeaker, and image to gathering carries out image recognition;
If including characteristics of human body's information in the image of the collection, the image according to the collection determines characteristics of human body's Orientation, using the orientation of the characteristics of human body as orientation where user.
Further, orientation where determining user includes:
IMAQ is carried out to the space where the loudspeaker using rotating camera, and in rotating camera rotation The image of Real time identification collection during turning;
If comprising characteristics of human body's information in recognizing the image of collection, controlling the rotating camera to stop the rotation, will The orientation of the rotating camera direction is used as orientation where user when stopping the rotation.
Further, orientation where determining user includes:
IMAQ, and the image that will be gathered and the advance user for gathering are carried out to the space where the loudspeaker Image matched;
If the match is successful, the image according to the collection determines the orientation of the user.
Further, orientation where determining user includes:
IMAQ is carried out to the space where the loudspeaker, if comprising multiple users' in recognizing the image of collection During characteristics of human body's information, then the distance between the loudspeaker and each user are determined using range sensor;
Image according to the collection determines orientation where the user nearest apart from the loudspeaker.
Further, before determining orientation where user, also include:
The user is identified using iris recognition sensor.
Second aspect, the embodiment of the present invention additionally provides a kind of audio output device, and the device includes:
Orientation determining module, for when loudspeaker is detected in audio output state, determining orientation where user;
Dio Output Modules, for controlling the loudspeaker to output audio in orientation where the user.
Further, the orientation determining module specifically for, IMAQ is carried out to the space where the loudspeaker, And the image to gathering carries out image recognition;If characteristics of human body's information is included in the image of the collection, according to the collection Image determine the orientation of characteristics of human body, using the orientation of the characteristics of human body as orientation where user.
Further, the orientation determining module is specifically for using rotating camera to the sky where the loudspeaker Between carry out IMAQ, and during the rotating camera rotates Real time identification collection image;If recognizing collection Image in include characteristics of human body's information, then control the rotating camera to stop the rotation, by when stopping the rotation it is described rotation take the photograph As the orientation of head direction is used as orientation where user.
Further, the orientation determining module specifically for, IMAQ is carried out to the space where the loudspeaker, And the image of collection is matched with the image of the user of advance collection;If the match is successful, according to the collection Image determines the orientation of the user.
Further, the orientation determining module specifically for, IMAQ is carried out to the space where the loudspeaker, If characteristics of human body's information of multiple users is included in recognizing the image of collection, raised one's voice using described in range sensor determination The distance between device and each user;Image according to the collection determines orientation where the user nearest apart from the loudspeaker.
Further, the audio output device also includes:
Iris recognition module, before orientation where for determining user in the orientation determining module, using iris recognition Sensor identifies the user.
The third aspect, the embodiment of the present invention additionally provides a kind of terminal device, including any that above-mentioned second aspect is provided The item audio output device and loudspeaker;
The loudspeaker is arranged in the terminal device.
Further, the terminal device includes camera and range sensor;Or, camera and iris recognition sensing Device;
The camera, the image in the space where for gathering the terminal device, and determined according to the image of collection Orientation where user;
The range sensor, for determining the distance between the terminal device and user;
The iris recognition sensor, for identifying user.
Further, the camera is rotary pick-up head.
The technical scheme of the embodiment of the present invention, is exported by the orientation of terminal automatic identification user and to orientation where user Audio, solves the technical problem that directional loudspeaker is unable to automatic identification user direction, realizes the orientation of automatic identification user, And the technique effect of audio is exported to the azimuthal orientation.
Brief description of the drawings
Fig. 1 is the flow chart of the audio-frequency inputting method in the embodiment of the present invention one;
Fig. 2 is the flow chart of the audio-frequency inputting method in the embodiment of the present invention two;
Fig. 3 is the flow chart of the audio-frequency inputting method in the embodiment of the present invention three;
Fig. 4 is the flow chart of the audio-frequency inputting method in the embodiment of the present invention four;
Fig. 5 is the flow chart of the audio-frequency inputting method in the embodiment of the present invention five;
Fig. 6 is the structural representation of the audio output device in the embodiment of the present invention six;
Fig. 7 is the structural representation of the terminal device in the embodiment of the present invention seven.
Specific embodiment
The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that, in order to just Part rather than entire infrastructure related to the present invention is illustrate only in description, accompanying drawing.
Embodiment one
Fig. 1 is a kind of flow chart of audio-frequency inputting method that the embodiment of the present invention one is provided, and the present embodiment is applicable to fixed To the situation of output audio, the method can perform by audio output device provided in an embodiment of the present invention, and the device can be with Realized by the way of software and/or hardware, the device can be integrated in the terminal with audio output function, for example, raising Sound device, mobile terminal (such as mobile phone, panel computer), car-mounted terminal, notebook computer and fixed terminal (such as desktop computer) In.Specifically include following steps:
S110, when detecting loudspeaker and being in audio output state, determine orientation where user.
The loudspeaker can be the loudspeaker being arranged in terminal, or loudspeaker apparatus.When the loudspeaker When setting terminal, audio output state refers to the state that terminal exports sound by loudspeaker, for example, can be talking state Or music state etc..When the loudspeaker is loudspeaker apparatus, audio output state refers to loudspeaker apparatus and broadcasts Play a record or audio out sound state.Orientation where user refers to the recipient position of audio relative to terminal Direction.
S120, the control loudspeaker are to output audio in orientation where the user.
In the present embodiment, when it is determined that after the orientation of user, control instruction is sent to loudspeaker so that audio is towards user Direction output.For example, user can send control instruction by control device (for example, remote control or mobile phone) to loudspeaker, tool Body can send control instruction using wifi network, bluetooth or 4G networks to loudspeaker, after the loudspeaker receives control instruction, can Loudspeaker are exported into audio towards user by rotating.
Wherein, controlling loudspeaker orientation output audio can be realized by active directional loudspeaker or matrix loudspeaker array. The operation principle of active directional loudspeaker is that low frequency sound signals are loaded on the very strong high-frequency signal of directive property, then by putting Greatly, it is transmitted into air, then, air will can naturally leach high-frequency signal rapid filtration, audible signal thereon, Realize the direction propagation as laser;The operation principle of matrix loudspeaker array be by some loudspeakers matrix arrangement at equal intervals, Each loudspeaker unit radiates a same-phase wave surface for plane, and the combination of multiple units is formed and can provide single main extension Sound source, the wave surface of the loudspeaker array produces quality by the coupling in whole audiorange in certain area coverage Consistent sound, makes it be propagated in a certain direction in the form of wave beam.
What deserves to be explained is, it is determined that, it is necessary to judge whether the audio in output state needs before orientation where user Orient output.Specifically, the distance of applications distances sensor detection terminal and user face, when the distance is less than predeterminable range When, it is not necessary to carry out the identification of user location, be normally carried out the broadcasting of voice, otherwise, when the distance more than predeterminable range or When the loudspeaker of terminal is in hands-free outer mode playback, the orientation of user can be determined automatically or the manually opened station-keeping mode of user, and Controlling loudspeaker is to output audio in orientation where user.Wherein, predeterminable range typically can be 10cm or 20cm.
The technical scheme of the present embodiment, sound is exported by the orientation of terminal automatic identification user and to orientation where user Frequently, the technical problem that directional loudspeaker is unable to automatic identification user direction is solved, the orientation of automatic identification user is realized, and The technique effect of audio is exported to the azimuthal orientation.
Embodiment two
Fig. 2 is a kind of flow chart of audio-frequency inputting method that the embodiment of the present invention two is provided, in the base of above-described embodiment one Audio-frequency inputting method is optimized on plinth, there is provided the method for determining orientation where user, specifically to the loudspeaker institute Space carry out IMAQ, and image to gathering carries out image recognition;If special comprising human body in the image of the collection Reference ceases, then the image according to the collection determines the orientation of characteristics of human body, using the orientation of the characteristics of human body as user institute In orientation.Accordingly, the method for the present embodiment includes:
S210, IMAQ is carried out to the space where the loudspeaker, and image to gathering carries out image recognition.
Wherein, it can, by camera collection image, image be entered that IMAQ is carried out to the space where loudspeaker Whether row identification refers to being identified the image information included in image, it is determined that including user in the image of collection.
If including characteristics of human body's information in S220, the image of the collection, the image according to the collection determines human body The orientation of feature, using the orientation of the characteristics of human body as orientation where user.
Wherein, characteristics of human body's information refers to being able to confirm that the information comprising human body in image, for example, can be human body head Portion, face or face etc., if containing above-mentioned any one characteristics of human body's information in identifying image, it is possible to determine image In contain user.Side of the user relative to terminal is calculated and determined by characteristics of human body's information relative position in the picture Position.
Terminal can separated in time, for example can be 30 seconds or 1 minute, the figure in the space where continuous acquisition loudspeaker Picture simultaneously recognizes that acquisition characteristics of human body's information, determines the orientation of user in real time.
S230, the control loudspeaker are to output audio in orientation where the user.
The technical scheme of the present embodiment, by gathering the image in space where loudspeaker, identification characteristics of human body's information is with certainly The orientation of dynamic identifying user, and audio is exported to the azimuthal orientation, the orientation of automatic identification user is realized, and it is fixed to the orientation To the technique effect of output audio.
Embodiment three
Fig. 3 is a kind of flow chart of audio-frequency inputting method that the embodiment of the present invention three is provided, on the basis of above-described embodiment On audio-frequency inputting method is optimized, there is provided the method for determining orientation where user, specifically using rotating camera pair Space where the loudspeaker carries out IMAQ, and the Real time identification collection during the rotating camera rotates Image;If comprising characteristics of human body's information in recognizing the image of collection, controlling the rotating camera to stop the rotation, will stop The orientation of the rotating camera direction is used as orientation where user during rotation.Accordingly, the method for the present embodiment includes:
S310, IMAQ is carried out to the space where the loudspeaker using rotating camera, and taken the photograph in the rotation The image of Real time identification collection during being rotated as head.
Wherein, rotating camera is the camera for being capable of rotary taking.Specifically, being in audio output when terminal is detected State, and judging it needs to be determined that during the orientation of user, terminal is automatic or the manually opened rotating camera of user, obtains terminal institute In the image in space, and the image to rotating camera acquisition is identified in real time, automatic to catch characteristics of human body's information.
If S320, recognizing in the image of collection comprising characteristics of human body's information, the rotating camera is controlled to stop rotation Turn, using the orientation of rotating camera direction when stopping the rotation as orientation where user.
Specifically, in rotating camera during rotary taking, when having recognized characteristics of human body's information and occurring, for example Can occur in that human body head in image, control rotating camera is stopped the rotation, rotating camera is stopped the rotation the moment Direction be defined as user where direction;Otherwise, the image in rotating camera collection space is continued, until recognizing someone Body characteristicses information, determines user direction.What deserves to be explained is, when the characteristics of human body's information in image disappears, rotating camera Automatic opening rotates and continues to gather the image in space, until having recognized characteristics of human body's information, determines user direction.
S330, the control loudspeaker are to output audio in orientation where the user.
The technical scheme of the present embodiment, obtains and recognizes the image in space where loudspeaker in real time by rotating camera, Automatically catch the characteristics of human body of user to determine the orientation of user, realize the orientation of automatic identification user, and it is fixed to the orientation To the technique effect of output audio.
Example IV
Fig. 4 is a kind of flow chart of audio-frequency inputting method that the embodiment of the present invention four is provided, on the basis of above-described embodiment On audio-frequency inputting method is optimized, there is provided the method for determining orientation where user, specifically to where the loudspeaker Space carry out IMAQ, and the image of collection is matched with the image of the user of advance collection;If matching into Work(, then the image according to the collection determine the orientation of the user.Accordingly, the method for the present embodiment includes:
S410, IMAQ is carried out to the space where the loudspeaker, and the image that will gather and the institute of collection in advance The image for stating user is matched.
Wherein, the user images of collection refer to what is compared for the image gathered with terminal in advance, pre-existing The image of the user in terminal for example can be the autonomous image for shooting of user, or terminal in images match before During automatic storage user image.
Specifically, terminal can be matched using face recognition algorithms to the image for gathering.The principle of face recognition algorithms It is the face information in the image for extracting terminal collection, including eyes, nose, face or ear etc., and will be to face information Matched with the face information in the image of the user of advance collection, when similarity reaches preset value, determined that terminal is gathered Image in there is terminal user.Wherein, the preset value of matching similarity can be terminal recommendation, or user from The adjusted value of definition, for example, can be 80% or 90%.When matching similarity preset value is higher, matching accuracy is higher, More long with elapsed time, accordingly, when matching similarity preset value is relatively low, matching speed is fast, and matching accuracy is low, easily goes out Now recognize the situation of mistake.
If S420, the match is successful, the image according to the collection determines the orientation of the user.
Wherein, the match is successful refer to terminal collection image and in advance collection user images in face information phase Matching similarity preset value is reached like degree, there is terminal user in the image for confirming terminal collection.Can be used by the terminal The face information at family relative position in the picture is calculated and determined orientation of the user relative to terminal.
In the present embodiment, by the image information match cognization of information in the image that is gathered to terminal and default user, Identification terminal user, determines the orientation of user, improves the degree of accuracy of orientation determination.
Optionally, before determining orientation where user, the method also includes:
The user is identified using iris recognition sensor.
Wherein, iris recognition technology refers to carrying out identification by eyes.Iris is the black for being located at human eye Annular formations between pupil and white sclera, it comprises many interlaced spots, filament, coronal, striped and hidden The minutia of nest etc.;Iris keeps constant in the whole life course after prenatal development stage is formed.According to the thin of iris Section feature is capable of the identity of the identifying user of uniqueness.
Iris recognition sensor is the sensor that can obtain human eye iris image and identifying user identity.Iris recognition is passed The operation principle of sensor is acquisition iris image;Iris image is pre-processed, it is met the demand for extracting iris feature; Extract iris feature;Characteristic matching, identifying user identity are carried out to extract and model.
Specifically, in the present embodiment, the iris image in space where loudspeaker is obtained by iris recognition sensor, and The iris image for obtaining is identified in real time, and is matched with the iris image of the terminal user for prestoring, when the match is successful When, it is determined that the iris image for obtaining belongs to terminal user, and calculate the orientation for determining terminal user.
In the present embodiment, by the identity of iris recognition technology unique identification terminal user, the orientation of user is determined, improve The degree of accuracy that orientation determines.
S430, the control loudspeaker are to output audio in orientation where the user.
The technical scheme of the present embodiment, by the image information of information in the image that is gathered to terminal and default user With identification, identification terminal user determines the orientation of user, solves the technology that directional loudspeaker is unable to automatic identification user direction Problem, realizes the orientation of automatic identification user, and the technique effect of audio is exported to the azimuthal orientation.
Embodiment five
Fig. 5 is a kind of flow chart of audio-frequency inputting method that the embodiment of the present invention five is provided, on the basis of above-described embodiment On audio-frequency inputting method is optimized, there is provided the method for determining orientation where user, specifically to where the loudspeaker Space carry out IMAQ, if during characteristics of human body's information comprising multiple users in recognizing the image of collection, using away from Determine the distance between the loudspeaker and each user from sensor;Image according to the collection is determined apart from the loudspeaker Orientation where nearest user.Accordingly, the method for the present embodiment includes:
S510, IMAQ is carried out to the space where the loudspeaker, if comprising multiple in recognizing the image of collection During characteristics of human body's information of user, then the distance between the loudspeaker and each user are determined using range sensor.
Wherein, range sensor is a kind of sensor that can detect physical distance, for example, can be passed by optoelectronic distance Distance between sensor or ultrasonic distance sensor detection user and terminal speaker.Specifically, when the figure for recognizing terminal collection There is characteristics of human body's information that is multiple and being not belonging to same user in different azimuth as in, terminal cannot determine the output side of audio To can detect by range sensor and recognize unique audio output user to determine the outbound course of audio.
S520, the orientation according to where the image of the collection determines the user nearest apart from the loudspeaker.
Specifically, detecting the distance between each user and terminal speaker by range sensor, the distance is compared Compared with according to comparative result, the nearest user of chosen distance is defined as audio output user.According to the orientation of audio output user Determine the outbound course of audio.
S530, the control loudspeaker are to output audio in orientation where the user.
The technical scheme of the present embodiment, when characteristics of human body's information of many people is recognized, is detected by range sensor and known Other each user and the distance of terminal speaker, are defined as audio output user to determine the output of audio by nearest user Direction, realizes in the presence of multiple users, the effect in automatic identification audio output direction.
Embodiment six
Fig. 6 is the structural representation of the audio output device that the embodiment of the present invention six is provided, and the device is adapted for carrying out this The audio-frequency inputting method that inventive embodiments are provided, as shown in fig. 6, the device can specifically include:
Orientation determining module 610, for when loudspeaker is detected in audio output state, determining user place side Position;
Dio Output Modules 620, for controlling the loudspeaker to output audio in orientation where the user.
Optionally, orientation determining module 610 specifically for, IMAQ is carried out to the space where the loudspeaker, and Image to gathering carries out image recognition;If characteristics of human body's information is included in the image of the collection, according to the collection Image determines the orientation of characteristics of human body, using the orientation of the characteristics of human body as orientation where user.
Optionally, orientation determining module 610 to the space where the loudspeaker using rotating camera specifically for being entered Row IMAQ, and the image that Real time identification is gathered during the rotating camera rotates;If recognizing the figure of collection Characteristics of human body's information is included as in, then controls the rotating camera to stop the rotation, by rotating camera when stopping the rotation The orientation of direction is used as orientation where user.
Optionally, orientation determining module 610 specifically for, IMAQ is carried out to the space where the loudspeaker, and The image of collection is matched with the image of the user of advance collection;If the match is successful, according to the figure of the collection Orientation as determining the user.
Optionally, orientation determining module 610 specifically for, IMAQ is carried out to the space where the loudspeaker, if When recognizing the characteristics of human body's information comprising multiple users in the image of collection, then the loudspeaker is determined using range sensor The distance between with each user;Image according to the collection determines orientation where the user nearest apart from the loudspeaker.
Optionally, the audio output device also includes:
Iris recognition module, before orientation where for determining user in the orientation determining module, using iris recognition Sensor identifies the user.
The present embodiment passes through the orientation of terminal automatic identification user and exports audio to orientation where user, solves orientation Loudspeaker is unable to the technical problem in automatic identification user direction, realizes the orientation of automatic identification user, and to the azimuthal orientation Export the technique effect of audio.
Embodiment seven
Fig. 7 is the structural representation of the terminal device that the embodiment of the present invention seven is provided, based on the sound that above-described embodiment is provided Frequency output device, present embodiments provides the terminal device of any one audio output device provided comprising above-described embodiment 700.Audio output device 600 with the automatic identification user direction of control terminal equipment 700, and can orient output to user direction Audio.Specifically, the terminal device includes audio output device 600 and loudspeaker 710, the loudspeaker 710 is arranged on terminal and sets In standby 700.
Wherein, terminal device 700 can be the Intelligent worn devices such as intelligent watch or Intelligent bracelet, smart mobile phone or shifting Dynamic flat board etc..
The audio output direction instruction orientation output audio that loudspeaker 710 is formed according to audio output device 600.It is exemplary , loudspeaker can be realized using MEMS matrixes loudspeaker array in the present embodiment, and MEMS speaker sizes are micron order, MEMS squares MEMS number of loudspeakers can be typically 50-200 in battle array loudspeaker array, and MEMS number of loudspeakers is preferably in the present embodiment 100 or so, MEMS matrix loudspeaker array are preferably dimensioned to be 10mm.
MEMS matrix loudspeakers are different from classical matrix piezo-electric loudspeaker, and small volume is Miniaturized, volume production and can apply In terminal device 700.
Optionally, the terminal device 700 includes camera and range sensor;Or, camera and iris recognition sensor;
The camera, the image in the space where for gathering the terminal device, and determined according to the image of collection Orientation where user;
The range sensor, for determining the distance between the terminal device and user;
The iris recognition sensor, for identifying user.
Optionally, camera is rotary pick-up head.
The present embodiment is on the basis of above-described embodiment, there is provided a kind of terminal device, and the embodiment passes through audio output Device determines user location, and, to user location orientation output audio, solving directional loudspeaker can not be automatic for controlling loudspeaker The technical problem in identifying user direction, realizes the orientation of automatic identification user, and the technology of audio is exported to the azimuthal orientation Effect.
Note, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although the present invention is carried out by above example It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also More other Equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims (15)

1. a kind of audio-frequency inputting method, it is characterised in that including:
When loudspeaker is detected in audio output state, orientation where user is determined;
The loudspeaker is controlled to output audio in orientation where the user.
2. method according to claim 1, it is characterised in that orientation where determining user includes:
Carry out IMAQ to the space where the loudspeaker, and image to gathering carries out image recognition;
If including characteristics of human body's information in the image of the collection, the image according to the collection determines the side of characteristics of human body Position, using the orientation of the characteristics of human body as orientation where user.
3. method according to claim 1, it is characterised in that orientation where determining user includes:
IMAQ is carried out to the space where the loudspeaker using rotating camera, and in rotating camera rotation During Real time identification collection image;
If comprising characteristics of human body's information in recognizing the image of collection, controlling the rotating camera to stop the rotation, will stop The orientation of the rotating camera direction is used as orientation where user during rotation.
4. method according to claim 1, it is characterised in that orientation where determining user includes:
Carry out IMAQ to the space where the loudspeaker, and the image that will be gathered and the user of advance collection figure As being matched;
If the match is successful, the image according to the collection determines the orientation of the user.
5. method according to claim 1, it is characterised in that orientation where determining user includes:
IMAQ is carried out to the space where the loudspeaker, if comprising the human body of multiple users in recognizing the image of collection During characteristic information, then the distance between the loudspeaker and each user are determined using range sensor;
Image according to the collection determines orientation where the user nearest apart from the loudspeaker.
6. the method according to claim any one of 1-3, it is characterised in that before determining orientation where user, also include:
The user is identified using iris recognition sensor.
7. a kind of audio output device, it is characterised in that including:
Orientation determining module, for when loudspeaker is detected in audio output state, determining orientation where user;
Dio Output Modules, for controlling the loudspeaker to output audio in orientation where the user.
8. device according to claim 7, it is characterised in that the orientation determining module to described specifically for raising one's voice Space where device carries out IMAQ, and image to gathering carries out image recognition;If including people in the image of the collection Body characteristicses information, then the image according to the collection determine the orientation of characteristics of human body, using the orientation of the characteristics of human body as with Orientation where family.
9. device according to claim 7, it is characterised in that the orientation determining module using rotation specifically for being taken the photograph IMAQ, and the Real time identification during the rotating camera rotates are carried out to the space where the loudspeaker as head The image of collection;If comprising characteristics of human body's information in recognizing the image of collection, controlling the rotating camera to stop the rotation, Using the orientation of rotating camera direction when stopping the rotation as orientation where user.
10. device according to claim 7, it is characterised in that the orientation determining module to described specifically for raising one's voice Space where device carries out IMAQ, and the image of collection is matched with the image of the user of advance collection;If The match is successful, then the image according to the collection determines the orientation of the user.
11. devices according to claim 7, it is characterised in that the orientation determining module to described specifically for raising one's voice Space where device carries out IMAQ, if during characteristics of human body's information comprising multiple users in recognizing the image of collection, The distance between the loudspeaker and each user are determined using range sensor;Image according to the collection is determined described in distance Orientation where the nearest user of loudspeaker.
12. device according to claim any one of 7-9, it is characterised in that also include:
Iris recognition module, before orientation where for determining user in the orientation determining module, is sensed using iris recognition Device identifies the user.
13. a kind of terminal devices, it is characterised in that including audio output device and loudspeaker described in any one of 7-12;
The loudspeaker is arranged in the terminal device.
14. terminal devices according to claim 13, it is characterised in that including camera and range sensor;Or, shooting Head and iris recognition sensor;
The camera, the image in the space where for gathering the terminal device, and user is determined according to the image of collection Place orientation;
The range sensor, for determining the distance between the terminal device and user;
The iris recognition sensor, for identifying user.
15. terminal devices according to claim 14, it is characterised in that the camera is rotary pick-up head.
CN201611056298.3A 2016-11-23 2016-11-23 Audio output method and device and terminal equipment Pending CN106792341A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201611056298.3A CN106792341A (en) 2016-11-23 2016-11-23 Audio output method and device and terminal equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201611056298.3A CN106792341A (en) 2016-11-23 2016-11-23 Audio output method and device and terminal equipment

Publications (1)

Publication Number Publication Date
CN106792341A true CN106792341A (en) 2017-05-31

Family

ID=58910740

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201611056298.3A Pending CN106792341A (en) 2016-11-23 2016-11-23 Audio output method and device and terminal equipment

Country Status (1)

Country Link
CN (1) CN106792341A (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107656718A (en) * 2017-08-02 2018-02-02 宇龙计算机通信科技(深圳)有限公司 A kind of audio signal direction propagation method, apparatus, terminal and storage medium
CN108399916A (en) * 2018-01-08 2018-08-14 蔚来汽车有限公司 Vehicle intelligent voice interactive system and method, processing unit and storage device
CN108536418A (en) * 2018-03-26 2018-09-14 深圳市冠旭电子股份有限公司 A kind of method, apparatus and wireless sound box of the switching of wireless sound box play mode
CN108595145A (en) * 2018-04-26 2018-09-28 广东小天才科技有限公司 Voice playing control method and device of wearable device and wearable device
CN108810742A (en) * 2018-08-01 2018-11-13 奇酷互联网络科技(深圳)有限公司 Speaker control method, device, readable storage medium storing program for executing and mobile terminal
CN109257682A (en) * 2018-09-29 2019-01-22 歌尔科技有限公司 Pickup adjusting method, controlling terminal and computer readable storage medium
CN110139246A (en) * 2019-05-22 2019-08-16 广州小鹏汽车科技有限公司 Treating method and apparatus, automobile and the machine readable media of on-vehicle Bluetooth call
CN110611861A (en) * 2019-09-06 2019-12-24 Oppo广东移动通信有限公司 Directional sound production control method and device, sound production equipment, medium and electronic equipment
CN111193987A (en) * 2019-12-27 2020-05-22 新石器慧通(北京)科技有限公司 Method and device for directionally playing sound by vehicle and unmanned vehicle
CN111385649A (en) * 2018-12-28 2020-07-07 深圳Tcl新技术有限公司 Television sound transmission control method and device, smart television and storage medium
CN111486491A (en) * 2020-01-04 2020-08-04 于贵庆 Intelligent control system and method based on content identification
CN111823241A (en) * 2019-05-27 2020-10-27 广东小天才科技有限公司 Intelligent security robot, method and device and storage medium
CN111866674A (en) * 2019-04-25 2020-10-30 北京小米移动软件有限公司 Speaker assembly control method, device and storage medium
CN113504890A (en) * 2021-07-14 2021-10-15 炬佑智能科技(苏州)有限公司 ToF camera-based speaker assembly control method, apparatus, device, and medium

Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1984507A (en) * 2005-12-16 2007-06-20 乐金电子(沈阳)有限公司 Voice-frequency/video-frequency equipment and method for automatically adjusting loundspeaker position
CN101030244A (en) * 2006-03-03 2007-09-05 中国科学院自动化研究所 Automatic identity discriminating method based on human-body physiological image sequencing estimating characteristic
US20110050643A1 (en) * 2009-08-28 2011-03-03 INVENTEC APPLIANCES (Shanghai) CO., LTD./ INVENTEC APPLIANCES CORP. Passive infrared sensing user interface and device using the same
CN102004881A (en) * 2010-11-24 2011-04-06 东莞宇龙通信科技有限公司 Mobile terminal and switching device and method of working modes thereof
CN102025945A (en) * 2009-09-16 2011-04-20 宏碁股份有限公司 Electronic device and control method thereof
CN103002376A (en) * 2011-09-09 2013-03-27 联想(北京)有限公司 Method for orientationally transmitting voice and electronic equipment
CN204031425U (en) * 2014-05-05 2014-12-17 马建敏 Active directional loudspeaker
CN104732396A (en) * 2015-03-24 2015-06-24 广东欧珀移动通信有限公司 Payment control method and device
CN104902203A (en) * 2015-05-19 2015-09-09 广东欧珀移动通信有限公司 Video recording method based on rotary camera, and terminal
CN104935718A (en) * 2015-06-11 2015-09-23 广东欧珀移动通信有限公司 Control method and mobile terminal
CN104935717A (en) * 2015-06-11 2015-09-23 广东欧珀移动通信有限公司 Call reminder method based on rotary camera and terminal
CN105007368A (en) * 2015-06-11 2015-10-28 广东欧珀移动通信有限公司 Method of controlling loudspeaker, and mobile terminal
CN105007553A (en) * 2015-07-23 2015-10-28 惠州Tcl移动通信有限公司 Sound oriented transmission method of mobile terminal and mobile terminal
CN105245732A (en) * 2015-11-12 2016-01-13 浪潮(北京)电子信息产业有限公司 Switching method and device for terminal equipment and terminal equipment
CN205620939U (en) * 2016-05-05 2016-10-05 广东小天才科技有限公司 Wearable equipment of intelligence
CN207835776U (en) * 2018-03-12 2018-09-07 上海慧声科技有限公司 A kind of adjustable directional loudspeaker of output power

Patent Citations (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1984507A (en) * 2005-12-16 2007-06-20 乐金电子(沈阳)有限公司 Voice-frequency/video-frequency equipment and method for automatically adjusting loundspeaker position
CN101030244A (en) * 2006-03-03 2007-09-05 中国科学院自动化研究所 Automatic identity discriminating method based on human-body physiological image sequencing estimating characteristic
US20110050643A1 (en) * 2009-08-28 2011-03-03 INVENTEC APPLIANCES (Shanghai) CO., LTD./ INVENTEC APPLIANCES CORP. Passive infrared sensing user interface and device using the same
CN102025945A (en) * 2009-09-16 2011-04-20 宏碁股份有限公司 Electronic device and control method thereof
CN102004881A (en) * 2010-11-24 2011-04-06 东莞宇龙通信科技有限公司 Mobile terminal and switching device and method of working modes thereof
CN103002376A (en) * 2011-09-09 2013-03-27 联想(北京)有限公司 Method for orientationally transmitting voice and electronic equipment
CN204031425U (en) * 2014-05-05 2014-12-17 马建敏 Active directional loudspeaker
CN104732396A (en) * 2015-03-24 2015-06-24 广东欧珀移动通信有限公司 Payment control method and device
CN104902203A (en) * 2015-05-19 2015-09-09 广东欧珀移动通信有限公司 Video recording method based on rotary camera, and terminal
CN104935718A (en) * 2015-06-11 2015-09-23 广东欧珀移动通信有限公司 Control method and mobile terminal
CN104935717A (en) * 2015-06-11 2015-09-23 广东欧珀移动通信有限公司 Call reminder method based on rotary camera and terminal
CN105007368A (en) * 2015-06-11 2015-10-28 广东欧珀移动通信有限公司 Method of controlling loudspeaker, and mobile terminal
CN105007553A (en) * 2015-07-23 2015-10-28 惠州Tcl移动通信有限公司 Sound oriented transmission method of mobile terminal and mobile terminal
CN105245732A (en) * 2015-11-12 2016-01-13 浪潮(北京)电子信息产业有限公司 Switching method and device for terminal equipment and terminal equipment
CN205620939U (en) * 2016-05-05 2016-10-05 广东小天才科技有限公司 Wearable equipment of intelligence
CN207835776U (en) * 2018-03-12 2018-09-07 上海慧声科技有限公司 A kind of adjustable directional loudspeaker of output power

Cited By (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10694291B2 (en) 2017-08-02 2020-06-23 Yulong Computer Telecommunication Scientific (Shenzhen) Co., Ltd. (Cn) Directional propagation method and apparatus for audio signal, a terminal device and a storage medium
CN107656718A (en) * 2017-08-02 2018-02-02 宇龙计算机通信科技(深圳)有限公司 A kind of audio signal direction propagation method, apparatus, terminal and storage medium
CN108399916A (en) * 2018-01-08 2018-08-14 蔚来汽车有限公司 Vehicle intelligent voice interactive system and method, processing unit and storage device
CN108536418A (en) * 2018-03-26 2018-09-14 深圳市冠旭电子股份有限公司 A kind of method, apparatus and wireless sound box of the switching of wireless sound box play mode
CN108595145A (en) * 2018-04-26 2018-09-28 广东小天才科技有限公司 Voice playing control method and device of wearable device and wearable device
CN108810742A (en) * 2018-08-01 2018-11-13 奇酷互联网络科技(深圳)有限公司 Speaker control method, device, readable storage medium storing program for executing and mobile terminal
CN108810742B (en) * 2018-08-01 2021-03-19 奇酷互联网络科技(深圳)有限公司 Sound box control method and device, readable storage medium and mobile terminal
CN109257682A (en) * 2018-09-29 2019-01-22 歌尔科技有限公司 Pickup adjusting method, controlling terminal and computer readable storage medium
CN111385649B (en) * 2018-12-28 2022-01-04 深圳Tcl新技术有限公司 Television sound transmission control method and device, smart television and storage medium
CN111385649A (en) * 2018-12-28 2020-07-07 深圳Tcl新技术有限公司 Television sound transmission control method and device, smart television and storage medium
CN111866674B (en) * 2019-04-25 2022-02-22 北京小米移动软件有限公司 Speaker assembly control method, device and storage medium
CN111866674A (en) * 2019-04-25 2020-10-30 北京小米移动软件有限公司 Speaker assembly control method, device and storage medium
CN110139246A (en) * 2019-05-22 2019-08-16 广州小鹏汽车科技有限公司 Treating method and apparatus, automobile and the machine readable media of on-vehicle Bluetooth call
CN111823241A (en) * 2019-05-27 2020-10-27 广东小天才科技有限公司 Intelligent security robot, method and device and storage medium
CN110611861A (en) * 2019-09-06 2019-12-24 Oppo广东移动通信有限公司 Directional sound production control method and device, sound production equipment, medium and electronic equipment
CN111193987A (en) * 2019-12-27 2020-05-22 新石器慧通(北京)科技有限公司 Method and device for directionally playing sound by vehicle and unmanned vehicle
CN111486491A (en) * 2020-01-04 2020-08-04 于贵庆 Intelligent control system and method based on content identification
CN111486491B (en) * 2020-01-04 2021-04-13 董峰 Intelligent control system and method based on content identification
CN113504890A (en) * 2021-07-14 2021-10-15 炬佑智能科技(苏州)有限公司 ToF camera-based speaker assembly control method, apparatus, device, and medium

Similar Documents

Publication Publication Date Title
CN106792341A (en) Audio output method and device and terminal equipment
US20230308067A1 (en) Intelligent audio output devices
US10405081B2 (en) Intelligent wireless headset system
EP3120298B1 (en) Method and apparatus for establishing connection between electronic devices
US10027888B1 (en) Determining area of interest in a panoramic video or photo
CN102902505B (en) Device with enhancing audio
CN102045618B (en) Automatically adjusted microphone array, method for automatically adjusting microphone array, and device carrying microphone array
CN108766438B (en) Man-machine interaction method and device, storage medium and intelligent terminal
US20190028817A1 (en) System and method for a directional speaker selection
US11482237B2 (en) Method and terminal for reconstructing speech signal, and computer storage medium
CN108428452A (en) Terminal support and far field voice interactive system
US20130120243A1 (en) Display apparatus and control method thereof
CN107920263A (en) Volume adjusting method and device
CN109284081B (en) Audio output method and device and audio equipment
US20160314785A1 (en) Sound reproduction method, speech dialogue device, and recording medium
US12061278B1 (en) Acoustic identification of audio products
CN109460072A (en) A kind of audio frequency apparatus orientation display methods, device and audio frequency apparatus
EP3195618B1 (en) A method for operating a hearing system as well as a hearing system
CN112533070B (en) Video sound and picture adjusting method, terminal and computer readable storage medium
CN110351629B (en) Radio reception method, radio reception device and terminal
US20230171493A1 (en) Electronic device for autofocusing and method of operating the same
CN109473096B (en) Intelligent voice equipment and control method thereof
CN115035187A (en) Sound source direction determining method, device, terminal, storage medium and product
US10713002B1 (en) Electronic devices and corresponding methods for adjusting audio output devices to mimic received audio input
CN112887770A (en) Photo transmission method and device, television and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170531

RJ01 Rejection of invention patent application after publication