CN104834377A

CN104834377A - Audio control method based on 3D (3-Dimensional) gesture recognition

Info

Publication number: CN104834377A
Application number: CN201510222339.0A
Authority: CN
Inventors: 杨天虎; 杨伟茂; 孙国辉
Original assignee: Living Network Science And Technology Ltd On Chengdu
Current assignee: Living Network Science And Technology Ltd On Chengdu
Priority date: 2015-05-05
Filing date: 2015-05-05
Publication date: 2015-08-12

Abstract

The invention discloses an audio control method based on 3D (3-Dimensional) gesture recognition. The audio control method based on 3D gesture recognition comprises the following steps: S1) acquiring electric field data in a gesture recognition area; S2) establishing a spatial 3D coordinate system in the gesture recognition area; S3) acquiring position coordinates of an electric field change area in the gesture recognition area; S4) repeating step S3 to obtain dynamic change data of the position coordinates of the electric field change area; S5) adjusting audio volume in real time according to the dynamic change data of an X-axis coordinate; S6) controlling audio switching in real time according to the dynamic change data of a Y-axis coordinate; S7) controlling audio play and pause in real time according to the dynamic change data of a Z-axis coordinate. By recognizing gesture actions in a 3D space, operations such as audio play/pause, volume adjustment and audio switching on smart devices are realized, and the audio control method based on 3D gesture recognition has the characteristics of naturalness, simplicity, novelty and the like.

Description

A kind of audio control method based on 3D gesture identification

Technical field

The invention belongs to embedded software technology field, be specifically related to a kind of design of the audio control method based on 3D gesture identification.

Background technology

In the reciprocal process of user and smart machine, input mode seems particularly important, and input mode can strengthen the experience effect of user easily.In prior art, on smart machine, voice-operated input mode is generally adopted as input through keyboard or touches input.On the one hand, these two kinds of input modes are ripe and stable implementations, have substantially been easily accepted by a user; On the other hand, these two kinds of input modes lack certain novelty, are difficult to realize the personalized customization of user to smart machine.

Recent years; along with the fast development of computer technology; the novel human-computer interaction technology that research meets interpersonal communication custom becomes Showed Very Brisk; also make encouraging progress, these researchs comprise recognition of face, human facial expression recognition, labiomaney, head movement tracking, stare tracking, gesture identification and body posture identification etc.Generally speaking. from progressively transferring to centered by computing machine, focus be put on man for human-computer interaction technology, is the interaction technique of multimedia, various modes.

Gesture refers under the consciousness domination of people, and all kinds of actions that staff is made, if digital flexion, stretching, extension and hand are in the motion etc. in space, can be perform a certain task, also can be and the exchanging, to express certain implication or intention of people.Gesture be a kind of natural, directly perceived, be easy to learn man-machine interaction means, using staff directly as the input equipment of computing machine, the communication of between humans and machines will no longer need middle media, and user can define the machine of a kind of suitable gesture to surrounding simply and control.Using staff directly as input medium compared with other input mode, there is naturality, terseness, rich and direct feature.

Summary of the invention

The object of the invention is to lack certain novelty to solve in prior art voice-operated input mode on smart machine, being difficult to realize the problem of user to the personalized customization of smart machine, proposing a kind of audio control method based on 3D gesture identification.

Technical scheme of the present invention is: a kind of audio control method based on 3D gesture identification, comprises the following steps:

S1, the electric field data obtained in gesture identification region;

S2, in gesture identification region, set up space 3D coordinate system;

The position coordinates of S3, acquisition gesture identification region internal electric field region of variation;

S4, repetition step S3, obtain the dynamic changing data of electric field change regional location coordinate;

S5, adjust the volume of audio frequency in real time according to the dynamic changing data of X-axis coordinate;

S6, control the switching between audio frequency in real time according to the dynamic changing data of Y-axis coordinate;

S7, control broadcasting and the time-out of audio frequency in real time according to the dynamic changing data of Z axis coordinate.

Further, step S2 specifically comprises step by step following:

S21, selected a bit as true origin in gesture identification region;

S22, determine the positive dirction of X-axis, Y-axis and Z axis, set up space 3D coordinate system.

Further, step S5 specifically comprises step by step following:

S51, the variable quantity of setting X-axis coordinate data and the corresponding relation of volume variable quantity;

The acquisition time interval delta T of S52, setting X-axis coordinate data _x;

S53, calculate each acquisition time interval delta T according to formula (1) _xthe variable quantity of interior X-axis coordinate data:

ΔX _n＝X _n-X _n-1(n＝1,2,3…) (1)；

S54, to adjust in real time according to the volume of the corresponding relation set in step S51 to audio frequency.

Further, step S6 specifically comprises step by step following:

The acquisition time interval delta T of S61, setting Y-axis coordinate data _y;

S62, calculate each acquisition time interval delta T according to formula (2) _ythe variable quantity of interior Y-axis coordinate data:

ΔY _n＝Y _n-Y _n-1(n＝1,2,3…) (2)；

S63, setting audio handover trigger threshold value Y _maxwith Y _min;

S64, by the variation delta Y of Y-axis coordinate data _nrespectively with Y _maxand Y _mincompare,

If Δ Y _n>=Y _max, then the next audio frequency in audio playlist is switched to;

If Δ Y _n<=Y _min, then the upper audio frequency in audio playlist is switched to;

If Y _min< Δ Y _n<Y _max, then continue to play present video.

Further, Y _maxvalue is just, Y _minvalue is negative.

Further, step S7 specifically comprises step by step following:

Activation threshold value Z is clicked in S71, definition _m;

Trigger condition is clicked in S72, definition: when first Z axis coordinate data reduces, and reduction exceedes and clicks activation threshold value Z _m, Z axis coordinate data increases again subsequently, and recruitment exceedes and clicks activation threshold value Z _m, be then defined as triggering and once click, note clicks times N _z=1;

Number of times determination time interval delta T is clicked in S73, setting _z;

S74, basis click number of times determination time interval delta T _zinterior clicks times N _zthe broadcasting of real-time control audio frequency and time-out:

If N _z=1, then audio plays;

If N _z=2, then suspend audio frequency;

If N _z≠ 1 and N _z≠ 2, then keep audio frequency current state.

The invention has the beneficial effects as follows: the present invention is by the identification to gesture motion in 3d space, achieve the operations such as the switching between the broadcasting/time-out to audio frequency on smart machine, volume adjustment and audio frequency, the personalized customization function of product can be realized, there is the features such as naturality, terseness, novelty.

Accompanying drawing explanation

Fig. 1 is a kind of audio control method process flow diagram based on 3D gesture identification provided by the invention.

Fig. 2 is the process flow diagram step by step of step S2 of the present invention.

Fig. 3 is the process flow diagram step by step of step S5 of the present invention.

Fig. 4 is the process flow diagram step by step of step S6 of the present invention.

Fig. 5 is the process flow diagram step by step of step S7 of the present invention.

Embodiment

Below in conjunction with accompanying drawing, embodiments of the invention are further described.

The invention provides a kind of audio control method based on 3D gesture identification, as shown in Figure 1, comprise the following steps:

S1, the electric field data obtained in gesture identification region;

Here adopt electric field strength transducer[sensor to measure gesture identified region, obtain the initial electric field data in gesture identification region, its object is to:

(1) reference is provided for setting up space 3D coordinate system subsequently in gesture identification region;

(2) dynamic changing data obtaining electric field signal is subsequently convenient to.

S2, in gesture identification region, set up space 3D coordinate system;

As shown in Figure 2, this step specifically comprises step by step following:

S21, selected a bit as true origin in gesture identification region;

In the present invention, clearly limit selected there is no of true origin position, usual true origin can be selected in the position near gesture identification regional center.

In the embodiment of the present invention, the positive dirction back to direction as Y-axis of electric field strength transducer[sensor is set up Y-axis; Using electric field strength transducer[sensor just to the right in direction as the positive dirction of X-axis, set up X-axis perpendicular to Y-axis; Using electric field strength transducer[sensor just to the top in direction as the positive dirction of Z axis, set up Z axis perpendicular to X-axis and Y-axis place plane, set up space 3D coordinate system with this.

Change due to user's gesture can cut the electric field line in gesture identification region, thus cause the change of electric field signal data, therefore the position coordinates in electric field change region can react the position of user's gesture, and the physical action of user's gesture change just can be characterized by the dynamic changing data of electric field change regional location coordinate.

As shown in Figure 3, this step specifically comprises step by step following:

In the embodiment of the present invention, the variable quantity of X-axis coordinate data and the corresponding relation of volume variable quantity are set as: X-axis coordinate data often increases 1cm, and volume increases 1dB; X-axis coordinate data often reduces 1cm, and volume reduces 1dB.

In the embodiment of the present invention, the acquisition time interval delta T of X-axis coordinate data _x=0.1s.

ΔX _n＝X _n-X _n-1(n＝1,2,3…) (1)；

Such as, if Δ X ₁=5cm, then the volume of audio frequency increases 5dB;

If Δ X ₂=-7cm, then the volume of audio frequency reduces 7dB.

As shown in Figure 4, this step specifically comprises step by step following:

In the embodiment of the present invention, the acquisition time interval delta T of Y-axis coordinate data _y=0.5s.

ΔY _n＝Y _n-Y _n-1(n＝1,2,3…) (2)；

S63, setting audio handover trigger threshold value Y _maxwith Y _min;

Wherein, Y _maxvalue is just, Y _minvalue is negative.

In the embodiment of the present invention, Audio conversion activation threshold value Y _max=20cm, Y _min=-20cm.

If Y _min< Δ Y _n<Y _max, then continue to play present video.

Such as, if Δ Y ₁=18cm, then continue to play present video;

If Δ Y ₂=22cm, then the next audio frequency switched in audio playlist is play;

If Δ Y ₃=-25cm, then the upper audio frequency switched in audio playlist is play.

As shown in Figure 5, this step specifically comprises step by step following:

Activation threshold value Z is clicked in S71, definition _m;

In the embodiment of the present invention, click activation threshold value Z _m=10cm.

In the embodiment of the present invention, click number of times determination time interval delta T _z=1s.

If N _z=1, then audio plays;

If N _z=2, then suspend audio frequency;

If N _z≠ 1 and N _z≠ 2, namely when clicking number of times and being other value outside 1 or 2, then keep audio frequency current state.

Audio frequency current state is kept to refer to: if audio frequency is current be in broadcast state, then to keep broadcast state; If audio frequency is current be in halted state, then keep halted state.

Those of ordinary skill in the art will appreciate that, embodiment described here is to help reader understanding's principle of the present invention, should be understood to that protection scope of the present invention is not limited to so special statement and embodiment.Those of ordinary skill in the art can make various other various concrete distortion and combination of not departing from essence of the present invention according to these technology enlightenment disclosed by the invention, and these distortion and combination are still in protection scope of the present invention.

Claims

1. based on an audio control method for 3D gesture identification, it is characterized in that, comprise the following steps:

S1, the electric field data obtained in gesture identification region;

S2, in gesture identification region, set up space 3D coordinate system;

2. the audio control method based on 3D gesture identification according to claim 1, is characterized in that, described step S2 specifically comprises step by step following:

S21, selected a bit as true origin in gesture identification region;

3. the audio control method based on 3D gesture identification according to claim 1, is characterized in that, described step S5 specifically comprises step by step following:

ΔX _n＝X _n-X _n-1(n＝1,2,3…) (1)；

4. the audio control method based on 3D gesture identification according to claim 1, is characterized in that, described step S6 specifically comprises step by step following:

ΔY _n＝Y _n-Y _n-1(n＝1,2,3…) (2)；

S63, setting audio handover trigger threshold value Y _maxwith Y _min;

If Y _min< Δ Y _n<Y _max, then continue to play present video.

5. the audio control method based on 3D gesture identification according to claim 4, is characterized in that, described Y _maxvalue is just, described Y _minvalue is negative.

6. the audio control method based on 3D gesture identification according to claim 1, is characterized in that, described step S7 specifically comprises step by step following:

Activation threshold value Z is clicked in S71, setting _m;

If N _z=1, then audio plays;

If N _z=2, then suspend audio frequency;

If N _z≠ 1 and N _z≠ 2, then keep audio frequency current state.