CN110427167B

CN110427167B - Audio information processing method for handheld device

Info

Publication number: CN110427167B
Application number: CN201910499687.0A
Authority: CN
Inventors: 李新宇
Original assignee: Zhuhai Shengyuan Intelligent Technology Co ltd
Current assignee: Zhuhai Shengyuan Intelligent Technology Co ltd
Priority date: 2019-06-11
Filing date: 2019-06-11
Publication date: 2024-01-02
Anticipated expiration: 2039-06-11
Also published as: CN110427167A

Abstract

The invention relates to a handheld device audio information processing method, a central processing unit sets one face of a handheld device as a positive direction according to an included angle between an attitude sensor and a horizontal plane, and judges whether the handheld device is in the positive direction or the negative direction; starting a corresponding audio processing engine according to the position of the handheld device; the handheld device collects audio information; the audio processing engine analyzes and processes the audio information; and storing the acquired audio information and the analyzed and processed audio information in a set storage area. The invention discloses a handheld device audio information processing method, which ingeniously utilizes the characteristic that the handheld device points differently during dialogue, obtains the self-gesture of the handheld device by using an electronic gyroscope commonly applied by the handheld electronic device at present, judges the current sound acquisition object by a corresponding algorithm, and has the advantages of simple and ingenious conception, no need of a large amount of computing power, low cost and high accuracy.

Description

Audio information processing method for handheld device

Technical Field

The invention relates to an audio processing method, in particular to an audio information processing method of handheld equipment.

Background

With the development of technology, more and more cases of recording or translating voice using handheld devices are required, and voice recognition technology is required. Under the condition that two persons talk, if the voice input by the current handheld device is what person in the conversation is said, the voice input by the current handheld device has great effect on the subsequent voice processing and subsequent processing, for example, in the use of a translator, if the person who inputs the voice at present is known, the language which he speaks can be judged according to the setting, recognition engines of different languages can be purposefully called, and more accurate recognition results can be obtained; for example, in the interview recording device, if the current recorded object is an interviewee or an interviewee, the results of speech recognition and transcription can be recorded in the records of the corresponding persons respectively, and if two persons have different local accents, the engines aiming at the different local accents can be called according to the setting, so that better recognition results can be obtained. Therefore, a method for distinguishing the current handheld device sound collection object is found, and the method has important practical value.

At present, some solutions are available to distinguish sound collection objects, one method is to add a hardware button or a software button on a handheld device, when recording, a specific key is set for a person, the sound of the person is recorded, the corresponding key is pressed, for example, a translator is used, the person presses the key to represent the person speaking, and the person presses the key to represent the person speaking; one method is to use software algorithm, design through artificial intelligence method, use voiceprint to distinguish which speaker is in the dialogue; one method is to automatically identify the language type spoken by the speaker, also by a software artificial intelligence algorithm.

The two methods are complex to operate, and a user must press a designated button each time of recording, so that the method is inconvenient to use; meanwhile, frequent operation also has the condition of misoperation, if the misoperation happens, the voice recognition engine can call the wrong recognition method, and the use accuracy is reduced.

The second artificial intelligence software algorithm is difficult to realize, has high technical threshold, consumes a large amount of computing power in computing, and has high misjudgment rate and is still immature.

Disclosure of Invention

Aiming at the defects in the prior art, the invention provides the handheld equipment audio information processing method which is simple and feasible, small in calculated amount, low in cost, high in accuracy and convenient to use.

The technical scheme adopted by the invention is as follows:

a method for processing audio information of a handheld device comprises the handheld device,

the handheld device acquires the gesture information of the handheld device through a gesture sensor, and determines the currently acquired voice as a corresponding acquisition object according to the gesture information of the handheld device;

and according to different collected objects, correspondingly processing the collected voice.

When the handheld device points to the user and points to the other side, the included angle between the pointing direction of the front surface of the handheld device and the horizontal plane is greatly changed; the attitude sensor is determined to be different collected objects through the change of the included angle between the pointing direction of the front face of the handheld device and the horizontal plane.

The handheld device is determined to be two postures through a posture sensor, and the two postures are respectively: the handheld device points towards each other and towards itself.

The attitude sensor is an electronic gyroscope or an electronic gyroscope chip.

Usually when a user uses a sound collection handheld device, he speaks, the handheld device points to himself, the other party speaks, and the handheld device points to the other party.

The angle between the front direction of the handheld device and the horizontal plane is 0-90 degrees, which is the forward direction, and 90-180 degrees, which is the reverse direction.

The angle between the front direction of the handheld device and the horizontal plane is 0-80 degrees, which is the forward direction, and 80-180 degrees, which is the reverse direction.

A handheld device audio information processing method comprises the following steps:

step 1, a central processing unit sets one face of a handheld device as a positive direction according to an included angle between an attitude sensor and a horizontal plane, and sets one direction along the front face of the handheld device as a reverse direction;

step 2, starting the handheld device;

step 3, judging whether the handheld device is in the forward direction or the reverse direction;

step 4, starting a corresponding audio processing engine according to the position of the handheld device;

step 5, the handheld device collects audio information;

step 6, the audio processing engine analyzes and processes the audio information;

step 7, storing the collected audio information and the analyzed and processed audio information in a set storage area;

step 8, judging whether the information collection of the handheld device is finished or not, and executing the step 3 if the information collection is not finished; the acquisition is finished and the step 9 is executed;

and 9, ending.

The angle may be expressed as a negative number or may be reversed with respect to the two objects defined above, depending on the definition of the front of the device.

The handheld device points to itself and points to the other side, the angle range of the front face of the device and the horizontal plane is quite different, and the current sound collection object is judged to be one of the two sides by defining the angle ranges of the front faces of different handheld devices and the horizontal plane.

The method for processing the audio information of the handheld device can realize the distinction of two dialogue roles in many occasions by combining with the prior convention, including but not limited to the use in interview recorders, translators and mobile phones.

The method for processing the audio information of the handheld device can distinguish the object for collecting the sound, further determine the information such as accent, language, name and the like of the object for collecting the sound according to the convention, and facilitate the subsequent speech recognition and transcription.

Compared with the prior art, the invention has the beneficial effects that:

the invention discloses a handheld device audio information processing method, which ingeniously utilizes the characteristic that the handheld device points differently during dialogue, obtains the self-gesture of the handheld device by using an electronic gyroscope commonly applied by the handheld electronic device at present, judges the current sound acquisition object by a corresponding algorithm, and has the advantages of simple and ingenious conception, no need of a large amount of computing power, low cost and high accuracy.

Drawings

FIG. 1 is an audio processing flow chart of a handheld device audio information processing method of the present invention;

FIG. 2 is a schematic diagram of the use status of the method for processing audio information of a handheld device according to the present invention;

FIG. 3 is a schematic diagram of the handheld device usage orientation of the handheld device audio information processing method of the present invention.

The main component symbols in the drawings illustrate:

Detailed Description

The invention is described in detail below with reference to the attached drawings and examples:

as can be seen from fig. 1-3, a method for processing audio information of a handheld device, comprising a handheld device,

The attitude sensor is an electronic gyroscope or an electronic gyroscope chip.

step 2, starting the handheld device;

step 5, the handheld device collects audio information;

and 9, ending.

Depending on the definition of the front of the device, the angle may be expressed as a negative number or may be opposite to the two objects defined above.

The present invention, in combination with prior conventions, allows for the differentiation of two-person conversational characters in many applications, including but not limited to use in interview recorders, translators and cell phones.

The invention can distinguish the object for collecting the sound, further determine the information of accent, language, name and the like of the object for collecting the sound according to the convention, and is convenient for the subsequent voice recognition and transcription.

The method has the advantages of simplicity, easiness, small calculated amount, high accuracy and convenience in use; under the condition of not influencing the behavior habit of the user, the current sound collection object can be accurately judged to be one of the two parties without very accurately measuring the gesture of the handheld device. Similarly, if the method is used for a translator, after the languages of the two selected parties are selected, the current gesture of the translator can also be used for judging the speaking object to be the person, so that the language of the speaking object is known, and the corresponding voice recognition engine can be conveniently called in the later voice recognition.

At present, many handheld devices are provided with electronic gyroscope chips, most of common mobile phones are provided with electronic gyroscope chips, and the included angle between the front surface of the handheld device and the horizontal plane can be conveniently obtained. For example, using an android mobile phone, the angle between the front surface of the mobile phone and the horizontal plane can be calculated and obtained by calling a sensor manager.

The invention skillfully utilizes the habit of holding the sound collection equipment by the human and utilizes the electronic gyroscope chip, thereby simply, efficiently and cheaply completing the problem of distinguishing the speakers between two people, which is not easy to solve by an artificial intelligent algorithm.

The invention is used for interview recording pens, translators and other devices, can greatly improve the use experience of users and increase the added value of products.

Compared with the prior art, the invention has the beneficial effects that:

The above description is only of the preferred embodiment of the present invention, and is not intended to limit the structure of the present invention in any way. Any simple modification, equivalent variation and modification of the above embodiments according to the technical substance of the present invention fall within the technical scope of the present invention.

Claims

1. A method for processing audio information of a handheld device comprises the handheld device and is characterized in that,

according to different collected objects, correspondingly processing collected voices;

the attitude sensor determines different collected objects through the change of the included angle between the pointing direction of the front surface of the handheld device and the horizontal plane;

the handheld device is determined to be two postures through a posture sensor, and the two postures are respectively: the handheld device points to the other party and points to the person;

the method comprises the following steps:

step 2, starting the handheld device;

step 5, the handheld device collects audio information;

and 9, ending.

2. The method for processing audio information of a handheld device according to claim 1, wherein:

the attitude sensor is an electronic gyroscope or an electronic gyroscope chip.

3. The method for processing audio information of a handheld device according to claim 1, wherein:

4. The method for processing audio information of a handheld device according to claim 1, wherein: