CN108280067B - Earphone translation method and system - Google Patents

Earphone translation method and system Download PDF

Info

Publication number
CN108280067B
CN108280067B CN201810162974.8A CN201810162974A CN108280067B CN 108280067 B CN108280067 B CN 108280067B CN 201810162974 A CN201810162974 A CN 201810162974A CN 108280067 B CN108280067 B CN 108280067B
Authority
CN
China
Prior art keywords
audio
earphone
translation
storage box
headset
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810162974.8A
Other languages
Chinese (zh)
Other versions
CN108280067A (en
Inventor
李建明
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Boomtech Industrial Co ltd
Original Assignee
Shenzhen Boomtech Industrial Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Boomtech Industrial Co ltd filed Critical Shenzhen Boomtech Industrial Co ltd
Priority to CN201810162974.8A priority Critical patent/CN108280067B/en
Publication of CN108280067A publication Critical patent/CN108280067A/en
Application granted granted Critical
Publication of CN108280067B publication Critical patent/CN108280067B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/42Data-driven translation
    • G06F40/45Example-based machine translation; Alignment
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1033Cables or cables storage, e.g. cable reels
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1083Reduction of ambient noise
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/10Earpieces; Attachments therefor ; Earphones; Monophonic headphones
    • H04R1/1091Details not provided for in groups H04R1/1008 - H04R1/1083
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02163Only one microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2420/00Details of connection covered by H04R, not provided for in its groups
    • H04R2420/07Applications of wireless loudspeakers or wireless microphones

Abstract

The embodiment of the invention discloses an earphone translation method, which is characterized by being applied to an earphone system comprising a first earphone, a second earphone and an earphone storage box, wherein the translation method comprises the following steps: the first earphone acquires a first audio, adds a first identifier to the first audio, and sends the first audio with the first identifier to the earphone storage box; the earphone storage box acquires a second audio obtained by translating the first audio according to the first audio, a first character corresponding to the first audio after voice recognition and a second character corresponding to the second audio after voice recognition, and displays the first character, the second character and the first identification; and sending the second audio to a second earphone for playing. By adopting the method and the device, the translation efficiency among different languages can be improved.

Description

Earphone translation method and system
Technical Field
The invention relates to the field of wireless communication, in particular to an earphone translation method and an earphone translation system.
Background
Today, with rapid development of science and technology, people are increasingly open in communication, and in the process of tourism, entertainment and academic communication, not only information is more and more globalized, but also languages are more and more diversified, so that language communication barriers are common. How to conveniently and quickly provide translation contents for two parties who have communication with different languages and realize barrier-free communication between the two parties is a problem worthy of thinking.
In the existing translation method, a large amount of human resources are consumed for manually obtaining translation results, the obtaining is difficult, time and labor are wasted, and how to quickly and efficiently obtain the translation results is a problem to be solved urgently.
Disclosure of Invention
In view of this, embodiments of the present invention provide an earphone translation method and system, which enable a user to quickly obtain required translated voice or text information, facilitate communication among users of different languages, and improve translation efficiency.
In order to solve the above technical problem, a first aspect of an embodiment of the present invention provides an earphone translation system, including a first earphone, a second earphone, and an earphone storage box, where:
the first earphone comprises a first wireless communication module, a first audio acquisition module and a first identifier, the first wireless communication module is used for establishing wireless connection with the earphone storage box, and the first audio acquisition module is used for acquiring first audio and adding the first identifier to the first audio;
the second earphone comprises a second wireless communication module and a second identifier, the second wireless communication module is used for establishing wireless connection with the earphone storage box, and the second identifier is used for marking the second earphone;
the earphone storage box comprises a first earphone seat, a second earphone seat, a box body wireless communication module, a translation module, a character acquisition module and a display screen, wherein the first earphone seat is used for placing a first earphone, the second earphone seat is used for placing a second earphone, and the box body wireless communication module is used for establishing wireless connection with the first earphone and/or the second earphone and identifying a first identification of the first earphone and a second identification of the second earphone;
the translation module is used for acquiring a second audio obtained by translating the first audio according to the first audio; the character acquisition module is used for acquiring first characters corresponding to the first audio according to the first audio and acquiring second characters corresponding to the second audio according to the second audio; the display screen is used for correspondingly displaying the first characters and the first identification and displaying the second characters;
the box body wireless communication module is used for sending the second audio to a second earphone for playing according to the second identification.
Optionally, the first earphone is used for establishing wireless connection with the terminal and acquiring a first audio from the terminal; or:
the first earpiece includes a primary microphone, a secondary microphone, and a first processor, wherein:
the primary microphone is used for picking up primary voice close to a target sound source;
the first processor is used for detecting whether a noise reduction mode is started or not;
the auxiliary microphone is used for picking up auxiliary voice far away from a target sound source;
the first processor is further configured to filter noise in the main speech according to the auxiliary speech to obtain a first audio.
Optionally, the translation module is connected with the box body wireless communication module and used for receiving a second audio obtained by translating the first audio from the cloud server; or:
the translation module comprises a local translation engine and a corpus, and is used for translating the first audio to obtain a second audio obtained by translating the first audio.
Optionally, the display screen is configured to receive a modification instruction of a user for the first text, to obtain a modified first text, and the translation module is configured to obtain, according to the modified first text and the first audio, a second audio obtained by translating the first audio.
Optionally, the earphone storage box includes a memory and a search module, where the memory is used to correspondingly store the first audio and a second audio obtained by translating the first audio;
the searching module is used for searching and acquiring the first audio and the second audio corresponding to the first audio which are pre-stored in the memory when the first audio is received.
Optionally, the translation module is further configured to set a translation mode, where the translation mode includes a first language type corresponding to the first earphone and a second language type corresponding to the second earphone.
Optionally, the first headphone is further configured to play the second audio.
The second aspect of the embodiment of the present invention provides an earphone translation method, which is applied to an earphone system including a first earphone, a second earphone and an earphone storage box, and the translation method includes:
when the first earphone and the second earphone are taken out of the earphone containing box, the earphone containing box is triggered to establish wireless connection with the first earphone and the second earphone, the earphone containing box identifies the first earphone and the second earphone, and a first identification of the first earphone and a second identification of the second earphone are obtained;
the first earphone acquires a first audio, adds a first identifier to the first audio, and sends the first audio with the first identifier to the earphone storage box;
the earphone storage box acquires a second audio obtained by translating the first audio according to the first audio, a first character corresponding to the first audio after voice recognition and a second character corresponding to the second audio after voice recognition, and displays the first character, the second character and the first identification; and sending the second audio to a second earphone for playing.
Optionally, the acquiring, by the first headset, the first audio includes:
acquiring a first audio from a terminal which establishes wireless connection with the first earphone; or
Picking up a main voice close to a target sound source;
whether a noise reduction mode is started is detected, if so, the method comprises the following steps:
picking up auxiliary voices far away from a target sound source;
and filtering the noise in the main voice according to the auxiliary voice to obtain a first audio.
Optionally, before the detecting whether to turn on the noise reduction mode, the method includes:
the first earphone receives noise reduction operation and starts a noise reduction mode; or:
the earphone storage box receives noise reduction operation, generates a noise reduction instruction and sends the noise reduction instruction to the first earphone, and the first earphone starts a noise reduction mode.
Optionally, the acquiring, by the earphone storage box according to the first audio, a second audio obtained by translating the first audio includes:
sending the first audio to a cloud server, and receiving a second audio obtained by translating the first audio from the cloud server; or:
and translating the first audio according to a translation engine and a corpus local to the earphone storage box to obtain a second audio.
Optionally, the translating the first audio according to a translation engine and a corpus local to the earphone storage box to obtain a second audio further includes:
when first audio is received, confirming that the earphone storage box stores the first audio in advance;
and acquiring a second audio which is stored corresponding to the first audio.
Optionally, the translating the first audio according to a translation engine and a corpus local to the earphone storage box further includes, before obtaining the second audio:
receiving a modification instruction of the first character, obtaining the modified first character, and obtaining a second audio translated by the first audio according to the modified first character and the first audio;
and correspondingly storing the first audio and the second audio in an earphone storage box.
The embodiment of the invention has the following beneficial effects:
in the earphone translation system containing first earphone, in the earphone translation system of second earphone and earphone receiver, when first earphone and second earphone were taken out the receiver, first earphone and second earphone all establish wireless connection with the earphone receiver, then one or two receive simultaneously and correspond earphone user's audio frequency, and the earphone of this receipt audio frequency carries out the sign to this audio frequency, audio transmission after the sign to the earphone receiver, obtain this audio frequency and correspond the translation result, send the translation result back to the earphone of another non-receipt audio frequency and play, at this in-process, two earphones have the same authority when communicating with the earphone receiver, make the earphone receiver can receive the audio frequency that two earphones sent simultaneously, both sides user's conversation efficiency has been promoted, in addition the earphone carries out the sign to the received audio frequency, can guarantee that the earphone receiver obtains correct translation result according to the audio frequency that receives, and send correct earphone to and play, thereby avoid the cross talk and signal interference problem. Secondly, the earphone receiver receives the user and corresponds the modification of characters to the audio frequency to reacquire the translation result according to characters after the modification and former audio frequency, promoted the rate of accuracy of translation, finally, the earphone receiver corresponds the storage with audio frequency and its translation result, when making things convenient for the earphone receiver to receive this audio frequency once more, directly transfers the translation result, has improved the intelligence and the translation efficiency of translation earphone system.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a block diagram of a headset translation system according to an embodiment of the present invention;
fig. 2 is a schematic structural diagram of an earphone translation system according to an embodiment of the present invention: fig. 2 (a) is a schematic view of a second earphone placed in an earphone storage box, and fig. 2 (B) is a schematic view of a first earphone structure;
fig. 3 is a schematic flow chart of a method for interpreting an earphone according to an embodiment of the present invention;
fig. 4 is a schematic flow chart of a method for reducing noise in a translation process according to an embodiment of the present invention;
FIG. 5 is a flowchart illustration of a method for improving translation quality according to an embodiment of the present invention;
fig. 6 is a flowchart illustrating a specific earphone translation method according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be obtained by a person skilled in the art without making any creative effort based on the embodiments in the present invention, belong to the protection scope of the present invention.
It will be understood that the terms "comprises" and/or "comprising," when used in this specification and the appended claims, specify the presence of stated features, integers, steps, operations, elements, and/or components, but do not preclude the presence or addition of one or more other features, integers, steps, operations, elements, components, and/or groups thereof.
The following describes embodiments of the present application in detail.
Referring to fig. 1, fig. 1 is a block diagram of a headset translating system according to an embodiment of the present invention, as shown in fig. 1, the headset translating system includes a first headset 11, a second headset 12, and a headset receiving box 13, where the first headset 11 includes a first wireless communication module 110, a first audio obtaining module 112, and a first identifier 1101, the first wireless communication module 110 is configured to establish a wireless connection with the headset receiving box 13, and the first audio obtaining module 112 is configured to obtain a first audio and add the first identifier 1101 to the first audio;
the second earphone 12 comprises a second wireless communication module 120 and a second mark 1201, the second wireless communication module 120 is used for establishing wireless connection with the earphone storage box 13, and the second mark 1201 is used for marking the second earphone 12;
the earphone storage box 13 comprises a first earphone seat 130, a second earphone seat 131, a box body wireless communication module 132, a translation module 133, a character acquisition module 135 and a display screen 136, wherein the first earphone seat 130 is used for placing a first earphone 11, the second earphone seat 131 is used for placing a second earphone 12, and the box body wireless communication module 132 is used for establishing wireless connection with the first earphone 11 and the second earphone 12 and identifying a first identification 1101 of the first earphone 11 and a second identification 1201 of the second earphone 12;
the translation module 133 is configured to obtain, according to the first audio, a second audio obtained by translating the first audio; the text acquiring module 135 is configured to acquire a first text corresponding to a first audio according to the first audio, and acquire a second text corresponding to a second audio according to the second audio; the display screen 136 is configured to correspondingly display the first text and the first identifier, and display the second text;
the box body wireless communication module 132 is configured to send the second audio to the second earphone 12 for playing according to the second identifier 1201.
The wireless connection method between the first wireless communication module 110 and the second wireless communication module 120 and the cartridge wireless communication module 132 may include bluetooth, wifi, zigbee, NB-IOT, etc., the first identifier 1101 is included in the first wireless communication module 110, and may be a communication name or a communication model of the first wireless communication module 110, and similarly, the second identifier 1201 may also be a communication name or a communication model of the second wireless communication module 120.
In addition, the display screen 136 may include a cathode ray tube display, a plasma display, a liquid crystal display, etc. according to the different materials, and may be a curved screen, a flexible screen, a full-screen or a 3D display screen, etc. according to the different structural forms.
In one possible example, the first earpiece 11 is used to establish a wireless connection with the terminal 15 and to obtain first audio from the terminal; or: the first earphone 11 includes a primary microphone 1121, a secondary microphone 1122, and a first processor 114, wherein the primary microphone 1121 is used to pick up primary voice close to a target sound source; the first processor 114 is configured to detect whether the noise reduction mode is turned on; secondary microphone 1122 is used to pick up secondary speech that is far from the target sound source; the first processor 114 is further configured to filter noise in the main speech according to the auxiliary speech to obtain a first audio.
It can be seen that in this example, by setting the main microphone and the auxiliary microphone for the earphone, and performing noise reduction processing on the voice according to the difference between the voices picked up by the main microphone and the auxiliary microphone, high-quality audio needing translation is obtained, and the subsequent translation quality is indirectly improved.
In one possible example, the translation module 133 is connected to the box wireless communication module 132, and is configured to receive the second audio obtained by translating the first audio from the cloud server 14; or:
the translation module 133 includes a local translation engine 1331 and a corpus 1332, and is configured to translate the first audio to obtain a second audio obtained by translating the first audio.
In a possible example, the display 136 is configured to receive a modification instruction of the first text from a user, obtain a modified first text, and the translation module 133 is configured to obtain a second audio translated by the first audio according to the modified first text and the first audio.
It can be seen that in this example, the translated second audio of the new first audio is obtained through the modified first text and the modified first audio, so that the accuracy of translation is ensured.
In one possible example, the headphone storage box 13 includes a memory 137 and a search module 138, where the memory 137 is used for correspondingly storing the first audio and the second audio obtained by translating the first audio; the searching module is used for searching and acquiring the first audio pre-stored in the memory and the second audio corresponding to the first audio when the first audio is received.
It is thus clear that in this example, through carrying out corresponding storage with translation result and former audio frequency, make follow-up when the earphone receiver receives the audio frequency the same with former audio frequency again, can directly call the translation result from the earphone receiver, promoted translation efficiency and degree of accuracy.
In one possible example, the translation module 133 is further configured to set a translation mode, which includes a language type one corresponding to the first earpiece and a language type two corresponding to the second earpiece.
In one possible example, the first headphone is also for playing the second audio.
It should be noted that, the function of the second earphone is completely the same as that of the first earphone, and the second earphone may also be wirelessly connected to the earphone storage box, and the picked-up audio is sent to the earphone storage box and the corresponding translation result is obtained, which is not described herein again.
In a specific example, as shown in fig. 2, fig. 2 is a schematic structural diagram of a headset translation system according to an embodiment of the present invention: fig. 2 (a) is a schematic view of a second earphone placed in an earphone storage box, and fig. 2 (B) is a schematic view of a first earphone structure. As shown in the figure, the earphone translation system is composed of an earphone storage box 200, a first earphone 201 and a second earphone 202, the earphone storage box 200 is composed of a box of a first storage box part 205 and a box drawer of a second storage box part 206, and the box drawer 206 can be put into a space formed by the box 205 and also can be pulled out. The box drawer 206 further comprises a first earphone seat 203 and a second earphone seat 204, wherein the first earphone seat 203 is used for placing a first earphone 201, the second earphone seat 204 is used for placing a second earphone 202, the second earphone seat comprises a first contact point 211, the second earphone 202 comprises a second contact point 212, when the second earphone is placed on the second earphone seat, the first contact point 211 contacts the second contact point 212, a battery in the second earphone stops supplying power to each function module of the first earphone, and the earphone storage box can charge the battery of the second earphone.
In addition, after the second earphone is taken out, the main microphone 210, the auxiliary microphone 209 and the second loudspeaker 213 are arranged on the second earphone 202, and the display screen 207 and the function keys 208 are arranged on the earphone containing box 200. The headset translation system includes the functional modules corresponding to fig. 1, and can implement the headset translation process in the embodiment corresponding to fig. 1.
It can be seen that, in the embodiment of the present application, in the earphone translation system including the first earphone, the second earphone and the earphone storage box, when the first earphone and the second earphone are taken out of the storage box, the first earphone and the second earphone both establish wireless connection with the earphone storage box, then one or two earphones receive the audio of the corresponding earphone user at the same time, and the earphone receiving the audio identifies the audio, send the identified audio to the earphone storage box, obtain the translation result corresponding to the audio, and send the translation result back to the other earphone not receiving the audio for playing. Secondly, the earphone receiver receives the user and corresponds the modification of characters to the audio frequency to reacquire the translation result according to characters after the modification and former audio frequency, promoted the rate of accuracy of translation, at last, the earphone receiver corresponds the storage with audio frequency and its translation result, when making things convenient for the earphone receiver to receive this audio frequency once more, directly transfers the translation result, has improved translation earphone system's intelligence and translation efficiency.
Referring to fig. 3, fig. 3 is a schematic flow chart of an earphone translation method according to an embodiment of the present invention, and as shown in fig. 3, the earphone translation method includes:
s301, the first earphone and the earphone containing box are in wireless connection according to the trigger instruction, and the earphone containing box identifies a first identification of the first earphone.
Specifically, before first earphone and earphone receiver received trigger command, both were in shutdown state or communication network disconnection state, and first earphone is placed in the earphone receiver, and the earphone receiver can be for first earphone charging. When the first earphone or the earphone storage box receives a trigger instruction, the first earphone or the earphone storage box establishes wireless connection, the earphone storage box identifies a first identifier of the first earphone, wherein the trigger instruction can be that the first earphone takes out the earphone storage box, or the first earphone receives a key operation or a touch operation to trigger the first earphone to initiate a wireless connection request to the earphone storage box, and correspondingly, the earphone storage box can also receive corresponding operations and initiate a wireless connection request to the first earphone.
Optionally, the connection modes of the first earphone and the second earphone with the earphone storage box include wireless connection modes such as bluetooth, wi f I, z I gbee or NB-I OT, taking bluetooth connection as an example, the first identifier of the first earphone may be a bluetooth name or a bluetooth model, and when the earphone storage box establishes bluetooth connection with the first earphone, the first identifier is identified and obtained.
S302, the second earphone and the earphone containing box are in wireless connection according to the triggering instruction, and the earphone containing box identifies a second identification of the second earphone.
Specifically, the connection mode and the trigger mode of the second earphone and the earphone storage box are the same as those of the first earphone, and the process of identifying the second earphone identifier by the earphone storage box is also the same as that in S301, which is not repeated herein. In addition, when the first earphone and the second earphone are connected with the earphone storage box, clear precedence relation does not exist, and the specific connection sequence is determined according to specific operation conditions.
S303, the first earphone acquires a first audio, a first identifier is added to the first audio, and the first audio with the first identifier is sent to the earphone containing box.
In an alternative embodiment, the second headphone acquires the first audio, i.e. the audio to be translated, and the method includes two methods: firstly, a microphone is arranged on a second earphone, and the microphone picks up audio to be translated; secondly, the earphone is connected with the terminal, and receives the audio data sent by the terminal to obtain the audio to be translated. The first method is generally used to pick up the sound uttered by the user, and the application scenarios include talking with different language objects, bilingual learning, etc., and the second method is generally used to obtain a piece of existing audio that needs to be translated, and the application scenarios include watching movies, dramas, or bilingual cross learning. Optionally, noise is likely to exist in the audio picked up by the microphone, so that noise reduction processing needs to be performed on the picked-up audio, where the noise reduction method includes setting a master microphone and a slave microphone, and the master voice picked up by the master microphone is subjected to noise reduction processing according to the slave voice picked up by the slave microphone, and the specific method is shown in the embodiment corresponding to fig. 4.
After the first audio needing to be translated is obtained, the user operates the second earphone to enable the second earphone to send the first audio to the earphone containing box, or the second earphone automatically sends the first audio to the earphone containing box according to the receiving time interval.
S304, the earphone containing box acquires second audio obtained by translating the first audio according to the first audio, first characters corresponding to the first audio after voice recognition and second characters corresponding to the second audio after voice recognition, and displays the first characters, the second characters and the first identification.
In an alternative embodiment, the earphone containing box obtains the first audio according to the first audio, and the translated second audio includes two meanings: firstly, the earphone storage box comprises a local translation engine and a semantic library, and can be used for directly translating received first audio locally to obtain translated second audio; and secondly, the earphone storage box is connected with the cloud server, after the earphone storage box receives the first audio needing to be translated, the first audio is sent to the cloud server for translation, and then the second audio obtained by translating the first audio is obtained from the cloud server. The first method can enable the earphone translation system to obtain translation results quickly, translation efficiency is improved, the second method can reduce storage and load pressure of an earphone storage box, and design complexity of earphone storage is reduced.
In an alternative embodiment, after the first audio is sent to the earphone storage box, the language type of translation needs to be set, for example, the language type of the first earphone is set to "chinese", the language type of the second earphone is set to "english", which indicates that the audio received from the first earphone needs to be recognized as chinese, the audio received from the second earphone needs to be recognized as english, and correspondingly, the chinese received from the first earphone needs to be translated into english, and the audio received from the second earphone needs to be translated into english.
Correspondingly, chinese-French, chinese-Japanese, english-Japanese and the like can be set to represent the inter-translation of the corresponding language types. This process can improve the efficiency of recognition of the first audio and obtain a target language type that meets the demand.
After the earphone receiver receives the first audio frequency, the first characters after the voice recognition can be obtained according to the first audio frequency, and after the second audio frequency obtained after the first audio frequency is translated is obtained, the second characters after the voice recognition can be obtained according to the second audio frequency. The display screen of earphone receiver shows first characters and second characters. In a specific embodiment, for example, if the earphone storage box receives the first audio from the first earphone, the content of the first audio is identified as "hello, ask you for us is american" according to the "chinese" language type? And the earphone receiver recognizes that the first identification is "H1", then the display screen correspondingly displays the first character and the first identification as "H1: do you ask you for you to be a american? ", the translated second word is English, and is displayed as" Hello! Are you from America? ".
In addition, the user can modify the first characters, so that the recognition result of the first audio is more accurate, and the translation result is obtained again according to the modified first characters and the modified first audio, so that the translation accuracy is improved. This process will be described in detail in the corresponding embodiment of fig. 5.
S305, the earphone storage box sends the second audio to a second earphone for playing.
Specifically, the earphone receiver sends the second audio to the second earphone, and the second earphone plays the second audio, so that the user of the second earphone can listen to the translation result of the audio sent by the first earphone user, and communication between users with two different language types is realized.
It should be noted that the functional structure of the second earphone is the same as that of the first earphone, and the earphone translation method described in S301 to S305 can also be implemented, and the specific process is not described again.
In the embodiment of the application, in the earphone translation system including the first earphone, the second earphone and the earphone storage box, wireless connection is established between the first earphone and the second earphone and the earphone storage box, then one or two earphones receive the audio corresponding to the earphone user at the same time, and the earphone receiving the audio identifies the audio, the audio after identification is sent to the earphone storage box, the translation result corresponding to the audio is obtained, and then the translation result is sent back to the other earphone not receiving the audio to be played, in the process, the two earphones have the same authority when communicating with the earphone storage box, so that the earphone storage box can receive the audio sent by the two earphones at the same time, the conversation efficiency of the two earphones is improved, in addition, the earphones identify the received audio, the earphone storage box can be ensured to obtain the correct translation result according to the received audio, and send the correct earphones to be played, the problems of channel crosstalk and signal interference are avoided, and the translation accuracy of the translation system is improved.
Referring to fig. 4, fig. 4 is a schematic flow chart of a translation process denoising method according to an embodiment of the present invention, and as shown in fig. 4, the translation process denoising method includes:
s401, main voice close to a target sound source is picked up.
In an alternative embodiment, two microphones are provided for the first earpiece: the microphone system comprises a main microphone and an auxiliary microphone, wherein the main microphone is used for picking up main voice. The sound picked up by the microphone of the earphone comes from the speaking sound of the user, so the target sound source is the mouth of the user, and the sound close to the target sound source is the mouth close to the user. If the side where the speaker is located is the front side of the earphone and the side opposite to the speaker is the back side of the earphone, when the user wears the earphone, the front side of the earphone is closer to the mouth of the user and the back side of the earphone is farther from the mouth of the user, so that the main microphone can be arranged on the front side of the earphone, the auxiliary microphone is arranged on the back side of the earphone, and the sound picked up by the main microphone is the main voice.
S402, detecting whether the noise reduction mode is started or not.
Specifically, the noise reduction mode is turned on, that is, the received voice needs to be subjected to noise reduction processing by default.
In an alternative embodiment, the method for turning on the noise reduction mode of the first headphone includes two methods: the method comprises the steps that firstly, a first earphone directly receives noise reduction operation of a user and starts a noise reduction mode according to the noise reduction operation, wherein the noise reduction operation comprises key operation, touch operation or knocking operation and the like; secondly, the earphone receiver receives the operation of making an uproar that falls of user, then will fall the instruction of making an uproar and send for first earphone through wireless connection, and the mode of making an uproar falls in first earphone opening according to the instruction of making an uproar that receives, and wherein the operation of making an uproar falls the earphone receiver the same with first earphone.
And S403, picking up auxiliary voice far away from the target sound source.
In an alternative embodiment, if the first earphone does not receive the noise reduction instruction, the first earphone only needs to pick up the main voice, and if the first earphone receives the noise reduction instruction, the auxiliary earphone also needs to pick up the auxiliary voice.
S404, filtering noise in the main voice according to the auxiliary voice to obtain a first audio.
In an optional embodiment, the content of the main speech includes effective sound and environmental sound emitted by the user, the content of the auxiliary speech also includes effective sound and environmental sound emitted by the user, the main speech and the auxiliary speech are different in that the effective sound in the main speech has a larger volume than the effective sound in the auxiliary speech, the content is more and clearer, the main speech is filtered according to the content of the auxiliary speech, the environmental noise in the main speech can be filtered, and the remaining sound with excellent tone quality is the first audio meeting the requirement.
Optionally, according to the contents of S401 to S404, another feasible scheme exists, that is, the main microphone is disposed on the second earphone, and the auxiliary microphone is disposed on the earphone storage box, and the subsequent process is not changed, so that the ambient noise in the main voice can be effectively filtered out, and the first audio is obtained.
In the embodiment of the invention, the main microphone and the auxiliary microphone are arranged on the earphone to pick up the corresponding main voice and the auxiliary voice, and then the noise reduction processing is carried out on the main voice according to the content of the auxiliary voice, so that the environmental noise in the picked-up audio can be effectively filtered, and the audio quality needing to be translated can be improved.
Referring to fig. 5, fig. 5 is a schematic view of a flowchart of a method for improving translation quality according to an embodiment of the present invention, where as shown in the figure, the method for improving translation quality includes:
s501, the earphone storage box receives first audio.
S502, acquiring and displaying first characters corresponding to the first audio.
In an alternative embodiment, the first audio needs to be subjected to speech recognition, and a first text corresponding to the first audio is obtained. In this process, the characters acquisition module of earphone receiver can acquire first characters according to the built-in speech recognition module of earphone receiver, also can send first audio frequency to the high in the clouds server and carry out speech recognition and obtain first characters.
S503, searching whether the earphone storage box stores the first audio in advance.
In an optional embodiment, the storage module is included in the earphone storage box, and can correspondingly store the received audio needing to be translated and the translation result thereof, so that the stored content can be directly called when the same voice is received in the subsequent process. Optionally, before storing the audio to be translated and the translation result thereof, a storage condition may be further set, for example, when the same audio is received N (N > 0) times or more, it is described that the audio to be translated is a common term, and the audio to be translated may be stored. And the audio which is received for less than N times and needs to be translated is not stored, so that the consumption of the storage space of the earphone storage box can be reduced. If the first audio is found to be stored in the earphone storage box in advance, S508 is executed, otherwise, S504 is executed.
S504, the first audio and the first characters are sent to a cloud server, and a second audio obtained after the first audio is translated is obtained.
In an optional embodiment, according to S503, the first audio is not stored in the earphone storage box, so that the first audio and the corresponding first text thereof need to be sent to a cloud server that establishes a wireless connection with the earphone storage box, and a second audio obtained by translating the first audio is obtained. A language type, such as "chinese-english," may also be set before sending the first audio to the cloud server, indicating that chinese of the first audio is translated into english of the second audio. After the second audio is obtained, S505 is performed.
And S505, acquiring second characters corresponding to a second audio and displaying the second characters.
S506, correspondingly storing the first audio and the second audio.
In an optional embodiment, assuming that the first audio already satisfies the storage condition of the earphone storage box, the earphone storage box correspondingly stores the first audio and the translated second audio, so that when the earphone storage box receives the first audio with the same content next time, the earphone storage box executes a corresponding instruction.
And S507, receiving a modification instruction of the first character to obtain the modified first character.
In an optional embodiment, the first audio corresponding to the first text is the user's speaking voice picked up by the first earphone, and in the process of identifying the first audio, the identification result may be inaccurate due to problems such as noise, pronunciation, or mismatch of corresponding language types, and the user corrects the first text according to the display result on the earphone storage box to obtain the corrected first text. Because the recognition result of the first audio is modified and the translation result is different, S504-S506 needs to be repeated, the first audio and the modified first text are sent to the cloud server again for translation, the translation result of the previous time is updated, the second text corresponding to the second audio and the second audio obtained by translating the new first audio is obtained, the first audio and the new second audio are correspondingly stored, and the related content of the previously stored first audio is replaced.
And S508, acquiring a second audio which is stored correspondingly to the first audio.
In an optional embodiment, according to S503, it can be known that the first audio is found to be stored in the earphone storage box, and then the translation content of the first audio can be obtained only by obtaining the second audio stored corresponding to the first audio, so as to complete the translation process.
And S509, acquiring second characters corresponding to the second audio and displaying the second characters.
The process is the same as S505 and will not be described herein.
In the embodiment of the invention, the earphone storage box receives the pre-translation audio, performs voice recognition on the pre-translation audio to obtain pre-translation characters, and searches whether the earphone storage box stores the pre-translation audio in advance, if so, directly obtains a translation result corresponding to the pre-translation audio from the earphone storage box; if not, the pre-translation audio and the corresponding pre-translation characters are sent to the cloud server together to obtain a translation result, and then the characters corresponding to the translation result are obtained according to the translation result and displayed. And if the correction of the user on the pre-translation characters is received, re-acquiring the translation result from the cloud server according to the corrected pre-translation characters and the corrected pre-translation voice. In the process, the translation efficiency is improved by storing the translation result through the earphone storage box, and the translation accuracy of the earphone translation system is improved by receiving the modification of the characters before translation.
Referring to fig. 6, a flowchart of a specific earphone translation method according to an embodiment of the present invention is shown, and as shown in fig. 6, the earphone translation method includes:
s601, establishing wireless connection between the first earphone and the earphone storage box according to the trigger instruction;
s602, establishing wireless connection between the second earphone and the earphone storage box according to the trigger instruction;
s603, the first earphone receives a first audio needing to be translated from the terminal, and adds a first identifier to the first audio;
s604, the first earphone sends the first audio to an earphone storage box, the earphone storage box obtains a translation mode, and the first audio is translated according to the translation mode;
s605, the earphone storage box obtains a second audio frequency obtained by translating the first audio frequency from the local, and a first character corresponding to the first audio frequency and a second character corresponding to the second audio frequency, and displays the first character and the second character;
s606, the second earphone acquires a second audio from the earphone storage box and plays the second audio;
s607, the second earphone picks up the audio, and carries out noise reduction processing on the audio to obtain a third audio;
s608, the second earphone sends the third audio to the earphone storage box;
s609, the earphone storage box obtains third characters according to the third audio and displays the third characters;
s610, the earphone storage box sends the third audio and the third characters to a cloud server for translation, a fourth audio obtained after the third audio is translated and fourth characters corresponding to the fourth audio are obtained, and the fourth characters are displayed;
s611, the earphone storage box receives the modification of the third characters, the modified third characters and the modified third audio are sent to the cloud server, a new translated fourth audio of the third audio and a fourth character corresponding to the fourth audio are obtained, and the fourth characters are displayed;
s612, the earphone containing box correspondingly stores the third audio and the fourth audio and sends the fourth audio to the first earphone for playing;
and S613, the earphone storage box sends the fourth audio to the second earphone for playing.
In the embodiment of the application, the process that in the earphone translation system comprising the first earphone, the second earphone and the earphone containing box, the two earphones transmit audio to the earphone containing box and obtain the translation result is completely expressed. In the process, the two earphones have the same authority when communicating with the earphone storage box, so that the earphone storage box can simultaneously receive the audio transmitted by the two earphones, the conversation efficiency of users of both the earphones is improved, the noise reduction processing is performed on the audio by the translation earphone system, the audio quality is improved, and the translation efficiency and the translation accuracy are improved; in addition, the earphone marks the received audio, so that the earphone storage box can obtain a correct translation result according to the received audio and sends the translation result to the correct earphone for playing, and the problems of channel crossing and signal interference are avoided. Secondly, the earphone receiver receives the user and corresponds the modification of characters to the audio frequency to reacquire the translation result according to characters after the modification and former audio frequency, promoted the rate of accuracy of translation, at last, the earphone receiver corresponds the storage with audio frequency and its translation result, when making things convenient for the earphone receiver to receive this audio frequency once more, directly transfers the translation result, has improved translation earphone system's intelligence and translation efficiency.
The above disclosure is only for the purpose of illustrating the preferred embodiments of the present invention, and it is therefore to be understood that the invention is not limited by the scope of the appended claims.

Claims (10)

1. The utility model provides an earphone translation system, its characterized in that, earphone translation system includes first earphone, second earphone and earphone receiver, wherein:
the first earphone comprises a first wireless communication module, a first audio acquisition module and a first identifier, the first wireless communication module is used for establishing wireless connection with the earphone storage box, and the first audio acquisition module is used for acquiring first audio and adding the first identifier to the first audio;
the second earphone comprises a second wireless communication module and a second identifier, the second wireless communication module is used for establishing wireless connection with the earphone storage box, and the second identifier is used for marking the second earphone;
the earphone storage box comprises a first earphone seat, a second earphone seat, a box body wireless communication module, a translation module, a character acquisition module and a display screen, wherein the first earphone seat is used for placing a first earphone, the second earphone seat is used for placing a second earphone, and the box body wireless communication module is used for establishing wireless connection with the first earphone and/or the second earphone and identifying a first identification of the first earphone and a second identification of the second earphone;
the translation module is used for acquiring a second audio obtained by translating the first audio according to the first audio; the character acquisition module is used for acquiring first characters corresponding to the first audio according to the first audio and acquiring second characters corresponding to the second audio according to the second audio; the display screen is used for correspondingly displaying the first characters and the first identification and displaying the second characters;
the box body wireless communication module is used for sending the second audio to a second earphone for playing according to the second identification.
2. The headset translation system of claim 1, wherein the first headset is configured to establish a wireless connection with a terminal and obtain the first audio from the terminal; or:
the first earpiece includes a primary microphone, a secondary microphone, and a first processor, wherein:
the primary microphone is used for picking up primary voice close to a target sound source;
the first processor is used for detecting whether a noise reduction mode is started or not;
the auxiliary microphone is used for picking up auxiliary voice far away from a target sound source;
the first processor is further configured to filter noise in the main speech according to the auxiliary speech to obtain a first audio.
3. The headset translation system of claim 2, wherein the translation module is connected to the box wireless communication module and configured to receive a second audio translated from the first audio from the cloud server; or:
the translation module comprises a local translation engine and a corpus and is used for translating the first audio to obtain a second audio obtained by translating the first audio.
4. The headset translation system according to any one of claims 1 to 3, wherein the display screen is configured to receive a modification instruction of the first text from a user to obtain a modified first text, and the translation module is configured to obtain a second audio translated from the first audio according to the modified first text and the first audio.
5. The earphone translation system according to claim 4, wherein the earphone storage box comprises a memory and a search module, the memory is used for correspondingly storing the first audio and a second audio obtained by translating the first audio;
the searching module is used for searching and acquiring the first audio and the second audio corresponding to the first audio which are pre-stored in the memory when the first audio is received.
6. The headset translation system of claim 5, wherein the translation module is further configured to set a translation mode, the translation mode comprising a first language type corresponding to the first headset and a second language type corresponding to the second headset.
7. The headset translation system of claim 5 or 6 wherein the first headset is further configured to play the second audio.
8. A headset translation method is applied to a headset system comprising a first headset, a second headset and a headset storage box, and comprises the following steps:
when the first earphone and the second earphone are taken out of the earphone containing box, the earphone containing box is triggered to establish wireless connection with the first earphone and the second earphone, the earphone containing box identifies the first earphone and the second earphone, and a first identification of the first earphone and a second identification of the second earphone are obtained;
the first earphone acquires a first audio, adds a first identifier to the first audio, and sends the first audio with the first identifier to the earphone storage box;
the earphone storage box acquires a second audio obtained by translating the first audio according to the first audio, a first character corresponding to the first audio after voice recognition and a second character corresponding to the second audio after voice recognition, and displays the first character, the second character and the first identification; and sending the second audio to a second earphone for playing.
9. The method of claim 8, wherein the headset receiver obtaining translated second audio of the first audio from the first audio comprises:
sending the first audio to a cloud server, and receiving a second audio obtained by translating the first audio from the cloud server; or:
and translating the first audio according to a translation engine and a corpus local to the earphone storage box to obtain a second audio.
10. The method of claim 9, wherein translating the first audio according to a translation engine and corpus local to a headset receiver to obtain a second audio further comprises:
when first audio is received, confirming that the earphone storage box stores the first audio in advance;
and acquiring a second audio which is stored corresponding to the first audio.
CN201810162974.8A 2018-02-26 2018-02-26 Earphone translation method and system Active CN108280067B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810162974.8A CN108280067B (en) 2018-02-26 2018-02-26 Earphone translation method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810162974.8A CN108280067B (en) 2018-02-26 2018-02-26 Earphone translation method and system

Publications (2)

Publication Number Publication Date
CN108280067A CN108280067A (en) 2018-07-13
CN108280067B true CN108280067B (en) 2023-04-18

Family

ID=62808725

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810162974.8A Active CN108280067B (en) 2018-02-26 2018-02-26 Earphone translation method and system

Country Status (1)

Country Link
CN (1) CN108280067B (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109005480A (en) * 2018-07-19 2018-12-14 Oppo广东移动通信有限公司 Information processing method and related product
CN109286725B (en) * 2018-10-15 2021-10-19 华为技术有限公司 Translation method and terminal
CN109614628A (en) * 2018-11-16 2019-04-12 广州市讯飞樽鸿信息技术有限公司 A kind of interpretation method and translation system based on Intelligent hardware
CN109618258A (en) * 2018-12-10 2019-04-12 深圳市友杰智新科技有限公司 A kind of the voice real time translating method and system of bluetooth headset
CN109819367A (en) * 2019-02-21 2019-05-28 日照职业技术学院 One kind being used for the English-Chinese conversion earphone of International Business Negotiation
CN110265061B (en) * 2019-06-26 2021-08-20 广州三星通信技术研究有限公司 Method and equipment for translating call voice in real time
CN110650408B (en) * 2019-10-29 2022-04-19 歌尔股份有限公司 Earphone, anti-lost method, anti-lost system and computer readable storage medium
CN111739530A (en) * 2020-06-05 2020-10-02 北京搜狗科技发展有限公司 Interaction method and device, earphone and earphone storage device
CN111739529A (en) * 2020-06-05 2020-10-02 北京搜狗科技发展有限公司 Interaction method and device, earphone and server
CN111783481A (en) * 2020-06-30 2020-10-16 歌尔科技有限公司 Earphone control method, translation method, earphone and cloud server

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104303177A (en) * 2012-04-25 2015-01-21 寇平公司 Instant translation system
CN106412813A (en) * 2016-11-30 2017-02-15 深圳市高为通信技术有限公司 Real-time communication translation method with bluetooth headsets
CN206117947U (en) * 2016-08-31 2017-04-19 深圳超多维科技有限公司 Wear -type display device
CN206313964U (en) * 2016-12-28 2017-07-07 江苏麦客创和网络技术有限公司 A kind of multifunctional headphone receiver
CN107333201A (en) * 2017-07-24 2017-11-07 歌尔科技有限公司 One kind translation earphone storage box, wireless translation earphone and wireless translation system
CN107506353A (en) * 2017-10-17 2017-12-22 深圳市沃特沃德股份有限公司 Translate box and translation system

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104303177A (en) * 2012-04-25 2015-01-21 寇平公司 Instant translation system
CN206117947U (en) * 2016-08-31 2017-04-19 深圳超多维科技有限公司 Wear -type display device
CN106412813A (en) * 2016-11-30 2017-02-15 深圳市高为通信技术有限公司 Real-time communication translation method with bluetooth headsets
CN206313964U (en) * 2016-12-28 2017-07-07 江苏麦客创和网络技术有限公司 A kind of multifunctional headphone receiver
CN107333201A (en) * 2017-07-24 2017-11-07 歌尔科技有限公司 One kind translation earphone storage box, wireless translation earphone and wireless translation system
CN107506353A (en) * 2017-10-17 2017-12-22 深圳市沃特沃德股份有限公司 Translate box and translation system

Also Published As

Publication number Publication date
CN108280067A (en) 2018-07-13

Similar Documents

Publication Publication Date Title
CN108280067B (en) Earphone translation method and system
CN108509428B (en) Earphone translation method and system
US11240376B2 (en) Transcription of communications through a device
WO2020019848A1 (en) Method for performing switching with respect to primary earpiece, mobile terminal, and computer readable storage medium
WO2020238635A1 (en) Mobile terminal and sound outlet switching method
US10659730B2 (en) Systems and methods for improved video call handling
CN110033769B (en) Recorded voice processing method, terminal and computer readable storage medium
US20210329370A1 (en) Method for providing service using earset
CN209103286U (en) Earphone translation system
WO2019164321A1 (en) Method for intelligent assistance
CN111836149A (en) Wireless earphone device and using method of wireless earphone box
US11138390B2 (en) Concurrent reception of multiple user speech input for translation
US20210249007A1 (en) Conversation assistance device, conversation assistance method, and program
CN108009116B (en) MicroUSB interface circuit and mobile terminal thereof
CN208353534U (en) A kind of translation earphone system
CN110913070B (en) Call method and terminal equipment
CN109543193B (en) Translation method, translation device and terminal equipment
CN109453526B (en) Sound processing method, terminal and computer readable storage medium
CN113905128B (en) Call reminding method and terminal
CN211403431U (en) Device for realizing simultaneous translation by extensible connection based on network
CN112866855A (en) Hearing aid method and system, earphone charging box and storage medium
CN111694539A (en) Method, apparatus and medium for switching between earpiece and speaker
TW201403357A (en) Wireless transmission headphones with language translation function
TWI779337B (en) Bluetooth earphone system, and storage and charging case for bluetooth earphones
KR20080007966A (en) Simultaneous interpretation service system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant