WO2019214299A1 - Automatic translation apparatus and method, and computer device - Google Patents

Automatic translation apparatus and method, and computer device Download PDF

Info

Publication number
WO2019214299A1
WO2019214299A1 PCT/CN2019/073534 CN2019073534W WO2019214299A1 WO 2019214299 A1 WO2019214299 A1 WO 2019214299A1 CN 2019073534 W CN2019073534 W CN 2019073534W WO 2019214299 A1 WO2019214299 A1 WO 2019214299A1
Authority
WO
WIPO (PCT)
Prior art keywords
voice information
language
voice
information
pickup
Prior art date
Application number
PCT/CN2019/073534
Other languages
French (fr)
Chinese (zh)
Inventor
张立新
Original Assignee
深圳市沃特沃德股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市沃特沃德股份有限公司 filed Critical 深圳市沃特沃德股份有限公司
Publication of WO2019214299A1 publication Critical patent/WO2019214299A1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/40Processing or translation of natural language
    • G06F40/58Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L2015/088Word spotting
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Definitions

  • the present invention relates to the field of translation machines, and in particular, to an automatic translation apparatus, method, and computer apparatus.
  • the technical problem to be solved by the present invention is to provide an automatic translation apparatus, method and computer apparatus in view of the deficiencies in the above background art.
  • the technical means adopted by the present invention to solve the technical problem is to provide an automatic translation apparatus, including:
  • a pickup assembly comprising a first mounted pickup and a second pickup, the first pickup for collecting the first voice information, and the second pickup for collecting the second voice information;
  • a controller component coupled to the first pickup and the second pickup, for receiving and comparing the magnitude of the voice amplitude of the first voice information and the second voice information, and controlling the sound pickup component to output a voice information with a large voice amplitude;
  • the main processor component is connected to the pickup component for receiving one voice information with a large amplitude of voice output by the pickup component, and according to the predetermined first language and the second language, the voice information having a larger voice amplitude
  • the translation is performed to generate translation information, and the first language and the second language respectively correspond to the first voice information and the second voice information.
  • the present invention also provides an automatic translation method for use in an automatic translation apparatus as described above, comprising:
  • the first voice information is translated from the first language to the second language to generate the first translation information
  • the second voice information is translated from the second language into the first language to generate the second translation information, where the first language and The second language corresponds to the first voice information and the second voice information, respectively.
  • the present invention also provides a computer device comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, the processor implementing the method of any of the above.
  • the present invention has at least the following beneficial effects: the embodiment of the present invention is provided with a first pickup and a second pickup that are reversely mounted, thereby respectively acquiring the first voice information and the second voice information, when the first user is facing When the first pickup is speaking and the second user is speaking to the second pickup, the voice amplitude of the first user's voice signal collected by the first pickup is greater than the voice amplitude of the first user's voice signal collected by the second pickup. Similarly, the voice amplitude of the second user's voice signal collected by the second pickup may be greater than the voice amplitude of the second user's voice signal collected by the first pickup, and the controller component determines the first voice information and the second voice.
  • the magnitude of the speech amplitude of the information to determine whether the first user is speaking or the second user; if the first user is speaking, the controller component controls the pickup component to transmit the first voice information to the main processor component, the first The voice information is translated from the first language into the second language to generate translation information; if the second user is speaking, the controller component The pickup component transmits the second voice information to the main processor component, and the second voice information is translated into the first language by the second language to generate translation information, wherein the first language and the second language respectively correspond to the first voice information and the first voice information
  • the second voice information is automatically translated according to the size of the sound received by the two-way pickup, which reduces the manual setting steps and improves the user experience.
  • FIG. 1 is a schematic structural view of an embodiment of an automatic translation apparatus of the present invention
  • FIG. 2 is a schematic structural view of another embodiment of an automatic translation apparatus of the present invention.
  • FIG. 3 is a schematic diagram showing the circuit structure of an embodiment of an automatic translation apparatus of the present invention.
  • FIG. 4 is a schematic diagram showing the circuit structure of another embodiment of the automatic translation apparatus of the present invention.
  • FIG. 5 is a block diagram showing the flow of an embodiment of the automatic translation method of the present invention.
  • FIG. 6 is a schematic block diagram of a module of an embodiment of a computer device according to the present invention.
  • an automatic translation apparatus including:
  • the pickup assembly 1 includes a first pickup 11 and a second pickup 12 which are oppositely mounted, the first pickup 11 is for collecting the first voice information, and the second pickup 12 is for collecting the second voice information;
  • the controller component 2 is connected to the first pickup 11 and the second pickup 12 for receiving and comparing the magnitude of the speech amplitude of the first speech information and the second speech information, and controlling the pickup component 1 to output a path with a larger amplitude of the speech. voice message;
  • the main processor component 3 is connected to the pickup component 1 for receiving one voice information with a large amplitude of the voice output by the pickup component 1, and has a large amplitude of the voice according to the predetermined first language and the second language.
  • the voice information is translated all the way to generate translation information, and the first language and the second language respectively correspond to the first voice information and the second voice information.
  • the first language and the second language respectively correspond to the languages of the first user and the second user, and before the conversation, the first user and the second user
  • the language can be input separately.
  • the first user and the second user can also speak the language wake-up words corresponding to the language of the first pickup 11 and the second pickup 12 respectively, and the present invention automatically translates
  • the device may respectively identify the first language corresponding to the first pickup 11 and the second language corresponding to the second pickup 12 through a preset offline voice library, thereby predetermining the first language and the second language.
  • the first pickup 11 and the second pickup 12 are reversely mounted to each other such that the first pickup 11 is in different directions.
  • the first user is facing the first pickup 11 and the second user is facing the second pickup 12.
  • the first pickup 11 and the second pickup 12 both collect the sound signal of the first user, and the sound signal collected by the first pickup 11 is the first voice information, and the second pickup 12 collects the sound signal.
  • the sound signal is the second voice information, and because the first user is facing the first pickup 11, the voice amplitude of the first voice information collected by the first pickup 11 is greater than the voice of the second voice information collected by the second pickup 12.
  • the amplitude, the controller component 2 compares the voice amplitude of the first voice information with the voice amplitude of the second voice message, and transmits the first voice information with a larger voice amplitude to the main processor component, the main processor The component receives the first voice information sent by the first pickup 11, thereby translating the first voice information from Chinese into English to generate translation information.
  • the first pickup 11 and the second pickup 12 both collect the sound signal of the second user, and the sound signal collected by the first pickup 11 is the first voice information, and the second pickup 12
  • the collected sound signal is the second voice information, and the second user is facing the second pickup 12, so that the voice amplitude of the first voice information collected by the first pickup 11 is smaller than the second voice collected by the second pickup 12.
  • the voice amplitude of the information the controller component 2 compares the voice amplitude of the first voice information with the voice amplitude of the second voice information, and sends the second voice information with a larger voice amplitude to the main processor component.
  • the main processor component receives the second voice information sent by the second pickup 11, so that the second voice information is translated from English into Chinese to generate translation information, and the user language can be automatically recognized when the first user and the second user perform dialogue communication. And the language that needs to be translated and translated, without the need for the first user and the second user to take the hand and hold the translation key of each party to speak, reducing the user's For improving the user experience.
  • the first pickup 11 and the second pickup 12 are installed in the reverse direction by the pickup assembly 1, thereby respectively acquiring the first voice information and the second voice information, when the first user faces the first pickup 11 and the second user
  • the voice amplitude of the first user's voice signal collected by the first pickup 11 is greater than the voice amplitude of the first user's voice signal collected by the second pickup 12, similarly,
  • the voice amplitude of the second user's voice signal collected by the second pickup 12 is greater than the voice amplitude of the second user's voice signal collected by the first pickup 11, and the controller component 2 determines the first voice information and the second voice information.
  • the magnitude of the voice amplitude thereby determining whether the first user or the second user is speaking.
  • the control The component 2 controls the pickup component 1 to transmit the first voice information to the main processor component 3, and the main processor component 3 receives the first voice information and the first voice information is Translating a language into a second language to generate translation information; if the second user is speaking, the speech amplitude of the second speech information is greater than the speech amplitude of the first speech information, and at this time, the controller component 2 controls the pickup component 1 to be the second
  • the voice information is sent to the main processor component 3, and the main processor component 3 receives the second voice information and translates the second voice information from the second language into the first language to generate translation information, where the first language and the second language respectively correspond to
  • the first voice information and the second voice information are automatically translated according to the size of the sound received by the two-way pickup, which reduces the steps of manual setting and improves the user experience.
  • the automatic translation device of the present invention further includes:
  • the translation output component 4 is connected to the main processor component 3 for outputting translation information, and the translation information includes at least one of translating speech information and translating text information.
  • the translation output component 4 includes a speaker, and the translated voice information is played through the speaker, wherein the first language of the first user is Japanese and the second language of the second user is Malay, and the first user is facing the first
  • the controller component 2 controls the pickup component 1 to output the first voice information to the main processor component 3, and the main processor component 3 translates the first voice information from Japanese into Malay to generate translation information, and finally passes through the speaker.
  • the controller component 2 controls the pickup component 1 to output the second voice information to the main processor component 3, and the main processor component 3 will be the second
  • the voice information is translated into Japanese by Malay to generate translation information, and finally the translated voice information in the translation information is played through the speaker, and the second user can communicate with the first user.
  • the translation output component 4 can also be designed with a display screen, and the display screen can display translated text information.
  • the first user and the second user can also communicate through the display screen to improve the user experience.
  • the first pickup 11 and the second pickup 12 employ a unidirectional pickup.
  • the pickup is also called the monitor head.
  • the monitor pickup is a device for collecting the sound of the live environment and then transmitting it to the back-end device. It is composed of a microphone (microphone) and an audio amplifier circuit.
  • Pickups are generally classified into digital pickups and analog pickups. Sound pickups are sound sensing devices that convert analog audio signals into digital signals and perform corresponding digital signal processing through a digital signal processing system.
  • both the first pickup 11 and the second pickup 12 employ a unidirectional pickup
  • the first pickup 11 and the second pickup 12 are mounted opposite each other with the highest sensitivity in the direct direction of the unidirectional pickup, and single pointing The pickup has the lowest sensitivity in the back direction.
  • the unidirectional pickup receives a positively opposite sound signal that is greater than the sound signal that faces away from the unidirectional pickup, so that the first user faces the first pickup 11
  • the two users are facing the second pickup 11 as an example. At this time, the first user faces the second pickup 12 and the second user faces the first pickup 11.
  • the first pickup 11 and the second pickup 12 are both The voice information of the first user can be received, but the voice amplitude of the first user's voice information received by the first user is greater than the voice amplitude of the first user's voice information received by the second camera, and the controller component 2
  • the first voice information collected by the first pickup 11 is compared with the voice amplitude of the second voice information collected by the second pickup 12, thereby controlling the one-way pickup with a large voice amplitude.
  • the voice information is output to the main processor component 3, and the main processor component determines that the voice information is sent from the pickup device, thereby determining the language corresponding to the voice information and the language to be translated, thereby automatically recognizing the language and automatically translating the language. Function, saving user's operation process and convenient for users.
  • the pickup assembly 1 further comprises a first amplification filtering unit 13 and a second amplification filtering unit 14, the first amplification filtering unit 13 being connected to the output of the first pickup 11 for receiving the first voice information And outputting the first voice information through the amplification filtering process; the second amplification filtering unit 14 is connected to the output end of the second pickup 12 for receiving the second voice information, and after the second voice information is subjected to the amplification filtering process Make the output.
  • the controller component 2 outputs the control to the main processor component 3, specifically, when the second user speaks, the second voice information collected by the second sounder 12 has a larger voice amplitude than the first voice collected by the first pickup device 11.
  • the controller component 2 After the voice amplitude of the information, the controller component 2 compares the voice amplitudes of the first voice information and the second voice information, and controls the pickup component 1 to output the second voice information to the main processor component 3 for translation to generate translation information, and finally
  • the translation output component 4 performs playback to realize automatic language recognition and translation functions, and the voice information can effectively remove noise and interference information in the voice information after being amplified and filtered, thereby improving the accuracy of voice recognition and language recognition.
  • the automatic translation device of the present invention further includes:
  • the analog-to-digital conversion component 5 is connected to the first amplification filtering unit 13 and the second amplification filtering unit 14 for receiving the first voice information and the second voice information that are output after the amplification filtering process, and respectively respectively, the first voice information And converting the second voice information into the first digital voice information and the second digital voice information; the analog to digital conversion component 4 is further coupled to the controller component 2 and the main processor component 3 for receiving and controlling according to the output of the controller component 2 The signal outputs first digital voice information or second digital voice information to the main processor component 3.
  • the pickup amplifies the sound collected by the microphone through a general analog circuit, and then performs analog-to-digital conversion by the analog-to-digital conversion component 5 and outputs the result to the main processor 3 for translation, which can effectively improve the accuracy of identifying the voice information, and improve the product. quality.
  • the analog to digital conversion component 5 includes a first analog to digital conversion unit 51 and a second analog to digital conversion unit 52.
  • the first analog to digital conversion unit 51 is coupled to the first amplification filtering unit 13 for receiving amplification filtering. Processing, outputting the first voice information, and converting the first voice information into the first digital voice information;
  • the second analog-to-digital conversion unit 52 is connected to the second amplification filtering unit 14 for receiving the output after the amplification filtering process The second voice information and convert the second voice information into the second digital voice information.
  • the first analog-to-digital conversion unit 51 is connected to the first amplification filtering unit 13 and the controller component 2
  • the second analog-to-digital conversion unit 52 is connected to the second amplification filtering unit 14 and the main controller component 2, and the controller component 2
  • the voice information with a large amplitude of the voice is sent to the main processor component 3, and finally the main processor component 3 translates and generates the translation information.
  • the first voice information collected by the first pickup 11 is much larger than the voice signal of the second voice information collected by the second pickup 12, and the first voice information is passed through An amplification filtering processing unit 13 performs amplification filtering processing, and then transmits to the first analog-to-digital conversion unit 51 to convert the first voice information after the amplification filtering processing into the first digital voice information, and the second voice information passes the second amplification filtering.
  • the processing unit 14 performs amplification filtering processing, and then sends the second voice information to the second digital voice information after being subjected to the amplification filtering process, and the amplified and filtered voice information is performed by the controller component 2; Comparing the amplitude of the voice, determining that the signal of the voice information of the first voice information and the second voice information is larger, thereby determining that the signal of the digital voice information of the one road is larger, and then outputting the digital voice information with a larger control signal to the a main processor component 3, the main processor component 3 is based on an analog to digital conversion unit that outputs the digital voice information,
  • the language corresponding to the digital voice information and the language to be translated may be known; the language corresponding to the first user is X and the language corresponding to the second user is Y, and the user may operate the corresponding APP application corresponding to the first selection.
  • the pickup 11 corresponds to the language X and the second pickup 12 corresponds to the language Y.
  • the system automatically sets the language of the translation to X and Y. Since the first user is facing the first pickup 11, when the first user speaks, the first user A voice message is sent to the main processor component 3, and the main processor component 3 can translate the first voice information from X to Y. Similarly, when the second user speaks, the main processor component 3 can The second voice information is translated from Y to X, which can automatically recognize the user's language and automatically translate.
  • the language corresponding to the first user is X and the language corresponding to the second user is Y.
  • the first user is speaking the first language wake-up word to the first pickup 11 and the second user is talking to the second pickup 12
  • the second language wake-up word the main processor component 3 can determine the first user corresponding language X according to the first language wake-up word and the second language wake-up word, and the second user corresponding language Y, the system automatically sets the translated language For X and Y translation, since the first user is facing the first pickup 11, when the first user speaks, the first voice information is sent to the main processor component 3, and the main processor component 3 can be the first The voice information is translated from X to Y.
  • the main processor component 3 can translate the second voice information from Y to X, can automatically recognize the user's language and automatically translate.
  • the controller component 2 comprises a comparison unit 21 and a control unit 22 connected to the comparison unit 21, the comparison unit 21 being connected to the first amplification filtering unit 13 and the second amplification filtering unit 14, for comparing a voice amplitude of the voice information and the second voice information and outputting a comparison signal to the control unit 22;
  • the control unit 22 is coupled to the first analog to digital conversion unit 51 and the second analog to digital conversion unit 52 for controlling the first signal according to the comparison signal
  • An analog-to-digital conversion unit 51 and a second analog-to-digital conversion unit 52 output digital voice information having a large amplitude of speech to the main processor unit 3.
  • the first pickup 11 and the second pickup 12 are respectively facing the first user and the second user, and the first user is facing the first pickup 11 speaking the language wake-up words corresponding to the first language, the first pickup 11 and the second pickup 12 both receive the language wake-up words, but the signal of the first voice information collected by the first pickup 11 is larger, thereby determining The first pickup 11 corresponds to the first language.
  • the second user is speaking the second wake-up word corresponding to the second language, and the first pickup 11 and the second pickup 12 both receive the voice wake-up. The word, but because the signal of the second voice information collected by the second pickup 12 is large, it is determined that the second pickup 12 corresponds to the second language.
  • the silence detection threshold may also be set in the system.
  • the voice signal amplitude of the first voice information of the first user is detected to exceed the silence detection threshold, if the system recognizes that the first voice information is in the first language, In the language awakening word, it is determined that the first voice information having a larger signal corresponds to the first language, and similarly, when the voice signal amplitude of the second voice information exceeds the silence detection threshold, if the system recognizes that the second voice information is the second In the language of the language, the second speech information corresponding to the signal is determined to correspond to the second language, and the system automatically sets the language of the translation as the first language and the second language, and the first voice information is The first language is translated into the second language, and the second voice information is translated into the first language for the second language, and the corresponding translation function is automatically activated.
  • the first voice information is amplified and filtered by the first amplification filtering processing unit 13 and then sent to the first analog to digital conversion unit 51 for conversion to the first digital voice information, and the second voice.
  • the information is amplified and filtered by the second amplification filtering processing unit 14 and then sent to the second analog-to-digital conversion unit 52 for conversion to the second digital voice information.
  • the first amplification filtering unit 13 and the second amplification filtering unit 14 are connected to the comparison unit 21.
  • the comparing unit 21 receives the first speech information and the second speech information subjected to the amplification filtering process and compares them to determine which one of the signals is large, and transmits the digital voice information of the larger one of the signals by the control unit 22 to the main unit.
  • Processor component 3 performs the processing.
  • the main processor component 3 when translating the output, can also output a signal to the control unit 22 to control the analog-to-digital conversion component 5 to stop the conversion by the control unit 22, so that the speech and translation outputs are half-duplex, avoiding each other. influences.
  • analog to digital conversion component 5 can also be designed to have only the first analog to digital conversion unit 51, the first analog to digital conversion unit 51 is connected to the control unit 22 and the main processor component 3, and the control unit 22 The first amplification filtering unit 13 and the second amplification filtering unit 14 are connected.
  • the control unit 22 controls the first amplification filtering unit 13 and the second amplification filtering unit 14
  • the voice information with a large amplitude of the voice information is transmitted to the first analog-to-digital conversion unit 51 for analog-to-digital conversion, and the control unit 22 outputs a high-low level signal to the main processor component 3 to notify the main processor component 3 that the main processor component 3 is receiving.
  • Which one is the voice signal so that the main processor component 3 can determine whether the voice signal is the first voice signal or the second voice signal, thereby realizing automatic language recognition and translation functions, and reducing the cost investment.
  • the main processor component 3 can also output a signal to the control unit 22, and the first analog-to-digital conversion unit 51 is controlled by the control unit 22 to stop the conversion, so that the speech and translation are output to a half-duplex state, thereby avoiding mutual influence.
  • the present invention further provides an automatic translation method, which is applied to the automatic translation apparatus as described above, and includes:
  • Step S1 acquiring first voice information collected by the first pickup and second voice information collected by the second pickup;
  • Step S2 determining whether the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information
  • Step S3 if yes, translating the first speech information from the first language to the second language to generate the first translation information, otherwise translating the second speech information from the second language to the first language to generate the second translation information
  • the first language and the second language respectively correspond to the first voice information and the second voice information.
  • the first pickup and the second pickup adopt a unidirectional pickup and are installed opposite to each other, by acquiring the first voice information and the second voice information, and determining whether the voice amplitude of the first voice information is greater than the second voice information.
  • the voice amplitude if yes, determining that the first user is speaking, because the first user corresponds to the first language and the second user corresponds to the second language, the first voice information can be translated from the first language to the second language.
  • the first translation information similarly, if the voice amplitude of the first voice information is smaller than the voice amplitude of the second voice information, determining that the second user is speaking, and translating the second voice information from the second language to the first language Generating the second translation information to implement a translation function between the first language and the second language, wherein the first language and the second voice are respectively selected by the first user and the second user through the translation machine.
  • the two-way unidirectional pickups respectively receive the voices of both parties of the conversation.
  • the signal output from the speaker of the speaker is larger than the signal output by the other pickup, and it is easy to distinguish which party is by the comparator.
  • speech it is not easy to malfunction, and only the voice information of the speaking party is sent to the main processor component for translation processing.
  • the unidirectional pickup also helps to reduce the influence of surrounding noise, improve the translation accuracy and the translation output effect, and improve the user. Experience.
  • the method before the step of acquiring the first voice information collected by the first pickup and the second voice information collected by the second pickup, the method includes:
  • Step S4 acquiring first wake-up voice information input by the first user and second wake-up voice information input by the second user;
  • Step S5 Acquire a first language corresponding to the first wake-up voice information and a second language corresponding to the second wake-up voice information according to the preset voice library.
  • the user before the formal conversation conversation, the user can set the respective language by the wake-up words.
  • the first user is speaking the language wake-up words of the first language to the first pickup, and the first pickup collects the first user.
  • the first wake-up voice information and then acquire the first language of the first user according to the first wake-up voice information and the preset voice library.
  • the second user is speaking the second-language language wake-up word to the second pickup,
  • the second pickup acquires the second wake-up voice information of the second user, and then acquires the second language of the second user according to the second wake-up voice information and the preset voice library, so that the next communication conversation step can be performed, from the first of the two sides of the talk.
  • the step of acquiring the first language corresponding to the first wake-up voice information and the second language corresponding to the second wake-up voice information according to the preset voice library includes:
  • Step S51 determining whether the voice amplitudes of the first wake-up voice information and the second wake-up voice information are both greater than a preset silence detection threshold
  • Step S52 if yes, acquiring a second language wake-up word in the first language wake-up word and the second wake-up voice information in the first wake-up voice information;
  • Step S53 Acquire a first language that matches the first wake-up word and a second language that matches the second wake-up word according to the preset voice library.
  • a preset silence detection (VAD) threshold when the amplitudes of the first wake-up voice information and the second wake-up voice information respectively collected by the first pickup and the second pickup exceed the preset silence detection
  • VAD preset silence detection
  • the system performs the language recognition.
  • the amplitudes of the first voice information and the second voice information collected by the first pickup and the second pickup respectively exceed the preset silence detection threshold
  • the system The translation function will be performed to avoid the misoperation caused by the user's breathing or the surrounding environment, and improve the accuracy of speech recognition and language recognition.
  • the method further includes:
  • step S54 the prompt sound is turned on by playing the translation function in the first language and the second language respectively.
  • the automatic translation device reminds the user that the mutual translation function is currently enabled by means of a voice prompt. Specifically, the automatic translation device performs voice through the first language and the second language respectively. The broadcast translation function turns on the prompt tone to inform the user that the conversation can be opened. In another optional embodiment, the automatic translation device may further enable the text display translation function by using the first language and the second language on the display screen to inform the user that the conversation can be opened.
  • the step of determining whether the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information includes:
  • Step S21 performing amplification filtering on the first voice information and the second voice information respectively;
  • Step S22 it is determined whether the amplitude of the voice of the first voice information after the amplification and filtering process is greater than the voice amplitude of the second voice information after the amplification and filtering process.
  • the automatic translation device receives the first voice information acquired by the first pickup through the first amplification filtering unit, and performs output after the amplification filtering process; and receives the second voice information acquired by the second pickup through the second amplification filtering unit, And after the amplification filtering process, the output is performed; finally, the amplitude of the speech of the first speech information and the second speech information after the amplification filtering process is compared.
  • a specific comparison signal is obtained, for the control unit to control, according to the comparison signal, the output digital speech with a large amplitude of the speech in the first analog to digital conversion unit and the second mode conversion unit according to the comparison signal.
  • Information to the main processor component After the amplification and filtering process, the voice information can effectively remove the noise and interference information in the voice information, and improve the accuracy of voice recognition and language recognition.
  • the method further includes:
  • Step S23 converting the first voice information after the amplification and filtering processing into the first digital voice information and converting the second voice information after the amplification and filtering processing into the second digital voice information.
  • the automatic translation device converts the first voice information after the amplification and filtering processing into the first digital voice information, and converts the second voice information after the amplification and filtering processing into the second digital voice information, in some embodiments.
  • the first digital voice information and the second digital voice information are both digital signals.
  • the pickup amplifies the voice information collected by the microphone through a general analog circuit, and then performs analog-to-digital conversion of the voice information through the analog-to-digital conversion component, and then outputs the digital voice information to the main processor for translation, which can effectively improve the recognition voice information. Accuracy and improve product quality.
  • the first voice information is translated from the first language to the second language to generate the first translation information, otherwise the second voice information is translated from the second language into the first language.
  • the step of generating the second translation information, wherein the first language and the second language respectively correspond to the first voice information and the second voice information including:
  • Step S31 if the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information, translating the first digital voice information from the first language to the second language generation Decoding the first translation information, otherwise translating the second digital speech information from the second language into the first language to generate the second translation information, wherein the first language and the second language respectively Corresponding to the first digital voice information and the second digital voice information.
  • the comparison signal determines that the signal of the voice information of the first voice information and the second voice information is larger, thereby determining that the signal of the digital voice information of the one channel is larger, and then the control signal is larger.
  • the digital voice information is output, and the main processor component can know the language corresponding to the digital voice information and the language to be translated according to the analog-to-digital conversion unit that outputs the digital voice information.
  • the method further includes:
  • step S6 the first translation information or the second translation information is outputted in a manner of translating the voice information and/or translating the text information.
  • the automatic translation device converts the first voice information from the first language into the second language to generate the first translation information and the second voice information from the second language to the first language to generate the second translation information, and finally passes The speaker plays the translated voice information or displays the translated text information through the display screen, so that the second user can communicate with the first user, and the translation information is output in various ways.
  • the speaker fails and does not work, the first user and the second user You can also communicate through the display to improve the user experience.
  • the present invention further provides a computer device including a memory 1003 and a processor 1002.
  • the memory 1003 stores a computer program 1004.
  • the processor 1002 executes the computer program 1004, the steps of any of the above methods are implemented.
  • the method includes: acquiring first voice information collected by the first pickup and second voice information collected by the second sounder; determining whether a voice amplitude of the first voice information is greater than a voice amplitude of the second voice information; if yes, using the first voice
  • the information is translated from the first language into the second language to generate the first translation information, otherwise the second voice information is translated from the second language into the first language to generate the second translation information, wherein the first language and the second language respectively correspond to the first language Voice information and second voice information.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Theoretical Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Machine Translation (AREA)

Abstract

Disclosed by the present invention are an automatic translation apparatus and method, and a computer device, the method comprising: respectively acquiring first voice information and second voice information collected by a first pickup and a second pickup; comparing the magnitudes of the voice amplitudes of the first voice information and the second voice information; outputting the voice information having the greater voice amplitude as translation information. By means of automatically translating according to the magnitudes of voice information received by two sound pickups, steps of manual configuration are reduced, thereby improving user experience.

Description

自动翻译装置、方法及计算机设备Automatic translation device, method and computer device 技术领域Technical field
本发明涉及翻译机技术领域,特别涉及一种自动翻译装置、方法及计算机设备。The present invention relates to the field of translation machines, and in particular, to an automatic translation apparatus, method, and computer apparatus.
背景技术Background technique
随着国际贸易以及地球村的发展,跨国之间或者跨语言之间的交流也日益频繁,当交流的双方不懂彼此之间的语言时,往往需要借助翻译机来进行交流,但是,现在市面上的智能翻译机需要先通过按键来设置翻译语种,并且由于语音识别比较难识别混合语种,故需按住各自一方的翻译键来讲话并启动对应的语种翻译功能,因此当两人交谈时需要两人轮流伸手按住各自一方的翻译键讲话,翻译键可能在主机上也可能在耳机或其它可穿戴设备上,在沟通交流时需要不断的进行按键操作,这个操作很不人性化,用户体验较差。With the development of international trade and the global village, cross-border or cross-language communication is becoming more frequent. When the two sides of the communication do not understand each other's language, they often need to use the translation machine to communicate, but now on the market. The intelligent translation machine needs to set the translation language by pressing the button first, and since the speech recognition is more difficult to recognize the mixed language, it is necessary to press and hold the translation key of each side to speak and start the corresponding language translation function, so when the two people talk, two are needed. People take turns to hold down the translation keys of their respective sides. The translation keys may be on the main unit or on the headphones or other wearable devices. In the communication, the button operations need to be performed continuously. This operation is very unhumanized and the user experience is more user-friendly. difference.
技术问题technical problem
本发明要解决的技术问题在于针对上述背景技术中的不足之处,提供一种自动翻译装置、方法及计算机设备。The technical problem to be solved by the present invention is to provide an automatic translation apparatus, method and computer apparatus in view of the deficiencies in the above background art.
技术解决方案Technical solution
本发明解决技术问题采用的技术手段是提供一种自动翻译装置,包括:The technical means adopted by the present invention to solve the technical problem is to provide an automatic translation apparatus, including:
拾音器组件,包括反向安装的第一拾音器和第二拾音器,第一拾音器用于采集第一语音信息,第二拾音器用于采集第二语音信息;a pickup assembly comprising a first mounted pickup and a second pickup, the first pickup for collecting the first voice information, and the second pickup for collecting the second voice information;
控制器组件,与第一拾音器和第二拾音器连接,用于接收并比较第一语音信息和第二语音信息的语音幅值大小,并控制拾音器组件输出语音幅值较大的一路语音信息;a controller component, coupled to the first pickup and the second pickup, for receiving and comparing the magnitude of the voice amplitude of the first voice information and the second voice information, and controlling the sound pickup component to output a voice information with a large voice amplitude;
主处理器组件,与拾音器组件连接,用于接收拾音器组件输出的语音幅值较大的一路语音信息,并根据预确定的第一语种和第二语种,对语音幅值较大的一路语音信息进行翻译,生成翻译信息,第一语种和第二语种分别对应第一语音信息和第二语音信息。The main processor component is connected to the pickup component for receiving one voice information with a large amplitude of voice output by the pickup component, and according to the predetermined first language and the second language, the voice information having a larger voice amplitude The translation is performed to generate translation information, and the first language and the second language respectively correspond to the first voice information and the second voice information.
另一方面,本发明还提供一种自动翻译方法,应用于如上述的自动翻译装置中,包括:In another aspect, the present invention also provides an automatic translation method for use in an automatic translation apparatus as described above, comprising:
获取第一拾音器采集的第一语音信息以及第二拾音器采集的第二语音信息;Acquiring the first voice information collected by the first pickup and the second voice information collected by the second pickup;
判断第一语音信息的语音幅值是否大于第二语音信息的语音幅值;Determining whether the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information;
若是,则将第一语音信息由第一语种翻译为第二语种生成第一翻译信息,否则将第二语音信息由第二语种翻译为第一语种生成第二翻译信息,其中,第一语种和第二语种分别对应第一语音信息和第二语音信息。If yes, the first voice information is translated from the first language to the second language to generate the first translation information, otherwise the second voice information is translated from the second language into the first language to generate the second translation information, where the first language and The second language corresponds to the first voice information and the second voice information, respectively.
本发明还提出了一种计算机设备,包括存储器、处理器以及存储在存储器上并可在处理器上运行的计算机程序,处理器执行计算机程序时实现上述任一项的方法。The present invention also provides a computer device comprising a memory, a processor, and a computer program stored on the memory and operable on the processor, the processor implementing the method of any of the above.
有益效果Beneficial effect
采用上述技术方案,本发明至少具有以下有益效果:本发明实施例设有反向安装的第一拾音器和第二拾音器,从而分别采集第一语音信息和第二语音信息,当第一用户正对第一拾音器而第二用户正对第二拾音器进行讲话时,第一拾音器采集到的第一用户的声音信号的语音幅值会大于第二拾音器采集的第一用户的声音信号的语音幅值,同理,第二拾音器采集到的第二用户的声音信号的语音幅值会大于第一拾音器采集的第二用户的声音信号的语音幅值,控制器组件通过判断第一语音信息和第二语音信息的语音幅值大小,从而确定正在讲话的是第一用户还是第二用户;若是第一用户正在讲话,控制器组件控制拾音器组件将第一语音信息发送至主处理器组件,将该第一语音信息由第一语种翻译成第二语种生成翻译信息;若是第二用户正在讲话,控制器组件控制拾音器组件将第二语音信息发送至主处理器组件,将该第二语音信息由第二语种翻译成第一语种生成翻译信息,其中第一语种和第二语种分别对应第一语音信息和第二语音信息,通过根据两路拾音器接收的声音大小自动进行翻译,减少了人工设置的步骤,提高用户体验。With the above technical solution, the present invention has at least the following beneficial effects: the embodiment of the present invention is provided with a first pickup and a second pickup that are reversely mounted, thereby respectively acquiring the first voice information and the second voice information, when the first user is facing When the first pickup is speaking and the second user is speaking to the second pickup, the voice amplitude of the first user's voice signal collected by the first pickup is greater than the voice amplitude of the first user's voice signal collected by the second pickup. Similarly, the voice amplitude of the second user's voice signal collected by the second pickup may be greater than the voice amplitude of the second user's voice signal collected by the first pickup, and the controller component determines the first voice information and the second voice. The magnitude of the speech amplitude of the information to determine whether the first user is speaking or the second user; if the first user is speaking, the controller component controls the pickup component to transmit the first voice information to the main processor component, the first The voice information is translated from the first language into the second language to generate translation information; if the second user is speaking, the controller component The pickup component transmits the second voice information to the main processor component, and the second voice information is translated into the first language by the second language to generate translation information, wherein the first language and the second language respectively correspond to the first voice information and the first voice information The second voice information is automatically translated according to the size of the sound received by the two-way pickup, which reduces the manual setting steps and improves the user experience.
附图说明DRAWINGS
图1是本发明自动翻译装置一个实施例的结构示意图;1 is a schematic structural view of an embodiment of an automatic translation apparatus of the present invention;
图2是本发明自动翻译装置另一个实施例结构示意图;2 is a schematic structural view of another embodiment of an automatic translation apparatus of the present invention;
图3是本发明自动翻译装置一个实施例的电路结构示意图;3 is a schematic diagram showing the circuit structure of an embodiment of an automatic translation apparatus of the present invention;
图4是本发明自动翻译装置另一个实施例的电路结构示意图;4 is a schematic diagram showing the circuit structure of another embodiment of the automatic translation apparatus of the present invention;
图5是本发明自动翻译方法一个实施例的流程方框示意图;Figure 5 is a block diagram showing the flow of an embodiment of the automatic translation method of the present invention;
图6为本发明计算机设备一实施例的模块示意框图。FIG. 6 is a schematic block diagram of a module of an embodiment of a computer device according to the present invention.
本发明目的的实现、功能特点及优点将结合实施例,参照附图做进一步说明。The implementation, functional features, and advantages of the present invention will be further described in conjunction with the embodiments.
本发明的最佳实施方式BEST MODE FOR CARRYING OUT THE INVENTION
请参阅图1至图4,本发明提供一种技术方案:一种自动翻译装置,包括:Referring to FIG. 1 to FIG. 4, the present invention provides a technical solution: an automatic translation apparatus, including:
拾音器组件1,包括反向安装的第一拾音器11和第二拾音器12,第一拾音器11用于采集第一语音信息,第二拾音器12用于采集第二语音信息;The pickup assembly 1 includes a first pickup 11 and a second pickup 12 which are oppositely mounted, the first pickup 11 is for collecting the first voice information, and the second pickup 12 is for collecting the second voice information;
控制器组件2,与第一拾音器11和第二拾音器连接12,用于接收并比较第一语音信息和第二语音信息的语音幅值大小,并控制拾音器组件1输出语音幅值较大的一路语音信息;The controller component 2 is connected to the first pickup 11 and the second pickup 12 for receiving and comparing the magnitude of the speech amplitude of the first speech information and the second speech information, and controlling the pickup component 1 to output a path with a larger amplitude of the speech. voice message;
主处理器组件3,与拾音器组件1连接,用于接收拾音器组件1输出的语音幅值较大的一路语音信息,并根据预确定的第一语种和第二语种,对语音幅值较大的一路语音信息进行翻译,生成翻译信息,第一语种和第二语种分别对应第一语音信息和第二语音信息。The main processor component 3 is connected to the pickup component 1 for receiving one voice information with a large amplitude of the voice output by the pickup component 1, and has a large amplitude of the voice according to the predetermined first language and the second language. The voice information is translated all the way to generate translation information, and the first language and the second language respectively correspond to the first voice information and the second voice information.
在一个实施例中,以用户包括第一用户和第二用户为例,第一语种和第二语种分别对应第一用户和第二用户的语言,在进行对话之前,第一用户和第二用户可以分别输入自己的语种,当然,在进行沟通交流之前,第一用户和第二用户还可以分别正对第一拾音器11和第二拾音器12讲出对应自己语种的语种唤醒词,本发明自动翻译装置可以通过预设的离线语音库,分别识别到对应第一拾音器11的第一语种以及对应第二拾音器12的第二语种,从而预确定出第一语种和第二语种。具体地,以第一用户的第一语种为汉语而第二用户的第二语种为英语为例,第一拾音器11和第二拾音器12彼此反向安装,从而使得在不同方向上第一拾音器11和第二拾音器12采集到的声音信号的语音幅值不同,第一用户正对第一拾音器11,第二用户正对第二拾音器12。In one embodiment, taking the user as the first user and the second user as an example, the first language and the second language respectively correspond to the languages of the first user and the second user, and before the conversation, the first user and the second user The language can be input separately. Of course, before the communication is performed, the first user and the second user can also speak the language wake-up words corresponding to the language of the first pickup 11 and the second pickup 12 respectively, and the present invention automatically translates The device may respectively identify the first language corresponding to the first pickup 11 and the second language corresponding to the second pickup 12 through a preset offline voice library, thereby predetermining the first language and the second language. Specifically, in a case where the first language of the first user is Chinese and the second language of the second user is English, the first pickup 11 and the second pickup 12 are reversely mounted to each other such that the first pickup 11 is in different directions. Unlike the voice amplitude of the sound signal collected by the second pickup 12, the first user is facing the first pickup 11 and the second user is facing the second pickup 12.
当第一用户进行讲话时,第一拾音器11和第二拾音器12均会采集到第一用户的声音信号,第一拾音器11采集到的声音信号为第一语音信息,第二拾音器12采集到的声音信号为第二语音信息,而由于第一用户正对第一拾音器11,使得第一拾音器11采集到的第一语音信息的语音幅值大于第二拾音器12采集到的第二语音信息的语音幅值,控制器组件2对第一语音信息的语音幅值和第二语音信息的语音幅值进行比较,并将语音幅值较大的第一语音信息发送至主处理器组件,主处理器组件接收第一拾音器11发送的第一语音信息,从而将第一语音信息由汉语翻译成英语生成翻译信息。When the first user speaks, the first pickup 11 and the second pickup 12 both collect the sound signal of the first user, and the sound signal collected by the first pickup 11 is the first voice information, and the second pickup 12 collects the sound signal. The sound signal is the second voice information, and because the first user is facing the first pickup 11, the voice amplitude of the first voice information collected by the first pickup 11 is greater than the voice of the second voice information collected by the second pickup 12. The amplitude, the controller component 2 compares the voice amplitude of the first voice information with the voice amplitude of the second voice message, and transmits the first voice information with a larger voice amplitude to the main processor component, the main processor The component receives the first voice information sent by the first pickup 11, thereby translating the first voice information from Chinese into English to generate translation information.
同理,当第二用户进行讲话时,第一拾音器11和第二拾音器12均会采集到第二用户的声音信号,第一拾音器11采集到的声音信号为第一语音信息,第二拾音器12采集到的声音信号为第二语音信息,而由于第二用户正对第二拾音器12,使得第一拾音器11采集到的第一语音信息的语音幅值小于第二拾音器12采集到的第二语音信息的语音幅值,控制器组件2对第一语音信息的语音幅值和第二语音信息的语音幅值进行比较,并将语音幅值较大的第二语音信息发送至主处理器组件,主处理器组件接收第二拾音器11发送的第二语音信息,从而将第二语音信息由英语翻译成汉语生成翻译信息,在第一用户和第二用户进行对话交流时,能自动识别用户的语种以及需要翻译的语种并进行翻译,不需要第一用户和第二用户轮流伸手按住各自一方的翻译键进行讲话,减少用户的操作,提高用户体验。Similarly, when the second user speaks, the first pickup 11 and the second pickup 12 both collect the sound signal of the second user, and the sound signal collected by the first pickup 11 is the first voice information, and the second pickup 12 The collected sound signal is the second voice information, and the second user is facing the second pickup 12, so that the voice amplitude of the first voice information collected by the first pickup 11 is smaller than the second voice collected by the second pickup 12. The voice amplitude of the information, the controller component 2 compares the voice amplitude of the first voice information with the voice amplitude of the second voice information, and sends the second voice information with a larger voice amplitude to the main processor component. The main processor component receives the second voice information sent by the second pickup 11, so that the second voice information is translated from English into Chinese to generate translation information, and the user language can be automatically recognized when the first user and the second user perform dialogue communication. And the language that needs to be translated and translated, without the need for the first user and the second user to take the hand and hold the translation key of each party to speak, reducing the user's For improving the user experience.
本实施例通过拾音器组件1设有反向安装的第一拾音器11和第二拾音器12,从而分别采集第一语音信息和第二语音信息,当第一用户正对第一拾音器11而第二用户正对第二拾音器12进行讲话时,第一拾音器11采集到的第一用户的声音信号的语音幅值会大于第二拾音器12采集的第一用户的声音信号的语音幅值,同理,第二拾音器12采集到的第二用户的声音信号的语音幅值会大于第一拾音器11采集的第二用户的声音信号的语音幅值,控制器组件2通过判断第一语音信息和第二语音信息的语音幅值大小,从而确定正在讲话的是第一用户还是第二用户,若是第一用户正在讲话,则第一语音信息的语音幅值大于第二语音信息的语音幅值,此时,控制器组件2控制拾音器组件1将第一语音信息发送至主处理器组件3,主处理器组件3接收第一语音信息并将该第一语音信息由第一语种翻译成第二语种生成翻译信息;若是第二用户正在讲话,则第二语音信息的语音幅值大于第一语音信息的语音幅值,此时控制器组件2控制拾音器组件1将第二语音信息发送至主处理器组件3,主处理器组件3接收第二语音信息并将该第二语音信息由第二语种翻译成第一语种生成翻译信息,其中第一语种和第二语种分别对应第一语音信息和第二语音信息,通过根据两路拾音器接收的声音大小自动进行翻译,减少了人工设置的步骤,提高用户体验。In this embodiment, the first pickup 11 and the second pickup 12 are installed in the reverse direction by the pickup assembly 1, thereby respectively acquiring the first voice information and the second voice information, when the first user faces the first pickup 11 and the second user When the second pickup 12 is being spoken, the voice amplitude of the first user's voice signal collected by the first pickup 11 is greater than the voice amplitude of the first user's voice signal collected by the second pickup 12, similarly, The voice amplitude of the second user's voice signal collected by the second pickup 12 is greater than the voice amplitude of the second user's voice signal collected by the first pickup 11, and the controller component 2 determines the first voice information and the second voice information. The magnitude of the voice amplitude, thereby determining whether the first user or the second user is speaking. If the first user is speaking, the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information. At this time, the control The component 2 controls the pickup component 1 to transmit the first voice information to the main processor component 3, and the main processor component 3 receives the first voice information and the first voice information is Translating a language into a second language to generate translation information; if the second user is speaking, the speech amplitude of the second speech information is greater than the speech amplitude of the first speech information, and at this time, the controller component 2 controls the pickup component 1 to be the second The voice information is sent to the main processor component 3, and the main processor component 3 receives the second voice information and translates the second voice information from the second language into the first language to generate translation information, where the first language and the second language respectively correspond to The first voice information and the second voice information are automatically translated according to the size of the sound received by the two-way pickup, which reduces the steps of manual setting and improves the user experience.
在一个可选实施例中,本发明自动翻译装置还包括:In an optional embodiment, the automatic translation device of the present invention further includes:
翻译输出组件4,翻译输出组件4与主处理器组件3连接,用于输出翻译信息,翻译信息至少包括翻译语音信息以及翻译文字信息中的一种。The translation output component 4 is connected to the main processor component 3 for outputting translation information, and the translation information includes at least one of translating speech information and translating text information.
在实施时,翻译输出组件4包括扬声器,通过扬声器播放翻译语音信息,以第一用户的第一语种为日语而第二用户的第二语种为马来语为例,第一用户正对第一拾音器11进行讲话时,控制器组件2控制拾音器组件1输出第一语音信息至主处理器组件3,主处理器组件3将第一语音信息由日语翻译成马来语生成翻译信息,最后通过扬声器播放翻译信息中的翻译语音信息,第二用户正对第二拾音器12进行讲话时,控制器组件2控制拾音器组件1输出第二语音信息至主处理器组件3,主处理器组件3将第二语音信息由马来语翻译成日语生成翻译信息,最后通过扬声器播放翻译信息中的翻译语音信息,第二用户即可与第一用户进行沟通交流。In implementation, the translation output component 4 includes a speaker, and the translated voice information is played through the speaker, wherein the first language of the first user is Japanese and the second language of the second user is Malay, and the first user is facing the first When the pickup 11 speaks, the controller component 2 controls the pickup component 1 to output the first voice information to the main processor component 3, and the main processor component 3 translates the first voice information from Japanese into Malay to generate translation information, and finally passes through the speaker. Playing the translated voice information in the translation information, when the second user is speaking to the second pickup 12, the controller component 2 controls the pickup component 1 to output the second voice information to the main processor component 3, and the main processor component 3 will be the second The voice information is translated into Japanese by Malay to generate translation information, and finally the translated voice information in the translation information is played through the speaker, and the second user can communicate with the first user.
当然,翻译输出组件4还可以设计有显示屏,显示屏能显示翻译文字信息,当扬声器出现故障而不工作时,第一用户和第二用户还可以通过显示屏进行沟通交流,提高用户体验。Of course, the translation output component 4 can also be designed with a display screen, and the display screen can display translated text information. When the speaker fails and does not work, the first user and the second user can also communicate through the display screen to improve the user experience.
在一个可选实施例中,第一拾音器11和第二拾音器12采用单指向性拾音器。In an alternative embodiment, the first pickup 11 and the second pickup 12 employ a unidirectional pickup.
拾音器又称监听头,监听拾音器是用来采集现场环境声音再传送到后端设备的一个器件,它是由咪头(麦克风)和音频放大电路构成。拾音器一般分为数字拾音器和模拟拾音器,数字拾音器就是通过数字信号处理系统将模拟的音频信号转换成数字信号并进行相应的数字信号处理的声音传感设备。The pickup is also called the monitor head. The monitor pickup is a device for collecting the sound of the live environment and then transmitting it to the back-end device. It is composed of a microphone (microphone) and an audio amplifier circuit. Pickups are generally classified into digital pickups and analog pickups. Sound pickups are sound sensing devices that convert analog audio signals into digital signals and perform corresponding digital signal processing through a digital signal processing system.
在实施时,第一拾音器11和第二拾音器12均采用单指向性拾音器,且第一拾音器11和第二拾音器12彼此反向安装,在单指向性拾音器正对方向的灵敏度最高,而单指向性拾音器后背方向的灵敏度最低,具体地,单指向性拾音器接收到的正对着的声音信号要大于背对着单指向性拾音器的声音信号,以第一用户正对第一拾音器11而第二用户正对第二拾音器11为例,此时,第一用户背对第二拾音器12而第二用户背对第一拾音器11,若第一用户讲话,第一拾音器11和第二拾音器12均能接收到第一用户的声音信息,但是第一拾音器11接收到第一用户的声音信息的语音幅值要大于第二拾音器接收到第一用户的声音信息的语音幅值,控制器组件2对第一拾音器11采集的第一语音信息和第二拾音器12采集的第二语音信息的语音幅值进行比较,从而控制语音幅值较大的一路拾音器输出语音信息至主处理器组件3,主处理器组件判断该语音信息是从那一路拾音器发出的,从而确定该语音信息对应的语种以及需要翻译的语种,进而能实现自动识别语种并进行自动翻译功能,节省用户的操作流程,方便用户使用。In implementation, both the first pickup 11 and the second pickup 12 employ a unidirectional pickup, and the first pickup 11 and the second pickup 12 are mounted opposite each other with the highest sensitivity in the direct direction of the unidirectional pickup, and single pointing The pickup has the lowest sensitivity in the back direction. Specifically, the unidirectional pickup receives a positively opposite sound signal that is greater than the sound signal that faces away from the unidirectional pickup, so that the first user faces the first pickup 11 The two users are facing the second pickup 11 as an example. At this time, the first user faces the second pickup 12 and the second user faces the first pickup 11. If the first user speaks, the first pickup 11 and the second pickup 12 are both The voice information of the first user can be received, but the voice amplitude of the first user's voice information received by the first user is greater than the voice amplitude of the first user's voice information received by the second camera, and the controller component 2 The first voice information collected by the first pickup 11 is compared with the voice amplitude of the second voice information collected by the second pickup 12, thereby controlling the one-way pickup with a large voice amplitude. The voice information is output to the main processor component 3, and the main processor component determines that the voice information is sent from the pickup device, thereby determining the language corresponding to the voice information and the language to be translated, thereby automatically recognizing the language and automatically translating the language. Function, saving user's operation process and convenient for users.
在一个可选实施例中,拾音器组件1还包括第一放大滤波单元13以及第二放大滤波单元14,第一放大滤波单元13连接至第一拾音器11的输出端,用于接收第一语音信息,并将第一语音信息经过放大滤波处理后进行输出;第二放大滤波单元14连接至第二拾音器12的输出端,用于接收第二语音信息,并将第二语音信息经过放大滤波处理后进行输出。In an alternative embodiment, the pickup assembly 1 further comprises a first amplification filtering unit 13 and a second amplification filtering unit 14, the first amplification filtering unit 13 being connected to the output of the first pickup 11 for receiving the first voice information And outputting the first voice information through the amplification filtering process; the second amplification filtering unit 14 is connected to the output end of the second pickup 12 for receiving the second voice information, and after the second voice information is subjected to the amplification filtering process Make the output.
本实施例通过设有第一放大滤波单元13和第二放大滤波单元14,以分别对第一拾音器11和第二拾音器12采集的第一语音信息和第二语音信息进行放大滤波处理,最后在控制器组件2的控制下输出至主处理器组件3,具体地,当第二用户讲话时,第二拾音器12采集到的第二语音信息的语音幅值大于第一拾音器11采集的第一语音信息的语音幅值,控制器组件2经过比较第一语音信息和第二语音信息的语音幅值后,控制拾音器组件1输出第二语音信息至主处理器组件3进行翻译生成翻译信息,最后由翻译输出组件4进行播放,实现语种自动识别和翻译功能,而且语音信息经过放大滤波处理后能有效去除语音信息中的杂音以及干扰信息,提高语音识别以及语种识别的准确度。In this embodiment, by providing the first amplification filtering unit 13 and the second amplification filtering unit 14, the first speech information and the second speech information collected by the first pickup 11 and the second pickup 12 are respectively subjected to amplification filtering processing, and finally The controller component 2 outputs the control to the main processor component 3, specifically, when the second user speaks, the second voice information collected by the second sounder 12 has a larger voice amplitude than the first voice collected by the first pickup device 11. After the voice amplitude of the information, the controller component 2 compares the voice amplitudes of the first voice information and the second voice information, and controls the pickup component 1 to output the second voice information to the main processor component 3 for translation to generate translation information, and finally The translation output component 4 performs playback to realize automatic language recognition and translation functions, and the voice information can effectively remove noise and interference information in the voice information after being amplified and filtered, thereby improving the accuracy of voice recognition and language recognition.
在一个可选实施例中,本发明自动翻译装置还包括:In an optional embodiment, the automatic translation device of the present invention further includes:
模数转换组件5,与第一放大滤波单元13和第二放大滤波单元14连接,用于接收经过放大滤波处理后进行输出的第一语音信息和第二语音信息,并分别将第一语音信息和第二语音信息转换为第一数字语音信息和第二数字语音信息;模数转换组件4还与控制器组件2和主处理器组件3连接,用于接收并根据控制器组件2输出的控制信号输出第一数字语音信息或第二数字语音信息至主处理器组件3。The analog-to-digital conversion component 5 is connected to the first amplification filtering unit 13 and the second amplification filtering unit 14 for receiving the first voice information and the second voice information that are output after the amplification filtering process, and respectively respectively, the first voice information And converting the second voice information into the first digital voice information and the second digital voice information; the analog to digital conversion component 4 is further coupled to the controller component 2 and the main processor component 3 for receiving and controlling according to the output of the controller component 2 The signal outputs first digital voice information or second digital voice information to the main processor component 3.
在实施时,拾音器通过一般的模拟电路放大麦克风采集到的声音,然后通过模数转换组件5进行模数转换后输出至主处理器3进行翻译,能有效提高识别语音信息的准确度,提高产品质量。In implementation, the pickup amplifies the sound collected by the microphone through a general analog circuit, and then performs analog-to-digital conversion by the analog-to-digital conversion component 5 and outputs the result to the main processor 3 for translation, which can effectively improve the accuracy of identifying the voice information, and improve the product. quality.
在一个实施例中,模数转换组件5包括第一模数转换单元51和第二模数转换单元52,第一模数转换单元51与第一放大滤波单元13连接,用于接收经过放大滤波处理后进行输出的第一语音信息,并将第一语音信息转换为第一数字语音信息;第二模数转换单元52与第二放大滤波单元14连接,用于接收经过放大滤波处理后进行输出的第二语音信息,并将第二语音信息转换为第二数字语音信息。In one embodiment, the analog to digital conversion component 5 includes a first analog to digital conversion unit 51 and a second analog to digital conversion unit 52. The first analog to digital conversion unit 51 is coupled to the first amplification filtering unit 13 for receiving amplification filtering. Processing, outputting the first voice information, and converting the first voice information into the first digital voice information; the second analog-to-digital conversion unit 52 is connected to the second amplification filtering unit 14 for receiving the output after the amplification filtering process The second voice information and convert the second voice information into the second digital voice information.
具体地,第一模数转换单元51与第一放大滤波单元13和控制器组件2连接,第二模数转换单元52与第二放大滤波单元14和主控制器组件2连接,控制器组件2接收到第一语音信息和第二语音信息后,对第一语音信息和第二语音信息进行语音幅值比较,判断哪一路语音信息的语音幅值信号较大,然后控制模数转换组件5输出经过模数转换后的语音幅值较大的一路语音信息至主处理器组件3,最后由主处理器组件3进行翻译生成翻译信息。Specifically, the first analog-to-digital conversion unit 51 is connected to the first amplification filtering unit 13 and the controller component 2, and the second analog-to-digital conversion unit 52 is connected to the second amplification filtering unit 14 and the main controller component 2, and the controller component 2 After receiving the first voice information and the second voice information, performing voice amplitude comparison on the first voice information and the second voice information, determining which voice information has a larger voice amplitude signal, and then controlling the analog-to-digital conversion component 5 to output After the analog-to-digital conversion, the voice information with a large amplitude of the voice is sent to the main processor component 3, and finally the main processor component 3 translates and generates the translation information.
具体地,当第一用户正对第一拾音器11讲话时,第一拾音器11采集的第一语音信息要比第二拾音器12采集的第二语音信息的语音信号大很多,第一语音信息经过第一放大滤波处理单元13进行放大滤波处理,然后发送至第一模数转换单元51将经过放大滤波处理后的第一语音信息转换为第一数字语音信息,而第二语音信息经过第二放大滤波处理单元14进行放大滤波处理,然后发送至第二模数转换单元52将经过放大滤波处理后第二语音信息转换为第二数字语音信息,同时,放大滤波后的语音信息通过控制器组件2进行语音幅值比较,判断第一语音信息和第二语音信息中那一路语音信息的信号较大,从而确定那一路的数字语音信息的信号较大,然后控制信号较大的一路数字语音信息输出至主处理器组件3,主处理器组件3根据输出该数字语音信息的模数转换单元,即可知道该数字语音信息对应的语种以及需要进行翻译的语种;以第一用户对应的语种为X而第二用户对应的语种为Y 为例,用户可以操作装置中对应的APP应用对应选择第一拾音器11对应语种X而第二拾音器12对应语种Y,系统自动将此次翻译的语种设为X与Y 互译,由于第一用户正对第一拾音器11,所以当第一用户讲话时,第一语音信息会被发送至主处理器组件3,主处理器组件3即可将第一语音信息由X翻译为Y,同理,当第二用户讲话时,主处理器组件3即可将第二语音信息由Y翻译为X,能自动识别用户的语种并自动进行翻译。Specifically, when the first user is speaking to the first pickup 11, the first voice information collected by the first pickup 11 is much larger than the voice signal of the second voice information collected by the second pickup 12, and the first voice information is passed through An amplification filtering processing unit 13 performs amplification filtering processing, and then transmits to the first analog-to-digital conversion unit 51 to convert the first voice information after the amplification filtering processing into the first digital voice information, and the second voice information passes the second amplification filtering. The processing unit 14 performs amplification filtering processing, and then sends the second voice information to the second digital voice information after being subjected to the amplification filtering process, and the amplified and filtered voice information is performed by the controller component 2; Comparing the amplitude of the voice, determining that the signal of the voice information of the first voice information and the second voice information is larger, thereby determining that the signal of the digital voice information of the one road is larger, and then outputting the digital voice information with a larger control signal to the a main processor component 3, the main processor component 3 is based on an analog to digital conversion unit that outputs the digital voice information, The language corresponding to the digital voice information and the language to be translated may be known; the language corresponding to the first user is X and the language corresponding to the second user is Y, and the user may operate the corresponding APP application corresponding to the first selection. The pickup 11 corresponds to the language X and the second pickup 12 corresponds to the language Y. The system automatically sets the language of the translation to X and Y. Since the first user is facing the first pickup 11, when the first user speaks, the first user A voice message is sent to the main processor component 3, and the main processor component 3 can translate the first voice information from X to Y. Similarly, when the second user speaks, the main processor component 3 can The second voice information is translated from Y to X, which can automatically recognize the user's language and automatically translate.
当然,以第一用户对应的语种为X而第二用户对应的语种为Y 为例,第一用户正对第一拾音器11讲出第一语种唤醒词而第二用户正对第二拾音器12讲出第二语种唤醒词,主处理器组件3即可根据第一语种唤醒词和第二语种唤醒词确定第一用户对应语种X而第二用户对应语种Y,系统自动将此次翻译的语种设为X与Y 互译,由于第一用户正对第一拾音器11,所以当第一用户讲话时,第一语音信息会被发送至主处理器组件3,主处理器组件3即可将第一语音信息由X翻译为Y,同理,当第二用户讲话时,主处理器组件3即可将第二语音信息由Y翻译为X,能自动识别用户的语种并自动进行翻译。Of course, the language corresponding to the first user is X and the language corresponding to the second user is Y. The first user is speaking the first language wake-up word to the first pickup 11 and the second user is talking to the second pickup 12 The second language wake-up word, the main processor component 3 can determine the first user corresponding language X according to the first language wake-up word and the second language wake-up word, and the second user corresponding language Y, the system automatically sets the translated language For X and Y translation, since the first user is facing the first pickup 11, when the first user speaks, the first voice information is sent to the main processor component 3, and the main processor component 3 can be the first The voice information is translated from X to Y. Similarly, when the second user speaks, the main processor component 3 can translate the second voice information from Y to X, can automatically recognize the user's language and automatically translate.
在一个可选实施例中,控制器组件2包括比较单元21以及与比较单元21连接的控制单元22,比较单元21与第一放大滤波单元13和第二放大滤波单元14连接,用于比较第一语音信息和第二语音信息的语音幅值大小并输出比较信号至控制单元22;控制单元22与第一模数转换单元51和第二模数转换单元52连接,用于根据比较信号控制第一模数转换单元51和第二模数转换单元52中语音幅值较大的输出数字语音信息至主处理器组件3。In an alternative embodiment, the controller component 2 comprises a comparison unit 21 and a control unit 22 connected to the comparison unit 21, the comparison unit 21 being connected to the first amplification filtering unit 13 and the second amplification filtering unit 14, for comparing a voice amplitude of the voice information and the second voice information and outputting a comparison signal to the control unit 22; the control unit 22 is coupled to the first analog to digital conversion unit 51 and the second analog to digital conversion unit 52 for controlling the first signal according to the comparison signal An analog-to-digital conversion unit 51 and a second analog-to-digital conversion unit 52 output digital voice information having a large amplitude of speech to the main processor unit 3.
在实施时,当本发明自动翻译装置放置于第一用户和第二用户中间时,第一拾音器11和第二拾音器12分别正对第一用户和第二用户,第一用户正对第一拾音器11讲出对应第一语种的语种唤醒词,第一拾音器11和第二拾音器12均会接收到该语种唤醒词,但是由于第一拾音器11采集到的第一语音信息的信号较大,从而确定第一拾音器11对应的是第一语种,同理,第二用户正对第二拾音器12讲出对应第二语种的语种唤醒词,第一拾音器11和第二拾音器12均会接收到该语音唤醒词,但是由于第二拾音器12采集到的第二语音信息的信号较大,从而确定第二拾音器12对应的是第二语种。In implementation, when the automatic translation apparatus of the present invention is placed between the first user and the second user, the first pickup 11 and the second pickup 12 are respectively facing the first user and the second user, and the first user is facing the first pickup 11 speaking the language wake-up words corresponding to the first language, the first pickup 11 and the second pickup 12 both receive the language wake-up words, but the signal of the first voice information collected by the first pickup 11 is larger, thereby determining The first pickup 11 corresponds to the first language. Similarly, the second user is speaking the second wake-up word corresponding to the second language, and the first pickup 11 and the second pickup 12 both receive the voice wake-up. The word, but because the signal of the second voice information collected by the second pickup 12 is large, it is determined that the second pickup 12 corresponds to the second language.
在具体实施时,还可以通过在系统中设置静音检测门限,当检测到第一用户的第一语音信息的语音信号幅度超过静音检测门限时,如果系统识别到第一语音信息为第一语种的语种唤醒词,则判断此时信号较大的第一语音信息对应第一语种,同样地,当第二语音信息的语音信号幅度超过静音检测门限时,如果系统识别到第二语音信息为第二语种的语种唤醒词,则判断此时信号较大的第二语音信息对应第二语种,系统自动将此次翻译的语种设为第一语种和第二语种互译,并且将第一语音信息为第一语种翻译为第二语种,第二语音信息为第二语种翻译为第一语种,并自动启动对应的翻译功能。In a specific implementation, the silence detection threshold may also be set in the system. When the voice signal amplitude of the first voice information of the first user is detected to exceed the silence detection threshold, if the system recognizes that the first voice information is in the first language, In the language awakening word, it is determined that the first voice information having a larger signal corresponds to the first language, and similarly, when the voice signal amplitude of the second voice information exceeds the silence detection threshold, if the system recognizes that the second voice information is the second In the language of the language, the second speech information corresponding to the signal is determined to correspond to the second language, and the system automatically sets the language of the translation as the first language and the second language, and the first voice information is The first language is translated into the second language, and the second voice information is translated into the first language for the second language, and the corresponding translation function is automatically activated.
当第一用户正对第一拾音器11讲话时,第一语音信息经过第一放大滤波处理单元13进行放大滤波后发送至第一模数转换单元51转换为第一数字语音信息,而第二语音信息经过第二放大滤波处理单元14进行放大滤波后发送至第二模数转换单元52转换为第二数字语音信息,同时,第一放大滤波单元13和第二放大滤波单元14与比较单元21连接,比较单元21接收经过放大滤波处理后的第一语音信息和第二语音信息并进行比较,判断哪一方的信号较大,并通过控制单元22控制信号较大的一方的数字语音信息传输给主处理器组件3进行处理。When the first user is speaking to the first pickup 11, the first voice information is amplified and filtered by the first amplification filtering processing unit 13 and then sent to the first analog to digital conversion unit 51 for conversion to the first digital voice information, and the second voice. The information is amplified and filtered by the second amplification filtering processing unit 14 and then sent to the second analog-to-digital conversion unit 52 for conversion to the second digital voice information. Meanwhile, the first amplification filtering unit 13 and the second amplification filtering unit 14 are connected to the comparison unit 21. The comparing unit 21 receives the first speech information and the second speech information subjected to the amplification filtering process and compares them to determine which one of the signals is large, and transmits the digital voice information of the larger one of the signals by the control unit 22 to the main unit. Processor component 3 performs the processing.
在具体实施时,当翻译输出时,主处理器组件3还可以输出信号至控制单元22以通过控制单元22控制模数转换组件5停止转换,使讲话与翻译输出为半双工状态,避免相互影响。In a specific implementation, when translating the output, the main processor component 3 can also output a signal to the control unit 22 to control the analog-to-digital conversion component 5 to stop the conversion by the control unit 22, so that the speech and translation outputs are half-duplex, avoiding each other. influences.
在另一个可选实施例中,模数转换组件5还可以设计成只有第一模数转换单元51,第一模数转换单元51与控制单元22以及主处理器组件3连接,控制单元22与第一放大滤波单元13和第二放大滤波单元14连接,第一语音信息和第二语音信息经过放大滤波和幅度比较后,控制单元22控制第一放大滤波单元13和第二放大滤波单元14中语音信息幅度较大的一路语音信息传输给第一模数转换单元51进行模数转换,同时控制单元22输出一高低电平信号至主处理器组件3,以通知主处理器组件3正在接收的是哪一方的语音信号,使得主处理器组件3可以确定该语音信号是第一语音信号还是第二语音信号,从而实现语种自动识别及翻译功能,降低成本投入。当然,主处理器组件3还可以输出信号至控制单元22,通过控制单元22控制第一模数转换单元51停止转换,使讲话与翻译输出为半双工状态,避免相互影响。In another alternative embodiment, the analog to digital conversion component 5 can also be designed to have only the first analog to digital conversion unit 51, the first analog to digital conversion unit 51 is connected to the control unit 22 and the main processor component 3, and the control unit 22 The first amplification filtering unit 13 and the second amplification filtering unit 14 are connected. After the first speech information and the second speech information are subjected to amplification filtering and amplitude comparison, the control unit 22 controls the first amplification filtering unit 13 and the second amplification filtering unit 14 The voice information with a large amplitude of the voice information is transmitted to the first analog-to-digital conversion unit 51 for analog-to-digital conversion, and the control unit 22 outputs a high-low level signal to the main processor component 3 to notify the main processor component 3 that the main processor component 3 is receiving. Which one is the voice signal, so that the main processor component 3 can determine whether the voice signal is the first voice signal or the second voice signal, thereby realizing automatic language recognition and translation functions, and reducing the cost investment. Of course, the main processor component 3 can also output a signal to the control unit 22, and the first analog-to-digital conversion unit 51 is controlled by the control unit 22 to stop the conversion, so that the speech and translation are output to a half-duplex state, thereby avoiding mutual influence.
另一方面,如图5所示,本发明还提供一种自动翻译方法,应用于如上述的自动翻译装置中,包括:On the other hand, as shown in FIG. 5, the present invention further provides an automatic translation method, which is applied to the automatic translation apparatus as described above, and includes:
步骤S1,获取第一拾音器采集的第一语音信息以及第二拾音器采集的第二语音信息;Step S1, acquiring first voice information collected by the first pickup and second voice information collected by the second pickup;
步骤S2,判断第一语音信息的语音幅值是否大于第二语音信息的语音幅值;Step S2, determining whether the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information;
步骤S3,若是,则将第一语音信息由第一语种翻译为第二语种,以生成第一翻译信息,否则将第二语音信息由第二语种翻译为第一语种,以生成第二翻译信息,其中,第一语种和第二语种分别对应第一语音信息和第二语音信息。Step S3, if yes, translating the first speech information from the first language to the second language to generate the first translation information, otherwise translating the second speech information from the second language to the first language to generate the second translation information The first language and the second language respectively correspond to the first voice information and the second voice information.
在实施时,第一拾音器和第二拾音器采用单指向性拾音器并彼此反向安装,通过获取第一语音信息和第二语音信息,并判断第一语音信息的语音幅值是否大于第二语音信息的语音幅值,若是,则判断第一用户在讲话,由于第一用户对应第一语种而第二用户对应第二语种,则可以将第一语音信息由第一语种向第二语种进行翻译生成第一翻译信息,同理,若第一语音信息的语音幅值小于第二语音信息的语音幅值,则判断第二用户在讲话,将第二语音信息由第二语种向第一语种进行翻译生成第二翻译信息,实现第一语种和第二语种之间的互译功能,其中,第一语种和第二语音分别由第一用户和第二用户通过翻译机进行选择。In implementation, the first pickup and the second pickup adopt a unidirectional pickup and are installed opposite to each other, by acquiring the first voice information and the second voice information, and determining whether the voice amplitude of the first voice information is greater than the second voice information. The voice amplitude, if yes, determining that the first user is speaking, because the first user corresponds to the first language and the second user corresponds to the second language, the first voice information can be translated from the first language to the second language. The first translation information, similarly, if the voice amplitude of the first voice information is smaller than the voice amplitude of the second voice information, determining that the second user is speaking, and translating the second voice information from the second language to the first language Generating the second translation information to implement a translation function between the first language and the second language, wherein the first language and the second voice are respectively selected by the first user and the second user through the translation machine.
本实施例采用两路单指向性拾音器分别接收交谈双方的语音,由于一方讲话时,正对讲话者的拾音器输出的信号要大于另一个拾音器输出的信号,通过比较器比较很容易区分是哪一方在讲话,不易误动作,并仅将讲话一方的语音信息发送至主处理器组件进行翻译处理,单指向性拾音器还有助于降低周围噪声的影响,提高翻译准确度以及翻译输出效果,提高用户体验。In this embodiment, the two-way unidirectional pickups respectively receive the voices of both parties of the conversation. When one party speaks, the signal output from the speaker of the speaker is larger than the signal output by the other pickup, and it is easy to distinguish which party is by the comparator. In speech, it is not easy to malfunction, and only the voice information of the speaking party is sent to the main processor component for translation processing. The unidirectional pickup also helps to reduce the influence of surrounding noise, improve the translation accuracy and the translation output effect, and improve the user. Experience.
在一个实施例中,获取第一拾音器采集的第一语音信息以及第二拾音器采集的第二语音信息的步骤之前,包括:In one embodiment, before the step of acquiring the first voice information collected by the first pickup and the second voice information collected by the second pickup, the method includes:
步骤S4,获取第一用户输入的第一唤醒语音信息以及第二用户输入的第二唤醒语音信息;Step S4, acquiring first wake-up voice information input by the first user and second wake-up voice information input by the second user;
步骤S5,根据预设语音库获取对应第一唤醒语音信息的第一语种和对应第二唤醒语音信息的第二语种。Step S5: Acquire a first language corresponding to the first wake-up voice information and a second language corresponding to the second wake-up voice information according to the preset voice library.
在实施时,在正式进行对话交谈之前,用户可以通过唤醒词来设置各自的语种,具体地,第一用户正对第一拾音器讲出第一语种的语种唤醒词,第一拾音器采集第一用户的第一唤醒语音信息,然后根据第一唤醒语音信息和预设语音库获取第一用户的第一语种,同理,第二用户正对第二拾音器讲出第二语种的语种唤醒词,第二拾音器采集第二用户的第二唤醒语音信息,然后根据第二唤醒语音信息和预设语音库获取第二用户的第二语种,即可进行接下来的沟通交谈步骤,从讲话双方的第一唤醒语音信息和第二唤醒语音信息中的语种唤醒词,即可自动判断讲话双方各自的语种以及需要翻译的语种,从而省去了人工设置各自语种和按键翻译的繁琐操作,提高用户体验。In the implementation, before the formal conversation conversation, the user can set the respective language by the wake-up words. Specifically, the first user is speaking the language wake-up words of the first language to the first pickup, and the first pickup collects the first user. The first wake-up voice information, and then acquire the first language of the first user according to the first wake-up voice information and the preset voice library. Similarly, the second user is speaking the second-language language wake-up word to the second pickup, The second pickup acquires the second wake-up voice information of the second user, and then acquires the second language of the second user according to the second wake-up voice information and the preset voice library, so that the next communication conversation step can be performed, from the first of the two sides of the talk. By awakening the speech information and the language wake-up words in the second wake-up speech information, the respective languages of the speech and the languages to be translated can be automatically determined, thereby eliminating the cumbersome operation of manually setting the respective languages and key translations, and improving the user experience.
在一个可选实施例中,根据预设语音库获取对应第一唤醒语音信息的第一语种和对应第二唤醒语音信息的第二语种的步骤,包括:In an optional embodiment, the step of acquiring the first language corresponding to the first wake-up voice information and the second language corresponding to the second wake-up voice information according to the preset voice library includes:
步骤S51,判断第一唤醒语音信息和第二唤醒语音信息的语音幅值是否均大于预设静音检测门限值;Step S51, determining whether the voice amplitudes of the first wake-up voice information and the second wake-up voice information are both greater than a preset silence detection threshold;
步骤S52,若是,则获取第一唤醒语音信息中的第一语种唤醒词和第二唤醒语音信息中的第二语种唤醒词;Step S52, if yes, acquiring a second language wake-up word in the first language wake-up word and the second wake-up voice information in the first wake-up voice information;
步骤S53,根据预设语音库获取与第一唤醒词匹配的第一语种以及与第二唤醒词匹配的第二语种。Step S53: Acquire a first language that matches the first wake-up word and a second language that matches the second wake-up word according to the preset voice library.
在实施时,通过设置预设静音检测(VAD)门限值,当分别由第一拾音器和第二拾音器采集到的第一唤醒语音信息和第二唤醒语音信息的语音幅值超过预设静音检测门限值时,系统才会进行语种识别,另外,分别由第一拾音器和第二拾音器采集到的第一语音信息和第二语音信息的语音幅值超过预设静音检测门限值时,系统才会进行翻译功能,避免因用户的喘息声或者周围环境声音而造成的误操作,提高语音识别以及语种识别的准确度。In implementation, by setting a preset silence detection (VAD) threshold, when the amplitudes of the first wake-up voice information and the second wake-up voice information respectively collected by the first pickup and the second pickup exceed the preset silence detection When the threshold is used, the system performs the language recognition. In addition, when the amplitudes of the first voice information and the second voice information collected by the first pickup and the second pickup respectively exceed the preset silence detection threshold, the system The translation function will be performed to avoid the misoperation caused by the user's breathing or the surrounding environment, and improve the accuracy of speech recognition and language recognition.
在一个可选实施例中,在根据预设语音库获取与第一唤醒词匹配的第一语种以及与第二唤醒词匹配的第二语种的步骤之后,还包括:In an optional embodiment, after the step of acquiring the first language matching the first wake-up word and the second language matching the second wake-up word according to the preset voice library, the method further includes:
步骤S54,分别以第一语种和第二语种语音播放互译功能开启提示音。In step S54, the prompt sound is turned on by playing the translation function in the first language and the second language respectively.
在实施时,自动翻译装置在获取第一语种以及第二语种之后,会通过语音提示的方式提醒用户当前已开启互译功能,具体地,自动翻译装置分别通过第一语种和第二语种进行语音播报互译功能开启提示音,以告知用户可以进行开启对话。在另一个可选的实施例中,自动翻译装置还可以在显示屏上分别通过第一语种和第二语种进行文字显示互译功能开启,以告知用户可以进行开启对话。In the implementation, after the first language and the second language are acquired, the automatic translation device reminds the user that the mutual translation function is currently enabled by means of a voice prompt. Specifically, the automatic translation device performs voice through the first language and the second language respectively. The broadcast translation function turns on the prompt tone to inform the user that the conversation can be opened. In another optional embodiment, the automatic translation device may further enable the text display translation function by using the first language and the second language on the display screen to inform the user that the conversation can be opened.
在一个可选的实施例中,判断第一语音信息的语音幅值是否大于第二语音信息的语音幅值的步骤,包括:In an optional embodiment, the step of determining whether the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information includes:
步骤S21,将第一语音信息和第二语音信息分别进行放大滤波;Step S21, performing amplification filtering on the first voice information and the second voice information respectively;
步骤S22,判断放大滤波处理后的第一语音信息的语音幅值是否大于放大滤波处理后的第二语音信息的语音幅值。Step S22, it is determined whether the amplitude of the voice of the first voice information after the amplification and filtering process is greater than the voice amplitude of the second voice information after the amplification and filtering process.
在实施时,自动翻译装置通过第一放大滤波单元接收第一拾音器获取的第一语音信息,并经过放大滤波处理后进行输出;通过第二放大滤波单元接收第二拾音器获取的第二语音信息,并经过放大滤波处理后进行输出;最后比较经过放大滤波处理后第一语音信息和第二语音信息的语音幅值大小。在一些实施例中,在经过比较之后,会得出具体的比较信号,用于供控制单元根据比较信号控制第一模数转换单元和第二模式转换单元中语音幅值较大的输出数字语音信息至主处理器组件。语音信息在经过放大滤波处理后能有效去除语音信息中的杂音以及干扰信息,提高语音识别以及语种识别的准确度。In an implementation, the automatic translation device receives the first voice information acquired by the first pickup through the first amplification filtering unit, and performs output after the amplification filtering process; and receives the second voice information acquired by the second pickup through the second amplification filtering unit, And after the amplification filtering process, the output is performed; finally, the amplitude of the speech of the first speech information and the second speech information after the amplification filtering process is compared. In some embodiments, after comparison, a specific comparison signal is obtained, for the control unit to control, according to the comparison signal, the output digital speech with a large amplitude of the speech in the first analog to digital conversion unit and the second mode conversion unit according to the comparison signal. Information to the main processor component. After the amplification and filtering process, the voice information can effectively remove the noise and interference information in the voice information, and improve the accuracy of voice recognition and language recognition.
在一个可选的实施例中,在将第一语音信息和第二语音信息分别进行放大滤波处理的步骤之后,还包括:In an optional embodiment, after the step of performing the amplification filtering process on the first voice information and the second voice information respectively, the method further includes:
步骤S23,将经过放大滤波处理后的第一语音信息转换为第一数字语音信息以及将经过放大滤波处理后的第二语音信息转换为第二数字语音信息。Step S23, converting the first voice information after the amplification and filtering processing into the first digital voice information and converting the second voice information after the amplification and filtering processing into the second digital voice information.
在实施时,自动翻译装置将经过放大滤波处理后的第一语音信息转换为第一数字语音信息,将经过放大滤波处理后的第二语音信息转换为第二数字语音信息,在一些实施例中,第一数字语音信息和第二数字语音信息都为数字信号。在具体应用中,拾音器通过一般的模拟电路放大麦克风采集到的语音信息,然后通过模数转换组件将语音信息进行模数转换后输出数字语音信息至主处理器进行翻译,能有效提高识别语音信息的准确度,提高产品质量。In an implementation, the automatic translation device converts the first voice information after the amplification and filtering processing into the first digital voice information, and converts the second voice information after the amplification and filtering processing into the second digital voice information, in some embodiments. The first digital voice information and the second digital voice information are both digital signals. In a specific application, the pickup amplifies the voice information collected by the microphone through a general analog circuit, and then performs analog-to-digital conversion of the voice information through the analog-to-digital conversion component, and then outputs the digital voice information to the main processor for translation, which can effectively improve the recognition voice information. Accuracy and improve product quality.
在一个可选的实施例中,若是,则将第一语音信息由第一语种翻译为第二语种,以生成第一翻译信息,否则将第二语音信息由第二语种翻译为第一语种,以生成第二翻译信息,其中,第一语种和第二语种分别对应第一语音信息和第二语音信息的步骤,包括:In an optional embodiment, if yes, the first voice information is translated from the first language to the second language to generate the first translation information, otherwise the second voice information is translated from the second language into the first language. The step of generating the second translation information, wherein the first language and the second language respectively correspond to the first voice information and the second voice information, including:
步骤S31,若所述第一语音信息的语音幅值大于所述第二语音信息的语音幅值,则将所述第一数字语音信息由所述第一语种翻译为所述第二语种生成所述第一翻译信息,否则将所述第二数字语音信息由所述第二语种翻译为所述第一语种生成所述第二翻译信息,其中,所述第一语种和所述第二语种分别对应所述第一数字语音信息和所述第二数字语音信息。Step S31, if the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information, translating the first digital voice information from the first language to the second language generation Decoding the first translation information, otherwise translating the second digital speech information from the second language into the first language to generate the second translation information, wherein the first language and the second language respectively Corresponding to the first digital voice information and the second digital voice information.
在实施时,具体地,通过比较信号判断第一语音信息和第二语音信息中那一路语音信息的信号较大,从而确定那一路的数字语音信息的信号较大,然后控制信号较大的一路数字语音信息输出,主处理器组件根据输出该数字语音信息的模数转换单元,即可知道该数字语音信息对应的语种以及需要进行翻译的语种。In the implementation, specifically, the comparison signal determines that the signal of the voice information of the first voice information and the second voice information is larger, thereby determining that the signal of the digital voice information of the one channel is larger, and then the control signal is larger. The digital voice information is output, and the main processor component can know the language corresponding to the digital voice information and the language to be translated according to the analog-to-digital conversion unit that outputs the digital voice information.
在一个可选的实施例中,在若是,则将第一语音信息由第一语种翻译为第二语种生成第一翻译信息,否则将第二语音信息由第二语种翻译为第一语种生成第二翻译信息,其中,第一语种和第二语种分别对应第一语音信息和第二语音信息的步骤之后,还包括:In an optional embodiment, if yes, the first voice information is translated from the first language to the second language to generate the first translation information, otherwise the second voice information is translated from the second language to the first language generation The second translation information, wherein the first language and the second language respectively correspond to the first voice information and the second voice information, the method further includes:
步骤S6,将第一翻译信息或第二翻译信息以翻译语音信息和/或翻译文字信息的方式输出。In step S6, the first translation information or the second translation information is outputted in a manner of translating the voice information and/or translating the text information.
在实施时,自动翻译装置将第一语音信息由第一语种翻译成第二语种生成第一翻译信息以及将第二语音信息由第二语种翻译成第一语种生成第二翻译信息后,最后通过扬声器播放翻译语音信息或者通过显示屏显示翻译文字信息,这样第二用户即可与第一用户进行沟通交流,翻译信息输出方式多样,当扬声器出现故障而不工作时,第一用户和第二用户还可以通过显示屏进行沟通交流,提高用户体验。In implementation, the automatic translation device converts the first voice information from the first language into the second language to generate the first translation information and the second voice information from the second language to the first language to generate the second translation information, and finally passes The speaker plays the translated voice information or displays the translated text information through the display screen, so that the second user can communicate with the first user, and the translation information is output in various ways. When the speaker fails and does not work, the first user and the second user You can also communicate through the display to improve the user experience.
如图6所示,本发明还提出了一种计算机设备,包括存储器1003和处理器1002,存储器1003存储有计算机程序1004,处理器1002执行计算机程序1004时实现上述中任一项方法的步骤,包括:获取第一拾音器采集的第一语音信息以及第二拾音器采集的第二语音信息;判断第一语音信息的语音幅值是否大于第二语音信息的语音幅值;若是,则将第一语音信息由第一语种翻译为第二语种生成第一翻译信息,否则将第二语音信息由第二语种翻译为第一语种生成第二翻译信息,其中,第一语种和第二语种分别对应第一语音信息和第二语音信息。As shown in FIG. 6, the present invention further provides a computer device including a memory 1003 and a processor 1002. The memory 1003 stores a computer program 1004. When the processor 1002 executes the computer program 1004, the steps of any of the above methods are implemented. The method includes: acquiring first voice information collected by the first pickup and second voice information collected by the second sounder; determining whether a voice amplitude of the first voice information is greater than a voice amplitude of the second voice information; if yes, using the first voice The information is translated from the first language into the second language to generate the first translation information, otherwise the second voice information is translated from the second language into the first language to generate the second translation information, wherein the first language and the second language respectively correspond to the first language Voice information and second voice information.

Claims (18)

  1. 一种自动翻译装置,其特征在于,包括:An automatic translation device, comprising:
    拾音器组件,包括反向安装的第一拾音器和第二拾音器,所述第一拾音器用于采集第一语音信息,所述第二拾音器用于采集第二语音信息;a pickup assembly comprising a first mounted pickup and a second pickup, the first pickup for collecting first voice information, and the second pickup for collecting second voice information;
    控制器组件,与所述第一拾音器和所述第二拾音器连接,用于接收并比较所述第一语音信息和所述第二语音信息的语音幅值大小,并控制所述拾音器组件输出语音幅值较大的一路语音信息;a controller component, coupled to the first pickup and the second pickup, for receiving and comparing a magnitude of a voice amplitude of the first voice information and the second voice information, and controlling the sound output of the pickup component a voice information with a large amplitude;
    主处理器组件,与所述拾音器组件连接,用于接收所述拾音器组件输出的所述语音幅值较大的一路语音信息,并根据预确定的第一语种和第二语种,对所述语音幅值较大的一路语音信息进行翻译,生成翻译信息,所述第一语种和所述第二语种分别对应所述第一语音信息和所述第二语音信息。 a main processor component, coupled to the pickup component, configured to receive a voice information of a larger amplitude of the voice output by the pickup component, and to the voice according to the predetermined first language and the second language The voice information of the larger amplitude is translated to generate translation information, and the first language and the second language respectively correspond to the first voice information and the second voice information.
  2. 根据权利要求 1所述的自动翻译装置,其特征在于,还包括:The automatic translation apparatus according to claim 1, further comprising:
    翻译输出组件,与所述主处理器组件连接,用于输出所述翻译信息。A translation output component coupled to the main processor component for outputting the translation information.
  3. 根据权利要求2所述的自动翻译装置,其特征在于,所述翻译输出组件包括扬声器,所述扬声器用于输出所述翻译信息中的翻译语音信息。The automatic translation apparatus according to claim 2, wherein said translation output component comprises a speaker for outputting translated voice information in said translation information.
  4. 根据权利要求2所述的自动翻译装置,其特征在于,所述翻译输出组件包括显示屏,所述显示屏用于输出所述翻译信息中的翻译文字信息。The automatic translation apparatus according to claim 2, wherein said translation output component comprises a display screen for outputting translated text information in said translation information.
  5. 根据权利要求1所述的自动翻译装置,其特征在于:所述第一拾音器和所述第二拾音器采用单指向性拾音器。The automatic translation apparatus according to claim 1, wherein said first pickup and said second pickup employ a unidirectional pickup.
  6. 根据权利要求1至5任一项所述的自动翻译装置,其特征在于:所述拾音器组件还包括第一放大滤波单元以及第二放大滤波单元,所述第一放大滤波单元连接至所述第一拾音器的输出端,用于接收所述第一语音信息并经过放大滤波处理后进行输出;所述第二放大滤波单元连接至所述第二拾音器的输出端,用于接收所述第二语音信息并经过放大滤波处理后进行输出。The automatic translation apparatus according to any one of claims 1 to 5, wherein the pickup assembly further includes a first amplification filtering unit and a second amplification filtering unit, wherein the first amplification filtering unit is connected to the first An output end of a pickup for receiving the first voice information and outputting after being subjected to amplification filtering processing; the second amplification filtering unit is connected to an output end of the second pickup for receiving the second voice The information is output after being amplified and filtered.
  7. 根据权利要求6所述的自动翻译装置,其特征在于,还包括:The automatic translation apparatus according to claim 6, further comprising:
    模数转换组件,与所述第一放大滤波单元和第二放大滤波单元连接,用于接收经过放大滤波处理后进行输出的所述第一语音信息和所述第二语音信息,并分别将所述第一语音信息和所述第二语音信息转换为第一数字语音信息和第二数字语音信息;所述模数转换组件还与所述控制器组件和所述主处理器组件连接,用于接收并根据所述控制器组件输出的控制信号输出所述第一数字语音信息或所述第二数字语音信息至所述主处理器组件。An analog-to-digital conversion component is connected to the first amplification filtering unit and the second amplification filtering unit, and configured to receive the first voice information and the second voice information that are output after being subjected to amplification filtering processing, and respectively Converting the first voice information and the second voice information into first digital voice information and second digital voice information; the analog to digital conversion component is further connected to the controller component and the main processor component, Receiving and outputting the first digital voice information or the second digital voice information to the main processor component according to a control signal output by the controller component.
  8. 根据权利要求7所述的自动翻译装置,其特征在于:所述模数转换组件包括第一模数转换单元和第二模数转换单元,所述第一模数转换单元与所述第一放大滤波单元连接,用于接收经过放大滤波处理后进行输出的所述第一语音信息并转换为第一数字语音信息;所述第二模数转换单元与所述第二放大滤波单元连接,用于接收经过放大滤波处理后进行输出的所述第二语音信息并转换为第二数字语音信息。The automatic translation apparatus according to claim 7, wherein said analog to digital conversion unit comprises a first analog to digital conversion unit and a second analog to digital conversion unit, said first analog to digital conversion unit and said first amplification a filtering unit connection, configured to receive the first voice information outputted after being subjected to amplification filtering processing, and converted into first digital voice information; the second analog-to-digital conversion unit is connected to the second amplification filtering unit, and configured to The second voice information outputted after being subjected to the amplification filtering process is received and converted into second digital voice information.
  9. 根据权利要求8所述的自动翻译装置,其特征在于:所述控制器组件包括比较单元以及与所述比较单元连接的控制单元,所述比较单元与所述第一放大滤波单元和所述第二放大滤波单元连接,用于比较所述第一语音信息和第二语音信息的语音幅值大小并输出比较信号;所述控制单元与所述第一模数转换单元和所述第二模数转换单元连接,用于根据所述比较信号控制所述第一模数转换单元和所述第二模式转换单元中语音幅值较大的输出数字语音信息至所述主处理器组件。The automatic translation apparatus according to claim 8, wherein said controller component comprises a comparison unit and a control unit connected to said comparison unit, said comparison unit and said first amplification filter unit and said a second amplification filtering unit connection, configured to compare a magnitude of the voice amplitude of the first voice information and the second voice information, and output a comparison signal; the control unit and the first analog to digital conversion unit and the second modulus And a conversion unit connection, configured to control, according to the comparison signal, output digital voice information having a large voice amplitude in the first analog-to-digital conversion unit and the second mode conversion unit to the main processor component.
  10. 一种自动翻译方法,应用于如权利要求1至9中任一项所述的自动翻译装置中,其特征在于,包括:An automatic translation method for use in an automatic translation apparatus according to any one of claims 1 to 9, characterized in that it comprises:
    获取所述第一拾音器采集的所述第一语音信息以及所述第二拾音器采集的所述第二语音信息;Acquiring the first voice information collected by the first pickup and the second voice information collected by the second pickup;
    判断所述第一语音信息的语音幅值是否大于所述第二语音信息的语音幅值;Determining whether a voice amplitude of the first voice information is greater than a voice amplitude of the second voice information;
    若是,则将所述第一语音信息由所述第一语种翻译为所述第二语种,以生成第一翻译信息,否则将所述第二语音信息由所述第二语种翻译为所述第一语种,以生成第二翻译信息,其中,所述第一语种和所述第二语种分别对应所述第一语音信息和所述第二语音信息。If yes, translating the first voice information from the first language to the second language to generate first translation information, otherwise translating the second voice information from the second language to the first a language to generate second translation information, wherein the first language and the second language respectively correspond to the first voice information and the second voice information.
  11. 根据权利要求10所述的自动翻译方法,其特征在于,所述获取所述第一拾音器采集的所述第一语音信息以及所述第二拾音器采集的所述第二语音信息的步骤之前,包括:The automatic translation method according to claim 10, wherein the step of acquiring the first voice information collected by the first pickup and the second voice information collected by the second pickup includes :
    获取第一用户输入的第一唤醒语音信息以及第二用户输入的第二唤醒语音信息;Obtaining first wake-up voice information input by the first user and second wake-up voice information input by the second user;
    根据预设语音库获取对应所述第一唤醒语音信息的所述第一语种和对应所述第二唤醒语音信息的所述第二语种。Acquiring the first language corresponding to the first wake-up voice information and the second language corresponding to the second wake-up voice information according to a preset voice library.
  12. 根据权利要求11所述的自动翻译方法,其特征在于,所述根据预设语音库获取对应所述第一唤醒语音信息的所述第一语种和对应所述第二唤醒语音信息的所述第二语种的步骤,包括:The automatic translation method according to claim 11, wherein the acquiring the first language corresponding to the first wake-up voice information and the first corresponding to the second wake-up voice information according to a preset voice library Second language steps, including:
    判断所述第一唤醒语音信息和所述第二唤醒语音信息的语音幅值是否均大于预设静音检测门限值;Determining whether the voice amplitudes of the first wake-up voice information and the second wake-up voice information are greater than a preset silence detection threshold;
    若是,则获取所述第一唤醒语音信息中的第一语种唤醒词和所述第二唤醒语音信息中的第二语种唤醒词;If yes, acquiring a first language wake-up word in the first wake-up voice information and a second language wake-up word in the second wake-up voice information;
    根据所述预设语音库,获取与所述第一唤醒词匹配的所述第一语种以及与所述第二唤醒词匹配的所述第二语种。And acquiring, according to the preset voice library, the first language that matches the first wake-up word and the second language that matches the second wake-up word.
  13. 根据权利要求12所述的自动翻译方法,其特征在于,在所述根据所述预设语音库,获取与所述第一唤醒词匹配的所述第一语种以及与所述第二唤醒词匹配的所述第二语种的步骤之后,还包括:The automatic translation method according to claim 12, wherein the first language matching the first wake-up word and the second wake-up word are matched according to the preset voice library After the step of the second language, the method further includes:
    分别以所述第一语种和所述第二语种对应的语音播放互译功能开启提示音。The prompt sound is turned on by the voice play translation function corresponding to the first language and the second language, respectively.
  14. 根据权利要求10所述的自动翻译方法,其特征在于,所述判断所述第一语音信息的语音幅值是否大于所述第二语音信息的语音幅值的步骤,包括:The automatic translation method according to claim 10, wherein the step of determining whether the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information comprises:
    将所述第一语音信息和所述第二语音信息分别进行放大滤波处理;Performing amplification filtering processing on the first voice information and the second voice information respectively;
    判断放大滤波处理后的所述第一语音信息对应的语音幅值是否大于放大滤波处理后的所述第二语音信息对应的语音幅值。And determining whether the voice amplitude corresponding to the first voice information after the amplification filtering process is greater than the voice amplitude corresponding to the second voice information after the amplification filtering process.
  15. 根据权利要求14所述的自动翻译方法,其特征在于,在所述将所述第一语音信息和所述第二语音信息分别进行放大滤波处理的步骤之后,还包括:The automatic translation method according to claim 14, wherein after the step of performing the amplification and filtering processing on the first voice information and the second voice information, the method further comprises:
    将经过放大滤波处理后的所述第一语音信息转换为第一数字语音信息,并将经过放大滤波处理后的第二语音信息转换为第二数字语音信息。Converting the first voice information after the amplification and filtering processing into first digital voice information, and converting the second voice information after the amplification and filtering processing into the second digital voice information.
  16. 根据权利要求15所述的自动翻译方法,其特征在于,所述若是,则将所述第一语音信息由所述第一语种翻译为所述第二语种生成第一翻译信息,否则将所述第二语音信息由所述第二语种翻译为所述第一语种生成第二翻译信息,其中,所述第一语种和所述第二语种分别对应所述第一语音信息和所述第二语音信息的步骤,包括:The automatic translation method according to claim 15, wherein if yes, the first voice information is translated from the first language to the second language to generate first translation information, otherwise the Translating, by the second language, the second language information into the first language to generate second translation information, wherein the first language and the second language respectively correspond to the first voice information and the second voice The steps of the information, including:
    若所述第一语音信息的语音幅值大于所述第二语音信息的语音幅值,则将所述第一数字语音信息由所述第一语种翻译为所述第二语种,以生成所述第一翻译信息,否则将所述第二数字语音信息由所述第二语种翻译为所述第一语种,以生成所述第二翻译信息,其中,所述第一语种和所述第二语种分别对应所述第一数字语音信息和所述第二数字语音信息。If the voice amplitude of the first voice information is greater than the voice amplitude of the second voice information, translating the first digital voice information from the first language to the second language to generate the First translating information, otherwise translating the second digital speech information from the second language to the first language to generate the second translation information, wherein the first language and the second language Corresponding to the first digital voice information and the second digital voice information, respectively.
  17. 根据权利要求10所述的自动翻译方法,其特征在于,在所述若是,则将所述第一语音信息由所述第一语种翻译为所述第二语种生成第一翻译信息,否则将所述第二语音信息由所述第二语种翻译为所述第一语种生成第二翻译信息,其中,所述第一语种和所述第二语种分别对应所述第一语音信息和所述第二语音信息的步骤之后,还包括:The automatic translation method according to claim 10, wherein if the first voice information is translated from the first language to the second language, the first translation information is generated, otherwise Translating, by the second language, the second language information into the first language to generate second translation information, wherein the first language and the second language respectively correspond to the first voice information and the second language After the steps of voice information, it also includes:
    将所述第一翻译信息或所述第二翻译信息通过翻译语音信息和/或翻译文字信息的输出方式进行输出。The first translation information or the second translation information is output by outputting the voice information and/or the translated text information.
  18. 一种计算机设备,其特征在于,包括存储器、处理器以及存储在所述存储器上并可在所述处理器上运行的计算机程序,所述处理器执行所述计算机程序时实现如权利要求10至17任一项所述的方法。A computer device, comprising: a memory, a processor, and a computer program stored on the memory and operable on the processor, the processor executing the computer program as claimed in claim 10 17. The method of any of the preceding claims.
PCT/CN2019/073534 2018-05-08 2019-01-28 Automatic translation apparatus and method, and computer device WO2019214299A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810432318.5A CN108899018A (en) 2018-05-08 2018-05-08 automatic translation device and method
CN201810432318.5 2018-05-08

Publications (1)

Publication Number Publication Date
WO2019214299A1 true WO2019214299A1 (en) 2019-11-14

Family

ID=64343828

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/073534 WO2019214299A1 (en) 2018-05-08 2019-01-28 Automatic translation apparatus and method, and computer device

Country Status (2)

Country Link
CN (1) CN108899018A (en)
WO (1) WO2019214299A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108899018A (en) * 2018-05-08 2018-11-27 深圳市沃特沃德股份有限公司 automatic translation device and method

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012038612A1 (en) * 2010-09-21 2012-03-29 Pedre Joel Built-in verbal translator having built-in speaker recognition
CN205121555U (en) * 2015-07-06 2016-03-30 北京市振隆科技股份有限公司 Terminal is translated in interactive plurilingual automation
CN106131292A (en) * 2016-06-03 2016-11-16 上海与德通讯技术有限公司 The system of the method for terminal wake-up, awakening method and correspondence is set
CN106486125A (en) * 2016-09-29 2017-03-08 安徽声讯信息技术有限公司 A kind of simultaneous interpretation system based on speech recognition technology
CN106940997A (en) * 2017-03-20 2017-07-11 海信集团有限公司 A kind of method and apparatus that voice signal is sent to speech recognition system
CN107766333A (en) * 2016-08-22 2018-03-06 万德洪 A kind of intelligent translation apparatus, system and method
CN108899018A (en) * 2018-05-08 2018-11-27 深圳市沃特沃德股份有限公司 automatic translation device and method

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH10116093A (en) * 1996-10-09 1998-05-06 Nec Corp Voice recognition device
CN202772966U (en) * 2012-09-03 2013-03-06 上海三旗通信科技股份有限公司 Mobile phone having global barrier-free communication function
CN103970734A (en) * 2014-05-21 2014-08-06 刘业兴 Interactive multi-language automatic interpretation terminal and realizing method thereof
CN105825853A (en) * 2015-01-07 2016-08-03 中兴通讯股份有限公司 Speech recognition device speech switching method and speech recognition device speech switching device
CN107247711B (en) * 2017-06-28 2020-10-02 河南拓恒电子科技有限公司 Bidirectional translation method, mobile terminal and computer readable storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2012038612A1 (en) * 2010-09-21 2012-03-29 Pedre Joel Built-in verbal translator having built-in speaker recognition
CN205121555U (en) * 2015-07-06 2016-03-30 北京市振隆科技股份有限公司 Terminal is translated in interactive plurilingual automation
CN106131292A (en) * 2016-06-03 2016-11-16 上海与德通讯技术有限公司 The system of the method for terminal wake-up, awakening method and correspondence is set
CN107766333A (en) * 2016-08-22 2018-03-06 万德洪 A kind of intelligent translation apparatus, system and method
CN106486125A (en) * 2016-09-29 2017-03-08 安徽声讯信息技术有限公司 A kind of simultaneous interpretation system based on speech recognition technology
CN106940997A (en) * 2017-03-20 2017-07-11 海信集团有限公司 A kind of method and apparatus that voice signal is sent to speech recognition system
CN108899018A (en) * 2018-05-08 2018-11-27 深圳市沃特沃德股份有限公司 automatic translation device and method

Also Published As

Publication number Publication date
CN108899018A (en) 2018-11-27

Similar Documents

Publication Publication Date Title
US10586534B1 (en) Voice-controlled device control using acoustic echo cancellation statistics
JP4557919B2 (en) Audio processing apparatus, audio processing method, and audio processing program
WO2018137704A1 (en) Microphone array-based pick-up method and system
TWI383377B (en) Multi-sensory speech recognition system and method
US7006974B2 (en) Voice controller and voice-controller system having a voice-controller apparatus
US20060028337A1 (en) Voice-operated remote control for TV and electronic systems
US9293134B1 (en) Source-specific speech interactions
US20100063820A1 (en) Correlating video images of lip movements with audio signals to improve speech recognition
US20070276658A1 (en) Apparatus and Method for Detecting Speech Using Acoustic Signals Outside the Audible Frequency Range
JPH096390A (en) Voice recognition interactive processing method and processor therefor
JP2005055668A (en) Speech processing device
JP2009178783A (en) Communication robot and its control method
US6959095B2 (en) Method and apparatus for providing multiple output channels in a microphone
WO2019214299A1 (en) Automatic translation apparatus and method, and computer device
JP2005055667A (en) Audio processing device
JP2005338454A (en) Speech interaction device
JP2005055666A (en) Audio processing device
JP2011199698A (en) Av equipment
CN108337620A (en) A kind of loudspeaker and its control method of voice control
US20210082456A1 (en) Speech processing apparatus and translation apparatus
CN208337877U (en) A kind of loudspeaker of voice control
JP2019139146A (en) Voice recognition system and voice recognition method
JP2010164992A (en) Speech interaction device
JP2003295892A (en) Interpretation system and program
JP3940895B2 (en) Speech recognition apparatus and method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19800016

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19800016

Country of ref document: EP

Kind code of ref document: A1