CN117787295B - Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal - Google Patents

Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal Download PDF

Info

Publication number
CN117787295B
CN117787295B CN202410202041.2A CN202410202041A CN117787295B CN 117787295 B CN117787295 B CN 117787295B CN 202410202041 A CN202410202041 A CN 202410202041A CN 117787295 B CN117787295 B CN 117787295B
Authority
CN
China
Prior art keywords
translation
language
user
host
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202410202041.2A
Other languages
Chinese (zh)
Other versions
CN117787295A (en
Inventor
李智彪
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Wooask Technology Co ltd
Original Assignee
Shenzhen Wooask Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Wooask Technology Co ltd filed Critical Shenzhen Wooask Technology Co ltd
Priority to CN202410202041.2A priority Critical patent/CN117787295B/en
Publication of CN117787295A publication Critical patent/CN117787295A/en
Application granted granted Critical
Publication of CN117787295B publication Critical patent/CN117787295B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Machine Translation (AREA)

Abstract

The invention relates to the technical fields of voice recognition, image processing, machine translation, natural language processing, man-machine interaction, network communication, data transmission and the like, and provides an audiovisual integrated intelligent translation interaction method and an intelligent translation terminal.

Description

Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal
Technical Field
The invention relates to the technical fields of machine translation, voice recognition, image processing, natural language processing, man-machine interaction, network communication, data transmission and the like, in particular to an intelligent translation interaction method and an intelligent translation terminal integrating audio and visual.
Background
Translation machines are widely used for foreign work and travel transactions, and instant translation through the translation machines can be communicated with local personnel. Currently, the translator mainly comprises a visual translator and an audio translation Bluetooth headset. Audio translation bluetooth headsets typically have only audio processing functions, such as translation dialog, music play, etc. The visual translator can not only carry out audio processing, but also display visual information such as video, pictures and the like through the display screen and the video processing module. For example, there is a multi-functional intelligent bluetooth headset among the prior art, and this multi-functional intelligent bluetooth headset includes charging equipment, and the inside bluetooth headset groove that is provided with of charging equipment, bluetooth headset inslot portion are provided with left bluetooth headset and right bluetooth headset, and charging equipment back lateral wall top is provided with the organic lid, and charging equipment openly center department is provided with the display screen, and the display screen below is provided with left speaking button, language selection button, right speaking button. The user can directly take out the Bluetooth earphone from the charging equipment for use, and the language used by the user and the user can be set on the appointed application software of the electronic equipment such as the mobile phone, and the conversation communication can be started. When the multifunctional intelligent Bluetooth headset is inconvenient to use in cooperation with electronic equipment, the multifunctional intelligent Bluetooth headset can be independently used offline. The user directly takes out the Bluetooth headset from the charging equipment, operates on the equipment, and selects and determines the language used by the user on the display screen through the language selection key. The user presses the button of one party to which the user belongs and speaks, after the user finishes, the button is released, and the other party Bluetooth headset plays the translated language; when another user replies, the user also presses the button of the party to which the user belongs and speaks, and after the user finishes, the user releases the button, and the translated language is received by the Bluetooth headset of the other party, so that the Bluetooth headset completes the independent translation task without being matched with the electronic equipment. The display screen of the multifunctional intelligent Bluetooth headset can only display, and does not have a touch operation function, so that diversified setting of product functions is affected. And when the language is selected, the user needs to press a physical key arranged below the display screen, namely the language selection key, so that the operation is not intuitive and simple.
In summary, the existing bluetooth headset translation technology has the technical problems of single function, insufficient visual and simple language selection and the like.
Disclosure of Invention
In order to solve the technical problems of single function, insufficient visual and simple language selection and the like in the prior art, one of the purposes of the invention is to provide an intelligent translation interaction method and an intelligent translation terminal with integrated audio-visual function, so that functions of the intelligent translation terminal are diversified, the language selection is visual and simple, and the user experience is improved.
In a first aspect, the present invention provides an integrated intelligent translation interaction method for audio-visual, which is applied to an intelligent translation terminal including a translation host and a bluetooth headset, wherein a pickup and a loudspeaker are arranged on the translation host, and the intelligent translation interaction method for audio-visual is characterized in that a touch display screen is also arranged on the translation host, and comprises the following steps:
According to the starting-up operation signal, the translation host controls the touch display screen to display a translation operation main interface; the starting-up operation signal is generated by operating a starting-up key through a user, the starting-up key is arranged on the side face of the translation host, and the touch display screen is arranged on the front face of the translation host;
when a first operation event occurs, the translation host controls the touch display screen to display a first functional operation interface, and translation controls supporting different translation functions are distributed on the first functional operation interface; the first operation event occurs through the user operating the translation operation main interface;
When a second operation event occurs, the translation host translates the current language in the current user voice signal according to the translation control operation signal and the current user voice signal to obtain a target language, and controls the loudspeaker or the Bluetooth headset to play the target language; the second operation event occurs through the user selecting a translation control for operating a specific translation function in the translation controls with different translation functions, and the user selects the specific translation function and the current language to be translated; the translation control operation signal is generated by operating a translation control key by a user, and the translation control key is arranged on the side surface of the translation host; when the user presses the translation control key and speaks, the translation host obtains the current user voice signal through the pickup.
In a second aspect, the present invention provides an intelligent translation terminal, including:
The translation host is used for running the intelligent translation interaction method integrated with the audio-visual function;
bluetooth headset, can assemble in charge in the storage tank that charges that translation host computer top set up, bluetooth headset can with translation host computer communication connection.
Compared with the prior art, the invention has the beneficial effects that:
According to the audiovisual integrated intelligent translation interaction method and the intelligent translation terminal, the translation host controls the touch display screen to display the translation operation main interface according to the starting operation signal, when a first operation event occurs, the translation host controls the touch display screen to display a first function operation interface, the first function operation interface distributes translation controls supporting different translation functions, when a second operation event occurs, the translation host translates the current language in the current user voice signal according to the translation control operation signal and the current user voice signal so as to obtain a target language, and controls the loudspeaker or the Bluetooth earphone to play the target language, so that the intelligent translation terminal has diversified functions, visual and simple language selection and user experience improvement.
Drawings
FIG. 1 is a schematic flow chart of an intelligent translation interaction method integrated with audio-visual function;
FIG. 2 is a schematic diagram of an interface layout of a translation operation main interface according to the present invention;
FIG. 3 is a schematic diagram of an interface layout of a default selection interface according to the present invention;
FIG. 4 is a schematic diagram of an interface layout of a first functional operation interface according to the present invention;
Fig. 5 is a schematic structural diagram of an intelligent translation terminal according to the present invention.
In the figure:
11. A housing; 1111. a charging storage groove; 12. an outer cover;
3. A touch display screen;
4. Bluetooth earphone;
6. And translating the control keys.
Detailed Description
The technical solutions of the present invention will be clearly and completely described below by means of examples, and it is obvious that the described examples are only some, but not all, examples of the present invention. The description of "first," "second," etc. in this disclosure is for descriptive purposes only and is not to be construed as indicating or implying a relative importance or implicitly indicating the number of technical features indicated. In addition, the technical solutions of the embodiments may be combined with each other, but it is necessary to base that the technical solutions can be realized by those skilled in the art, and when the technical solutions are contradictory or cannot be realized, the combination of the technical solutions should be considered to be absent and not within the scope of protection claimed in the present invention.
Example 1
Referring to fig. 1, the embodiment provides an intelligent translation interaction method integrated with audio and video, which is applied to an intelligent translation terminal including a translation host and a bluetooth headset, wherein a pickup and a loudspeaker are arranged on the translation host. The intelligent translation interaction method integrating audio and visual comprises the following steps:
S101, according to a starting-up operation signal, the translation host controls the touch display screen to display a translation operation main interface; the starting-up operation signal is generated by operating a starting-up key through a user, the starting-up key is arranged on the side face of the translation host, and the touch display screen is arranged on the front face of the translation host;
S102, when a first operation event occurs, the translation host controls the touch display screen to display a first function operation interface, and translation controls supporting different translation functions are distributed on the first function operation interface; the first operation event occurs through the user operating the translation operation main interface;
S103, when a second operation event occurs, the translation host translates the current language in the current user voice signal according to the translation control operation signal and the current user voice signal to obtain a target language, and controls the loudspeaker or the Bluetooth headset to play the target language; the second operation event occurs through the user selecting a translation control for operating a specific translation function in the translation controls with different translation functions, and the user selects the specific translation function and the current language to be translated; the translation control operation signal is generated by operating a translation control key by a user, and the translation control key is arranged on the side surface of the translation host; when the user presses the translation control key and speaks, the translation host obtains the current user voice signal through the pickup.
It should be noted that, for the user interfaces such as the translation operation main interface and the first function operation interface in this embodiment, the displayed image may be processed and optimized by the image display, image recognition and gesture recognition technologies in the image processing technology, so as to ensure the quality and definition thereof, identify and understand the image elements such as icons, controls and menus in the user interfaces, and identify and interpret the gesture operations such as sliding, zooming, rotating, touch control and the like of the user. It should be noted that, the image processing technologies such as image display, image recognition and gesture recognition are all the prior art, and the embodiment is not described herein again.
It should be noted that, the translation host controls the loudspeaker or the bluetooth headset to play the target language, so as to meet the translation requirements of different scenes and sanitary use habits.
In step S101, the start-up key is disposed on a side of the translation host, so that the front of the translation host can be fully utilized to set the touch display screen, so that the touch display screen occupies the front space of the translation host as much as possible, and the touch display screen with a larger area is obtained, which is convenient for a user to perform man-machine interaction operation. And after the user operates the starting key, generating a starting operation signal, and controlling the touch display screen to display a translation operation main interface by the translation host according to the starting operation signal. It should be noted that the body of the translation host is not designed to be too large for cost reasons and portability requirements, e.g., it is not designed to be as large as a smart phone. Under the condition, the touch display screen occupies the front space of the translation host as much as possible, so that the touch display screen with larger area can be obtained, and the user interaction is facilitated. Especially, under the condition that a part of the main body at the top end of the translation host is occupied by the charging storage groove, the space for setting the touch display screen on the front side of the main body of the translation host is small, and in order to occupy the front space of the translation host as much as possible, in this embodiment, the start-up key is selectively set on the side surface of the translation host, and no physical key is set on the front side of the main body of the translation host.
In step S102, when the user operates the translation operation main interface, the first operation event occurs, and the translation host controls the touch display screen to display a first function operation interface according to the first operation event, where the first function operation interface distributes translation controls supporting different translation functions, for example, a translation control supporting an online conversation translation function, a translation control supporting an offline conversation translation function, a translation control supporting a bluetooth headset conversation translation function, and the like, so as to realize functional diversification of the intelligent translation terminal, meet use requirements of different scenes of the user, and promote product use experience of the user.
In step S103, the translation control key is disposed on a side of the translation host, so that the front of the translation host can be fully utilized to set the touch display screen, so that the touch display screen occupies the front space of the translation host as much as possible, and the touch display screen with a larger area is obtained, which is convenient for a user to perform man-machine interaction operation. It should be noted that the body of the translation host is not designed to be too large for cost reasons and portability requirements, e.g., it is not designed to be as large as a smart phone. Under the condition, the touch display screen occupies the front space of the translation host as much as possible, so that the touch display screen with larger area can be obtained, and the user interaction is facilitated. Especially, under the condition that a part of the main body at the top end of the translation host is occupied by the charging storage groove, the space for arranging the touch display screen on the front side of the main body of the translation host is small, and in order to occupy the front space of the translation host as much as possible, in this embodiment, the translation control key is selectively arranged on the side surface of the translation host, and no physical key is arranged on the front side of the main body of the translation host.
When the user presses the translation control button and speaks, the translation host obtains the current user voice signal through the pickup, and the current user voice signal contains the current language of the current speaking user, for example, chinese, english, etc. And when the user presses the translation control key, the translation control operation signal is generated, and the translation host acquires the translation control operation signal.
It should be noted that, when the user selects a translation control for operating a specific translation function in the translation controls with different translation functions, a second operation event occurs. For example, the user may select a translation control supporting an online conversation translation function, a translation control supporting an offline conversation translation function, a translation control supporting a bluetooth headset conversation translation function, and so on in the translation controls of the different translation functions, where a second operation event of online conversation translation occurs, or a second operation event supporting offline conversation translation occurs, or a second operation event of bluetooth headset conversation translation occurs.
In some preferred embodiments, the audiovisual integrated intelligent translation interaction method further comprises:
when a third operation event occurs, the translation host controls the touch display screen to display a second functional operation interface, and the second functional operation interface distributes content playing control keys supporting audio and/or video playing; the third operation event occurs through the user operating the first function operation interface;
When a content playing operation event occurs, the translation host controls the playing of audio content or video content; the content playing operation event occurs by the user operating the content playing control key.
In this embodiment, when the user operates the first functional operation interface, the third operation event occurs, the translation host controls the touch display screen to display the second functional operation interface, the second functional operation interface distributes content playing control keys supporting audio and/or video playing, when the user operates the content playing control keys, the content playing operation event occurs, and the translation host controls audio content or video content playing, so that the intelligent translation terminal can be used as an audio player and/or a video player, and functional diversity of the intelligent translation terminal is achieved.
In some preferred embodiments, the translation control buttons include a first language translation control button and a second language translation control button, and the translation controls of the different translation functions include a dialog translation control; when a user operates the dialogue translation control, the translation host controls the touch display screen to display a first language selection interface positioned on the left side or the right side of the first language translation control key and a second language selection interface positioned on the left side or the right side of the second language translation control key; the user selects a first language as the current language of the first dialogue user through the first language selection interface, and selects a second language as the current language of the second dialogue user through the second language selection interface; the first language and the second language are the target language.
In this embodiment, the translation control with different translation functions includes a dialogue translation control, when the user operates the dialogue translation control, the translation host controls the touch display screen to display a first language selection interface and a second language selection interface, the user may select a first language through the first language selection interface as a current language of the first dialogue user, and select a second language through the second language selection interface as a current language of the second dialogue user, where the first language and the second language are the target languages, so as to implement language translation. For example, when the first dialogue user and the second dialogue user perform dialogue, the current language of the first dialogue user is chinese, the current language of the second dialogue user is english, after translation, the chinese of the first dialogue user is translated into english of the target language, so that the second dialogue user listens, the english of the second dialogue user is translated into chinese of the target language, so that the first dialogue user listens.
It should be noted that, in this embodiment, the translation control key includes a first language translation control key and a second language translation control key, where the first language selection interface is located on the left side or the right side of the first language translation control key, and the second language selection interface is located on the left side or the right side of the second language translation control key, so that physical keys for controlling translation of different languages are corresponding to user interfaces for selecting different languages in physical spatial positions, so that a user can simply and intuitively select a language and control the translation of the language.
In some preferred embodiments, the conversation translation control comprises an offline conversation translation control; and the user operates the offline dialogue translation control to enter a language selection interface for language selection, and when the translation control key is pressed for speaking, the translation host translates the first language into the second language or translates the second language into the first language through a local translation database, and controls the loudspeaker or the Bluetooth headset to play the second language or the first language. In some preferred embodiments, the conversation translation control comprises an online conversation translation control; and when the user operates the online dialogue translation control to enter a language selection interface for language selection and presses the translation control button to speak, the translation host translates the first language into the second language or translates the second language into the first language through the cloud translation database and controls the loudspeaker or the Bluetooth headset to play the second language or the first language.
It should be noted that, when the online user is in conversation translation, the translation host translates the first language into the second language or translates the second language into the first language through the cloud translation database, and controls the loudspeaker or the Bluetooth headset to play the second language or the first language, so that translation can be performed by utilizing mass language types of the cloud translation database, more accurate language translation is realized, functions of the intelligent translation terminal are enriched, and product use experience of a user is improved.
When the offline user is in conversation translation, the translation host translates the first language into the second language or translates the second language into the first language through the local translation database, and controls the loudspeaker or the Bluetooth headset to play the second language or the first language, so that translation can be performed under the condition that network communication conditions are not available or network communication quality is poor, functions of the intelligent translation terminal are enriched, and product use experience of a user is improved.
In some preferred embodiments, the first operation event occurs by a user sliding operation of the translation operation main interface, and the third operation event occurs by a user sliding operation of the first function operation interface. For example, when the user slides to the left to operate the translation operation main interface, the first operation event occurs. For another example, when the user slides to the left to operate the first functional operation interface, the third operation event occurs.
In some preferred embodiments, the first functional operation interface further distributes a human-machine conversation control; when the user operates the man-machine conversation control, the translation host controls the touch display screen to display an AI assistant man-machine interaction interface, wherein the AI assistant man-machine interaction interface comprises a language selection control; when a user selects the user language of the man-machine conversation through the language selection control and speaks, the translation host obtains the user voice signal of the current man-machine conversation through the pickup, analyzes the user language of the man-machine conversation in the user voice signal of the current man-machine conversation through the AI language model, controls the loudspeaker or the Bluetooth headset to play the conversation result through the user language of the man-machine conversation after the conversation result is obtained, and controls the conversation result to be displayed on the man-machine conversation interface of the touch display screen. Wherein the AI language model may be ChatGPT.
It should be noted that in the prior art, the translation machine has no interaction design related to an AI language model, but the conventional AI language model, such as a heaven and earth, has no interface design and language selection related to voice interaction, in this embodiment, the first functional operation interface further distributes a man-machine conversation control, when a user operates the man-machine conversation control, the translation host controls the touch display screen to display an AI assistant man-machine interaction interface, the AI assistant man-machine interaction interface includes a language selection control, when the user selects a user language of a man-machine conversation and speaks through the language selection control, the translation host obtains a user voice signal of the current man-machine conversation through the sound pickup, and analyzes the user language of the man-machine conversation in the user voice signal of the current man-machine conversation through the AI language model, so as to obtain a conversation result, then control the loudspeaker or the bluetooth earphone to play the conversation result through the user language of the man-machine conversation, and control the conversation result to be displayed on the man-machine conversation interface of the touch display screen, thereby obtaining a conversation result with audio-visual and audio functions, facilitating the use of the user, and improving the use experience of the translation product of the intelligent terminal.
It should be noted that, the user selects the user language of the man-machine conversation through the language selection control, for example, the Chinese language, the conversation language obtained by the AI language model is the Chinese language, the conversation result played by the loudspeaker or the Bluetooth headset is the Chinese language, and the conversation result displayed by the man-machine conversation interface of the touch display screen is the Chinese language. Through the man-machine conversation selection of the user, the AI language model only needs to analyze the selected language, for example, only analyze Chinese, only analyze Sichuan, and the like, thereby improving the accuracy of machine translation.
In some preferred embodiments, the translation host controls the loudspeaker or the bluetooth headset to play the target language, and controls the touch display screen to display a current translation interface, wherein the current translation interface synchronously displays the target language.
In this embodiment, the translation host controls the touch display screen to display a current translation interface, and the current translation interface synchronously displays the target language, so that a user can not only listen to the translated target language, but also view the translation result through the touch display screen, thereby improving the product use experience of the user.
Example two
1-5, On the basis of the above embodiments, the present embodiment proposes some improved embodiments of the above audiovisual integrated intelligent translation interaction method, so that the intelligent translation terminal is further intuitive and convenient to use, and product use experience of a user is improved.
In some improved embodiments, the translation operation main interface comprises a foreign time display interface, a home time display interface and a translation key indication control; the foreign time display interface displays the national timing of the foreign country where the user is located, and the home time display interface displays the national timing of the home country of the user; the foreign time display interface, the home time display interface and the translation key indication control are displayed side by side; the translation key indication control indicates the position of the translation control key of the user through an arrow and characters, and indicates the user to speak when pressing the translation control key; and the user enters a default translation mode when pressing the translation control key and speaking according to the instruction, the translation host controls the touch display screen to flash a default selection interface containing a latest language selection history record and then jump to a current translation interface, translates the current language in the current user voice signal according to the default selection language recorded by the default selection interface so as to obtain a target language in the default translation mode, and controls the loudspeaker or the Bluetooth headset to play the target language in the default translation mode. The duration of the flash can be specifically set according to the needs.
In this embodiment, the foreign time display interface displays the country timing of the foreign country where the user is located, and the home time display interface displays the country timing of the home country where the user is located, so that the user can check the time in the foreign country, for example, the user can check the country timing of the foreign country where the user is located and the country timing of the home country where the user is located. Illustratively, when a Chinese user is in the United states, the national time of the United states is New York time 19:11, the national timing of China is Beijing time 8 point 11. In addition, the foreign time display interface displays the country timing of the foreign country where the user is located, and the home time display interface displays the country timing of the home country of the user, which may also suggest a recent history of possible language selections by the user, for example, the user has recently been able to select chinese and english.
It should be noted that, when the user presses the translation control key and speaks, the user enters a default translation mode, and the translation host controls the touch display screen to flash a default selection interface containing the history of the latest language selection and then jump to the current translation interface, so as to provide visual confirmation of the latest language selection for the user, help the user to visually confirm the history of language selection, and promote the product use experience of the user.
It should be further noted that, the translation host translates the current language in the current user voice signal according to the default selected language recorded by the default selected interface, so as to obtain the target language in the default translation mode, and controls the loudspeaker or the bluetooth headset to play the target language in the default translation mode, thereby realizing efficient translation, avoiding language selection operation each time, and reducing operation programs.
It should be noted that the translation button indication control indicates the position of the translation control button of the user through arrows and characters, and indicates that the user speaks when pressing the translation control button, so that the translation control button is intuitive and easy to use, and the product use experience of the user is improved.
In a further improved embodiment, when the user touches the translation button indication control of the translation operation main interface, the translation host controls the touch display screen to display a default selection interface containing the latest language selection history, and the default selection interface displays the history selection language and the translation button indication icon. Wherein the history selection language includes a first language and a second language. For example, the first language is chinese and the second language is english.
It should be noted that, when the user touches the translation button indication control of the translation operation main interface, the translation button indication control not only can indicate the position of the translation control button of the user, and indicates the user to speak when pressing the translation control button, but also when the user touches the translation button indication control of the translation operation main interface, the translation host controls the touch display screen to display a default selection interface containing the latest language selection history record, thereby facilitating the user to confirm the history selection language and reducing the number and complexity of the control. In addition, the translation key indication icon continuously indicates the position of the translation control key, so that the user can conveniently and quickly find the translation control key.
In some improved embodiments, the first functional operation interface further comprises a translation headset control; and when the user operates the translation earphone control to select the translation earphone function and the current language to be translated, the translation host controls the Bluetooth earphone to play the target language.
It should be noted that, in this embodiment, when the user operates the translation earphone control to select the translation earphone function and the current language to be translated, the translation host controls the bluetooth earphone to play the target language, so as to provide translation communication for the user wearing the bluetooth earphone, and reduce external interference.
Example III
Referring to fig. 1-5, this embodiment provides an intelligent translation terminal, including:
the translation host is used for running the intelligent translation interaction method integrated with audio and video according to any embodiment;
bluetooth headset 4, can assemble in the charging of translation host computer top sets up accomodates the inslot 1111 and charges, bluetooth headset can with translation host computer communication connection.
The translation host is provided with a touch display 3, a bluetooth headset 4, a translation control button 6, and the like. The translation host may include a housing 11 and an outer cover 12. The translation control button 6 is arranged on one side of the shell 11, and the touch display screen 3 is arranged on the front surface of the shell 11. The housing 11 is provided with a charging storage groove 1111 for charging the bluetooth headset 4 when storing the bluetooth headset 4. The outer cover 12 covers the entrance of the charging receiving groove 1111, and the outer cover 12 is movably connected with the housing 11. The shape of the bluetooth headset 4 is adapted to the shape of the charging receiving slot 1111. Through setting up the storage slot 1111 that charges to thereby accomodate bluetooth headset 4 and charge bluetooth headset 4. The outer cover 12 is a cover protection structure of the charging storage groove 1111, and can effectively prevent the bluetooth headset 4 from falling off when charging or storing.
The above embodiments are only preferred embodiments of the present invention, and the scope of the present invention is not limited thereto, but any insubstantial changes and substitutions made by those skilled in the art on the basis of the present invention are intended to be within the scope of the present invention as claimed.

Claims (5)

1. The intelligent translation interaction method is applied to an intelligent translation terminal comprising a translation host and a Bluetooth headset, wherein a pickup and a loudspeaker are arranged on the translation host, and the intelligent translation interaction method is characterized in that a touch display screen is further arranged on the translation host, and comprises the following steps:
According to the starting-up operation signal, the translation host controls the touch display screen to display a translation operation main interface; the starting-up operation signal is generated by operating a starting-up key through a user, the starting-up key is arranged on the side face of the translation host, and the touch display screen is arranged on the front face of the translation host;
when a first operation event occurs, the translation host controls the touch display screen to display a first functional operation interface, and translation controls supporting different translation functions are distributed on the first functional operation interface; the first operation event occurs through the user operating the translation operation main interface;
When a second operation event occurs, the translation host translates the current language in the current user voice signal according to the translation control operation signal and the current user voice signal to obtain a target language, and controls the loudspeaker or the Bluetooth headset to play the target language; the second operation event occurs through the user selecting a translation control for operating a specific translation function in the translation controls with different translation functions, and the user selects the specific translation function and the current language to be translated; the translation control operation signal is generated by operating a translation control key by a user, and the translation control key is arranged on the side surface of the translation host; when a user presses the translation control key and speaks, the translation host obtains the current user voice signal through the sound pick-up; when a third operation event occurs, the translation host controls the touch display screen to display a second functional operation interface, and the second functional operation interface distributes content playing control keys supporting audio and/or video playing; the third operation event occurs through the user operating the first function operation interface; when a content playing operation event occurs, the translation host controls the playing of audio content or video content; the content playing operation event occurs through the user operating the content playing control key; the translation control buttons comprise a first language translation control button and a second language translation control button, and the translation controls with different translation functions comprise dialogue translation controls; when a user operates the dialogue translation control, the translation host controls the touch display screen to display a first language selection interface positioned on the left side or the right side of the first language translation control key and a second language selection interface positioned on the left side or the right side of the second language translation control key; the user selects a first language as the current language of the first dialogue user through the first language selection interface, and selects a second language as the current language of the second dialogue user through the second language selection interface; the first language and the second language are the target language; the conversation translation control comprises an offline conversation translation control; when the user operates the offline dialogue translation control to enter a language selection interface for language selection and presses the translation control key to speak, the translation host translates the first language into the second language or translates the second language into the first language through a local translation database and controls the loudspeaker or the Bluetooth headset to play the second language or the first language; the conversation translation control comprises an online conversation translation control; when the user operates the online dialogue translation control to enter a language selection interface for language selection and presses the translation control key to speak, the translation host translates the first language into the second language or translates the second language into the first language through a cloud translation database and controls the loudspeaker or the Bluetooth headset to play the second language or the first language; the first functional operation interface is also distributed with man-machine conversation controls; when the user operates the man-machine conversation control, the translation host controls the touch display screen to display an AI assistant man-machine interaction interface, wherein the AI assistant man-machine interaction interface comprises a language selection control; when a user selects the user language of the man-machine conversation through the language selection control and speaks, the translation host obtains the user voice signal of the current man-machine conversation through the pickup, analyzes the user language of the man-machine conversation in the user voice signal of the current man-machine conversation through the AI language model, controls the loudspeaker or the Bluetooth headset to play the conversation result through the user language of the man-machine conversation after the conversation result is obtained, and controls the conversation result to be displayed on the man-machine conversation interface of the touch display screen.
2. The audiovisual integrated intelligent translation interaction method according to claim 1, wherein the AI language model is ChatGPT.
3. The intelligent translation interaction method of audio-visual integration according to claim 1 or 2, wherein the translation host controls the loudspeaker or the bluetooth headset to play the target language, and controls the touch display screen to display a current translation interface, and the current translation interface synchronously displays the target language.
4. The viewing-integrated intelligent translation interaction method according to claim 1 or 2, wherein the first operation event occurs by a user sliding operation of the translation operation main interface, and the third operation event occurs by a user sliding operation of the first function operation interface.
5. An intelligent translation terminal, characterized by comprising:
A translation host for running the audiovisual integrated intelligent translation interaction method according to any one of claims 1-4;
bluetooth headset, can assemble in charge in the storage tank that charges that translation host computer top set up, bluetooth headset can with translation host computer communication connection.
CN202410202041.2A 2024-02-23 2024-02-23 Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal Active CN117787295B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202410202041.2A CN117787295B (en) 2024-02-23 2024-02-23 Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202410202041.2A CN117787295B (en) 2024-02-23 2024-02-23 Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal

Publications (2)

Publication Number Publication Date
CN117787295A CN117787295A (en) 2024-03-29
CN117787295B true CN117787295B (en) 2024-05-03

Family

ID=90393003

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202410202041.2A Active CN117787295B (en) 2024-02-23 2024-02-23 Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal

Country Status (1)

Country Link
CN (1) CN117787295B (en)

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080007966A (en) * 2006-07-19 2008-01-23 현상수 Simultaneous interpretation service system
CN107656924A (en) * 2017-10-20 2018-02-02 泾县麦蓝网络技术服务有限公司 A kind of bilingual translation method and system applied to mobile terminal
CN207764801U (en) * 2018-02-07 2018-08-24 科大讯飞股份有限公司 Translator
CN110769345A (en) * 2019-11-04 2020-02-07 湖南文理学院 Portable translation device with Bluetooth headset and convenient to fix
CN111428515A (en) * 2020-03-30 2020-07-17 浙江大学 Simultaneous interpretation equipment and method
CN111447397A (en) * 2020-03-27 2020-07-24 深圳市贸人科技有限公司 Translation method and translation device based on video conference
CN112788175A (en) * 2021-03-03 2021-05-11 北京雅信诚医学信息科技有限公司 Machine translation method and machine translation device
CN215219952U (en) * 2021-06-03 2021-12-17 深圳市贸人科技有限公司 Portable translating machine

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9183199B2 (en) * 2011-03-25 2015-11-10 Ming-Yuan Wu Communication device for multiple language translation system
US20130173246A1 (en) * 2012-01-04 2013-07-04 Sheree Leung Voice Activated Translation Device

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20080007966A (en) * 2006-07-19 2008-01-23 현상수 Simultaneous interpretation service system
CN107656924A (en) * 2017-10-20 2018-02-02 泾县麦蓝网络技术服务有限公司 A kind of bilingual translation method and system applied to mobile terminal
CN207764801U (en) * 2018-02-07 2018-08-24 科大讯飞股份有限公司 Translator
CN110769345A (en) * 2019-11-04 2020-02-07 湖南文理学院 Portable translation device with Bluetooth headset and convenient to fix
CN111447397A (en) * 2020-03-27 2020-07-24 深圳市贸人科技有限公司 Translation method and translation device based on video conference
CN111428515A (en) * 2020-03-30 2020-07-17 浙江大学 Simultaneous interpretation equipment and method
CN112788175A (en) * 2021-03-03 2021-05-11 北京雅信诚医学信息科技有限公司 Machine translation method and machine translation device
CN215219952U (en) * 2021-06-03 2021-12-17 深圳市贸人科技有限公司 Portable translating machine

Also Published As

Publication number Publication date
CN117787295A (en) 2024-03-29

Similar Documents

Publication Publication Date Title
US10511647B2 (en) Information processing apparatus, information processing method and program
CN104967900B (en) A kind of method and apparatus generating video
CN103853493B (en) Mobile terminal and its information processing method
CN101621566B (en) Distinguishing input signals detected by a mobile terminal
KR100718138B1 (en) Function input method and apparatus for inputting function in portable terminal thereof
CN103870804B (en) Mobile device with face recognition function and the method for controlling the mobile device
CN109656512A (en) Exchange method, device, storage medium and terminal based on voice assistant
CN108920071B (en) Control method and device based on special-shaped display screen, storage medium and mobile terminal
CN105550251A (en) Picture play method and device
CN109067965A (en) Interpretation method, translating equipment, wearable device and storage medium
KR20180076830A (en) Audio device and method for controlling the same
CN108923810A (en) Interpretation method and relevant device
CN117787295B (en) Intelligent translation interaction method integrating audio-visual function and intelligent translation terminal
CN113241097A (en) Recording method, recording device, electronic equipment and readable storage medium
CN107340963A (en) A kind of information processing method and electronic equipment
CN108874797B (en) Voice processing method and device
CN108924308A (en) Call method, device, storage medium and the terminal of camera
KR20110136589A (en) Mobile terminal and operating method thereof
KR20110039028A (en) Method for acquiring information related to object on video scene and portable device thereof
CN109375847A (en) Screen locking magazine display methods, device, storage medium and mobile terminal
CN101094309A (en) TV set possessing function of receiving and sending short message
KR101353415B1 (en) The mobile terminal of pen mouse type
CN105780891B (en) A kind of intellectual water closet with projection operation device and communication function
WO2018137306A1 (en) Method and device for triggering speech function
CN215680104U (en) Mobile device with translation and sound change functions

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant