CN113377276A - System, method and device for quick recording and translation, electronic equipment and storage medium - Google Patents
System, method and device for quick recording and translation, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN113377276A CN113377276A CN202110547675.8A CN202110547675A CN113377276A CN 113377276 A CN113377276 A CN 113377276A CN 202110547675 A CN202110547675 A CN 202110547675A CN 113377276 A CN113377276 A CN 113377276A
- Authority
- CN
- China
- Prior art keywords
- text information
- language
- text
- translation
- information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000013519 translation Methods 0.000 title claims abstract description 84
- 238000000034 method Methods 0.000 title claims abstract description 53
- 238000004891 communication Methods 0.000 claims description 22
- 238000012545 processing Methods 0.000 claims description 15
- 238000004590 computer program Methods 0.000 claims description 13
- 238000009877 rendering Methods 0.000 claims 1
- 238000005516 engineering process Methods 0.000 abstract description 12
- 238000010586 diagram Methods 0.000 description 7
- 230000008569 process Effects 0.000 description 6
- 230000000694 effects Effects 0.000 description 5
- 238000013518 transcription Methods 0.000 description 5
- 230000035897 transcription Effects 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 4
- 238000012986 modification Methods 0.000 description 4
- 230000004048 modification Effects 0.000 description 4
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 238000012549 training Methods 0.000 description 3
- 230000006870 function Effects 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 1
- 238000003058 natural language processing Methods 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000012552 review Methods 0.000 description 1
- 239000013589 supplement Substances 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/01—Input arrangements or combined input and output arrangements for interaction between user and computer
- G06F3/048—Interaction techniques based on graphical user interfaces [GUI]
- G06F3/0487—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
- G06F3/0489—Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using dedicated keyboard keys or combinations thereof
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/40—Processing or translation of natural language
- G06F40/58—Use of machine translation, e.g. for multi-lingual retrieval, for server-side translation for client devices or for real-time translation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Abstract
The application provides a system, a method, a device, an electronic device and a storage medium for quick recording and translation, wherein the system comprises: the method comprises the following steps that an operation area is provided with an input keyboard of the rapid recorder, wherein the input keyboard of the rapid recorder is used for inputting first text information matched with voice information of an environment where the rapid recorder is located, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information belongs to the text information of the first language, and the second user interacts by using a second language; and the processor is used for translating the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information. By the method and the device, the problem that the voice recognition is inaccurate or the voice recognition is different in the related technology is solved.
Description
Technical Field
The present application relates to the field of natural language processing technologies, and in particular, to a system, a method, an apparatus, an electronic device, and a storage medium for fast recording and translation.
Background
At present, as the Chinese market is internationalized, trade transactions among countries are closer, which causes the meeting requirements for translation work to increase, and various international meetings, forum sites, large-scale remote meetings and international course training activities often need simultaneous translation.
In order to meet the conference with the requirement of multilingual real-time communication such as enterprises, public institutions, colleges, courts, large conferences and the like, communication translation is usually completed by using an artificial intelligent voice recognition technology, but the artificial intelligent voice recognition has low quality on various different accents and dialects and is easy to generate errors, so that the voice recognition is inaccurate or is different, and the understanding error in the aspect of languages is brought to participants.
Therefore, the related art has problems in that the speech recognition is inaccurate or the speech recognition is ambiguous.
Disclosure of Invention
The application provides a system, a method, a device, an electronic device and a storage medium for quick transcription and translation, which at least solve the problems of inaccurate voice recognition or ambiguous voice recognition in the related art.
According to an aspect of an embodiment of the present application, there is provided a system for fast-record translation, including:
the method comprises the steps that an operation area is provided with an input keyboard of the video recorder, the input keyboard of the video recorder is used for inputting first text information matched with voice information of an environment where the input keyboard is located, wherein the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information belongs to the text information of the first language, and the second user interacts by using a second language;
and the processor is used for translating the first text information according to a target scheme to obtain second text information belonging to the second language, wherein the target scheme is used for indicating the translation of the first text information.
Optionally, the system further comprises:
a display area for displaying the first text information and the second text information.
Optionally, the operation area includes an operation screen for displaying the first text information.
According to another aspect of the embodiments of the present application, there is also provided a method for fast-record translation, including:
acquiring first text information, wherein the first text information is information matched with voice information of an environment where a quick recorder input keyboard is used for inputting, the quick recorder input keyboard is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information belongs to the text information of the first language, and the second user interacts by using a second language;
and translating the first text message according to a target scheme to obtain second text message belonging to the second language, wherein the target scheme is used for indicating the translation of the first text message.
Optionally, after the first text information is translated according to a target scheme to obtain second text information belonging to the second language, the method further includes:
receiving the second text information;
and displaying the first text information and the second text information by using visualization software.
Optionally, translating the first text information according to a target scheme, and obtaining second text information belonging to the second language includes:
matching the first text information with preset text information in a target scheme;
and under the condition that target text information matched with the preset text information exists in the first text information, translating the first text information by using an intelligent translation engine to obtain the second text information.
Optionally, the method further comprises:
after first text information is acquired, similarity comparison is carried out on the first text information and each text information stored in a text database, wherein the text database is used for correcting the first text information;
determining that the first text information is wrong text information under the condition that the similarity comparison result between the first text information and reference text information is smaller than a preset threshold value, wherein the reference text information is any one of the text information;
replacing the first text information with the reference text information.
According to another aspect of the embodiments of the present application, there is also provided an apparatus for fast-record translation, including:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring first text information, the first text information is information matched with voice information of an environment where a quick recorder input keyboard is used for inputting, the quick recorder input keyboard is provided by a system operation area, the voice information belongs to a first user, the first user uses a first language to interact with a second user, the first text information belongs to the first language, and the second user uses a second language to interact with each other;
and the translation unit is used for translating the first text information according to a target scheme to obtain second text information belonging to the second language, wherein the target scheme is used for indicating the translation of the first text information.
Optionally, the apparatus further comprises:
a receiving unit, configured to perform translation processing on the first text information according to a target scheme, and receive second text information belonging to the second language after obtaining the second text information;
and the display unit is used for displaying the first text information and the second text information by using visualization software.
Optionally, the translation unit comprises:
the matching module is used for matching the first text information with preset text information in a target scheme;
and the translation module is used for translating the first text information by using an intelligent translation engine under the condition that the first text information has the target text information matched with the preset text information to obtain the second text information.
Optionally, the apparatus further comprises:
the comparison unit is used for comparing the similarity of the first text information with each text information stored in a text database after the first text information is acquired, wherein the text database is used for correcting the first text information;
a determining unit, configured to determine that the first text information is wrong text information when a similarity comparison result between the first text information and reference text information is smaller than a preset threshold, where the reference text information is any one of the text information;
a replacing unit configured to replace the first text information with the reference text information.
According to another aspect of the embodiments of the present application, there is also provided an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory communicate with each other through the communication bus; wherein the memory is used for storing the computer program; a processor for executing the method steps of shorthand translation in any of the above embodiments by executing the computer program stored on the memory.
According to a further aspect of the embodiments of the present application, there is also provided a computer-readable storage medium, in which a computer program is stored, where the computer program is configured to execute the method steps of fast record translation in any of the above embodiments when the computer program is executed.
In the embodiment of the application, a mode of inputting voice information by a speed recorder is adopted, and first text information is obtained, wherein the first text information is information matched with the voice information of an environment where a keyboard is input by the speed recorder, the keyboard is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information is the text information belonging to the first language, and the second user interacts by using a second language; and translating the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information. Because the voice information is input by adopting the keyboard of the speed recorder, the method can improve the efficiency of the current industry voice recognition simultaneous transmission system on different accents, dialects and low reception quality, achieves the technical effect of reducing the error rate of voice recognition, and further solves the problems of inaccurate voice recognition or different voice recognition in the related technology.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate embodiments consistent with the invention and together with the description, serve to explain the principles of the invention.
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without inventive exercise.
FIG. 1 is a schematic diagram of an alternative fast-record translation system, according to an embodiment of the present invention;
FIG. 2 is a flow diagram illustrating an alternative method of shorthand translation according to an embodiment of the present application;
FIG. 3 is a block diagram of an alternative apparatus for fast-record translation according to an embodiment of the present application;
fig. 4 is a block diagram of an alternative electronic device according to an embodiment of the present application.
Detailed Description
In order to make the technical solutions better understood by those skilled in the art, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is obvious that the described embodiments are only partial embodiments of the present application, but not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and claims of this application and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the application described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
In order to meet the conference with the requirement of multilingual real-time communication such as enterprises, public institutions, colleges, courts, large conferences and the like, communication translation is usually completed by using an artificial intelligent voice recognition technology, but the artificial intelligent voice recognition has low quality on various different accents and dialects and is easy to generate errors, so that the voice recognition is inaccurate or is different, and the understanding error in the aspect of languages is brought to participants. In order to solve the above problem, an embodiment of the present application provides a system for fast recording and translation, including:
the method comprises the following steps that an operation area is provided with an input keyboard of the rapid recorder, wherein the input keyboard of the rapid recorder is used for inputting first text information matched with voice information of an environment where the rapid recorder is located, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information belongs to the text information of the first language, and the second user interacts by using a second language;
and the processor is used for translating the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information.
Optionally, as shown in fig. 1, the shorthand translation system provided in this embodiment of the present application is composed of an operation area 1 and a processor 2, where the operation area 1 is connected to the processor 2, where an input keyboard of the shorthand recorder is provided in the operation area 1, and various operation systems for operating the shorthand recorder are also provided, and after receiving a voice message in a field environment, a shorthand recorder may input a first text message that matches (is consistent with) the voice message, for example, inputting a chinese message, and may also input an english message, and the like, by using the input keyboard of the shorthand recorder.
It should be noted that the rapid recording and translation system according to the embodiment of the present application is applied to scenes such as an international conference, a forum site, a large-scale teleconference, and an international course training, so that at this time, there exists a situation where a first user interacts with a second user in a first language, and at this time, the second user interacts with the second user in a second language, for example, when a first user (a country person) needs to interact with a second user (B country person) in a certain international conference, the rapid recording and translation system according to the embodiment of the present application may be used to translate voice information of the first user. The speech obtained by the language used by the first user is called first text information, and the speech obtained by the language used by the second user is called second text information.
In the rapid recording and translation system according to the embodiment of the application, the processor is used to process the first text message, and at this time, translation processing on the first text message needs to be executed according to a target scheme set in the processor, and the translated text message is used as the second text message in the second language.
In the embodiment of the application, a mode that the speed recorder inputs voice information is adopted, an input keyboard of the speed recorder is provided in an operation area, the input keyboard of the speed recorder is used for inputting first text information matched with the voice information of the environment where the speed recorder is located, and the processor translates the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating translation of the first text information. Because the voice information is input by adopting the keyboard of the speed recorder, the method can improve the efficiency of the current industry voice recognition simultaneous transmission system on different accents, dialects and low reception quality, achieves the technical effect of reducing the error rate of voice recognition, and further solves the problems of inaccurate voice recognition or different voice recognition in the related technology.
As an alternative embodiment, the system further comprises:
and the display area is used for displaying the first text information and the second text information.
Optionally, as shown in fig. 1, the system further includes a display area 3, the display area 3 is connected to the operation area 1 and the processor 2, and the display area 3 is configured to scroll and display the first text information and the second text information in real time, so that the accuracy of the translated content and the original text is ensured, and meanwhile, on-site conference participants can watch bilingual scrolling live broadcast. In addition, the conference contents (i.e. the first text message and the second text message) are stored in the fast-recording translation system in real time for later checking and verification. As in fig. 1, the display area 3 may be a second screen displayed to the participant.
In the present embodiment, Vue + Electron front-rear end bonding technology is used. With the continuous development of the front-end technology, the front-end technology can be used for developing client software of each system nowadays, and the rapid recording and translation system in the embodiment of the application is novel cross-end client software developed by adopting the front-end technology.
In the embodiment of the application, the first text information of the original text and the translated second text information can be displayed in real time through the display area, so that the participants can watch the first text information and the translated second text information in real time conveniently.
As an alternative embodiment, the operation area includes an operation screen for displaying the first text information.
Optionally, as shown in fig. 1, an operation screen for the stenographer to view the operation condition is included in the operation area, and the operation screen is mainly convenient for the stenographer to view the first text information input by the stenographer. As shown in fig. 1, the operation screen may be a first screen operated by the stenographer.
According to an aspect of the embodiments of the present application, there is provided a method for quick transcription and translation, where the method runs on a system server for quick transcription and translation, as shown in fig. 2, fig. 2 is a schematic flowchart of an alternative method for quick transcription and translation according to an embodiment of the present application, and the flowchart of the method may include the following steps:
step S201, obtaining first text information, wherein the first text information is information matched with voice information of an environment where the input keyboard of the rapid recorder is used for inputting, the input keyboard of the rapid recorder is provided by a system operation area, the voice information belongs to a first user, the first user uses a first language to interact with a second user, the first text information is text information belonging to the first language, and the second user uses the second language to interact.
Optionally, the system server obtains that the stenographer inputs first text information by using an input keyboard of the stenograph, where the first text information is voice information spoken by a first user and received by the stenograph in a field environment, and then the system server performs a translation operation based on the first text information.
It should be noted that the rapid recording and translation system according to the embodiment of the present application is applied to scenes such as an international conference, a forum site, a large-scale teleconference, and an international course training, so that at this time, there exists a situation where a first user interacts with a second user in a first language, and at this time, the second user interacts with the second user in a second language, for example, when a first user (a country person) needs to interact with a second user (B country person) in a certain international conference, the rapid recording and translation system according to the embodiment of the present application may be used to translate voice information of the first user. The speech obtained by the language used by the first user is called first text information, and the speech obtained by the language used by the second user is called second text information.
Step S202, performing a translation process on the first text information according to a target scheme to obtain second text information belonging to a second language, where the target scheme is used to instruct to translate the first text information.
Optionally, after receiving the first text message, the system server translates the first text message according to the stored target scheme, and uses the translated text message as a second text message in a second language.
In the embodiment of the application, a mode of inputting voice information by a speed recorder is adopted, and first text information is obtained, wherein the first text information is information matched with the voice information of an environment where a keyboard is input by the speed recorder, the keyboard is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information is the text information belonging to the first language, and the second user interacts by using a second language; and translating the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information. Because the voice information is input by adopting the keyboard of the speed recorder, the method can improve the efficiency of the current industry voice recognition simultaneous transmission system on different accents, dialects and low reception quality, achieves the technical effect of reducing the error rate of voice recognition, and further solves the problems of inaccurate voice recognition or different voice recognition in the related technology.
As an alternative embodiment, after the first text information is translated according to the target scheme to obtain the second text information belonging to the second language, the method further includes:
receiving second text information;
and displaying the first text information and the second text information by using visualization software.
Optionally, in the embodiment of the application, in order to facilitate the participants to view the original text and the translated text, the first text information and the second text information are displayed on the screen by using the visualization software. Therefore, the participants can check and correct the original text and the translation text, and the translation accuracy is improved.
As an alternative embodiment, the translating the first text information according to the target solution to obtain the second text information belonging to the second language includes:
matching the first text information with preset text information in a target scheme;
and under the condition that the target text information matched with the preset text information exists in the first text information, translating the first text information by using an intelligent translation engine to obtain second text information.
Alternatively, before the first text information is translated, a translation rule (i.e., a target scheme) is usually set, and the translation operation on the first text is performed only when the translation rule is satisfied. Specifically, in the embodiment of the present application, the processing parameters may be set as some punctuation marks, for example: and regarding the punctuation parameters as preset text information in a target scheme, and then checking whether the first text information has target text information matched with the preset text information, wherein the target text information is the sentence or the semicolon or the exclamation mark or the question mark.
If the target text information matched with the preset text information exists in the first text information, it is indicated that the paragraph where the current first text information is located is already ended, at this time, the translation processing operation can be performed on the whole paragraph, at this time, the intelligent translation engine self-developed by each enterprise can be used for performing translation processing on the first text information, and then translated second text information is obtained.
In the embodiment of the application, whether the first text information is translated or not is determined according to the matching result between the first text information and the preset text information in the target scheme, and meanwhile, the self-researched intelligent translation engine is used for translating the first text information, so that powerful supplement and guarantee can be provided.
As an optional embodiment, after acquiring the first text information, the method further includes:
comparing the similarity of the first text information with each text information stored in a text database, wherein the text database is used for correcting the first text information;
determining that the first text information is wrong text information under the condition that the similarity comparison result between the first text information and the reference text information is smaller than a preset threshold value, wherein the reference text information is any one text information in each text information;
the first text information is replaced with the reference text information.
Alternatively, the embodiments of the present application provide a way to correct the original text and correct the translated text, and in particular, storing various correct text information in a text database of the system, comparing the similarity of the first text information with the text information, when the similarity comparison result between the reference text information and the first text information is smaller than a preset threshold value after the first text information is compared with any one of the text information, the first text message may be a wrong text message, may be a shorthand input error, in case of an error in the first text information, the translated second text information will also have an error accordingly, therefore, the reference text information should be replaced by the first text information, and then the reference text should be used as the text information to be translated, so as to obtain the correct translation text (i.e. the second text information).
The preset threshold value related in the embodiment of the present application is a lowest value for comparing similarity between the first text message and each text message, and only if the preset threshold value is greater than or equal to the preset threshold value, it indicates that the first text message is a correct text message, and the preset threshold value may be set to 85%, and the like.
The correct first text information and the correct second text information may then be stored for subsequent review and verification.
In the embodiment of the application, the original text content can be modified in time in the meeting process, and then the sentence segment with the translation error can be corrected in time, so that the error rate in the voice recognition process which needs manual modification in the co-transmission industry is solved, the modification and arrangement work of bilingual text content after meeting is reduced, and the function of looking up bilingual meeting summary after meeting is quickly formed.
It should be noted that, for simplicity of description, the above-mentioned method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present application is not limited by the order of acts described, as some steps may occur in other orders or concurrently depending on the application. Further, those skilled in the art should also appreciate that the embodiments described in the specification are preferred embodiments and that the acts and modules referred to are not necessarily required in this application.
Through the above description of the embodiments, those skilled in the art can clearly understand that the method according to the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but the former is a better implementation mode in many cases. Based on such understanding, the technical solutions of the present application may be embodied in the form of a software product, which is stored in a storage medium (e.g., a ROM (Read-Only Memory)/RAM (Random Access Memory), a magnetic disk, an optical disk) and includes several instructions for enabling a terminal device (e.g., a mobile phone, a computer, a server, or a network device) to execute the methods of the embodiments of the present application.
According to another aspect of the embodiments of the present application, there is also provided an apparatus for fast record translation for implementing the above method for fast record translation. Fig. 3 is a block diagram of an alternative apparatus for fast record translation according to an embodiment of the present application, and as shown in fig. 3, the apparatus may include:
an obtaining unit 301, configured to obtain first text information, where the first text information is information that matches voice information of an environment where a user inputs the information by using a shorthand recorder input keyboard, the shorthand recorder input keyboard is provided in a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information is text information belonging to the first language, and the second user interacts with the second language;
the translating unit 302 is connected to the obtaining unit 301, and is configured to perform translation processing on the first text information according to a target scheme, so as to obtain second text information belonging to a second language, where the target scheme is used to instruct to translate the first text information.
It should be noted that the obtaining unit 301 in this embodiment may be configured to execute the step S201, and the translating unit 302 in this embodiment may be configured to execute the step S202.
Through the module, a mode of inputting voice information by a speed recorder is adopted, and first text information is obtained, wherein the first text information is information matched with the voice information of the environment where the input keyboard of the speed recorder is used for inputting, the input keyboard of the speed recorder is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information is the text information belonging to the first language, and the second user interacts by using a second language; and translating the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information. Because the voice information is input by adopting the keyboard of the speed recorder, the method can improve the efficiency of the current industry voice recognition simultaneous transmission system on different accents, dialects and low reception quality, achieves the technical effect of reducing the error rate of voice recognition, and further solves the problems of inaccurate voice recognition or different voice recognition in the related technology.
As an alternative embodiment, the apparatus further comprises:
the receiving unit is used for translating the first text information according to the target scheme to obtain second text information belonging to a second language and then receiving the second text information;
and the display unit is used for displaying the first text information and the second text information by using visualization software.
As an alternative embodiment, the translation unit 302 includes:
the matching module is used for matching the first text information with preset text information in the target scheme;
the translation module is used for translating the first text information by using the intelligent translation engine under the condition that the target text information matched with the preset text information exists in the first text information to obtain second text information.
As an alternative embodiment, the apparatus further comprises:
the comparison unit is used for comparing the similarity of the first text information with each text information stored in a text database after the first text information is acquired, wherein the text database is used for correcting the first text information;
the determining unit is used for determining that the first text information is wrong text information under the condition that the similarity comparison result between the first text information and the reference text information is smaller than a preset threshold value, wherein the reference text information is any one of the text information;
a replacing unit for replacing the first text information with the reference text information.
It should be noted here that the modules described above are the same as the examples and application scenarios implemented by the corresponding steps, but are not limited to the disclosure of the above embodiments.
According to another aspect of the embodiments of the present application, there is also provided an electronic device for implementing the above fast recording and translation method, where the electronic device may be a server, a terminal, or a combination thereof.
Fig. 4 is a block diagram of an alternative electronic device according to an embodiment of the present application, as shown in fig. 4, including a processor 401, a communication interface 402, a memory 403, and a communication bus 404, where the processor 401, the communication interface 402, and the memory 403 communicate with each other through the communication bus 404, where,
a memory 403 for storing a computer program;
the processor 401, when executing the computer program stored in the memory 403, implements the following steps:
s1, acquiring first text information, wherein the first text information is information matched with voice information of an environment where the input keyboard of the rapid recorder is used for inputting, the input keyboard of the rapid recorder is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information is text information belonging to the first language, and the second user interacts by using a second language;
and S2, performing translation processing on the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information.
Alternatively, in this embodiment, the communication bus may be a PCI (Peripheral Component Interconnect) bus, an EISA (Extended Industry Standard Architecture) bus, or the like. The communication bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one thick line is shown in FIG. 4, but this does not indicate only one bus or one type of bus.
The communication interface is used for communication between the electronic equipment and other equipment.
The memory may include RAM, and may also include non-volatile memory (non-volatile memory), such as at least one disk memory. Alternatively, the memory may be at least one memory device located remotely from the processor.
As an example, as shown in fig. 4, the memory 403 may include, but is not limited to, an obtaining unit 301 and a translating unit 302 in the apparatus for fast recording and translating. In addition, the apparatus may further include, but is not limited to, other module units in the apparatus for fast recording and translating, which is not described in detail in this example.
The processor may be a general-purpose processor, and may include but is not limited to: a CPU (Central Processing Unit), an NP (Network Processor), and the like; but also a DSP (Digital Signal Processing), an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component.
In addition, the electronic device further includes: and the display is used for displaying the translation result of the rapid transcription system.
Optionally, the specific examples in this embodiment may refer to the examples described in the above embodiments, and this embodiment is not described herein again.
It can be understood by those skilled in the art that the structure shown in fig. 4 is only an illustration, and the device implementing the fast recording and translation method may be a terminal device, and the terminal device may be a terminal device such as a smart phone (e.g., an Android phone, an iOS phone, etc.), a tablet computer, a palm computer, a Mobile Internet Device (MID), a PAD, and the like. Fig. 4 is a diagram illustrating a structure of the electronic device. For example, the terminal device may also include more or fewer components (e.g., network interfaces, display devices, etc.) than shown in FIG. 4, or have a different configuration than shown in FIG. 4.
Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by a program instructing hardware associated with the terminal device, where the program may be stored in a computer-readable storage medium, and the storage medium may include: flash disk, ROM, RAM, magnetic or optical disk, and the like.
According to still another aspect of an embodiment of the present application, there is also provided a storage medium. Alternatively, in this embodiment, the storage medium may be a program code for executing a method of fast record translation.
Optionally, in this embodiment, the storage medium may be located on at least one of a plurality of network devices in a network shown in the above embodiment.
Optionally, in this embodiment, the storage medium is configured to store program code for performing the following steps:
s1, acquiring first text information, wherein the first text information is information matched with voice information of an environment where the input keyboard of the rapid recorder is used for inputting, the input keyboard of the rapid recorder is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information is text information belonging to the first language, and the second user interacts by using a second language;
and S2, performing translation processing on the first text information according to a target scheme to obtain second text information belonging to a second language, wherein the target scheme is used for indicating the translation of the first text information.
Optionally, the specific example in this embodiment may refer to the example described in the above embodiment, which is not described again in this embodiment.
Optionally, in this embodiment, the storage medium may include, but is not limited to: various media capable of storing program codes, such as a U disk, a ROM, a RAM, a removable hard disk, a magnetic disk, or an optical disk.
According to yet another aspect of an embodiment of the present application, there is also provided a computer program product or a computer program comprising computer instructions stored in a computer readable storage medium; the processor of the computer device reads the computer instructions from the computer-readable storage medium, and the processor executes the computer instructions to cause the computer device to perform the method steps of the snapshot translation in any of the above embodiments.
The above-mentioned serial numbers of the embodiments of the present application are merely for description and do not represent the merits of the embodiments.
The integrated unit in the above embodiments, if implemented in the form of a software functional unit and sold or used as a separate product, may be stored in the above computer-readable storage medium. Based on such understanding, the technical solution of the present application may be substantially implemented or contributed to by the prior art, or all or part of the technical solution may be embodied in the form of a software product, stored in a storage medium, including instructions for causing one or more computer devices (which may be personal computers, servers, or network devices) to execute all or part of the steps of the method for fast-forwarding and translating the embodiments of the present application.
In the above embodiments of the present application, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the several embodiments provided in the present application, it should be understood that the disclosed client may be implemented in other manners. The above-described embodiments of the apparatus are merely illustrative, and for example, a division of a unit is merely a division of a logic function, and an actual implementation may have another division, for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, and may also be distributed on a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution provided in the embodiment.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The foregoing is only a preferred embodiment of the present application and it should be noted that those skilled in the art can make several improvements and modifications without departing from the principle of the present application, and these improvements and modifications should also be considered as the protection scope of the present application.
Claims (10)
1. A system for shorthand translation, the system comprising:
the method comprises the steps that an operation area is provided with an input keyboard of the video recorder, the input keyboard of the video recorder is used for inputting first text information matched with voice information of an environment where the input keyboard is located, wherein the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information belongs to the text information of the first language, and the second user interacts by using a second language;
and the processor is used for translating the first text information according to a target scheme to obtain second text information belonging to the second language, wherein the target scheme is used for indicating the translation of the first text information.
2. The system of claim 1, further comprising:
a display area for displaying the first text information and the second text information.
3. The system of claim 1, wherein the operating area comprises an operating screen for displaying the first text information.
4. A method of shorthand translation, the method comprising:
acquiring first text information, wherein the first text information is information matched with voice information of an environment where a quick recorder input keyboard is used for inputting, the quick recorder input keyboard is provided by a system operation area, the voice information belongs to a first user, the first user interacts with a second user by using a first language, the first text information belongs to the text information of the first language, and the second user interacts by using a second language;
and translating the first text message according to a target scheme to obtain second text message belonging to the second language, wherein the target scheme is used for indicating the translation of the first text message.
5. The method according to claim 4, wherein after the translating the first text message according to the target solution to obtain the second text message belonging to the second language, the method further comprises:
receiving the second text information;
and displaying the first text information and the second text information by using visualization software.
6. The method according to claim 4, wherein said translating the first text message according to the target solution to obtain the second text message belonging to the second language comprises:
matching the first text information with preset text information in a target scheme;
and under the condition that target text information matched with the preset text information exists in the first text information, translating the first text information by using an intelligent translation engine to obtain the second text information.
7. The method according to any one of claims 4 to 6, wherein after the obtaining the first text information, the method further comprises:
comparing the similarity of the first text information with each text information stored in a text database, wherein the text database is used for correcting the first text information;
determining that the first text information is wrong text information under the condition that the similarity comparison result between the first text information and reference text information is smaller than a preset threshold value, wherein the reference text information is any one of the text information;
replacing the first text information with the reference text information.
8. An apparatus for shorthand rendering, the apparatus comprising:
the device comprises an acquisition unit, a processing unit and a processing unit, wherein the acquisition unit is used for acquiring first text information, the first text information is information matched with voice information of an environment where a quick recorder input keyboard is used for inputting, the quick recorder input keyboard is provided by a system operation area, the voice information belongs to a first user, the first user uses a first language to interact with a second user, the first text information belongs to the first language, and the second user uses a second language to interact with each other;
and the translation unit is used for translating the first text information according to a target scheme to obtain second text information belonging to the second language, wherein the target scheme is used for indicating the translation of the first text information.
9. An electronic device comprising a processor, a communication interface, a memory and a communication bus, wherein said processor, said communication interface and said memory communicate with each other via said communication bus,
the memory for storing a computer program;
the processor for performing the method steps of the shorthand translation of any of claims 4 to 7 by executing the computer program stored on the memory.
10. A computer-readable storage medium, in which a computer program is stored, wherein the computer program is configured to carry out the method steps of the shorthand translation of any of claims 4 to 7 when executed.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110547675.8A CN113377276A (en) | 2021-05-19 | 2021-05-19 | System, method and device for quick recording and translation, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202110547675.8A CN113377276A (en) | 2021-05-19 | 2021-05-19 | System, method and device for quick recording and translation, electronic equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN113377276A true CN113377276A (en) | 2021-09-10 |
Family
ID=77571352
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202110547675.8A Pending CN113377276A (en) | 2021-05-19 | 2021-05-19 | System, method and device for quick recording and translation, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN113377276A (en) |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0728821A (en) * | 1993-07-09 | 1995-01-31 | Fujitsu Ltd | Text processor |
US20090076792A1 (en) * | 2005-12-16 | 2009-03-19 | Emil Ltd | Text editing apparatus and method |
US7539619B1 (en) * | 2003-09-05 | 2009-05-26 | Spoken Translation Ind. | Speech-enabled language translation system and method enabling interactive user supervision of translation and speech recognition accuracy |
JP2010170303A (en) * | 2009-01-22 | 2010-08-05 | Toshiba Corp | Machine translation device and program |
US20140114642A1 (en) * | 2012-10-19 | 2014-04-24 | Laurens van den Oever | Statistical linguistic analysis of source content |
CN107679032A (en) * | 2017-09-04 | 2018-02-09 | 百度在线网络技术(北京)有限公司 | Voice changes error correction method and device |
US20180101366A1 (en) * | 2016-10-11 | 2018-04-12 | Sap Se | Reducing translation volume and ensuring consistent text strings in software development |
CN109522564A (en) * | 2018-12-17 | 2019-03-26 | 北京百度网讯科技有限公司 | Voice translation method and device |
CN111862940A (en) * | 2020-07-15 | 2020-10-30 | 百度在线网络技术(北京)有限公司 | Earphone-based translation method, device, system, equipment and storage medium |
-
2021
- 2021-05-19 CN CN202110547675.8A patent/CN113377276A/en active Pending
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH0728821A (en) * | 1993-07-09 | 1995-01-31 | Fujitsu Ltd | Text processor |
US7539619B1 (en) * | 2003-09-05 | 2009-05-26 | Spoken Translation Ind. | Speech-enabled language translation system and method enabling interactive user supervision of translation and speech recognition accuracy |
US20090076792A1 (en) * | 2005-12-16 | 2009-03-19 | Emil Ltd | Text editing apparatus and method |
JP2010170303A (en) * | 2009-01-22 | 2010-08-05 | Toshiba Corp | Machine translation device and program |
US20140114642A1 (en) * | 2012-10-19 | 2014-04-24 | Laurens van den Oever | Statistical linguistic analysis of source content |
US20180101366A1 (en) * | 2016-10-11 | 2018-04-12 | Sap Se | Reducing translation volume and ensuring consistent text strings in software development |
CN107679032A (en) * | 2017-09-04 | 2018-02-09 | 百度在线网络技术(北京)有限公司 | Voice changes error correction method and device |
CN109522564A (en) * | 2018-12-17 | 2019-03-26 | 北京百度网讯科技有限公司 | Voice translation method and device |
CN111862940A (en) * | 2020-07-15 | 2020-10-30 | 百度在线网络技术(北京)有限公司 | Earphone-based translation method, device, system, equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020186778A1 (en) | Error word correction method and device, computer device, and storage medium | |
US8504350B2 (en) | User-interactive automatic translation device and method for mobile device | |
CN108460026B (en) | Translation method and device | |
CN110059313B (en) | Translation processing method and device | |
CN111832449A (en) | Engineering drawing display method and related device | |
JP7199061B2 (en) | translation device | |
CN110728156B (en) | Translation method and device, electronic equipment and readable storage medium | |
KR20150117914A (en) | Language learning system by a plurality of Users | |
CN112447168A (en) | Voice recognition system and method, sound box, display device and interaction platform | |
CN111062221A (en) | Data processing method, data processing device, electronic equipment and storage medium | |
CN113889092A (en) | Training method, processing method and device of post-processing model of voice recognition result | |
CN113377276A (en) | System, method and device for quick recording and translation, electronic equipment and storage medium | |
CN116303937A (en) | Reply method, reply device, electronic equipment and readable storage medium | |
CN114155841A (en) | Voice recognition method, device, equipment and storage medium | |
CN114417834A (en) | Text processing method and device, electronic equipment and readable storage medium | |
CN113254579A (en) | Voice retrieval method and device and electronic equipment | |
CN113221514A (en) | Text processing method and device, electronic equipment and storage medium | |
CN112466286A (en) | Data processing method and device and terminal equipment | |
CN111161737A (en) | Data processing method and device, electronic equipment and storage medium | |
CN114818748B (en) | Method for generating translation model, translation method and device | |
CN111508484B (en) | Voice data processing method and device | |
US20240153485A1 (en) | Systems and methods for machine-learning based multi-lingual pronunciation generation | |
CN116719914A (en) | Text extraction method, system and related device | |
CN114254630A (en) | Translation method, translation device, electronic equipment and readable storage medium | |
CN112395863A (en) | Text processing method and device |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination |