WO2023197949A1 - Chinese translation method and electronic device - Google Patents

Chinese translation method and electronic device Download PDF

Info

Publication number
WO2023197949A1
WO2023197949A1 PCT/CN2023/086870 CN2023086870W WO2023197949A1 WO 2023197949 A1 WO2023197949 A1 WO 2023197949A1 CN 2023086870 W CN2023086870 W CN 2023086870W WO 2023197949 A1 WO2023197949 A1 WO 2023197949A1
Authority
WO
WIPO (PCT)
Prior art keywords
electronic device
user
oral
text information
data
Prior art date
Application number
PCT/CN2023/086870
Other languages
French (fr)
Chinese (zh)
Inventor
谢雨晨
常亚
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2023197949A1 publication Critical patent/WO2023197949A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • G06F16/3329Natural language query formulation or dialogue systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/332Query formulation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/289Phrasal analysis, e.g. finite state techniques or chunking
    • G06F40/295Named entity recognition
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B21/00Teaching, or communicating with, the blind, deaf or mute
    • GPHYSICS
    • G09EDUCATION; CRYPTOGRAPHY; DISPLAY; ADVERTISING; SEALS
    • G09BEDUCATIONAL OR DEMONSTRATION APPLIANCES; APPLIANCES FOR TEACHING, OR COMMUNICATING WITH, THE BLIND, DEAF OR MUTE; MODELS; PLANETARIA; GLOBES; MAPS; DIAGRAMS
    • G09B21/00Teaching, or communicating with, the blind, deaf or mute
    • G09B21/009Teaching or communicating with deaf persons

Definitions

  • This application provides a Chinese translation method that only adds oral movements to keywords when translating text information into sign language, which is beneficial to improving the accuracy of language expression when translating Chinese into sign language.
  • the first aspect provides a Chinese translation method, including: in response to user input, an electronic device obtains text information, the text information includes keywords; the electronic device displays the hand movements corresponding to the text information; the electronic device displays Oral movements corresponding to keywords.
  • the keyword here may be one or more words contained in the text information or one or more words contained in the text information.
  • the order of word expressions of sign language users when performing sign language movements may be different from the word order of natural spoken language.
  • the order of hand movements corresponding to the text information displayed on the electronic device may be based on the order of the sign language users. Habits are determined.
  • this technical solution translates text information into sign language that better serves the expression habits of sign language users, which is conducive to improving the accuracy of the translation results of text information and is conducive to reducing the The probability of sign language users misunderstanding the translated sign language is conducive to enhancing mutual communication with sign language users.
  • the keyword is determined based on the language habits of the sign language user.
  • oral movements are not displayed for ordinary words that are not keywords, which is beneficial to reducing data transmission during the translation process, improving the efficiency of data transmission and processing during the translation process, and improving the application usage of electronic device users. experience.
  • the electronic device displays the mouth movement while displaying the hand movement corresponding to the keyword.
  • This technical solution displays the oral movements corresponding to the keywords while performing the hand movements corresponding to the keywords.
  • the implementation of this technical scheme is conducive to ensuring the corresponding relationship between the hand movements and the oral movements, and is conducive to further improving the translation of text information. The accuracy of the results will help improve sign language users' understanding of the translated sign language.
  • the keyword is a proper noun.
  • the proper nouns may include one or more of the following words: names of people, places, institutions, works, and other proper nouns.
  • Attaching oral movements to proper nouns can help improve sign language users' understanding of difficult-to-understand proper nouns, and can help enhance mutual communication with sign language users.
  • the electronic device before displaying the oral movements corresponding to the keywords, displays the first vocabulary, the text information includes the first vocabulary, and the first vocabulary is recommended Words with oral movements attached; in response to the user's confirmation operation, the electronic device determines that the first word is a keyword.
  • the electronic device before displaying the oral movements corresponding to the keywords, in response to the user's first input, acquires a second vocabulary, the second vocabulary being additionally requested by the user. Vocabulary of oral movements; when the text information contains the second vocabulary, the electronic device determines the second vocabulary as the keyword; when the text information does not contain the second vocabulary, the electronic device displays update request information, The update request information is used to prompt that the text information does not contain the second vocabulary; in response to the user's second input, the electronic device obtains the updated second vocabulary.
  • This technical solution identifies words input by the user that request additional oral movements, and notifies the user of the recognition result of whether the text information contains the words that the user requests additional oral movements.
  • the implementation of this technical solution is conducive to improving the efficiency of Chinese translation into sign language, improving the accuracy of text information translation, and improving the user's application experience.
  • the first vocabulary is determined based on the user's translation history, the translation history includes a second vocabulary input by the user, and the second vocabulary is a vocabulary for which the user requests additional oral movements. .
  • the second words included in the translation history can, to a certain extent, reflect the language habits and application usage habits of electronic device users.
  • This technical solution recommends vocabulary for additional oral movements to the user based on the user's history of requesting additional oral movements.
  • the implementation of this technical solution is conducive to determining the Chinese translation results according to the user's habits and is conducive to improving the efficiency of translation. The effect is conducive to improving the application experience of electronic device users.
  • the oral movement is determined according to the pronunciation mouth shape of the Chinese pinyin of the keyword.
  • the mixed shape value corresponding to the pronunciation mouth shape is stored in the mouth action database.
  • the hand movement includes a first hand movement and a second hand movement, the first hand movement precedes the second hand movement, and the electronic The device receives first hand action data from the server, and the first hand action data is used to display the first hand action; while displaying the first hand action, the electronic device receives second hand action data from the server, and the first hand action data is used to display the first hand action data.
  • the second hand movement data is used to display the second hand movement.
  • the electronic device first receives the hand movement data that needs to be displayed first, and while displaying the hand movement that needs to be displayed first, it also receives the hand movement data that is displayed later. It uses fragmented transmission of hand movement data and displays it while The transmission solution is conducive to shortening the waiting time for data transmission and improving the user's application experience.
  • the oral action includes a first oral action and a second oral action, the first oral action precedes the second oral action, and the electronic device
  • the first mouth movement data is received from the server, and the first mouth movement data is used to display the first mouth movement; while displaying the first mouth movement, the electronic device receives the second mouth movement data from the server, and the first mouth movement data is used to display the first mouth movement.
  • the second oral movement data is used to display the second oral movement.
  • first oral movement or the second oral movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
  • the electronic device first receives the oral action data that needs to be displayed first, and while displaying the oral action data that needs to be displayed first, it also receives the oral action data that is displayed later. It uses slices to transmit the oral action data and displays it while The transmission solution is conducive to shortening the waiting time for data transmission and improving the user's application experience.
  • a Chinese translation method including: the server receives a translation request message, the translation request message includes text information, the text information includes keywords, the keywords are determined according to the language habits of the sign language user, and the translation request The message is used to request to obtain the hand movement data corresponding to the text information.
  • the translation request message is also used to request to obtain the oral movement data corresponding to the keyword; the server determines whether to send the hand movement data and/or oral movement data based on the text information. action data.
  • the hand movement data is used to display the hand movement corresponding to the text information
  • the mouth movement data is used to display the mouth movement corresponding to the keyword.
  • the keyword is used by the electronic device according to the text information input by the user through the following One or more identifications in the formula are obtained: the content of the text information, the user's translation history information, or other methods for the user to determine the keywords in the text information.
  • the keyword here may be one or more words contained in the text information or one or more words contained in the text information.
  • the keyword is a proper noun.
  • the server determines whether the text information contains sensitive information; if the text information contains sensitive information, the server sends a first response message, and the first response message The response message is used to indicate that the text information contains sensitive information; if the text information does not contain sensitive information, the server sends a second response message, and the second response message includes hand movement data and/or mouth movement data.
  • the hand movement data includes first hand movement data and second hand movement data
  • the first hand movement data is used to display the first hand movement data.
  • the second hand action data is used to display the second hand action
  • the server sends the second hand action after sending the first hand action data.
  • first hand movement or the second hand movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
  • the server first sends the hand movement data that needs to be displayed first, and sends the hand movement data that needs to be displayed first while displaying the hand movement data that needs to be displayed first. It uses fragmented transmission of hand movement data and transmission while displaying. This solution will help shorten the waiting time for data transmission and improve the user's application experience.
  • the mouth action data includes first mouth action data and second mouth action data
  • the first mouth action data is used to display the first mouth action data.
  • the second mouth action data is used to display the second mouth action
  • the first mouth action is before the second mouth action
  • the server sends the second mouth action after sending the first mouth action data action data.
  • first oral movement or the second oral movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
  • the server first sends the mouth movement data that needs to be displayed first, and sends the mouth movement data that needs to be displayed first while displaying the mouth movement data that needs to be displayed first. It adopts fragmented transmission of mouth movement data and transmission while displaying. This solution will help shorten the waiting time for data transmission and improve the user's application experience.
  • the server obtains the oral action data from an oral action database, where the oral action database contains mixed shape values corresponding to the pronunciation mouth shape of Chinese Pinyin.
  • the electronic device By establishing a database for the mouth movement data, when the mouth movement needs to be displayed, the electronic device sends a request message to the server, and the server calls the required mouth movement data from the database.
  • the server By establishing a database for the mouth movement data, when the mouth movement needs to be displayed, the electronic device sends a request message to the server, and the server calls the required mouth movement data from the database.
  • an electronic device including a processor and a memory
  • the memory stores one or more A computer program, the one or more computer programs including instructions that, when executed by a processor, are configured to: in response to user input, obtain textual information, the textual information including keywords, the keywords being used according to sign language The language habits of the user are determined; the processor is also used to display the hand movements corresponding to the text information, and the processor is also used to display the oral movements corresponding to the keywords.
  • the keyword is determined based on the language habits of the sign language user.
  • the processor is also configured to not display oral movements corresponding to common words, the text information includes the common words, and the common words are different from the keywords.
  • the processor is further configured to display a first vocabulary, the text information includes the first vocabulary, and the first vocabulary is a vocabulary that recommends additional oral movements; In response to the user's confirmation operation, the processor is also configured to determine that the first word is a keyword.
  • the processor in response to the user's first input, is configured to obtain a second vocabulary, the second vocabulary being a vocabulary for which the user requests additional oral movements; in the text information When the text information contains the second vocabulary, the processor is also used to determine that the second vocabulary is a keyword; when the text information does not contain the second vocabulary, the processor is also used to display update request information, and the update request message is The prompt text information does not include the second vocabulary; in response to the user's second input, the processor is also used to obtain the updated second vocabulary.
  • the hand movement includes a first hand movement and a second hand movement, the first hand movement precedes the second hand movement, and the processing
  • the processor is also used to receive first hand movement data from the server, and the first hand movement data is used to display the first hand movement; while displaying the first hand movement, the processor is also used to receive a second hand movement data from the server. Hand movement data, the second hand movement data is used to display the second hand movement.
  • the oral action includes a first oral action and a second oral action, the first oral action precedes the second oral action, and the processor It is also used to receive the first oral movement data from the server, and the first oral movement data is used to display the first oral movement; while displaying the first oral movement, the processor is also used to receive the second oral movement data from the server. mouth movement data, and the second mouth movement data is used to display the second mouth movement.
  • a server including a processor and a memory.
  • the memory stores one or more computer programs.
  • the one or more computer programs include instructions.
  • the processor is used to: Receive a translation request message.
  • the translation message includes text information.
  • the translation request message is used to request acquisition of hand movement data corresponding to the text information.
  • the text information includes keywords.
  • the keywords are determined according to the language habits of the sign language user.
  • the translation request message is also used to request the oral movement data corresponding to the keyword; the processor is also used to determine whether to send hand movement data and/or mouth movement data based on the text information.
  • the processor is also used to determine whether the text information contains sensitive information; when the text information contains sensitive information, the processor is also used to send A first response message, the first response message is used to indicate that the text information contains sensitive information; when the text information does not contain sensitive information, the processor is also used to send a second response message, the second response message includes a handwritten message. facial movement data and/or oral action data.
  • the hand movement data includes first hand movement data and second hand movement data
  • the first hand movement data is used to display the first hand movement data.
  • the second hand action data is used to display the second hand action
  • the processor is further configured to send the first hand action data after sending the Second hand movement data.
  • the mouth action data includes first mouth action data and second mouth action data, and the first mouth action data is used to display the first mouth action data.
  • the second oral action data is used to display the second oral action, the first oral action precedes the second oral action, and the processor is further configured to send the first oral action data after sending the first oral action data.
  • Second oral movement data is further configured to send the first oral action data after sending the first oral action data.
  • the processor is further configured to obtain the oral action data from an oral action database, where the oral action database contains mixed shapes corresponding to the mouth shapes of Chinese Pinyin pronunciations. numerical value.
  • a Chinese translation device including an acquisition unit and a processing unit.
  • the acquisition unit is used to acquire text information in response to user input.
  • the text information includes keywords, and the keywords are based on the language habits of the sign language user. OK;
  • the processing unit is used to display the hand movements corresponding to the text information;
  • the processing unit is also used to display the mouth movements corresponding to the keywords.
  • the processing unit is also configured to not display oral movements corresponding to common words, the text information includes the common words, and the common words are different from the keywords.
  • the processing unit is further configured to display the mouth movement while displaying the hand movement corresponding to the keyword.
  • the processing unit is also used to display a first vocabulary
  • the text information includes the first vocabulary
  • the first vocabulary is a vocabulary that recommends additional oral movements
  • the processing unit is also used to determine that the first vocabulary is a keyword.
  • the acquisition unit is further configured to acquire a second vocabulary in response to the user's first input, where the second vocabulary is a vocabulary for which the user requests additional oral movements; in When the text information contains the second vocabulary, the processing unit is also used to determine that the second vocabulary is a keyword; when the text information does not contain the second vocabulary, the processing unit is also used to display an update request message. The update request The message is used to prompt that the text information does not contain the second vocabulary; the acquisition unit is also used to obtain the updated second vocabulary in response to the user's second input.
  • the Chinese translation device further includes a communication unit
  • the hand movement includes a first hand movement and a second hand movement
  • the first hand movement is in the Before the second hand movement, and before displaying the hand movement corresponding to the text information
  • the communication unit is used to receive the first hand movement data from the server, and the first hand movement data is used to display the first hand movement;
  • the communication unit is further configured to receive second hand movement data from the server while displaying the first hand movement, and the second hand movement data is used to display the second hand movement.
  • the oral action includes a first oral action and a second oral action, the first oral action precedes the second oral action, and the communication unit It is also used to receive the first oral movement data from the server, and the first oral movement data is used to display the first oral movement; while displaying the first oral movement, the communication unit is also used to receive the second oral movement data from the server. mouth movement data, and the second mouth movement data is used to display the second mouth movement.
  • the communication unit before displaying the hand movement corresponding to the text information, is also used to receive a response message from the server, the response message is used to indicate that the text information does not contain sensitive information. information.
  • a Chinese translation device including a communication unit and a processing unit.
  • the communication unit is used to receive a translation request message.
  • the translation request message includes text information.
  • the translation request message is used to request to obtain the text corresponding to the text information.
  • Hand movement data the text information includes keywords, the keywords are determined according to the language habits of the sign language user, the translation request message is also used to request to obtain oral movement data corresponding to the keywords;
  • the processing unit is used to obtain the oral movement data corresponding to the keywords;
  • the processing unit is used to obtain the hand movement data according to the language habits of the sign language user.
  • Information determines whether to send hand movement data and/or mouth movement data.
  • the processing unit is also used to determine whether the text information contains sensitive information; in the case where the text information contains sensitive information, the communication unit is also used to send A first response message, the first response message is used to indicate that the text information contains sensitive information; when the text information does not contain sensitive information, the communication unit is also used to send a second response message, the second response message includes a handheld message. facial movement data and/or oral movement data.
  • the processing unit is further configured to obtain the oral action data from an oral action database, where the oral action database contains mixed shapes corresponding to the pronunciation mouth shapes of Chinese Pinyin. numerical value.
  • a computer program product includes computer program code.
  • the computer program product includes computer program code.
  • a computer program product includes computer program code.
  • the computer program product includes computer program code.
  • a computer-readable storage medium is provided.
  • Computer instructions are stored in the computer-readable medium.
  • the method in the first aspect or any possible implementation thereof is executed.
  • a computer-readable storage medium is provided.
  • Computer instructions are stored in the computer-readable medium.
  • the method in the second aspect or any possible implementation thereof is executed.
  • An eleventh aspect provides a chip, including a processor for reading instructions stored in a memory.
  • the processor executes the instructions, the chip implements the method in the first aspect or any possible implementation thereof. be executed.
  • FIG. 1 is a schematic diagram of the hardware architecture of an electronic device applicable to an embodiment of the present application.
  • Figure 4 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
  • Figure 5 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
  • Figure 9 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
  • Figure 10 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
  • Figure 11 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
  • Figure 13 is a schematic diagram of a Chinese translation device provided by an embodiment of the present application.
  • Figure 14 is a schematic diagram of another Chinese translation device provided by an embodiment of the present application.
  • Figure 15 is a schematic diagram of an electronic device provided by an embodiment of the present application.
  • Figure 16 is a schematic diagram of a server provided by an embodiment of the present application.
  • FIG. 1 shows a schematic structural diagram of an electronic device 100 .
  • the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, and an antenna 2.
  • mobile communication module 150 wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone interface 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193, display screen 194, and subscriber identification module (SIM) card interface 195, etc.
  • SIM subscriber identification module
  • the sensor module 180 may include a pressure sensor 180A, a gyro sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
  • the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100 .
  • the electronic device 100 may include more or fewer components than shown in the figures, or some components may be combined, some components may be separated, or some components may be arranged differently.
  • the components illustrated may be implemented in hardware, software, or a combination of software and hardware.
  • the processor 110 may include one or more processing units.
  • the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (GPU), and an image signal processor. (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) wait.
  • application processor application processor, AP
  • modem processor graphics processing unit
  • GPU graphics processing unit
  • image signal processor image signal processor
  • ISP image signal processor
  • controller memory
  • video codec digital signal processor
  • DSP digital signal processor
  • baseband processor baseband processor
  • NPU neural-network processing unit
  • different processing units can be independent devices or integrated in one or more processors.
  • the controller may be the nerve center and command center of the electronic device 100 .
  • the controller can generate operation control signals based on the instruction operation code and timing signals to complete the control of fetching and executing instructions.
  • the processor 110 may also be provided with a memory for storing instructions and data.
  • the memory in processor 110 is cache memory. This memory may hold instructions or data that have been recently used or recycled by processor 110 . If the processor 110 needs to use the instructions or data again, it can be called directly from the memory. Repeated access is avoided and the waiting time of the processor 110 is reduced, thus improving the efficiency of the system.
  • processor 110 may include one or more interfaces.
  • Interfaces may include integrated circuit (inter-integrated circuit, I2C) interface, integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, pulse code modulation (pulse code modulation, PCM) interface, universal asynchronous receiver and transmitter (universal asynchronous receiver/transmitter (UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and /or universal serial bus (USB) interface, etc.
  • I2C integrated circuit
  • I2S integrated circuit built-in audio
  • PCM pulse code modulation
  • UART universal asynchronous receiver and transmitter
  • MIPI mobile industry processor interface
  • GPIO general-purpose input/output
  • SIM subscriber identity module
  • USB universal serial bus
  • the I2C interface is a bidirectional synchronous serial bus, including a serial data line (SDA) and a serial clock line (derail clock line, SCL).
  • processor 110 may include multiple sets of I2C buses.
  • the processor 110 can separately couple the touch sensor 180K, charger, flash, camera 193, etc. through different I2C bus interfaces.
  • the processor 110 can be coupled to the touch sensor 180K through an I2C interface, so that the processor 110 and the touch sensor 180K communicate through the I2C bus interface to implement the touch function of the electronic device 100 .
  • the I2S interface can be used for audio communication.
  • processor 110 may include multiple sets of I2S buses.
  • the processor 110 can be coupled with the audio module 170 through the I2S bus to implement communication between the processor 110 and the audio module 170 .
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface to implement the function of answering calls through a Bluetooth headset.
  • the PCM interface can also be used for audio communications to sample, quantize and encode analog signals.
  • the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
  • the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface to achieve connection via a Bluetooth headset. Listen to the phone function. Both the I2S interface and the PCM interface can be used for audio communication.
  • the UART interface is a universal serial data bus used for asynchronous communication.
  • the bus can be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
  • a UART interface is generally used to connect the processor 110 and the wireless communication module 160 .
  • the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function.
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface to implement the function of playing music through a Bluetooth headset.
  • the MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 .
  • MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc.
  • the processor 110 and the camera 193 communicate through the CSI interface to implement the shooting function of the electronic device 100 .
  • the processor 110 and the display screen 194 communicate through the DSI interface to implement the display function of the electronic device 100 .
  • the GPIO interface can be configured through software.
  • the GPIO interface can be configured as a control signal or as a data signal.
  • the GPIO interface can be used to connect the processor 110 with the camera 193, display screen 194, wireless communication module 160, audio module 170, sensor module 180, etc.
  • the GPIO interface can also be configured as an I2C interface, I2S interface, UART interface, MIPI interface, etc.
  • the USB interface 130 is an interface that complies with the USB standard specification, and may be a Mini USB interface, a Micro USB interface, a USB Type C interface, etc.
  • the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through them. This interface can also be used to connect other electronic devices, such as AR devices, etc.
  • the interface connection relationships between the modules illustrated in the embodiments of the present application are only schematic illustrations and do not constitute a structural limitation of the electronic device 100 .
  • the electronic device 100 may also adopt different interface connection methods in the above embodiments, or a combination of multiple interface connection methods.
  • the charging management module 140 is used to receive charging input from the charger.
  • the charger can be a wireless charger or a wired charger.
  • the charging management module 140 may receive charging input from the wired charger through the USB interface 130 .
  • the charging management module 140 may receive wireless charging input through the wireless charging coil of the electronic device 100 . While the charging management module 140 charges the battery 142, it can also provide power to the electronic device through the power management module 141.
  • the power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110.
  • the power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, internal memory 121, external memory, display screen 194, camera 193, wireless communication module 160, etc.
  • the power management module 141 can also be used to monitor battery capacity, battery cycle times, battery health status (leakage, impedance) and other parameters.
  • the power management module 141 may also be provided in the processor 110 .
  • the power management module 141 and the charging management module 140 may also be provided in the same device.
  • the wireless communication function of the electronic device 100 can be implemented through the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor and the baseband processor.
  • Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
  • Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example: Antenna 1 can be reused as a diversity antenna for a wireless LAN. In other embodiments, antennas may be used in conjunction with tuning switches.
  • the mobile communication module 150 can provide wireless communication including 2G/3G/4G/5G etc. applied on the electronic device 100. solution.
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA), etc.
  • the mobile communication module 150 can receive electromagnetic waves through the antenna 1, perform filtering, amplification and other processing on the received electromagnetic waves, and transmit them to the modem processor for demodulation.
  • the mobile communication module 150 can also amplify the signal modulated by the modem processor and convert it into electromagnetic waves through the antenna 1 for radiation.
  • at least part of the functional modules of the mobile communication module 150 may be disposed in the processor 110 .
  • at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be provided in the same device.
  • a modem processor may include a modulator and a demodulator.
  • the modulator is used to modulate the low-frequency baseband signal to be sent into a medium-high frequency signal.
  • the demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal.
  • the demodulator then transmits the demodulated low-frequency baseband signal to the baseband processor for processing.
  • the application processor outputs sound signals through audio devices (not limited to speaker 170A, receiver 170B, etc.), or displays images or videos through display screen 194.
  • the modem processor may be a stand-alone device.
  • the modem processor may be independent of the processor 110 and may be provided in the same device as the mobile communication module 150 or other functional modules.
  • the wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) network), Bluetooth (bluetooth, BT), and global navigation satellites.
  • WLAN wireless local area networks
  • System global navigation satellite system, GNSS
  • frequency modulation frequency modulation, FM
  • near field communication technology near field communication, NFC
  • infrared technology infrared, IR
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
  • the wireless communication module 160 can also receive the signal to be sent from the processor 110, frequency modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
  • the GNSS may include global positioning system (GPS), global navigation satellite system (GLONASS), Beidou navigation satellite system (BDS), quasi-zenith satellite system (quasi) -zenith satellite system (QZSS) and/or satellite based augmentation systems (SBAS).
  • GPS global positioning system
  • GLONASS global navigation satellite system
  • BDS Beidou navigation satellite system
  • QZSS quasi-zenith satellite system
  • SBAS satellite based augmentation systems
  • the ISP is used to process the data fed back by the camera 193. For example, when taking a photo, the shutter is opened, the light is transmitted to the camera sensor through the lens, the optical signal is converted into an electrical signal, and the camera sensor passes the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 193.
  • Camera 193 is used to capture still images or video.
  • the object passes through the lens to produce an optical image that is projected onto the photosensitive element.
  • the photosensitive element can be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the optical signal into an electrical signal, and then passes the electrical signal to the ISP to convert it into a digital image signal.
  • ISP outputs digital image signals to DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other format image signals.
  • the electronic device 100 may include 1 or N cameras 193, where N is a positive integer greater than 1.
  • Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
  • Video codecs are used to compress or decompress digital video.
  • Electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in multiple encoding formats, such as moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
  • MPEG moving picture experts group
  • MPEG2 MPEG2, MPEG3, MPEG4, etc.
  • NPU is a neural network (NN) computing processor.
  • NN neural network
  • Intelligent cognitive applications of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, etc.
  • the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100.
  • the external memory card communicates with the processor 110 through the external memory interface 120 to implement the data storage function. Such as saving music, videos, etc. files in external memory card.
  • Internal memory 121 may be used to store computer executable program code, which includes instructions.
  • the processor 110 executes instructions stored in the internal memory 121 to execute various functional applications and data processing of the electronic device 100 .
  • the internal memory 121 may include a program storage area and a data storage area. Among them, the stored program area can store an operating system, at least one application program required for a function (such as a sound playback function, an image playback function, etc.).
  • the storage data area may store data created during use of the electronic device 100 (such as audio data, phone book, etc.).
  • the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, universal flash storage (UFS), etc.
  • the electronic device 100 can implement audio functions through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone interface 170D, and the application processor. Such as music playback, recording, etc.
  • Speaker 170A also called “speaker” is used to convert audio electrical signals into sound signals.
  • the electronic device 100 can listen to music through the speaker 170A, or listen to hands-free calls.
  • Receiver 170B also called “earpiece” is used to convert audio electrical signals into sound signals.
  • the electronic device 100 answers a call or a voice message, the voice can be heard by bringing the receiver 170B close to the human ear.
  • Microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can speak close to the microphone 170C with the human mouth and input the sound signal to the microphone 170C.
  • the electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which in addition to collecting sound signals, may also implement a noise reduction function. In other embodiments, the electronic device 100 can also be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions, etc.
  • the headphone interface 170D is used to connect wired headphones.
  • the headphone interface 170D may be a USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, or a Cellular Telecommunications Industry Association of the USA (CTIA) standard interface.
  • OMTP open mobile terminal platform
  • CTIA Cellular Telecommunications Industry Association of the USA
  • the buttons 190 include a power button, a volume button, etc.
  • Key 190 may be a mechanical key. It can also be a touch button.
  • the electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
  • the motor 191 can generate vibration prompts.
  • the motor 191 can be used for vibration prompts for incoming calls and can also be used for touch vibration feedback.
  • touch operations for different applications can correspond to different vibration feedback effects.
  • the motor 191 can also respond to different vibration feedback effects for touch operations in different areas of the display screen 194 .
  • Different application scenarios such as time reminders, receiving information, alarm clocks, games, etc.
  • the touch vibration feedback effect can also be customized.
  • the indicator 192 may be an indicator light, which may be used to indicate charging status, power changes, or may be used to indicate messages, missed calls, notifications, etc.
  • the SIM card interface 195 is used to connect a SIM card.
  • the SIM card can be connected to or separated from the electronic device 100 by inserting it into the SIM card interface 195 or pulling it out from the SIM card interface 195 .
  • the electronic device 100 can support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
  • SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card, etc. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different.
  • the SIM card interface 195 is also compatible with different types of SIM cards.
  • the SIM card interface 195 is also compatible with external memory cards.
  • the electronic device 100 interacts with the network through the SIM card to implement functions such as calls and data communications.
  • the electronic device 100 uses an embedded SIM (embedded-SIM, eSIM) card, that is, an embedded SIM card.
  • the eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device
  • the phone card in the embodiment of the present application includes but is not limited to SIM card, eSIM card, universal subscriber identity module (USIM), universal integrated circuit card (UICC), etc.
  • the software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture.
  • the embodiment of this application takes the Android system with a layered architecture as an example to illustrate the software structure of the electronic device 100 .
  • the application framework layer provides an application programming interface (API) and programming framework for applications in the application layer.
  • API application programming interface
  • the application framework layer includes some predefined functions.
  • the application framework layer can include a window manager, content provider, view system, phone manager, resource manager, notification manager, etc.
  • a window manager is used to manage window programs.
  • the window manager can obtain the display size, determine whether there is a status bar, lock the screen, capture the screen, etc.
  • Content providers are used to store and retrieve data and make this data accessible to applications.
  • Said data can include videos, images, audio, calls made and received, browsing history and bookmarks, phone books, etc.
  • the view system includes visual controls, such as controls that display text, controls that display pictures, etc.
  • a view system can be used to build applications.
  • the display interface can be composed of one or more views.
  • a display interface including a text message notification icon may include a view for displaying text and a view for displaying pictures.
  • the phone manager is used to provide communication functions of the electronic device 100 .
  • call status management including connected, hung up, etc.
  • the resource manager provides various resources to applications, such as localized strings, icons, pictures, layout files, video files, etc.
  • the notification manager allows applications to display notification information in the status bar, which can be used to convey notification-type messages and can automatically disappear after a short stay without user interaction.
  • the notification manager is used to notify download completion, message reminders, etc.
  • the notification manager can also be notifications that appear in the status bar at the top of the system in the form of charts or scroll bar text, such as notifications for applications running in the background, or notifications that appear on the screen in the form of conversation windows. For example, text information is prompted in the status bar, a beep sounds, the electronic device vibrates, the indicator light flashes, etc.
  • Android runtime includes core libraries and virtual machines. Android runtime is responsible for the scheduling and management of the Android system.
  • the application layer and application framework layer run in virtual machines.
  • the virtual machine executes the java files of the application layer and application framework layer into binary files.
  • the virtual machine is used to perform object life cycle management, stack management, thread management, security and exception management, and garbage collection and other functions.
  • System libraries can include multiple functional modules. For example: surface manager (surface manager), media libraries (media libraries), 3D graphics processing libraries (for example: OpenGL ES), 2D graphics engines (for example: SGL), etc.
  • the surface manager is used to manage the display subsystem and provides the fusion of 2D and 3D layers for multiple applications.
  • the media library supports playback and recording of a variety of commonly used audio and video formats, as well as static image files, etc.
  • the media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
  • the 3D graphics processing library is used to implement 3D graphics drawing, image rendering, composition, and layer processing.
  • 2D Graphics Engine is a drawing engine for 2D drawing.
  • the kernel layer is the layer between hardware and software.
  • the kernel layer contains at least display driver, camera driver, audio driver, and sensor driver.
  • CSL Chinese sign language
  • Speech recognition automatic speech recognition, ASR: It can also be called speech to text (speech to text, STT). Its goal is to use computers to automatically convert human speech content into corresponding text.
  • OCR optical character recognition
  • SDK Software development kit
  • Blendshape A technology that operates on the vertices of the three-dimensional model mesh to achieve a defined shape, which can be used to control the facial expressions of virtual characters.
  • Digital human refers to the use of computer technology to digitize the human body structure, and a visible and controllable virtual human body form appears on the computer screen. The functional information of the human body is further attached to this human body form framework, and through virtual reality technology Through cross-fusion, this "digital human” will be able to imitate real people and make various reactions. If equipped with sound and force feedback devices, it can also provide an intuitive and natural real-time sense of sight, hearing, touch, etc.
  • Sign language is a language that does not use auditory-speech, but uses visual-gestural mode - using body movements and facial expressions to express and convey meaning.
  • Part of speech refers to the characteristics of a word and is used to classify parts of speech.
  • Modern Chinese words can be divided into two categories: content words and function words.
  • Content words refer to those that can act alone as syntactic components or mostly as the main components of oranges. It has lexical and grammatical meanings. Includes nouns, verbs, adjectives, adverbs, numerals, quantifiers, pronouns and onomatopoeia.
  • Function words cannot serve as syntactic components alone or mostly as auxiliary components of sentences. It has only grammatical meaning. Includes prepositions, conjunctions, particles and interjections.
  • Table 1 gives a classification method for parts of speech, in which proper nouns can include: names of people, place names, institutional groups, work titles and other proper nouns.
  • Figure 3 is a schematic diagram of a Chinese translation method provided by an embodiment of the present application. The following takes the process of an electronic device using App1 to translate text information into corresponding sign language as an example to introduce the Chinese translation method provided by an embodiment of the present application.
  • the user of App1 ie, electronic device user or user
  • the input function control 304 can be used to input one or more of the following data types to App1: text (for example, the content shown in 303), image , documents, audio, video, etc.
  • App1 can directly obtain the text information contained in the text input by the electronic device user.
  • the text may be manually input by the electronic device user, or may be one or more texts provided by App1 (common sentences built into App1), and the electronic device user selects from the one or more texts.
  • App1 When the terminal user inputs an image, App1 recognizes the text information contained in the image through OCR after receiving the image data.
  • App1 parses the document after receiving the document data to obtain the text information contained in the document.
  • App1 recognizes the text information contained in the audio or video data through ASR and/or OCR after receiving the audio or video data. For example, when the video data received by App1 contains subtitles, App1 can identify the text information in the video through OCR. When the video data received by App1 contains audio data, App1 can identify the text information contained in the video data through ASR. When When the video data received by App1 contains both subtitles and audio data, App1 can simultaneously use ASR and OCR to identify the text information contained in the video, and perform mutual proofreading to improve the accuracy of text recognition.
  • App1 When App1 obtains the text information shown in 303 "John went to the cinema this afternoon.”, App1 can obtain the hand movement data corresponding to the text information based on the text information, and then use the hand movement data to drive the virtual character model , so that the virtual character model can display the hand movements corresponding to the text information.
  • App1 before App1 translates the obtained text into sign language, App1 will also identify the parts of speech of different words contained in the data input by the user to be translated. For proper nouns, App1 will also obtain the corresponding corresponding nouns. Oral movement data. When the avatar shows the hand movements of a proper noun, driven by the mouth movement data, the avatar will also show the mouth movements corresponding to the proper noun.
  • App1 displays prompt information 302 in response to the user's operation.
  • the prompt information is used to prompt the electronic device user to use the method or step of App1.
  • the prompt information can be used to prompt the electronic device user to input data to be translated into App1 through the input function control 304.
  • the prompt information can also be used to prompt the user of the electronic device to input words that require additional oral movements.
  • App1 can also display processing status information. For example, as shown at 301, App1 is currently executing action or the action the user is performing.
  • Figure 4 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
  • App1 uses OCR to recognize text information in images as an example to illustrate App1's processing of data to be translated that needs to recognize text information such as images and documents.
  • the electronic device user inputs a picture containing the text information "John went to the cinema this afternoon.” to App1 through the input function control. After receiving the user's input image, App1 recognizes the text information in the image through OCR.
  • App1 recognizes the text information correctly ("John went to the cinema this afternoon.”), App1 displays the text recognition result confirmation prompt window (as shown in (a) in Figure 4), the user clicks "Confirm”, and App1 Obtain the user's confirmation instruction and perform the next operation, which is the operation shown in (c) in Figure 4.
  • App1 recognizes text information incorrectly ("John went to the cinema this morning.”), App1 displays a text recognition result confirmation prompt window, and the user clicks "Modify” after confirming that App1 recognizes text information incorrectly, and App1 obtains the user
  • the modification instruction displays the modification text recognition result prompt window as shown in Figure 4(b).
  • the user clicks "Confirm” after inputting the correct text information ("John went to the cinema this afternoon.”).
  • App1 obtains the modified text information and performs the next operation, which is the operation shown in (c) in Figure 4.
  • App1 can translate the text information into corresponding hand movements after obtaining the text information confirmed by the user.
  • App1 before App1 translates the confirmed text information into hand movements, it displays the operation prompt message "Please enter the keywords that require additional oral movements:", and the electronic device user uses the input function control according to the operation prompt message. Enter “John” into App1. In response to the user's input, App1 obtains the keyword "John” and obtains the hand movement data corresponding to the text information based on the text information. It also obtains the spoken word corresponding to the keyword "John". Hand movement data, so that App1 can use the obtained hand movement data and mouth movement data of keywords to drive the virtual character to display the corresponding hand movements and mouth movements.
  • App1 before App1 translates the confirmed text information into spoken language, it analyzes that the text information contains the proper noun "John", App1 uses the proper noun as a keyword that requires additional oral movements, and While obtaining the hand movement data corresponding to the text information, it also obtains the oral movement data corresponding to the proper noun, so that App1 can use the obtained hand movement data and the mouth movement data of the keywords to drive the virtual character to display the corresponding hands. movements and oral movements.
  • keywords can be words containing one or more Chinese characters.
  • the electronic device user inputs text information to be translated as “Please turn off the faucet after using up water.”
  • App1 displays the prompt message “Please enter keywords that require additional oral movements. :”.
  • the user of the electronic device inputs the keyword: "close, faucet” according to the above prompt information.
  • App1 checks and determines that the text information contains the keyword entered by the user, and then performs the corresponding translation operation.
  • the user of the electronic device inputs the keyword: "faucet” according to the above prompt information.
  • App1 checks and determines that the text information contains the keyword entered by the user, but cannot obtain the keyword. If the corresponding mouth movement data is obtained, App1 will issue a prompt message.
  • the prompt message may be "Cannot find the mouth movement data corresponding to the "faucet” you entered. Manual service has been requested for you in the background, please wait.”.
  • App1 can establish a video connection with the artificial customer service for the user. After the connection is established, the artificial customer service can show the user the mouth movements of the above-mentioned unavailable keywords. Alternatively, the artificial customer service staff can supplement the oral action data of the above-mentioned unobtainable keywords in the background and call it to App1. After App1 obtains the oral action data, it will be displayed to the electronic device user.
  • the electronic device user inputs the entire content of the text information to be translated according to the above prompt information.
  • App1 detects that there are many keywords that require additional mouth movement data.
  • App1 can issue a prompt message to remind the user: There are currently many keywords that require additional mouth movement data. You can re-enter the keywords that require additional mouth movement data. Action keywords.
  • the user of the electronic device does not enter any keywords according to the above prompt information, and App1 detects that the keywords entered by the user are not obtained within the preset time period, then App1 can identify the content of the text information input by the user, Issue a prompt message containing keywords that recommend additional oral movements.
  • the keywords for recommending additional oral actions can be determined based on the parts of speech of Chinese vocabulary as shown in Table 1, such as proper nouns, time words, etc.
  • the recommended keywords for additional oral movements can also be determined based on user habits. For example, the user has used "John” as a keyword to append oral movements multiple times in the translation query history in App1. Then, when App1 obtains that the user input data to be translated also contains "John”, "John” can be used as a keyword to recommend additional oral actions.
  • the subject and object in the text information input by the user to be translated can be determined as keywords that recommend additional oral movements.
  • the keywords for recommending additional oral movements may be determined based on determination methods of other users. For example, for the same video, 80% of users identified “cinema” and “playground” as keywords that require additional oral movements. When the user inputs the same video and App1 does not When obtaining the keywords input by the user that require additional oral movements, App1 can use "cinema” and "playground” as keywords to recommend additional oral movements.
  • App1 can also send a prompt message to prompt the user whether to also add mouth movement data for "playground”?
  • the electronic device user determines to add mouth action data to "Amusement Park”
  • App1 will add “Cinema”, “Amusement Park” As a keyword that requires additional oral movements.
  • App1 when App1 obtains the keywords confirmed by the user, App1 will also display prompt information, which is used to prompt the updated keywords.
  • App1 After App1 obtains the hand movement data and mouth movement data corresponding to the text information based on the user's input, App1 displays the translation result interface as shown in Figure 6.
  • the translation result interface may include processing prompt information 630, which is used to prompt that the translation of the text information has been completed.
  • the prompt information is also used to prompt the electronic device user how to use the translation result.
  • the translation result interface may also include an overall display area 611, which is used to display the overall situation of the virtual character when signing.
  • the overall display area is used to display the hand movements of the text information to be translated and the mouth of the keywords with the oral movement data attached. action.
  • the translation result interface may also include a hand movement display area 613, which is used to display the details of the hand movement of the text information input by the user to be translated.
  • the hand movement display area may include auxiliary lines and/or auxiliary text, and the auxiliary lines and/or auxiliary text are used to help the user understand sign language details such as finger movement trajectories.
  • the above-mentioned overall display area 611, mouth movement display area 612, hand movement display area 613, text status display area 614 and prompt information 630 constitute a translation result interface.
  • the above-mentioned overall display area 611, mouth movement display area 612, hand movement display area 613 and text status display area 614 constitute the translation result display area 610 of the translation result interface, and the translation result display area 610 is Part of the translation results interface.
  • the translation result display area 610 may also include prompt information 630 and an input area 620.
  • the input area is used to display the text information to be translated that has been input by the user, the input prompt information issued by App1, and the additional information that the user has input. Keywords for oral movements, etc.
  • the electronic device user can also re-enter keywords that require additional oral action data in the input area.
  • App1 obtains the oral movement data corresponding to the keywords re-entered by the user, and updates the overall display area in the translation result display area. Mouth movement display area, hand movement display area and text status display area.
  • the electronic device user triggers App1 to display the functional options of the translation result display area by clicking, double-clicking or long-pressing a blank space in the translation result display area.
  • the electronic device user triggers App1 to display and translate the function tabs in these areas by clicking, double-clicking or long-pressing the overall display area, the mouth movement display area, the hand movement display area, or the text status display area.
  • the above-mentioned function tabs may include one or more of the following functions: “View in full screen”, “Play at double speed”, “Insert into audio/video”, “Hide”, “Save” or “Share”, etc.
  • App1 displays the entire display area, mouth movement display area, hand movement display area, or text status display area in full screen.
  • App1 When the electronic device user selects the "double speed playback" function option, App1 displays a playback rate adjustment function window in response to the user's operation, and the user can select or input the playback rate that needs to be set in the playback rate adjustment function window. After obtaining the playback rate selected or input by the user, App1 plays the content contained in the overall display area, mouth movement display area, hand movement display area, or text status display area at the corresponding rate (slow or fast).
  • App1 When the electronic device user selects the "insert into audio/video" function option, in response to the user's operation, App1 will display one or more of the overall display area, the mouth movement display area, the hand movement display area, or the text status display area. Insert into the corresponding audio or video. Optionally, after inserting any of the above areas into the audio file, App1 can save the modified audio file in the format of a video file.
  • App1 When the electronic device user selects the "hide” function option, in response to the user's operation, App1 hides the overall display area, the mouth movement display area, the hand movement display area, or the text status display area.
  • the function options corresponding to the area may include the "Show” function option.
  • App1 displays the hidden area in response to the user's operation.
  • the oral movement display area can be hidden by default.
  • App1 When the electronic device user selects the "save" function option, App1 saves the data corresponding to the area selected by the user in response to the user's operation.
  • App1 can also display a save prompt window.
  • the save prompt window is used to prompt the user whether to save data corresponding to other related areas at the same time.
  • the save prompt window is also used to obtain the user's instruction information. For example, when the user chooses to save data corresponding to other related areas at the same time, in response to the user's operation, App1 saves both the data corresponding to the user-selected area and the data corresponding to the related areas locally on the electronic device.
  • App1 displays a prompt message: "Do you want to save the data of the mouth movement display area, hand movement display area, and text status display area at the same time?"
  • App1 simultaneously saves data corresponding to the overall display area and the mouth movement display area in response to the user's selection.
  • App1 When the electronic device selects the "share" function option, App1 displays the sharing function control in response to the user's operation, and the sharing function control includes one or more sharing channels. The end user can select one or more sharing channels. In response to the user's selection, App1 shares the data corresponding to the area selected by the user through one or more sharing channels selected by the user.
  • App1 can also display a sharing prompt window in response to the user's operation.
  • the sharing prompt window is used to prompt the user whether to share data corresponding to other related areas at the same time.
  • the sharing prompt The window is also used to obtain instructions from the user. For example, when the user chooses to share data corresponding to other related areas at the same time, in response to the user's operation, App1 uses the data corresponding to the area selected by the user and the data corresponding to the related areas as data to be shared.
  • App1 displays a prompt message: "Do you want to share the data of the mouth movement display area, hand movement display area, and text status display area at the same time?"
  • App1 simultaneously shares data corresponding to the overall display area and the mouth movement display area.
  • the user of the electronic device can open it again for viewing, sharing, editing, etc.
  • Figure 8 shows the interface of the resource library.
  • the resource library is used to classify, arrange and display data corresponding to different display areas saved locally on the electronic device according to certain rules.
  • the above rules include classification rules and arrangement rules.
  • the classification rules may include any of the following rules: area (overall display area, mouth movement display area or hand movement display area, etc.), time (time saved locally on the electronic device, for example: today, yesterday, One week ago, etc.) or source (for example: from the current electronic device, from electronic devices of the same account, or from home electronic devices, etc.), etc.
  • Sorting rules can include any of the following rules: time (such as time from far to recent or from recent to far), text information contained in the data (for example: alphabetical order of text information) or additional oral action keys
  • time such as time from far to recent or from recent to far
  • text information contained in the data for example: alphabetical order of text information
  • additional oral action keys The order of words (the stroke order of the first word of the keyword).
  • the electronic device user can select the "classification method” function option 801 of the resource library to set different classification methods for the data stored locally.
  • the electronic device user can also select the "arrangement” function option 802 of the resource library to set different arrangements for the data stored locally.
  • the resource library also includes a search box 805 in which the user of the electronic device can enter words, Word, time, region, source and other content to quickly find the corresponding data.
  • the resource library also includes a "Recycle Bin” function option 803, and the electronic device user can select the "Recycle Bin” function option to view the data that has been stored in the "Recycle Bin”.
  • the "Recycle Bin” is used to temporarily store data deleted by the user. Data that has not been restored by the user after a preset period of time or data that the user has confirmed deletion in the "Recycle Bin” will be removed from the storage medium of the electronic device by App1. Erase.
  • the resource library also includes a "share" function option 804, which an electronic device user can select to share one or more data in the resource library.
  • the electronic device When the electronic device user selects any data to open in the resource library, in response to the user's operation, the electronic device can display a playback interface as shown in Figure 9.
  • the playback interface may include one of an overall display area, a mouth movement display area, a hand movement display area, and a text status display area. or more.
  • the option functions corresponding to the areas shown in Figure 6 can also be opened.
  • triggering methods of the option functions and specific functions please refer to the relevant descriptions in Figure 6. To avoid duplication, they are not included here. Again.
  • the playback interface may include a playback function control 901.
  • the playback function control may control the start and stop of data playback.
  • the playback function control may also view the progress of the current data playback.
  • the playback function control may also include a prompt control 902 with a keyword attached to the oral movement data.
  • the electronic device user can directly view the oral movement of the keyword by selecting (for example, clicking) the prompt control.
  • the playback interface may include a "share” function option 903, and the electronic device user may select the "share" function option to share one or more types of data being played in the playback interface.
  • the sharing process can be triggered through the sharing function as shown in Figure 6, or through the sharing function in the resource library interface as shown in Figure 8, or also It can be triggered through the sharing function of the playback interface in Figure 9, or it can also be triggered through other methods, and this application does not impose restrictions on this.
  • the sharing selection prompt information 1001 is used to prompt information about the currently selected data to be shared.
  • the sharing selection prompt information may include the quantity of data to be shared.
  • the sharing selection prompt information may also include the types included in the data to be shared.
  • the sharing selection prompt information may be displayed: 11 items have been selected, including: data corresponding to the hand movement display area (manual), data corresponding to the mouth movement display area (mouth movement), and data corresponding to the text status display area (text).
  • the shared data preview area may also include a function check box 1004.
  • the electronic device user can select data to be shared or deselect data to be shared by clicking the function check box.
  • the sharing channel selection window 1003 is used to display one or more available sharing channels, and the sharing channel window is also used to obtain one or more sharing channels selected by the electronic device user.
  • the sharing methods can include: Bluetooth sharing, uploading to cloud disk, or sending via email.
  • This setting function option can include the "automatic keyword recognition and conversion" function option. Electronic device users can use this function option to turn on or off the keywords in the text, video, and audio data input by App1 during the input process.
  • the keywords Refers to keywords that require additional mouth movement data.
  • the setting function option may also include a "keyword auto-correction" function option, through which the electronic device user can turn on or off App1 to prompt and/or make errors in the keywords entered by the user during the input process. Automatic correction.
  • the setting function options may also include a "translation acceleration function" function option. Electronic device users can use this function option to turn on a function that improves the efficiency of text information translation. Details of how to improve the efficiency of text information translation are provided in the following embodiments. introduce.
  • the setting function options may also include a "result display content” function option, through which electronic device users can select the content to be displayed on the translation results display interface. For example, if the electronic device user selects "hand movements" and "mouth movements” in this function option, then in the interface shown in Figure 6, the overall display area and the text status display area are not displayed by default, and the hand movements The display area and mouth movement display area are displayed by default.
  • This setting function option may also include a "resource library default classification method" function option, through which the electronic device user can select the default classification method in the resource library for different data that the user saves locally on the electronic device.
  • This setting function option may also include a "resource library default sorting method" function option, through which the electronic device user can select the default sorting method in the resource library for different data saved locally on the electronic device.
  • the above describes the Chinese translation method provided by the embodiment of the present application from the perspective of an electronic device user.
  • the following describes the Chinese translation method provided by the embodiment of the present application and the implementation process within the electronic device with reference to FIG. 12 .
  • the electronic device obtains the text information to be translated.
  • the text information to be translated may be directly input to the electronic device by the user of the electronic device, or may be recognized by the electronic device based on data such as text, pictures, audio, or video input by the user.
  • data such as text, pictures, audio, or video input by the user.
  • the electronic device also obtains keywords that require additional mouth action data.
  • the electronic device sends a translation request to the server, and accordingly, the server receives the translation request.
  • This translation request is used to request the hand movement data corresponding to the text information to be translated.
  • the electronic device also obtains a keyword that requires additional oral movement data in S1201
  • the translation request is also used to request to obtain the oral movement data corresponding to the keyword.
  • the translation request is used to request to obtain oral action data corresponding to a keyword that requires additional oral action data.
  • the server sends hand movement data and/or mouth movement data, and accordingly, the electronic device receives the hand movement data and/or mouth movement data.
  • the server determines to send the hand movement data and/or mouth movement data to the electronic device according to the content of the translation request message received in S1202.
  • the server before sending the above hand movement data and/or mouth movement data to the electronic device, the server first The text information requested by the electronic device to be translated undergoes a text risk control check.
  • the text risk control check is used to check whether the text information to be translated contains sensitive information, so as to filter out bad text information.
  • the server directly sends the above-mentioned hand movement data and/or mouth movement data to the electronic device after determining that the text information to be translated passes the text risk control check.
  • the server sends instruction information to the electronic device, and the instruction information is used to indicate that the text information to be translated passes the text risk control check.
  • the electronic device After receiving the instruction information, the electronic device sends a text-to-sign language request corresponding to the text that passed the text risk control check to the server. After receiving the text-to-sign language request, the server sends the above-mentioned hand movement data and/or mouth to the electronic device. action data.
  • the server if the server determines that the text information to be translated does not pass the text risk control check, the server sends indication information to the electronic device, and the indication information is used to indicate that the text information to be translated does not pass the text risk control check.
  • the server can determine the hand movement data corresponding to the text information to be translated from the hand movement database, and send the hand movement data to the electronic device.
  • the server can also determine the oral action data corresponding to the keyword from the oral action database, and send the oral action data to the electronic device.
  • the server includes a part-of-speech tagging module and a verbal action database.
  • the part-of-speech tagging module is used to tag each word in the text information received from the electronic device with a part-of-speech tag.
  • the specific meaning of the part-of-speech tag is shown in Table 1.
  • the mouth movement database is used to store mixed shape values corresponding to the mouth shapes of Chinese Pinyin. The mixed shape values can be used to display mouth movements corresponding to keywords.
  • Table 2 shows the Chinese pinyin of the mouth movements video that needs to be recorded during the creation of the oral movement database.
  • the oral movements of different Chinese characters are determined based on their corresponding Chinese pinyin.
  • the mouth shape videos are then converted into data that can drive the mouth movements of the virtual character.
  • the server can call the mouth shape generation algorithm to obtain the data converted from the mouth shape video of the Chinese Pinyin pronunciation corresponding to the keyword, and send the data to the electronic device, so that the electronic device can The obtained data is used to drive the virtual character to make corresponding mouth movements.
  • the server sends the above-mentioned oral movement data and hand movement data together to the electronic device.
  • the server sends hand movement data in different time frames in sequence according to the order of hand movements.
  • the server sends oral action data in different time frames in sequence according to the order of the oral actions.
  • the hand action data and oral action data have the same timestamp
  • the server sends the hand action data and oral action data in different time frames in pieces according to the order of the hand movements or oral movements. data.
  • the electronic device drives the virtual character model to display the hand movements and/or the mouth movements of the keywords corresponding to the text to be translated based on the hand movement data and/or mouth movement data received in S1203.
  • the embodiment of the present application also provides a Chinese translation device 1300.
  • the Chinese translation device 1300 includes an acquisition unit 1310 and a processing unit 1320.
  • the acquisition unit is used to acquire the data as shown in Figure 3 to Figure 13.
  • the processing unit is used to perform processing operations performed by the electronic device in the embodiment shown in Figure 3 to Figure 11, such as obtaining the corresponding hand according to the text information input by the user. Action data, etc.
  • the Chinese translation device may also include a communication unit 1330, which is used to perform communication and data transmission operations with the server performed by the electronic device in the embodiments shown in Figures 3 to 11.
  • a communication unit 1330 which is used to perform communication and data transmission operations with the server performed by the electronic device in the embodiments shown in Figures 3 to 11.
  • the embodiment of the present application also provides another Chinese translation device 1400.
  • the Chinese translation device 1400 It includes a processing unit 1410 and a communication unit 1420.
  • the processing unit is used to perform text risk control operation inspection on the text information sent by the electronic device to be translated.
  • the communication unit is used to perform the embodiments shown in Figures 3 to 11. Communication and data transmission operations performed by servers and electronic devices.
  • the Chinese translation device may also include a storage unit 1430, which is used to store one or more computer programs, hand movement data, oral movement data, etc.
  • the embodiment of the present application also provides an electronic device 1500.
  • the electronic device includes a processor 1510 and a memory 1520.
  • the processor is used to execute the steps performed by the electronic device in the embodiments shown in Figures 3 to 11. Processing operations, such as obtaining corresponding hand movement data based on text information input by the user, etc.
  • One or more computer programs are stored in the memory.
  • the one or more computer programs include instructions. When the instructions are processed by one or more When the processor is executed, any of the Chinese translation methods mentioned above will be executed.
  • the embodiment of the present application also provides a server 1600.
  • the server includes a processor 1610 and a memory 1620.
  • the processor is used to perform text risk control operations on text information to be translated sent by the electronic device.
  • the memory stores one or more computer programs, hand movement data, oral movement data, etc.
  • the one or more computer programs include instructions. When the instructions are executed by one or more processors, any of the above A Chinese translation method is implemented.
  • Embodiments of the present application also provide a computer program product.
  • the computer program product includes computer program code.
  • the computer program code When the computer program code is run on a computer, it causes the computer to implement the methods in the embodiments shown in FIGS. 3 to 12 .
  • Embodiments of the present application also provide a computer-readable storage medium.
  • the computer-readable medium stores computer instructions. When the computer instructions are run on the computer, the computer implements the methods in the embodiments shown in Figures 3 to 12. .
  • An embodiment of the present application also provides a chip, including a processor for reading instructions stored in a memory.
  • the processor executes the instructions, the chip implements the embodiments shown in Figures 3 to 12. method.
  • the disclosed systems, devices and methods can be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented.
  • the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • each functional unit in each embodiment of the present application can be integrated into one processing unit, or each functional unit can be integrated into one processing unit.
  • the functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium.
  • the technical solution of the present application is essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product.
  • the computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of this application.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program code. .

Abstract

Provided in the present application are a Chinese translation method and an electronic device. The method comprises: in response to an input of a user, an electronic device acquiring text information, wherein the text information comprises a keyword; the electronic device displaying a hand action corresponding to the text information; and the electronic device displaying a mouth action corresponding to the keyword. By means of the translation method and the electronic device provided in the present application, a mouth action compatible with the habit of a sign language user is added when text information is translated into sign language, thereby facilitating an improvement in the accuracy of language expressions when Chinese is translated into sign language, a reduction in misunderstanding of a translation result by the sign language user, an enhancement in exchange and communication with the sign language user, and an improvement in the application experience for users of an electronic device.

Description

汉语翻译的方法和电子设备Methods and electronic devices for Chinese translation
本申请要求于2022年04月15日提交中国专利局、申请号为202210396448.4、发明名称为“汉语翻译的方法和电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the China Patent Office on April 15, 2022, with the application number 202210396448.4 and the invention title "Method and Electronic Device for Chinese Translation", the entire content of which is incorporated into this application by reference. .
技术领域Technical field
本申请涉及计算机领域,具体的,涉及一种汉语翻译的方法和电子设备。The present application relates to the field of computers, specifically, to a Chinese translation method and electronic equipment.
背景技术Background technique
手语数字人(digital human)可以通过手部动作和/或口部动作来帮助手语使用者理解语言信息。A sign language digital human can help sign language users understand language information through hand movements and/or oral movements.
当手语数字人在做手部动作的同时配合相应的口部动作来表达某句话或某个含义时,口部动作有时并不能起到辅助手部动作帮助理解的目的,反而有可能造成不必要的误解。例如,在做自然手语的手部动作时,配合听力无障碍人士说话的口部动作,由于手部动作与口部动作在同一时刻表达的可能并不是同一个词语,这样便会造成误解。When a digital signer makes hand movements and coordinates corresponding oral movements to express a certain sentence or a certain meaning, the oral movements sometimes do not serve the purpose of assisting the hand movements to help understanding, but may instead cause inaccuracies. Necessary misunderstanding. For example, when making hand movements in natural sign language and matching the oral movements of hearing-impaired people when speaking, since the hand movements and oral movements may not express the same word at the same time, this may cause misunderstandings.
发明内容Contents of the invention
本申请提供一种汉语翻译方法,在将文本信息翻译成手语时只为关键词附加口部动作,有利于提高汉语翻译成手语时语言表达的准确性。This application provides a Chinese translation method that only adds oral movements to keywords when translating text information into sign language, which is beneficial to improving the accuracy of language expression when translating Chinese into sign language.
第一方面,提供了一种汉语翻译方法,包括:响应于用户的输入,电子设备获取文字信息,该文字信息包括关键词;该电子设备显示该文字信息对应的手部动作;该电子设备显示关键词对应的口部动作。The first aspect provides a Chinese translation method, including: in response to user input, an electronic device obtains text information, the text information includes keywords; the electronic device displays the hand movements corresponding to the text information; the electronic device displays Oral movements corresponding to keywords.
在一种可能的实现方式中,该关键词由电子设备根据用户输入的文字信息通过以下方式中的一种或多种识别得到:文字信息的内容、用户的翻译历史信息或其他用户对该文字信息中关键词的确定的方法。In a possible implementation, the keyword is obtained by the electronic device based on the text information input by the user through one or more of the following methods: the content of the text information, the user's translation history information, or other users' identification of the text. Methods for determining keywords in information.
需要说明的是,这里关键词可以是文字信息包含的一个或多个字或者也可以是文字信息包含的一个或多个词汇。It should be noted that the keyword here may be one or more words contained in the text information or one or more words contained in the text information.
还需要说明的是,手语使用者在做手语动作时的词汇表达的顺序可能是与自然口语的语序是不同的,这里电子设备显示文字信息对应的手部动作的顺序可以是根据手语使用者的习惯确定的。It should also be noted that the order of word expressions of sign language users when performing sign language movements may be different from the word order of natural spoken language. Here, the order of hand movements corresponding to the text information displayed on the electronic device may be based on the order of the sign language users. Habits are determined.
在打手语时为只为关键词附加上相应的口部动作,本技术方案将文字信息翻译成更服务手语使用者的表达习惯的手语,有利于提升文字信息翻译结果的准确度,有利于降低手语使用者对翻译得到的手语造成误解的机率,有利于增强与手语使用者之间的相互沟通。In order to only attach corresponding oral movements to keywords when typing sign language, this technical solution translates text information into sign language that better serves the expression habits of sign language users, which is conducive to improving the accuracy of the translation results of text information and is conducive to reducing the The probability of sign language users misunderstanding the translated sign language is conducive to enhancing mutual communication with sign language users.
结合第一方面,在第一方面的某些实现方式中,该关键词根据手语使用者的语言习惯确定。 Combined with the first aspect, in some implementations of the first aspect, the keyword is determined based on the language habits of the sign language user.
在打手语时为根据手语使用者的语言习惯确定的关键词附加上相应的口部动作,本技术方案将文字信息翻译成更服务手语使用者的表达习惯的手语,有利于提升文字信息翻译结果的准确度,有利于降低手语使用者对翻译得到的手语造成误解的机率,有利于增强与手语使用者之间的相互沟通。When typing in sign language, corresponding oral movements are attached to the keywords determined according to the language habits of the sign language users. This technical solution translates text information into sign language that better serves the expression habits of sign language users, which is conducive to improving the translation results of text information. The accuracy is conducive to reducing the probability of sign language users misunderstanding the translated sign language, and is conducive to enhancing mutual communication with sign language users.
结合第一方面,在第一方面的某些实现方式中,该电子设备不显示普通词汇对应的口部动作,该文字信息包括该普通词汇,该普通词汇与该关键词不同。With reference to the first aspect, in some implementations of the first aspect, the electronic device does not display oral movements corresponding to common words, the text information includes the common words, and the common words are different from the keywords.
本技术方案中,对于不是关键词的普通词汇不显示口部动作,有利于减少翻译过程中的数据传输,有利于提高翻译过程中数据传输与处理的效率,有利于提高电子设备用户的应用使用体验。In this technical solution, oral movements are not displayed for ordinary words that are not keywords, which is beneficial to reducing data transmission during the translation process, improving the efficiency of data transmission and processing during the translation process, and improving the application usage of electronic device users. experience.
结合第一方面,在第一方面的某些实现方式中,该电子设备在显示关键词对应的手部动作的同时显示该口部动作。With reference to the first aspect, in some implementations of the first aspect, the electronic device displays the mouth movement while displaying the hand movement corresponding to the keyword.
本技术方案,在做关键词对应的手部动作的同时展示关键词对应的口部动作,该技术方案的实施有利于保证手部动作和口部动作的对应关系,有利于进一步提高文字信息翻译结果的准确度,有利于提高手语使用者对翻译得到的手语的理解程度。This technical solution displays the oral movements corresponding to the keywords while performing the hand movements corresponding to the keywords. The implementation of this technical scheme is conducive to ensuring the corresponding relationship between the hand movements and the oral movements, and is conducive to further improving the translation of text information. The accuracy of the results will help improve sign language users' understanding of the translated sign language.
结合第一方面,在第一方面的某些实现方式中,该关键词为专有名词。Combined with the first aspect, in some implementations of the first aspect, the keyword is a proper noun.
该专有名词可以包括以下词汇中的一种或多种:人名、地名、机构团体、作品名和其他专有名词。The proper nouns may include one or more of the following words: names of people, places, institutions, works, and other proper nouns.
为专有名词附加口部动作,有利于提高手语使用者对于较难理解的专有名词的理解程度,有利于增强与手语使用者之间的相互沟通。Attaching oral movements to proper nouns can help improve sign language users' understanding of difficult-to-understand proper nouns, and can help enhance mutual communication with sign language users.
结合第一方面,在第一方面的某些实现方式中,在显示关键词对应的口部动作前,该电子设备显示第一词汇,该文字信息包括该第一词汇,该第一词汇为推荐附加口部动作的词汇;响应于用户的确认操作,该电子设备确定该第一词汇为关键词。In connection with the first aspect, in some implementations of the first aspect, before displaying the oral movements corresponding to the keywords, the electronic device displays the first vocabulary, the text information includes the first vocabulary, and the first vocabulary is recommended Words with oral movements attached; in response to the user's confirmation operation, the electronic device determines that the first word is a keyword.
本技术方案通过向电子设备用户推荐关键词,并在用户确认后为推荐的关键词附加口部动作。本技术方案的实施有利于提升手语学习者对手语使用的理解程度,有利于提升电子设备用户的应用使用体验,有利于提升手语学习者学习手语的效率。This technical solution recommends keywords to electronic device users and adds mouth movements to the recommended keywords after the user confirms. The implementation of this technical solution is conducive to improving sign language learners' understanding of the use of sign language, is conducive to improving the application experience of electronic device users, and is conducive to improving the efficiency of sign language learners learning sign language.
结合第一方面,在第一方面的某些实现方式中,在显示关键词对应的口部动作前,响应于用户的第一输入,电子设备获取第二词汇,该第二词汇为用户请求附加口部动作的词汇;在文字信息包含该第二词汇的情况下,该电子设备确定第二词汇为关键词;在文字信息不包含该第二词汇的情况下,该电子设备显示更新请求信息,该更新请求信息用于提示文字信息不包含该第二词汇;响应于用户的第二输入,电子设备获取更新后的第二词汇。In conjunction with the first aspect, in some implementations of the first aspect, before displaying the oral movements corresponding to the keywords, in response to the user's first input, the electronic device acquires a second vocabulary, the second vocabulary being additionally requested by the user. Vocabulary of oral movements; when the text information contains the second vocabulary, the electronic device determines the second vocabulary as the keyword; when the text information does not contain the second vocabulary, the electronic device displays update request information, The update request information is used to prompt that the text information does not contain the second vocabulary; in response to the user's second input, the electronic device obtains the updated second vocabulary.
本技术方案对于用户输入的请求附加口部动作的词汇进行识别,并将文字信息中是否包含用户请求附加口部动作的词汇的识别结果通知用户。本技术方案的实施有利于提升汉语翻译成手语的效率,有利于提高对文字信息翻译的准确度,有利于提升用户的应用使用体验。This technical solution identifies words input by the user that request additional oral movements, and notifies the user of the recognition result of whether the text information contains the words that the user requests additional oral movements. The implementation of this technical solution is conducive to improving the efficiency of Chinese translation into sign language, improving the accuracy of text information translation, and improving the user's application experience.
结合第一方面,在第一方面的某些实现方式中,第一词汇根据用户的翻译历史确定,该翻译历史包含用户输入的第二词汇,该第二词汇为用户请求附加口部动作的词汇。In conjunction with the first aspect, in some implementations of the first aspect, the first vocabulary is determined based on the user's translation history, the translation history includes a second vocabulary input by the user, and the second vocabulary is a vocabulary for which the user requests additional oral movements. .
翻译历史中包含的第二词汇在一定程度上可以提现电子设备用户的语言习惯和应用使用习惯。本技术方案根据用户请求附加口部动作的历史来为用户推荐附加口部动作的词汇。本技术方案的实施有利于根据用户的习惯来确定汉语的翻译结果,有利于提升翻译的 效果,有利于提升电子设备用户的应用使用体验。The second words included in the translation history can, to a certain extent, reflect the language habits and application usage habits of electronic device users. This technical solution recommends vocabulary for additional oral movements to the user based on the user's history of requesting additional oral movements. The implementation of this technical solution is conducive to determining the Chinese translation results according to the user's habits and is conducive to improving the efficiency of translation. The effect is conducive to improving the application experience of electronic device users.
结合第一方面,在第一方面的某些实现方式中,该口部动作根据关键词的汉语拼音的发音口型确定。Combined with the first aspect, in some implementations of the first aspect, the oral movement is determined according to the pronunciation mouth shape of the Chinese pinyin of the keyword.
结合第一方面,在第一方面的某些实现方式中,该发音口型对应的混合形状数值保存在口部动作数据库中。In connection with the first aspect, in some implementations of the first aspect, the mixed shape value corresponding to the pronunciation mouth shape is stored in the mouth action database.
通过建立口部动作数据库,在需要显示口部动作时由电子设备发送请求消息至服务器后,服务器从数据库中调用所需的口部动作数据。相比于通过深度学习等方案获取的口部动作数据,有利于简化口部动作数据获取的流程,有利于提高翻译的效率,提升电子设备的应用使用体验。By establishing a mouth movement database, when the mouth movement needs to be displayed, the electronic device sends a request message to the server, and the server calls the required mouth movement data from the database. Compared with oral movement data obtained through deep learning and other solutions, it is conducive to simplifying the process of obtaining oral movement data, improving the efficiency of translation, and improving the application experience of electronic devices.
结合第一方面,在第一方面的某些实现方式中,该手部动作包括第一手部动作和第二手部动作,该第一手部动作在该第二手部动作之前,该电子设备从服务器接收第一手部动作数据,该第一手部动作数据用于显示第一手部动作;在显示第一手部动作的同时电子设备从服务器接收第二手部动作数据,该第二手部动作数据用于显示第二手部动作。With reference to the first aspect, in some implementations of the first aspect, the hand movement includes a first hand movement and a second hand movement, the first hand movement precedes the second hand movement, and the electronic The device receives first hand action data from the server, and the first hand action data is used to display the first hand action; while displaying the first hand action, the electronic device receives second hand action data from the server, and the first hand action data is used to display the first hand action data. The second hand movement data is used to display the second hand movement.
需要说明的是,这里第一手部动作或第二手部动作可以是一个具体的动作,也可以是一个具体动作的一帧或多帧包含的动作画面。It should be noted that the first hand movement or the second hand movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
本技术方案中电子设备先接收需要先显示的手部动作数据,并在显示需要先显示的手部动作的同时接收后显示的手部动作数据,采用分片传输手部动作数据、边显示边传输的方案,有利于缩短数据传输产生的等待时间,有利于提升用户的应用使用体验。In this technical solution, the electronic device first receives the hand movement data that needs to be displayed first, and while displaying the hand movement that needs to be displayed first, it also receives the hand movement data that is displayed later. It uses fragmented transmission of hand movement data and displays it while The transmission solution is conducive to shortening the waiting time for data transmission and improving the user's application experience.
结合第一方面,在第一方面的某些实现方式中,该口部动作包括第一口部动作和第二口部动作,该第一口部动作在第二口部动作之前,该电子设备从服务器接受第一口部动作数据,该第一口部动作数据用于显示该第一口部动作;在显示第一口部动作的同时电子设备从服务器接收第二口部动作数据,该第二口部动作数据用于显示该第二口部动作。With reference to the first aspect, in some implementations of the first aspect, the oral action includes a first oral action and a second oral action, the first oral action precedes the second oral action, and the electronic device The first mouth movement data is received from the server, and the first mouth movement data is used to display the first mouth movement; while displaying the first mouth movement, the electronic device receives the second mouth movement data from the server, and the first mouth movement data is used to display the first mouth movement. The second oral movement data is used to display the second oral movement.
需要说明的是,这里第一口部动作或第二口部动作可以是一个具体的动作,也可以是一个具体动作的一帧或多帧包含的动作画面。It should be noted that the first oral movement or the second oral movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
本技术方案中电子设备先接收需要先显示的口部动作数据,并在显示需要先显示的口部动作的同时接收后显示的口部动作数据,采用分片传输口部动作数据、边显示边传输的方案,有利于缩短数据传输产生的等待时间,有利于提升用户的应用使用体验。In this technical solution, the electronic device first receives the oral action data that needs to be displayed first, and while displaying the oral action data that needs to be displayed first, it also receives the oral action data that is displayed later. It uses slices to transmit the oral action data and displays it while The transmission solution is conducive to shortening the waiting time for data transmission and improving the user's application experience.
结合第一方面,在第一方面的某些实现方式中,在显示文字信息对应的手部动作前,电子设备从服务器接收响应消息,该响应消息用于指示文字信息不包含敏感信息。In conjunction with the first aspect, in some implementations of the first aspect, before displaying the hand movement corresponding to the text information, the electronic device receives a response message from the server, the response message being used to indicate that the text information does not contain sensitive information.
在将文字信息翻译成手部动作和/或口部动作前,首先对文字信息进行文本风控检查,本技术方案的实施有利于过滤不良文本信息,有利于提高电子设备用户的应用使用体验。Before translating text information into hand movements and/or mouth movements, text risk control checks are first performed on the text information. The implementation of this technical solution is conducive to filtering out bad text information and is conducive to improving the application experience of electronic device users.
第二方面,提供了一种汉语翻译方法,包括:服务器接收翻译请求消息,该翻译请求消息包括文字信息,该文字信息包括关键词,该关键词根据手语使用者的语言习惯确定,该翻译请求消息用于请求获取该文字信息对应的手部动作数据,该翻译请求消息还用于请求获取该关键词对应的口部动作数据;该服务器根据文字信息确定是否发送手部动作数据和/或口部动作数据。In a second aspect, a Chinese translation method is provided, including: the server receives a translation request message, the translation request message includes text information, the text information includes keywords, the keywords are determined according to the language habits of the sign language user, and the translation request The message is used to request to obtain the hand movement data corresponding to the text information. The translation request message is also used to request to obtain the oral movement data corresponding to the keyword; the server determines whether to send the hand movement data and/or oral movement data based on the text information. action data.
这里,手部动作数据用于显示文字信息对应的手部动作,口部动作数据用于显示关键词对应的口部动作。Here, the hand movement data is used to display the hand movement corresponding to the text information, and the mouth movement data is used to display the mouth movement corresponding to the keyword.
在一种可能的实现方式中,该关键词由电子设备根据用户输入的文字信息通过以下方 式中的一种或多种识别得到:文字信息的内容、用户的翻译历史信息或其他用户对该文字信息中关键词的确定的方法。In a possible implementation, the keyword is used by the electronic device according to the text information input by the user through the following One or more identifications in the formula are obtained: the content of the text information, the user's translation history information, or other methods for the user to determine the keywords in the text information.
需要说明的是,这里关键词可以是文字信息包含的一个或多个字或者也可以是文字信息包含的一个或多个词汇。It should be noted that the keyword here may be one or more words contained in the text information or one or more words contained in the text information.
本技术方案中只对根据用户习惯确定的关键词附加口部动作,本技术方案的实施有利于减少文字信息翻译成手语时,电子设备与服务器之间传递数据的数据量,有利于提升电子设备对文字信息翻译的效率。In this technical solution, only oral movements are added to keywords determined according to user habits. The implementation of this technical solution is conducive to reducing the amount of data transmitted between the electronic device and the server when translating text information into sign language, and is conducive to improving the efficiency of electronic devices. The efficiency of text message translation.
结合第二方面,在第二方面的某些实现方式中,该关键词为专有名词。Combined with the second aspect, in some implementations of the second aspect, the keyword is a proper noun.
结合第二方面,在第二方面的某些实现方式中,该服务器确定该文字信息中是否包含敏感信息;在文字信息中包含敏感信息的情况下,该服务器发送第一响应消息,该第一响应消息用于指示该文字信息包含敏感信息;在文字信息不包含敏感信息的情况下,该服务器发送第二响应消息,该第二响应消息包括手部动作数据和/或口部动作数据。Combined with the second aspect, in some implementations of the second aspect, the server determines whether the text information contains sensitive information; if the text information contains sensitive information, the server sends a first response message, and the first response message The response message is used to indicate that the text information contains sensitive information; if the text information does not contain sensitive information, the server sends a second response message, and the second response message includes hand movement data and/or mouth movement data.
在将文字信息翻译成手部动作和/或口部动作前,首先对文字信息进行文本风控检查,本技术方案的实施有利于过滤不良文本信息,有利于提高电子设备用户的应用使用体验。Before translating text information into hand movements and/or mouth movements, text risk control checks are first performed on the text information. The implementation of this technical solution is conducive to filtering out bad text information and is conducive to improving the application experience of electronic device users.
结合第二方面,在第二方面的某些实现方式中,该手部动作数据包括第一手部动作数据和第二手部动作数据,该第一手部动作数据用于显示第一手部动作,该第二手部动作数据用于显示第二手部动作,该第一手部动作在该第二手部动作之前,该服务器在发送第一手部动作数据之后发送该第二手部动作数据。Combined with the second aspect, in some implementations of the second aspect, the hand movement data includes first hand movement data and second hand movement data, and the first hand movement data is used to display the first hand movement data. action, the second hand action data is used to display the second hand action, the first hand action precedes the second hand action, and the server sends the second hand action after sending the first hand action data. action data.
需要说明的是,这里第一手部动作或第二手部动作可以是一个具体的动作,也可以是一个具体动作的一帧或多帧包含的动作画面。It should be noted that the first hand movement or the second hand movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
本技术方案中服务器先发送需要先显示的手部动作数据,并在显示需要先显示的手部动作的同时发送后显示的手部动作数据,采用分片传输手部动作数据、边显示边传输的方案,有利于缩短数据传输产生的等待时间,有利于提升用户的应用使用体验。In this technical solution, the server first sends the hand movement data that needs to be displayed first, and sends the hand movement data that needs to be displayed first while displaying the hand movement data that needs to be displayed first. It uses fragmented transmission of hand movement data and transmission while displaying. This solution will help shorten the waiting time for data transmission and improve the user's application experience.
结合第二方面,在第二方面的某些实现方式中,该口部动作数据包括第一口部动作数据和第二口部动作数据,该第一口部动作数据用于显示第一口部动作,该第二口部动作数据用于显示第二口部动作,该第一口部动作在该第二口部动作之前,该服务器在发送第一口部动作数据之后发送该第二口部动作数据。Combined with the second aspect, in some implementations of the second aspect, the mouth action data includes first mouth action data and second mouth action data, and the first mouth action data is used to display the first mouth action data. action, the second mouth action data is used to display the second mouth action, the first mouth action is before the second mouth action, and the server sends the second mouth action after sending the first mouth action data action data.
需要说明的是,这里第一口部动作或第二口部动作可以是一个具体的动作,也可以是一个具体动作的一帧或多帧包含的动作画面。It should be noted that the first oral movement or the second oral movement here may be a specific action, or may be an action picture contained in one or more frames of a specific action.
本技术方案中服务器先发送需要先显示的口部动作数据,并在显示需要先显示的口部动作的同时发送后显示的口部动作数据,采用分片传输口部动作数据、边显示边传输的方案,有利于缩短数据传输产生的等待时间,有利于提升用户的应用使用体验。In this technical solution, the server first sends the mouth movement data that needs to be displayed first, and sends the mouth movement data that needs to be displayed first while displaying the mouth movement data that needs to be displayed first. It adopts fragmented transmission of mouth movement data and transmission while displaying. This solution will help shorten the waiting time for data transmission and improve the user's application experience.
结合第二方面,在第二方面的某些实现方式中,该服务器从口部动作数据库中获取该口部动作数据,该口部动作数据库包含汉语拼音发音口型对应的混合形状数值。Combined with the second aspect, in some implementations of the second aspect, the server obtains the oral action data from an oral action database, where the oral action database contains mixed shape values corresponding to the pronunciation mouth shape of Chinese Pinyin.
通过为口部动作数据建立数据库,在需要显示口部动作时由电子设备发送请求消息至服务器后,服务器从数据库中调用所需的口部动作数据。相比于通过深度学习等方案获取的口部动作数据,有利于简化口部动作数据获取的流程,有利于提高翻译的效率,提升电子设备的应用使用体验。By establishing a database for the mouth movement data, when the mouth movement needs to be displayed, the electronic device sends a request message to the server, and the server calls the required mouth movement data from the database. Compared with oral movement data obtained through deep learning and other solutions, it is conducive to simplifying the process of obtaining oral movement data, improving the efficiency of translation, and improving the application experience of electronic devices.
第三方面,提供一种电子设备,包括处理器和存储器,该存储器存储有一个或者多个 计算机程序,该一个或多个计算机程序包括指令,当该指令被处理器执行时,该处理器用于:响应于用户的输入,获取文字信息,该文字信息包括关键词,该关键词根据手语使用者的语言习惯确定;该处理器还用于显示该文字信息对应的手部动作,该处理器还用于显示关键词对应的口部动作。In a third aspect, an electronic device is provided, including a processor and a memory, the memory stores one or more A computer program, the one or more computer programs including instructions that, when executed by a processor, are configured to: in response to user input, obtain textual information, the textual information including keywords, the keywords being used according to sign language The language habits of the user are determined; the processor is also used to display the hand movements corresponding to the text information, and the processor is also used to display the oral movements corresponding to the keywords.
结合第三方面,在第三方面的某些实现方式中,该关键词根据手语使用者的语言习惯确定。Combined with the third aspect, in some implementations of the third aspect, the keyword is determined based on the language habits of the sign language user.
结合第三方面,在第三方面的某些实现方式中,该处理器还用于不显示普通词汇对应的口部动作,该文字信息包括该普通词汇,该普通词汇与该关键词不同。Combined with the third aspect, in some implementations of the third aspect, the processor is also configured to not display oral movements corresponding to common words, the text information includes the common words, and the common words are different from the keywords.
结合第三方面,在第三方面的某些实现方式中,该处理器具体用于在显示关键词对应的手部动作的同时显示该口部动作。Combined with the third aspect, in some implementations of the third aspect, the processor is specifically configured to display the mouth movement while displaying the hand movement corresponding to the keyword.
结合第三方面,在第三方面的某些实现方式中,该处理器还用于,显示第一词汇,该文字信息包括该第一词汇,该第一词汇为推荐附加口部动作的词汇;响应于用户的确认操作,该处理器还用于确定该第一词汇为关键词。In conjunction with the third aspect, in some implementations of the third aspect, the processor is further configured to display a first vocabulary, the text information includes the first vocabulary, and the first vocabulary is a vocabulary that recommends additional oral movements; In response to the user's confirmation operation, the processor is also configured to determine that the first word is a keyword.
结合第三方面,在第三方面的某些实现方式中,响应于用户的第一输入,该处理器用于获取第二词汇,该第二词汇为用户请求附加口部动作的词汇;在文字信息包含第二词汇的情况下,该处理器还用于确定第二词汇为关键词;在文字信息不包含第二词汇的情况下,该处理器还用于显示更新请求信息,该更新请求消息用于提示文字信息不包含第二词汇;响应于用户的第二输入,该处理器还用于获取更新后的第二词汇。Combined with the third aspect, in some implementations of the third aspect, in response to the user's first input, the processor is configured to obtain a second vocabulary, the second vocabulary being a vocabulary for which the user requests additional oral movements; in the text information When the text information contains the second vocabulary, the processor is also used to determine that the second vocabulary is a keyword; when the text information does not contain the second vocabulary, the processor is also used to display update request information, and the update request message is The prompt text information does not include the second vocabulary; in response to the user's second input, the processor is also used to obtain the updated second vocabulary.
结合第三方面,在第三方面的某些实现方式中,该手部动作包括第一手部动作和第二手部动作,该第一手部动作在该第二手部动作之前,该处理器还用于从服务器接收第一手部动作数据,该第一手部动作数据用于显示第一手部动作;在显示第一手部动作的同时该处理器还用于从服务器接收第二手部动作数据,该第二手部动作数据用于显示第二手部动作。Combined with the third aspect, in some implementations of the third aspect, the hand movement includes a first hand movement and a second hand movement, the first hand movement precedes the second hand movement, and the processing The processor is also used to receive first hand movement data from the server, and the first hand movement data is used to display the first hand movement; while displaying the first hand movement, the processor is also used to receive a second hand movement data from the server. Hand movement data, the second hand movement data is used to display the second hand movement.
结合第三方面,在第三方面的某些实现方式中,该口部动作包括第一口部动作和第二口部动作,该第一口部动作在第二口部动作之前,该处理器还用于从服务器接收第一口部动作数据,该第一口部动作数据用于显示第一口部动作;在显示第一口部动作的同时该处理器还用于从服务器接收第二口部动作数据,该第二口部动作数据用于显示第二口部动作。Combined with the third aspect, in some implementations of the third aspect, the oral action includes a first oral action and a second oral action, the first oral action precedes the second oral action, and the processor It is also used to receive the first oral movement data from the server, and the first oral movement data is used to display the first oral movement; while displaying the first oral movement, the processor is also used to receive the second oral movement data from the server. mouth movement data, and the second mouth movement data is used to display the second mouth movement.
结合第三方面,在第三方面的某些实现方式中,该处理器还用于从服务器接收响应消息,该响应消息用于指示文字信息不包含敏感信息。Combined with the third aspect, in some implementations of the third aspect, the processor is further configured to receive a response message from the server, where the response message is used to indicate that the text information does not contain sensitive information.
第四方面,提供一种服务器,包括处理器和存储器,该存储器存储有一个或者多个计算机程序,该一个或多个计算机程序包括指令,当该指令被处理器执行时,该处理器用于:接收翻译请求消息,该翻译消息包括文字信息,该翻译请求消息用于请求获取该文字信息对应的手部动作数据,该文字信息包括关键词,该关键词根据手语使用者的语言习惯确定,该翻译请求消息还用于请求获取该关键词对应的口部动作数据;该处理器还用于根据文字信息确定是否发送手部动作数据和/或口部动作数据。In a fourth aspect, a server is provided, including a processor and a memory. The memory stores one or more computer programs. The one or more computer programs include instructions. When the instructions are executed by the processor, the processor is used to: Receive a translation request message. The translation message includes text information. The translation request message is used to request acquisition of hand movement data corresponding to the text information. The text information includes keywords. The keywords are determined according to the language habits of the sign language user. The translation request message is also used to request the oral movement data corresponding to the keyword; the processor is also used to determine whether to send hand movement data and/or mouth movement data based on the text information.
结合第四方面,在第四方面的某些实现方式中,该处理器还用于确定该文字信息中是否包含敏感信息;在文字信息中包含敏感信息的情况下,该处理器还用于发送第一响应消息,该第一响应消息用于指示该文字信息包含敏感信息;在文字信息不包含敏感信息的情况下,该处理器还用于发送第二响应消息,该第二响应消息包括手部动作数据和/或口部 动作数据。Combined with the fourth aspect, in some implementations of the fourth aspect, the processor is also used to determine whether the text information contains sensitive information; when the text information contains sensitive information, the processor is also used to send A first response message, the first response message is used to indicate that the text information contains sensitive information; when the text information does not contain sensitive information, the processor is also used to send a second response message, the second response message includes a handwritten message. facial movement data and/or oral action data.
结合第四方面,在第四方面的某些实现方式中,该手部动作数据包括第一手部动作数据和第二手部动作数据,该第一手部动作数据用于显示第一手部动作,该第二手部动作数据用于显示第二手部动作,该第一手部动作在该第二手部动作之前,该处理器还用于在发送第一手部动作数据之后发送该第二手部动作数据。In conjunction with the fourth aspect, in some implementations of the fourth aspect, the hand movement data includes first hand movement data and second hand movement data, and the first hand movement data is used to display the first hand movement data. action, the second hand action data is used to display the second hand action, the first hand action precedes the second hand action, and the processor is further configured to send the first hand action data after sending the Second hand movement data.
结合第四方面,在第四方面的某些实现方式中,该口部动作数据包括第一口部动作数据和第二口部动作数据,该第一口部动作数据用于显示第一口部动作,该第二口部动作数据用于显示第二口部动作,该第一口部动作在该第二口部动作之前,该处理器还用于在发送第一口部动作数据之后发送该第二口部动作数据。In conjunction with the fourth aspect, in some implementations of the fourth aspect, the mouth action data includes first mouth action data and second mouth action data, and the first mouth action data is used to display the first mouth action data. action, the second oral action data is used to display the second oral action, the first oral action precedes the second oral action, and the processor is further configured to send the first oral action data after sending the first oral action data. Second oral movement data.
结合第四方面,在第四方面的某些实现方式中,该处理器还用于从口部动作数据库中获取该口部动作数据,该口部动作数据库包含汉语拼音发音口型对应的混合形状数值。In conjunction with the fourth aspect, in some implementations of the fourth aspect, the processor is further configured to obtain the oral action data from an oral action database, where the oral action database contains mixed shapes corresponding to the mouth shapes of Chinese Pinyin pronunciations. numerical value.
第五方面,提供一种汉语翻译装置,包括获取单元和处理单元,该获取单元用于响应于用户的输入,获取文字信息,该文字信息包括关键词,该关键词根据手语使用者的语言习惯确定;该处理单元用于显示该文字信息对应的手部动作;该处理单元还用显示关键词对应的口部动作。In a fifth aspect, a Chinese translation device is provided, including an acquisition unit and a processing unit. The acquisition unit is used to acquire text information in response to user input. The text information includes keywords, and the keywords are based on the language habits of the sign language user. OK; the processing unit is used to display the hand movements corresponding to the text information; the processing unit is also used to display the mouth movements corresponding to the keywords.
结合第五方面,在第五方面的某些实现方式中,该关键词根据手语使用者的语言习惯确定。Combined with the fifth aspect, in some implementations of the fifth aspect, the keyword is determined based on the language habits of the sign language user.
结合第五方面,在第五方面的某些实现方式中,该处理单元还用于不显示普通词汇对应的口部动作,该文字信息包括该普通词汇,该普通词汇与该关键词不同。Combined with the fifth aspect, in some implementations of the fifth aspect, the processing unit is also configured to not display oral movements corresponding to common words, the text information includes the common words, and the common words are different from the keywords.
结合第五方面,在第五方面的某些实现方式中,该处理单元还用于在显示关键词对应的手部动作的同时显示该口部动作。In conjunction with the fifth aspect, in some implementations of the fifth aspect, the processing unit is further configured to display the mouth movement while displaying the hand movement corresponding to the keyword.
结合第五方面,在第五方面的某些实现方式中,该处理单元还用于显示第一词汇,该文字信息包括该第一词汇,该第一词汇为推荐附加口部动作的词汇,响应于用户的确认操作,该处理单元还用于确定该第一词汇为关键词。Combined with the fifth aspect, in some implementations of the fifth aspect, the processing unit is also used to display a first vocabulary, the text information includes the first vocabulary, the first vocabulary is a vocabulary that recommends additional oral movements, and the response Based on the user's confirmation operation, the processing unit is also used to determine that the first vocabulary is a keyword.
结合第五方面,在第五方面的某些实现方式中,该获取单元还用于响应于用户的第一输入,获取第二词汇,该第二词汇为用户请求附加口部动作的词汇;在文字信息包含第二词汇的情况下,该处理单元还用于确定第二词汇为关键词;在文字信息不包含第二词汇的情况下,该处理单元还用于显示更新请求消息,该更新请求消息用于提示文字信息不包含该第二词汇;该获取单元还用于响应于用户的第二输入,获取更新后的第二词汇。In conjunction with the fifth aspect, in some implementations of the fifth aspect, the acquisition unit is further configured to acquire a second vocabulary in response to the user's first input, where the second vocabulary is a vocabulary for which the user requests additional oral movements; in When the text information contains the second vocabulary, the processing unit is also used to determine that the second vocabulary is a keyword; when the text information does not contain the second vocabulary, the processing unit is also used to display an update request message. The update request The message is used to prompt that the text information does not contain the second vocabulary; the acquisition unit is also used to obtain the updated second vocabulary in response to the user's second input.
结合第五方面,在第五方面的某些实现方式中,该汉语翻译装置还包括通信单元,该手部动作包括第一手部动作和第二手部动作,该第一手部动作在该第二手部动作之前,在显示文字信息对应的手部动作前,该通信单元用于从服务器接收第一手部动作数据,该第一手部动作数据用于显示第一手部动作;该通信单元还用于在显示第一手部动作的同时从服务器接收第二手部动作数据,该第二手部动作数据用于显示第二手部动作。In conjunction with the fifth aspect, in some implementations of the fifth aspect, the Chinese translation device further includes a communication unit, the hand movement includes a first hand movement and a second hand movement, and the first hand movement is in the Before the second hand movement, and before displaying the hand movement corresponding to the text information, the communication unit is used to receive the first hand movement data from the server, and the first hand movement data is used to display the first hand movement; The communication unit is further configured to receive second hand movement data from the server while displaying the first hand movement, and the second hand movement data is used to display the second hand movement.
结合第五方面,在第五方面的某些实现方式中,该口部动作包括第一口部动作和第二口部动作,该第一口部动作在第二口部动作之前,该通信单元还用于从服务器接收第一口部动作数据,该第一口部动作数据用于显示第一口部动作;在显示第一口部动作的同时该通信单元还用于从服务器接收第二口部动作数据,该第二口部动作数据用于显示第二口部动作。 In conjunction with the fifth aspect, in some implementations of the fifth aspect, the oral action includes a first oral action and a second oral action, the first oral action precedes the second oral action, and the communication unit It is also used to receive the first oral movement data from the server, and the first oral movement data is used to display the first oral movement; while displaying the first oral movement, the communication unit is also used to receive the second oral movement data from the server. mouth movement data, and the second mouth movement data is used to display the second mouth movement.
结合第五方面,在第五方面的某些实现方式中,在显示文字信息对应的手部动作前,该通信单元还用于从服务器接收响应消息,该响应消息用于指示文字信息不包含敏感信息。Combined with the fifth aspect, in some implementations of the fifth aspect, before displaying the hand movement corresponding to the text information, the communication unit is also used to receive a response message from the server, the response message is used to indicate that the text information does not contain sensitive information. information.
第六方面,提供一种汉语翻译装置,包括通信单元和处理单元,该通信单元用于,接收翻译请求消息,该翻译请求消息包括文字信息,该翻译请求消息用于请求获取该文字信息对应的手部动作数据,该文字信息包括关键词,该关键词根据手语使用者的语言习惯确定,该翻译请求消息还用于请求获取该关键词对应的口部动作数据;该处理单元用于根据文字信息确定是否发送手部动作数据和/或口部动作数据。In a sixth aspect, a Chinese translation device is provided, including a communication unit and a processing unit. The communication unit is used to receive a translation request message. The translation request message includes text information. The translation request message is used to request to obtain the text corresponding to the text information. Hand movement data, the text information includes keywords, the keywords are determined according to the language habits of the sign language user, the translation request message is also used to request to obtain oral movement data corresponding to the keywords; the processing unit is used to obtain the oral movement data corresponding to the keywords; the processing unit is used to obtain the hand movement data according to the language habits of the sign language user. Information determines whether to send hand movement data and/or mouth movement data.
结合第六方面,在第六方面的某些实现方式中,该处理单元还用于确定该文字信息中是否包含敏感信息;在文字信息中包含敏感信息的情况下,该通信单元还用于发送第一响应消息,该第一响应消息用于指示该文字信息包含敏感信息;在文字信息不包含敏感信息的情况下,该通信单元还用于发送第二响应消息,该第二响应消息包括手部动作数据和/或口部动作数据。Combined with the sixth aspect, in some implementations of the sixth aspect, the processing unit is also used to determine whether the text information contains sensitive information; in the case where the text information contains sensitive information, the communication unit is also used to send A first response message, the first response message is used to indicate that the text information contains sensitive information; when the text information does not contain sensitive information, the communication unit is also used to send a second response message, the second response message includes a handheld message. facial movement data and/or oral movement data.
结合第六方面,在第六方面的某些实现方式中,该手部动作数据包括第一手部动作数据和第二手部动作数据,该第一手部动作数据用于显示第一手部动作,该第二手部动作数据用于显示第二手部动作,该第一手部动作在该第二手部动作之前,该通信单元还用于在发送第一手部动作数据之后发送该第二手部动作数据。In conjunction with the sixth aspect, in some implementations of the sixth aspect, the hand movement data includes first hand movement data and second hand movement data, and the first hand movement data is used to display the first hand movement data. action, the second hand action data is used to display the second hand action, the first hand action precedes the second hand action, and the communication unit is also used to send the first hand action data after sending the Second hand movement data.
结合第六方面,在第六方面的某些实现方式中,该口部动作数据包括第一口部动作数据和第二口部动作数据,该第一口部动作数据用于显示第一口部动作,该第二口部动作数据用于显示第二口部动作,该第一口部动作在该第二口部动作之前,该通信单元还用于在发送第一口部动作数据之后发送该第二口部动作数据。In conjunction with the sixth aspect, in some implementations of the sixth aspect, the mouth action data includes first mouth action data and second mouth action data, and the first mouth action data is used to display the first mouth action data. action, the second oral action data is used to display the second oral action, the first oral action precedes the second oral action, and the communication unit is further configured to send the first oral action data after sending the first oral action data. Second oral movement data.
结合第六方面,在第六方面的某些实现方式中,该处理单元还用于从口部动作数据库中获取该口部动作数据,该口部动作数据库包含汉语拼音发音口型对应的混合形状数值。In conjunction with the sixth aspect, in some implementations of the sixth aspect, the processing unit is further configured to obtain the oral action data from an oral action database, where the oral action database contains mixed shapes corresponding to the pronunciation mouth shapes of Chinese Pinyin. numerical value.
第七方面,提供一种计算机程序产品,该计算机程序产品包括计算机程序代码,当计算机程序代码在计算机上运行时,使得第一方面或其任意可能的实现方式中的方法被执行。In a seventh aspect, a computer program product is provided. The computer program product includes computer program code. When the computer program code is run on a computer, the method in the first aspect or any possible implementation thereof is executed.
第八方面,提供一种计算机程序产品,该计算机程序产品包括计算机程序代码,当计算机程序代码在计算机上运行时,使得第二方面或其任意可能的实现方式中的方法被执行。In an eighth aspect, a computer program product is provided. The computer program product includes computer program code. When the computer program code is run on a computer, the method in the second aspect or any possible implementation thereof is executed.
第九方面,提供一种计算机可读存储介质,该计算机可读介质中存储有计算机指令,当计算机指令在计算机上运行时,使得第一方面或其任意可能的实现方式中的方法被执行。In a ninth aspect, a computer-readable storage medium is provided. Computer instructions are stored in the computer-readable medium. When the computer instructions are run on a computer, the method in the first aspect or any possible implementation thereof is executed.
第十方面,提供一种计算机可读存储介质,该计算机可读介质中存储有计算机指令,当计算机指令在计算机上运行时,使得第二方面或其任意可能的实现方式中的方法被执行。In a tenth aspect, a computer-readable storage medium is provided. Computer instructions are stored in the computer-readable medium. When the computer instructions are run on a computer, the method in the second aspect or any possible implementation thereof is executed.
第十一方面,提供一种芯片,包括处理器,用于读取存储器中存储的指令,当该处理器执行该指令时,使得该芯片实现第一方面或其任意可能的实现方式中的方法被执行。An eleventh aspect provides a chip, including a processor for reading instructions stored in a memory. When the processor executes the instructions, the chip implements the method in the first aspect or any possible implementation thereof. be executed.
第十二方面,提供一种芯片,包括处理器,用于读取存储器中存储的指令,当该处理器执行该指令时,使得该芯片实现第二方面或其任意可能的实现方式中的方法被执行。In a twelfth aspect, a chip is provided, including a processor for reading instructions stored in a memory. When the processor executes the instruction, the chip implements the method in the second aspect or any possible implementation thereof. be executed.
附图说明Description of the drawings
图1是适用于本申请实施例中一种电子设备硬件架构示意图。FIG. 1 is a schematic diagram of the hardware architecture of an electronic device applicable to an embodiment of the present application.
图2是适用于本申请实施例中一种电子设备软件架构示意图。FIG. 2 is a schematic diagram of an electronic device software architecture applicable to an embodiment of the present application.
图3是本申请实施例提供的一种汉语翻译方法示意图。 Figure 3 is a schematic diagram of a Chinese translation method provided by an embodiment of the present application.
图4是本申请实施例提供的另一种汉语翻译方法示意图。Figure 4 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图5是本申请实施例提供的又一种汉语翻译方法示意图。Figure 5 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图6是本申请实施例提供的又一种汉语翻译方法示意图。Figure 6 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图7是本申请实施例提供的又一种汉语翻译方法示意图。Figure 7 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图8是本申请实施例提供的又一种汉语翻译方法示意图。Figure 8 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图9是本申请实施例提供的又一种汉语翻译方法示意图。Figure 9 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图10是本申请实施例提供的又一种汉语翻译方法示意图。Figure 10 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图11是本申请实施例提供的又一种汉语翻译方法示意图。Figure 11 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图12是本申请实施例提供的又一种汉语翻译方法示意图。Figure 12 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
图13是本申请实施例提供的一种汉语翻译装置示意图。Figure 13 is a schematic diagram of a Chinese translation device provided by an embodiment of the present application.
图14是本申请实施例提供的另一种汉语翻译装置示意图。Figure 14 is a schematic diagram of another Chinese translation device provided by an embodiment of the present application.
图15是本申请实施例提供的一种电子设备示意图。Figure 15 is a schematic diagram of an electronic device provided by an embodiment of the present application.
图16是本申请实施例提供的一种服务器示意图。Figure 16 is a schematic diagram of a server provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合附图,对本申请中的技术方案进行描述。The technical solutions in this application will be described below with reference to the accompanying drawings.
以下实施例中所使用的术语只是为了描述特定实施例的目的,而并非旨在作为对本申请的限制。如在本申请的说明书和所附权利要求书中所使用的那样,单数表达形式“一个”、“一种”、“所述”、“上述”、“该”和“这一”旨在也包括例如“一个或多个”这种表达形式,除非其上下文中明确地有相反指示。还应当理解,在本申请以下各实施例中,“至少一个”、“一个或多个”是指一个、两个或两个以上。术语“和/或”,用于描述关联对象的关联关系,表示可以存在三种关系;例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B的情况,其中A、B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。The terminology used in the following examples is for the purpose of describing specific embodiments only and is not intended to limit the application. As used in the specification and appended claims of this application, the singular expressions "a", "an", "said", "above", "the" and "the" are intended to also Expressions such as "one or more" are included unless the context clearly indicates otherwise. It should also be understood that in the following embodiments of this application, "at least one" and "one or more" refer to one, two or more than two. The term "and/or" is used to describe the relationship between associated objects, indicating that there can be three relationships; for example, A and/or B can mean: A exists alone, A and B exist simultaneously, and B exists alone, Where A and B can be singular or plural. The character "/" generally indicates that the related objects are in an "or" relationship.
在本说明书中描述的参考“一个实施例”或“一些实施例”等意味着在本申请的一个或多个实施例中包括结合该实施例描述的特定特征、结构或特点。由此,在本说明书中的不同之处出现的语句“在一个实施例中”、“在一些实施例中”、“在其他一些实施例中”、“在另外一些实施例中”等不是必然都参考相同的实施例,而是意味着“一个或多个但不是所有的实施例”,除非是以其他方式另外特别强调。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。Reference in this specification to "one embodiment" or "some embodiments" or the like means that a particular feature, structure or characteristic described in connection with the embodiment is included in one or more embodiments of the application. Therefore, the phrases "in one embodiment", "in some embodiments", "in other embodiments", "in other embodiments", etc. appearing in different places in this specification are not necessarily References are made to the same embodiment, but rather to "one or more but not all embodiments" unless specifically stated otherwise. The terms “including,” “includes,” “having,” and variations thereof all mean “including but not limited to,” unless otherwise specifically emphasized.
本申请实施例提供的方法可以应用于手机、平板电脑、可穿戴设备、车载设备、增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本、个人数字助理(personal digital assistant,PDA)等电子设备上,本申请实施例对电子设备的具体类型不作任何限制。The methods provided by the embodiments of this application can be applied to mobile phones, tablet computers, wearable devices, vehicle-mounted devices, augmented reality (AR)/virtual reality (VR) devices, notebook computers, ultra mobile personal computers (ultra -Mobile personal computer (UMPC), netbook, personal digital assistant (personal digital assistant, PDA) and other electronic devices, the embodiments of this application do not place any restrictions on the specific types of electronic devices.
示例性的,图1示出了电子设备100的结构示意图。电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏 194,以及用户身份识别(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。By way of example, FIG. 1 shows a schematic structural diagram of an electronic device 100 . The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charging management module 140, a power management module 141, a battery 142, an antenna 1, and an antenna 2. , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone interface 170D, sensor module 180, button 190, motor 191, indicator 192, camera 193, display screen 194, and subscriber identification module (SIM) card interface 195, etc. The sensor module 180 may include a pressure sensor 180A, a gyro sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
可以理解的是,本申请实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that the structure illustrated in the embodiment of the present application does not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or fewer components than shown in the figures, or some components may be combined, some components may be separated, or some components may be arranged differently. The components illustrated may be implemented in hardware, software, or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units. For example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (GPU), and an image signal processor. (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) wait. Among them, different processing units can be independent devices or integrated in one or more processors.
其中,控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The controller may be the nerve center and command center of the electronic device 100 . The controller can generate operation control signals based on the instruction operation code and timing signals to complete the control of fetching and executing instructions.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。The processor 110 may also be provided with a memory for storing instructions and data. In some embodiments, the memory in processor 110 is cache memory. This memory may hold instructions or data that have been recently used or recycled by processor 110 . If the processor 110 needs to use the instructions or data again, it can be called directly from the memory. Repeated access is avoided and the waiting time of the processor 110 is reduced, thus improving the efficiency of the system.
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户身份识别(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, processor 110 may include one or more interfaces. Interfaces may include integrated circuit (inter-integrated circuit, I2C) interface, integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, pulse code modulation (pulse code modulation, PCM) interface, universal asynchronous receiver and transmitter (universal asynchronous receiver/transmitter (UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and /or universal serial bus (USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。The I2C interface is a bidirectional synchronous serial bus, including a serial data line (SDA) and a serial clock line (derail clock line, SCL). In some embodiments, processor 110 may include multiple sets of I2C buses. The processor 110 can separately couple the touch sensor 180K, charger, flash, camera 193, etc. through different I2C bus interfaces. For example, the processor 110 can be coupled to the touch sensor 180K through an I2C interface, so that the processor 110 and the touch sensor 180K communicate through the I2C bus interface to implement the touch function of the electronic device 100 .
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, processor 110 may include multiple sets of I2S buses. The processor 110 can be coupled with the audio module 170 through the I2S bus to implement communication between the processor 110 and the audio module 170 . In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface to implement the function of answering calls through a Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接 听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communications to sample, quantize and encode analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface. In some embodiments, the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface to achieve connection via a Bluetooth headset. Listen to the phone function. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus can be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is generally used to connect the processor 110 and the wireless communication module 160 . For example, the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function. In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface to implement the function of playing music through a Bluetooth headset.
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。The MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 . MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc. In some embodiments, the processor 110 and the camera 193 communicate through the CSI interface to implement the shooting function of the electronic device 100 . The processor 110 and the display screen 194 communicate through the DSI interface to implement the display function of the electronic device 100 .
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured through software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface can be used to connect the processor 110 with the camera 193, display screen 194, wireless communication module 160, audio module 170, sensor module 180, etc. The GPIO interface can also be configured as an I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface that complies with the USB standard specification, and may be a Mini USB interface, a Micro USB interface, a USB Type C interface, etc. The USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through them. This interface can also be used to connect other electronic devices, such as AR devices, etc.
可以理解的是,本申请实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在本申请另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationships between the modules illustrated in the embodiments of the present application are only schematic illustrations and do not constitute a structural limitation of the electronic device 100 . In other embodiments of the present application, the electronic device 100 may also adopt different interface connection methods in the above embodiments, or a combination of multiple interface connection methods.
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块140可以通过USB接口130接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块140可以通过电子设备100的无线充电线圈接收无线充电输入。充电管理模块140为电池142充电的同时,还可以通过电源管理模块141为电子设备供电。The charging management module 140 is used to receive charging input from the charger. Among them, the charger can be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive charging input from the wired charger through the USB interface 130 . In some wireless charging embodiments, the charging management module 140 may receive wireless charging input through the wireless charging coil of the electronic device 100 . While the charging management module 140 charges the battery 142, it can also provide power to the electronic device through the power management module 141.
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,外部存储器,显示屏194,摄像头193,和无线通信模块160等供电。电源管理模块141还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在其他一些实施例中,电源管理模块141也可以设置于处理器110中。在另一些实施例中,电源管理模块141和充电管理模块140也可以设置于同一个器件中。The power management module 141 is used to connect the battery 142, the charging management module 140 and the processor 110. The power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, internal memory 121, external memory, display screen 194, camera 193, wireless communication module 160, etc. The power management module 141 can also be used to monitor battery capacity, battery cycle times, battery health status (leakage, impedance) and other parameters. In some other embodiments, the power management module 141 may also be provided in the processor 110 . In other embodiments, the power management module 141 and the charging management module 140 may also be provided in the same device.
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 can be implemented through the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor and the baseband processor.
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example: Antenna 1 can be reused as a diversity antenna for a wireless LAN. In other embodiments, antennas may be used in conjunction with tuning switches.
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的 解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 can provide wireless communication including 2G/3G/4G/5G etc. applied on the electronic device 100. solution. The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA), etc. The mobile communication module 150 can receive electromagnetic waves through the antenna 1, perform filtering, amplification and other processing on the received electromagnetic waves, and transmit them to the modem processor for demodulation. The mobile communication module 150 can also amplify the signal modulated by the modem processor and convert it into electromagnetic waves through the antenna 1 for radiation. In some embodiments, at least part of the functional modules of the mobile communication module 150 may be disposed in the processor 110 . In some embodiments, at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be provided in the same device.
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。A modem processor may include a modulator and a demodulator. Among them, the modulator is used to modulate the low-frequency baseband signal to be sent into a medium-high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low-frequency baseband signal. The demodulator then transmits the demodulated low-frequency baseband signal to the baseband processor for processing. After the low-frequency baseband signal is processed by the baseband processor, it is passed to the application processor. The application processor outputs sound signals through audio devices (not limited to speaker 170A, receiver 170B, etc.), or displays images or videos through display screen 194. In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be independent of the processor 110 and may be provided in the same device as the mobile communication module 150 or other functional modules.
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) network), Bluetooth (bluetooth, BT), and global navigation satellites. System (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 . The wireless communication module 160 can also receive the signal to be sent from the processor 110, frequency modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the electronic device 100 is coupled to the mobile communication module 150, and the antenna 2 is coupled to the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code division multiple access (wideband code division multiple access, WCDMA), time division code division multiple access (time-division code division multiple access, TD-SCDMA), long term evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc. The GNSS may include global positioning system (GPS), global navigation satellite system (GLONASS), Beidou navigation satellite system (BDS), quasi-zenith satellite system (quasi) -zenith satellite system (QZSS) and/or satellite based augmentation systems (SBAS).
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 implements display functions through a GPU, a display screen 194, an application processor, and the like. The GPU is an image processing microprocessor and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED), 有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。The display screen 194 is used to display images, videos, etc. Display 194 includes a display panel. The display panel can use liquid crystal display (LCD) or organic light-emitting diode (OLED). Active matrix organic light emitting diode or active matrix organic light emitting diode (active-matrix organic light emitting diode, AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, Quantum dot light emitting diodes (QLED), etc. In some embodiments, the electronic device 100 may include 1 or N display screens 194, where N is a positive integer greater than 1.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 can implement the shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP is used to process the data fed back by the camera 193. For example, when taking a photo, the shutter is opened, the light is transmitted to the camera sensor through the lens, the optical signal is converted into an electrical signal, and the camera sensor passes the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 193.
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。Camera 193 is used to capture still images or video. The object passes through the lens to produce an optical image that is projected onto the photosensitive element. The photosensitive element can be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then passes the electrical signal to the ISP to convert it into a digital image signal. ISP outputs digital image signals to DSP for processing. DSP converts digital image signals into standard RGB, YUV and other format image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193, where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. Electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos in multiple encoding formats, such as moving picture experts group (MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。NPU is a neural network (NN) computing processor. By drawing on the structure of biological neural networks, such as the transmission mode between neurons in the human brain, it can quickly process input information and can continuously learn by itself. Intelligent cognitive applications of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, etc.
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备100的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100. The external memory card communicates with the processor 110 through the external memory interface 120 to implement the data storage function. Such as saving music, videos, etc. files in external memory card.
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器110通过运行存储在内部存储器121的指令,从而执行电子设备100的各种功能应用以及数据处理。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备100使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。Internal memory 121 may be used to store computer executable program code, which includes instructions. The processor 110 executes instructions stored in the internal memory 121 to execute various functional applications and data processing of the electronic device 100 . The internal memory 121 may include a program storage area and a data storage area. Among them, the stored program area can store an operating system, at least one application program required for a function (such as a sound playback function, an image playback function, etc.). The storage data area may store data created during use of the electronic device 100 (such as audio data, phone book, etc.). In addition, the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one disk storage device, flash memory device, universal flash storage (UFS), etc.
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。 The electronic device 100 can implement audio functions through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the headphone interface 170D, and the application processor. Such as music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signals. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。Speaker 170A, also called "speaker", is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music through the speaker 170A, or listen to hands-free calls.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。Receiver 170B, also called "earpiece", is used to convert audio electrical signals into sound signals. When the electronic device 100 answers a call or a voice message, the voice can be heard by bringing the receiver 170B close to the human ear.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。Microphone 170C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can speak close to the microphone 170C with the human mouth and input the sound signal to the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which in addition to collecting sound signals, may also implement a noise reduction function. In other embodiments, the electronic device 100 can also be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions, etc.
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The headphone interface 170D is used to connect wired headphones. The headphone interface 170D may be a USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, or a Cellular Telecommunications Industry Association of the USA (CTIA) standard interface.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。The buttons 190 include a power button, a volume button, etc. Key 190 may be a mechanical key. It can also be a touch button. The electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
马达191可以产生振动提示。马达191可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏194不同区域的触摸操作,马达191也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。The motor 191 can generate vibration prompts. The motor 191 can be used for vibration prompts for incoming calls and can also be used for touch vibration feedback. For example, touch operations for different applications (such as taking pictures, audio playback, etc.) can correspond to different vibration feedback effects. The motor 191 can also respond to different vibration feedback effects for touch operations in different areas of the display screen 194 . Different application scenarios (such as time reminders, receiving information, alarm clocks, games, etc.) can also correspond to different vibration feedback effects. The touch vibration feedback effect can also be customized.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 192 may be an indicator light, which may be used to indicate charging status, power changes, or may be used to indicate messages, missed calls, notifications, etc.
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备100采用嵌入式SIM(embedded-SIM,eSIM)卡,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备100中,不能和电子设备100分离。The SIM card interface 195 is used to connect a SIM card. The SIM card can be connected to or separated from the electronic device 100 by inserting it into the SIM card interface 195 or pulling it out from the SIM card interface 195 . The electronic device 100 can support 1 or N SIM card interfaces, where N is a positive integer greater than 1. SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card, etc. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 195 is also compatible with different types of SIM cards. The SIM card interface 195 is also compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as calls and data communications. In some embodiments, the electronic device 100 uses an embedded SIM (embedded-SIM, eSIM) card, that is, an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100 .
应理解,本申请实施例中的电话卡包括但不限于SIM卡、eSIM卡、全球用户识别卡(universal subscriber identity module,USIM)、通用集成电话卡(universal integrated circuit card,UICC)等等。 It should be understood that the phone card in the embodiment of the present application includes but is not limited to SIM card, eSIM card, universal subscriber identity module (USIM), universal integrated circuit card (UICC), etc.
电子设备100的软件系统可以采用分层架构,事件驱动架构,微核架构,微服务架构,或云架构。本申请实施例以分层架构的Android系统为例,示例性说明电子设备100的软件结构。The software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture. The embodiment of this application takes the Android system with a layered architecture as an example to illustrate the software structure of the electronic device 100 .
图2是本申请实施例的电子设备100的软件结构框图。分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将Android系统分为四层,从上至下分别为应用程序层,应用程序框架层,安卓运行时(Android runtime)和系统库,以及内核层。应用程序层可以包括一系列应用程序包。FIG. 2 is a software structure block diagram of the electronic device 100 according to the embodiment of the present application. The layered architecture divides the software into several layers, and each layer has clear roles and division of labor. The layers communicate through software interfaces. In some embodiments, the Android system is divided into four layers, from top to bottom: application layer, application framework layer, Android runtime and system libraries, and kernel layer. The application layer can include a series of application packages.
如图2所示,应用程序包可以包括相机,图库,日历,通话,地图,导航,WLAN,蓝牙,音乐,视频,短信息等应用程序。As shown in Figure 2, the application package can include camera, gallery, calendar, calling, map, navigation, WLAN, Bluetooth, music, video, short message and other applications.
应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。The application framework layer provides an application programming interface (API) and programming framework for applications in the application layer. The application framework layer includes some predefined functions.
如图2所示,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,电话管理器,资源管理器,通知管理器等。As shown in Figure 2, the application framework layer can include a window manager, content provider, view system, phone manager, resource manager, notification manager, etc.
窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。A window manager is used to manage window programs. The window manager can obtain the display size, determine whether there is a status bar, lock the screen, capture the screen, etc.
内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。Content providers are used to store and retrieve data and make this data accessible to applications. Said data can include videos, images, audio, calls made and received, browsing history and bookmarks, phone books, etc.
视图系统包括可视控件,例如显示文字的控件,显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如,包括短信通知图标的显示界面,可以包括显示文字的视图以及显示图片的视图。The view system includes visual controls, such as controls that display text, controls that display pictures, etc. A view system can be used to build applications. The display interface can be composed of one or more views. For example, a display interface including a text message notification icon may include a view for displaying text and a view for displaying pictures.
电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通,挂断等)。The phone manager is used to provide communication functions of the electronic device 100 . For example, call status management (including connected, hung up, etc.).
资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。The resource manager provides various resources to applications, such as localized strings, icons, pictures, layout files, video files, etc.
通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,电子设备振动,指示灯闪烁等。The notification manager allows applications to display notification information in the status bar, which can be used to convey notification-type messages and can automatically disappear after a short stay without user interaction. For example, the notification manager is used to notify download completion, message reminders, etc. The notification manager can also be notifications that appear in the status bar at the top of the system in the form of charts or scroll bar text, such as notifications for applications running in the background, or notifications that appear on the screen in the form of conversation windows. For example, text information is prompted in the status bar, a beep sounds, the electronic device vibrates, the indicator light flashes, etc.
Android runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。Android runtime includes core libraries and virtual machines. Android runtime is responsible for the scheduling and management of the Android system.
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是安卓的核心库。The core library contains two parts: one is the functional functions that need to be called by the Java language, and the other is the core library of Android.
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。The application layer and application framework layer run in virtual machines. The virtual machine executes the java files of the application layer and application framework layer into binary files. The virtual machine is used to perform object life cycle management, stack management, thread management, security and exception management, and garbage collection and other functions.
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(media libraries),三维图形处理库(例如:OpenGL ES),2D图形引擎(例如:SGL)等。System libraries can include multiple functional modules. For example: surface manager (surface manager), media libraries (media libraries), 3D graphics processing libraries (for example: OpenGL ES), 2D graphics engines (for example: SGL), etc.
表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了2D和3D图层的融合。 The surface manager is used to manage the display subsystem and provides the fusion of 2D and 3D layers for multiple applications.
媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.264,MP3,AAC,AMR,JPG,PNG等。The media library supports playback and recording of a variety of commonly used audio and video formats, as well as static image files, etc. The media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
三维图形处理库用于实现三维图形绘图,图像渲染,合成,和图层处理等。The 3D graphics processing library is used to implement 3D graphics drawing, image rendering, composition, and layer processing.
2D图形引擎是2D绘图的绘图引擎。2D Graphics Engine is a drawing engine for 2D drawing.
内核层是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动。The kernel layer is the layer between hardware and software. The kernel layer contains at least display driver, camera driver, audio driver, and sensor driver.
应理解,本申请实施例中的技术方案可以用于Android、IOS、鸿蒙等系统中。It should be understood that the technical solutions in the embodiments of this application can be used in Android, IOS, Hongmeng and other systems.
以上结合图1和图2介绍了适用于本申请提供的翻译方法的电子设备的硬件和软件的架构,以下结合图3至图16说明本申请实施例提供的汉语翻译方法。在正式介绍本申请实施例前,首先介绍以下实施例中可能用到的一些术语。The hardware and software architecture of the electronic device suitable for the translation method provided by the present application has been introduced above with reference to Figures 1 and 2. The Chinese translation method provided by the embodiment of the present application will be described below with reference to Figures 3 to 16. Before formally introducing the embodiments of this application, some terms that may be used in the following embodiments are first introduced.
1、中国手语(Chinese sign language,CSL):中国通用手语,主要在中国大陆使用。1. Chinese sign language (CSL): Chinese universal sign language, mainly used in mainland China.
2、语音识别(automatic speech recognition,ASR):又可以称为语音转文本(speech to text,STT),其目标是以电脑自动将人类的语音内容转换为相应的文字。2. Speech recognition (automatic speech recognition, ASR): It can also be called speech to text (speech to text, STT). Its goal is to use computers to automatically convert human speech content into corresponding text.
3、光学字符识别(optical character recognition,OCR):指对文本资料的图像文件进行分析识别处理,获取文字及版面信息的过程。3. Optical character recognition (OCR): refers to the process of analyzing and recognizing image files of text data to obtain text and layout information.
4、软件开发套件(software development kit,SDK):指一些用于为特定的软件包、软件框架、硬件平台及操作系统等创建应用软件的开发工具的集合。4. Software development kit (SDK): refers to a collection of development tools used to create application software for specific software packages, software frameworks, hardware platforms, operating systems, etc.
5、混合变形(blendshape):对三维模型网格顶点进行操作,以实现定义形状的技术,可用来控制虚拟人物的面部表情。5. Blendshape: A technology that operates on the vertices of the three-dimensional model mesh to achieve a defined shape, which can be used to control the facial expressions of virtual characters.
6、数字人:指通过计算机技术,将人体结构数字化,在电脑屏幕上出现看得见的,能够调控的虚拟人体形态,进一步将人体功能性信息附加到这个人体形态框架上,经过虚拟现实技术的交叉融合,这个“数字人”将能模仿真人做出各种各样的反应,若设置有声音和力反馈的装置,还可以提供视、听、触等直观而自然的实时感。6. Digital human: refers to the use of computer technology to digitize the human body structure, and a visible and controllable virtual human body form appears on the computer screen. The functional information of the human body is further attached to this human body form framework, and through virtual reality technology Through cross-fusion, this "digital human" will be able to imitate real people and make various reactions. If equipped with sound and force feedback devices, it can also provide an intuitive and natural real-time sense of sight, hearing, touch, etc.
7、手语(sign language、signed language、signing)是一种不使用听觉-语音,而使用视觉-手势模式——以肢体动作、脸部表情来表达传达意义、意思使用的语言。7. Sign language (sign language, signed language, signing) is a language that does not use auditory-speech, but uses visual-gestural mode - using body movements and facial expressions to express and convey meaning.
8、词性:指词的特点,用于划分词类。现代汉语的词可以分为实词和虚词两大类,其中实词是指能够单独充当句法成分或大多充当橘子的主要成分。有词汇意义和语法意义。包括名词、动词、形容词、副词、数词、量词、代词和拟声词。虚词不能够单独充当句法成分或大多充当句子的辅助性成分。只有语法意义。包括介词、连词、助词和叹词。8. Part of speech: refers to the characteristics of a word and is used to classify parts of speech. Modern Chinese words can be divided into two categories: content words and function words. Content words refer to those that can act alone as syntactic components or mostly as the main components of oranges. It has lexical and grammatical meanings. Includes nouns, verbs, adjectives, adverbs, numerals, quantifiers, pronouns and onomatopoeia. Function words cannot serve as syntactic components alone or mostly as auxiliary components of sentences. It has only grammatical meaning. Includes prepositions, conjunctions, particles and interjections.
表1给出了一种词性的分类方法,其中专有名词可以包括:人名、地名、机构团体、作品名和其他专有名词等。Table 1 gives a classification method for parts of speech, in which proper nouns can include: names of people, place names, institutional groups, work titles and other proper nouns.
表1词性标签及其含义

Table 1 Part-of-speech tags and their meanings

图3为本申请实施例提供的一种汉语翻译方法示意图,以下以电子设备使用App1将文字信息翻译成相应的手语的过程为例介绍本申请实施例提供的汉语翻译方法。Figure 3 is a schematic diagram of a Chinese translation method provided by an embodiment of the present application. The following takes the process of an electronic device using App1 to translate text information into corresponding sign language as an example to introduce the Chinese translation method provided by an embodiment of the present application.
需要说明的是,以下实施例中App1的使用者(即:电子设备用户或用户)既可以是听障人士,也可以是听力无障碍人士。It should be noted that in the following embodiments, the user of App1 (ie, electronic device user or user) may be a hearing-impaired person or a hearing-impaired person.
电子设备用户可以通过App1的输入功能控件304输入需要进行翻译的数据,该输入功能控件304可以用于向App1输入以下数据类型中的一种或多种:文本(例如303所示内容)、图像、文档、音频、视频等。Electronic device users can input data that needs to be translated through the input function control 304 of App1. The input function control 304 can be used to input one or more of the following data types to App1: text (for example, the content shown in 303), image , documents, audio, video, etc.
当电子设备用户输入文本时,App1可以直接获取电子设备用户输入的文本包含的文字信息。该文本可以是电子设备用户手动输入的,也可以是App1提供的一个或多个文本(App1内置的常用句),电子设备用户从该一个或多个文本中选择的。When the electronic device user inputs text, App1 can directly obtain the text information contained in the text input by the electronic device user. The text may be manually input by the electronic device user, or may be one or more texts provided by App1 (common sentences built into App1), and the electronic device user selects from the one or more texts.
当终端终端用户输入图像时,App1在接收图像数据后通过OCR识别图像中包含的文字信息。当电子设备用户输入文档(如示例.txt等)数据后,App1在接收文档数据后解析文档获得文档中包含的文字信息。When the terminal user inputs an image, App1 recognizes the text information contained in the image through OCR after receiving the image data. When the electronic device user inputs document data (such as example.txt, etc.), App1 parses the document after receiving the document data to obtain the text information contained in the document.
当电子设备用户输入音频或视频数据时,App1在接收音频或视频数据后通过ASR和/或OCR识别音频或视频数据中包含的文字信息。示例性的,当App1接收的视频数据包含字幕时,App1可以通过OCR识别视频中的文字信息,当App1接收的视频数据包含音频数据时,App1可以通过ASR识别视频数据中包含的文字信息,当App1接收的视频数据即包含字幕又包含音频数据时,App1可以同时利用ASR和OCR识别视频中包含的文字信息,并进行相互校对,提高文字识别的正确率。When the electronic device user inputs audio or video data, App1 recognizes the text information contained in the audio or video data through ASR and/or OCR after receiving the audio or video data. For example, when the video data received by App1 contains subtitles, App1 can identify the text information in the video through OCR. When the video data received by App1 contains audio data, App1 can identify the text information contained in the video data through ASR. When When the video data received by App1 contains both subtitles and audio data, App1 can simultaneously use ASR and OCR to identify the text information contained in the video, and perform mutual proofreading to improve the accuracy of text recognition.
当App1获取如303中所示的文字信息“约翰今天下午去了电影院。”,App1可以根据该文字信息获取与该文字信息对应的手部动作数据,进而利用该手部动作数据驱动虚拟人物模型,使得虚拟人物模型可以展示文字信息对应的手部动作。When App1 obtains the text information shown in 303 "John went to the cinema this afternoon.", App1 can obtain the hand movement data corresponding to the text information based on the text information, and then use the hand movement data to drive the virtual character model , so that the virtual character model can display the hand movements corresponding to the text information.
在一些实施例中,在App1将获取的文字翻译成手语前,App1还会识别用户输入的待翻译的数据中包含的不同词汇的词性,对于专有名词,App1还会获取该专有名词对应的口部动作数据。当虚拟人物展示专有名词的手部动作时,在口部动作数据的驱动下,虚拟人物还会展示该专有名词对应的口部动作。In some embodiments, before App1 translates the obtained text into sign language, App1 will also identify the parts of speech of different words contained in the data input by the user to be translated. For proper nouns, App1 will also obtain the corresponding corresponding nouns. Oral movement data. When the avatar shows the hand movements of a proper noun, driven by the mouth movement data, the avatar will also show the mouth movements corresponding to the proper noun.
在一些实施例中,当电子设备用户打开图3所示的界面时,响应于用户的操作,App1显示提示信息302,该提示信息用于提示电子设备用户使用App1的方法或步骤。In some embodiments, when the electronic device user opens the interface shown in Figure 3, App1 displays prompt information 302 in response to the user's operation. The prompt information is used to prompt the electronic device user to use the method or step of App1.
示例性地,该提示信息可以用于提示电子设备用户通过输入功能控件304向App1输入待翻译的数据。For example, the prompt information can be used to prompt the electronic device user to input data to be translated into App1 through the input function control 304.
示例性地,该提示信息还可以用于提示电子设备用户输入需要附加口部动作的字、词。For example, the prompt information can also be used to prompt the user of the electronic device to input words that require additional oral movements.
可选地,App1还可以显示处理状态信息。示例性地,如在301处显示App1当前执行 的操作或用户正在执行的操作。Optionally, App1 can also display processing status information. For example, as shown at 301, App1 is currently executing action or the action the user is performing.
图4为本本申请实施例提供的另一种汉语翻译方法示意图。Figure 4 is a schematic diagram of another Chinese translation method provided by an embodiment of the present application.
本申请实施例中,以App1通过OCR识别图像中的文字信息为例说明App1对于以图像、文档等需要识别文字信息的待翻译数据的处理过程。In the embodiment of this application, App1 uses OCR to recognize text information in images as an example to illustrate App1's processing of data to be translated that needs to recognize text information such as images and documents.
电子设备用户通过输入功能控件向App1输入包含“约翰今天下午去了电影院。”的文字信息的图片。App1在接收用户的输入图片后,通过OCR对图片中的文字信息进行识别。The electronic device user inputs a picture containing the text information "John went to the cinema this afternoon." to App1 through the input function control. After receiving the user's input image, App1 recognizes the text information in the image through OCR.
在一些实施例中,App1识别文字信息正确(“约翰今天下午去了电影院。”),App1显示文本识别结果确认提示窗(如图4中(a)所示),用户点击“确认”,App1获取用户的确认指示,执行下一步操作,即图4中(c)所示的操作。In some embodiments, App1 recognizes the text information correctly ("John went to the cinema this afternoon."), App1 displays the text recognition result confirmation prompt window (as shown in (a) in Figure 4), the user clicks "Confirm", and App1 Obtain the user's confirmation instruction and perform the next operation, which is the operation shown in (c) in Figure 4.
在另一些实施例中,App1识别文字信息错误(“约翰今天上午去了电影院。”),App1显示文本识别结果确认提示窗,用户在确认App1识别文本错误后,点击“修改”,App1获取用户的修改指示,显示如图4中(b)所示的修改文本识别结果提示窗,用户在输入正确的文字信息(“约翰今天下午去了电影院。”)后点击“确认”,响应于用户的输入,App1获取到修改后的文字信息,执行下一步操作,即图4中(c)所示的操作。In other embodiments, App1 recognizes text information incorrectly ("John went to the cinema this morning."), App1 displays a text recognition result confirmation prompt window, and the user clicks "Modify" after confirming that App1 recognizes text information incorrectly, and App1 obtains the user The modification instruction displays the modification text recognition result prompt window as shown in Figure 4(b). The user clicks "Confirm" after inputting the correct text information ("John went to the cinema this afternoon."). In response to the user's Input, App1 obtains the modified text information and performs the next operation, which is the operation shown in (c) in Figure 4.
如图4中(c)所示,App1在获取用户确认后的文字信息后可以将文字信息翻译成对应的手部动作。As shown in (c) in Figure 4, App1 can translate the text information into corresponding hand movements after obtaining the text information confirmed by the user.
在一些实施例中,App1在将确认的文字信息翻译成手部动作前,显示操作提示信息“请输入需要附加口部动作的关键词:”,电子设备用户根据该操作提示信息通过输入功能控件向App1输入“约翰”,响应于用户的输入,App1在获取“约翰”关键词后,在根据文字信息获取文字信息对应的手部动作数据的同时,还会获取关键词“约翰”对应的口部动作数据,从而App1可以利用获取的手部动作数据和关键词的口部动作数据驱动虚拟人物展示相应的手部动作和口部动作。In some embodiments, before App1 translates the confirmed text information into hand movements, it displays the operation prompt message "Please enter the keywords that require additional oral movements:", and the electronic device user uses the input function control according to the operation prompt message. Enter "John" into App1. In response to the user's input, App1 obtains the keyword "John" and obtains the hand movement data corresponding to the text information based on the text information. It also obtains the spoken word corresponding to the keyword "John". Hand movement data, so that App1 can use the obtained hand movement data and mouth movement data of keywords to drive the virtual character to display the corresponding hand movements and mouth movements.
在另一些实施例中,App1在将确认的文字信息翻译成口语前,分析该文字信息中包含专有名词“约翰”,App1将该专有名词作为需要附加口部动作的关键词,并在获取文字信息对应的手部动作数据的同时,获取该专有名词对应的口部动作数据,从而App1可以利用获取的手部动作数据和关键词的口部动作数据驱动虚拟人物展示相应的手部动作和口部动作。In other embodiments, before App1 translates the confirmed text information into spoken language, it analyzes that the text information contains the proper noun "John", App1 uses the proper noun as a keyword that requires additional oral movements, and While obtaining the hand movement data corresponding to the text information, it also obtains the oral movement data corresponding to the proper noun, so that App1 can use the obtained hand movement data and the mouth movement data of the keywords to drive the virtual character to display the corresponding hands. movements and oral movements.
需要说明的是,关键词可以是包含一个或多个汉字的词汇。It should be noted that keywords can be words containing one or more Chinese characters.
图5所示为本申请实施例提供的另一种汉语翻译方法示意图。本申请实施例中,电子设备用户输入需要附加口部动作的关键词,App1对用户输入的关键词进行检查以降低文字信息翻译成手部动作过程中可能出现错误的机率。Figure 5 shows a schematic diagram of another Chinese translation method provided by an embodiment of the present application. In the embodiment of the present application, the electronic device user inputs keywords that require additional oral movements, and App1 checks the keywords input by the user to reduce the probability of possible errors in the process of translating text information into hand movements.
如图5中(a)所示,电子设备用户输入待翻译的文字信息为“用完水请关闭水龙头。”响应于用户的输入,App1显示提示信息“请输入需要附加口部动作的关键词:”。As shown in (a) in Figure 5, the electronic device user inputs text information to be translated as “Please turn off the faucet after using up water.” In response to the user’s input, App1 displays the prompt message “Please enter keywords that require additional oral movements. :".
在一些实施例中,电子设备用户根据上述提示信息输入关键词:“关闭、水龙头”,响应于用户的输入,App1检查并确定文字信息中包含用户输入的关键词,则执行相应的翻译操作。In some embodiments, the user of the electronic device inputs the keyword: "close, faucet" according to the above prompt information. In response to the user's input, App1 checks and determines that the text information contains the keyword entered by the user, and then performs the corresponding translation operation.
在另一些实施例中,电子设备用户根据上述提示信息输入关键词:“关闭、火龙头”,响应于用户的输入,App1检查并确定文字信息中包含关键词“关闭”,但不包含关键词 “火龙头”,则App1显示如图5中(b)所示的确认关键词提示信息:“未找到“火龙头”,请确认是否为“水龙头”?”根据该提示信息,电子设备用户确认已经输入的关键词有误且App1识别的关键词正确,并点击“确认”。响应于用户的确认操作,App1将需要附加口部动作的关键词更新为:“关闭”和“水龙头”。In other embodiments, the user of the electronic device inputs the keyword: "close, fire faucet" according to the above prompt information. In response to the user's input, App1 checks and determines that the text information contains the keyword "close" but does not contain the keyword "Fire faucet", then App1 displays the confirmation keyword prompt message as shown in Figure 5(b): ""Fire faucet" is not found, please confirm whether it is "faucet"?" According to this prompt message, the electronic device user confirms The entered keywords are incorrect and the keywords recognized by App1 are correct, and click "Confirm". In response to the user's confirmation operation, App1 updates the keywords that require additional mouth movements to: "close" and "faucet".
或者,当用户确认已经输入的关键词有误且App1识别的关键词也不正确,用户可以点击“修改”从而输入正确的需要附加口部动作的关键词。Or, when the user confirms that the entered keywords are incorrect and the keywords recognized by App1 are also incorrect, the user can click "Modify" to enter the correct keywords that require additional mouth movements.
在又一些实施例中,电子设备用户根据上述提示信息输入关键词:“水龙头”,响应于用户的输入,App1检查并确定文字信息中包含用户输入的关键词,但并无法获取到该关键词对应的口部动作数据,则App1发出提示信息,该提示信息可以为“无法找到您输入的“水龙头”对应的口部动作数据,已经为您后台请求人工服务,请稍后。”。In some embodiments, the user of the electronic device inputs the keyword: "faucet" according to the above prompt information. In response to the user's input, App1 checks and determines that the text information contains the keyword entered by the user, but cannot obtain the keyword. If the corresponding mouth movement data is obtained, App1 will issue a prompt message. The prompt message may be "Cannot find the mouth movement data corresponding to the "faucet" you entered. Manual service has been requested for you in the background, please wait.".
可选的,App1可以为用户建立与人工客服的视频连接,在建立连接后,人工客服可以为用户展示上述无法获取到的关键词的口部动作。或者,人工客服在后台补充好上述无法获取到的关键词的口部动作数据后供App1调用,App1获取到该口部动作数据后展示给电子设备用户。Optionally, App1 can establish a video connection with the artificial customer service for the user. After the connection is established, the artificial customer service can show the user the mouth movements of the above-mentioned unavailable keywords. Alternatively, the artificial customer service staff can supplement the oral action data of the above-mentioned unobtainable keywords in the background and call it to App1. After App1 obtains the oral action data, it will be displayed to the electronic device user.
在又一些实施例中,电子设备用户根据上述提示信息输入了待翻译的文字信息的全部内容。响应于用户的输入,App1检查到需要附加口部动作数据的关键词较多,App1可以发出提示信息,提示用户:当前需要附加口部动作数据的关键词较多,可以重新输入需要附加口部动作的关键词。In some embodiments, the electronic device user inputs the entire content of the text information to be translated according to the above prompt information. In response to the user's input, App1 detects that there are many keywords that require additional mouth movement data. App1 can issue a prompt message to remind the user: There are currently many keywords that require additional mouth movement data. You can re-enter the keywords that require additional mouth movement data. Action keywords.
在又一些实施例中,电子设备用户未根据上述提示信息输入任何关键词,App1检测在预设时长内未获取到用户输入的关键词,则App1可以根据识别的用户输入的文字信息的内容,发出提示信息,该提示信息包含推荐附加口部动作的关键词。In some embodiments, the user of the electronic device does not enter any keywords according to the above prompt information, and App1 detects that the keywords entered by the user are not obtained within the preset time period, then App1 can identify the content of the text information input by the user, Issue a prompt message containing keywords that recommend additional oral movements.
在一个实施例中,该推荐附加口部动作的关键词可以根据如表1所示的汉语词汇的词性确定,例如专有名词、时间词等。In one embodiment, the keywords for recommending additional oral actions can be determined based on the parts of speech of Chinese vocabulary as shown in Table 1, such as proper nouns, time words, etc.
在另一个实施例中,该推荐附加口部动作的关键词也可以是根据用户习惯确定,示例性的,用户在App1中翻译查询历史中多次将“约翰”作为关键词附加口部动作,则当App1获取到用户输入的待翻译数据中同样包含“约翰”,可以将“约翰”作为推荐附加口部动作的关键词。In another embodiment, the recommended keywords for additional oral movements can also be determined based on user habits. For example, the user has used "John" as a keyword to append oral movements multiple times in the translation query history in App1. Then, when App1 obtains that the user input data to be translated also contains "John", "John" can be used as a keyword to recommend additional oral actions.
同样示例性的,用户在为待翻译的文字信息选择附加口部动作的关键词时,多次将一句话中的主语和宾语确定为附加口部动作的关键词,则当App1在预设时长中未获取到用户输入的需要附加口部动作的关键词时,可以将用户输入的待翻译的文字信息中的主语和宾语确定为推荐附加口部动作的关键词。For the same example, when the user selects keywords for additional oral movements for the text information to be translated, and determines the subject and object in a sentence as keywords for additional oral movements multiple times, then when App1 is in the preset time When the keywords input by the user that require additional oral movements are not obtained, the subject and object in the text information input by the user to be translated can be determined as keywords that recommend additional oral movements.
在又一个实施例中,该推荐附加口部动作的关键词可以根据其他用户的确定方法来确定。示例性的,对于相同的一段视频,80%的用户将“电影院”、“游乐场”确定为需要附加口部动作的关键词,则当用户输入相同的视频,且App1在预设时长中未获取到用户输入的需要附加口部动作的关键词时,App1可以将“电影院”、“游乐场”作为推荐附加口部动作的关键词。In yet another embodiment, the keywords for recommending additional oral movements may be determined based on determination methods of other users. For example, for the same video, 80% of users identified "cinema" and "playground" as keywords that require additional oral movements. When the user inputs the same video and App1 does not When obtaining the keywords input by the user that require additional oral movements, App1 can use "cinema" and "playground" as keywords to recommend additional oral movements.
可选的,当用户输入的需附加口部动作数据的关键词只包含“电影院”,App1还可以发出提示信息,提示用户是否为“游乐场”也添加口部动作数据?当电子设备用户确定为“游乐场”添加口部动作数据,响应于用户的操作,App1将“电影院”、“游乐场” 作为需要附加口部动作的关键词。Optionally, when the keywords entered by the user that require additional mouth movement data only include "cinema", App1 can also send a prompt message to prompt the user whether to also add mouth movement data for "playground"? When the electronic device user determines to add mouth action data to "Amusement Park", in response to the user's operation, App1 will add "Cinema", "Amusement Park" As a keyword that requires additional oral movements.
如图5中(c)所示,当App1获取了用户确认的关键词后,App1还会显示提示信息,该提示信息用于提示更新后的关键词。As shown in (c) in Figure 5, when App1 obtains the keywords confirmed by the user, App1 will also display prompt information, which is used to prompt the updated keywords.
以上结合图3至图5介绍了本申请提供的汉语翻译方法的文字信息的输入过程,以下结合图6至图11说明本申请实施例提供的汉语翻译方法对应的翻译结果的展示和使用等的过程。The above describes the text information input process of the Chinese translation method provided by the present application with reference to Figures 3 to 5. The display and use of the translation results corresponding to the Chinese translation method provided by the embodiment of the present application will be described below with reference to Figures 6 to 11. process.
当App1根据用户的输入获取到文字信息对应的手部动作数据和口部动作数据后,App1显示如图6所示的翻译结果界面。After App1 obtains the hand movement data and mouth movement data corresponding to the text information based on the user's input, App1 displays the translation result interface as shown in Figure 6.
该翻译结果界面可以包括处理提示信息630,该处理提示信息用于提示文字信息的翻译已经完成。可选地,该提示信息还用于提示电子设备用户翻译结果的使用方式。The translation result interface may include processing prompt information 630, which is used to prompt that the translation of the text information has been completed. Optionally, the prompt information is also used to prompt the electronic device user how to use the translation result.
该翻译结果界面还可以包括整体展示区域611,该整体展示区域用于展示虚拟人物在打手语时的整体情况。可选的,当用户确认需要为一个或多个关键词附加口部动作数据时,该整体展示区域用于展示待翻译的文字信息的手部动作和附加口部动作数据的关键词的口部动作。The translation result interface may also include an overall display area 611, which is used to display the overall situation of the virtual character when signing. Optionally, when the user confirms that he or she needs to append oral movement data to one or more keywords, the overall display area is used to display the hand movements of the text information to be translated and the mouth of the keywords with the oral movement data attached. action.
该翻译结果界面还可以包括手部动作展示区域613,该手部动作展示区域用于展示用户输入的待翻译的文字信息的手部动作的细节。可选的,该手部动作展示区域可以包括辅助线和/或辅助文字,该辅助线和/或辅助文字用于帮助用户理解手指的动作轨迹等手语细节。The translation result interface may also include a hand movement display area 613, which is used to display the details of the hand movement of the text information input by the user to be translated. Optionally, the hand movement display area may include auxiliary lines and/or auxiliary text, and the auxiliary lines and/or auxiliary text are used to help the user understand sign language details such as finger movement trajectories.
该翻译结果界面还可以包括口部动作展示区域612,该口部动作展示区域用于展示用户请求附加口部动作数据词汇的口部动作或者推荐附加口部动作数据词汇的口部动作。可选的,该口部动作展示区域可以包括辅助线和/或辅助文字,该辅助线和/或辅助文字用于帮助用户理解嘴部动作轨迹等口部动作细节。The translation result interface may also include a mouth movement display area 612, which is used to display mouth movements for which the user requests additional mouth movement data words or recommends mouth movements for which additional mouth movement data words are recommended. Optionally, the oral movement display area may include auxiliary lines and/or auxiliary text, and the auxiliary lines and/or auxiliary text are used to help the user understand oral movement details such as mouth movement trajectories.
该翻译结果界面还可以包括文字状态展示区域614,该文字状态展示区域用于展示当前展示的手部动作和/或口部动作对应的文字。可选的,该文字状态展示区域还包括拼音注解区域,该拼音注解区域用于展示当前展示的手部动作和/或口部动作对应的文字的拼音注解。The translation result interface may also include a text status display area 614, which is used to display text corresponding to the currently displayed hand movements and/or mouth movements. Optionally, the text status display area also includes a pinyin annotation area, which is used to display the pinyin annotation of the text corresponding to the currently displayed hand movement and/or oral movement.
在一些实施例中,文字状态展示区域按照手部动作的顺序显示相应的词汇。In some embodiments, the text status display area displays corresponding words in the order of hand movements.
这里需要说明的是,手部动作的顺序与听力无障碍人士从左至右的阅读顺序可能并不是相同的。It should be noted here that the order of hand movements may not be the same as the reading order from left to right for hearing-impaired people.
示例性的,“我没有带手机。”的手语表达顺序为:手机、我、带、没有,因此,如果将“手机”作为需要附加口部动作的关键词,则这句话通过文字状态展示区域可以按照如下形式显示:从左到右依次显示“手机”、“我”、“带”、“没有”。对于“手机”,可以进行高亮或者加粗等突出显示。For example, the sign language expression order of "I didn't bring my mobile phone." is: mobile phone, me, bring, no. Therefore, if "mobile phone" is used as a keyword that requires additional oral movements, this sentence will be displayed through the text state The area can be displayed in the following form: "Mobile phone", "Me", "With", and "No" are displayed in order from left to right. For "mobile phone", you can highlight or bold the display.
在另一些实施例中,文字状态展示区域按照自然口语的顺序显示文字信息,并按照手部动作的顺序突出显示手部动作对应的词汇。In other embodiments, the text status display area displays text information in the order of natural spoken language, and highlights words corresponding to hand movements in the order of hand movements.
示例性的,在文字状态展示区域的文字的默认色彩为黑色,当前展示的手部动作对应的文字为红色,当前展示口部动作对应的文字为绿色加粗。For example, the default color of the text in the text status display area is black, the text corresponding to the currently displayed hand movement is red, and the text corresponding to the currently displayed mouth movement is green and bold.
同样示例性的,手语“我没有带手机。”的手语表达顺序为:手机、我、带、没有,因此,如果将“手机”作为需要附加口部动作的关键词,则这句话通过文字状态展示区域 会按照如下形式显示:“手机”绿色加粗显示,“我”红色显示,“带”红色显示,“没有”红色显示。Similarly, for example, the sign language expression order of "I don't have a mobile phone." is: mobile phone, me, bring, no. Therefore, if "mobile phone" is used as a keyword that requires additional oral movements, this sentence will be passed through text. status display area It will be displayed in the following form: "Mobile" is displayed in bold green, "I" is displayed in red, "With" is displayed in red, and "Without" is displayed in red.
在一些实施例中,上述整体展示区域611、口部动作展示区域612、手部动作展示区域613、文字状态展示区域614以及提示信息630组成翻译结果界面。In some embodiments, the above-mentioned overall display area 611, mouth movement display area 612, hand movement display area 613, text status display area 614 and prompt information 630 constitute a translation result interface.
在另一些实施例中,上述整体展示区域611、口部动作展示区域612、手部动作展示区域613和文字状态展示区域614组成翻译结果界面的翻译结果展示区域610,该翻译结果展示区域610为翻译结果界面的一部分。In other embodiments, the above-mentioned overall display area 611, mouth movement display area 612, hand movement display area 613 and text status display area 614 constitute the translation result display area 610 of the translation result interface, and the translation result display area 610 is Part of the translation results interface.
可选的,该翻译结果展示区域610还可以包括提示信息630和输入区域620,该输入区域用于展示用户已经输入的待翻译的文字信息、App1发出的输入提示信息、用户已经输入的需要附加口部动作的关键词等。可选的,该电子设备用户还可以在该输入区域中重新输入需要附加口部动作数据的关键词。当用户在输入区域重新输入需要附加口部动作数据的关键词,响应于用户的输入,App1获取用户重新输入的关键词对应的口部动作数据,并在翻译结果展示区域中更新整体展示区域、口部动作展示区域、手部动作展示区域和文字状态展示区域。Optionally, the translation result display area 610 may also include prompt information 630 and an input area 620. The input area is used to display the text information to be translated that has been input by the user, the input prompt information issued by App1, and the additional information that the user has input. Keywords for oral movements, etc. Optionally, the electronic device user can also re-enter keywords that require additional oral action data in the input area. When the user re-enters keywords that require additional oral movement data in the input area, in response to the user's input, App1 obtains the oral movement data corresponding to the keywords re-entered by the user, and updates the overall display area in the translation result display area. Mouth movement display area, hand movement display area and text status display area.
示例性的,在用户已经确定文字信息:“我没有带手机。”中的“手机”为需要附加口部动作的词汇后,用户在图6所示的输入区域中重新输入“我”,响应于用户的输入,App1确定“我”为需要附加口部动作的关键词。For example, after the user has determined that "mobile phone" in the text message: "I did not bring my mobile phone." is a word that requires additional oral movements, the user re-enters "I" in the input area shown in Figure 6, and responds Based on the user's input, App1 determines "I" as a keyword that requires additional oral movements.
以上结合图6介绍了翻译结果展示界面的基本组成,以下结合图7详细介绍翻译结构展示界面各组件可以具备的功能。The basic composition of the translation result display interface is introduced above in conjunction with Figure 6. The following is a detailed introduction to the functions that each component of the translation structure display interface can have in conjunction with Figure 7.
电子设备用户通过单击、双击或长按翻译结果展示区域的空白处触发App1显示翻译结果展示区域的功能选项。The electronic device user triggers App1 to display the functional options of the translation result display area by clicking, double-clicking or long-pressing a blank space in the translation result display area.
电子设备用户通过单击、双击或长按整体展示区域或口部动作展示区域或手部动作展示区域或文字状态展示区域触发App1显示翻译这些区域具备的功能选项卡。The electronic device user triggers App1 to display and translate the function tabs in these areas by clicking, double-clicking or long-pressing the overall display area, the mouth movement display area, the hand movement display area, or the text status display area.
上述功能选项卡可以包括以下功能中的一项或多项:“全屏查看”、“倍速播放”、“插入到音/视频”、“隐藏”、“保存”或“分享”等。The above-mentioned function tabs may include one or more of the following functions: "View in full screen", "Play at double speed", "Insert into audio/video", "Hide", "Save" or "Share", etc.
当电子设备用户选择“全屏查看”功能选项时,响应于用户的操作,App1全屏展示整体展示区域或口部动作展示区域或手部动作展示区域或文字状态展示区域。When the electronic device user selects the "full screen view" function option, in response to the user's operation, App1 displays the entire display area, mouth movement display area, hand movement display area, or text status display area in full screen.
当电子设备用户选择“倍速播放”功能选项时,响应于用户的操作,App1显示播放速率调整功能窗,用户可以在该播放速率调整功能窗中选择或输入需要设置的播放速率。在获取用户选择或输入的播放速率后,App1按照对应的速率(慢速或快速)播放整体展示区域或口部动作展示区域或手部动作展示区域或文字状态展示区域中包含的内容。When the electronic device user selects the "double speed playback" function option, App1 displays a playback rate adjustment function window in response to the user's operation, and the user can select or input the playback rate that needs to be set in the playback rate adjustment function window. After obtaining the playback rate selected or input by the user, App1 plays the content contained in the overall display area, mouth movement display area, hand movement display area, or text status display area at the corresponding rate (slow or fast).
当电子设备用户选择“插入到音/视频”功能选项时,响应于用户的操作,App1将整体展示区域或口部动作展示区域或手部动作展示区域或文字状态展示区域中的一个或多个插入到对应的音频或视频当中。可选的,当在音频文件中插入上述任一区域后,App1可以将修改后的音频文件以视频文件的格式保存。When the electronic device user selects the "insert into audio/video" function option, in response to the user's operation, App1 will display one or more of the overall display area, the mouth movement display area, the hand movement display area, or the text status display area. Insert into the corresponding audio or video. Optionally, after inserting any of the above areas into the audio file, App1 can save the modified audio file in the format of a video file.
当电子设备用户选择“隐藏”功能选项时,响应于用户的操作,App1隐藏整体展示区域或口部动作展示区域或手部动作展示区域或文字状态展示区域。当用户再次点击已经隐藏的区域时,该区域对应的功能选项中可以包含“显示”功能选项,当用户选择该“显示”功能选项,响应于用户的操作,App1显示已经隐藏的区域。 When the electronic device user selects the "hide" function option, in response to the user's operation, App1 hides the overall display area, the mouth movement display area, the hand movement display area, or the text status display area. When the user clicks on the hidden area again, the function options corresponding to the area may include the "Show" function option. When the user selects the "Show" function option, App1 displays the hidden area in response to the user's operation.
这里需要说明的是,对于待翻译的文字信息中不包含口部动作,或者用户选择不为任何关键词附加口部动作,则口部动作展示区域可以默认隐藏。It should be noted here that if the text information to be translated does not contain oral movements, or the user chooses not to attach oral movements to any keywords, the oral movement display area can be hidden by default.
当电子设备用户选择“保存”功能选项时,响应于用户的操作,App1保存用户选择的区域对应的数据。可选的,响应于用户的操作,App1还可以显示保存提示窗,该保存提示窗用于提示用户是否同时保存其他相关区域对应的数据,该保存提示窗还用于获取用户的指示信息。示例性的,当用户选择同时保存其他相关区域对应的数据,响应于用户的操作,App1将用户选择区域对应的数据以及相关区域对应的数据都保存到电子设备本地。When the electronic device user selects the "save" function option, App1 saves the data corresponding to the area selected by the user in response to the user's operation. Optionally, in response to the user's operation, App1 can also display a save prompt window. The save prompt window is used to prompt the user whether to save data corresponding to other related areas at the same time. The save prompt window is also used to obtain the user's instruction information. For example, when the user chooses to save data corresponding to other related areas at the same time, in response to the user's operation, App1 saves both the data corresponding to the user-selected area and the data corresponding to the related areas locally on the electronic device.
示例性的,当用户在整体展示区域选择“保存”功能选项时,App1显示提示信息:“是否同时保存口部动作展示区域、手部动作展示区域和文字状态展示区域的数据?”当用户选择保存口部动作展示区域时,响应于用户的选择,App1同时保存整体展示区域和口部动作展示区域对应的数据。For example, when the user selects the "Save" function option in the overall display area, App1 displays a prompt message: "Do you want to save the data of the mouth movement display area, hand movement display area, and text status display area at the same time?" When the user selects When saving the mouth movement display area, App1 simultaneously saves data corresponding to the overall display area and the mouth movement display area in response to the user's selection.
当电子设备选择“分享”功能选项时,响应于用户的操作,App1显示分享功能控件,该分享功能控件包括一个或多个分享途径。终端用户可以选择一种或多种分享途径,响应于用户的选择,App1通过用户选择的一种或多种分享途径分享用户选择的区域对应的数据。When the electronic device selects the "share" function option, App1 displays the sharing function control in response to the user's operation, and the sharing function control includes one or more sharing channels. The end user can select one or more sharing channels. In response to the user's selection, App1 shares the data corresponding to the area selected by the user through one or more sharing channels selected by the user.
可选的,当电子设备选择“分享”功能选项时,响应于用户的操作,App1还可以显示分享提示窗,该分享提示窗用于提示用户是否同时分享其他相关区域对应的数据,该分享提示窗还用于获取用户的指示信息。示例性的,当用户选择同时分享其他相关区域对应的数据,响应于用户的操作,App1将用户选择区域对应的数据以及相关区域对应的数据都作为待分享数据。Optionally, when the electronic device selects the "share" function option, App1 can also display a sharing prompt window in response to the user's operation. The sharing prompt window is used to prompt the user whether to share data corresponding to other related areas at the same time. The sharing prompt The window is also used to obtain instructions from the user. For example, when the user chooses to share data corresponding to other related areas at the same time, in response to the user's operation, App1 uses the data corresponding to the area selected by the user and the data corresponding to the related areas as data to be shared.
示例性的,当用户在整体展示区域选择“分享”功能选项时,App1显示提示信息:“是否同时分享口部动作展示区域、手部动作展示区域和文字状态展示区域的数据?”当用户选择分享口部动作展示区域时,响应于用户的选择,App1同时分享整体展示区域和口部动作展示区域对应的数据。For example, when the user selects the "Share" function option in the overall display area, App1 displays a prompt message: "Do you want to share the data of the mouth movement display area, hand movement display area, and text status display area at the same time?" When the user selects When sharing the mouth movement display area, in response to the user's selection, App1 simultaneously shares data corresponding to the overall display area and the mouth movement display area.
对于保存到电子设备本地的不同展示区域对应的数据,电子设备用户可以再次打开查看、分享和编辑等。For the data corresponding to different display areas saved locally on the electronic device, the user of the electronic device can open it again for viewing, sharing, editing, etc.
图8所示为资源库的界面,该资源库用于按照一定的规则分类、排列并展示保存到电子设备本地的不同展示区域对应的数据,上述规则包括分类规则和排列规则。Figure 8 shows the interface of the resource library. The resource library is used to classify, arrange and display data corresponding to different display areas saved locally on the electronic device according to certain rules. The above rules include classification rules and arrangement rules.
其中,分类规则可以包括以下规则中的任一种:区域(整体展示区域、口部动作展示区域或手部动作展示区域等)、时间(保存到电子设备本地的时间,例如:今天、昨天、一周前等)或来源(例如:来源于当前电子设备、来源于相同账户的电子设备或来源于家庭电子设备等)等。The classification rules may include any of the following rules: area (overall display area, mouth movement display area or hand movement display area, etc.), time (time saved locally on the electronic device, for example: today, yesterday, One week ago, etc.) or source (for example: from the current electronic device, from electronic devices of the same account, or from home electronic devices, etc.), etc.
排序规则可以包括以下规则中的任一种:时间(例如时间由远至近或由近至远)、数据中包含的文字信息先后(例如:文字信息首字母字母表顺序)或附加口部动作关键词的先后顺序(关键词第一个词的笔画先后顺序)。Sorting rules can include any of the following rules: time (such as time from far to recent or from recent to far), text information contained in the data (for example: alphabetical order of text information) or additional oral action keys The order of words (the stroke order of the first word of the keyword).
电子设备用户可以选择资源库的“分类方式”功能选项801为保存在本地的数据设置不同的分类方式。电子设备用户也可以选择资源库的“排列方式”功能选项802为保存在本地的数据设置不同的排列方式。The electronic device user can select the "classification method" function option 801 of the resource library to set different classification methods for the data stored locally. The electronic device user can also select the "arrangement" function option 802 of the resource library to set different arrangements for the data stored locally.
在一些实施例中,资源库还包括搜索框805,电子设备用户可以在该搜索框中输入字、 词、时间、区域、来源等内容来快速查找相应的数据。In some embodiments, the resource library also includes a search box 805 in which the user of the electronic device can enter words, Word, time, region, source and other content to quickly find the corresponding data.
在另一些实施例中,资源库还包括“回收站”功能选项803,电子设备用户可以选择该“回收站”功能选项,以查看已经存入“回收站”的数据。该“回收站”用于存储暂时存储用户删除的数据,在预设时长后未被用户恢复的数据或者用户在“回收站”中确认删除的数据,App1会将其从电子设备的存储介质上擦除。In other embodiments, the resource library also includes a "Recycle Bin" function option 803, and the electronic device user can select the "Recycle Bin" function option to view the data that has been stored in the "Recycle Bin". The "Recycle Bin" is used to temporarily store data deleted by the user. Data that has not been restored by the user after a preset period of time or data that the user has confirmed deletion in the "Recycle Bin" will be removed from the storage medium of the electronic device by App1. Erase.
在又一些实施例中,资源库还包括“分享”功能选项804,电子设备用户可以选择该“分享”功能选项,以分享资源库中的一个或多个数据。In some embodiments, the resource library also includes a "share" function option 804, which an electronic device user can select to share one or more data in the resource library.
当电子设备用户在资源库中选择任一数据打开,响应于用户的操作,电子设备可以显示如图9所示的播放界面。When the electronic device user selects any data to open in the resource library, in response to the user's operation, the electronic device can display a playback interface as shown in Figure 9.
与图6中所示的翻译结果展示区域610类似,根据打开数据类型的不同,该播放界面可以包括整体展示区域、口部动作展示区域、手部动作展示区域和文字状态展示区域中的一种或多种,这些区域中也可以打开如图6中所示的区域对应的选项功能,详细的选项功能的触发方式以及具体的功能可以参考图6中相关的描述,为避免重复,此处不再赘述。Similar to the translation result display area 610 shown in Figure 6, depending on the type of open data, the playback interface may include one of an overall display area, a mouth movement display area, a hand movement display area, and a text status display area. or more. In these areas, the option functions corresponding to the areas shown in Figure 6 can also be opened. For detailed triggering methods of the option functions and specific functions, please refer to the relevant descriptions in Figure 6. To avoid duplication, they are not included here. Again.
在一些实施例中,该播放界面可以包括播放功能控件901,该播放功能控件可以控制数据播放的开始与停止,该播放功能控件还可以查看当前数据播放的进度。In some embodiments, the playback interface may include a playback function control 901. The playback function control may control the start and stop of data playback. The playback function control may also view the progress of the current data playback.
可选的,该播放功能控件还可以包含附加口部动作数据关键词的提示控件902,电子设备用户可以通过选择(例如点击)该提示控件以直接查看关键词的口部动作。Optionally, the playback function control may also include a prompt control 902 with a keyword attached to the oral movement data. The electronic device user can directly view the oral movement of the keyword by selecting (for example, clicking) the prompt control.
在一些实施例中,该播放界面可以包括“分享”功能选项903,电子设备用户可以选择“分享”功能选项,对播放界面的正在播放的数据中的一种或多种进行分享。In some embodiments, the playback interface may include a "share" function option 903, and the electronic device user may select the "share" function option to share one or more types of data being played in the playback interface.
以下结合图10详细介绍翻译数据的分享过程,需要说明的是,该分享过程可以通过如图6中的分享功能触发,也可以通过如图8中资源库的界面中的分享功能触发,或者也可以通过图9中播放界面的分享功能触发,或者还可以通过其他方式触发,本申请对此不做限制。The following describes the sharing process of translation data in detail with reference to Figure 10. It should be noted that the sharing process can be triggered through the sharing function as shown in Figure 6, or through the sharing function in the resource library interface as shown in Figure 8, or also It can be triggered through the sharing function of the playback interface in Figure 9, or it can also be triggered through other methods, and this application does not impose restrictions on this.
如图10所示为分享界面,该分享界面包括分享选择提示信息1001、分享数据预览区域1002和分享途径选择窗1003。Figure 10 shows a sharing interface, which includes sharing selection prompt information 1001, a sharing data preview area 1002, and a sharing channel selection window 1003.
分享选择提示信息1001用于提示当前已经选择的待分享的数据的信息,该分享选择提示信息可以包括待分享数据的数量,该分享选择提示信息还可以包括待分享数据中包含的种类。The sharing selection prompt information 1001 is used to prompt information about the currently selected data to be shared. The sharing selection prompt information may include the quantity of data to be shared. The sharing selection prompt information may also include the types included in the data to be shared.
示例性的,当电子设备用户选择3个手部动作展示区域对应的数据、4个口部动作展示区域对应的数据和4个口部动作展示区域对应的数据,该分享选择提示信息可以显示:已选择11项,包含:手部动作展示区域对应的数据(手动)、口部动作展示区域对应的数据(口动)、文字状态展示区域对应的数据(文字)。For example, when the electronic device user selects data corresponding to three hand movement display areas, four mouth movement display areas, and four mouth movement display areas, the sharing selection prompt information may be displayed: 11 items have been selected, including: data corresponding to the hand movement display area (manual), data corresponding to the mouth movement display area (mouth movement), and data corresponding to the text status display area (text).
分享数据预览区域1002用于展示待分享的数据。示例性的,当电子设备用户选择分享整体展示区域的数据时,分享数据预览区域可以显示分享整体展示区域的某一帧画面用于预览该整体展示区域的数据。The shared data preview area 1002 is used to display data to be shared. For example, when the electronic device user chooses to share the data of the overall display area, the shared data preview area may display a certain frame of the shared overall display area for previewing the data of the overall display area.
可选的,该分享数据预览区域还可以包括功能复选框1004,电子设备用户可以通过点击该功能复选框来选择待分享数据或取消选择待分享数据。Optionally, the shared data preview area may also include a function check box 1004. The electronic device user can select data to be shared or deselect data to be shared by clicking the function check box.
分享途径选择窗1003用于展示可用的一种或多种分享途径,该分享途径窗还用于获取电子设备用户选择的一种或多种分享途径。示例性的,如图10中所示,上述一种或多 种分享途径可以包括:蓝牙分享、上传到云盘或通过邮件发送等。The sharing channel selection window 1003 is used to display one or more available sharing channels, and the sharing channel window is also used to obtain one or more sharing channels selected by the electronic device user. Illustratively, as shown in Figure 10, one or more of the above The sharing methods can include: Bluetooth sharing, uploading to cloud disk, or sending via email.
上述结合图3至图10详细以App1为例介绍了本申请实施例提供的翻译方法,上述描述的App1的一种或多种功能可以通过App1的设置功能选项开启或关闭。以下结合图11说明App1的设置功能。The translation method provided by the embodiment of the present application is introduced in detail with reference to Figures 3 to 10, taking App1 as an example. One or more functions of App1 described above can be turned on or off through the setting function options of App1. The following describes the setting function of App1 with reference to Figure 11.
该设置功能选项中可以包含“关键词自动识别、转换”功能选项,电子设备用户可以通过该功能选项开启或关闭输入过程中App1对输入的文本、视频、音频数据中的关键词,该关键词指需要附加口部动作数据的关键词。This setting function option can include the "automatic keyword recognition and conversion" function option. Electronic device users can use this function option to turn on or off the keywords in the text, video, and audio data input by App1 during the input process. The keywords Refers to keywords that require additional mouth movement data.
该设置功能选项中还可以包含“关键词自动纠正”功能选项,电子设备用户可以通过该功能选项开启或关闭输入过程中App1对输入过程中用户输入的关键词存在错误的情况进行提示和/或自动纠正。The setting function option may also include a "keyword auto-correction" function option, through which the electronic device user can turn on or off App1 to prompt and/or make errors in the keywords entered by the user during the input process. Automatic correction.
该设置功能选项中还可以包含“翻译加速功能”功能选项,电子设备用户可以通过该功能选项来开启提高文字信息翻译的效率的功能,详细如何提高文字信息翻译的效率的方式在下文实施例中介绍。The setting function options may also include a "translation acceleration function" function option. Electronic device users can use this function option to turn on a function that improves the efficiency of text information translation. Details of how to improve the efficiency of text information translation are provided in the following embodiments. introduce.
该设置功能选项中还可以包含“结果展示内容”功能选项,电子设备用户可以通过该功能选项来选择在翻译结果展示界面需要展示的内容。示例性的,电子设备用户在该功能选项中选择“手部动作”和“口部动作”,则在图6所示的界面中,整体展示区域和文字状态展示区域默认不显示,手部动作展示区域和口部动作展示区域默认显示。The setting function options may also include a "result display content" function option, through which electronic device users can select the content to be displayed on the translation results display interface. For example, if the electronic device user selects "hand movements" and "mouth movements" in this function option, then in the interface shown in Figure 6, the overall display area and the text status display area are not displayed by default, and the hand movements The display area and mouth movement display area are displayed by default.
该设置功能选项中还可以包含“资源库默认分类方式”功能选项,电子设备用户可以通过该功能选项来选择用户保存到电子设备本地的不同数据在资源库中的默认分类方式。This setting function option may also include a "resource library default classification method" function option, through which the electronic device user can select the default classification method in the resource library for different data that the user saves locally on the electronic device.
该设置功能选项中还可以包含“资源库默认排序方式”功能选项,电子设备用户可以通过该功能选项来选择用户保存到电子设备本地的不同数据在资源库中的默认排序方式。This setting function option may also include a "resource library default sorting method" function option, through which the electronic device user can select the default sorting method in the resource library for different data saved locally on the electronic device.
以上以电子设备用户的角度说明了本申请实施例提供的汉语翻译方法,以下结合图12说明本申请实施例提供的汉语翻译方法,电子设备内部的实现流程。The above describes the Chinese translation method provided by the embodiment of the present application from the perspective of an electronic device user. The following describes the Chinese translation method provided by the embodiment of the present application and the implementation process within the electronic device with reference to FIG. 12 .
S1201,电子设备获取待翻译的文字信息。S1201. The electronic device obtains the text information to be translated.
该待翻译的文字信息可以是电子设备用户直接向电子设备输入的,也可以是电子设备根据用户输入的文本、图片、音频或视频等数据识别得到的。具体获取待翻译的文字信息的方法可以参考图3至图5中的相关描述。The text information to be translated may be directly input to the electronic device by the user of the electronic device, or may be recognized by the electronic device based on data such as text, pictures, audio, or video input by the user. For a specific method of obtaining the text information to be translated, please refer to the relevant descriptions in Figures 3 to 5.
在一些实施例中,电子设备还获取了需要附加口部动作数据的关键词。In some embodiments, the electronic device also obtains keywords that require additional mouth action data.
S1202,电子设备向服务器发送翻译请求,相应的,服务器接收该翻译请求。S1202. The electronic device sends a translation request to the server, and accordingly, the server receives the translation request.
该翻译请求用于请求获取待翻译的文字信息对应的手部动作数据。当S1201中电子设备还获取了需要附加口部动作数据的关键词,该翻译请求还用于请求获取该关键词对应的口部动作数据。This translation request is used to request the hand movement data corresponding to the text information to be translated. When the electronic device also obtains a keyword that requires additional oral movement data in S1201, the translation request is also used to request to obtain the oral movement data corresponding to the keyword.
在一些实施例中,该翻译请求用于请求获取需要附加口部动作数据的关键词对应的口部动作数据。In some embodiments, the translation request is used to request to obtain oral action data corresponding to a keyword that requires additional oral action data.
S1203,服务器发送手部动作数据和/或口部动作数据,相应的,电子设备接收该手部动作数据和/或口部动作数据。S1203. The server sends hand movement data and/or mouth movement data, and accordingly, the electronic device receives the hand movement data and/or mouth movement data.
服务器根据S1202中接收的翻译请求消息的内容确定向电子设备发送手部动作数据和/或口部动作数据。The server determines to send the hand movement data and/or mouth movement data to the electronic device according to the content of the translation request message received in S1202.
可选的,在向电子设备发送上述手部动作数据和/或口部动作数据前,服务器首先对 电子设备请求翻译的文字信息进行文本风控检查,该文本风控检查用于检查待翻译的文字信息是否包含敏感信息,以起到过滤不良文本信息的作用。Optionally, before sending the above hand movement data and/or mouth movement data to the electronic device, the server first The text information requested by the electronic device to be translated undergoes a text risk control check. The text risk control check is used to check whether the text information to be translated contains sensitive information, so as to filter out bad text information.
在一些实施例中,服务器确定待翻译的文字信息通过文本风控检查后直接向电子设备发送上述手部动作数据和/或口部动作数据。In some embodiments, the server directly sends the above-mentioned hand movement data and/or mouth movement data to the electronic device after determining that the text information to be translated passes the text risk control check.
在另一些实施例中,服务器确定待翻译的文字信息通过文本风控检查后,向电子设备发送指示信息,该指示信息用于指示待翻译的文字信息通过文本风控检查。电子设备在接收该指示信息后,向服务器发送通过文本风控检查的文字对应的文字转手语请求,在接收该文字转手语请求后,服务器向电子设备发送上述手部动作数据和/或口部动作数据。In other embodiments, after the server determines that the text information to be translated passes the text risk control check, the server sends instruction information to the electronic device, and the instruction information is used to indicate that the text information to be translated passes the text risk control check. After receiving the instruction information, the electronic device sends a text-to-sign language request corresponding to the text that passed the text risk control check to the server. After receiving the text-to-sign language request, the server sends the above-mentioned hand movement data and/or mouth to the electronic device. action data.
在又一些实施例中,服务器确定待翻译的文字信息未通过文本风控检查,则服务器向电子设备发送指示信息,该指示信息用于指示待翻译的文字信息未通过文本风控检查。In some embodiments, if the server determines that the text information to be translated does not pass the text risk control check, the server sends indication information to the electronic device, and the indication information is used to indicate that the text information to be translated does not pass the text risk control check.
服务器可以从手部动作数据库中确定与待翻译的文字信息对应的手部动作数据,并将该手部动作数据发送至电子设备。The server can determine the hand movement data corresponding to the text information to be translated from the hand movement database, and send the hand movement data to the electronic device.
类似的,服务器也可以从口部动作数据库中确定与关键词对应的口部动作数据,并将该口部动作数据发送至电子设备。Similarly, the server can also determine the oral action data corresponding to the keyword from the oral action database, and send the oral action data to the electronic device.
在一些实施例中,服务器包括词性标注模块和口部动作数据库,该词性标注模块用于对从电子设备接受的文字信息中的各个词汇标注词性标签,词性标签的具体含义如表1所示。口部动作数据库用于保存汉语拼音的口型对应的混合形状数值,该混合形状数值可以用于显示关键词对应的口部动作。In some embodiments, the server includes a part-of-speech tagging module and a verbal action database. The part-of-speech tagging module is used to tag each word in the text information received from the electronic device with a part-of-speech tag. The specific meaning of the part-of-speech tag is shown in Table 1. The mouth movement database is used to store mixed shape values corresponding to the mouth shapes of Chinese Pinyin. The mixed shape values can be used to display mouth movements corresponding to keywords.
具体的,首先由录像设备录制模特人脸单个拼音口型视频,比如拼音口型“wu”,录制后每一帧的混合形状数值保存到口部动作数据库中。Specifically, the video equipment first records a single pinyin mouth shape video of the model's face, such as the pinyin mouth shape "wu". After recording, the mixed shape value of each frame is saved into the mouth movement database.
表2所示为口部动作数据库建立过程中需要录制口型视频的汉语拼音,不同的汉字的口部动作是根据其对应的汉语拼音确定的。通过录制不同的汉语拼音对应的口型视频,再将口型视频转换为可以驱动虚拟人物口部动作的数据。在获取需要附加口部动作数据的关键词时,服务器可以调用口型生成算法获取关键词对应的汉语拼音发音的口型视频转换得到的数据,并将该数据发送给电子设备,从而电子设备可以利用获取的该数据驱动虚拟人物做出相应的口部动作。Table 2 shows the Chinese pinyin of the mouth movements video that needs to be recorded during the creation of the oral movement database. The oral movements of different Chinese characters are determined based on their corresponding Chinese pinyin. By recording different mouth shape videos corresponding to Chinese Pinyin, the mouth shape videos are then converted into data that can drive the mouth movements of the virtual character. When obtaining keywords that require additional oral movement data, the server can call the mouth shape generation algorithm to obtain the data converted from the mouth shape video of the Chinese Pinyin pronunciation corresponding to the keyword, and send the data to the electronic device, so that the electronic device can The obtained data is used to drive the virtual character to make corresponding mouth movements.
表2汉语拼音

Table 2 Chinese Pinyin

在一些实施例中,服务器将上述口部动作数据和手部动作数据一起发送至电子设备。In some embodiments, the server sends the above-mentioned oral movement data and hand movement data together to the electronic device.
在另一些实施例中,服务器根据手部动作的先后顺序,分片依次发送不同时间帧的手部动作数据。In other embodiments, the server sends hand movement data in different time frames in sequence according to the order of hand movements.
在又一些实施例中,服务器根据口部动作的先后顺序,分片依次发送不同时间帧的口部动作数据。In some embodiments, the server sends oral action data in different time frames in sequence according to the order of the oral actions.
在又一些实施例中,手部动作数据语口部动作数据具备相同的时间戳,服务器根据手部动作或口部动作的先后顺序,分片发送不同时间帧的手部动作数据和口部动作数据。In some embodiments, the hand action data and oral action data have the same timestamp, and the server sends the hand action data and oral action data in different time frames in pieces according to the order of the hand movements or oral movements. data.
S1204,驱动虚拟人物。S1204, drive the virtual character.
电子设备根据在S1203中接收的手部动作数据和/或口部动作数据,驱动虚拟人物模型展示待翻译文字对应的手部动作和/或关键词的口部动作。The electronic device drives the virtual character model to display the hand movements and/or the mouth movements of the keywords corresponding to the text to be translated based on the hand movement data and/or mouth movement data received in S1203.
在获取文字信息对应的翻译结果后,电子设备用户可以保存、分享、编辑和设置翻译结果,详细的执行过程可以参考图6至图11中的相关描述,为了简洁,此处不再赘述。After obtaining the translation results corresponding to the text information, the electronic device user can save, share, edit and set the translation results. For the detailed execution process, please refer to the relevant descriptions in Figures 6 to 11. For the sake of brevity, they will not be repeated here.
基于相同的发明构思,如图13所示,本申请实施例还提供一种汉语翻译装置1300,该汉语翻译装置1300包括获取单元1310和处理单元1320,该获取单元用于获取如图3至图11所示实施例中电子设备用户输入的信息,该处理单元用于执行如图3至图11所示的实施例中电子设备执行的处理操作,如根据用户输入的文字信息获取相应的手部动作数据等。Based on the same inventive concept, as shown in Figure 13, the embodiment of the present application also provides a Chinese translation device 1300. The Chinese translation device 1300 includes an acquisition unit 1310 and a processing unit 1320. The acquisition unit is used to acquire the data as shown in Figure 3 to Figure 13. For information input by the user of the electronic device in the embodiment shown in Figure 11, the processing unit is used to perform processing operations performed by the electronic device in the embodiment shown in Figure 3 to Figure 11, such as obtaining the corresponding hand according to the text information input by the user. Action data, etc.
可选的,该汉语翻译装置还可以包括通信单元1330,该通信单元用于执行如图3至图11所示的实施例中电子设备执行的与服务器的通信和数据传输操作等。Optionally, the Chinese translation device may also include a communication unit 1330, which is used to perform communication and data transmission operations with the server performed by the electronic device in the embodiments shown in Figures 3 to 11.
如图14所示,本申请实施例还提供另一种汉语翻译装置1400,该汉语翻译装置1400 包括处理单元1410和通信单元1420,该处理单元用于执行对电子设备发送的待翻译的文字信息的文本风控操作检查等,该通信单元用于执行如图3至图11所示的实施例中服务器与电子设备执行的通信和数据传输操作等。As shown in Figure 14, the embodiment of the present application also provides another Chinese translation device 1400. The Chinese translation device 1400 It includes a processing unit 1410 and a communication unit 1420. The processing unit is used to perform text risk control operation inspection on the text information sent by the electronic device to be translated. The communication unit is used to perform the embodiments shown in Figures 3 to 11. Communication and data transmission operations performed by servers and electronic devices.
可选的,该汉语翻译装置还可以包括存储单元1430,该存储单元用于存储一个或多个计算机程序、手部动作数据和口部动作数据等。Optionally, the Chinese translation device may also include a storage unit 1430, which is used to store one or more computer programs, hand movement data, oral movement data, etc.
如图15所示,本申请实施例还提供一种电子设备1500,该电子设备包括处理器1510和存储器1520,该处理器用于执行如图3至图11所示的实施例中电子设备执行的处理操作,如根据用户输入的文字信息获取相应的手部动作数据等,该存储器上存储有一个或多个计算机程序,该一个或多个计算机程序包括指令,当该指令被一个或多个处理器执行时,使得如前文中任一种汉语翻译方法被执行。As shown in Figure 15, the embodiment of the present application also provides an electronic device 1500. The electronic device includes a processor 1510 and a memory 1520. The processor is used to execute the steps performed by the electronic device in the embodiments shown in Figures 3 to 11. Processing operations, such as obtaining corresponding hand movement data based on text information input by the user, etc. One or more computer programs are stored in the memory. The one or more computer programs include instructions. When the instructions are processed by one or more When the processor is executed, any of the Chinese translation methods mentioned above will be executed.
如图16所示,本申请实施例还提供一种服务器1600,该服务器包括处理器1610和存储器1620,该处理器用于执行对电子设备发送的待翻译的文字信息的文本风控操作等,该存储器存储有一个或多个计算机程序、手部动作数据和口部动作数据等,该一个或多个计算机程序包括指令,当该指令被一个或多个处理器执行时,使得如前文中任一种汉语翻译方法被执行。As shown in Figure 16, the embodiment of the present application also provides a server 1600. The server includes a processor 1610 and a memory 1620. The processor is used to perform text risk control operations on text information to be translated sent by the electronic device. The The memory stores one or more computer programs, hand movement data, oral movement data, etc. The one or more computer programs include instructions. When the instructions are executed by one or more processors, any of the above A Chinese translation method is implemented.
本申请实施例还提供一种计算机程序产品,该计算机程序产品包括计算机程序代码,当计算机程序代码在计算机上运行时,使得计算机实现如图3至图12所示的实施例中的方法。Embodiments of the present application also provide a computer program product. The computer program product includes computer program code. When the computer program code is run on a computer, it causes the computer to implement the methods in the embodiments shown in FIGS. 3 to 12 .
本申请实施例还提供一种计算机可读存储介质,该计算机可读介质存储有计算机指令,当计算机指令在计算机上运行时,使得计算机实现如图3至图12所示的实施例中的方法。Embodiments of the present application also provide a computer-readable storage medium. The computer-readable medium stores computer instructions. When the computer instructions are run on the computer, the computer implements the methods in the embodiments shown in Figures 3 to 12. .
本申请实施例还提供一种芯片,包括处理器,用于读取存储器中存储的指令,当该处理器执行该指令时,使得该芯片实现如图3至图12所示的实施例中的方法。An embodiment of the present application also provides a chip, including a processor for reading instructions stored in a memory. When the processor executes the instructions, the chip implements the embodiments shown in Figures 3 to 12. method.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art will appreciate that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented with electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each specific application, but such implementations should not be considered beyond the scope of this application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that for the convenience and simplicity of description, the specific working processes of the systems, devices and units described above can be referred to the corresponding processes in the foregoing method embodiments, and will not be described again here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the coupling or direct coupling or communication connection between each other shown or discussed may be through some interfaces, and the indirect coupling or communication connection of the devices or units may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physical units, that is, they may be located in one place, or they may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各 个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present application can be integrated into one processing unit, or each functional unit can be integrated into one processing unit. Each unit physically exists alone, or two or more units can be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。If the functions are implemented in the form of software functional units and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application is essentially or the part that contributes to the existing technology or the part of the technical solution can be embodied in the form of a software product. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in various embodiments of this application. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program code. .
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。 The above are only specific embodiments of the present application, but the protection scope of the present application is not limited thereto. Any person familiar with the technical field can easily think of changes or substitutions within the technical scope disclosed in the present application. should be covered by the protection scope of this application. Therefore, the protection scope of this application should be subject to the protection scope of the claims.

Claims (18)

  1. 一种汉语翻译的方法,其特征在于,包括:A method of Chinese translation, which is characterized by including:
    响应于用户的输入,电子设备获取文字信息,所述文字信息包括关键词;In response to the user's input, the electronic device obtains text information, the text information including keywords;
    所述电子设备显示所述文字信息对应的手部动作;The electronic device displays the hand movements corresponding to the text information;
    所述电子设备显示关键词对应的口部动作。The electronic device displays the mouth movements corresponding to the keywords.
  2. 根据权利要求1所述的方法,其特征在于,所述关键词根据手语使用者的语言习惯确定。The method according to claim 1, characterized in that the keywords are determined according to the language habits of sign language users.
  3. 根据权利要求1或2所述的方法,其特征在于,所述方法还包括:所述电子设备不显示普通词汇对应的口部动作,所述文字信息包括所述普通词汇,所述普通词汇与所述关键词不同。The method according to claim 1 or 2, characterized in that the method further includes: the electronic device does not display oral movements corresponding to common words, the text information includes the common words, and the common words are the same as The keywords are different.
  4. 根据权利要求1至3中任一项所述的方法,其特征在于,所述电子设备显示关键词对应的口部动作,包括:The method according to any one of claims 1 to 3, characterized in that the electronic device displays mouth movements corresponding to keywords, including:
    所述电子设备在显示所述关键词对应的手部动作的同时显示所述口部动作。The electronic device displays the mouth movement while displaying the hand movement corresponding to the keyword.
  5. 根据权利要求1至4中任一项所述的方法,其特征在于,所述关键词为专有名词。The method according to any one of claims 1 to 4, characterized in that the keywords are proper nouns.
  6. 根据权利要求1至5中任一项所述的方法,其特征在于,在显示所述关键词对应的口部动作前,所述方法还包括:The method according to any one of claims 1 to 5, characterized in that before displaying the oral movements corresponding to the keywords, the method further includes:
    所述电子设备显示第一词汇,所述文字信息包括所述第一词汇,所述第一词汇为推荐附加口部动作的词汇;The electronic device displays a first vocabulary, the text information includes the first vocabulary, and the first vocabulary is a vocabulary that recommends additional oral movements;
    响应于用户的确认操作,所述电子设备确定所述第一词汇为所述关键词。In response to the user's confirmation operation, the electronic device determines that the first vocabulary is the keyword.
  7. 根据权利要求1至6中任一项所述的方法,其特征在于,在显示所述关键词对应的口部手部动作前,所述方法还包括:The method according to any one of claims 1 to 6, characterized in that, before displaying the oral and hand movements corresponding to the keywords, the method further includes:
    响应于用户的第一输入,所述电子设备获取第二词汇,所述第二词汇为用户请求附加口部动作的词汇;In response to the user's first input, the electronic device acquires a second vocabulary, the second vocabulary being a vocabulary for which the user requests additional oral movements;
    在所述文字信息包含所述第二词汇的情况下,所述电子设备确定所述第二词汇为所述关键词;When the text information includes the second vocabulary, the electronic device determines that the second vocabulary is the keyword;
    在所述文字信息不包含所述第二词汇的情况下,所述电子设备显示更新请求信息,所述更新请求信息用于提示所述文字信息不包含所述第二词汇;When the text information does not include the second vocabulary, the electronic device displays update request information, and the update request information is used to prompt that the text information does not include the second vocabulary;
    响应于用户的第二输入,所述电子设备获取更新后的第二词汇。In response to the user's second input, the electronic device obtains the updated second vocabulary.
  8. 根据权利要求6所述的方法,其特征在于,所述第一词汇根据用户的翻译历史确定,所述翻译历史包含用户输入的第二词汇,所述第二词汇为用户请求附加口部动作的词汇。The method of claim 6, wherein the first vocabulary is determined based on the user's translation history, the translation history includes a second vocabulary input by the user, and the second vocabulary is the user's request for additional oral movements. vocabulary.
  9. 根据权利要求1至8中任一项所述的方法,其特征在于,所述口部动作根据所述关键词的汉语拼音的发音口型确定。The method according to any one of claims 1 to 8, characterized in that the mouth movement is determined according to the pronunciation mouth shape of the Chinese pinyin of the keyword.
  10. 根据权利要求9所述的方法,其特征在于,所述发音口型对应的混合形状数值保存在口部动作数据库中。The method according to claim 9, characterized in that the mixed shape value corresponding to the pronunciation mouth shape is stored in an oral movement database.
  11. 根据权利要求1至10中任一项所述的方法,其特征在于,所述手部动作包括第一手部动作和第二手部动作,所述第一手部动作在所述第二手部动作之前,所述电子设备 显示所述文字信息对应的手部动作,包括:The method according to any one of claims 1 to 10, wherein the hand movement includes a first hand movement and a second hand movement, and the first hand movement is performed on the second hand movement. Before partial action, the electronic device Display the hand movements corresponding to the text information, including:
    所述电子设备从服务器接收第一手部动作数据,所述第一手部动作数据用于显示所述第一手部动作;The electronic device receives first hand movement data from the server, and the first hand movement data is used to display the first hand movement;
    在显示所述第一手部动作的同时所述电子设备从所述服务器接收第二手部动作数据,所述第二手部动作数据用于显示所述第二手部动作。While displaying the first hand movement, the electronic device receives second hand movement data from the server, and the second hand movement data is used to display the second hand movement.
  12. 根据权利要求1至11中任一项所述的方法,其特征在于,所述口部动作包括第一口部动作和第二口部动作,所述第一口部动作在所述第二口部动作之前,所述电子设备显示关键词对应的口部动作,包括:The method according to any one of claims 1 to 11, wherein the oral movement includes a first oral movement and a second oral movement, and the first oral movement is performed on the second oral movement. Before the oral movements, the electronic device displays the oral movements corresponding to the keywords, including:
    所述电子设备从服务器接收第一口部动作数据,所述第一口部动作数据用于显示所述第一口部动作;The electronic device receives first mouth movement data from the server, and the first mouth movement data is used to display the first mouth movement;
    在显示所述第一口部动作的同时所述电子设备从所述服务器接收第二口部动作数据,所述第二口部动作数据用于显示所述第二口部动作。While displaying the first mouth movement, the electronic device receives second mouth movement data from the server, and the second mouth movement data is used to display the second mouth movement.
  13. 根据权利要求1至12中任一项所述的方法,其特征在于,在显示所述文字信息对应的手部动作前,所述方法还包括:The method according to any one of claims 1 to 12, characterized in that, before displaying the hand movement corresponding to the text information, the method further includes:
    所述电子设备从服务器接收响应消息,所述响应消息用于指示所述文字信息不包含敏感信息。The electronic device receives a response message from the server, where the response message is used to indicate that the text information does not contain sensitive information.
  14. 一种电子设备,其特征在于,包括处理器和存储器,所述存储器用户存储程序指令,所述处理器用于调用所述程序指令来执行权利要求1至13中任一项所述的方法。An electronic device, characterized in that it includes a processor and a memory, the memory stores program instructions, and the processor is configured to call the program instructions to execute the method according to any one of claims 1 to 13.
  15. 一种汉语翻译装置,其特征在于,包括用于实现权利要求1至13中任一项所述的方法的模块。A Chinese translation device, characterized by comprising a module for implementing the method according to any one of claims 1 to 13.
  16. 一种计算机程序产品,其特征在于,所述计算机程序产品包括计算机程序代码,当所述计算机程序代码在计算机上运行时,权利要求1至13中任一项所述的方法被执行。A computer program product, characterized in that the computer program product includes computer program code, and when the computer program code is run on a computer, the method of any one of claims 1 to 13 is executed.
  17. 一种计算机可读存储介质,其特征在于,其上存储有计算机程序,所述计算机程序被计算机执行时,以使得实现权利要求1至13中任一项所述的方法。A computer-readable storage medium, characterized in that a computer program is stored thereon, and when the computer program is executed by a computer, the method of any one of claims 1 to 13 is implemented.
  18. 一种芯片产品,其特征在于,包括:处理器,用于读取存储器中存储的指令,当所述处理器执行所述指令时,使得所述芯片实现权利要求1至13中任一项所述的方法。 A chip product, characterized in that it includes: a processor for reading instructions stored in a memory, and when the processor executes the instructions, the chip implements any one of claims 1 to 13. method described.
PCT/CN2023/086870 2022-04-15 2023-04-07 Chinese translation method and electronic device WO2023197949A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210396448.4 2022-04-15
CN202210396448.4A CN116932706A (en) 2022-04-15 2022-04-15 Chinese translation method and electronic equipment

Publications (1)

Publication Number Publication Date
WO2023197949A1 true WO2023197949A1 (en) 2023-10-19

Family

ID=88329034

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/086870 WO2023197949A1 (en) 2022-04-15 2023-04-07 Chinese translation method and electronic device

Country Status (2)

Country Link
CN (1) CN116932706A (en)
WO (1) WO2023197949A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190333020A1 (en) * 2018-04-27 2019-10-31 Microsoft Technology Licensing, Llc Generating personalized smart responses
CN112287690A (en) * 2020-10-29 2021-01-29 中国科学技术大学 Sign language translation method based on conditional sentence generation and cross-modal rearrangement
CN113657101A (en) * 2021-07-20 2021-11-16 北京搜狗科技发展有限公司 Data processing method and device and data processing device
CN113835522A (en) * 2021-09-10 2021-12-24 阿里巴巴达摩院(杭州)科技有限公司 Sign language video generation, translation and customer service method, device and readable medium
CN113971837A (en) * 2021-10-27 2022-01-25 厦门大学 Knowledge-based multi-modal feature fusion dynamic graph neural sign language translation method

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20190333020A1 (en) * 2018-04-27 2019-10-31 Microsoft Technology Licensing, Llc Generating personalized smart responses
CN112287690A (en) * 2020-10-29 2021-01-29 中国科学技术大学 Sign language translation method based on conditional sentence generation and cross-modal rearrangement
CN113657101A (en) * 2021-07-20 2021-11-16 北京搜狗科技发展有限公司 Data processing method and device and data processing device
CN113835522A (en) * 2021-09-10 2021-12-24 阿里巴巴达摩院(杭州)科技有限公司 Sign language video generation, translation and customer service method, device and readable medium
CN113971837A (en) * 2021-10-27 2022-01-25 厦门大学 Knowledge-based multi-modal feature fusion dynamic graph neural sign language translation method

Also Published As

Publication number Publication date
CN116932706A (en) 2023-10-24

Similar Documents

Publication Publication Date Title
CN113794800B (en) Voice control method and electronic equipment
US11843716B2 (en) Translation method and electronic device
JP7222112B2 (en) Singing recording methods, voice correction methods, and electronic devices
WO2020078299A1 (en) Method for processing video file, and electronic device
US11941323B2 (en) Meme creation method and apparatus
CN110825469A (en) Voice assistant display method and device
WO2021027476A1 (en) Method for voice controlling apparatus, and electronic apparatus
CN109286725B (en) Translation method and terminal
CN110910872A (en) Voice interaction method and device
WO2021244457A1 (en) Video generation method and related apparatus
WO2020119455A1 (en) Method for repeating word or sentence during video playback, and electronic device
WO2021013132A1 (en) Input method and electronic device
WO2022052776A1 (en) Human-computer interaction method, and electronic device and system
WO2021258814A1 (en) Video synthesis method and apparatus, electronic device, and storage medium
CN115050358A (en) Voice control command generation method and terminal
WO2022143258A1 (en) Voice interaction processing method and related apparatus
CN115543145A (en) Folder management method and device
CN113823280A (en) Intelligent device control method, electronic device and system
WO2022135254A1 (en) Text editing method, electronic device and system
WO2023197949A1 (en) Chinese translation method and electronic device
CN115730091A (en) Comment display method and device, terminal device and readable storage medium
CN113742460A (en) Method and device for generating virtual role
CN113470638B (en) Method for slot filling, chip, electronic device and readable storage medium
WO2023236908A1 (en) Image description method, electronic device and computer-readable storage medium
WO2023197951A1 (en) Search method and electronic device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23787593

Country of ref document: EP

Kind code of ref document: A1