WO2022062884A1 - Text input method, electronic device, and computer-readable storage medium - Google Patents

Text input method, electronic device, and computer-readable storage medium Download PDF

Info

Publication number
WO2022062884A1
WO2022062884A1 PCT/CN2021/116515 CN2021116515W WO2022062884A1 WO 2022062884 A1 WO2022062884 A1 WO 2022062884A1 CN 2021116515 W CN2021116515 W CN 2021116515W WO 2022062884 A1 WO2022062884 A1 WO 2022062884A1
Authority
WO
WIPO (PCT)
Prior art keywords
sequence
lip
text
user
initial
Prior art date
Application number
PCT/CN2021/116515
Other languages
French (fr)
Chinese (zh)
Inventor
刘浩
黄韬
胡粤麟
秦磊
张乐乐
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2022062884A1 publication Critical patent/WO2022062884A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/02Input arrangements using manually operated switches, e.g. using keyboards or dials
    • G06F3/023Arrangements for converting discrete items of information into a coded form, e.g. arrangements for interpreting keyboard generated codes as alphanumeric codes, operand codes or instruction codes
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/40Extraction of image or video features
    • G06V10/46Descriptors for shape, contour or point-related descriptors, e.g. scale invariant feature transform [SIFT] or bags of words [BoW]; Salient regional features
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/74Image or video pattern matching; Proximity measures in feature spaces
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/70Arrangements for image or video recognition or understanding using pattern recognition or machine learning
    • G06V10/77Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
    • G06V10/774Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V40/00Recognition of biometric, human-related or animal-related patterns in image or video data
    • G06V40/20Movements or behaviour, e.g. gesture recognition

Definitions

  • the present application relates to the field of artificial intelligence (Artificial Intelligence, AI), and in particular, to a text input method, an electronic device, and a computer-readable storage medium.
  • AI Artificial Intelligence
  • the mobile phone input method has added Wubi, handwriting, voice input and other methods to the initial pinyin input.
  • the pinyin input is still It is the most used input method by users, followed by handwriting input, and finally voice input. Since the speed of voice input is the fastest, more and more users choose the way of voice input. However, in some scenarios, such as in a noisy environment or in a private environment where it is inconvenient for users to make loud voices, the accuracy of speech recognition is low, resulting in a reduction in the accuracy of text input using the voice input method.
  • the present application provides a text input method, an electronic device, and a computer-readable storage medium, which can improve the accuracy of text input when it is inconvenient for a user to perform voice input.
  • a text input method comprising: when a text input operation by a user is detected, acquiring lip change information of the user and character information input by the user, wherein the lip change information includes the The lip feature sequence when the user speaks the text to be input; the text to be input by the user is determined according to the lip feature sequence and the character information.
  • the lip change information of the user and the character information input by the user are obtained, and the lip change information includes the lip feature sequence when the user speaks the text to be input.
  • the sequence of lip features and character information determine the text to be entered by the user. Since the text to be input determined according to the character information input by the user has a high accuracy rate, combining the lip feature sequence with the character information to determine the text to be recognized can improve text input when the user is inconvenient for voice input. 's accuracy.
  • the acquiring the lip change information of the user includes:
  • a camera is used to collect an image sequence including the lip region of the user, and lip features are extracted from each image of the image sequence to obtain a lip feature sequence.
  • the electronic device turns on the camera to capture the image of the lip area.
  • the image capture is completed.
  • the lip language input and the input of character information can be performed synchronously without affecting the efficiency of text input.
  • the lip feature sequence obtained from the image sequence can better reflect the change information of the user's lip shape, and the accuracy is high, and then the text to be input by the user is determined according to the lip feature sequence with high accuracy, which improves the text input. 's accuracy.
  • the acquiring the lip change information of the user includes:
  • Transmit a wireless signal and obtain a reflected signal sequence, wherein the reflected signal in the reflected signal sequence is the signal reflected back by the wireless signal after encountering an obstacle; determine the obstacle according to the reflected signal sequence, if the obstacle If the object is a lip, the lip feature is extracted from each reflected signal of the reflected signal sequence to obtain a lip feature sequence. Since the wireless signal has low requirements on the environment, for example, it is not affected by external light, the application range of the electronic device can be improved by using the wireless signal to obtain the lip feature sequence.
  • the character information includes the first letter of the text to be input by the user. Since the input speed is faster when only the first letter is input, determining the text to be input by the user according to the lip feature sequence and the first letter has higher input efficiency.
  • the determining the text to be input by the user according to the lip feature sequence and the character information includes:
  • determining the character sequence corresponding to the lip feature sequence correcting the character sequence according to the first initial to obtain at least one corrected candidate character sequence; determining the character sequence with the highest probability from the candidate character sequence
  • the candidate text sequence, the candidate text sequence with the highest probability is used as the text to be input by the user. Since multiple character sequences can be determined according to the lip feature sequence, there will be wrong character sequences in the determined multiple character sequences. Therefore, using the first letter to correct the character sequence can improve the accuracy of character input. It can reduce the number of candidate text sequences, thereby reducing the amount of calculation for determining the text to be input subsequently, and improving the calculation speed.
  • performing correction processing on the character sequence according to the first initial to obtain at least one corrected candidate character sequence including:
  • the determining the character sequence corresponding to the lip feature sequence includes:
  • the lip language recognition model Inputting the lip feature sequence into a trained lip language recognition model to obtain a text sequence output by the lip language recognition model, the lip language recognition model is used to recognize the text corresponding to the lip features, and the lip language recognition model It is trained based on lip features and the text corresponding to the lip features as training samples. Since the lip language recognition model is trained according to the training samples, it has universality. Therefore, the lip language recognition model is used to recognize the text sequence, which improves the accuracy of the output text sequence.
  • a text input device including:
  • an acquisition module configured to acquire the lip change information of the user and the character information input by the user when detecting the text input operation of the user, the lip change information including the user speaking the text to be input lip feature sequence at time;
  • a processing module configured to determine the text to be input by the user according to the lip feature sequence and the character information.
  • the obtaining module is specifically used for:
  • a camera is used to collect an image sequence including the lip region of the user, and lip features are extracted from each image of the image sequence to obtain a lip feature sequence.
  • the obtaining module is specifically used for:
  • An obstacle is determined according to the reflected signal sequence, and if the obstacle is a lip, a lip feature is extracted from each reflected signal of the reflected signal sequence to obtain a lip feature sequence.
  • the character information includes the first letter of the text to be input by the user.
  • the processing module includes:
  • a determining unit for determining the character sequence corresponding to the lip feature sequence
  • An error correction unit configured to perform correction processing on the character sequence according to the first initial to obtain at least one corrected candidate character sequence
  • An output unit configured to determine a candidate character sequence with the highest probability from the candidate character sequence, and use the candidate character sequence with the highest probability as the character to be input by the user.
  • the error correction unit is specifically used for:
  • the error correction unit is further configured to:
  • the determining unit is specifically configured to:
  • the lip language recognition model Inputting the lip feature sequence into a trained lip language recognition model to obtain a text sequence output by the lip language recognition model, the lip language recognition model is used to recognize the text corresponding to the lip features, and the lip language recognition model It is trained based on lip features and the text corresponding to the lip features as training samples.
  • an electronic device including a processor for executing a computer program stored in a memory, so as to implement the text input method according to the above-mentioned first aspect.
  • a computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, implements the text input method according to the first aspect.
  • a fifth aspect provides a computer program product that, when the computer program product runs on an electronic device, enables the electronic device to execute the text input method described in the first aspect.
  • FIG. 1 is a schematic flowchart of a text input method provided by an embodiment of the present application.
  • FIG. 2 is an application scenario diagram of the text input method provided by the embodiment of the present application.
  • FIG. 3 is a schematic diagram of a lip shape provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of a method for outputting a text sequence provided by an embodiment of the present application.
  • FIG. 7 is a schematic diagram of a radar wave signal provided by an embodiment of the present application.
  • FIG. 9 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
  • the term “if” may be contextually interpreted as “when” or “once” or “in response to determining” or “in response to detecting “.
  • the phrases “if it is determined” or “if the [described condition or event] is detected” may be interpreted, depending on the context, to mean “once it is determined” or “in response to the determination” or “once the [described condition or event] is detected. ]” or “in response to detection of the [described condition or event]”.
  • references in this specification to "one embodiment” or “some embodiments” and the like mean that a particular feature, structure or characteristic described in connection with the embodiment is included in one or more embodiments of the present application.
  • appearances of the phrases “in one embodiment,” “in some embodiments,” “in other embodiments,” “in other embodiments,” etc. in various places in this specification are not necessarily All refer to the same embodiment, but mean “one or more but not all embodiments” unless specifically emphasized otherwise.
  • the terms “including”, “including”, “having” and their variants mean “including but not limited to” unless specifically emphasized otherwise.
  • Existing input methods mainly include voice input method, pinyin input method and handwriting input method.
  • Pinyin input method and handwriting input method have higher input accuracy, but lower input efficiency.
  • the voice input method generally performs voice recognition first, and converts the recognized voice into corresponding text, which has high input efficiency. As a result, the accuracy of speech-to-text conversion is also reduced.
  • an embodiment of the present application provides a text input method.
  • a text input operation by a user is detected, the lip change information of the user and the character information input by the user are obtained, where the lip change information includes the user speaking the text to be input.
  • the lip feature sequence when writing is used to determine the text to be input by the user according to the lip feature sequence and character information. Since the accuracy of the characters to be input determined according to the character information input by the user is high, combining the lip feature sequence with the character information to determine the characters to be recognized can improve the accuracy of the input characters in the voice input mode.
  • the text input method provided in the embodiments of the present application is applied to electronic devices, and the electronic devices may be mobile phones, tablet computers, handheld computers, personal digital assistants (PDAs), speakers with screens, wearable devices, and the like.
  • PDAs personal digital assistants
  • a text input method provided by an embodiment of the present application includes:
  • the user can input character information by means of soft keyboard input or handwriting input.
  • the input character information can be text, letters corresponding to the text, or the first letter of the text, and the text can be Chinese, English or other foreign language.
  • the user may or may not make a sound when speaking the text to be input.
  • the lip feature sequence includes lip features at each moment in a continuous time.
  • the lip features are used to represent the lip shape of the user when speaking, and different lip shapes correspond to different pronunciations.
  • the electronic device collects an image sequence of the user's lip region through a camera, the image sequence is an image of the lip region at each moment in a continuous time, and then extracts lip features from each image in the image sequence , to obtain a sequence of lip features.
  • the electronic device transmits a wireless signal through a wireless sensor, and the wireless signal is reflected back after encountering an obstacle, and the reflected signal is a reflected signal.
  • the electronic device receives the reflected signal through the wireless sensor, obtains the reflected signal sequence according to the reflected signal at each moment in the continuous time, and determines whether the obstacle is a lip according to the reflected signal sequence.
  • a lip feature is extracted from each reflected signal in the reflected signal sequence to obtain a lip feature sequence.
  • the wireless signal may be a radar wave signal, an infrared signal, or an ultrasonic signal, or the like.
  • the electronic device acquires the user's lip change information after detecting that the user starts to input characters.
  • a preset input mode e.g, a multi-mode input mode
  • the user opens the text message editing page, and the electronic device opens the interface shown in Figure 2 after detecting the instruction of the multi-mode input mode.
  • the interface inputs the first letter of Chinese pinyin, and simultaneously performs lip language input.
  • the camera of the electronic device collects the lip image of the user, and extracts the lip feature from the lip image at each moment, that is, the lip feature sequence is obtained.
  • S102 Determine the text to be input by the user according to the lip feature sequence and the character information.
  • the lip shape when the lip features change, that is, when the lip shape changes, the pronunciation will also change. Therefore, according to the lip shape when the user speaks, the user's pronunciation can be determined, and the text spoken by the user can be further determined.
  • the corresponding lip shape may also be the same. For example, when the lip is in the shape as shown in (A) in Figure 3, the corresponding pronunciation may be the English letter "A” ", "E” or "I”, when the lips are in the shape shown in (B) in Figure 3, the corresponding pronunciation may be "Q" or "W” of English letters.
  • the same character may need a changed lip shape before it can be issued.
  • the English letters "M” and "L” need a changed lip shape before they can be issued. Therefore, according to the lip features of the user at each moment, it is possible to determine multiple characters or determine wrong characters. Then, according to the lip features at each moment, that is, the lip feature sequence, a variety of possible text sequences can be determined.
  • Each character information input by the user also corresponds to one or more characters.
  • Combining the lip feature sequence with the character information can correct the characters identified according to the lip feature sequence, or remove the wrong characters identified according to the lip feature sequence. , so as to obtain an accurate text sequence, that is, the text to be input by the user, so that the text input can be completed without acquiring the user's voice, and the accuracy of the text input is improved.
  • the electronic device may determine multiple candidate character sequences according to the lip feature sequence and character information, and after determining multiple candidate character sequences, select the one with the highest probability according to the semantics of each character sequence.
  • Candidate text sequence the candidate text sequence with the highest probability is used as the text to be input by the user.
  • the candidate text sequence may be input into the trained semantic recognition model, and the candidate text sequence with the highest probability output by the semantic recognition model is obtained.
  • the semantic recognition model is trained based on the text sequence and the text sequence with the highest probability as the training sample.
  • the combination of the lip feature sequence and the character information to determine the text to be recognized can improve the input accuracy in the voice input mode. accuracy of the text.
  • the character information input by the user includes the first letter of the text to be input by the user, wherein “first” is used to distinguish and describe the “initial letter", and “the first letter” refers to the first letter of the text It can be the first letter in Chinese Pinyin, such as "w” in “wen”, or the first letter of an English word, such as "G” in “Good".
  • the first letter includes the first letter of each of the words to be entered.
  • the electronic device After determining the text sequence according to the acquired lip feature sequence, the electronic device corrects the pinyin of the determined text sequence according to the first letter, and obtains the corrected pinyin. The resulting pinyin determines at least one corrected candidate character sequence.
  • the electronic device corrects the pinyin of the text sequence according to the first letter to obtain a corrected pinyin, and determines the corrected candidate text sequence according to the corrected pinyin. For example, if the character sequence determined according to the lip feature sequence is "Netherlands" and the first letter is "hn”, then the pinyin "helan” of the character sequence is corrected, and the corrected pinyin is "henan".
  • the candidate character sequences determined by the pronunciation of "henan” include “Henan", "Henan” and so on.
  • the electronic device corrects the text sequence according to the first letter, obtains a plurality of corrected pinyin, and determines the corrected candidate text sequence according to each corrected pinyin. For example, if the character sequence determined according to the lip feature sequence is "airplane" and the first letter is "hj”, then the pinyin “feiji” of the character sequence is corrected, and the corrected pinyin is "huijia”, "" huiji”, etc. For the pinyin "huijia”, the determined candidate text sequences are "home”, “exchange price”, etc. For the pinyin "huiji”, the determined candidate text sequences are "collection", "benefit” and so on.
  • the electronic device After obtaining the corrected candidate character sequence, the electronic device determines the candidate character sequence with the highest probability from the candidate character sequence, and uses the candidate character sequence with the highest probability as the character to be input by the user. For example, input the candidate text sequence into the semantic recognition model, and obtain the candidate text sequence with the highest probability output by the semantic recognition model. Since the initial letter input method has a low accuracy rate without historical association information, but the input speed is fast, the combination of the lip feature sequence and the initial letter improves the accuracy rate of text input, and the user uses a shorter The input of the initial letter can be completed in time, which improves the input efficiency.
  • the method for the electronic device to correct the character sequence determined according to the lip feature sequence according to the first initial letter is specifically: first extracting the second character sequence of each character in the determined character sequence Initials, where "second" is used to distinguish the description "initials".
  • the second letter can be directly extracted from the text sequence.
  • the replaced character sequence is used as a candidate character sequence.
  • the corresponding first letter refers to the first letter corresponding to the position of the second letter. For example, if the second letter is the first letter of the second letter in the text sequence, the corresponding first letter The second initial entered for the user.
  • the unmatched second letter is directly replaced with the first letter to obtain the replaced pinyin, and then the replaced at least one is determined according to the replaced pinyin.
  • a candidate text sequence For example, if the character sequence determined according to the lip feature sequence is "support”, the extracted second initial letter is "zc"; if the first initial letter is “zs”, there is a mismatched second initial letter” c", replace “c” with the first letter "s”, get the replaced pinyin as "zhishi”, and then determine the candidate word sequence according to the replaced pinyin as "knowledge”, “instruction”, etc.
  • the unmatched second letter is directly replaced with the first letter.
  • the replaced letter cannot form a text, or the replaced letter and the text sequence
  • the replaced letters are corrected, and at least one character sequence is obtained according to the correction result.
  • the character sequence determined according to the lip feature sequence is "wow", and the corresponding pronunciation is "wa”, then the extracted second initial letter is "w", if the first initial letter is "h”, then There is an unmatched second letter, if you directly replace "w” with “h”, the pinyin obtained according to the replaced letter is "ha”, because the pronunciation of "ha” and "wa” is quite different, therefore, it cannot be Direct replacement. Therefore, according to the preset correction rules, the replaced letters are corrected, and "ha” is corrected to "hua", so that the pronunciation of the corrected pinyin is close to the pronunciation of the text sequence, and then according to the corrected pinyin
  • the determined candidate character sequences are "flower", "hua” and so on.
  • an associated database is preset, and the associated database stores letters with associated relationships. Letters are letters that are pronounced close to each other and are easily confused.
  • the electronic device After determining that there is an unmatched second initial, the electronic device determines whether there is a letter associated with the corresponding first initial according to the associated letters stored in the associated database. If there is a letter associated with the corresponding first letter, replace the unmatched second letter with the corresponding first letter, and replace the unmatched second letter with the associated letter, and get the replacement Therefore, the range of the candidate character sequence can be expanded, and when the character to be input by the user is determined according to the candidate character sequence, the accuracy of character input is improved.
  • the candidate text sequence obtained according to the replaced pronunciation includes "driver”, “fourth level”, “actual”, “ timing” etc.
  • the lip feature sequence is obtained from the image sequence of the lip region of the user
  • the lip language recognition model is based on the lip feature sequence extracted from the lip region image sequence
  • the lip feature sequence The corresponding text sequences are obtained by training as training samples.
  • the specific flow of the text input method is shown in FIG. 4 , when the text input operation of the user is detected, the first letter of the text to be input input by the user is obtained, and the front camera on the electronic device is used to collect the face. Image, identify each face image collected, and identify the face in the image.
  • the image of the lip area is cut out from the face, and the images of the lip area at each moment in the continuous time form an image sequence, and the lips are extracted from each image in the image sequence. feature to get the lip feature sequence.
  • the lip language recognition model may be a Spatiotemporal Convolutional Neural Networks (STCNN) model. After the text sequence output by the lip language recognition model is obtained, the first letter is used to correct the text sequence, and the corrected candidate text sequence is obtained.
  • STCNN Spatiotemporal Convolutional Neural Networks
  • the candidate character sequence is determined according to the replaced pinyin.
  • the semantics of each candidate character sequence is determined, and according to the semantics of each candidate character sequence, the candidate character sequence with the highest probability is determined.
  • the determined word sequence can be compared with a pre-stored word database, and the candidate word sequence with the highest degree of matching with the word database can be selected as the candidate word sequence with the highest probability, and then the candidate word sequence with the highest probability can be selected as the candidate word sequence to be input by the user Text.
  • the replaced pinyin may also be input into a preset semantic recognition model to obtain a candidate character sequence with the highest probability output by the semantic recognition model.
  • the semantic recognition model is trained based on pronunciation and the text sequence with the highest probability as a training sample.
  • the lip feature sequence is obtained from the radar wave signal sequence reflected by the obstacle, and the lip language recognition model is based on the lip feature sequence extracted from the radar wave signal sequence, and the lip
  • the text corresponding to the partial feature sequence is obtained by training as the training sample.
  • the specific flow of the text input method is shown in FIG. 6 , when the text input operation of the user is detected, the first letter of the text to be input input by the user is obtained, and the radar in front of the electronic device is used to transmit radar waves. and receive the radar wave signal sequence reflected by the obstacle, that is, the reflected signal sequence, and the reflected signal sequence includes the reflected signals at each moment in the continuous time.
  • the radar can be a 60GHz millimeter-wave radar, and the radar antenna can be a single-transmission and multi-reception mode, or a multi-transmission and multi-reception mode. Since the delay of the reflected signal relative to the transmitted signal and the Doppler effect of the reflected signal can reflect the characteristics of the obstacle, including the size, shape, distance, speed and other information of the obstacle, it is possible to use the lip movement when speaking to the user.
  • the reflected signal sequence is processed to obtain the lip feature sequence of the user when speaking. Then input the lip feature sequence into the lip language recognition model to obtain the text sequence output by the lip language recognition model.
  • the radar wave may be modulated by a modulation format of Frequency Modulated Continuous Wave (FMCW), and the FMCW modulation format is modulated by a periodic sawtooth wave function.
  • FMCW Frequency Modulated Continuous Wave
  • the modulated radar wave as shown in Figure 7 is obtained, in which the reflected signal s2 is delayed relative to the transmitted signal s1, and the reflected signal and the transmitted signal have a frequency difference, and the reason for the frequency difference is an obstacle
  • the Doppler effect of objects in motion The reflected signal is multiplied by the transmitted signal, and the multiplied signal is low-pass filtered based on the analog signal to obtain the beat signal.
  • FFT fast Fourier transform
  • background removal such as filtering
  • the distance as shown in Figure 8 is obtained.
  • Range Doppler Map RDM
  • Each grid in the RDM corresponds to an element in the matrix.
  • the element in each column represents the distance of the obstacle
  • the element in each row represents the speed of the obstacle.
  • the speed and distance of the obstacle at the current moment can be determined according to the RDM.
  • the black area in Figure 8 represents the speed and distance of the obstacle at the current moment.
  • the arrival angle of the obstacle can be obtained, according to the arrival angle and distance of the obstacle, the spatial position information of the obstacle can be obtained, and then according to the spatial position information of the obstacle 3D reconstruction is performed on the obstacle to obtain the 3D depth map of the obstacle.
  • the 4D (3D space and velocity) vector signal of the obstacle can be obtained.
  • the electronic device after receiving the reflected signal, performs the above-mentioned processing on the reflected signal and the corresponding transmitted signal to obtain the four-dimensional vector signal of the lip, and the four-dimensional vector signal of the lip is used as the lip feature, according to
  • the lip feature sequence can be obtained from the four-dimensional vector signal of the lips at each moment. Then input the lip feature sequence into the lip language recognition model to obtain the text sequence output by the lip language recognition model. After the text sequence output by the lip language recognition model is obtained, the first letter is used to correct the text sequence, and the corrected candidate text sequence is obtained. After the candidate text sequences are obtained, the semantics of each candidate text sequence are determined, the candidate text sequence with the highest probability is determined according to the semantics of each candidate text sequence, and the candidate text sequence with the highest probability is used as the text to be input by the user.
  • the application range of the electronic device can be improved by using the radar wave signal to obtain the lip feature sequence.
  • FIG. 9 shows a schematic structural diagram of an electronic device 100 provided by an embodiment of the present application.
  • the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, buttons 190, motor 191, indicator 192, camera 193, display screen 194, and Subscriber identification module (subscriber identification module, SIM) card interface 195 and so on.
  • SIM Subscriber identification module
  • the sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
  • the structures illustrated in the embodiments of the present invention do not constitute a specific limitation on the electronic device 100 .
  • the electronic device 100 may include more or less components than those shown, or some components may be combined, or some components may be separated, or different component arrangements.
  • the illustrated components may be implemented in hardware, software, or a combination of software and hardware.
  • the processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (neural-network processing unit, NPU), etc. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
  • application processor application processor, AP
  • modem processor graphics processor
  • ISP image signal processor
  • controller video codec
  • digital signal processor digital signal processor
  • baseband processor baseband processor
  • neural-network processing unit neural-network processing unit
  • the controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
  • a memory may also be provided in the processor 110 for storing instructions and data.
  • the memory in processor 110 is cache memory. This memory may hold instructions or data that have just been used or recycled by the processor 110 . If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated accesses are avoided and the latency of the processor 110 is reduced, thereby increasing the efficiency of the system.
  • the processor 110 may include one or more interfaces.
  • the interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
  • I2C integrated circuit
  • I2S integrated circuit built-in audio
  • PCM pulse code modulation
  • PCM pulse code modulation
  • UART universal asynchronous transceiver
  • MIPI mobile industry processor interface
  • GPIO general-purpose input/output
  • SIM subscriber identity module
  • USB universal serial bus
  • the I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL).
  • the processor 110 may contain multiple sets of I2C buses.
  • the processor 110 can be respectively coupled to the touch sensor 180K, the charger, the flash, the camera 193 and the like through different I2C bus interfaces.
  • the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate with each other through the I2C bus interface, so as to realize the touch function of the electronic device 100 .
  • the I2S interface can be used for audio communication.
  • the processor 110 may contain multiple sets of I2S buses.
  • the processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170 .
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
  • the PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals.
  • the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
  • the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
  • the UART interface is a universal serial data bus used for asynchronous communication.
  • the bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
  • a UART interface is typically used to connect the processor 110 with the wireless communication module 160 .
  • the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function.
  • the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
  • the MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 .
  • MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc.
  • the processor 110 communicates with the camera 193 through a CSI interface, so as to realize the photographing function of the electronic device 100 .
  • the processor 110 communicates with the display screen 194 through the DSI interface to implement the display function of the electronic device 100 .
  • the GPIO interface can be configured by software.
  • the GPIO interface can be configured as a control signal or as a data signal.
  • the GPIO interface may be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like.
  • the GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
  • the USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like.
  • the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones.
  • the interface can also be used to connect other electronic devices, such as AR devices.
  • the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 .
  • the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
  • the charging management module 140 is used to receive charging input from the charger.
  • the charger may be a wireless charger or a wired charger.
  • the charging management module 140 may receive charging input from the wired charger through the USB interface 130 .
  • the charging management module 140 may receive wireless charging input through a wireless charging coil of the electronic device 100 . While the charging management module 140 charges the battery 142 , it can also supply power to the electronic device through the power management module 141 .
  • the power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 .
  • the power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, the internal memory 121, the display screen 194, the camera 193, and the wireless communication module 160.
  • the power management module 141 can also be used to monitor parameters such as battery capacity, battery cycle times, battery health status (leakage, impedance).
  • the power management module 141 may also be provided in the processor 110 .
  • the power management module 141 and the charging management module 140 may also be provided in the same device.
  • the wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modulation and demodulation processor, the baseband processor, and the like.
  • Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
  • Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization.
  • the antenna 1 can be multiplexed as a diversity antenna of the wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
  • the mobile communication module 150 may provide wireless communication solutions including 2G/3G/4G/5G etc. applied on the electronic device 100 .
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA) and the like.
  • the mobile communication module 150 can receive electromagnetic waves from the antenna 1, filter and amplify the received electromagnetic waves, and transmit them to the modulation and demodulation processor for demodulation.
  • the mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor, and then turn it into an electromagnetic wave for radiation through the antenna 1 .
  • at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110 .
  • at least part of the functional modules of the mobile communication module 150 may be provided in the same device as at least part of the modules of the processor 110 .
  • the modem processor may include a modulator and a demodulator.
  • the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal.
  • the demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing.
  • the low frequency baseband signal is processed by the baseband processor and passed to the application processor.
  • the application processor outputs sound signals through audio devices (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or videos through the display screen 194 .
  • the modem processor may be a stand-alone device.
  • the modem processor may be independent of the processor 110, and may be provided in the same device as the mobile communication module 150 or other functional modules.
  • the wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), global navigation satellites Wireless communication solutions such as global navigation satellite system (GNSS), frequency modulation (FM), near field communication (NFC), and infrared technology (IR).
  • WLAN wireless local area networks
  • BT Bluetooth
  • GNSS global navigation satellite system
  • FM frequency modulation
  • NFC near field communication
  • IR infrared technology
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
  • the wireless communication module 160 can also receive the signal to be sent from the processor 110 , perform frequency modulation on it, amplify it, and convert it into electromagnetic waves for radiation through the antenna 2 .
  • the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology.
  • the wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code Division Multiple Access (WCDMA), Time Division Code Division Multiple Access (TD-SCDMA), Long Term Evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc.
  • the GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (GLONASS), a Beidou navigation satellite system (BDS), a quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite based augmentation systems (SBAS).
  • GPS global positioning system
  • GLONASS global navigation satellite system
  • BDS Beidou navigation satellite system
  • QZSS quasi-zenith satellite system
  • SBAS satellite based augmentation systems
  • the electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like.
  • the GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor.
  • the GPU is used to perform mathematical and geometric calculations for graphics rendering.
  • Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
  • Display screen 194 is used to display images, videos, and the like.
  • Display screen 194 includes a display panel.
  • the display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (active-matrix organic light).
  • LED diode AMOLED
  • flexible light-emitting diode flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (quantum dot light emitting diodes, QLED) and so on.
  • the electronic device 100 may include one or N display screens 194 , where N is a positive integer greater than one.
  • the electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
  • the ISP is used to process the data fed back by the camera 193 .
  • the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye.
  • ISP can also perform algorithm optimization on image noise, brightness, and skin tone.
  • ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
  • the ISP may be provided in the camera 193 .
  • Camera 193 is used to capture still images or video.
  • the object is projected through the lens to generate an optical image onto the photosensitive element.
  • the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal.
  • the ISP outputs the digital image signal to the DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other formats of image signals.
  • the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
  • the camera 193 is used to capture the face image of the user when the user's text input operation is detected.
  • a digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
  • Video codecs are used to compress or decompress digital video.
  • the electronic device 100 may support one or more video codecs.
  • the electronic device 100 can play or record videos of various encoding formats, such as: Moving Picture Experts Group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
  • MPEG Moving Picture Experts Group
  • MPEG2 moving picture experts group
  • MPEG3 MPEG4
  • MPEG4 Moving Picture Experts Group
  • the NPU is a neural-network (NN) computing processor.
  • NN neural-network
  • Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
  • the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100 .
  • the external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example to save files like music, video etc in external memory card.
  • Internal memory 121 may be used to store computer executable program code, which includes instructions.
  • the internal memory 121 may include a storage program area and a storage data area.
  • the storage program area can store an operating system, an application program required for at least one function (such as a sound playback function, an image playback function, etc.), and the like.
  • the storage data area may store data (such as audio data, phone book, etc.) created during the use of the electronic device 100 and the like.
  • the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (UFS), and the like.
  • the processor 110 executes various functional applications and data processing of the electronic device 100 by executing instructions stored in the internal memory 121 and/or instructions stored in a memory provided in the processor.
  • the electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playback, recording, etc.
  • the audio module 170 is used for converting digital audio information into analog audio signal output, and also for converting analog audio input into digital audio signal. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
  • Speaker 170A also referred to as a "speaker" is used to convert audio electrical signals into sound signals.
  • the electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
  • the receiver 170B also referred to as "earpiece" is used to convert audio electrical signals into sound signals.
  • the voice can be answered by placing the receiver 170B close to the human ear.
  • the microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals.
  • the user can make a sound by approaching the microphone 170C through a human mouth, and input the sound signal into the microphone 170C.
  • the electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which can implement a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
  • the earphone jack 170D is used to connect wired earphones.
  • the earphone interface 170D may be the USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface.
  • OMTP open mobile terminal platform
  • CTIA cellular telecommunications industry association of the USA
  • the pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals.
  • the pressure sensor 180A may be provided on the display screen 194 .
  • the capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes.
  • the electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
  • the electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A.
  • touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
  • the gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 .
  • the angular velocity of electronic device 100 about three axes ie, x, y, and z axes
  • the gyro sensor 180B can be used for image stabilization.
  • the gyro sensor 180B detects the angle at which the electronic device 100 shakes, calculates the distance that the lens module needs to compensate for according to the angle, and allows the lens to counteract the shake of the electronic device 100 through reverse motion to achieve anti-shake.
  • the gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
  • the air pressure sensor 180C is used to measure air pressure.
  • the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
  • the magnetic sensor 180D includes a Hall sensor.
  • the electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D.
  • the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
  • the acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes).
  • the magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
  • the electronic device 100 can measure the distance through radar, infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing. In some embodiments, the electronic device 100 may also measure the distance and speed of obstacles using the distance sensor 180F.
  • Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes.
  • the light emitting diodes may be infrared light emitting diodes.
  • the electronic device 100 emits infrared light to the outside through the light emitting diode.
  • Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 .
  • the electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power.
  • the proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
  • the ambient light sensor 180L is used to sense ambient light brightness.
  • the electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness.
  • the ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures.
  • the ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket, so as to prevent accidental touch.
  • the fingerprint sensor 180H is used to collect fingerprints.
  • the electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, accessing application locks, taking pictures with fingerprints, answering incoming calls with fingerprints, and the like.
  • the temperature sensor 180J is used to detect the temperature.
  • the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold value, the electronic device 100 reduces the performance of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection.
  • the electronic device 100 when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to avoid abnormal shutdown of the electronic device 100 caused by the low temperature.
  • the electronic device 100 boosts the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
  • Touch sensor 180K also called “touch device”.
  • the touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”.
  • the touch sensor 180K is used to detect a touch operation on or near it.
  • the touch sensor can pass the detected touch operation to the application processor to determine the type of touch event.
  • Visual output related to touch operations may be provided through display screen 194 .
  • the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
  • the bone conduction sensor 180M can acquire vibration signals.
  • the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice.
  • the bone conduction sensor 180M can also contact the pulse of the human body and receive the blood pressure beating signal.
  • the bone conduction sensor 180M can also be disposed in the earphone, combined with the bone conduction earphone.
  • the audio module 170 can analyze the voice signal based on the vibration signal of the vocal vibration bone block obtained by the bone conduction sensor 180M, so as to realize the voice function.
  • the application processor can analyze the heart rate information based on the blood pressure beat signal obtained by the bone conduction sensor 180M, and realize the function of heart rate detection.
  • the keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key.
  • the electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
  • Motor 191 can generate vibrating cues.
  • the motor 191 can be used for vibrating alerts for incoming calls, and can also be used for touch vibration feedback.
  • touch operations acting on different applications can correspond to different vibration feedback effects.
  • the motor 191 can also correspond to different vibration feedback effects for touch operations on different areas of the display screen 194 .
  • Different application scenarios for example: time reminder, receiving information, alarm clock, games, etc.
  • the touch vibration feedback effect can also support customization.
  • the indicator 192 can be an indicator light, which can be used to indicate the charging state, the change of the power, and can also be used to indicate a message, a missed call, a notification, and the like.
  • the SIM card interface 195 is used to connect a SIM card.
  • the SIM card can be contacted and separated from the electronic device 100 by inserting into the SIM card interface 195 or pulling out from the SIM card interface 195 .
  • the electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
  • the SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card and so on. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different.
  • the SIM card interface 195 can also be compatible with different types of SIM cards.
  • the SIM card interface 195 is also compatible with external memory cards.
  • the electronic device 100 interacts with the network through the SIM card to implement functions such as call and data communication.
  • the electronic device 100 employs an eSIM, ie: an embedded SIM card.
  • the eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100 .
  • the integrated unit if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium.
  • all or part of the processes in the methods of the above embodiments can be implemented by a computer program to instruct the relevant hardware.
  • the computer program can be stored in a computer-readable storage medium, and the computer program When executed by a processor, the steps of each of the above method embodiments can be implemented.
  • the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file or some intermediate form, and the like.
  • the computer-readable medium may include at least: any entity or device capable of carrying the computer program code to the photographing device/electronic device, recording medium, computer memory, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electrical carrier signals, telecommunication signals, and software distribution media.
  • ROM read-only memory
  • RAM random access memory
  • electrical carrier signals telecommunication signals
  • software distribution media For example, U disk, mobile hard disk, disk or CD, etc.
  • the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • the disclosed apparatus/network device and method may be implemented in other manners.
  • the apparatus/network device embodiments described above are only illustrative.
  • the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods, such as multiple units. Or components may be combined or may be integrated into another system, or some features may be omitted, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.

Abstract

The present application relates to the field of artificial intelligence (AI), and provides a text input method, an electronic device, and a computer-readable storage medium. The text input method comprises: upon detecting a text input operation of a user, acquiring lip change information of the user and character information inputted by the user, the lip change information comprising a lip feature sequence when the user speaks a text to be inputted; and determining, according to the lip feature sequence and the character information, the text to be inputted by the user. Because the accuracy of the text to be inputted that is determined according to the character information inputted by the user is relatively high, combining the lip feature sequence with the character information to determine a text to be recognized can improve the accuracy of text input in the case that the user is inconvenient for voice input.

Description

文字输入方法、电子设备及计算机可读存储介质Character input method, electronic device, and computer-readable storage medium
本申请要求于2020年09月27日提交国家知识产权局、申请号为202011036037.1、申请名称为“文字输入方法、电子设备及计算机可读存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202011036037.1 and the application name "Text input method, electronic device and computer-readable storage medium" submitted to the State Intellectual Property Office on September 27, 2020, the entire content of which is approved by Reference is incorporated in this application.
技术领域technical field
本申请涉及人工智能(Artificial Intelligence,AI)领域,尤其涉及一种文字输入方法、电子设备及计算机可读存储介质。The present application relates to the field of artificial intelligence (Artificial Intelligence, AI), and in particular, to a text input method, an electronic device, and a computer-readable storage medium.
背景技术Background technique
移动互联网用户以及市场规模的不断扩大,为手机输入法的快速发展奠定了基础,手机输入法在最初的拼音输入的基础上增加了五笔、手写、语音输入等多种方式,目前,拼音输入仍是用户使用最多的输入方式,其次是手写输入,最后是语音输入。由于语音输入的速度最快,越来越多的用户选择语音输入的方式。但是在一些场景下,例如在嘈杂环境下、用户不方便发出较大声音的隐私环境下,语音识别的准确率较低,从而导致采用语音输入方式进行文字输入的准确率降低。The continuous expansion of mobile Internet users and the market scale has laid the foundation for the rapid development of the mobile phone input method. The mobile phone input method has added Wubi, handwriting, voice input and other methods to the initial pinyin input. At present, the pinyin input is still It is the most used input method by users, followed by handwriting input, and finally voice input. Since the speed of voice input is the fastest, more and more users choose the way of voice input. However, in some scenarios, such as in a noisy environment or in a private environment where it is inconvenient for users to make loud voices, the accuracy of speech recognition is low, resulting in a reduction in the accuracy of text input using the voice input method.
发明内容SUMMARY OF THE INVENTION
本申请提供一种文字输入方法、电子设备及计算机可读存储介质,可以在用户不方便进行语音输入的情况下,提高文字输入的准确率。The present application provides a text input method, an electronic device, and a computer-readable storage medium, which can improve the accuracy of text input when it is inconvenient for a user to perform voice input.
为达到上述目的,本申请采用如下技术方案:To achieve the above object, the application adopts the following technical solutions:
第一方面,提供一种文字输入方法,包括:在检测到用户的文字输入操作时,获取所述用户的唇部变化信息以及所述用户输入的字符信息,所述唇部变化信息包括所述用户在说出待输入的文字时的唇部特征序列;根据所述唇部特征序列以及所述字符信息确定所述用户待输入的文字。In a first aspect, a text input method is provided, comprising: when a text input operation by a user is detected, acquiring lip change information of the user and character information input by the user, wherein the lip change information includes the The lip feature sequence when the user speaks the text to be input; the text to be input by the user is determined according to the lip feature sequence and the character information.
上述实施例中,在检测到用户的文字输入操作时,获取用户的唇部变化信息以及用户输入的字符信息,唇部变化信息包括用户在说出待输入的文字时的唇部特征序列,根据唇部特征序列以及字符信息确定用户待输入的文字。由于根据用户输入的字符信息确定出的待输入的文字的准确率较高,将唇部特征序列与字符信息结合确定待识别的文字,可以在用户不方便进行语音输入的情况下,提高文字输入的准确率。In the above-mentioned embodiment, when the user's text input operation is detected, the lip change information of the user and the character information input by the user are obtained, and the lip change information includes the lip feature sequence when the user speaks the text to be input. The sequence of lip features and character information determine the text to be entered by the user. Since the text to be input determined according to the character information input by the user has a high accuracy rate, combining the lip feature sequence with the character information to determine the text to be recognized can improve text input when the user is inconvenient for voice input. 's accuracy.
在第一方面的一种可能的实现方式中,所述获取所述用户的唇部变化信息,包括:In a possible implementation manner of the first aspect, the acquiring the lip change information of the user includes:
通过摄像头采集包含所述用户的嘴唇区域的图像序列,从所述图像序列的每张图像中提取唇部特征,获得唇部特征序列。在用户进行文字输入时,电子设备即开启摄像头采集嘴唇区域的图像,在用户完成唇语输入后,即完成图像采集,唇语输入和字符信息的输入可以同步进行,不影响文字输入效率。从图像序列中获取的唇部特征序列可以较好的反映用户唇部形状的变化信息,准确率较高,再根据准确率较高的唇部特征序列确定用户待输入的文字,提高了文字输入的准确率。A camera is used to collect an image sequence including the lip region of the user, and lip features are extracted from each image of the image sequence to obtain a lip feature sequence. When the user enters text, the electronic device turns on the camera to capture the image of the lip area. After the user completes the lip language input, the image capture is completed. The lip language input and the input of character information can be performed synchronously without affecting the efficiency of text input. The lip feature sequence obtained from the image sequence can better reflect the change information of the user's lip shape, and the accuracy is high, and then the text to be input by the user is determined according to the lip feature sequence with high accuracy, which improves the text input. 's accuracy.
在第一方面的一种可能的实现方式中,所述获取所述用户的唇部变化信息,包括:In a possible implementation manner of the first aspect, the acquiring the lip change information of the user includes:
发射无线信号,并获取反射信号序列,其中所述反射信号序列中的反射信号是所述无线信号在碰到障碍物后反射回来的信号;根据所述反射信号序列确定障碍物,若所述 障碍物为嘴唇,则从所述反射信号序列的每个反射信号中提取唇部特征,获得唇部特征序列。由于无线信号对环境的要求较低,例如,不受外界光线的影响,因此,采用无线信号获得唇部特征序列,可以提高电子设备的应用范围。Transmit a wireless signal, and obtain a reflected signal sequence, wherein the reflected signal in the reflected signal sequence is the signal reflected back by the wireless signal after encountering an obstacle; determine the obstacle according to the reflected signal sequence, if the obstacle If the object is a lip, the lip feature is extracted from each reflected signal of the reflected signal sequence to obtain a lip feature sequence. Since the wireless signal has low requirements on the environment, for example, it is not affected by external light, the application range of the electronic device can be improved by using the wireless signal to obtain the lip feature sequence.
在第一方面的一种可能的实现方式中,所述字符信息包括所述用户待输入的文字的第一首字母。由于只输入首字母时,输入速度较快,因此,根据唇部特征序列以及首字母确定用户待输入的文字具有较高的输入效率。In a possible implementation manner of the first aspect, the character information includes the first letter of the text to be input by the user. Since the input speed is faster when only the first letter is input, determining the text to be input by the user according to the lip feature sequence and the first letter has higher input efficiency.
在第一方面的一种可能的实现方式中,所述根据所述唇部特征序列以及所述字符信息确定所述用户待输入的文字,包括:In a possible implementation manner of the first aspect, the determining the text to be input by the user according to the lip feature sequence and the character information includes:
确定所述唇部特征序列对应的文字序列;根据所述第一首字母,对所述文字序列进行纠正处理,获得至少一个纠正后的候选文字序列;从所述候选文字序列中确定概率最大的候选文字序列,将所述概率最大的候选文字序列作为所述用户待输入的文字。由于根据唇部特征序列可以确定出多个文字序列,确定出的多个文字序列中,会存在错误的文字序列,因此,采用第一首字母对文字序列进行纠错,可以提高文字输入的准确率,且可以缩小候选文字序列的数量,进而减少后续确定待输入文字的计算量,提高计算速度。determining the character sequence corresponding to the lip feature sequence; correcting the character sequence according to the first initial to obtain at least one corrected candidate character sequence; determining the character sequence with the highest probability from the candidate character sequence The candidate text sequence, the candidate text sequence with the highest probability is used as the text to be input by the user. Since multiple character sequences can be determined according to the lip feature sequence, there will be wrong character sequences in the determined multiple character sequences. Therefore, using the first letter to correct the character sequence can improve the accuracy of character input. It can reduce the number of candidate text sequences, thereby reducing the amount of calculation for determining the text to be input subsequently, and improving the calculation speed.
在第一方面的一种可能的实现方式中,所述根据所述第一首字母,对所述文字序列进行纠正处理,获得至少一个纠正后的候选文字序列,包括:In a possible implementation manner of the first aspect, performing correction processing on the character sequence according to the first initial to obtain at least one corrected candidate character sequence, including:
提取所述文字序列中每个文字的第二首字母;将提取的所述第二首字母与所述第一首字母进行匹配;若存在不匹配的第二首字母,则将所述不匹配的第二首字母替换为对应的第一首字母,得到替换后的至少一个文字序列,将所述替换后的文字序列作为所述候选文字序列。采用首字母替换的方法,可以纠正根据唇部特征序列确定出的文字序列,提高文字输入效率。Extracting the second initial of each character in the character sequence; matching the extracted second initial with the first initial; if there is an unmatched second initial, the unmatched The second initial letter of is replaced with the corresponding first initial letter to obtain at least one replaced character sequence, and the replaced character sequence is used as the candidate character sequence. By adopting the method of initial letter replacement, the character sequence determined according to the lip feature sequence can be corrected, and the character input efficiency can be improved.
在第一方面的一种可能的实现方式中,所述若存在不匹配的第二首字母,则将所述不匹配的第二首字母替换为对应的第一首字母,得到替换后的至少一个文字序列,包括:In a possible implementation manner of the first aspect, if there is an unmatched second initial, replace the unmatched second initial with a corresponding first initial to obtain at least the replaced first letter. A literal sequence consisting of:
若存在不匹配的第二首字母,且存在与对应的第一首字母的关联的字母,则将所述不匹配的第二首字母替换为对应的第一首字母,以及将所述不匹配的第二首字母替换为关联的字母,得到替换后的至少一个文字序列。由于用户输入的首字母也可能存在输入错误的情况,因此,根据与第一首字母关联的字母确定替换后的文字序列,可以防止丢失有用信息,提高文字输入的准确率。If there is an unmatched second initial, and there is a letter associated with the corresponding first initial, then replace the unmatched second initial with the corresponding first initial, and replace the unmatched second initial with the corresponding first initial The second initial of is replaced with the associated letter, resulting in at least one literal sequence after the replacement. Since the first letter input by the user may also have input errors, determining the replaced text sequence according to the letters associated with the first initial can prevent loss of useful information and improve the accuracy of text input.
在第一方面的一种可能的实现方式中,所述确定所述唇部特征序列对应的文字序列,包括:In a possible implementation manner of the first aspect, the determining the character sequence corresponding to the lip feature sequence includes:
将所述唇部特征序列输入训练后的唇语识别模型,获得所述唇语识别模型输出的文字序列,所述唇语识别模型用于识别唇部特征对应的文字,所述唇语识别模型是基于唇部特征,以及唇部特征对应的文字作为训练样本训练得到的。由于唇语识别模型是根据训练样本训练得到的,具有通用性,因此,采用唇语识别模型识别文字序列,提高了输出的文字序列的准确度。Inputting the lip feature sequence into a trained lip language recognition model to obtain a text sequence output by the lip language recognition model, the lip language recognition model is used to recognize the text corresponding to the lip features, and the lip language recognition model It is trained based on lip features and the text corresponding to the lip features as training samples. Since the lip language recognition model is trained according to the training samples, it has universality. Therefore, the lip language recognition model is used to recognize the text sequence, which improves the accuracy of the output text sequence.
第二方面,提供一种文字输入装置,包括:In a second aspect, a text input device is provided, including:
获取模块,用于在检测到用户的文字输入操作时,获取所述用户的唇部变化信息以及所述用户输入的字符信息,所述唇部变化信息包括所述用户在说出待输入的文字时的唇部特征序列;an acquisition module, configured to acquire the lip change information of the user and the character information input by the user when detecting the text input operation of the user, the lip change information including the user speaking the text to be input lip feature sequence at time;
处理模块,用于根据所述唇部特征序列以及所述字符信息确定所述用户待输入的文字。and a processing module, configured to determine the text to be input by the user according to the lip feature sequence and the character information.
在第二方面的一种可能的实现方式中,所述获取模块具体用于:In a possible implementation manner of the second aspect, the obtaining module is specifically used for:
通过摄像头采集包含所述用户的嘴唇区域的图像序列,从所述图像序列的每张图像中提取唇部特征,获得唇部特征序列。A camera is used to collect an image sequence including the lip region of the user, and lip features are extracted from each image of the image sequence to obtain a lip feature sequence.
在第二方面的一种可能的实现方式中,所述获取模块具体用于:In a possible implementation manner of the second aspect, the obtaining module is specifically used for:
发射无线信号,并获取反射信号序列,其中所述反射信号序列中的反射信号是所述无线信号在碰到障碍物后反射回来的信号;transmitting a wireless signal, and acquiring a reflected signal sequence, wherein the reflected signal in the reflected signal sequence is a signal reflected back by the wireless signal after encountering an obstacle;
根据所述反射信号序列确定障碍物,若所述障碍物为嘴唇,则从所述反射信号序列的每个反射信号中提取唇部特征,获得唇部特征序列。An obstacle is determined according to the reflected signal sequence, and if the obstacle is a lip, a lip feature is extracted from each reflected signal of the reflected signal sequence to obtain a lip feature sequence.
在第二方面的一种可能的实现方式中,所述字符信息包括所述用户待输入的文字的第一首字母。In a possible implementation manner of the second aspect, the character information includes the first letter of the text to be input by the user.
在第二方面的一种可能的实现方式中,所述处理模块包括:In a possible implementation manner of the second aspect, the processing module includes:
确定单元,用于确定所述唇部特征序列对应的文字序列;a determining unit for determining the character sequence corresponding to the lip feature sequence;
纠错单元,用于根据所述第一首字母,对所述文字序列进行纠正处理,获得至少一个纠正后的候选文字序列;An error correction unit, configured to perform correction processing on the character sequence according to the first initial to obtain at least one corrected candidate character sequence;
输出单元,用于从所述候选文字序列中确定概率最大的候选文字序列,将所述概率最大的候选文字序列作为所述用户待输入的文字。An output unit, configured to determine a candidate character sequence with the highest probability from the candidate character sequence, and use the candidate character sequence with the highest probability as the character to be input by the user.
在第二方面的一种可能的实现方式中,所述纠错单元具体用于:In a possible implementation manner of the second aspect, the error correction unit is specifically used for:
提取所述文字序列中每个文字的第二首字母;extracting the second initial of each character in the sequence of characters;
将提取的所述第二首字母与所述第一首字母进行匹配;matching the extracted second initial with the first initial;
若存在不匹配的第二首字母,则将所述不匹配的第二首字母替换为对应的第一首字母,得到替换后的至少一个文字序列,将所述替换后的文字序列作为所述候选文字序列。If there is an unmatched second initial letter, replace the unmatched second initial letter with the corresponding first initial letter to obtain at least one replaced text sequence, and use the replaced text sequence as the Candidate text sequences.
在第二方面的一种可能的实现方式中,所述纠错单元具体还用于:In a possible implementation manner of the second aspect, the error correction unit is further configured to:
若存在不匹配的第二首字母,且存在与对应的第一首字母的关联的字母,则将所述不匹配的第二首字母替换为对应的第一首字母,以及将所述不匹配的第二首字母替换为关联的字母,得到替换后的至少一个文字序列。If there is an unmatched second initial, and there is a letter associated with the corresponding first initial, then replace the unmatched second initial with the corresponding first initial, and replace the unmatched second initial with the corresponding first initial The second initial of is replaced with the associated letter, resulting in at least one literal sequence after the replacement.
在第二方面的一种可能的实现方式中,所述确定单元具体用于:In a possible implementation manner of the second aspect, the determining unit is specifically configured to:
将所述唇部特征序列输入训练后的唇语识别模型,获得所述唇语识别模型输出的文字序列,所述唇语识别模型用于识别唇部特征对应的文字,所述唇语识别模型是基于唇部特征,以及唇部特征对应的文字作为训练样本训练得到的。Inputting the lip feature sequence into a trained lip language recognition model to obtain a text sequence output by the lip language recognition model, the lip language recognition model is used to recognize the text corresponding to the lip features, and the lip language recognition model It is trained based on lip features and the text corresponding to the lip features as training samples.
第三方面,提供一种电子设备,包括处理器,所述处理器用于执行存储在存储器中的计算机程序,以实现如上述第一方面所述的文字输入方法。In a third aspect, an electronic device is provided, including a processor for executing a computer program stored in a memory, so as to implement the text input method according to the above-mentioned first aspect.
第四方面,提供一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,所述计算机程序被处理器执行时实现如上述第一方面所述的文字输入方法。In a fourth aspect, a computer-readable storage medium is provided, where the computer-readable storage medium stores a computer program, and when the computer program is executed by a processor, implements the text input method according to the first aspect.
第五方面,提供一种计算机程序产品,当计算机程序产品在电子设备上运行时,使得电子设备执行上述第一方面中所述的文字输入方法。A fifth aspect provides a computer program product that, when the computer program product runs on an electronic device, enables the electronic device to execute the text input method described in the first aspect.
可以理解的是,上述第二方面至第五方面的有益效果可以参见上述第一方面中的相关描述,在此不再赘述。It can be understood that, for the beneficial effects of the second aspect to the fifth aspect, reference may be made to the relevant description in the first aspect, which is not repeated here.
附图说明Description of drawings
图1为本申请实施例提供的文字输入方法的流程示意图;1 is a schematic flowchart of a text input method provided by an embodiment of the present application;
图2为本申请实施例提供的文字输入方法的应用场景图;FIG. 2 is an application scenario diagram of the text input method provided by the embodiment of the present application;
图3为本申请实施例提供的唇部形状示意图;3 is a schematic diagram of a lip shape provided by an embodiment of the present application;
图4为本申请一实施例提供的文字输入方法的具体流程图;4 is a specific flowchart of a text input method provided by an embodiment of the present application;
图5为本申请一实施例提供的输出文字序列的方法示意图;5 is a schematic diagram of a method for outputting a text sequence provided by an embodiment of the present application;
图6为本申请另一实施例提供的文字输入方法的具体流程图;6 is a specific flowchart of a text input method provided by another embodiment of the present application;
图7为本申请实施例提供的雷达波信号的示意图;7 is a schematic diagram of a radar wave signal provided by an embodiment of the present application;
图8为本申请实施例提供的距离多普勒图像的示意图;8 is a schematic diagram of a range Doppler image provided by an embodiment of the present application;
图9为本申请实施例提供的电子设备的结构示意图。FIG. 9 is a schematic structural diagram of an electronic device provided by an embodiment of the present application.
具体实施方式detailed description
以下描述中,为了说明而不是为了限定,提出了诸如特定系统结构、技术之类的具体细节,以便透彻理解本申请实施例。然而,本领域的技术人员应当清楚,在没有这些具体细节的其它实施例中也可以实现本申请。在其它情况中,省略对众所周知的系统、装置、电路以及方法的详细说明,以免不必要的细节妨碍本申请的描述。In the following description, for the purpose of illustration rather than limitation, specific details such as a specific system structure and technology are set forth in order to provide a thorough understanding of the embodiments of the present application. However, it will be apparent to those skilled in the art that the present application may be practiced in other embodiments without these specific details. In other instances, detailed descriptions of well-known systems, devices, circuits, and methods are omitted so as not to obscure the description of the present application with unnecessary detail.
应当理解,当在本申请说明书和所附权利要求书中使用时,术语“包括”指示所描述特征、整体、步骤、操作、元素和/或组件的存在,但并不排除一个或多个其它特征、整体、步骤、操作、元素、组件和/或其集合的存在或添加。It is to be understood that, when used in this specification and the appended claims, the term "comprising" indicates the presence of the described feature, integer, step, operation, element and/or component, but does not exclude one or more other The presence or addition of features, integers, steps, operations, elements, components and/or sets thereof.
还应当理解,在本申请说明书和所附权利要求书中使用的术语“和/或”是指相关联列出的项中的一个或多个的任何组合以及所有可能组合,并且包括这些组合。It will also be understood that, as used in this specification and the appended claims, the term "and/or" refers to and including any and all possible combinations of one or more of the associated listed items.
如在本申请说明书和所附权利要求书中所使用的那样,术语“如果”可以依据上下文被解释为“当...时”或“一旦”或“响应于确定”或“响应于检测到”。类似地,短语“如果确定”或“如果检测到[所描述条件或事件]”可以依据上下文被解释为意指“一旦确定”或“响应于确定”或“一旦检测到[所描述条件或事件]”或“响应于检测到[所描述条件或事件]”。As used in the specification of this application and the appended claims, the term "if" may be contextually interpreted as "when" or "once" or "in response to determining" or "in response to detecting ". Similarly, the phrases "if it is determined" or "if the [described condition or event] is detected" may be interpreted, depending on the context, to mean "once it is determined" or "in response to the determination" or "once the [described condition or event] is detected. ]" or "in response to detection of the [described condition or event]".
另外,在本申请的描述中,术语“第一”、“第二”等仅用于区分描述,而不能理解为指示或暗示相对重要性。In addition, in the description of the present application, the terms "first", "second" and the like are only used to distinguish the description, and cannot be understood as indicating or implying relative importance.
在本申请说明书中描述的参考“一个实施例”或“一些实施例”等意味着在本申请的一个或多个实施例中包括结合该实施例描述的特定特征、结构或特点。由此,在本说明书中的不同之处出现的语句“在一个实施例中”、“在一些实施例中”、“在其他一些实施例中”、“在另外一些实施例中”等不是必然都参考相同的实施例,而是意味着“一个或多个但不是所有的实施例”,除非是以其他方式另外特别强调。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。References in this specification to "one embodiment" or "some embodiments" and the like mean that a particular feature, structure or characteristic described in connection with the embodiment is included in one or more embodiments of the present application. Thus, appearances of the phrases "in one embodiment," "in some embodiments," "in other embodiments," "in other embodiments," etc. in various places in this specification are not necessarily All refer to the same embodiment, but mean "one or more but not all embodiments" unless specifically emphasized otherwise. The terms "including", "including", "having" and their variants mean "including but not limited to" unless specifically emphasized otherwise.
现有的输入法主要有语音输入法、拼音输入法以及手写输入法等。拼音输入法以及手写输入法的输入准确率较高,但是输入效率较低。语音输入法一般是先进行语音识别,将识别出的语音转换为对应的文字,具有较高的输入效率,但是在一些噪音较大或者不能大声说话的场景下,语音识别的准确率较低,导致语音转化为文本的准确率也降低。Existing input methods mainly include voice input method, pinyin input method and handwriting input method. Pinyin input method and handwriting input method have higher input accuracy, but lower input efficiency. The voice input method generally performs voice recognition first, and converts the recognized voice into corresponding text, which has high input efficiency. As a result, the accuracy of speech-to-text conversion is also reduced.
为此,本申请实施例提供一种文字输入方法,在检测到用户的文字输入操作时,获取用户的唇部变化信息以及用户输入的字符信息,唇部变化信息包括用户在说出待输入的文字时的唇部特征序列,根据唇部特征序列以及字符信息确定用户待输入的文字。由 于根据用户输入的字符信息确定出的待输入的文字的准确率较高,因此将唇部特征序列与字符信息结合确定待识别的文字,可以在语音输入方式下提高输入的文字的准确率。To this end, an embodiment of the present application provides a text input method. When a text input operation by a user is detected, the lip change information of the user and the character information input by the user are obtained, where the lip change information includes the user speaking the text to be input. The lip feature sequence when writing is used to determine the text to be input by the user according to the lip feature sequence and character information. Since the accuracy of the characters to be input determined according to the character information input by the user is high, combining the lip feature sequence with the character information to determine the characters to be recognized can improve the accuracy of the input characters in the voice input mode.
下面结合具体实施例对本申请提供的文字输入方法进行示例性说明。The text input method provided by the present application will be exemplarily described below with reference to specific embodiments.
本申请实施例提供的文字输入方法应用于电子设备,电子设备可以是手机、平板电脑、手持计算机、个人数字助理(personal digital assistant,PDA)、带屏音箱、穿戴设备等。The text input method provided in the embodiments of the present application is applied to electronic devices, and the electronic devices may be mobile phones, tablet computers, handheld computers, personal digital assistants (PDAs), speakers with screens, wearable devices, and the like.
如图1所示,本申请一实施例提供的文字输入方法包括:As shown in FIG. 1 , a text input method provided by an embodiment of the present application includes:
S101:在检测到用户的文字输入操作时,获取用户的唇部变化信息以及用户输入的字符信息,唇部变化信息包括用户在说出待输入的文字时的唇部特征序列。S101: When a text input operation of the user is detected, obtain lip change information of the user and character information input by the user, where the lip change information includes a lip feature sequence when the user speaks the text to be input.
其中,用户可以采用软键盘输入或者手写输入的方式输入字符信息,输入的字符信息可以是文字,也可以是与文字对应的字母,也可以是文字的首字母,文字可以是中文、英文或其它外文。The user can input character information by means of soft keyboard input or handwriting input. The input character information can be text, letters corresponding to the text, or the first letter of the text, and the text can be Chinese, English or other foreign language.
用户在说出待输入的文字时可以发出声音,也可以不发出声音。唇部特征序列包括连续时间内,每个时刻的唇部特征,唇部特征用于表征是用户在说话时的唇部形状,不同的唇部形状对应不同的发音。The user may or may not make a sound when speaking the text to be input. The lip feature sequence includes lip features at each moment in a continuous time. The lip features are used to represent the lip shape of the user when speaking, and different lip shapes correspond to different pronunciations.
在一种可能的实现方式中,电子设备通过摄像头采集用户的嘴唇区域的图像序列,图像序列为连续时间内,每个时刻的嘴唇区域图像,再从图像序列的每张图像中提取唇部特征,获得唇部特征序列。In a possible implementation manner, the electronic device collects an image sequence of the user's lip region through a camera, the image sequence is an image of the lip region at each moment in a continuous time, and then extracts lip features from each image in the image sequence , to obtain a sequence of lip features.
在另一种可能的实现方式中,电子设备通过无线传感器发射无线信号,无线信号在碰到障碍物后反射回信号,反射回的信号即为反射信号。电子设备再通过无线传感器再接收反射信号,根据连续时间内每个时刻的反射信号,得到反射信号序列,根据反射信号序列确定障碍物是否为嘴唇,若确定出障碍物为嘴唇,从嘴唇对应的反射信号序列中的每个反射信号中提取唇部特征,获得唇部特征序列。其中,无线信号可以是雷达波信号、红外线信号或者超声波信号等。In another possible implementation manner, the electronic device transmits a wireless signal through a wireless sensor, and the wireless signal is reflected back after encountering an obstacle, and the reflected signal is a reflected signal. The electronic device then receives the reflected signal through the wireless sensor, obtains the reflected signal sequence according to the reflected signal at each moment in the continuous time, and determines whether the obstacle is a lip according to the reflected signal sequence. A lip feature is extracted from each reflected signal in the reflected signal sequence to obtain a lip feature sequence. The wireless signal may be a radar wave signal, an infrared signal, or an ultrasonic signal, or the like.
在一种可能的实现方式中,用户选择预设的输入模式(例如多模输入模式)后,电子设备在检测到用户开始输入字符后,获取用户的唇部变化信息。In a possible implementation manner, after the user selects a preset input mode (eg, a multi-mode input mode), the electronic device acquires the user's lip change information after detecting that the user starts to input characters.
例如,在一种应用场景中,用户打开短信编辑页面,电子设备在检测到多模输入模式的指令后,打开如图2所示的界面,若用户需要输入中文,可以通过图2所示的界面输入中文拼音的首字母,同时进行唇语输入。当用户开始进行输入时,电子设备的摄像头采集用户的唇部图像,从各个时刻的唇部图像中提取唇部特征,即得到唇部特征序列。For example, in an application scenario, the user opens the text message editing page, and the electronic device opens the interface shown in Figure 2 after detecting the instruction of the multi-mode input mode. The interface inputs the first letter of Chinese pinyin, and simultaneously performs lip language input. When the user starts to input, the camera of the electronic device collects the lip image of the user, and extracts the lip feature from the lip image at each moment, that is, the lip feature sequence is obtained.
S102:根据所述唇部特征序列以及所述字符信息确定所述用户待输入的文字。S102: Determine the text to be input by the user according to the lip feature sequence and the character information.
由于用户在说话时,当唇部特征变化,即唇部形状变化时,发音也会变化,因此,根据用户说话时的唇部形状,可以确定出用户的发音,进一步确定用户说出的文字。但是,当用户说出不同的文字时,对应的唇部形状也可能相同,例如,当唇部为如图3中的(A)所示的形状时,对应的发音可能是英文字母的“A”、“E”或者“I”,当唇部为如图3中的(B)所示的形状时,对应的发音可能是英文字母的“Q”或者“W”。而且同一个文字可能需要变化的唇部形状才可以发出,例如,英文字母“M”、“L”需要变化的唇部形状才可以发出。因此,根据用户的每个时刻唇部特征,均有可能确定出多个文字,或者确定出错误的文字。再根据各个时刻的唇部特征,即唇部特征序列,可以确定出多种可能的文字序列。When the user speaks, when the lip features change, that is, when the lip shape changes, the pronunciation will also change. Therefore, according to the lip shape when the user speaks, the user's pronunciation can be determined, and the text spoken by the user can be further determined. However, when the user speaks different words, the corresponding lip shape may also be the same. For example, when the lip is in the shape as shown in (A) in Figure 3, the corresponding pronunciation may be the English letter "A" ", "E" or "I", when the lips are in the shape shown in (B) in Figure 3, the corresponding pronunciation may be "Q" or "W" of English letters. Moreover, the same character may need a changed lip shape before it can be issued. For example, the English letters "M" and "L" need a changed lip shape before they can be issued. Therefore, according to the lip features of the user at each moment, it is possible to determine multiple characters or determine wrong characters. Then, according to the lip features at each moment, that is, the lip feature sequence, a variety of possible text sequences can be determined.
用户输入的每个字符信息也对应一个或者多个文字,将唇部特征序列与字符信息结 合,可以纠正根据唇部特征序列识别出的文字,或者去除根据唇部特征序列识别出的错误的文字,从而得到准确的文字序列,即用户待输入的文字,从而不用获取用户的语音,即可完成文字输入,提高了文字输入的准确率。Each character information input by the user also corresponds to one or more characters. Combining the lip feature sequence with the character information can correct the characters identified according to the lip feature sequence, or remove the wrong characters identified according to the lip feature sequence. , so as to obtain an accurate text sequence, that is, the text to be input by the user, so that the text input can be completed without acquiring the user's voice, and the accuracy of the text input is improved.
在一种可能的实现方式中,电子设备根据唇部特征序列和字符信息可以确定出多个候选文字序列,在确定出多个候选文字序列后,根据各文字序列的语义,选择出概率最大的候选文字序列,将概率最大的候选文字序列作为用户待输入的文字。In a possible implementation manner, the electronic device may determine multiple candidate character sequences according to the lip feature sequence and character information, and after determining multiple candidate character sequences, select the one with the highest probability according to the semantics of each character sequence. Candidate text sequence, the candidate text sequence with the highest probability is used as the text to be input by the user.
在一种可能的实现方式中,可以将候选文字序列输入训练好的语义识别模型中,获得语义识别模型输出的概率最大的候选文字序列。其中,语义识别模型是基于文字序列,以及概率最大的文字序列作为训练样本训练得到的。In a possible implementation manner, the candidate text sequence may be input into the trained semantic recognition model, and the candidate text sequence with the highest probability output by the semantic recognition model is obtained. Among them, the semantic recognition model is trained based on the text sequence and the text sequence with the highest probability as the training sample.
上述实施例中,由于根据用户输入的字符信息确定出的待输入的文字的准确率较高,因此将唇部特征序列与字符信息结合确定待识别的文字,可以在语音输入方式下提高输入的文字的准确率。In the above embodiment, since the accuracy of the text to be input determined according to the character information input by the user is high, the combination of the lip feature sequence and the character information to determine the text to be recognized can improve the input accuracy in the voice input mode. accuracy of the text.
在一种可能的实现方式中,用户输入的字符信息包括用户待输入的文字的第一首字母,其中,“第一”用于区分描述“首字母”,“首字母”指文字的第一个字母,可以是中文拼音中的第一个字母,例如“wen”中的“w”,也可以是英文单词的第一个字母,例如“Good”中的“G”。第一首字母包括待输入的文字中的每个文字的首字母。电子设备根据获取的唇部特征序列确定出文字序列后,根据第一首字母,对确定出的文字序列进行纠正处理,获得至少一个纠正后的候选文字序列。In a possible implementation manner, the character information input by the user includes the first letter of the text to be input by the user, wherein "first" is used to distinguish and describe the "initial letter", and "the first letter" refers to the first letter of the text It can be the first letter in Chinese Pinyin, such as "w" in "wen", or the first letter of an English word, such as "G" in "Good". The first letter includes the first letter of each of the words to be entered. After determining the character sequence according to the acquired lip feature sequence, the electronic device performs correction processing on the determined character sequence according to the first initial to obtain at least one corrected candidate character sequence.
若用户待输入的文字是中文,电子设备在根据获取的唇部特征序列确定出文字序列后,根据第一首字母,对确定出的文字序列的拼音进行纠正,得到纠正后的拼音,根据纠正后的拼音确定至少一个纠正后的候选文字序列。If the text to be input by the user is Chinese, after determining the text sequence according to the acquired lip feature sequence, the electronic device corrects the pinyin of the determined text sequence according to the first letter, and obtains the corrected pinyin. The resulting pinyin determines at least one corrected candidate character sequence.
例如,在一种应用场景中,电子设备根据第一首字母对文字序列的拼音进行纠正,得到一个纠正后的拼音,根据纠正后的拼音确定纠正后的候选文字序列。例如,根据唇部特征序列确定出的文字序列为“荷兰”,第一首字母为“hn”,则对文字序列的拼音“helan”进行纠正,得到的纠正后的拼音为“henan”,根据发音“henan”确定出的候选文字序列包括“河南”、“贺楠”等。For example, in an application scenario, the electronic device corrects the pinyin of the text sequence according to the first letter to obtain a corrected pinyin, and determines the corrected candidate text sequence according to the corrected pinyin. For example, if the character sequence determined according to the lip feature sequence is "Netherlands" and the first letter is "hn", then the pinyin "helan" of the character sequence is corrected, and the corrected pinyin is "henan". The candidate character sequences determined by the pronunciation of "henan" include "Henan", "Henan" and so on.
在另一种应用场景中,电子设备根据第一首字母对文字序列进行纠正,得到多个纠正后的拼音,根据每个纠正后的拼音确定纠正后的候选文字序列。例如,根据唇部特征序列确定出的文字序列为“飞机”,第一首字母为“hj”,则对文字序列的拼音“feiji”进行纠正,得到的纠正后的拼音为“huijia”、“huiji”等,对于拼音“huijia”,确定出的候选文字序列为“回家”、“汇价”等,对于拼音“huiji”,确定出的候选文字序列为“汇集”、“惠及”等。In another application scenario, the electronic device corrects the text sequence according to the first letter, obtains a plurality of corrected pinyin, and determines the corrected candidate text sequence according to each corrected pinyin. For example, if the character sequence determined according to the lip feature sequence is "airplane" and the first letter is "hj", then the pinyin "feiji" of the character sequence is corrected, and the corrected pinyin is "huijia", "" huiji", etc. For the pinyin "huijia", the determined candidate text sequences are "home", "exchange price", etc. For the pinyin "huiji", the determined candidate text sequences are "collection", "benefit" and so on.
电子设备在获得纠正后的候选文字序列后,再从候选文字序列中确定概率最大的候选文字序列,将概率最大的候选文字序列作为用户待输入的文字。例如,将候选文字序列输入语义识别模型中,获得语义识别模型输出的概率最大的候选文字序列。由于首字母输入法在没有历史联想信息的情况下,准确率较低,但是输入速度较快,通过唇部特征序列和首字母的结合,提高了文字输入的准确率,且用户使用较短的时间即可完成首字母的输入,提高了输入效率。例如,若用户想输入“中国人”三个字,只需要输入三个首字母“z”“g”“r”,同时嘴部默念“中国人”三个字,就可以完成文字输入。After obtaining the corrected candidate character sequence, the electronic device determines the candidate character sequence with the highest probability from the candidate character sequence, and uses the candidate character sequence with the highest probability as the character to be input by the user. For example, input the candidate text sequence into the semantic recognition model, and obtain the candidate text sequence with the highest probability output by the semantic recognition model. Since the initial letter input method has a low accuracy rate without historical association information, but the input speed is fast, the combination of the lip feature sequence and the initial letter improves the accuracy rate of text input, and the user uses a shorter The input of the initial letter can be completed in time, which improves the input efficiency. For example, if the user wants to input the three characters "Chinese", he only needs to input the three initials "z", "g", and "r", and at the same time, the words "Chinese" are silently recited in his mouth, and then the text input can be completed.
在一种可能的实现方式中,电子设备根据第一首字母对根据唇部特征序列确定出的 文字序列进行纠正处理的方法具体为:首先提取出确定出的文字序列中每个文字的第二首字母,其中,“第二”用于区分描述“首字母”。对于英文来说,可以直接从文字序列中提取出第二首字母,对于中文来说,首先需要将中文转换为对应的拼音,再从每个拼音中提取出第二首字母。In a possible implementation manner, the method for the electronic device to correct the character sequence determined according to the lip feature sequence according to the first initial letter is specifically: first extracting the second character sequence of each character in the determined character sequence Initials, where "second" is used to distinguish the description "initials". For English, the second letter can be directly extracted from the text sequence. For Chinese, it is first necessary to convert the Chinese into the corresponding pinyin, and then extract the second letter from each pinyin.
提取出第二首字母后,将第二首字母与第一首字母进行匹配,若存在不匹配的第二首字母,将不匹配的第二首字母替换为对应的第一首字母,得到替换后的至少一个文字序列,再将替换后的文字序列作为候选文字序列。其中,对应的第一首字母是指与第二首字母的位置对应的第一首字母,例如,若第二首字母是文字序列中第二个文字的首字母,则对应的第一首字母为用户输入的第二个首字母。After extracting the second letter, match the second letter with the first letter. If there is an unmatched second letter, replace the unmatched second letter with the corresponding first letter to get the replacement After at least one character sequence, the replaced character sequence is used as a candidate character sequence. The corresponding first letter refers to the first letter corresponding to the position of the second letter. For example, if the second letter is the first letter of the second letter in the text sequence, the corresponding first letter The second initial entered for the user.
下面以中文为例,介绍得到候选文字序列的过程。The following takes Chinese as an example to introduce the process of obtaining the candidate character sequence.
在一种应用场景中,若存在不匹配的第二首字母,则直接用第一首字母替换不匹配的第二首字母,得到替换后的拼音,再根据替换后的拼音确定替换后的至少一个候选文字序列。例如,根据唇部特征序列确定出的文字序列为“支持”,则提取出的第二首字母为“zc”,若第一首字母为“zs”,则存在不匹配的第二首字母“c”,用第一首字母“s”替换“c”,得到替换后的拼音为“zhishi”,再根据替换后的拼音确定的候选文字序列为“知识”、“指示”等。In an application scenario, if there is an unmatched second letter, the unmatched second letter is directly replaced with the first letter to obtain the replaced pinyin, and then the replaced at least one is determined according to the replaced pinyin. A candidate text sequence. For example, if the character sequence determined according to the lip feature sequence is "support", the extracted second initial letter is "zc"; if the first initial letter is "zs", there is a mismatched second initial letter" c", replace "c" with the first letter "s", get the replaced pinyin as "zhishi", and then determine the candidate word sequence according to the replaced pinyin as "knowledge", "instruction", etc.
在另一种应用场景中,若存在不匹配的第二首字母,直接用第一首字母替换不匹配的第二首字母,替换后的字母不能形成一个文字,或者替换后的字母与文字序列的发音差别较大时,则对替换后的字母进行修正,根据修正结果得到至少一个文字序列。例如,根据唇部特征序列确定出的文字序列为“发挥”,对应的拼音为“fahui”,则提取出的第二首字母为“fh”,若第一首字母为“fw”,则存在不匹配的第二首字母“h”,若直接用第一首字母“w”替换“h”,替换后的字母形成的发音为“fawui”,由于“wui”不能形成文字,因此,不能直接替换,因此,根据预设的修正规则,对替换后的字母进行修正,将“wui”修正为“wei”,得到修正后的拼音“fawei”,再根据修正后的拼音确定的候选文字序列为“乏味”、“发尾”等。In another application scenario, if there is an unmatched second letter, the unmatched second letter is directly replaced with the first letter. The replaced letter cannot form a text, or the replaced letter and the text sequence When the pronunciation difference of the letters is relatively large, the replaced letters are corrected, and at least one character sequence is obtained according to the correction result. For example, if the character sequence determined according to the lip feature sequence is "play" and the corresponding pinyin is "fahui", the second first letter extracted is "fh", if the first letter is "fw", then there is The unmatched second letter "h", if you directly replace "h" with the first letter "w", the pronunciation formed by the replaced letter is "fawui", because "wui" cannot form words, therefore, it cannot be directly Therefore, according to the preset correction rules, the replaced letters are corrected, "wui" is corrected to "wei", and the corrected pinyin "fawei" is obtained, and then the candidate word sequence determined according to the corrected pinyin is: "boring", "hair tail", etc.
又例如,根据唇部特征序列确定出的文字序列为“哇”,对应的发音为“wa”,则提取出的第二首字母为“w”,若第一首字母为“h”,则存在不匹配的第二首字母,若直接用“h”替换“w”,根据替换后的字母得到的拼音为“ha”,由于“ha”与“wa”的发音差别较大,因此,不能直接替换,因此,根据预设的修正规则,对替换后的字母进行修正,将“ha”修正为“hua”,使得修正后的拼音的发音与文字序列的发音接近,再根据修正后的拼音确定的候选文字序列为“花”、“华”等。For another example, the character sequence determined according to the lip feature sequence is "wow", and the corresponding pronunciation is "wa", then the extracted second initial letter is "w", if the first initial letter is "h", then There is an unmatched second letter, if you directly replace "w" with "h", the pinyin obtained according to the replaced letter is "ha", because the pronunciation of "ha" and "wa" is quite different, therefore, it cannot be Direct replacement. Therefore, according to the preset correction rules, the replaced letters are corrected, and "ha" is corrected to "hua", so that the pronunciation of the corrected pinyin is close to the pronunciation of the text sequence, and then according to the corrected pinyin The determined candidate character sequences are "flower", "hua" and so on.
由于一些字母序列对应的发音比较接近,例如,拼音中的“r”和“l”发音接近,“n”和“l”发音接近,“h”和“f”发音接近,“zh”和“z”发音接近,“ch”和“c”发音接近,“sh”和“s”发音接近。因此,用户输入的第一首字母可能存在错误,为了提高文字输入的准确率,在一种可能的实现方式中,预先设定关联数据库,关联数据库中存储存在关联关系的字母,存在关联关系的字母为发音接近、容易混淆的字母。电子设备在判定存在不匹配的第二首字母后,根据关联数据库中存储的存在关联关系的字母,判断是否存在与对应的第一首字母关联的字母。若存在与对应的第一首字母的关联的字母,则将不匹配的第二首字母替换为对应的第一首字母,以及将不匹配的第二首字母替换为关联的字母,得到替换后的至少一个文字序列,从而可以扩大候选文字序列的范围, 再根据候选文字序列确定用户待输入的文字时,提高了文字输入的准确率。Since the corresponding pronunciations of some letter sequences are relatively close, for example, in Pinyin, the pronunciations of "r" and "l" are close, "n" and "l" are close, "h" and "f" are close, and "zh" and " z" is pronounced close, "ch" and "c" are close, and "sh" and "s" are close. Therefore, there may be errors in the first letter input by the user. In order to improve the accuracy of text input, in a possible implementation, an associated database is preset, and the associated database stores letters with associated relationships. Letters are letters that are pronounced close to each other and are easily confused. After determining that there is an unmatched second initial, the electronic device determines whether there is a letter associated with the corresponding first initial according to the associated letters stored in the associated database. If there is a letter associated with the corresponding first letter, replace the unmatched second letter with the corresponding first letter, and replace the unmatched second letter with the associated letter, and get the replacement Therefore, the range of the candidate character sequence can be expanded, and when the character to be input by the user is determined according to the candidate character sequence, the accuracy of character input is improved.
例如,文字序列为“自己”,对应的发音为“ziji”,则提取出的第二首字母为“zj”,若第一首字母为“sj”,则存在不匹配的第二首字母“z”,对应的第一首字母为“s”,且存在与第一首字母“s”的关联的字母“sh”,则将“z”替换为“s”,得到替换后的发音为“siji”,同时,将“z”替换为“sh”得到替换后的发音为“shiji”,最终根据替换后的发音得到的候选文字序列包括“司机”、“四级”、“实际”、“时机”等。For example, if the text sequence is "self" and the corresponding pronunciation is "ziji", then the extracted second letter is "zj", if the first letter is "sj", there is a mismatched second letter" z", the corresponding first letter is "s", and there is a letter "sh" associated with the first letter "s", then "z" is replaced with "s", and the replaced pronunciation is " siji", at the same time, replace "z" with "sh" to obtain the replaced pronunciation as "shiji", and finally the candidate text sequence obtained according to the replaced pronunciation includes "driver", "fourth level", "actual", " timing" etc.
又例如,文字序列为“落”,对应的发音为“luo”,则提取出的第二首字母为“l”,若第一首字母为“r”,则存在不匹配的第二首字母,且存在与第一首字母“r”发音接近的字母“n”,则将“l”替换为“r”,得到替换后的发音“ruo”,同时,将“l”替换为“n”,得到替换后的发音“nuo”,最终根据替换后的发音得到的候选文字序列为“若”、“弱”、“挪”、“诺”等。For another example, if the text sequence is "fall" and the corresponding pronunciation is "luo", then the extracted second initial letter is "l", and if the first initial letter is "r", there is an unmatched second initial letter , and there is a letter "n" that sounds close to the first letter "r", then replace "l" with "r" to get the replaced pronunciation "ruo", and at the same time, replace "l" with "n" , to obtain the replaced pronunciation "nuo", and finally the candidate text sequences obtained according to the replaced pronunciation are "If", "Weak", "Nuo", "Nuo", etc.
在一种可能的实现方式中,唇部特征序列是从用户的嘴唇区域的图像序列中获取的,唇语识别模型是基于嘴唇区域的图像序列中提取的唇部特征序列,以及唇部特征序列对应的文字序列作为训练样本训练得到的。对应地,文字输入方法的具体流程如图4所示,在检测到用户的文字输入操作时,获取用户输入的待输入文字的第一首字母,同时采用电子设备上前置的摄像头采集人脸图像,对采集的每张人脸图像进行识别,识别出图像中的人脸。如图5所示,在识别出人脸后,从人脸中截取出嘴唇区域的图像,连续时间内各时刻的嘴唇区域的图像形成图像序列,从图像序列中的每张图像中提取唇部特征,得到唇部特征序列。然后将唇部特征序列输入唇语识别模型中,获得唇语识别模型输出的文字序列。其中,唇语识别模型可以是时空卷积神经网络(Spatiotemporal Convolutional Neural Networks,STCNN)模型。在得到唇语识别模型输出的文字序列后,采用第一首字母对文字序列进行纠错处理,得到纠正后的候选文字序列。例如,若待输入文字为中文,在得到唇语识别模型输出的文字序列后,将文字序列转换为拼音,提取拼音中的第二首字母,采用第一首字母替换第二首字母,得到替换后的拼音,根据替换后的拼音确定候选文字序列。得到候选文字序列后,确定每个候选文字序列的语义,根据每个候选文字序列的语义,确定出概率最大的候选文字序列。例如,可以将确定出的文字序列与预先存储的词语数据库进行比较,选择与词语数据库匹配程度最高的候选文字序列,作为概率最大的候选文字序列,再将概率最大的候选文字序列作为用户待输入的文字。In a possible implementation, the lip feature sequence is obtained from the image sequence of the lip region of the user, the lip language recognition model is based on the lip feature sequence extracted from the lip region image sequence, and the lip feature sequence The corresponding text sequences are obtained by training as training samples. Correspondingly, the specific flow of the text input method is shown in FIG. 4 , when the text input operation of the user is detected, the first letter of the text to be input input by the user is obtained, and the front camera on the electronic device is used to collect the face. Image, identify each face image collected, and identify the face in the image. As shown in Figure 5, after the face is recognized, the image of the lip area is cut out from the face, and the images of the lip area at each moment in the continuous time form an image sequence, and the lips are extracted from each image in the image sequence. feature to get the lip feature sequence. Then input the lip feature sequence into the lip language recognition model to obtain the text sequence output by the lip language recognition model. The lip language recognition model may be a Spatiotemporal Convolutional Neural Networks (STCNN) model. After the text sequence output by the lip language recognition model is obtained, the first letter is used to correct the text sequence, and the corrected candidate text sequence is obtained. For example, if the input text is Chinese, after obtaining the text sequence output by the lip language recognition model, convert the text sequence into pinyin, extract the second letter in the pinyin, replace the second letter with the first letter, and obtain the replacement After the pinyin is replaced, the candidate character sequence is determined according to the replaced pinyin. After the candidate character sequence is obtained, the semantics of each candidate character sequence is determined, and according to the semantics of each candidate character sequence, the candidate character sequence with the highest probability is determined. For example, the determined word sequence can be compared with a pre-stored word database, and the candidate word sequence with the highest degree of matching with the word database can be selected as the candidate word sequence with the highest probability, and then the candidate word sequence with the highest probability can be selected as the candidate word sequence to be input by the user Text.
在其他可能的实现方式中,也可以将替换后的拼音输入预设的语义识别模型中,获得语义识别模型输出的概率最大的候选文字序列。其中,语义识别模型是基于发音,以及概率最大的文字序列作为训练样本训练得到的。In other possible implementation manners, the replaced pinyin may also be input into a preset semantic recognition model to obtain a candidate character sequence with the highest probability output by the semantic recognition model. Among them, the semantic recognition model is trained based on pronunciation and the text sequence with the highest probability as a training sample.
在另一种可能的实现方式中,唇部特征序列是从经障碍物反射后的雷达波信号序列中获取的,唇语识别模型是基于雷达波信号序列中提取的唇部特征序列,以及唇部特征序列对应的文字作为训练样本训练得到的。对应地,文字输入方法的具体流程如图6所示,在检测到用户的文字输入操作时,获取用户输入的待输入文字的第一首字母,同时采用电子设备上前置的雷达发射雷达波信号,并接收经障碍物反射的雷达波信号序列,即反射信号序列,反射信号序列包括连续时间内各时刻的反射信号。其中,雷达可以是60GHz毫米波雷达,雷达天线可以是单发射、多接收的模式,也可以是多发射、多接收的模式。由于反射信号相对于发射信号的延时、以及反射信号的多普勒效应可以反映障碍物的特征,包括障碍物的大小、形状、距离、速度等信息,因此,通过对用户说话时 唇部的反射信号序列进行处理,可以获取用户在说话时的唇部特征序列。然后将唇部特征序列输入唇语识别模型中,获得唇语识别模型输出的文字序列。In another possible implementation, the lip feature sequence is obtained from the radar wave signal sequence reflected by the obstacle, and the lip language recognition model is based on the lip feature sequence extracted from the radar wave signal sequence, and the lip The text corresponding to the partial feature sequence is obtained by training as the training sample. Correspondingly, the specific flow of the text input method is shown in FIG. 6 , when the text input operation of the user is detected, the first letter of the text to be input input by the user is obtained, and the radar in front of the electronic device is used to transmit radar waves. and receive the radar wave signal sequence reflected by the obstacle, that is, the reflected signal sequence, and the reflected signal sequence includes the reflected signals at each moment in the continuous time. Among them, the radar can be a 60GHz millimeter-wave radar, and the radar antenna can be a single-transmission and multi-reception mode, or a multi-transmission and multi-reception mode. Since the delay of the reflected signal relative to the transmitted signal and the Doppler effect of the reflected signal can reflect the characteristics of the obstacle, including the size, shape, distance, speed and other information of the obstacle, it is possible to use the lip movement when speaking to the user. The reflected signal sequence is processed to obtain the lip feature sequence of the user when speaking. Then input the lip feature sequence into the lip language recognition model to obtain the text sequence output by the lip language recognition model.
在一种可能的实现方式中,可以采用调频连续波(Frequency Modulated Continuous Wave,FMCW)的调制格式对雷达波进行调制,FMCW调制格式是由周期性锯齿波函数进行调制的。对雷达波进行调制后,得到如图7所示的调制后的雷达波,其中,反射信号s2相对于发射信号s1存在延时,且反射信号与发射信号存在频率差异,频率差异的原因是障碍物运动时的多普勒效应。将反射信号与发射信号相乘,将相乘后的信号基于模拟信号进行低通滤波,得到beat信号。在得到beat信号后,对beat信号进行快速傅里叶变换(fast Fourier transform,FFT),并进行背景消除(例如滤波处理)以去掉静止不变的背景环境,得到如图8所示的距离多普勒图像(Range Doppler Map,RDM)。RDM中的每一格对应矩阵中的一个元素,RDM中,每列中的元素,代表障碍物的距离,每行中的元素,代表障碍物的速度。根据RDM可以确定出当前时刻障碍物的速度和距离,例如图8中的黑色区域表示当前时刻障碍物的速度和距离。得到RDM后,再根据各个接收天线接收到的反射信号,可以得到障碍物的到达角度,根据障碍物的到达角度以及距离,即可得到障碍物的空间位置信息,再根据障碍物的空间位置信息对障碍物进行三维重建,得到障碍物的三维深度图。三维深度图结合RDM中的速度值,即可得到障碍物的四维(三维空间和速度)矢量信号。In a possible implementation manner, the radar wave may be modulated by a modulation format of Frequency Modulated Continuous Wave (FMCW), and the FMCW modulation format is modulated by a periodic sawtooth wave function. After the radar wave is modulated, the modulated radar wave as shown in Figure 7 is obtained, in which the reflected signal s2 is delayed relative to the transmitted signal s1, and the reflected signal and the transmitted signal have a frequency difference, and the reason for the frequency difference is an obstacle The Doppler effect of objects in motion. The reflected signal is multiplied by the transmitted signal, and the multiplied signal is low-pass filtered based on the analog signal to obtain the beat signal. After the beat signal is obtained, fast Fourier transform (FFT) is performed on the beat signal, and background removal (such as filtering) is performed to remove the static background environment, and the distance as shown in Figure 8 is obtained. Range Doppler Map (RDM). Each grid in the RDM corresponds to an element in the matrix. In the RDM, the element in each column represents the distance of the obstacle, and the element in each row represents the speed of the obstacle. The speed and distance of the obstacle at the current moment can be determined according to the RDM. For example, the black area in Figure 8 represents the speed and distance of the obstacle at the current moment. After obtaining the RDM, according to the reflected signals received by each receiving antenna, the arrival angle of the obstacle can be obtained, according to the arrival angle and distance of the obstacle, the spatial position information of the obstacle can be obtained, and then according to the spatial position information of the obstacle 3D reconstruction is performed on the obstacle to obtain the 3D depth map of the obstacle. Combining the 3D depth map with the velocity value in the RDM, the 4D (3D space and velocity) vector signal of the obstacle can be obtained.
同理,本申请实施例中,电子设备接收反射信号后,对反射信号以及对应的发射信号做上述处理,可以得到唇部的四维矢量信号,将唇部的四维矢量信号作为唇部特征,根据唇部各时刻的四维矢量信号,可以得到唇部特征序列。然后将唇部特征序列输入唇语识别模型中,获得唇语识别模型输出的文字序列。在得到唇语识别模型输出的文字序列后,采用第一首字母对文字序列进行纠错处理,得到纠正后的候选文字序列。得到候选文字序列后,确定每个候选文字序列的语义,根据每个候选文字序列的语义,确定出概率最大的候选文字序列,再将概率最大的候选文字序列作为用户待输入的文字。Similarly, in the embodiment of the present application, after receiving the reflected signal, the electronic device performs the above-mentioned processing on the reflected signal and the corresponding transmitted signal to obtain the four-dimensional vector signal of the lip, and the four-dimensional vector signal of the lip is used as the lip feature, according to The lip feature sequence can be obtained from the four-dimensional vector signal of the lips at each moment. Then input the lip feature sequence into the lip language recognition model to obtain the text sequence output by the lip language recognition model. After the text sequence output by the lip language recognition model is obtained, the first letter is used to correct the text sequence, and the corrected candidate text sequence is obtained. After the candidate text sequences are obtained, the semantics of each candidate text sequence are determined, the candidate text sequence with the highest probability is determined according to the semantics of each candidate text sequence, and the candidate text sequence with the highest probability is used as the text to be input by the user.
由于雷达波信号对环境的要求较低,例如,不受外界光线的影响,因此,采用雷达波信号获得唇部特征序列,可以提高电子设备的应用范围。Since the radar wave signal has low requirements on the environment, for example, it is not affected by external light, the application range of the electronic device can be improved by using the radar wave signal to obtain the lip feature sequence.
应理解,上述实施例中各步骤的序号的大小并不意味着执行顺序的先后,各过程的执行顺序应以其功能和内在逻辑确定,而不应对本申请实施例的实施过程构成任何限定。It should be understood that the size of the sequence numbers of the steps in the above embodiments does not mean the sequence of execution, and the execution sequence of each process should be determined by its function and internal logic, and should not constitute any limitation to the implementation process of the embodiments of the present application.
图9示出了本申请实施例提供的电子设备100的结构示意图。FIG. 9 shows a schematic structural diagram of an electronic device 100 provided by an embodiment of the present application.
电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194,以及用户标识模块(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, buttons 190, motor 191, indicator 192, camera 193, display screen 194, and Subscriber identification module (subscriber identification module, SIM) card interface 195 and so on. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
可以理解的是,本发明实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某 些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that, the structures illustrated in the embodiments of the present invention do not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or less components than those shown, or some components may be combined, or some components may be separated, or different component arrangements. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (neural-network processing unit, NPU), etc. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in processor 110 is cache memory. This memory may hold instructions or data that have just been used or recycled by the processor 110 . If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated accesses are avoided and the latency of the processor 110 is reduced, thereby increasing the efficiency of the system.
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户标识模块(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, the processor 110 may include one or more interfaces. The interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。The I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL). In some embodiments, the processor 110 may contain multiple sets of I2C buses. The processor 110 can be respectively coupled to the touch sensor 180K, the charger, the flash, the camera 193 and the like through different I2C bus interfaces. For example, the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate with each other through the I2C bus interface, so as to realize the touch function of the electronic device 100 .
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the processor 110 may contain multiple sets of I2S buses. The processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170 . In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface. In some embodiments, the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is typically used to connect the processor 110 with the wireless communication module 160 . For example, the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function. In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI 接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。The MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 . MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc. In some embodiments, the processor 110 communicates with the camera 193 through a CSI interface, so as to realize the photographing function of the electronic device 100 . The processor 110 communicates with the display screen 194 through the DSI interface to implement the display function of the electronic device 100 .
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface may be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones. The interface can also be used to connect other electronic devices, such as AR devices.
可以理解的是,本发明实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在本申请另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules illustrated in the embodiment of the present invention is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 . In other embodiments of the present application, the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块140可以通过USB接口130接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块140可以通过电子设备100的无线充电线圈接收无线充电输入。充电管理模块140为电池142充电的同时,还可以通过电源管理模块141为电子设备供电。The charging management module 140 is used to receive charging input from the charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive charging input from the wired charger through the USB interface 130 . In some wireless charging embodiments, the charging management module 140 may receive wireless charging input through a wireless charging coil of the electronic device 100 . While the charging management module 140 charges the battery 142 , it can also supply power to the electronic device through the power management module 141 .
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,显示屏194,摄像头193,和无线通信模块160等供电。电源管理模块141还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在其他一些实施例中,电源管理模块141也可以设置于处理器110中。在另一些实施例中,电源管理模块141和充电管理模块140也可以设置于同一个器件中。The power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 . The power management module 141 receives input from the battery 142 and/or the charging management module 140, and supplies power to the processor 110, the internal memory 121, the display screen 194, the camera 193, and the wireless communication module 160. The power management module 141 can also be used to monitor parameters such as battery capacity, battery cycle times, battery health status (leakage, impedance). In some other embodiments, the power management module 141 may also be provided in the processor 110 . In other embodiments, the power management module 141 and the charging management module 140 may also be provided in the same device.
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modulation and demodulation processor, the baseband processor, and the like.
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example, the antenna 1 can be multiplexed as a diversity antenna of the wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 may provide wireless communication solutions including 2G/3G/4G/5G etc. applied on the electronic device 100 . The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA) and the like. The mobile communication module 150 can receive electromagnetic waves from the antenna 1, filter and amplify the received electromagnetic waves, and transmit them to the modulation and demodulation processor for demodulation. The mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor, and then turn it into an electromagnetic wave for radiation through the antenna 1 . In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110 . In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the same device as at least part of the modules of the processor 110 .
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。The modem processor may include a modulator and a demodulator. Wherein, the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and passed to the application processor. The application processor outputs sound signals through audio devices (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or videos through the display screen 194 . In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be independent of the processor 110, and may be provided in the same device as the mobile communication module 150 or other functional modules.
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), bluetooth (BT), global navigation satellites Wireless communication solutions such as global navigation satellite system (GNSS), frequency modulation (FM), near field communication (NFC), and infrared technology (IR). The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 . The wireless communication module 160 can also receive the signal to be sent from the processor 110 , perform frequency modulation on it, amplify it, and convert it into electromagnetic waves for radiation through the antenna 2 .
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code Division Multiple Access (WCDMA), Time Division Code Division Multiple Access (TD-SCDMA), Long Term Evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc. The GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (GLONASS), a Beidou navigation satellite system (BDS), a quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite based augmentation systems (SBAS).
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。Display screen 194 is used to display images, videos, and the like. Display screen 194 includes a display panel. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (active-matrix organic light). emitting diode, AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (quantum dot light emitting diodes, QLED) and so on. In some embodiments, the electronic device 100 may include one or N display screens 194 , where N is a positive integer greater than one.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给 ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP is used to process the data fed back by the camera 193 . For example, when taking a photo, the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin tone. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 193 .
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。本申请实施例中,摄像头193用于在检测的用户的文字输入操作时,捕获用户的人脸图像。Camera 193 is used to capture still images or video. The object is projected through the lens to generate an optical image onto the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other formats of image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1. In the embodiment of the present application, the camera 193 is used to capture the face image of the user when the user's text input operation is detected.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。A digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos of various encoding formats, such as: Moving Picture Experts Group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By drawing on the structure of biological neural networks, such as the transfer mode between neurons in the human brain, it can quickly process the input information, and can continuously learn by itself. Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备100的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100 . The external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example to save files like music, video etc in external memory card.
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备100使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。处理器110通过运行存储在内部存储器121的指令,和/或存储在设置于处理器中的存储器的指令,执行电子设备100的各种功能应用以及数据处理。Internal memory 121 may be used to store computer executable program code, which includes instructions. The internal memory 121 may include a storage program area and a storage data area. The storage program area can store an operating system, an application program required for at least one function (such as a sound playback function, an image playback function, etc.), and the like. The storage data area may store data (such as audio data, phone book, etc.) created during the use of the electronic device 100 and the like. In addition, the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (UFS), and the like. The processor 110 executes various functional applications and data processing of the electronic device 100 by executing instructions stored in the internal memory 121 and/or instructions stored in a memory provided in the processor.
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used for converting digital audio information into analog audio signal output, and also for converting analog audio input into digital audio signal. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。Speaker 170A, also referred to as a "speaker", is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。The receiver 170B, also referred to as "earpiece", is used to convert audio electrical signals into sound signals. When the electronic device 100 answers a call or a voice message, the voice can be answered by placing the receiver 170B close to the human ear.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 170C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can make a sound by approaching the microphone 170C through a human mouth, and input the sound signal into the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which can implement a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone jack 170D is used to connect wired earphones. The earphone interface 170D may be the USB interface 130, or may be a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface.
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals. In some embodiments, the pressure sensor 180A may be provided on the display screen 194 . There are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, capacitive pressure sensors, and the like. The capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A. In some embodiments, touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即,x,y和z轴)的角速度。陀螺仪传感器180B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器180B检测电子设备100抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备100的抖动,实现防抖。陀螺仪传感器180B还可以用于导航,体感游戏场景。The gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 . In some embodiments, the angular velocity of electronic device 100 about three axes (ie, x, y, and z axes) may be determined by gyro sensor 180B. The gyro sensor 180B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 180B detects the angle at which the electronic device 100 shakes, calculates the distance that the lens module needs to compensate for according to the angle, and allows the lens to counteract the shake of the electronic device 100 through reverse motion to achieve anti-shake. The gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
气压传感器180C用于测量气压。在一些实施例中,电子设备100通过气压传感器180C测得的气压值计算海拔高度,辅助定位和导航。The air pressure sensor 180C is used to measure air pressure. In some embodiments, the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。在一些实施例中,当电子设备100是翻盖机时,电子设备100可以根据磁传感器180D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。The magnetic sensor 180D includes a Hall sensor. The electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a flip machine, the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes). The magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
距离传感器180F,用于测量距离。电子设备100可以通过雷达、红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快 速对焦。在一些实施例中,电子设备100也可以利用距离传感器180F测量障碍物的距离和速度。Distance sensor 180F for measuring distance. The electronic device 100 can measure the distance through radar, infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing. In some embodiments, the electronic device 100 may also measure the distance and speed of obstacles using the distance sensor 180F.
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。电子设备100可以利用接近光传感器180G检测用户手持电子设备100贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器180G也可用于皮套模式,口袋模式自动解锁与锁屏。Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes. The light emitting diodes may be infrared light emitting diodes. The electronic device 100 emits infrared light to the outside through the light emitting diode. Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 . The electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power. The proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
环境光传感器180L用于感知环境光亮度。电子设备100可以根据感知的环境光亮度自适应调节显示屏194亮度。环境光传感器180L也可用于拍照时自动调节白平衡。环境光传感器180L还可以与接近光传感器180G配合,检测电子设备100是否在口袋里,以防误触。The ambient light sensor 180L is used to sense ambient light brightness. The electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness. The ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures. The ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket, so as to prevent accidental touch.
指纹传感器180H用于采集指纹。电子设备100可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。The fingerprint sensor 180H is used to collect fingerprints. The electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, accessing application locks, taking pictures with fingerprints, answering incoming calls with fingerprints, and the like.
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。例如,当温度传感器180J上报的温度超过阈值,电子设备100执行降低位于温度传感器180J附近的处理器的性能,以便降低功耗实施热保护。在另一些实施例中,当温度低于另一阈值时,电子设备100对电池142加热,以避免低温导致电子设备100异常关机。在其他一些实施例中,当温度低于又一阈值时,电子设备100对电池142的输出电压执行升压,以避免低温导致的异常关机。The temperature sensor 180J is used to detect the temperature. In some embodiments, the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold value, the electronic device 100 reduces the performance of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection. In other embodiments, when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to avoid abnormal shutdown of the electronic device 100 caused by the low temperature. In some other embodiments, when the temperature is lower than another threshold, the electronic device 100 boosts the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
触摸传感器180K,也称“触控器件”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。Touch sensor 180K, also called "touch device". The touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”. The touch sensor 180K is used to detect a touch operation on or near it. The touch sensor can pass the detected touch operation to the application processor to determine the type of touch event. Visual output related to touch operations may be provided through display screen 194 . In other embodiments, the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
骨传导传感器180M可以获取振动信号。在一些实施例中,骨传导传感器180M可以获取人体声部振动骨块的振动信号。骨传导传感器180M也可以接触人体脉搏,接收血压跳动信号。在一些实施例中,骨传导传感器180M也可以设置于耳机中,结合成骨传导耳机。音频模块170可以基于所述骨传导传感器180M获取的声部振动骨块的振动信号,解析出语音信号,实现语音功能。应用处理器可以基于所述骨传导传感器180M获取的血压跳动信号解析心率信息,实现心率检测功能。The bone conduction sensor 180M can acquire vibration signals. In some embodiments, the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice. The bone conduction sensor 180M can also contact the pulse of the human body and receive the blood pressure beating signal. In some embodiments, the bone conduction sensor 180M can also be disposed in the earphone, combined with the bone conduction earphone. The audio module 170 can analyze the voice signal based on the vibration signal of the vocal vibration bone block obtained by the bone conduction sensor 180M, so as to realize the voice function. The application processor can analyze the heart rate information based on the blood pressure beat signal obtained by the bone conduction sensor 180M, and realize the function of heart rate detection.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。The keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key. The electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
马达191可以产生振动提示。马达191可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏194不同区域的触摸操作,马达191也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的 振动反馈效果。触摸振动反馈效果还可以支持自定义。Motor 191 can generate vibrating cues. The motor 191 can be used for vibrating alerts for incoming calls, and can also be used for touch vibration feedback. For example, touch operations acting on different applications (such as taking pictures, playing audio, etc.) can correspond to different vibration feedback effects. The motor 191 can also correspond to different vibration feedback effects for touch operations on different areas of the display screen 194 . Different application scenarios (for example: time reminder, receiving information, alarm clock, games, etc.) can also correspond to different vibration feedback effects. The touch vibration feedback effect can also support customization.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 192 can be an indicator light, which can be used to indicate the charging state, the change of the power, and can also be used to indicate a message, a missed call, a notification, and the like.
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备100采用eSIM,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备100中,不能和电子设备100分离。The SIM card interface 195 is used to connect a SIM card. The SIM card can be contacted and separated from the electronic device 100 by inserting into the SIM card interface 195 or pulling out from the SIM card interface 195 . The electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1. The SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card and so on. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 195 can also be compatible with different types of SIM cards. The SIM card interface 195 is also compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as call and data communication. In some embodiments, the electronic device 100 employs an eSIM, ie: an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100 .
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述或记载的部分,可以参见其它实施例的相关描述。In the foregoing embodiments, the description of each embodiment has its own emphasis. For parts that are not described or described in detail in a certain embodiment, reference may be made to the relevant descriptions of other embodiments.
所属领域的技术人员可以清楚地了解到,为了描述的方便和简洁,仅以上述各功能单元、模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能单元、模块完成,即将所述装置的内部结构划分成不同的功能单元或模块,以完成以上描述的全部或者部分功能。实施例中的各功能单元、模块可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中,上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。另外,各功能单元、模块的具体名称也只是为了便于相互区分,并不用于限制本申请的保护范围。上述系统中单元、模块的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and simplicity of description, only the division of the above-mentioned functional units and modules is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated to different functional units, Module completion, that is, dividing the internal structure of the device into different functional units or modules to complete all or part of the functions described above. Each functional unit and module in the embodiment may be integrated in one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit, and the above-mentioned integrated units may adopt hardware. It can also be realized in the form of software functional units. In addition, the specific names of the functional units and modules are only for the convenience of distinguishing from each other, and are not used to limit the protection scope of the present application. For the specific working processes of the units and modules in the above-mentioned system, reference may be made to the corresponding processes in the foregoing method embodiments, which will not be repeated here.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请实现上述实施例方法中的全部或部分流程,可以通过计算机程序来指令相关的硬件来完成,所述的计算机程序可存储于一计算机可读存储介质中,该计算机程序在被处理器执行时,可实现上述各个方法实施例的步骤。其中,所述计算机程序包括计算机程序代码,所述计算机程序代码可以为源代码形式、对象代码形式、可执行文件或某些中间形式等。所述计算机可读介质至少可以包括:能够将计算机程序代码携带到拍照装置/电子设备的任何实体或装置、记录介质、计算机存储器、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、电载波信号、电信信号以及软件分发介质。例如U盘、移动硬盘、磁碟或者光盘等。The integrated unit, if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium. Based on this understanding, all or part of the processes in the methods of the above embodiments can be implemented by a computer program to instruct the relevant hardware. The computer program can be stored in a computer-readable storage medium, and the computer program When executed by a processor, the steps of each of the above method embodiments can be implemented. Wherein, the computer program includes computer program code, and the computer program code may be in the form of source code, object code, executable file or some intermediate form, and the like. The computer-readable medium may include at least: any entity or device capable of carrying the computer program code to the photographing device/electronic device, recording medium, computer memory, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), electrical carrier signals, telecommunication signals, and software distribution media. For example, U disk, mobile hard disk, disk or CD, etc.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
在本申请所提供的实施例中,应该理解到,所揭露的装置/网络设备和方法,可以通过其它的方式实现。例如,以上所描述的装置/网络设备实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不 执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通讯连接可以是通过一些接口,装置或单元的间接耦合或通讯连接,可以是电性,机械或其它的形式。In the embodiments provided in this application, it should be understood that the disclosed apparatus/network device and method may be implemented in other manners. For example, the apparatus/network device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods, such as multiple units. Or components may be combined or may be integrated into another system, or some features may be omitted, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art can realize that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of this application.
最后应说明的是:以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。Finally, it should be noted that: the above are only specific embodiments of the present application, but the protection scope of the present application is not limited to this, and any changes or replacements within the technical scope disclosed in the present application should be covered by the present application. within the scope of protection of the application. Therefore, the protection scope of the present application should be subject to the protection scope of the claims.

Claims (10)

  1. 一种文字输入方法,其特征在于,包括:A text input method, comprising:
    在检测到用户的文字输入操作时,获取所述用户的唇部变化信息以及所述用户输入的字符信息,所述唇部变化信息包括所述用户在说出待输入的文字时的唇部特征序列;When a text input operation by the user is detected, obtain the lip change information of the user and the character information input by the user, where the lip change information includes the lip features of the user when he speaks the text to be input sequence;
    根据所述唇部特征序列以及所述字符信息确定所述用户待输入的文字。The text to be input by the user is determined according to the lip feature sequence and the character information.
  2. 根据权利要求1所述的方法,其特征在于,所述获取所述用户的唇部变化信息,包括:The method according to claim 1, wherein the acquiring the lip change information of the user comprises:
    通过摄像头采集包含所述用户的嘴唇区域的图像序列,从所述图像序列的每张图像中提取唇部特征,获得唇部特征序列。A camera is used to collect an image sequence including the lip region of the user, and lip features are extracted from each image of the image sequence to obtain a lip feature sequence.
  3. 根据权利要求1所述的方法,其特征在于,所述获取所述用户的唇部变化信息,包括:The method according to claim 1, wherein the acquiring the lip change information of the user comprises:
    发射无线信号,并获取反射信号序列,其中所述反射信号序列中的反射信号是所述无线信号在碰到障碍物后反射回来的信号;transmitting a wireless signal, and acquiring a reflected signal sequence, wherein the reflected signal in the reflected signal sequence is a signal reflected back by the wireless signal after encountering an obstacle;
    根据所述反射信号序列确定障碍物,若所述障碍物为嘴唇,则从所述反射信号序列的每个反射信号中提取唇部特征,获得唇部特征序列。An obstacle is determined according to the reflected signal sequence, and if the obstacle is a lip, a lip feature is extracted from each reflected signal of the reflected signal sequence to obtain a lip feature sequence.
  4. 根据权利要求1~3任一项所述的方法,其特征在于,所述字符信息包括所述用户待输入的文字的第一首字母。The method according to any one of claims 1 to 3, wherein the character information includes the first letter of the character to be input by the user.
  5. 根据权利要求4所述的方法,其特征在于,所述根据所述唇部特征序列以及所述字符信息确定所述用户待输入的文字,包括:The method according to claim 4, wherein the determining the text to be input by the user according to the lip feature sequence and the character information comprises:
    确定所述唇部特征序列对应的文字序列;determining the character sequence corresponding to the lip feature sequence;
    根据所述第一首字母,对所述文字序列进行纠正处理,获得至少一个纠正后的候选文字序列;Correcting the character sequence according to the first initial to obtain at least one corrected candidate character sequence;
    从所述候选文字序列中确定概率最大的候选文字序列,将所述概率最大的候选文字序列作为所述用户待输入的文字。A candidate character sequence with the highest probability is determined from the candidate character sequence, and the candidate character sequence with the highest probability is used as the character to be input by the user.
  6. 根据权利要求5所述的方法,其特征在于,所述根据所述第一首字母,对所述文字序列进行纠正处理,获得至少一个纠正后的候选文字序列,包括:The method according to claim 5, wherein, performing correction processing on the character sequence according to the first initial to obtain at least one corrected candidate character sequence, comprising:
    提取所述文字序列中每个文字的第二首字母;extracting the second initial of each character in the sequence of characters;
    将提取的所述第二首字母与所述第一首字母进行匹配;matching the extracted second initial with the first initial;
    若存在不匹配的第二首字母,则将所述不匹配的第二首字母替换为对应的第一首字母,得到替换后的至少一个文字序列,将所述替换后的文字序列作为所述候选文字序列。If there is an unmatched second initial letter, replace the unmatched second initial letter with the corresponding first initial letter to obtain at least one replaced text sequence, and use the replaced text sequence as the Candidate text sequences.
  7. 根据权利要求6所述的方法,其特征在于,所述若存在不匹配的第二首字母,则将所述不匹配的第二首字母替换为对应的第一首字母,得到替换后的至少一个文字序列,包括:The method according to claim 6, wherein if there is an unmatched second initial, replacing the unmatched second initial with a corresponding first initial to obtain at least the replaced first letter. A literal sequence consisting of:
    若存在不匹配的第二首字母,且存在与对应的第一首字母的关联的字母,则将所述不匹配的第二首字母替换为对应的第一首字母,以及将所述不匹配的第二首字母替换为关联的字母,得到替换后的至少一个文字序列。If there is an unmatched second initial, and there is a letter associated with the corresponding first initial, then replace the unmatched second initial with the corresponding first initial, and replace the unmatched second initial with the corresponding first initial The second initial of is replaced with the associated letter, resulting in at least one literal sequence after the replacement.
  8. 根据权利要求5~7任一项所述的方法,其特征在于,所述确定所述唇部特征序 列对应的文字序列,包括:The method according to any one of claims 5 to 7, wherein the determining the character sequence corresponding to the lip feature sequence comprises:
    将所述唇部特征序列输入训练后的唇语识别模型,获得所述唇语识别模型输出的文字序列,所述唇语识别模型用于识别唇部特征对应的文字,所述唇语识别模型是基于唇部特征,以及唇部特征对应的文字作为训练样本训练得到的。Inputting the lip feature sequence into a trained lip language recognition model to obtain a text sequence output by the lip language recognition model, the lip language recognition model is used to recognize the text corresponding to the lip features, and the lip language recognition model It is trained based on lip features and the text corresponding to the lip features as training samples.
  9. 一种电子设备,其特征在于,包括处理器,所述处理器用于执行存储在存储器中的计算机程序,以实现如权利要求1~8任一项所述的方法。An electronic device, characterized in that it includes a processor, and the processor is configured to execute a computer program stored in a memory, so as to implement the method according to any one of claims 1 to 8.
  10. 一种计算机可读存储介质,所述计算机可读存储介质存储有计算机程序,其特征在于,所述计算机程序被处理器执行时实现如权利要求1~8任一项所述的方法。A computer-readable storage medium storing a computer program, characterized in that, when the computer program is executed by a processor, the method according to any one of claims 1 to 8 is implemented.
PCT/CN2021/116515 2020-09-27 2021-09-03 Text input method, electronic device, and computer-readable storage medium WO2022062884A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011036037.1 2020-09-27
CN202011036037.1A CN114356109A (en) 2020-09-27 2020-09-27 Character input method, electronic device and computer readable storage medium

Publications (1)

Publication Number Publication Date
WO2022062884A1 true WO2022062884A1 (en) 2022-03-31

Family

ID=80844894

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/116515 WO2022062884A1 (en) 2020-09-27 2021-09-03 Text input method, electronic device, and computer-readable storage medium

Country Status (2)

Country Link
CN (1) CN114356109A (en)
WO (1) WO2022062884A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115601575A (en) * 2022-10-25 2023-01-13 扬州市职业大学(扬州开放大学)(Cn) Method and system for assisting in commonly used expression of aphasia and typographer

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11615781B2 (en) * 2019-10-18 2023-03-28 Google Llc End-to-end multi-speaker audio-visual automatic speech recognition

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1704877A (en) * 2004-05-26 2005-12-07 华为技术有限公司 Words input method and apparatus for hand-held devices
CN102117115A (en) * 2009-12-31 2011-07-06 上海量科电子科技有限公司 System for realizing text entry selection by using lip-language and realization method thereof
JP2011186994A (en) * 2010-03-11 2011-09-22 Fujitsu Ltd Character input device and character input method
CN104217218A (en) * 2014-09-11 2014-12-17 广州市香港科大霍英东研究院 Lip language recognition method and system
JP2015172848A (en) * 2014-03-12 2015-10-01 株式会社ゼンリンデータコム lip reading input device, lip reading input method and lip reading input program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1704877A (en) * 2004-05-26 2005-12-07 华为技术有限公司 Words input method and apparatus for hand-held devices
CN102117115A (en) * 2009-12-31 2011-07-06 上海量科电子科技有限公司 System for realizing text entry selection by using lip-language and realization method thereof
JP2011186994A (en) * 2010-03-11 2011-09-22 Fujitsu Ltd Character input device and character input method
JP2015172848A (en) * 2014-03-12 2015-10-01 株式会社ゼンリンデータコム lip reading input device, lip reading input method and lip reading input program
CN104217218A (en) * 2014-09-11 2014-12-17 广州市香港科大霍英东研究院 Lip language recognition method and system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115601575A (en) * 2022-10-25 2023-01-13 扬州市职业大学(扬州开放大学)(Cn) Method and system for assisting in commonly used expression of aphasia and typographer
CN115601575B (en) * 2022-10-25 2023-10-31 扬州市职业大学(扬州开放大学) Method and system for assisting expression of common expressions of aphasia and aphasia writers

Also Published As

Publication number Publication date
CN114356109A (en) 2022-04-15

Similar Documents

Publication Publication Date Title
CN114365476A (en) Shooting method and equipment
WO2022193989A1 (en) Operation method and apparatus for electronic device and electronic device
CN111742539B (en) Voice control command generation method and terminal
WO2022100685A1 (en) Drawing command processing method and related device therefor
CN113542580B (en) Method and device for removing light spots of glasses and electronic equipment
WO2022062884A1 (en) Text input method, electronic device, and computer-readable storage medium
CN110742580A (en) Sleep state identification method and device
CN114650363A (en) Image display method and electronic equipment
CN114242037A (en) Virtual character generation method and device
WO2022042768A1 (en) Index display method, electronic device, and computer readable storage medium
CN113672756A (en) Visual positioning method and electronic equipment
CN114880251A (en) Access method and access device of storage unit and terminal equipment
CN115589051A (en) Charging method and terminal equipment
CN111104295A (en) Method and equipment for testing page loading process
CN112584037B (en) Method for saving image and electronic equipment
CN113467735A (en) Image adjusting method, electronic device and storage medium
CN115641867B (en) Voice processing method and terminal equipment
CN109285563B (en) Voice data processing method and device in online translation process
WO2022214004A1 (en) Target user determination method, electronic device and computer-readable storage medium
WO2022095752A1 (en) Frame demultiplexing method, electronic device and storage medium
WO2022022319A1 (en) Image processing method, electronic device, image processing system and chip system
WO2022007757A1 (en) Cross-device voiceprint registration method, electronic device and storage medium
CN114120987B (en) Voice wake-up method, electronic equipment and chip system
CN115393676A (en) Gesture control optimization method and device, terminal and storage medium
CN114822525A (en) Voice control method and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21871256

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21871256

Country of ref document: EP

Kind code of ref document: A1