WO2022135254A1 - Text editing method, electronic device and system - Google Patents
Text editing method, electronic device and system Download PDFInfo
- Publication number
- WO2022135254A1 WO2022135254A1 PCT/CN2021/138655 CN2021138655W WO2022135254A1 WO 2022135254 A1 WO2022135254 A1 WO 2022135254A1 CN 2021138655 W CN2021138655 W CN 2021138655W WO 2022135254 A1 WO2022135254 A1 WO 2022135254A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- electronic device
- text content
- content
- text
- information
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 107
- 230000004044 response Effects 0.000 claims abstract description 121
- 230000006870 function Effects 0.000 claims description 62
- 230000015654 memory Effects 0.000 claims description 29
- 238000004590 computer program Methods 0.000 claims description 18
- 230000004048 modification Effects 0.000 claims description 4
- 238000012986 modification Methods 0.000 claims description 4
- 238000004891 communication Methods 0.000 description 36
- 230000006854 communication Effects 0.000 description 36
- 101100264195 Caenorhabditis elegans app-1 gene Proteins 0.000 description 34
- 230000008569 process Effects 0.000 description 23
- 238000007726 management method Methods 0.000 description 22
- 238000012545 processing Methods 0.000 description 17
- 230000008859 change Effects 0.000 description 13
- 230000005236 sound signal Effects 0.000 description 13
- 238000010586 diagram Methods 0.000 description 12
- 238000010295 mobile communication Methods 0.000 description 11
- 210000000988 bone and bone Anatomy 0.000 description 10
- 230000000694 effects Effects 0.000 description 10
- 230000005540 biological transmission Effects 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 210000004027 cell Anatomy 0.000 description 6
- 235000012149 noodles Nutrition 0.000 description 6
- 238000013528 artificial neural network Methods 0.000 description 4
- 238000001514 detection method Methods 0.000 description 4
- 235000019633 pungent taste Nutrition 0.000 description 4
- 230000001360 synchronised effect Effects 0.000 description 4
- 229920001621 AMOLED Polymers 0.000 description 3
- 230000001133 acceleration Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 230000003287 optical effect Effects 0.000 description 3
- 230000002159 abnormal effect Effects 0.000 description 2
- 230000036772 blood pressure Effects 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 2
- 235000013305 food Nutrition 0.000 description 2
- 230000003993 interaction Effects 0.000 description 2
- 230000002093 peripheral effect Effects 0.000 description 2
- 239000002096 quantum dot Substances 0.000 description 2
- 230000005855 radiation Effects 0.000 description 2
- 238000009877 rendering Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000000007 visual effect Effects 0.000 description 2
- 230000003416 augmentation Effects 0.000 description 1
- 230000003190 augmentative effect Effects 0.000 description 1
- 238000010009 beating Methods 0.000 description 1
- 230000007175 bidirectional communication Effects 0.000 description 1
- 230000002457 bidirectional effect Effects 0.000 description 1
- 238000013529 biological neural network Methods 0.000 description 1
- 210000004556 brain Anatomy 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 230000019771 cognition Effects 0.000 description 1
- 230000000295 complement effect Effects 0.000 description 1
- 239000004020 conductor Substances 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000004927 fusion Effects 0.000 description 1
- 230000005484 gravity Effects 0.000 description 1
- 230000003862 health status Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 239000010985 leather Substances 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 230000007774 longterm Effects 0.000 description 1
- 239000011159 matrix material Substances 0.000 description 1
- 210000002569 neuron Anatomy 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000003238 somatosensory effect Effects 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/103—Formatting, i.e. changing of presentation of documents
- G06F40/109—Font handling; Temporal or kinetic typography
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/166—Editing, e.g. inserting or deleting
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
Definitions
- the present application relates to the field of terminals, and more particularly, to a method, electronic device and system for editing text.
- ASR automatic speech recognition
- the present application provides a method, electronic device and system for editing text, which help to improve the editing efficiency of text editing by users.
- a system in a first aspect, includes a first electronic device and a second electronic device, wherein the first electronic device is used to acquire audio content; the first electronic device is further configured to report to the second electronic device Send first information, where the first information is the audio content, or the first information is the first text content corresponding to the audio content; the second electronic device is configured to display the first text content according to the first information ; the second electronic device is further configured to display the second text content in response to the user's editing operation on the first text content, where the second text content is the edited text content of the first text content.
- the first electronic device can send the acquired text content corresponding to the audio content to the second electronic device, so that the text content can be displayed on the second electronic device, so that it is convenient for the user to display the text content on the second electronic device Edit the text content to help improve the efficiency of the user when editing the text content.
- the first electronic device may store information of one or more electronic devices, and when the first electronic device acquires audio, the first electronic device may select a suitable one from the one or more electronic devices A second electronic device that performs text editing, thereby sending the first information to the second electronic device.
- the second electronic device in response to receiving the first information, may open a first application, and display the first text content in the first application; the second electronic device may The input method of the second electronic device edits the first text content.
- the second electronic device is further configured to send the second text content to the first electronic device.
- the second electronic device is further configured to send the second text content to the first electronic device when the first operation of the user is detected.
- the first operation is an operation in which the user clicks to save.
- the second electronic device may acquire the text content edited by the user and send the edited text content to the first electronic device, so that the first electronic device may save the edited text content.
- the edited text content can also be obtained by the first electronic device without any additional operation by the user.
- the editing operation includes a format modification operation for the first text content, wherein the second electronic device is further configured to send the first electronic device the Format information of the second text content.
- the second electronic device may also send the format information of the edited text content to the first electronic device, so that the first electronic device can restore the second electronic device according to the format information.
- User-edited text content on an electronic device when the user modifies the format of the text content, the second electronic device may also send the format information of the edited text content to the first electronic device, so that the first electronic device can restore the second electronic device according to the format information.
- the format information of the second text content includes the font color, font size, font background color, font inclination, font underline and the second text content of the second text content.
- One or more types of carriage return operations in the text content include the font color, font size, font background color, font inclination, font underline and the second text content of the second text content.
- the first electronic device is further configured to: display the first electronic device according to the audio content before receiving the second text content sent by the second electronic device a text content; after receiving the second text content sent by the second electronic device, replace the first text content with the second text content.
- the first electronic device may display the corresponding first text content when acquiring the audio content, and after the first electronic device receives the user-edited second text content sent by the second electronic device, the first electronic device The electronic device can replace the previous first text content with the second text content, so that it is convenient for the user to watch the edited text content on the first electronic device and the second electronic device at the same time, which helps to improve the user's experience.
- the first electronic device is further configured to send a query request, and the query request is used for the device that received the query request to determine whether it has a text editing function; in response to receiving the query request To the response sent by the second electronic device, the first information is sent to the second electronic device, where the response is used to indicate that the second electronic device has a text editing function.
- the first electronic device before sending the first information to the second electronic device, may inquire about a device with a text editing function. After determining that the second electronic device has a text editing function, the first electronic device can send the first information to the second electronic device, so as to facilitate the user to edit the text content on the second electronic device and help improve the user's ability to Efficiency when editing text.
- the second electronic device is specifically configured to: in response to receiving the first information, prompt the user whether to perform text editing on the second electronic device; The user determines to perform a text editing operation on the second electronic device to display the first text content.
- the second electronic device when the second electronic device receives the first information, it may first prompt the user whether to agree to perform text editing on the second electronic device. If the second electronic device detects the operation agreed by the user, it can display the text content, which helps to avoid interference to the user, and the user can choose a device he thinks is suitable for text editing, which helps to improve the user's experience.
- the first electronic device may also send request information to the second electronic device, where the request information is used to request the second electronic device to edit the first text content; the second electronic device may respond to The request information prompts the user whether to agree to edit the text content on the second electronic device; if the second electronic device detects that the user agrees to edit the text content on the second electronic device, the second electronic device can display the First text content.
- the first information is the first text content
- the first electronic device is specifically configured to: in response to acquiring the audio content, convert the audio content into the first text content; sending the first text content to the second electronic device.
- the first electronic device after acquiring the audio content, can convert the audio content into text content, so as to send the text content to the second electronic device, and the second electronic device can display the corresponding text content, thereby facilitating the The user edits the text content.
- the first information is the audio content
- the second electronic device is specifically configured to: in response to receiving the audio content, convert the audio content into the audio content a text content; display the first text content.
- the first electronic device may send the acquired audio content to the second electronic device, and the second electronic device may convert the audio content into text content and display the text content, thereby facilitating the user to edit the text content .
- the account of the first electronic device is associated with the account of the second electronic device.
- a text editing method is provided, the method is applied to a first electronic device, the method includes: the first electronic device acquires audio content; the first electronic device sends first information to the second electronic device, The first information is the audio content, or the first information is the first text content corresponding to the audio content, so that the second electronic device displays the first text content according to the first information and detects the user's An editing operation for text content.
- the method further includes: receiving, by the first electronic device, second text content sent by the second electronic device, where the second text content Text content after editing the first text content on the electronic device.
- the method further includes: receiving, by the first electronic device, format information of the second text content sent by the second electronic device.
- the format information of the second text content includes the font color, font size, font background color, font inclination, font underline and the second text content of the second text content.
- One or more types of carriage return operations in the text content include the font color, font size, font background color, font inclination, font underline and the second text content of the second text content.
- the method further includes: before receiving the second text content sent by the second electronic device, the first electronic device displays the first electronic device according to the audio content a text content; after receiving the second text content sent by the second electronic device, the first electronic device replaces the first text content with the second text content.
- the method includes: before sending the first information to the second electronic device, the first electronic device sends a query request, where the query request is used to receive The device of the query request determines whether it has a text editing function; in response to receiving a response sent by the second electronic device, the first electronic device sends the first information to the second electronic device, and the response is used to indicate the second electronic device.
- Electronic devices have text editing capabilities.
- the first information is the first text content
- the method further includes: in response to acquiring the audio content, the first electronic device converts the audio content is the first text content; the first electronic device sends the first text content to the second electronic device.
- the account of the first electronic device is associated with the account of the second electronic device.
- a text editing method is provided, the method is applied to a second electronic device, and the method includes: the second electronic device receives first information sent by the first electronic device, where the first information is the first information The audio content obtained by the electronic device, or the first information is the first text content corresponding to the audio content; the second electronic device displays the first text content according to the first information; In the content editing operation, the second electronic device displays the second text content, and the second text content is the edited text content of the first text content.
- the method further includes: the second electronic device sending the second text content to the first electronic device.
- the editing operation includes a format modification operation for the first text content
- the method further includes: the second electronic device sends the first electronic device the first electronic device. 2.
- the format information of the text content is included in some implementations of the third aspect.
- the format information of the second text content includes the font color, font size, font background color, font inclination, font underline and the second text content of the second text content.
- One or more types of carriage return operations in the text content include the font color, font size, font background color, font inclination, font underline and the second text content of the second text content.
- the method further includes: before receiving the first information sent by the first electronic device, receiving, by the second electronic device, a query sent by the first electronic device request, the query requests the second electronic device to determine whether it has a text editing function; the second electronic device sends a response to the first electronic device, where the response is used to indicate that the second electronic device has a text editing function.
- the second electronic device displays the first text content according to the first information, including: in response to receiving the first information, the second electronic device Prompting the user whether to perform text editing on the second electronic device; in response to the user determining that the text editing is performed on the second electronic device, the second electronic device displays the first text content.
- the first information is the audio content
- the method further includes: in response to receiving the audio content , and convert the audio content into the first text content.
- the account of the first electronic device is associated with the account of the second electronic device.
- an apparatus comprising: an acquiring unit for acquiring audio content; a sending unit for transmitting first information to a second electronic device, where the first information is the audio content, or the The first information is the first text content corresponding to the audio content, so that the second electronic device displays the first text content according to the first information and detects a user's editing operation on the first text content.
- an apparatus comprising: a receiving unit configured to receive first information sent by a first electronic device, where the first information is audio content acquired by the first electronic device, or the first information The information is the first text content corresponding to the audio content; the display unit is used to display the first text content according to the first information; the detection unit is used to detect the user's editing operation on the first text content; the display unit, It is also used to display the second text content, where the second text content is the edited text content of the first text content.
- an electronic device comprising: one or more processors; a memory; and one or more computer programs. Wherein, one or more computer programs are stored in the memory, the one or more computer programs comprising instructions. When the instructions are executed by the electronic device, the electronic device is caused to execute the method in any one of the possible implementations of the second aspect above.
- an electronic device comprising: one or more processors; a memory; and one or more computer programs. Wherein, one or more computer programs are stored in the memory, the one or more computer programs comprising instructions. When the instructions are executed by the electronic device, the electronic device is caused to execute the method in any one of the possible implementations of the third aspect above.
- a computer program product containing instructions, when the computer program product is run on a first electronic device, the electronic device causes the electronic device to perform the method described in the second aspect; or, when the computer program product is executed on a first electronic device
- the computer program product when running on the second electronic device, causes the electronic device to perform the method of the third aspect above.
- a computer-readable storage medium comprising instructions, when the instructions are executed on a first electronic device, the electronic device causes the electronic device to perform the method described in the second aspect; or, when the instructions are executed When the instruction is executed on the second electronic device, the electronic device is caused to execute the method described in the third aspect.
- a chip for executing an instruction, and when the chip is running, the chip executes the method described in the second aspect; or, the chip executes the method described in the third aspect.
- FIG. 1 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
- FIG. 2 is a block diagram of a software structure provided by an embodiment of the present application.
- FIG. 3 is a set of graphical user interfaces provided by an embodiment of the present application.
- FIG. 4 is another set of graphical user interfaces provided by an embodiment of the present application.
- FIG. 5 is another set of graphical user interfaces provided by the embodiments of the present application.
- FIG. 6 is another set of graphical user interfaces provided by an embodiment of the present application.
- FIG. 7 is another set of graphical user interfaces provided by the embodiments of the present application.
- FIG. 8 is another set of graphical user interfaces provided by the embodiments of the present application.
- FIG. 9 is another set of graphical user interfaces provided by the embodiments of the present application.
- FIG. 10 is another set of graphical user interfaces provided by the embodiments of the present application.
- FIG. 11 is a schematic diagram of a system architecture provided by an embodiment of the present application.
- FIG. 12 is a schematic flow chart of a method for editing text in accordance with an embodiment of the present application.
- FIG. 13 is a schematic diagram of another system architecture provided by an embodiment of the present application.
- FIG. 14 is a schematic structural diagram of an apparatus provided by an embodiment of the present application.
- FIG. 15 is another schematic structural diagram of an apparatus provided by an embodiment of the present application.
- FIG. 16 is another schematic structural diagram of an electronic device provided by an embodiment of the present application.
- first and second are only used for descriptive purposes, and should not be construed as indicating or implying relative importance or implicitly indicating the number of indicated technical features.
- a feature defined as “first” or “second” may expressly or implicitly include one or more of that feature.
- plural means two or more.
- the methods provided in the embodiments of the present application can be applied to mobile phones, tablet computers, wearable devices, in-vehicle devices, augmented reality (AR)/virtual reality (VR) devices, notebook computers, ultra-mobile personal computers (ultra mobile personal computers) -mobile personal computer, UMPC), netbook, personal digital assistant (personal digital assistant, PDA) and other electronic devices, the embodiments of the present application do not impose any restrictions on the specific types of electronic devices.
- AR augmented reality
- VR virtual reality
- notebook computers notebook computers
- netbook personal digital assistant
- PDA personal digital assistant
- FIG. 1 shows a schematic structural diagram of an electronic device 100 .
- the electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, buttons 190, motor 191, indicator 192, camera 193, display screen 194, and User identification (subscriber identification module, SIM) card interface 195 and so on.
- SIM Subscriber identification module
- the sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
- the structures illustrated in the embodiments of the present application do not constitute a specific limitation on the electronic device 100 .
- the electronic device 100 may include more or less components than shown, or combine some components, or separate some components, or arrange different components.
- the illustrated components may be implemented in hardware, software, or a combination of software and hardware.
- the processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
- application processor application processor, AP
- modem processor graphics processor
- graphics processor graphics processor
- ISP image signal processor
- controller memory
- video codec digital signal processor
- DSP digital signal processor
- NPU neural-network processing unit
- the controller may be the nerve center and command center of the electronic device 100 .
- the controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
- a memory may also be provided in the processor 110 for storing instructions and data.
- the memory in processor 110 is cache memory. This memory may hold instructions or data that have just been used or recycled by the processor 110 . If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated accesses are avoided and the latency of the processor 110 is reduced, thereby increasing the efficiency of the system.
- the processor 110 may include one or more interfaces.
- the interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
- I2C integrated circuit
- I2S integrated circuit built-in audio
- PCM pulse code modulation
- PCM pulse code modulation
- UART universal asynchronous transceiver
- MIPI mobile industry processor interface
- GPIO general-purpose input/output
- SIM subscriber identity module
- USB universal serial bus
- the I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL).
- the processor 110 may contain multiple sets of I2C buses.
- the processor 110 can be respectively coupled to the touch sensor 180K, the charger, the flash, the camera 193 and the like through different I2C bus interfaces.
- the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate with each other through the I2C bus interface, so as to realize the touch function of the electronic device 100 .
- the I2S interface can be used for audio communication.
- the processor 110 may contain multiple sets of I2S buses.
- the processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170 .
- the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
- the PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals.
- the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface.
- the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
- the UART interface is a universal serial data bus used for asynchronous communication.
- the bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication.
- a UART interface is typically used to connect the processor 110 with the wireless communication module 160 .
- the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function.
- the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
- the MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 .
- MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc.
- the processor 110 communicates with the camera 193 through a CSI interface, so as to realize the photographing function of the electronic device 100 .
- the processor 110 communicates with the display screen 194 through the DSI interface to implement the display function of the electronic device 100 .
- the GPIO interface can be configured by software.
- the GPIO interface can be configured as a control signal or as a data signal.
- the GPIO interface may be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like.
- the GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
- the USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like.
- the USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones.
- the interface can also be used to connect other electronic devices, such as AR devices.
- the interface connection relationship between the modules illustrated in the embodiments of the present application is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 .
- the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
- the charging management module 140 is used to receive charging input from the charger.
- the charger may be a wireless charger or a wired charger.
- the charging management module 140 may receive charging input from the wired charger through the USB interface 130 .
- the charging management module 140 may receive wireless charging input through the wireless charging coil of the electronic device 100. While the charging management module 140 charges the battery 142 , it can also supply power to the electronic device through the power management module 141 .
- the power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 .
- the power management module 141 receives input from the battery 142 and/or the charging management module 140 and supplies power to the processor 110 , the internal memory 121 , the external memory, the display screen 194 , the camera 193 , and the wireless communication module 160 .
- the power management module 141 can also be used to monitor parameters such as battery capacity, battery cycle times, battery health status (leakage, impedance).
- the power management module 141 may also be provided in the processor 110 .
- the power management module 141 and the charging management module 140 may also be provided in the same device.
- the wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modulation and demodulation processor, the baseband processor, and the like.
- Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
- Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization.
- the antenna 1 can be multiplexed as a diversity antenna of the wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
- the mobile communication module 150 may provide wireless communication solutions including 2G/3G/4G/5G etc. applied on the electronic device 100 .
- the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA) and the like.
- the mobile communication module 150 can receive electromagnetic waves from the antenna 1, filter and amplify the received electromagnetic waves, and transmit them to the modulation and demodulation processor for demodulation.
- the mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor, and then turn it into an electromagnetic wave for radiation through the antenna 1 .
- at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110 .
- at least part of the functional modules of the mobile communication module 150 may be provided in the same device as at least part of the modules of the processor 110 .
- the modem processor may include a modulator and a demodulator.
- the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal.
- the demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing.
- the low frequency baseband signal is processed by the baseband processor and passed to the application processor.
- the application processor outputs sound signals through audio devices (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or videos through the display screen 194 .
- the modem processor may be a stand-alone device.
- the modem processor may be independent of the processor 110, and may be provided in the same device as the mobile communication module 150 or other functional modules.
- the wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), Bluetooth (BT), global navigation satellites System (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions.
- WLAN wireless local area networks
- BT Bluetooth
- GNSS global navigation satellites System
- frequency modulation frequency modulation, FM
- NFC near field communication technology
- infrared technology infrared, IR
- the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
- the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
- the wireless communication module 160 can also receive the signal to be sent from the processor 110 , perform frequency modulation on it, amplify it, and convert it into electromagnetic waves
- the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology.
- the wireless communication technologies may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code Division Multiple Access (WCDMA), Time Division Code Division Multiple Access (TD-SCDMA), Long Term Evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc.
- the GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (GLONASS), a Beidou navigation satellite system (BDS), a quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite based augmentation systems (SBAS).
- GPS global positioning system
- GLONASS global navigation satellite system
- BDS Beidou navigation satellite system
- QZSS quasi-zenith satellite system
- SBAS satellite based augmentation systems
- the electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like.
- the GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor.
- the GPU is used to perform mathematical and geometric calculations for graphics rendering.
- Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
- Display screen 194 is used to display images, videos, and the like.
- Display screen 194 includes a display panel.
- the display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (active-matrix organic light).
- LED diode AMOLED
- flexible light-emitting diode flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (quantum dot light emitting diodes, QLED) and so on.
- the electronic device 100 may include one or N display screens 194 , where N is a positive integer greater than one.
- the electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
- the ISP is used to process the data fed back by the camera 193 .
- the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye.
- ISP can also perform algorithm optimization on image noise, brightness, and skin tone.
- ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
- the ISP may be provided in the camera 193 .
- the camera 193 is used to capture still images or video.
- the object is projected through the lens to generate an optical image onto the photosensitive element.
- the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
- CMOS complementary metal-oxide-semiconductor
- the photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal.
- the ISP outputs the digital image signal to the DSP for processing.
- DSP converts digital image signals into standard RGB, YUV and other formats of image signals.
- the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
- a digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
- Video codecs are used to compress or decompress digital video.
- the electronic device 100 may support one or more video codecs.
- the electronic device 100 can play or record videos of various encoding formats, such as: Moving Picture Experts Group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
- MPEG Moving Picture Experts Group
- MPEG2 moving picture experts group
- MPEG3 MPEG4
- MPEG4 Moving Picture Experts Group
- the NPU is a neural-network (NN) computing processor.
- NN neural-network
- Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
- the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100 .
- the external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example to save files like music, video etc in external memory card.
- Internal memory 121 may be used to store computer executable program code, which includes instructions.
- the processor 110 executes various functional applications and data processing of the electronic device 100 by executing the instructions stored in the internal memory 121 .
- the internal memory 121 may include a storage program area and a storage data area.
- the storage program area can store an operating system, an application program required for at least one function (such as a sound playback function, an image playback function, etc.), and the like.
- the storage data area may store data (such as audio data, phone book, etc.) created during the use of the electronic device 100 and the like.
- the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (UFS), and the like.
- the electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playback, recording, etc.
- the audio module 170 is used for converting digital audio information into analog audio signal output, and also for converting analog audio input into digital audio signal. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
- Speaker 170A also referred to as a "speaker" is used to convert audio electrical signals into sound signals.
- the electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
- the receiver 170B also referred to as "earpiece" is used to convert audio electrical signals into sound signals.
- the voice can be answered by placing the receiver 170B close to the human ear.
- the microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals.
- the user can make a sound by approaching the microphone 170C through a human mouth, and input the sound signal into the microphone 170C.
- the electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which can implement a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
- the earphone jack 170D is used to connect wired earphones.
- the earphone interface 170D can be the USB interface 130, or can be a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface.
- OMTP open mobile terminal platform
- CTIA cellular telecommunications industry association of the USA
- the pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals.
- the pressure sensor 180A may be provided on the display screen 194 .
- the capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes.
- the electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A.
- the electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A.
- touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
- the gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 .
- the angular velocity of electronic device 100 about three axes ie, x, y, and z axes
- the gyro sensor 180B can be used for image stabilization.
- the gyro sensor 180B detects the angle at which the electronic device 100 shakes, calculates the distance that the lens module needs to compensate for according to the angle, and allows the lens to counteract the shake of the electronic device 100 through reverse motion to achieve anti-shake.
- the gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
- the air pressure sensor 180C is used to measure air pressure.
- the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
- the magnetic sensor 180D includes a Hall sensor.
- the electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D.
- the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
- the acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes).
- the magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
- the electronic device 100 can measure the distance through infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing.
- Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes.
- the light emitting diodes may be infrared light emitting diodes.
- the electronic device 100 emits infrared light to the outside through the light emitting diode.
- Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 .
- the electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power.
- Proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
- the ambient light sensor 180L is used to sense ambient light brightness.
- the electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness.
- the ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures.
- the ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket, so as to prevent accidental touch.
- the fingerprint sensor 180H is used to collect fingerprints.
- the electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, accessing application locks, taking pictures with fingerprints, answering incoming calls with fingerprints, and the like.
- the temperature sensor 180J is used to detect the temperature.
- the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold value, the electronic device 100 reduces the performance of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection.
- the electronic device 100 when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to avoid abnormal shutdown of the electronic device 100 caused by the low temperature.
- the electronic device 100 boosts the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
- Touch sensor 180K also called “touch panel”.
- the touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”.
- the touch sensor 180K is used to detect a touch operation on or near it.
- the touch sensor can pass the detected touch operation to the application processor to determine the type of touch event.
- Visual output related to touch operations may be provided through display screen 194 .
- the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
- the bone conduction sensor 180M can acquire vibration signals.
- the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice.
- the bone conduction sensor 180M can also contact the pulse of the human body and receive the blood pressure beating signal.
- the bone conduction sensor 180M can also be disposed in the earphone, combined with the bone conduction earphone.
- the audio module 170 can analyze the voice signal based on the vibration signal of the vocal vibration bone block obtained by the bone conduction sensor 180M, so as to realize the voice function.
- the application processor can analyze the heart rate information based on the blood pressure beat signal obtained by the bone conduction sensor 180M, so as to realize the function of heart rate detection.
- the keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key.
- the electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
- Motor 191 can generate vibrating cues.
- the motor 191 can be used for vibrating alerts for incoming calls, and can also be used for touch vibration feedback.
- touch operations acting on different applications can correspond to different vibration feedback effects.
- the motor 191 can also correspond to different vibration feedback effects for touch operations on different areas of the display screen 194 .
- Different application scenarios for example: time reminder, receiving information, alarm clock, games, etc.
- the touch vibration feedback effect can also support customization.
- the indicator 192 can be an indicator light, which can be used to indicate the charging state, the change of the power, and can also be used to indicate a message, a missed call, a notification, and the like.
- the SIM card interface 195 is used to connect a SIM card.
- the SIM card can be contacted and separated from the electronic device 100 by inserting into the SIM card interface 195 or pulling out from the SIM card interface 195 .
- the electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1.
- the SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card and so on. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different.
- the SIM card interface 195 can also be compatible with different types of SIM cards.
- the SIM card interface 195 is also compatible with external memory cards.
- the electronic device 100 interacts with the network through the SIM card to implement functions such as call and data communication.
- the electronic device 100 adopts an embedded SIM (embedded-SIM, eSIM) card, that is, an embedded SIM card.
- the eSIM card can be embedded in the electronic device 100 and cannot be separated from the
- the telephone cards in the embodiments of the present application include but are not limited to SIM cards, eSIM cards, universal subscriber identity modules (USIM), universal integrated telephone cards (universal integrated circuit cards, UICC) and the like.
- the software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture.
- the embodiments of the present application take an Android system with a layered architecture as an example to exemplarily describe the software structure of the electronic device 100 .
- FIG. 2 is a block diagram of the software structure of the electronic device 100 according to the embodiment of the present application.
- the layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Layers communicate with each other through software interfaces.
- the Android system is divided into four layers, which are, from top to bottom, an application layer, an application framework layer, an Android runtime (Android runtime) and a system library, and a kernel layer.
- the application layer can include a series of application packages.
- the application package can include applications such as camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message and so on.
- the application framework layer provides an application programming interface (application programming interface, API) and a programming framework for applications in the application layer.
- the application framework layer includes some predefined functions.
- the application framework layer may include window managers, content providers, view systems, telephony managers, resource managers, notification managers, and the like.
- a window manager is used to manage window programs.
- the window manager can get the size of the display screen, determine whether there is a status bar, lock the screen, take screenshots, etc.
- Content providers are used to store and retrieve data and make these data accessible to applications.
- the data may include video, images, audio, calls made and received, browsing history and bookmarks, phone book, etc.
- the view system includes visual controls, such as controls for displaying text, controls for displaying pictures, and so on. View systems can be used to build applications.
- a display interface can consist of one or more views.
- the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures.
- the phone manager is used to provide the communication function of the electronic device 100 .
- the management of call status including connecting, hanging up, etc.).
- the resource manager provides various resources for the application, such as localization strings, icons, pictures, layout files, video files and so on.
- the notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear automatically after a brief pause without user interaction. For example, the notification manager is used to notify download completion, message reminders, etc.
- the notification manager can also display notifications in the status bar at the top of the system in the form of graphs or scroll bar text, such as notifications of applications running in the background, and notifications on the screen in the form of dialog windows. For example, text information is prompted in the status bar, a prompt sound is issued, the electronic device vibrates, and the indicator light flashes.
- Android Runtime includes core libraries and a virtual machine. Android runtime is responsible for scheduling and management of the Android system.
- the core library consists of two parts: one is the function functions that the java language needs to call, and the other is the core library of Android.
- the application layer and the application framework layer run in virtual machines.
- the virtual machine executes the java files of the application layer and the application framework layer as binary files.
- the virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, safety and exception management, and garbage collection.
- a system library can include multiple functional modules. For example: surface manager (surface manager), media library (media library), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
- surface manager surface manager
- media library media library
- 3D graphics processing library eg: OpenGL ES
- 2D graphics engine eg: SGL
- the Surface Manager is used to manage the display subsystem and provides a fusion of 2D and 3D layers for multiple applications.
- the media library supports playback and recording of a variety of commonly used audio and video formats, as well as still image files.
- the media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
- the 3D graphics processing library is used to implement 3D graphics drawing, image rendering, compositing, and layer processing.
- 2D graphics engine is a drawing engine for 2D drawing.
- the kernel layer is the layer between hardware and software.
- the kernel layer contains at least display drivers, camera drivers, audio drivers, and sensor drivers.
- FIG. 3 is a set of graphical user interfaces (graphical user interface, GUI) provided by the embodiments of the present application.
- the GUI is the desktop of the mobile phone.
- the desktop of the mobile phone includes icons of multiple applications, the multiple applications include clock, calendar, gallery, memo, file management, e-mail, music, calculator and so on.
- the GUI as shown in (b) of FIG. 3 may be displayed.
- the GUI is the display interface of the memo application.
- a note recorded on September 1 is displayed on the display interface, and the content of the note is "Mom's birthday (September 28)".
- the display interface also includes a control 301 for creating a new note.
- the mobile phone detects that the user clicks on the control 301, the mobile phone can display a GUI as shown in (c) of FIG. 3 .
- GUI is another display interface of the memo application. Users can edit the title and body content of the note on this interface.
- the display interface also includes controls for list, text editing, inserting pictures, audio-to-text, and handwriting controls.
- the mobile phone when the mobile phone detects that the user clicks the operation of the recording-to-text control 302, the mobile phone can send request information to the notebook computer, and the request information is used to request the notebook computer to correspond to the voice content acquired by the mobile phone. Edit for text content.
- the laptop computer In response to receiving the request information, the laptop computer can display a reminder box 303, wherein the reminder box 303 includes the prompt information "it is detected that the mobile phone is recording to text, whether to perform text editing on the laptop computer".
- the notebook computer can start App1 in response to the user's operation of clicking on the control 304 .
- App1 is an application that can do text editing.
- the mobile phone can detect the voice content in the environment, and convert the detected voice content into text content. After the mobile phone converts the voice content into text content, the mobile phone can also send the converted text content to the laptop. Thereby, the display interface of the memo application of the mobile phone and the display interface of App1 of the notebook computer are simultaneously displayed with the text content "the popularity changes again... social mentality".
- the mobile phone may include a microphone and a speech recognition (automatic speech recognition, ASR) module, wherein the microphone is used to collect speech content in the environment, and the role of the ASR module is to convert the received speech content into text content.
- ASR automatic speech recognition
- the text content on the display interface of App1 of the notebook computer and the display interface of the memo application of the mobile phone may be synchronized in real time.
- the mobile phone can convert the voice content into text content at a certain time interval (for example, 5 seconds). In 0-5 seconds, the mobile phone converts the collected voice content into text content, "reflecting the demands of people's death while the popularity changes and remains unchanged", so that the mobile phone can display the text content on the display interface of the memo, and the mobile phone can display the text content at the same time.
- the content is sent to the laptop, so that the laptop can display the text content on the display interface of App1; in 5-10 seconds, the mobile phone converts the collected voice content into text content.
- the laptop computer can display the cursor 306 after opening App1, wherein after the laptop computer receives the text content sent by the mobile phone, the laptop computer can display the cursor 306 at the back of the text content.
- the mobile phone can continue to detect the voice content and convert the voice content into text content.
- the mobile phone can synchronize the text content to the App1 of the laptop in real time.
- the notebook computer can edit the text content in App1 in response to the user's editing operation.
- users can add content to the text content sent by the mobile phone (add “1,” before “Hotness", and “2,” before “Guangzhi Think Tank”).
- the user can modify the inaccurate content of the text (for example, "name” is modified to "public opinion", and "breakthrough” is modified to "top10").
- the user can also modify the format of the text content (for example, the laptop detects the user's carriage return operation after the "reputation appeal").
- the notebook computer in response to the notebook computer detecting that the user clicks the save control 307, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone displays the notebook computer on the display interface of the memo. Edited text content.
- the mobile phone converts the voice content into text content after detecting the voice content in the environment, and sends the text content to the laptop computer, while the mobile phone may not display the text content. After the mobile phone receives the edited text content from the laptop computer, the mobile phone can display the edited text content on the display interface of the memo application.
- the notebook computer can edit the text content near the position. For example, when the current cursor 306 is behind "Social Mindset”, when the laptop detects that the user has added a symbol "?” after "Social Mindset”, the cursor 306 can move to the back of "Social Mindset?". At the same time, after the notebook computer receives other text content sent by the mobile phone, it can continue to display the received text content.
- the notebook computer when the notebook computer continues to detect the user's editing operation on the text content, it can edit the text content received from the mobile phone.
- the notebook computer can detect the user's operation of changing "positive success” to "Zheng Chenggong”; for another example, the notebook computer can detect the user's operation of adding a symbol ".” after "annual feature”; another example, The laptop can detect the user inserting the symbol "," in “Medical Education Housing Food Safety". As another example, a laptop can detect the user's press enter after "Social Mindset?".
- the notebook computer when the notebook computer detects that the user clicks the save control 307, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone displays the notebook computer on the display interface of the memo Edited text content.
- the GUI is another display interface of the memo application.
- the mobile phone detects that the user clicks the control 308 to stop recording, the mobile phone can display a GUI as shown in (i) in FIG. 3 .
- the GUI is another display interface of the memo application.
- the user can continue to edit the text content edited by the laptop on the mobile phone (for example, the user can insert a picture into the text content).
- the mobile phone detects that the user clicks on the control 309, the mobile phone can save the text content displayed on the current interface and display the GUI shown in (j) in FIG. 3 .
- the GUI is another display interface of the memo application.
- the mobile phone can update the title of the note to "1, hotspot".
- the mobile phone can display the GUI as shown in (k) in Figure 3.
- the GUI is another display interface of the memo application.
- the mobile phone can display 2 notes included in the current memo application, and the newly added note content can be displayed at the top.
- the mobile phone can start to convert audio to text, and the laptop computer can continue to display the prompt box 301 .
- the notebook computer can send a response to the mobile phone, where the response is used to indicate that the notebook computer can perform text editing.
- the mobile phone can send all the text content 1 that is voice-to-text before receiving the response to the laptop, and the laptop can open App1 and display the text content 1.
- the mobile phone can continue to send the text content 2 corresponding to the voice content input by the user to the notebook computer.
- the notebook computer can append the text content 2 sent by the mobile phone to the back of the text content 1.
- the mobile phone when the mobile phone performs the recording-to-text operation, the mobile phone can notify the notebook computer that the recording-to-text function is being performed, so that the notebook computer can prompt the user whether to edit the text content on the notebook computer.
- the laptop computer can display the text content sent by the mobile phone in real time, thereby facilitating the user to edit the text content and helping to improve the user experience.
- FIG. 4 is another set of GUIs provided by the embodiments of the present application.
- the mobile phone when the mobile phone detects that the user clicks the recording-to-text control, the mobile phone may display the interface shown in (b) in FIG. 4 .
- the mobile phone may send request information to the notebook computer, where the request information is used to request the notebook computer to edit the text content corresponding to the voice content obtained by the mobile phone.
- the notebook computer in response to receiving the request information sent by the mobile phone, the notebook computer can start App1.
- the mobile phone may display a reminder box 401, wherein the reminder box 401 includes prompt information "detecting the surrounding laptop computer, whether to send the text content to the laptop computer for editing".
- the mobile phone detects that the user clicks the control 402, the mobile phone starts to detect the voice content in the environment.
- the mobile phone after detecting the voice content in the environment, the mobile phone can start to convert the recording into text.
- the mobile phone can collect the voice content and convert the voice content into text content to "change the popularity...social mentality", and the mobile phone can also send the converted text content to the laptop, so that the laptop can display the text content through App1.
- the notebook computer when the notebook computer detects that the user edits the text content on the notebook computer and detects that the user clicks the save operation, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone can The edited text content is displayed on the display interface of the memo.
- processes (f) to (g) in FIG. 3 which are not repeated here for brevity.
- the mobile phone when the mobile phone performs the recording-to-text operation, if it is determined that there is a device (eg, a laptop) that is convenient for the user to edit text, the mobile phone can prompt the user whether to perform text editing on the laptop.
- the laptop computer can display the text content sent by the mobile phone in real time, thereby facilitating the user to edit the text content and helping to improve the user experience.
- FIG. 5 is another set of GUIs provided by the embodiments of the present application.
- the GUI is the desktop of the mobile phone.
- the desktop of the mobile phone includes icons of multiple applications, and the multiple applications include a voice recorder application and the like.
- the GUI as shown in (b) of FIG. 5 can be displayed.
- the GUI is the display interface of the recorder application.
- a recording file recorded on October 3 is displayed on the display interface, and the recording file is named "20201003_100832".
- the display interface also includes a control 501 for creating a new recording file.
- the mobile phone when the mobile phone detects that the user clicks on the control 501, the mobile phone can send instruction information to the notebook computer, where the instruction information is used to indicate that the mobile phone is recording.
- the laptop computer In response to receiving the instruction information, the laptop computer can display a reminder box 502, wherein the reminder box 502 includes a prompt message "It is detected that the mobile phone is recording, do you want to convert the recording to text on the laptop computer?".
- the notebook computer in response to the notebook computer detecting that the user clicks on the control 503, the notebook computer can launch App1.
- the mobile phone In response to the mobile phone detecting that the user clicks on the control 501, the mobile phone can display a recording interface, wherein the recording interface shows that 25 seconds have been recorded and the mobile phone is expected to record more than 10 hours.
- the mobile phone When the mobile phone collects the voice content, it can also send the voice content to the laptop computer. After the laptop computer receives the voice content, it can convert the voice content into text content. The text content is displayed in App1.
- an ASR module may be included in the notebook computer, wherein the function of the ASR module is to convert the received voice content into text content.
- the notebook computer can edit the text content in App1 in response to the user's editing operation.
- users can add content to the text content sent by the mobile phone (add “1,” before “Hotness”, and “2," before “Guangzhi Think Tank”).
- the user can modify the inaccurate content of the text (for example, "name” is modified to "public opinion", and “breakthrough” is modified to "top10”).
- the user can also modify the format of the text content (for example, the laptop detects the user's carriage return operation after the "reputation appeal").
- the notebook computer when it detects that the user clicks the save control, it can send the edited text content to the mobile phone, so that the mobile phone can save the edited text content in a certain application layer sequence (eg, memo).
- a certain application layer sequence eg, memo
- the mobile phone when the mobile phone detects that the user starts recording, the mobile phone can send instruction information to the surrounding notebook computers, so that the notebook computer prompts the user whether the recording needs to be converted into text on the notebook computer.
- the notebook computer prompts the user whether the recording needs to be converted into text on the notebook computer.
- FIG. 6 is another set of GUIs provided by the embodiments of the present application.
- the mobile phone displays a caller ID interface, and the caller ID interface shows that the mobile phone number "XXX-XXXXXXX" from Shenzhen, Guangdong is making a call.
- the mobile phone sends indication information to the surrounding notebook computers, where the indication information is used to instruct the mobile phone to detect the user's incoming call.
- the laptop computer can display a reminder box 601, wherein the reminder box 601 includes the prompt information "The mobile phone has received an incoming call, do you want to convert the content of the call to text on the laptop computer?".
- the notebook computer in response to the notebook computer detecting that the user clicks on the control 602, the notebook computer can open App1 and send a response to the mobile phone, which is used to indicate that the notebook computer can be used as a text content editing device.
- the mobile phone In response to the mobile phone detecting the operation of the user accepting the incoming call, the mobile phone can establish a call connection to the terminal device.
- the mobile phone can receive the voice content of another user sent by the peer device.
- the cell phone In response to receiving another user's voice content, the cell phone can send the voice content to the laptop.
- the mobile phone may also send indication information to the notebook computer when detecting the operation of the user accepting the incoming call, the indication information is used to indicate that the mobile phone is on a call, and request the notebook computer to edit the text content corresponding to the call content.
- the laptop can prompt the user "The mobile phone is on a call, do you want to convert the content of the call to text on the laptop?".
- the notebook computer detects that the user determines to use the notebook computer to convert the call content to text, the notebook computer can open App1 and receive the voice content of another user from the mobile phone. Thus, the notebook computer can convert the voice content into text content.
- the mobile phone after receiving the response, can convert the acquired voice content of another user into text content, so that the mobile phone can send the text content to the laptop.
- the notebook computer in response to receiving the voice content from the mobile phone, can convert the voice content into text content "Hi, there is a meeting tomorrow morning...please attend on time", thereby displaying through App1 the text content.
- the notebook computer when the notebook computer detects that the user edits the text content on the notebook computer and detects that the user clicks the save operation, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone can The edited text content is displayed on the display interface of the memo.
- processes (f) to (g) in FIG. 3 which are not repeated here for brevity.
- the mobile phone when the mobile phone detects an incoming call, if it is determined that there is a device (for example, a laptop computer) that is convenient for the user to edit text nearby, the mobile phone can indicate to the laptop computer that the mobile phone has detected an incoming call.
- the laptop can convert the voice content obtained from the mobile phone into text content in real time and display it to the user, so as to facilitate the user to edit the text content and help improve the user experience.
- FIG. 7 is another set of GUIs provided by the embodiments of the present application.
- the mobile phone displays a video call interface of a social application, and the video call interface shows that another user is inviting the user to make a video call.
- the mobile phone sends indication information to the surrounding notebook computers, where the indication information is used to instruct the mobile phone to detect another user's video call request.
- the laptop computer may display a prompt box 701, wherein the prompt box 701 includes prompt information "The mobile phone has received a video call request, whether to convert the call content to text on the laptop computer?".
- the notebook computer in response to the notebook computer detecting that the user clicks on the control 702, the notebook computer can open App1 and send a response to the mobile phone, which is used to indicate that the notebook computer can be used as a text content editing device.
- the mobile phone In response to the mobile phone detecting the user's operation of accepting the video call request, the mobile phone can establish a video call connection to the terminal device.
- the mobile phone can receive the voice content of another user sent by the peer device.
- the cell phone In response to receiving another user's voice content, the cell phone can send the voice content to the laptop.
- the mobile phone after receiving the response, can convert the acquired voice content of another user into text content, so that the mobile phone can send the text content to the laptop.
- the notebook computer in response to receiving the voice content from the mobile phone, can convert the voice content into text content "Hi, there is a meeting tomorrow morning...please attend on time", thereby displaying through App1 the text content.
- the notebook computer when the notebook computer detects that the user edits the text content on the notebook computer and detects that the user clicks the save operation, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone can The edited text content is displayed on the display interface of the memo.
- processes (f) to (g) in FIG. 3 which are not repeated here for brevity.
- the mobile phone when the mobile phone detects a video call, if it is determined that there is a device (for example, a laptop computer) that is convenient for the user to edit text nearby, the mobile phone can indicate to the laptop computer that the mobile phone detects an incoming call.
- the laptop can convert the voice content obtained from the mobile phone into text content in real time and display it to the user, so as to facilitate the user to edit the text content and help improve the user experience.
- the mobile phone when the mobile phone receives an incoming call and a video call request, the mobile phone indicates the surrounding laptops, so that the laptop prompts the user whether to convert the call content to text on the laptop.
- the mobile phone may also instruct the laptop computer after receiving a voice call request from another user, so that the laptop computer prompts the user whether to convert the call content to text on the laptop computer.
- FIG. 8 is another set of GUIs provided by the embodiments of the present application.
- the mobile phone displays a caller ID interface, and the caller ID interface displays that the mobile phone number "XXX-XXXXXXX" from Shenzhen, Guangdong is making a call.
- the mobile phone sends indication information to the surrounding notebook computers, where the indication information is used to instruct the mobile phone to detect the user's incoming call.
- the laptop computer can display a reminder box 801, wherein the reminder box 801 includes the prompt information "The mobile phone has received an incoming call, do you want to convert the content of the call to text on the laptop computer?".
- the reminder box 801 can be hidden.
- the mobile phone can establish a call connection to the terminal device.
- the user detects that the user clicks the operation of the recording control 803, and can send instruction information to the notebook computer, and the instruction information is used to indicate that the mobile phone is making a call, Request the laptop to edit the text content corresponding to the call content.
- the notebook computer may display the reminder box 801 again.
- the laptop computer detects that the user clicks on the control 802
- the laptop computer can send a response to the mobile phone, where the response is used to instruct the laptop computer to edit the text content corresponding to the call content.
- the cell phone can send the call content to the laptop.
- the ASR module of the notebook computer in response to receiving the call content sent by the mobile phone, can convert the call content into text content “Hi, tomorrow morning... join” and display it in App1.
- the laptop detects the user's editing operation, can edit the text content, and displays the edited text content in App1 ("Leader: Hi, tomorrow morning... see”).
- the mobile phone when the mobile phone detects that the user clicks the recording operation during the process of answering the call, the mobile phone sends instruction information to the notebook computer, thereby instructing the notebook computer to edit the text content corresponding to the call content.
- the laptop can convert the voice content obtained from the mobile phone into text content in real time and display it to the user, so as to facilitate the user to edit the text content and help improve the user experience.
- FIG. 9 is another set of GUIs provided by the embodiments of the present application.
- the mobile phone can display the recording-to-text interface in the memo, and the mobile phone detects the voice content input by the user "reflecting the people's livelihood demands between the change of popularity and the constant change", and the mobile phone can convert the voice content to the voice content. Convert to text content and send that text content to the laptop.
- the notebook computer can display the text content to the user by means of subtitles.
- the notebook computer can display the text content "reflecting the demands of people's livelihood between the change of popularity and the constant change" in the subtitle display area 901 .
- the mobile phone when the mobile phone detects the voice content "We have just obtained statistics from the Guangzhi Think Tank” input by the user, the mobile phone can convert the voice content into text content and send the text content to the laptop.
- the notebook computer After receiving the text content, the notebook computer can display the text content to the user by means of subtitles.
- the notebook computer can display the text content "We have just obtained statistics on Guangzhi Think Tank” in the subtitle display area 901 .
- FIG. 10 is another set of GUIs provided by the embodiments of the present application.
- the mobile phone can display the recording-to-text interface in the memo, and the mobile phone detects the voice content input by the user "reflecting the people's death demands between the hotness and the constant change", and the mobile phone can convert the voice content to the voice content. Convert to text content and send that text content to the laptop.
- the notebook computer can display the text content to the user by means of subtitles, and at the same time, the notebook computer can also open App1 and display the text content in App1.
- the laptop computer can display the text content "Reflecting the demands of people's livelihood between the change of popularity and the constant change" in the subtitle display area 1001, and simultaneously display the text content in App1.
- the user can edit the text content in App1.
- the mobile phone when the mobile phone detects the voice content "We have just obtained statistics from the Guangzhi Think Tank" input by the user, the mobile phone can convert the voice content into text content and send the text content to the laptop.
- the notebook computer After the notebook computer receives the text content, it can display the text content to the user through subtitles. At the same time, the notebook computer can also display the text in App1 after "reflecting the demands of people's death between the hotness and the constant change". content.
- the notebook computer can display the text content "We have just obtained the statistics of Guangzhi Think Tank” in the subtitle display area 1001, and display in App1 "The demand for people's death is reflected in the change of popularity and the constant change, we have just calculated the Guangzhi Think Tank. get”.
- the system architecture includes device A and device B, where device A may be a device with a pickup function (eg, a mobile phone), and device B may be a device with an efficient editing function (eg, a laptop).
- Device A includes a pickup module 1110 , an ASR module 1120 , a transceiver control module 1130 and a replacement module 1140 ;
- device B includes a transceiver control module 1150 , a display unit 1160 and an editing control module 1170 .
- the sound pickup module 1110 is used to acquire the voice content to be processed.
- the manner in which the device A acquires the voice content is not specifically limited, and the acquisition manner may be various.
- it can be real-time voice recorded from the surrounding environment of device A; or, it can also be the audio of the audio and video played by the user on the mobile phone; or, it can also be the audio (including remote and Near-end); it can also be the audio and video files in the mobile phone.
- the pickup module is responsible for converting the relevant audio into a specific audio format, such as a pulse code modulation (PCM) audio stream of a certain sampling rate, which is used as an input to the ASR module.
- PCM pulse code modulation
- the ASR module 1120 may convert the speech content to textual content.
- the deep neural network acoustic model is trained to obtain the phoneme sequence with the highest probability, and then through the corresponding language model, the text series with the highest probability is obtained, completing the process from speech content to text. content conversion.
- the transceiver control module 1130 is used to control information synchronization between device A and other devices.
- the transceiver control module 1130 may be responsible for transparent transmission of information, and the transceiver control module 1130 is not involved in the logical processing of the transmission content.
- the transmission of information may be through a local area network, such as Bluetooth, Wi-Fi, or other transmission protocols (eg, the internet).
- Device A establishes a connection network link with other devices (eg, device B) through a network communication protocol, and then initiates a start transmission.
- transceiver control module 1150 For the functional description of the transceiver control module 1150, reference may be made to the aforementioned transceiver control module 1130, which is not repeated here for brevity.
- the display unit 1160 is used to prompt the user whether to start text editing on the device B.
- the transceiver control module 1150 receives the instruction sent by the transceiver control module 1130 for device A to acquire audio content (eg, voice content or audio files), or, the transceiver control module 1150 receives the instruction sent by the transceiver control module 1130 to perform text editing on device B.
- a prompt can be popped up in the form of a notification through the display unit 1160 .
- the device B If the device B detects that the user determines to perform text editing through the device B, it can send a response to the transceiver control module 1130 through the transceiver control module 1150, and the response can be used to indicate that the device B can be used as a text editing device. After the transceiver control module 1130 of device A receives the response, device A starts to transmit the ASR result to device B in real time.
- the transceiver control module 1130 may send a query request, where the query request is used to query surrounding devices with text editing functions.
- device A may store information such as device type, device name, and MAC address of device B.
- device A can send BLE data packets to device B according to the MAC address of device B.
- the BLE data packet may include a PDU, and the query request may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU.
- the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits.
- Device A and Device B can agree on the content of a certain extensible bit. When a certain extensible bit is 1, device B can obtain device A and query whether device B has the text editing function.
- the mobile phone when the mobile phone detects that the user clicks on the control 501 (the control 501 is the control for starting recording), the mobile phone can send a BLE data packet, and the BLE data packet can carry the query ask.
- device A when device A detects that the user performs an operation of converting speech to text, device A can send a BLE data packet to device B according to the MAC address of device B.
- the mobile phone when the mobile phone detects that the user clicks on the control 302 (the control 302 is a control for the mobile phone to perform voice-to-text conversion), the mobile phone can send a BLE data packet, and the BLE data packet can carry the query request.
- the control 302 is a control for the mobile phone to perform voice-to-text conversion
- the transceiver control module of device B can call the interface for querying the text editing function (for example, the content provider interface) to send a request to one or more applications in the application layer, and the request is used to request the application to determine whether it has the text editing function; If an application program has a text editing function, the application program can send a response to the data synchronization module, and the response is used to indicate that app2 has logged into the account. Therefore, device B can determine that device B has installed and logged in app2.
- the text editing function for example, the content provider interface
- device B can send a response to device A, and the response can be carried in a BLE data packet.
- the BLE data packet may include a PDU, and the query request may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU.
- the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits.
- Device A and Device B can agree on the content of a certain extensible bit. When a certain extensible bit is 1, device A can obtain the text clipping function of device B.
- device A After device A determines that device B has the text editing function, device A can send the acquired text content and instruction information corresponding to the audio content to device B, where the instruction information is used to instruct device B to edit the text content.
- device A may send a BLE data packet to device B according to the MAC address of device B.
- the BLE data packet may include a PDU, and the text content and indication information may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU.
- the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits.
- Device A and Device B can agree on the content of a certain extensible bit.
- Device A can use encoding methods such as GBK, ISO8859-1 or Unicode (for example, UTF-8, UTF-16) to encode the text content output by the ASR module of device A, and carry the encoded information in an or on multiple scalable bits.
- Device A may also set an extensible bit to 1.
- the device B After receiving the BLE data packet, the device B can obtain the text content and the indication information by decoding, so that the device B can display the text content according to the indication information.
- device A may send text content corresponding to the acquired audio content to device B.
- device A may send a BLE data packet to device B according to the MAC address of device B.
- the BLE data packet may include a PDU, and the text content may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU.
- the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits.
- Device A and Device B can agree on the content of a certain extensible bit.
- Device A can use encoding methods such as GBK, ISO8859-1 or Unicode (for example, UTF-8, UTF-16) to encode the text content output by the ASR module of device A, and carry the encoded information in an or on multiple scalable bits.
- device B After receiving the BLE data packet, device B can obtain the text content by decoding, so that device B can display the text content.
- device A and device B are devices under the same account, then device A can store information such as device type, device name, MAC address, and whether it has a text editing function of device B. Then, when device A acquires the audio content, device A can send the text content and indication information corresponding to the audio content to device B, where the indication information is used to instruct device B to edit the text content. In response to receiving the text content and the indication information, the device B may prompt the user whether to perform text editing on the device B through the display unit 1160 . In response to detecting that the user performs text editing on the device B, the device B can open an application that can be used for text editing, so as to display the text content obtained from the device A.
- device A may send the text content corresponding to the audio content to device B.
- the device B can edit the text content on the device B, so that the device B can prompt the user whether to perform text editing on the device B through the display unit 1160 .
- the device B can open an application that can be used for text editing, so as to display the text content obtained from the device A.
- the display unit 1160 can also be used to display the text content output by the ASR module (including the intermediate result and the final result); the display unit 1160 can also be used to display the text content edited by the user on the device B.
- the transceiver control module 1150 of the device B can append the text content to the back of the previously displayed text content.
- the transceiver control module 1150 of device B can call the QT interface and perform the following steps: (1) select all the text content displayed by App1; ( 2) Move the cursor to the end of the text; (3) Insert the new text content sent by device A; (4) Save all the text content in the current App1.
- the editing control module 1170 is used for saving, editing and displaying control of the received information.
- the results sent to device B through the ASR module 1120 may include two types:
- the first type is an intermediate result. For example, before a sentence is finished, the text content determined by the ASR module 1120 is not finalized and can be used as an intermediate result. In order to reflect the real-time performance, the intermediate results are also displayed, but are not saved as the final results.
- the ASR module of device A detects that every time the user speaks a word, it sends a corresponding word to device B, so that text content can be displayed on device A and device B synchronously. For example, the user says “I am XX", when device A detects that the user has said "I”, the ASR module of device A can send the text content ("I” to device B after determining the corresponding text "I” ), so that device B can display the text content "I”. When device A detects that the user has said "yes”, the ASR module of device A can send the text content ("yes") to device B after determining the corresponding text "yes”, so that device B can Append the text "Yes” after "I” until "I am XX” is displayed.
- the ASR module of device A detects that every time the user speaks a phrase, it sends a corresponding phrase to device B, so that text content can be displayed on device A and device B synchronously. For example, the user says “no hardship can stop us from moving forward", when device A detects that the user has said "any", the ASR module of device A can send the corresponding word "any” to device B after determining the corresponding word "any” The text content ("any"), so that device B can display the text content "any”.
- the ASR module of device A can send the text content ("difficult") to device B after determining the corresponding word "difficult", so that device B can Append the text "difficulty” after any” until it shows "no hardship can stop us from moving forward".
- the content displayed by device A can be synchronized with the content displayed on device B, and device A may correct the text content previously converted by the ASR module when judging that the user has finished speaking.
- the ASR module of device A detects that every time the user speaks a word or a phrase, it sends the corresponding text content to device B.
- device A detects that the user says " When "thinking”, it is possible that the ASR module determines that the corresponding text is "xiang”, then device A will send the corresponding text content (“xiang”) to device B, and device B can display the text content "xiang”.
- device A After judging that the user has finished saying this sentence, device A can determine that there is an error in the conversion of "Xiang” in the previously converted text content "Today I'm eating noodles", then device A can automatically correct the "Xiang” to "Xiang”. I want to”, thereby updating the text content spoken by the user (from "Today I want to eat noodles” to "Today I want to eat noodles”). Device A can send the updated text content to device B, so that device B can also update "Xiang” in the previously displayed text content "Today I want to eat noodles” to "Want", thereby displaying the updated text content "Today I want to eat noodles”.
- the second is that after a sentence is spoken, the recognition result of the sentence determined by the ASR module 1120 will not be changed back in the future, and such a result will replace the previous intermediate result and display it as the saved result.
- the ASR module of device A can convert each word or phrase spoken by the user into text content, but device A may not send text content to device B before judging that the user has not finished a sentence. Instead, when device A determines that the user has finished speaking a sentence and corrects the previously converted text content, it sends the text content corresponding to the sentence to device B.
- the ASR module of device A can convert each word or phrase spoken by the user into text content and send it to device B.
- Device B can display the text content sent by device A in the form of subtitles, but the device B may not display the text content in app1.
- device A judges that the user has finished speaking a sentence (and corrects the previously converted text content)
- device A can send the text content corresponding to the sentence and indication information to device B, and the indication information instructs device B to display in app1
- the text content corresponding to the sentence After receiving the text content and the instruction information, device B can display the text content corresponding to the sentence in app1.
- the editing control module 1170 will also re-save the edited result and display it, and the subsequent ASR The result is appended on top of that.
- the editing control module 1170 is further configured to send the text content edited by the user to the sending and receiving control module 1150 , so that the sending and receiving control module 1150 sends the edited text content to the sending and receiving control module 1130 .
- the replacement module 1140 is configured to replace the original displayed text content with the edited text content after receiving the edited text content sent by the transceiver control module 1130 .
- the device A when the recording ends or the audio acquisition ends, the device A may indicate the end of the recording or the audio acquisition to the device B. After the user finishes editing on device B, the editing result can be synchronized to device A at one time, and the entire synchronization process is completed.
- FIG. 12 shows a schematic flowchart of a text editing method 1200 according to an embodiment of the present application.
- the method 1200 can be implemented by the above-mentioned device A and device B, and the method 1200 includes:
- S1201 device A acquires audio content.
- the mobile phone when the mobile phone detects the user's operation of accepting the incoming call, the mobile phone can acquire the voice content of another user.
- the mobile phone when the mobile phone detects that the user accepts the operation of the video call request, the mobile phone can acquire the voice content of another user.
- device A sends first information to device B according to the audio content.
- the first information is text content corresponding to the audio content.
- device A may first convert the voice content into text content through the ASR module, so as to send the text content to device B.
- ASR module For the process of converting the voice content into text content by the device A, reference may be made to the descriptions in the foregoing embodiments, which are not repeated here for brevity.
- the method before device A sends the voice content to device B, the method further includes:
- Device A sends a query request to request the device that received the query request to determine whether it has a text editing function
- device B In response to receiving the request information sent by device A, device B sends a response to device A, where the response is used to indicate that device B has a text editing function;
- device A In response to receiving the response, device A sends device B the first information.
- device A broadcasts the query request to surrounding devices.
- device A may store device information of device B (eg, device A and device B are devices under the same account, or device A and device B are devices under different accounts in the same family group).
- device B in response to receiving the query request, may prompt the user whether to perform text editing on device B; in response to the user determining to perform text editing on device B, device B sends the response to device A.
- the laptop computer when the laptop computer detects the query request sent by the mobile phone, the laptop computer can display a prompt box 303, which includes the prompt message “It is detected that the mobile phone is recording and transferring. text, whether text editing is done on a laptop". In response to the user clicking on control 304, the laptop can send the response to the phone.
- device A may send request information to device A, where the request information is used to request device B to edit the text content output by device A; Request device B to edit the text content corresponding to the audio content.
- the mobile phone when the mobile phone detects that the user clicks on the control 302, the mobile phone can send a query request to the surrounding devices, and the query request is used to query whether the surrounding devices have a text editing function; In response to receiving the response sent by the laptop, the mobile phone can determine that the laptop has text editing capabilities. The mobile phone can continue to send request information to the notebook computer, and the request information is used to edit the text content output by the mobile phone. In response to receiving the request information from the cell phone, the laptop computer may display a prompt box 303 . In response to the user clicking on the control 304, the notebook computer can send a response to the mobile phone, the response being used to indicate that the notebook computer can edit the text content. In response to receiving the response, the cell phone can send the textual content to the laptop.
- the mobile phone when the mobile phone detects the operation of the user clicking on the control 501, it sends a query request to the surrounding devices; in response to receiving the response sent by the notebook computer, the mobile phone can determine that the notebook computer has text Edit function.
- the mobile phone can send request information to the laptop computer, where the request information is used to request the laptop computer to edit the text content corresponding to the audio content acquired by the mobile phone.
- the laptop computer may display a prompt box 502, wherein the prompt box 502 includes prompt information "it is detected that the mobile phone is recording, whether to convert the recording to text on the laptop computer".
- the notebook computer can send a response to the mobile phone, where the response is used to instruct the notebook computer to edit the text content corresponding to the recording.
- the cell phone can send the textual content to the laptop.
- device B can send the response information to device A through a BLE data packet, and the specific sending process can refer to the above-mentioned process of device A sending a BLE data packet to device B, which is not repeated here for brevity.
- the request information may be carried in a user datagram protocol (user datagram protocol, UDP) data packet.
- UPD packet includes the data portion of the IP datagram.
- the data portion of an IP datagram may include extensible bits.
- Device A and Device B can agree on the content of a certain extensible bit. When a certain extensible bit is 1, device B can know that device A requests to edit the text content.
- the UDP data packet can also carry the IP address and port number of device A (including source port number and destination port number, where the source port number refers to the port number used by device A to send data, and the destination port number refers to the port number that device A receives The port used by the data), the IP address and port number of device A can be carried in the UDP header of the data part of the IP datagram.
- device B can establish a transmission control protocol (transmission control protocol, TCP) connection with device A.
- TCP transmission control protocol
- device B can send the response to device A through the TCP connection.
- the UDP data packet carries the IP address and destination port number of device A.
- Device B can establish a TCP connection with device A through the IP address and destination port number. After that, device A can send the text content input by the ASR module of device A to device B through the TCP connection.
- the method 1200 further includes: when the device A converts the voice content into text content, the device A displays the text content.
- the mobile phone after detecting the voice content in the environment, the mobile phone can convert the voice content into text content, so as to display the text content through the display interface of the memo application.
- S1203 Device B displays the text content corresponding to the audio content according to the first information.
- the notebook computer after receiving the text content sent by the mobile phone, the notebook computer can display the text content through App1.
- the notebook computer may display the edited text content.
- the method 1200 further includes: device B sending the edited text content to device A.
- device B detects the user's first operation, and sends the edited text content to device A.
- the notebook computer when the notebook computer detects that the user clicks on the control 307, the notebook computer can send the edited text content to the mobile phone.
- the notebook computer when the notebook computer detects that the user clicks on the control 307, the notebook computer can send the edited text content to the mobile phone.
- device A may use the edited text content to edit the previously displayed text content.
- the notebook computer may send the edited text content to the mobile phone.
- the phone can replace the previously displayed text content with the text content edited by the user.
- device B when device B detects an editing operation by the user on the text content sent by device A, it can edit the text content accordingly.
- the notebook computer when the notebook computer detects that the user has edited the text content, it can edit the text content received from the mobile phone.
- the notebook computer when the notebook computer can detect the user's operation of changing "positive success” to "Zheng Chenggong”; for another example, the notebook computer can detect the user's operation of adding a symbol ".” after "annual feature”; another example, The laptop can detect the user inserting the symbol "," in "Medical Education Housing Food Safety". As another example, a laptop can detect the user's press enter after "Social Mindset?".
- Device B can send the edited text content to device A through a BLE data packet or through a TCP connection.
- a BLE data packet or through a TCP connection.
- TCP connection For the sending method, reference may be made to the description in the above embodiment, which is not repeated here for brevity.
- device B if device B detects that the user has edited the format of the text content, when device B sends the edited text content to device A, it can also indicate the format information of the text content to device A.
- the format of the text content may include a line feed (or carriage return) operation between certain two words in the text content; or, there is a space between certain two words.
- the notebook computer while sending the edited text content to the mobile phone, the notebook computer can also indicate to the mobile phone that there is a line break between "Q" and "2", and "?” There is a newline between "3".
- device B may send the edited text content and format information of the text content to device A.
- the implementation manner of the device B sending the edited text content to the device A may refer to the description in the foregoing embodiment, which is not repeated here for brevity.
- the following describes an implementation manner in which device B sends the format information of the edited text content to device A.
- the text content format includes font size, font color, font inclination, font underline, font background color, and a carriage return operation after a certain text in this content.
- device A may send the format information of the edited text content to device B through a BLE data packet.
- This textual content may be carried in a service data field or a vendor specific data field in the PDU.
- the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits.
- device A may agree with device B on the content of some extensible bits. Exemplarily, when a certain extensible bit is 000, the device A can know that the character is not oblique, there is no underscore, and there is no carriage return operation after the character.
- a certain extensible bit when a certain extensible bit is 100, device A can learn that the character is oblique, has no underline, and there is no carriage return operation after the character.
- a certain extensible bit when a certain extensible bit is 010, device A can know that the character is not oblique, has an underline, and there is no carriage return operation after the character.
- a certain extensible bit when a certain extensible bit is 001, device A can know that the character is not slanted, has no underscore, and there is a carriage return operation after the character.
- device A may agree with device B on the content of some extensible bits.
- a certain expandable bit is 000
- device A can learn that the font color of the character is black.
- a certain extensible bit is 001
- device A can learn that the font color of the character is gray.
- a certain expandable bit is 010
- device A can know that the color of the character is blue.
- a certain expandable bit is 100
- device A can learn that the font color of the character is blue.
- device A may agree with device B on the content of some extensible bits.
- a certain expandable bit is 000
- device A can learn that the font size of the character is 10.
- a certain extensible bit is 001
- device A can learn that the font size of the character is 12.
- a certain extensible bit is 010
- device A can learn that the font size of the character is 14.
- a certain expandable bit is 100
- device A can learn that the font size of the character is 18.
- the format of the text content is not specifically limited in the embodiment of the present application.
- the device B After receiving the format information of the edited text content, the device B can to display the edited text content.
- the text content displayed on device B corresponds to the text content displayed on device A.
- the method 1200 further includes: device A replaces the original text content with the edited text content received from device B; in response to receiving the edited text content from device B, device A displays a second interface , the second interface includes the edited text content.
- the notebook computer sends the edited text content to the mobile phone in response to the operation of the user clicking the save control 307 .
- the mobile phone can replace the text content displayed on the original interface with the edited text content.
- device B when device B sends the edited text content to device A, it may also send the identification information of the edited text content to device A.
- S1204 is an optional step, and device B may also not send the edited text content to device A. Instead, save the edited text content locally on device B.
- FIG. 13 shows another system architecture provided by an embodiment of the present application.
- the system architecture includes device A and device B, where device A may be a device with a pickup function (eg, a mobile phone), and device B may be a device with an efficient editing function (eg, a laptop).
- Device A includes a pickup module 1310 and a transceiver control module 1320 ;
- device B includes a receive transceiver control module 1330 , an ASR module 1340 , a display module 1350 and an editing control module 1360 .
- the ASR module can be included in device B.
- Device A can be responsible for acquiring voice content or audio files, and device A sends the acquired voice content or audio files to the transceiver control module 1330 of device B through the transceiver control module 1320, so that the ASR module 1340 of device B sends the voice content or audio files to the device B. Converted to text content, so that device B can display the converted text content to the user. Device B can also display the edited text content through the display module 1340 after detecting an editing operation of the text content by the user through the editing control module 1360 .
- FIG. 14 shows a schematic block diagram of an apparatus 1400 provided by an embodiment of the present application.
- the apparatus 1400 can be set in the above-mentioned device A in FIG. 12 , and the apparatus 1400 includes: an acquiring unit 1410 for acquiring audio content; a sending unit 1420 for transmitting first information to the second electronic device, the first The information is the audio content, or the first information is the first text content corresponding to the audio content, so that the second electronic device displays the first text content according to the first information and detects the user's first text content. edit operation.
- FIG. 15 shows a schematic block diagram of an apparatus 1200 provided by an embodiment of the present application.
- the apparatus 1500 can be set in the above-mentioned device B in FIG. 12 , and the apparatus 1500 includes: a receiving unit 1510 for receiving first information sent by a first electronic device, where the first information is acquired by the first electronic device audio content, or the first information is the first text content corresponding to the audio content; the display unit 1520 is used to display the first text content according to the first information; the detection unit 1530 is used to detect the user's An editing operation of text content; the display unit 1520 is further configured to display the second text content, where the second text content is the edited text content of the first text content.
- FIG. 16 shows a schematic structural diagram of an electronic device 1600 provided by an embodiment of the present application.
- the electronic device includes: one or more processors 1610, one or more memories 1620, and the one or more memories 1620 stores one or more computer programs, and the one or more computer programs include instruction.
- the instruction is executed by the one or more processors 1610, the first electronic device or the second electronic device is caused to execute the technical solutions in the foregoing embodiments.
- An embodiment of the present application provides a system including a device A and a device B, and the system is used to implement the technical solutions in the foregoing embodiments.
- the implementation principle and technical effect thereof are similar to the related embodiments of the above method, and are not repeated here.
- the embodiments of the present application provide a computer program product, which enables the device A to execute the technical solutions in the foregoing embodiments when the computer program product runs on the device A (or the mobile phone in the foregoing embodiment).
- the implementation principle and technical effect thereof are similar to the related embodiments of the above method, and are not repeated here.
- An embodiment of the present application provides a computer program product, which enables the device B to execute the technical solutions in the foregoing embodiments when the computer program product runs on the device B (or the notebook computer in the foregoing embodiment).
- the implementation principle and technical effect thereof are similar to the related embodiments of the above method, and are not repeated here.
- An embodiment of the present application provides a readable storage medium, where the readable storage medium contains instructions, when the instructions are executed on device A (or, the mobile phone in the above embodiment), the device A is caused to execute the above embodiment technical solution.
- the implementation principle and technical effect thereof are similar, and are not repeated here.
- An embodiment of the present application provides a readable storage medium, where the readable storage medium contains instructions, when the instructions are executed on a device B (or a notebook computer in the above embodiment), the device B is caused to perform the above implementation.
- a device B or a notebook computer in the above embodiment
- the device B is caused to perform the above implementation.
- example technical solution The implementation principle and technical effect thereof are similar, and are not repeated here.
- An embodiment of the present application provides a chip, which is used for executing an instruction, and when the chip is running, executes the technical solutions in the foregoing embodiments.
- the implementation principle and technical effect thereof are similar, and are not repeated here.
- the disclosed system, apparatus and method may be implemented in other manners.
- the apparatus embodiments described above are only illustrative.
- the division of the units is only a logical function division. In actual implementation, there may be other division methods.
- multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented.
- the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
- the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
- each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
- the functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium.
- the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution.
- the computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application.
- the aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program codes .
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- Theoretical Computer Science (AREA)
- General Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- User Interface Of Digital Computer (AREA)
- Telephone Function (AREA)
Abstract
A text editing method, an electronic device (100, 1600) and a system. The method comprises: a device A acquiring audio content (S1201); the device A sending first information to a device B (S1202), wherein the first information is the audio content, or the first information is first text content corresponding to the audio content; the device B displaying the first text content according to the first information (S1203); and the device B displaying second text content in response to an editing operation of a user on the first text content (S1204), wherein the second text content is text content obtained after the first text content is edited (S1205). The present invention is conducive to improving the editing efficiency of a user for text content.
Description
本申请要求于2020年12月22日提交中国专利局、申请号为202011529621.0、申请名称为“一种编辑文本的方法、电子设备和系统”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese patent application with the application number 202011529621.0 and the application title "A method, electronic device and system for editing text" filed with the China Patent Office on December 22, 2020, the entire contents of which are incorporated by reference in this application.
本申请涉及终端领域,并且更具体地,涉及一种编辑文本的方法、电子设备和系统。The present application relates to the field of terminals, and more particularly, to a method, electronic device and system for editing text.
语音识别(automatic speech recognition,ASR)技术已经被广泛使用,非常大程度提高了人们做纪要的效率,但是由于某些条件下,ASR技术无法做到100%的准确,就需要人为对ASR结果进行人工编辑后才能使用。但是一般情况下,ASR模块都存在手机这样的便捷移动设备上,它没有相对人类交互比较友好的屏幕显示和输入方式,使得ASR结果的编辑效率低下。Speech recognition (automatic speech recognition, ASR) technology has been widely used, which greatly improves the efficiency of people making minutes. However, due to certain conditions, ASR technology cannot be 100% accurate, and it is necessary to manually perform ASR results. It can only be used after manual editing. However, in general, ASR modules are stored on convenient mobile devices such as mobile phones, and there is no screen display and input method that is relatively friendly to human interaction, which makes the editing efficiency of ASR results low.
发明内容SUMMARY OF THE INVENTION
本申请提供一种编辑文本的方法、电子设备和系统,有助于提升用户对文本编辑的编辑效率。The present application provides a method, electronic device and system for editing text, which help to improve the editing efficiency of text editing by users.
第一方面,提供了一种系统,该系统包括第一电子设备和第二电子设备,其中,该第一电子设备用于获取音频内容;该第一电子设备还用于向该第二电子设备发送第一信息,该第一信息为该音频内容,或者,该第一信息为该音频内容对应的第一文本内容;该第二电子设备用于根据该第一信息,显示该第一文本内容;该第二电子设备还用于响应于用户对该第一文本内容的编辑操作,显示第二文本容,该第二文本内容为对该第一文本内容编辑后的文本内容。In a first aspect, a system is provided, the system includes a first electronic device and a second electronic device, wherein the first electronic device is used to acquire audio content; the first electronic device is further configured to report to the second electronic device Send first information, where the first information is the audio content, or the first information is the first text content corresponding to the audio content; the second electronic device is configured to display the first text content according to the first information ; the second electronic device is further configured to display the second text content in response to the user's editing operation on the first text content, where the second text content is the edited text content of the first text content.
本申请实施例中,第一电子设备可以将获取的音频内容对应的文本内容发送给第二电子设备,从而使得该文本内容可以显示在第二电子设备上,从而方便用户在第二电子设备上对文本内容进行编辑,从而有助于提升用户在编辑文本内容时的效率。In this embodiment of the present application, the first electronic device can send the acquired text content corresponding to the audio content to the second electronic device, so that the text content can be displayed on the second electronic device, so that it is convenient for the user to display the text content on the second electronic device Edit the text content to help improve the efficiency of the user when editing the text content.
在一些可能的实现方式中,第一电子设备中可以保存有一个或者多个电子设备的信息,当第一电子设备获取音频时,第一电子设备可以从该一个或者多个电子设备选择一个合适进行文本编辑的第二电子设备,从而将第一信息发送给第二电子设备。In some possible implementations, the first electronic device may store information of one or more electronic devices, and when the first electronic device acquires audio, the first electronic device may select a suitable one from the one or more electronic devices A second electronic device that performs text editing, thereby sending the first information to the second electronic device.
在一些可能的实现方式中,第二电子设备响应于接收到该第一信息,可以打开第一应用程序,并在该第一应用程序中显示该第一文本内容;第二电子设备可以通过第二电子设备的输入法对该第一文本内容进行编辑。In some possible implementations, in response to receiving the first information, the second electronic device may open a first application, and display the first text content in the first application; the second electronic device may The input method of the second electronic device edits the first text content.
结合第一方面,在第一方面的某些实现方式中,该第二电子设备还用于向该第一电子设备发送该第二文本内容。With reference to the first aspect, in some implementations of the first aspect, the second electronic device is further configured to send the second text content to the first electronic device.
在一些可能的实现方式中,该第二电子设备还用于在检测到用户的第一操作时,向该 第一电子设备发送该第二文本内容。In some possible implementations, the second electronic device is further configured to send the second text content to the first electronic device when the first operation of the user is detected.
在一些可能的实现方式中,该第一操作为用户点击保存的操作。In some possible implementations, the first operation is an operation in which the user clicks to save.
本申请实施例中,第二电子设备可以获取到用户编辑后的文本内容后向第一电子设备发送编辑后的文本内容,从而使得第一电子设备可以保存该编辑后的文本内容。无需用户的额外操作,就可使得第一电子设备也可以获得编辑后的文本内容In this embodiment of the present application, the second electronic device may acquire the text content edited by the user and send the edited text content to the first electronic device, so that the first electronic device may save the edited text content. The edited text content can also be obtained by the first electronic device without any additional operation by the user.
结合第一方面,在第一方面的某些实现方式中,该编辑操作包括对该第一文本内容的格式修改操作,其中,该第二电子设备,还用于向该第一电子设备发送该第二文本内容的格式信息。With reference to the first aspect, in some implementations of the first aspect, the editing operation includes a format modification operation for the first text content, wherein the second electronic device is further configured to send the first electronic device the Format information of the second text content.
本申请实施例中,当用户修改了文本内容的格式时,第二电子设备还可以向第一电子设备发送编辑后的文本内容的格式信息,从而使得第一电子设备根据该格式信息还原第二电子设备上用户编辑后的文本内容。In this embodiment of the present application, when the user modifies the format of the text content, the second electronic device may also send the format information of the edited text content to the first electronic device, so that the first electronic device can restore the second electronic device according to the format information. User-edited text content on an electronic device.
结合第一方面,在第一方面的某些实现方式中,该第二文本内容的格式信息包括该第二文本内容的字体颜色、字体大小、字体背景色、字体倾斜、字体下划线以及该第二文本内容中的回车操作种的一种或者多种。With reference to the first aspect, in some implementations of the first aspect, the format information of the second text content includes the font color, font size, font background color, font inclination, font underline and the second text content of the second text content. One or more types of carriage return operations in the text content.
结合第一方面,在第一方面的某些实现方式中,该第一电子设备,还用于:在接收该第二电子设备发送的该第二文本内容之前,根据该音频内容,显示该第一文本内容;在接收该第二电子设备发送的该第二文本内容之后,使用该第二文本内容替换该第一文本内容。With reference to the first aspect, in some implementations of the first aspect, the first electronic device is further configured to: display the first electronic device according to the audio content before receiving the second text content sent by the second electronic device a text content; after receiving the second text content sent by the second electronic device, replace the first text content with the second text content.
本申请实施例中,第一电子设备可以在获取该音频内容时显示对应的第一文本内容,在第一电子设备接收到第二电子设备发送的用户编辑后的第二文本内容后,第一电子设备可以使用第二文本内容替换之前的第一文本内容,从而方便用户在第一电子设备和第二电子设备上同时观看到编辑后的文本内容,有助于提高用户的体验。In this embodiment of the present application, the first electronic device may display the corresponding first text content when acquiring the audio content, and after the first electronic device receives the user-edited second text content sent by the second electronic device, the first electronic device The electronic device can replace the previous first text content with the second text content, so that it is convenient for the user to watch the edited text content on the first electronic device and the second electronic device at the same time, which helps to improve the user's experience.
结合第一方面,在第一方面的某些实现方式中,该第一电子设备还用于发送查询请求,该查询请求用于接收到该查询请求的设备确定是否具有文本编辑功能;响应于接收到该第二电子设备发送的响应,向该第二电子设备发送该第一信息,该响应用于指示该第二电子设备具备文本编辑功能。With reference to the first aspect, in some implementations of the first aspect, the first electronic device is further configured to send a query request, and the query request is used for the device that received the query request to determine whether it has a text editing function; in response to receiving the query request To the response sent by the second electronic device, the first information is sent to the second electronic device, where the response is used to indicate that the second electronic device has a text editing function.
本申请实施例中,第一电子设备可以在向第二电子设备发送第一信息之前,查询具备文本编辑功能的设备。第一电子设备在确定第二电子设备具备文本编辑功能后,可以向该第二电子设备发送该第一信息,从而方便用户在第二电子设备上对文本内容进行编辑,有助于提升用户在文本编辑时的效率。In this embodiment of the present application, before sending the first information to the second electronic device, the first electronic device may inquire about a device with a text editing function. After determining that the second electronic device has a text editing function, the first electronic device can send the first information to the second electronic device, so as to facilitate the user to edit the text content on the second electronic device and help improve the user's ability to Efficiency when editing text.
结合第一方面,在第一方面的某些实现方式中,该第二电子设备具体用于:响应于接收到该第一信息,提示用户是否在该第二电子设备上进行文本编辑;响应于用户确定在该第二电子设备上进行文本编辑的操作,显示该第一文本内容。With reference to the first aspect, in some implementations of the first aspect, the second electronic device is specifically configured to: in response to receiving the first information, prompt the user whether to perform text editing on the second electronic device; The user determines to perform a text editing operation on the second electronic device to display the first text content.
本申请实施例中,第二电子设备在接收到该第一信息时,可以先提示用户是否同意在第二电子设备上进行文本编辑。若第二电子设备检测到用户同意的操作,则可以显示该文本内容,这样有助于避免对用户造成干扰,用户可以选择一个自己认为合适的设备进行文本编辑,有助于提升用户的体验。In this embodiment of the present application, when the second electronic device receives the first information, it may first prompt the user whether to agree to perform text editing on the second electronic device. If the second electronic device detects the operation agreed by the user, it can display the text content, which helps to avoid interference to the user, and the user can choose a device he thinks is suitable for text editing, which helps to improve the user's experience.
在一些可能的实现方式中,该第一电子设备还可以向第二电子设备发送请求信息,该请求信息用于请求第二电子设备对该第一文本内容进行编辑;第二电子设备可以响应于该请求信息,提示用户是否同意在第二电子设备上对文本内容进行编辑;若第二电子设备检 测到用户同意在第二电子设备上对文本内容进行编辑的操作,第二电子设备可以显示该第一文本内容。In some possible implementations, the first electronic device may also send request information to the second electronic device, where the request information is used to request the second electronic device to edit the first text content; the second electronic device may respond to The request information prompts the user whether to agree to edit the text content on the second electronic device; if the second electronic device detects that the user agrees to edit the text content on the second electronic device, the second electronic device can display the First text content.
结合第一方面,在第一方面的某些实现方式中,该第一信息为该第一文本内容,该第一电子设备具体用于:响应于获取到该音频内容,将该音频内容转化为该第一文本内容;向该第二电子设备发送该第一文本内容。With reference to the first aspect, in some implementations of the first aspect, the first information is the first text content, and the first electronic device is specifically configured to: in response to acquiring the audio content, convert the audio content into the first text content; sending the first text content to the second electronic device.
本申请实施例中,第一电子设备可以在获取音频内容后,将该音频内容转化为文本内容,从而向第二电子设备发送该文本内容,第二电子设备可以显示对应的文本内容,从而方便用户对文本内容进行编辑。In this embodiment of the present application, after acquiring the audio content, the first electronic device can convert the audio content into text content, so as to send the text content to the second electronic device, and the second electronic device can display the corresponding text content, thereby facilitating the The user edits the text content.
结合第一方面,在第一方面的某些实现方式中,该第一信息为该音频内容,该第二电子设备具体用于:响应于接收到该音频内容,将该音频内容转化为该第一文本内容;显示该第一文本内容。With reference to the first aspect, in some implementations of the first aspect, the first information is the audio content, and the second electronic device is specifically configured to: in response to receiving the audio content, convert the audio content into the audio content a text content; display the first text content.
本申请实施例中,第一电子设备可以将获取到的音频内容发送给第二电子设备,第二电子设备可以将音频内容转化为文本内容后显示该文本内容,从而方便用户对文本内容进行编辑。In the embodiment of the present application, the first electronic device may send the acquired audio content to the second electronic device, and the second electronic device may convert the audio content into text content and display the text content, thereby facilitating the user to edit the text content .
结合第一方面,在第一方面的某些实现方式中,该第一电子设备的账号和该第二电子设备的账号相关联。With reference to the first aspect, in some implementations of the first aspect, the account of the first electronic device is associated with the account of the second electronic device.
第二方面,提供了一种文本编辑的方法,该方法应用于第一电子设备,该方法包括:该第一电子设备获取音频内容;该第一电子设备向第二电子设备发送第一信息,该第一信息为该音频内容,或者,该第一信息为该音频内容对应的第一文本内容,以使得该第二电子设备根据该第一信息显示该第一文本内容并检测用户对该第一文本内容的编辑操作。In a second aspect, a text editing method is provided, the method is applied to a first electronic device, the method includes: the first electronic device acquires audio content; the first electronic device sends first information to the second electronic device, The first information is the audio content, or the first information is the first text content corresponding to the audio content, so that the second electronic device displays the first text content according to the first information and detects the user's An editing operation for text content.
结合第二方面,在第二方面的某些实现方式中,该方法还包括:该第一电子设备接收该第二电子设备发送的第二文本内容,该第二文本内容为用户在该第二电子设备上对该第一文本内容进行编辑后的文本内容。With reference to the second aspect, in some implementations of the second aspect, the method further includes: receiving, by the first electronic device, second text content sent by the second electronic device, where the second text content Text content after editing the first text content on the electronic device.
结合第二方面,在第二方面的某些实现方式中,该方法还包括:该第一电子设备接收该第二电子设备发送的该第二文本内容的格式信息。With reference to the second aspect, in some implementations of the second aspect, the method further includes: receiving, by the first electronic device, format information of the second text content sent by the second electronic device.
结合第二方面,在第二方面的某些实现方式中,该第二文本内容的格式信息包括该第二文本内容的字体颜色、字体大小、字体背景色、字体倾斜、字体下划线以及该第二文本内容中的回车操作种的一种或者多种。With reference to the second aspect, in some implementations of the second aspect, the format information of the second text content includes the font color, font size, font background color, font inclination, font underline and the second text content of the second text content. One or more types of carriage return operations in the text content.
结合第二方面,在第二方面的某些实现方式中,该方法还包括:在接收该第二电子设备发送的该第二文本内容之前,该第一电子设备根据该音频内容,显示该第一文本内容;在接收该第二电子设备发送的该第二文本内容之后,该第一电子设备使用该第二文本内容替换该第一文本内容。With reference to the second aspect, in some implementations of the second aspect, the method further includes: before receiving the second text content sent by the second electronic device, the first electronic device displays the first electronic device according to the audio content a text content; after receiving the second text content sent by the second electronic device, the first electronic device replaces the first text content with the second text content.
结合第二方面,在第二方面的某些实现方式中,该方法包括:在向该第二电子设备发送该第一信息之前,该第一电子设备发送查询请求,该查询请求用于接收到该查询请求的设备确定是否具有文本编辑功能;响应于接收到该第二电子设备发送的响应,该第一电子设备向该第二电子设备发送该第一信息,该响应用于指示该第二电子设备具备文本编辑功能。With reference to the second aspect, in some implementations of the second aspect, the method includes: before sending the first information to the second electronic device, the first electronic device sends a query request, where the query request is used to receive The device of the query request determines whether it has a text editing function; in response to receiving a response sent by the second electronic device, the first electronic device sends the first information to the second electronic device, and the response is used to indicate the second electronic device. Electronic devices have text editing capabilities.
结合第二方面,在第二方面的某些实现方式中,该第一信息为该第一文本内容,该方法还包括:响应于获取到该音频内容,该第一电子设备将该音频内容转化为该第一文本内 容;该第一电子设备向该第二电子设备发送该第一文本内容。With reference to the second aspect, in some implementations of the second aspect, the first information is the first text content, and the method further includes: in response to acquiring the audio content, the first electronic device converts the audio content is the first text content; the first electronic device sends the first text content to the second electronic device.
结合第二方面,在第二方面的某些实现方式中,该第一电子设备的账号和该第二电子设备的账号相关联。With reference to the second aspect, in some implementations of the second aspect, the account of the first electronic device is associated with the account of the second electronic device.
第三方面,提供了一种文本编辑的方法,该方法应用于第二电子设备,该方法包括:该第二电子设备接收第一电子设备发送的第一信息,该第一信息为该第一电子设备获取的音频内容,或者,该第一信息为该音频内容对应的第一文本内容;该第二电子设备根据该第一信息,显示该第一文本内容;响应于用户对该第一文本内容的编辑操作,该第二电子设备显示第二文本容,该第二文本内容为对该第一文本内容编辑后的文本内容。In a third aspect, a text editing method is provided, the method is applied to a second electronic device, and the method includes: the second electronic device receives first information sent by the first electronic device, where the first information is the first information The audio content obtained by the electronic device, or the first information is the first text content corresponding to the audio content; the second electronic device displays the first text content according to the first information; In the content editing operation, the second electronic device displays the second text content, and the second text content is the edited text content of the first text content.
结合第三方面,在第三方面的某些实现方式中,该方法还包括:该第二电子设备向该第一电子设备发送该第二文本内容。With reference to the third aspect, in some implementations of the third aspect, the method further includes: the second electronic device sending the second text content to the first electronic device.
结合第三方面,在第三方面的某些实现方式中,该编辑操作包括对该第一文本内容的格式修改操作,该方法还包括:该第二电子设备向该第一电子设备发送该第二文本内容的格式信息。With reference to the third aspect, in some implementations of the third aspect, the editing operation includes a format modification operation for the first text content, and the method further includes: the second electronic device sends the first electronic device the first electronic device. 2. The format information of the text content.
结合第三方面,在第三方面的某些实现方式中,该第二文本内容的格式信息包括该第二文本内容的字体颜色、字体大小、字体背景色、字体倾斜、字体下划线以及该第二文本内容中的回车操作种的一种或者多种。With reference to the third aspect, in some implementations of the third aspect, the format information of the second text content includes the font color, font size, font background color, font inclination, font underline and the second text content of the second text content. One or more types of carriage return operations in the text content.
结合第三方面,在第三方面的某些实现方式中,该方法还包括:在接收该第一电子设备发送的该第一信息之前,该第二电子设备接收该第一电子设备发送的查询请求,该查询请求该第二电子设备确定是否具有文本编辑功能;该第二电子设备向该第一电子设备发送响应,该响应用于指示该第二电子设备具备文本编辑功能。With reference to the third aspect, in some implementations of the third aspect, the method further includes: before receiving the first information sent by the first electronic device, receiving, by the second electronic device, a query sent by the first electronic device request, the query requests the second electronic device to determine whether it has a text editing function; the second electronic device sends a response to the first electronic device, where the response is used to indicate that the second electronic device has a text editing function.
结合第三方面,在第三方面的某些实现方式中,该第二电子设备根据该第一信息,显示该第一文本内容,包括:响应于接收到该第一信息,该第二电子设备提示用户是否在该第二电子设备上进行文本编辑;响应于用户确定在该第二电子设备上进行文本编辑的操作,该第二电子设备显示该第一文本内容。With reference to the third aspect, in some implementations of the third aspect, the second electronic device displays the first text content according to the first information, including: in response to receiving the first information, the second electronic device Prompting the user whether to perform text editing on the second electronic device; in response to the user determining that the text editing is performed on the second electronic device, the second electronic device displays the first text content.
结合第三方面,在第三方面的某些实现方式中,该第一信息为该音频内容,该第二电子设备显示该第一文本内容之前,该方法还包括:响应于接收到该音频内容,将该音频内容转化为该第一文本内容。With reference to the third aspect, in some implementations of the third aspect, the first information is the audio content, and before the second electronic device displays the first text content, the method further includes: in response to receiving the audio content , and convert the audio content into the first text content.
结合第三方面,在第三方面的某些实现方式中,该第一电子设备的账号和该第二电子设备的账号相关联。With reference to the third aspect, in some implementations of the third aspect, the account of the first electronic device is associated with the account of the second electronic device.
第四方面,提供了一种装置,该装置包括:获取单元,用于获取音频内容;发送单元,用于向第二电子设备发送第一信息,该第一信息为该音频内容,或者,该第一信息为该音频内容对应的第一文本内容,以使得该第二电子设备根据该第一信息显示该第一文本内容并检测用户对该第一文本内容的编辑操作。In a fourth aspect, an apparatus is provided, the apparatus comprising: an acquiring unit for acquiring audio content; a sending unit for transmitting first information to a second electronic device, where the first information is the audio content, or the The first information is the first text content corresponding to the audio content, so that the second electronic device displays the first text content according to the first information and detects a user's editing operation on the first text content.
第五方面,提供了一种装置,该装置包括:接收单元,用于接收第一电子设备发送的第一信息,该第一信息为该第一电子设备获取的音频内容,或者,该第一信息为该音频内容对应的第一文本内容;显示单元,用于根据该第一信息,显示该第一文本内容;检测单元,用于检测用户对该第一文本内容的编辑操作;显示单元,还用于显示第二文本容,该第二文本内容为对该第一文本内容编辑后的文本内容。In a fifth aspect, an apparatus is provided, the apparatus comprising: a receiving unit configured to receive first information sent by a first electronic device, where the first information is audio content acquired by the first electronic device, or the first information The information is the first text content corresponding to the audio content; the display unit is used to display the first text content according to the first information; the detection unit is used to detect the user's editing operation on the first text content; the display unit, It is also used to display the second text content, where the second text content is the edited text content of the first text content.
第六方面,提供了一种电子设备,包括:一个或多个处理器;存储器;以及一个或多 个计算机程序。其中,一个或多个计算机程序被存储在存储器中,一个或多个计算机程序包括指令。当指令被电子设备执行时,使得电子设备执行上述第二方面中任一项可能的实现中的方法。In a sixth aspect, an electronic device is provided, comprising: one or more processors; a memory; and one or more computer programs. Wherein, one or more computer programs are stored in the memory, the one or more computer programs comprising instructions. When the instructions are executed by the electronic device, the electronic device is caused to execute the method in any one of the possible implementations of the second aspect above.
第七方面,提供了一种电子设备,包括:一个或多个处理器;存储器;以及一个或多个计算机程序。其中,一个或多个计算机程序被存储在存储器中,一个或多个计算机程序包括指令。当指令被电子设备执行时,使得电子设备执行上述第三方面中任一项可能的实现中的方法。In a seventh aspect, an electronic device is provided, comprising: one or more processors; a memory; and one or more computer programs. Wherein, one or more computer programs are stored in the memory, the one or more computer programs comprising instructions. When the instructions are executed by the electronic device, the electronic device is caused to execute the method in any one of the possible implementations of the third aspect above.
第八方面,提供了一种包含指令的计算机程序产品,当所述计算机程序产品在第一电子设备上运行时,使得所述电子设备执行上述第二方面所述的方法;或者,当所述计算机程序产品在第二电子设备上运行时,使得所述电子设备执行上述第三方面所述的方法。In an eighth aspect, there is provided a computer program product containing instructions, when the computer program product is run on a first electronic device, the electronic device causes the electronic device to perform the method described in the second aspect; or, when the computer program product is executed on a first electronic device The computer program product, when running on the second electronic device, causes the electronic device to perform the method of the third aspect above.
第九方面,提供了一种计算机可读存储介质,包括指令,当所述指令在第一电子设备上运行时,使得所述电子设备执行上述第二方面所述的方法;或者,当所述指令在第二电子设备上运行时,使得所述电子设备执行上述第三方面所述的方法。In a ninth aspect, a computer-readable storage medium is provided, comprising instructions, when the instructions are executed on a first electronic device, the electronic device causes the electronic device to perform the method described in the second aspect; or, when the instructions are executed When the instruction is executed on the second electronic device, the electronic device is caused to execute the method described in the third aspect.
第十方面,提供了一种芯片用于执行指令,当所述芯片运行时,所述芯片执行上述第二方面所述的方法;或者,所述芯片执行上述第三方面所述的方法。In a tenth aspect, a chip is provided for executing an instruction, and when the chip is running, the chip executes the method described in the second aspect; or, the chip executes the method described in the third aspect.
图1是本申请实施例提供的一种电子设备的硬件结构示意图。FIG. 1 is a schematic diagram of a hardware structure of an electronic device provided by an embodiment of the present application.
图2是本申请实施例提供的软件结构框图。FIG. 2 is a block diagram of a software structure provided by an embodiment of the present application.
图3是本申请实施例提供的一组图形用户界面。FIG. 3 is a set of graphical user interfaces provided by an embodiment of the present application.
图4是本申请实施例提供的另一组图形用户界面。FIG. 4 is another set of graphical user interfaces provided by an embodiment of the present application.
图5是本申请实施例提供的另一组图形用户界面。FIG. 5 is another set of graphical user interfaces provided by the embodiments of the present application.
图6是本申请实施例提供的另一组图形用户界面。FIG. 6 is another set of graphical user interfaces provided by an embodiment of the present application.
图7是本申请实施例提供的另一组图形用户界面。FIG. 7 is another set of graphical user interfaces provided by the embodiments of the present application.
图8是本申请实施例提供的另一组图形用户界面。FIG. 8 is another set of graphical user interfaces provided by the embodiments of the present application.
图9是本申请实施例提供的另一组图形用户界面。FIG. 9 is another set of graphical user interfaces provided by the embodiments of the present application.
图10是本申请实施例提供的另一组图形用户界面。FIG. 10 is another set of graphical user interfaces provided by the embodiments of the present application.
图11是本申请实施例提供的系统架构的示意图。FIG. 11 is a schematic diagram of a system architecture provided by an embodiment of the present application.
图12是本申请实施例提供的调文本编辑的方法的示意性流程图。FIG. 12 is a schematic flow chart of a method for editing text in accordance with an embodiment of the present application.
图13是本申请实施例提供的另一系统架构的示意图。FIG. 13 is a schematic diagram of another system architecture provided by an embodiment of the present application.
图14是本申请实施例提供的装置的示意性结构图。FIG. 14 is a schematic structural diagram of an apparatus provided by an embodiment of the present application.
图15是本申请实施例提供的装置的另一示意性结构图。FIG. 15 is another schematic structural diagram of an apparatus provided by an embodiment of the present application.
图16是本申请实施例提供的电子设备的另一结构示意图。FIG. 16 is another schematic structural diagram of an electronic device provided by an embodiment of the present application.
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行描述。其中,在本申请实施例的描述中,除非另有说明,“/”表示或的意思,例如,A/B可以表示A或B;本文中的“和/或”仅仅是一种描述关联对象的关联关系,表示可以存在三种关系,例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B这三种情况。另外,在 本申请实施例的描述中,“复数个”或者“多个”是指两个或多于两个。The technical solutions in the embodiments of the present application will be described below with reference to the accompanying drawings in the embodiments of the present application. Wherein, in the description of the embodiments of the present application, unless otherwise stated, “/” means or means, for example, A/B can mean A or B; “and/or” in this document is only a description of the associated object The association relationship of , indicates that there can be three kinds of relationships, for example, A and/or B, can indicate that A exists alone, A and B exist at the same time, and B exists alone. In addition, in the description of the embodiments of the present application, "plurality" or "plurality" refers to two or more than two.
以下,术语“第一”、“第二”仅用于描述目的,而不能理解为指示或暗示相对重要性或者隐含指明所指示的技术特征的数量。由此,限定有“第一”、“第二”的特征可以明示或者隐含地包括一个或者更多个该特征。在本实施例的描述中,除非另有说明,“多个”的含义是两个或两个以上。Hereinafter, the terms "first" and "second" are only used for descriptive purposes, and should not be construed as indicating or implying relative importance or implicitly indicating the number of indicated technical features. Thus, a feature defined as "first" or "second" may expressly or implicitly include one or more of that feature. In the description of this embodiment, unless otherwise specified, "plurality" means two or more.
本申请实施例提供的方法可以应用于手机、平板电脑、可穿戴设备、车载设备、增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备、笔记本电脑、超级移动个人计算机(ultra-mobile personal computer,UMPC)、上网本、个人数字助理(personal digital assistant,PDA)等电子设备上,本申请实施例对电子设备的具体类型不作任何限制。The methods provided in the embodiments of the present application can be applied to mobile phones, tablet computers, wearable devices, in-vehicle devices, augmented reality (AR)/virtual reality (VR) devices, notebook computers, ultra-mobile personal computers (ultra mobile personal computers) -mobile personal computer, UMPC), netbook, personal digital assistant (personal digital assistant, PDA) and other electronic devices, the embodiments of the present application do not impose any restrictions on the specific types of electronic devices.
示例性的,图1示出了电子设备100的结构示意图。电子设备100可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,充电管理模块140,电源管理模块141,电池142,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180,按键190,马达191,指示器192,摄像头193,显示屏194,以及用户身份识别(subscriber identification module,SIM)卡接口195等。其中传感器模块180可以包括压力传感器180A,陀螺仪传感器180B,气压传感器180C,磁传感器180D,加速度传感器180E,距离传感器180F,接近光传感器180G,指纹传感器180H,温度传感器180J,触摸传感器180K,环境光传感器180L,骨传导传感器180M等。Exemplarily, FIG. 1 shows a schematic structural diagram of an electronic device 100 . The electronic device 100 may include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (USB) interface 130, a charge management module 140, a power management module 141, a battery 142, an antenna 1, an antenna 2 , mobile communication module 150, wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, headphone jack 170D, sensor module 180, buttons 190, motor 191, indicator 192, camera 193, display screen 194, and User identification (subscriber identification module, SIM) card interface 195 and so on. The sensor module 180 may include a pressure sensor 180A, a gyroscope sensor 180B, an air pressure sensor 180C, a magnetic sensor 180D, an acceleration sensor 180E, a distance sensor 180F, a proximity light sensor 180G, a fingerprint sensor 180H, a temperature sensor 180J, a touch sensor 180K, and ambient light. Sensor 180L, bone conduction sensor 180M, etc.
可以理解的是,本申请实施例示意的结构并不构成对电子设备100的具体限定。在本申请另一些实施例中,电子设备100可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that the structures illustrated in the embodiments of the present application do not constitute a specific limitation on the electronic device 100 . In other embodiments of the present application, the electronic device 100 may include more or less components than shown, or combine some components, or separate some components, or arrange different components. The illustrated components may be implemented in hardware, software, or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example, the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processor (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural-network processing unit (NPU) Wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
其中,控制器可以是电子设备100的神经中枢和指挥中心。控制器可以根据指令操作码和时序信号,产生操作控制信号,完成取指令和执行指令的控制。The controller may be the nerve center and command center of the electronic device 100 . The controller can generate an operation control signal according to the instruction operation code and timing signal, and complete the control of fetching and executing instructions.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in processor 110 is cache memory. This memory may hold instructions or data that have just been used or recycled by the processor 110 . If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated accesses are avoided and the latency of the processor 110 is reduced, thereby increasing the efficiency of the system.
在一些实施例中,处理器110可以包括一个或多个接口。接口可以包括集成电路(inter-integrated circuit,I2C)接口,集成电路内置音频(inter-integrated circuit sound,I2S)接口,脉冲编码调制(pulse code modulation,PCM)接口,通用异步收发传输器(universal asynchronous receiver/transmitter,UART)接口,移动产业处理器接口(mobile industry processor interface,MIPI),通用输入输出(general-purpose input/output,GPIO)接口,用户 身份识别(subscriber identity module,SIM)接口,和/或通用串行总线(universal serial bus,USB)接口等。In some embodiments, the processor 110 may include one or more interfaces. The interface may include an integrated circuit (inter-integrated circuit, I2C) interface, an integrated circuit built-in audio (inter-integrated circuit sound, I2S) interface, a pulse code modulation (pulse code modulation, PCM) interface, a universal asynchronous transceiver (universal asynchronous transmitter) receiver/transmitter, UART) interface, mobile industry processor interface (MIPI), general-purpose input/output (GPIO) interface, subscriber identity module (SIM) interface, and / or universal serial bus (universal serial bus, USB) interface, etc.
I2C接口是一种双向同步串行总线,包括一根串行数据线(serial data line,SDA)和一根串行时钟线(derail clock line,SCL)。在一些实施例中,处理器110可以包含多组I2C总线。处理器110可以通过不同的I2C总线接口分别耦合触摸传感器180K,充电器,闪光灯,摄像头193等。例如:处理器110可以通过I2C接口耦合触摸传感器180K,使处理器110与触摸传感器180K通过I2C总线接口通信,实现电子设备100的触摸功能。The I2C interface is a bidirectional synchronous serial bus that includes a serial data line (SDA) and a serial clock line (SCL). In some embodiments, the processor 110 may contain multiple sets of I2C buses. The processor 110 can be respectively coupled to the touch sensor 180K, the charger, the flash, the camera 193 and the like through different I2C bus interfaces. For example, the processor 110 may couple the touch sensor 180K through the I2C interface, so that the processor 110 and the touch sensor 180K communicate with each other through the I2C bus interface, so as to realize the touch function of the electronic device 100 .
I2S接口可以用于音频通信。在一些实施例中,处理器110可以包含多组I2S总线。处理器110可以通过I2S总线与音频模块170耦合,实现处理器110与音频模块170之间的通信。在一些实施例中,音频模块170可以通过I2S接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。The I2S interface can be used for audio communication. In some embodiments, the processor 110 may contain multiple sets of I2S buses. The processor 110 may be coupled with the audio module 170 through an I2S bus to implement communication between the processor 110 and the audio module 170 . In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the I2S interface, so as to realize the function of answering calls through a Bluetooth headset.
PCM接口也可以用于音频通信,将模拟信号抽样,量化和编码。在一些实施例中,音频模块170与无线通信模块160可以通过PCM总线接口耦合。在一些实施例中,音频模块170也可以通过PCM接口向无线通信模块160传递音频信号,实现通过蓝牙耳机接听电话的功能。所述I2S接口和所述PCM接口都可以用于音频通信。The PCM interface can also be used for audio communications, sampling, quantizing and encoding analog signals. In some embodiments, the audio module 170 and the wireless communication module 160 may be coupled through a PCM bus interface. In some embodiments, the audio module 170 can also transmit audio signals to the wireless communication module 160 through the PCM interface, so as to realize the function of answering calls through the Bluetooth headset. Both the I2S interface and the PCM interface can be used for audio communication.
UART接口是一种通用串行数据总线,用于异步通信。该总线可以为双向通信总线。它将要传输的数据在串行通信与并行通信之间转换。在一些实施例中,UART接口通常被用于连接处理器110与无线通信模块160。例如:处理器110通过UART接口与无线通信模块160中的蓝牙模块通信,实现蓝牙功能。在一些实施例中,音频模块170可以通过UART接口向无线通信模块160传递音频信号,实现通过蓝牙耳机播放音乐的功能。The UART interface is a universal serial data bus used for asynchronous communication. The bus may be a bidirectional communication bus. It converts the data to be transmitted between serial communication and parallel communication. In some embodiments, a UART interface is typically used to connect the processor 110 with the wireless communication module 160 . For example, the processor 110 communicates with the Bluetooth module in the wireless communication module 160 through the UART interface to implement the Bluetooth function. In some embodiments, the audio module 170 can transmit audio signals to the wireless communication module 160 through the UART interface, so as to realize the function of playing music through the Bluetooth headset.
MIPI接口可以被用于连接处理器110与显示屏194,摄像头193等外围器件。MIPI接口包括摄像头串行接口(camera serial interface,CSI),显示屏串行接口(display serial interface,DSI)等。在一些实施例中,处理器110和摄像头193通过CSI接口通信,实现电子设备100的拍摄功能。处理器110和显示屏194通过DSI接口通信,实现电子设备100的显示功能。The MIPI interface can be used to connect the processor 110 with peripheral devices such as the display screen 194 and the camera 193 . MIPI interfaces include camera serial interface (CSI), display serial interface (DSI), etc. In some embodiments, the processor 110 communicates with the camera 193 through a CSI interface, so as to realize the photographing function of the electronic device 100 . The processor 110 communicates with the display screen 194 through the DSI interface to implement the display function of the electronic device 100 .
GPIO接口可以通过软件配置。GPIO接口可以被配置为控制信号,也可被配置为数据信号。在一些实施例中,GPIO接口可以用于连接处理器110与摄像头193,显示屏194,无线通信模块160,音频模块170,传感器模块180等。GPIO接口还可以被配置为I2C接口,I2S接口,UART接口,MIPI接口等。The GPIO interface can be configured by software. The GPIO interface can be configured as a control signal or as a data signal. In some embodiments, the GPIO interface may be used to connect the processor 110 with the camera 193, the display screen 194, the wireless communication module 160, the audio module 170, the sensor module 180, and the like. The GPIO interface can also be configured as I2C interface, I2S interface, UART interface, MIPI interface, etc.
USB接口130是符合USB标准规范的接口,具体可以是Mini USB接口,Micro USB接口,USB Type C接口等。USB接口130可以用于连接充电器为电子设备100充电,也可以用于电子设备100与外围设备之间传输数据。也可以用于连接耳机,通过耳机播放音频。该接口还可以用于连接其他电子设备,例如AR设备等。The USB interface 130 is an interface that conforms to the USB standard specification, and may specifically be a Mini USB interface, a Micro USB interface, a USB Type C interface, and the like. The USB interface 130 can be used to connect a charger to charge the electronic device 100, and can also be used to transmit data between the electronic device 100 and peripheral devices. It can also be used to connect headphones to play audio through the headphones. The interface can also be used to connect other electronic devices, such as AR devices.
可以理解的是,本申请实施例示意的各模块间的接口连接关系,只是示意性说明,并不构成对电子设备100的结构限定。在本申请另一些实施例中,电子设备100也可以采用上述实施例中不同的接口连接方式,或多种接口连接方式的组合。It can be understood that the interface connection relationship between the modules illustrated in the embodiments of the present application is only a schematic illustration, and does not constitute a structural limitation of the electronic device 100 . In other embodiments of the present application, the electronic device 100 may also adopt different interface connection manners in the foregoing embodiments, or a combination of multiple interface connection manners.
充电管理模块140用于从充电器接收充电输入。其中,充电器可以是无线充电器,也可以是有线充电器。在一些有线充电的实施例中,充电管理模块140可以通过USB接口130接收有线充电器的充电输入。在一些无线充电的实施例中,充电管理模块140可以通 过电子设备100的无线充电线圈接收无线充电输入。充电管理模块140为电池142充电的同时,还可以通过电源管理模块141为电子设备供电。The charging management module 140 is used to receive charging input from the charger. The charger may be a wireless charger or a wired charger. In some wired charging embodiments, the charging management module 140 may receive charging input from the wired charger through the USB interface 130 . In some wireless charging embodiments, the charging management module 140 may receive wireless charging input through the wireless charging coil of the electronic device 100. While the charging management module 140 charges the battery 142 , it can also supply power to the electronic device through the power management module 141 .
电源管理模块141用于连接电池142,充电管理模块140与处理器110。电源管理模块141接收电池142和/或充电管理模块140的输入,为处理器110,内部存储器121,外部存储器,显示屏194,摄像头193,和无线通信模块160等供电。电源管理模块141还可以用于监测电池容量,电池循环次数,电池健康状态(漏电,阻抗)等参数。在其他一些实施例中,电源管理模块141也可以设置于处理器110中。在另一些实施例中,电源管理模块141和充电管理模块140也可以设置于同一个器件中。The power management module 141 is used for connecting the battery 142 , the charging management module 140 and the processor 110 . The power management module 141 receives input from the battery 142 and/or the charging management module 140 and supplies power to the processor 110 , the internal memory 121 , the external memory, the display screen 194 , the camera 193 , and the wireless communication module 160 . The power management module 141 can also be used to monitor parameters such as battery capacity, battery cycle times, battery health status (leakage, impedance). In some other embodiments, the power management module 141 may also be provided in the processor 110 . In other embodiments, the power management module 141 and the charging management module 140 may also be provided in the same device.
电子设备100的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the electronic device 100 may be implemented by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modulation and demodulation processor, the baseband processor, and the like.
天线1和天线2用于发射和接收电磁波信号。电子设备100中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。 Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in electronic device 100 may be used to cover a single or multiple communication frequency bands. Different antennas can also be reused to improve antenna utilization. For example, the antenna 1 can be multiplexed as a diversity antenna of the wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块150可以提供应用在电子设备100上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 may provide wireless communication solutions including 2G/3G/4G/5G etc. applied on the electronic device 100 . The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (LNA) and the like. The mobile communication module 150 can receive electromagnetic waves from the antenna 1, filter and amplify the received electromagnetic waves, and transmit them to the modulation and demodulation processor for demodulation. The mobile communication module 150 can also amplify the signal modulated by the modulation and demodulation processor, and then turn it into an electromagnetic wave for radiation through the antenna 1 . In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the processor 110 . In some embodiments, at least part of the functional modules of the mobile communication module 150 may be provided in the same device as at least part of the modules of the processor 110 .
调制解调处理器可以包括调制器和解调器。其中,调制器用于将待发送的低频基带信号调制成中高频信号。解调器用于将接收的电磁波信号解调为低频基带信号。随后解调器将解调得到的低频基带信号传送至基带处理器处理。低频基带信号经基带处理器处理后,被传递给应用处理器。应用处理器通过音频设备(不限于扬声器170A,受话器170B等)输出声音信号,或通过显示屏194显示图像或视频。在一些实施例中,调制解调处理器可以是独立的器件。在另一些实施例中,调制解调处理器可以独立于处理器110,与移动通信模块150或其他功能模块设置在同一个器件中。The modem processor may include a modulator and a demodulator. Wherein, the modulator is used to modulate the low frequency baseband signal to be sent into a medium and high frequency signal. The demodulator is used to demodulate the received electromagnetic wave signal into a low frequency baseband signal. Then the demodulator transmits the demodulated low-frequency baseband signal to the baseband processor for processing. The low frequency baseband signal is processed by the baseband processor and passed to the application processor. The application processor outputs sound signals through audio devices (not limited to the speaker 170A, the receiver 170B, etc.), or displays images or videos through the display screen 194 . In some embodiments, the modem processor may be a stand-alone device. In other embodiments, the modem processor may be independent of the processor 110, and may be provided in the same device as the mobile communication module 150 or other functional modules.
无线通信模块160可以提供应用在电子设备100上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(Bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide applications on the electronic device 100 including wireless local area networks (WLAN) (such as wireless fidelity (Wi-Fi) networks), Bluetooth (BT), global navigation satellites System (global navigation satellite system, GNSS), frequency modulation (frequency modulation, FM), near field communication technology (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 . The wireless communication module 160 can also receive the signal to be sent from the processor 110 , perform frequency modulation on it, amplify it, and convert it into electromagnetic waves for radiation through the antenna 2 .
在一些实施例中,电子设备100的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得电子设备100可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications, GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the electronic device 100 is coupled with the mobile communication module 150, and the antenna 2 is coupled with the wireless communication module 160, so that the electronic device 100 can communicate with the network and other devices through wireless communication technology. The wireless communication technologies may include global system for mobile communications (GSM), general packet radio service (GPRS), code division multiple access (CDMA), broadband Code Division Multiple Access (WCDMA), Time Division Code Division Multiple Access (TD-SCDMA), Long Term Evolution (LTE), BT, GNSS, WLAN, NFC , FM, and/or IR technology, etc. The GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (GLONASS), a Beidou navigation satellite system (BDS), a quasi-zenith satellite system (quasi -zenith satellite system, QZSS) and/or satellite based augmentation systems (SBAS).
电子设备100通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The electronic device 100 implements a display function through a GPU, a display screen 194, an application processor, and the like. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. The GPU is used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or alter display information.
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode的,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,电子设备100可以包括1个或N个显示屏194,N为大于1的正整数。Display screen 194 is used to display images, videos, and the like. Display screen 194 includes a display panel. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active-matrix organic light-emitting diode or an active-matrix organic light-emitting diode (active-matrix organic light). emitting diode, AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light-emitting diode (quantum dot light emitting diodes, QLED) and so on. In some embodiments, the electronic device 100 may include one or N display screens 194 , where N is a positive integer greater than one.
电子设备100可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The electronic device 100 may implement a shooting function through an ISP, a camera 193, a video codec, a GPU, a display screen 194, an application processor, and the like.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP is used to process the data fed back by the camera 193 . For example, when taking a photo, the shutter is opened, the light is transmitted to the camera photosensitive element through the lens, the light signal is converted into an electrical signal, and the camera photosensitive element transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin tone. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be provided in the camera 193 .
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,电子设备100可以包括1个或N个摄像头193,N为大于1的正整数。The camera 193 is used to capture still images or video. The object is projected through the lens to generate an optical image onto the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the optical signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other formats of image signals. In some embodiments, the electronic device 100 may include 1 or N cameras 193 , where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当电子设备100在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。A digital signal processor is used to process digital signals, in addition to processing digital image signals, it can also process other digital signals. For example, when the electronic device 100 selects a frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy and so on.
视频编解码器用于对数字视频压缩或解压缩。电子设备100可以支持一种或多种视频编解码器。这样,电子设备100可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. The electronic device 100 may support one or more video codecs. In this way, the electronic device 100 can play or record videos of various encoding formats, such as: Moving Picture Experts Group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4 and so on.
NPU为神经网络(neural-network,NN)计算处理器,通过借鉴生物神经网络结构,例如借鉴人脑神经元之间传递模式,对输入信息快速处理,还可以不断的自学习。通过NPU 可以实现电子设备100的智能认知等应用,例如:图像识别,人脸识别,语音识别,文本理解等。The NPU is a neural-network (NN) computing processor. By drawing on the structure of biological neural networks, such as the transfer mode between neurons in the human brain, it can quickly process the input information and can continuously learn by itself. Applications such as intelligent cognition of the electronic device 100 can be implemented through the NPU, such as image recognition, face recognition, speech recognition, text understanding, and the like.
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展电子设备100的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the electronic device 100 . The external memory card communicates with the processor 110 through the external memory interface 120 to realize the data storage function. For example to save files like music, video etc in external memory card.
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器110通过运行存储在内部存储器121的指令,从而执行电子设备100的各种功能应用以及数据处理。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储电子设备100使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。Internal memory 121 may be used to store computer executable program code, which includes instructions. The processor 110 executes various functional applications and data processing of the electronic device 100 by executing the instructions stored in the internal memory 121 . The internal memory 121 may include a storage program area and a storage data area. The storage program area can store an operating system, an application program required for at least one function (such as a sound playback function, an image playback function, etc.), and the like. The storage data area may store data (such as audio data, phone book, etc.) created during the use of the electronic device 100 and the like. In addition, the internal memory 121 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (UFS), and the like.
电子设备100可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The electronic device 100 may implement audio functions through an audio module 170, a speaker 170A, a receiver 170B, a microphone 170C, an earphone interface 170D, an application processor, and the like. Such as music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used for converting digital audio information into analog audio signal output, and also for converting analog audio input into digital audio signal. Audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be provided in the processor 110 , or some functional modules of the audio module 170 may be provided in the processor 110 .
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。电子设备100可以通过扬声器170A收听音乐,或收听免提通话。Speaker 170A, also referred to as a "speaker", is used to convert audio electrical signals into sound signals. The electronic device 100 can listen to music through the speaker 170A, or listen to a hands-free call.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当电子设备100接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。The receiver 170B, also referred to as "earpiece", is used to convert audio electrical signals into sound signals. When the electronic device 100 answers a call or a voice message, the voice can be answered by placing the receiver 170B close to the human ear.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。电子设备100可以设置至少一个麦克风170C。在另一些实施例中,电子设备100可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,电子设备100还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 170C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a call or sending a voice message, the user can make a sound by approaching the microphone 170C through a human mouth, and input the sound signal into the microphone 170C. The electronic device 100 may be provided with at least one microphone 170C. In other embodiments, the electronic device 100 may be provided with two microphones 170C, which can implement a noise reduction function in addition to collecting sound signals. In other embodiments, the electronic device 100 may further be provided with three, four or more microphones 170C to collect sound signals, reduce noise, identify sound sources, and implement directional recording functions.
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone jack 170D is used to connect wired earphones. The earphone interface 170D can be the USB interface 130, or can be a 3.5mm open mobile terminal platform (OMTP) standard interface, a cellular telecommunications industry association of the USA (CTIA) standard interface.
压力传感器180A用于感受压力信号,可以将压力信号转换成电信号。在一些实施例中,压力传感器180A可以设置于显示屏194。压力传感器180A的种类很多,如电阻式压力传感器,电感式压力传感器,电容式压力传感器等。电容式压力传感器可以是包括至少两个具有导电材料的平行板。当有力作用于压力传感器180A,电极之间的电容改变。电子设备100根据电容的变化确定压力的强度。当有触摸操作作用于显示屏194,电子设备100根据压力传感器180A检测所述触摸操作强度。电子设备100也可以根据压力传感器180A的检测信号计算触摸的位置。在一些实施例中,作用于相同触摸位置,但不同触摸 操作强度的触摸操作,可以对应不同的操作指令。例如:当有触摸操作强度小于第一压力阈值的触摸操作作用于短消息应用图标时,执行查看短消息的指令。当有触摸操作强度大于或等于第一压力阈值的触摸操作作用于短消息应用图标时,执行新建短消息的指令。The pressure sensor 180A is used to sense pressure signals, and can convert the pressure signals into electrical signals. In some embodiments, the pressure sensor 180A may be provided on the display screen 194 . There are many types of pressure sensors 180A, such as resistive pressure sensors, inductive pressure sensors, capacitive pressure sensors, and the like. The capacitive pressure sensor may be comprised of at least two parallel plates of conductive material. When a force is applied to the pressure sensor 180A, the capacitance between the electrodes changes. The electronic device 100 determines the intensity of the pressure according to the change in capacitance. When a touch operation acts on the display screen 194, the electronic device 100 detects the intensity of the touch operation according to the pressure sensor 180A. The electronic device 100 may also calculate the touched position according to the detection signal of the pressure sensor 180A. In some embodiments, touch operations acting on the same touch position but with different touch operation intensities may correspond to different operation instructions. For example, when a touch operation whose intensity is less than the first pressure threshold acts on the short message application icon, the instruction for viewing the short message is executed. When a touch operation with a touch operation intensity greater than or equal to the first pressure threshold acts on the short message application icon, the instruction to create a new short message is executed.
陀螺仪传感器180B可以用于确定电子设备100的运动姿态。在一些实施例中,可以通过陀螺仪传感器180B确定电子设备100围绕三个轴(即,x,y和z轴)的角速度。陀螺仪传感器180B可以用于拍摄防抖。示例性的,当按下快门,陀螺仪传感器180B检测电子设备100抖动的角度,根据角度计算出镜头模组需要补偿的距离,让镜头通过反向运动抵消电子设备100的抖动,实现防抖。陀螺仪传感器180B还可以用于导航,体感游戏场景。The gyro sensor 180B may be used to determine the motion attitude of the electronic device 100 . In some embodiments, the angular velocity of electronic device 100 about three axes (ie, x, y, and z axes) may be determined by gyro sensor 180B. The gyro sensor 180B can be used for image stabilization. Exemplarily, when the shutter is pressed, the gyro sensor 180B detects the angle at which the electronic device 100 shakes, calculates the distance that the lens module needs to compensate for according to the angle, and allows the lens to counteract the shake of the electronic device 100 through reverse motion to achieve anti-shake. The gyro sensor 180B can also be used for navigation and somatosensory game scenarios.
气压传感器180C用于测量气压。在一些实施例中,电子设备100通过气压传感器180C测得的气压值计算海拔高度,辅助定位和导航。The air pressure sensor 180C is used to measure air pressure. In some embodiments, the electronic device 100 calculates the altitude through the air pressure value measured by the air pressure sensor 180C to assist in positioning and navigation.
磁传感器180D包括霍尔传感器。电子设备100可以利用磁传感器180D检测翻盖皮套的开合。在一些实施例中,当电子设备100是翻盖机时,电子设备100可以根据磁传感器180D检测翻盖的开合。进而根据检测到的皮套的开合状态或翻盖的开合状态,设置翻盖自动解锁等特性。The magnetic sensor 180D includes a Hall sensor. The electronic device 100 can detect the opening and closing of the flip holster using the magnetic sensor 180D. In some embodiments, when the electronic device 100 is a flip machine, the electronic device 100 can detect the opening and closing of the flip according to the magnetic sensor 180D. Further, according to the detected opening and closing state of the leather case or the opening and closing state of the flip cover, characteristics such as automatic unlocking of the flip cover are set.
加速度传感器180E可检测电子设备100在各个方向上(一般为三轴)加速度的大小。当电子设备100静止时可检测出重力的大小及方向。还可以用于识别电子设备姿态,应用于横竖屏切换,计步器等应用。The acceleration sensor 180E can detect the magnitude of the acceleration of the electronic device 100 in various directions (generally three axes). The magnitude and direction of gravity can be detected when the electronic device 100 is stationary. It can also be used to identify the posture of electronic devices, and can be used in applications such as horizontal and vertical screen switching, pedometers, etc.
距离传感器180F,用于测量距离。电子设备100可以通过红外或激光测量距离。在一些实施例中,拍摄场景,电子设备100可以利用距离传感器180F测距以实现快速对焦。Distance sensor 180F for measuring distance. The electronic device 100 can measure the distance through infrared or laser. In some embodiments, when shooting a scene, the electronic device 100 can use the distance sensor 180F to measure the distance to achieve fast focusing.
接近光传感器180G可以包括例如发光二极管(LED)和光检测器,例如光电二极管。发光二极管可以是红外发光二极管。电子设备100通过发光二极管向外发射红外光。电子设备100使用光电二极管检测来自附近物体的红外反射光。当检测到充分的反射光时,可以确定电子设备100附近有物体。当检测到不充分的反射光时,电子设备100可以确定电子设备100附近没有物体。电子设备100可以利用接近光传感器180G检测用户手持电子设备100贴近耳朵通话,以便自动熄灭屏幕达到省电的目的。接近光传感器180G也可用于皮套模式,口袋模式自动解锁与锁屏。Proximity light sensor 180G may include, for example, light emitting diodes (LEDs) and light detectors, such as photodiodes. The light emitting diodes may be infrared light emitting diodes. The electronic device 100 emits infrared light to the outside through the light emitting diode. Electronic device 100 uses photodiodes to detect infrared reflected light from nearby objects. When sufficient reflected light is detected, it can be determined that there is an object near the electronic device 100 . When insufficient reflected light is detected, the electronic device 100 may determine that there is no object near the electronic device 100 . The electronic device 100 can use the proximity light sensor 180G to detect that the user holds the electronic device 100 close to the ear to talk, so as to automatically turn off the screen to save power. Proximity light sensor 180G can also be used in holster mode, pocket mode automatically unlocks and locks the screen.
环境光传感器180L用于感知环境光亮度。电子设备100可以根据感知的环境光亮度自适应调节显示屏194亮度。环境光传感器180L也可用于拍照时自动调节白平衡。环境光传感器180L还可以与接近光传感器180G配合,检测电子设备100是否在口袋里,以防误触。The ambient light sensor 180L is used to sense ambient light brightness. The electronic device 100 can adaptively adjust the brightness of the display screen 194 according to the perceived ambient light brightness. The ambient light sensor 180L can also be used to automatically adjust the white balance when taking pictures. The ambient light sensor 180L can also cooperate with the proximity light sensor 180G to detect whether the electronic device 100 is in a pocket, so as to prevent accidental touch.
指纹传感器180H用于采集指纹。电子设备100可以利用采集的指纹特性实现指纹解锁,访问应用锁,指纹拍照,指纹接听来电等。The fingerprint sensor 180H is used to collect fingerprints. The electronic device 100 can use the collected fingerprint characteristics to realize fingerprint unlocking, accessing application locks, taking pictures with fingerprints, answering incoming calls with fingerprints, and the like.
温度传感器180J用于检测温度。在一些实施例中,电子设备100利用温度传感器180J检测的温度,执行温度处理策略。例如,当温度传感器180J上报的温度超过阈值,电子设备100执行降低位于温度传感器180J附近的处理器的性能,以便降低功耗实施热保护。在另一些实施例中,当温度低于另一阈值时,电子设备100对电池142加热,以避免低温导致电子设备100异常关机。在其他一些实施例中,当温度低于又一阈值时,电子设备100对电池142的输出电压执行升压,以避免低温导致的异常关机。The temperature sensor 180J is used to detect the temperature. In some embodiments, the electronic device 100 uses the temperature detected by the temperature sensor 180J to execute a temperature processing strategy. For example, when the temperature reported by the temperature sensor 180J exceeds a threshold value, the electronic device 100 reduces the performance of the processor located near the temperature sensor 180J in order to reduce power consumption and implement thermal protection. In other embodiments, when the temperature is lower than another threshold, the electronic device 100 heats the battery 142 to avoid abnormal shutdown of the electronic device 100 caused by the low temperature. In some other embodiments, when the temperature is lower than another threshold, the electronic device 100 boosts the output voltage of the battery 142 to avoid abnormal shutdown caused by low temperature.
触摸传感器180K,也称“触控面板”。触摸传感器180K可以设置于显示屏194,由触摸传感器180K与显示屏194组成触摸屏,也称“触控屏”。触摸传感器180K用于检测作用于其上或附近的触摸操作。触摸传感器可以将检测到的触摸操作传递给应用处理器,以确定触摸事件类型。可以通过显示屏194提供与触摸操作相关的视觉输出。在另一些实施例中,触摸传感器180K也可以设置于电子设备100的表面,与显示屏194所处的位置不同。Touch sensor 180K, also called "touch panel". The touch sensor 180K may be disposed on the display screen 194 , and the touch sensor 180K and the display screen 194 form a touch screen, also called a “touch screen”. The touch sensor 180K is used to detect a touch operation on or near it. The touch sensor can pass the detected touch operation to the application processor to determine the type of touch event. Visual output related to touch operations may be provided through display screen 194 . In other embodiments, the touch sensor 180K may also be disposed on the surface of the electronic device 100 , which is different from the location where the display screen 194 is located.
骨传导传感器180M可以获取振动信号。在一些实施例中,骨传导传感器180M可以获取人体声部振动骨块的振动信号。骨传导传感器180M也可以接触人体脉搏,接收血压跳动信号。在一些实施例中,骨传导传感器180M也可以设置于耳机中,结合成骨传导耳机。音频模块170可以基于所述骨传导传感器180M获取的声部振动骨块的振动信号,解析出语音信号,实现语音功能。应用处理器可以基于所述骨传导传感器180M获取的血压跳动信号解析心率信息,实现心率检测功能。The bone conduction sensor 180M can acquire vibration signals. In some embodiments, the bone conduction sensor 180M can acquire the vibration signal of the vibrating bone mass of the human voice. The bone conduction sensor 180M can also contact the pulse of the human body and receive the blood pressure beating signal. In some embodiments, the bone conduction sensor 180M can also be disposed in the earphone, combined with the bone conduction earphone. The audio module 170 can analyze the voice signal based on the vibration signal of the vocal vibration bone block obtained by the bone conduction sensor 180M, so as to realize the voice function. The application processor can analyze the heart rate information based on the blood pressure beat signal obtained by the bone conduction sensor 180M, so as to realize the function of heart rate detection.
按键190包括开机键,音量键等。按键190可以是机械按键。也可以是触摸式按键。电子设备100可以接收按键输入,产生与电子设备100的用户设置以及功能控制有关的键信号输入。The keys 190 include a power-on key, a volume key, and the like. Keys 190 may be mechanical keys. It can also be a touch key. The electronic device 100 may receive key inputs and generate key signal inputs related to user settings and function control of the electronic device 100 .
马达191可以产生振动提示。马达191可以用于来电振动提示,也可以用于触摸振动反馈。例如,作用于不同应用(例如拍照,音频播放等)的触摸操作,可以对应不同的振动反馈效果。作用于显示屏194不同区域的触摸操作,马达191也可对应不同的振动反馈效果。不同的应用场景(例如:时间提醒,接收信息,闹钟,游戏等)也可以对应不同的振动反馈效果。触摸振动反馈效果还可以支持自定义。Motor 191 can generate vibrating cues. The motor 191 can be used for vibrating alerts for incoming calls, and can also be used for touch vibration feedback. For example, touch operations acting on different applications (such as taking pictures, playing audio, etc.) can correspond to different vibration feedback effects. The motor 191 can also correspond to different vibration feedback effects for touch operations on different areas of the display screen 194 . Different application scenarios (for example: time reminder, receiving information, alarm clock, games, etc.) can also correspond to different vibration feedback effects. The touch vibration feedback effect can also support customization.
指示器192可以是指示灯,可以用于指示充电状态,电量变化,也可以用于指示消息,未接来电,通知等。The indicator 192 can be an indicator light, which can be used to indicate the charging state, the change of the power, and can also be used to indicate a message, a missed call, a notification, and the like.
SIM卡接口195用于连接SIM卡。SIM卡可以通过插入SIM卡接口195,或从SIM卡接口195拔出,实现和电子设备100的接触和分离。电子设备100可以支持1个或N个SIM卡接口,N为大于1的正整数。SIM卡接口195可以支持Nano SIM卡,Micro SIM卡,SIM卡等。同一个SIM卡接口195可以同时插入多张卡。所述多张卡的类型可以相同,也可以不同。SIM卡接口195也可以兼容不同类型的SIM卡。SIM卡接口195也可以兼容外部存储卡。电子设备100通过SIM卡和网络交互,实现通话以及数据通信等功能。在一些实施例中,电子设备100采用嵌入式SIM(embedded-SIM,eSIM)卡,即:嵌入式SIM卡。eSIM卡可以嵌在电子设备100中,不能和电子设备100分离。The SIM card interface 195 is used to connect a SIM card. The SIM card can be contacted and separated from the electronic device 100 by inserting into the SIM card interface 195 or pulling out from the SIM card interface 195 . The electronic device 100 may support 1 or N SIM card interfaces, where N is a positive integer greater than 1. The SIM card interface 195 can support Nano SIM card, Micro SIM card, SIM card and so on. Multiple cards can be inserted into the same SIM card interface 195 at the same time. The types of the plurality of cards may be the same or different. The SIM card interface 195 can also be compatible with different types of SIM cards. The SIM card interface 195 is also compatible with external memory cards. The electronic device 100 interacts with the network through the SIM card to implement functions such as call and data communication. In some embodiments, the electronic device 100 adopts an embedded SIM (embedded-SIM, eSIM) card, that is, an embedded SIM card. The eSIM card can be embedded in the electronic device 100 and cannot be separated from the electronic device 100 .
应理解,本申请实施例中的电话卡包括但不限于SIM卡、eSIM卡、全球用户识别卡(universal subscriber identity module,USIM)、通用集成电话卡(universal integrated circuit card,UICC)等等。It should be understood that the telephone cards in the embodiments of the present application include but are not limited to SIM cards, eSIM cards, universal subscriber identity modules (USIM), universal integrated telephone cards (universal integrated circuit cards, UICC) and the like.
电子设备100的软件系统可以采用分层架构,事件驱动架构,微核架构,微服务架构,或云架构。本申请实施例以分层架构的Android系统为例,示例性说明电子设备100的软件结构。The software system of the electronic device 100 may adopt a layered architecture, an event-driven architecture, a microkernel architecture, a microservice architecture, or a cloud architecture. The embodiments of the present application take an Android system with a layered architecture as an example to exemplarily describe the software structure of the electronic device 100 .
图2是本申请实施例的电子设备100的软件结构框图。分层架构将软件分成若干个层,每一层都有清晰的角色和分工。层与层之间通过软件接口通信。在一些实施例中,将Android系统分为四层,从上至下分别为应用程序层,应用程序框架层,安卓运行时(Android runtime)和系统库,以及内核层。应用程序层可以包括一系列应用程序包。FIG. 2 is a block diagram of the software structure of the electronic device 100 according to the embodiment of the present application. The layered architecture divides the software into several layers, and each layer has a clear role and division of labor. Layers communicate with each other through software interfaces. In some embodiments, the Android system is divided into four layers, which are, from top to bottom, an application layer, an application framework layer, an Android runtime (Android runtime) and a system library, and a kernel layer. The application layer can include a series of application packages.
如图2所示,应用程序包可以包括相机,图库,日历,通话,地图,导航,WLAN,蓝牙,音乐,视频,短信息等应用程序。As shown in Figure 2, the application package can include applications such as camera, gallery, calendar, call, map, navigation, WLAN, Bluetooth, music, video, short message and so on.
应用程序框架层为应用程序层的应用程序提供应用编程接口(application programming interface,API)和编程框架。应用程序框架层包括一些预先定义的函数。The application framework layer provides an application programming interface (application programming interface, API) and a programming framework for applications in the application layer. The application framework layer includes some predefined functions.
如图2所示,应用程序框架层可以包括窗口管理器,内容提供器,视图系统,电话管理器,资源管理器,通知管理器等。As shown in Figure 2, the application framework layer may include window managers, content providers, view systems, telephony managers, resource managers, notification managers, and the like.
窗口管理器用于管理窗口程序。窗口管理器可以获取显示屏大小,判断是否有状态栏,锁定屏幕,截取屏幕等。A window manager is used to manage window programs. The window manager can get the size of the display screen, determine whether there is a status bar, lock the screen, take screenshots, etc.
内容提供器用来存放和获取数据,并使这些数据可以被应用程序访问。所述数据可以包括视频,图像,音频,拨打和接听的电话,浏览历史和书签,电话簿等。Content providers are used to store and retrieve data and make these data accessible to applications. The data may include video, images, audio, calls made and received, browsing history and bookmarks, phone book, etc.
视图系统包括可视控件,例如显示文字的控件,显示图片的控件等。视图系统可用于构建应用程序。显示界面可以由一个或多个视图组成的。例如,包括短信通知图标的显示界面,可以包括显示文字的视图以及显示图片的视图。The view system includes visual controls, such as controls for displaying text, controls for displaying pictures, and so on. View systems can be used to build applications. A display interface can consist of one or more views. For example, the display interface including the short message notification icon may include a view for displaying text and a view for displaying pictures.
电话管理器用于提供电子设备100的通信功能。例如通话状态的管理(包括接通,挂断等)。The phone manager is used to provide the communication function of the electronic device 100 . For example, the management of call status (including connecting, hanging up, etc.).
资源管理器为应用程序提供各种资源,比如本地化字符串,图标,图片,布局文件,视频文件等等。The resource manager provides various resources for the application, such as localization strings, icons, pictures, layout files, video files and so on.
通知管理器使应用程序可以在状态栏中显示通知信息,可以用于传达告知类型的消息,可以短暂停留后自动消失,无需用户交互。比如通知管理器被用于告知下载完成,消息提醒等。通知管理器还可以是以图表或者滚动条文本形式出现在系统顶部状态栏的通知,例如后台运行的应用程序的通知,还可以是以对话窗口形式出现在屏幕上的通知。例如在状态栏提示文本信息,发出提示音,电子设备振动,指示灯闪烁等。The notification manager enables applications to display notification information in the status bar, which can be used to convey notification-type messages, and can disappear automatically after a brief pause without user interaction. For example, the notification manager is used to notify download completion, message reminders, etc. The notification manager can also display notifications in the status bar at the top of the system in the form of graphs or scroll bar text, such as notifications of applications running in the background, and notifications on the screen in the form of dialog windows. For example, text information is prompted in the status bar, a prompt sound is issued, the electronic device vibrates, and the indicator light flashes.
Android Runtime包括核心库和虚拟机。Android runtime负责安卓系统的调度和管理。Android Runtime includes core libraries and a virtual machine. Android runtime is responsible for scheduling and management of the Android system.
核心库包含两部分:一部分是java语言需要调用的功能函数,另一部分是安卓的核心库。The core library consists of two parts: one is the function functions that the java language needs to call, and the other is the core library of Android.
应用程序层和应用程序框架层运行在虚拟机中。虚拟机将应用程序层和应用程序框架层的java文件执行为二进制文件。虚拟机用于执行对象生命周期的管理,堆栈管理,线程管理,安全和异常的管理,以及垃圾回收等功能。The application layer and the application framework layer run in virtual machines. The virtual machine executes the java files of the application layer and the application framework layer as binary files. The virtual machine is used to perform functions such as object lifecycle management, stack management, thread management, safety and exception management, and garbage collection.
系统库可以包括多个功能模块。例如:表面管理器(surface manager),媒体库(media libraries),三维图形处理库(例如:OpenGL ES),2D图形引擎(例如:SGL)等。A system library can include multiple functional modules. For example: surface manager (surface manager), media library (media library), 3D graphics processing library (eg: OpenGL ES), 2D graphics engine (eg: SGL), etc.
表面管理器用于对显示子系统进行管理,并且为多个应用程序提供了2D和3D图层的融合。The Surface Manager is used to manage the display subsystem and provides a fusion of 2D and 3D layers for multiple applications.
媒体库支持多种常用的音频,视频格式回放和录制,以及静态图像文件等。媒体库可以支持多种音视频编码格式,例如:MPEG4,H.264,MP3,AAC,AMR,JPG,PNG等。The media library supports playback and recording of a variety of commonly used audio and video formats, as well as still image files. The media library can support a variety of audio and video encoding formats, such as: MPEG4, H.264, MP3, AAC, AMR, JPG, PNG, etc.
三维图形处理库用于实现三维图形绘图,图像渲染,合成,和图层处理等。The 3D graphics processing library is used to implement 3D graphics drawing, image rendering, compositing, and layer processing.
2D图形引擎是2D绘图的绘图引擎。2D graphics engine is a drawing engine for 2D drawing.
内核层是硬件和软件之间的层。内核层至少包含显示驱动,摄像头驱动,音频驱动,传感器驱动。The kernel layer is the layer between hardware and software. The kernel layer contains at least display drivers, camera drivers, audio drivers, and sensor drivers.
应理解,本申请实施例中的技术方案可以用于Android、IOS、鸿蒙等系统中。It should be understood that the technical solutions in the embodiments of the present application can be used in systems such as Android, IOS, and Hongmeng.
图3是本申请实施例提供的一组图形用户界面(graphical user interface,GUI)。FIG. 3 is a set of graphical user interfaces (graphical user interface, GUI) provided by the embodiments of the present application.
参见图3中的(a)所示的GUI,该GUI为手机的桌面。其中,手机的桌面上包括多个应用程序的图标,该多个应用程序包括时钟、日历、图库、备忘录、文件管理、电子邮件、音乐和计算器等等。当手机检测到用户点击备忘录的图标的操作时,可以显示如图3中的(b)所示的GUI。Referring to the GUI shown in (a) of FIG. 3 , the GUI is the desktop of the mobile phone. Among them, the desktop of the mobile phone includes icons of multiple applications, the multiple applications include clock, calendar, gallery, memo, file management, e-mail, music, calculator and so on. When the mobile phone detects that the user clicks the icon of the memo, the GUI as shown in (b) of FIG. 3 may be displayed.
参见图3中的(b)所示的GUI,该GUI为备忘录应用的显示界面。该显示界面上显示有一条9月1日记录的笔记,该笔记的内容为“妈妈生日(9月28日)”。该显示界面上还包括创建新的笔记的控件301。当手机检测到用户点击控件301的操作时,手机可以显示如图3中的(c)所示的GUI。Referring to the GUI shown in (b) of FIG. 3 , the GUI is the display interface of the memo application. A note recorded on September 1 is displayed on the display interface, and the content of the note is "Mom's birthday (September 28)". The display interface also includes a control 301 for creating a new note. When the mobile phone detects that the user clicks on the control 301, the mobile phone can display a GUI as shown in (c) of FIG. 3 .
参见图3中的(c)所示的GUI,该GUI为备忘录应用的另一显示界面。用户可以在该界面上编辑笔记的标题以及正文内容。该显示界面上还包括清单、文本编辑、插入图片、录音转文字以及手写控件。Referring to the GUI shown in (c) of FIG. 3 , the GUI is another display interface of the memo application. Users can edit the title and body content of the note on this interface. The display interface also includes controls for list, text editing, inserting pictures, audio-to-text, and handwriting controls.
参见图3中的(d)所示,当手机检测到用户点击录音转文字控件302的操作时,手机可以向笔记本电脑发送请求信息,该请求信息用于请求笔记本电脑对手机获取的语音内容对应为文本内容进行编辑。响应于接收到该请求信息,笔记本电脑可以显示提醒框303,其中提醒框303中包括提示信息“检测到手机正在进行录音转文字,是否在笔记本电脑上进行文字编辑”。Referring to (d) in FIG. 3 , when the mobile phone detects that the user clicks the operation of the recording-to-text control 302, the mobile phone can send request information to the notebook computer, and the request information is used to request the notebook computer to correspond to the voice content acquired by the mobile phone. Edit for text content. In response to receiving the request information, the laptop computer can display a reminder box 303, wherein the reminder box 303 includes the prompt information "it is detected that the mobile phone is recording to text, whether to perform text editing on the laptop computer".
参见图3中的(e)所示,笔记本电脑响应于用户点击控件304的操作,可以启动App1。其中App1是可以进行文本编辑的应用程序。手机响应于用户点击控件302的操作,可以对环境中的语音内容进行检测,并将检测到的语音内容转化为文本内容。在手机将语音内容转化为文本内容后,手机还可以向笔记本电脑发送转化的文本内容。从而使得手机的备忘录应用的显示界面和笔记本电脑的App1显示界面上同时显示该文本内容“热度再变…社会心态”。Referring to (e) of FIG. 3 , the notebook computer can start App1 in response to the user's operation of clicking on the control 304 . Where App1 is an application that can do text editing. In response to the user's operation of clicking on the control 302, the mobile phone can detect the voice content in the environment, and convert the detected voice content into text content. After the mobile phone converts the voice content into text content, the mobile phone can also send the converted text content to the laptop. Thereby, the display interface of the memo application of the mobile phone and the display interface of App1 of the notebook computer are simultaneously displayed with the text content "the popularity changes again... social mentality".
一个实施例中,手机中可以包括麦克风和语音识别(automatic speech recognition,ASR)模块,其中,麦克风用于采集环境中的语音内容,ASR模块的作用是将接收到的语音内容转化为文本内容。In one embodiment, the mobile phone may include a microphone and a speech recognition (automatic speech recognition, ASR) module, wherein the microphone is used to collect speech content in the environment, and the role of the ASR module is to convert the received speech content into text content.
一个实施例中,笔记本电脑的App1显示界面上和手机的备忘录应用显示界面上的文本内容可以是实时同步的。示例性的,手机可以按照一定的时间间隔(例如,5秒)将语音内容转化为文本内容。在0-5秒中手机将采集到的语音内容转化为文本内容“热度再变与不变中体现民生诉求”,从而手机可以在备忘录的显示界面上显示该文本内容,同时手机可以将该文本内容发送给笔记本电脑,从而笔记本电脑可以在App1的显示界面上显示该文本内容;在5-10秒中手机将采集到的语音内容转化为文本内容“光志智库我们刚刚统计得到2020年全国两会前夕”,从而手机可以在备忘录的显示界面上显示该文本内容,同时手机可以将该文本内容发送给笔记本电脑,从而笔记本电脑可以在App1的显示界面上显示该文本内容。In one embodiment, the text content on the display interface of App1 of the notebook computer and the display interface of the memo application of the mobile phone may be synchronized in real time. Exemplarily, the mobile phone can convert the voice content into text content at a certain time interval (for example, 5 seconds). In 0-5 seconds, the mobile phone converts the collected voice content into text content, "reflecting the demands of people's livelihood while the popularity changes and remains unchanged", so that the mobile phone can display the text content on the display interface of the memo, and the mobile phone can display the text content at the same time. The content is sent to the laptop, so that the laptop can display the text content on the display interface of App1; in 5-10 seconds, the mobile phone converts the collected voice content into text content. "Guangzhi Think Tank, we have just counted the eve of the 2020 National Two Sessions. ”, so that the mobile phone can display the text content on the display interface of the memo, and at the same time, the mobile phone can send the text content to the notebook computer, so that the notebook computer can display the text content on the display interface of App1.
一个实施例中,笔记本电脑在开启App1后可以显示光标306,其中笔记本电脑在接收到手机发送的文本内容后,可以将光标306显示在文本内容的最后面。In one embodiment, the laptop computer can display the cursor 306 after opening App1, wherein after the laptop computer receives the text content sent by the mobile phone, the laptop computer can display the cursor 306 at the back of the text content.
参见图3中的(f)所示的GUI,手机可以继续检测语音内容并将该语音内容转化为 文本内容。手机可以将文本内容实时同步到笔记本电脑的App1中。同时,笔记本电脑响应于用户的编辑操作,可以对App1中的文本内容进行编辑。例如,用户可以在手机发送的文本内容中增加内容(在“热度”前增加“1、”,在“光志智库”前增加“2、”)。又例如,用户可以修改文本内容不准确的内容(例如,将“名义”修改为“民意”,将“突破时”修改为“top10”)。又例如,用户还可以对文本内容的格式进行修改(例如,笔记本电脑在“名声诉求”后检测到用户回车的操作)。Referring to the GUI shown in (f) of Fig. 3, the mobile phone can continue to detect the voice content and convert the voice content into text content. The mobile phone can synchronize the text content to the App1 of the laptop in real time. At the same time, the notebook computer can edit the text content in App1 in response to the user's editing operation. For example, users can add content to the text content sent by the mobile phone (add "1," before "Hotness", and "2," before "Guangzhi Think Tank"). For another example, the user can modify the inaccurate content of the text (for example, "name" is modified to "public opinion", and "breakthrough" is modified to "top10"). For another example, the user can also modify the format of the text content (for example, the laptop detects the user's carriage return operation after the "reputation appeal").
如图3中的(f),响应于笔记本电脑检测到用户点击保存控件307的操作时,笔记本电脑可以将编辑后的文本内容发送给手机,从而使得手机在备忘录的显示界面上显示笔记本电脑上编辑后的文本内容。As shown in (f) in FIG. 3 , in response to the notebook computer detecting that the user clicks the save control 307, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone displays the notebook computer on the display interface of the memo. Edited text content.
一个实施例中,手机在检测到环境中的语音内容后将该语音内容转化为文本内容,并向笔记本电脑发送该文本内容,而手机可以不显示文本内容。当手机从笔记本电脑接收到编辑后的文本内容后,手机可以在备忘录应用的显示界面上显示该编辑后的文本内容。In one embodiment, the mobile phone converts the voice content into text content after detecting the voice content in the environment, and sends the text content to the laptop computer, while the mobile phone may not display the text content. After the mobile phone receives the edited text content from the laptop computer, the mobile phone can display the edited text content on the display interface of the memo application.
一个实施例中,笔记本电脑在检测到用户将光标移动到需要进行编辑的位置后,可以对该位置附近的文本内容进行编辑。例如,当前光标306在“社会心态”后,当笔记本电脑检测到用户在“社会心态”后增加了符号“?”后,光标306可以移动至“社会心态?”后面。同时,在笔记本电脑接收到手机发送的其他文本内容后,可以继续将接收到的文本内容显示出来。In one embodiment, after detecting that the user moves the cursor to the position where editing is required, the notebook computer can edit the text content near the position. For example, when the current cursor 306 is behind "Social Mindset", when the laptop detects that the user has added a symbol "?" after "Social Mindset", the cursor 306 can move to the back of "Social Mindset?". At the same time, after the notebook computer receives other text content sent by the mobile phone, it can continue to display the received text content.
参见图3中的(g)所示,当笔记本电脑继续检测到用户对文本内容编辑操作时,可以对从手机接收到的文本内容进行编辑。示例性的,当笔记本电脑可以检测到用户将“正成功”修改为“郑成功”的操作;又例如,笔记本电脑可以检测到用户在“年度特征”后增加符号“。”的操作;又例如,笔记本电脑可以检测到用户在“医疗教育住房食品安全”中插入符号“、”的操作。又例如,笔记本电脑可以检测到用户在“社会心态?”后回车的操作。Referring to (g) in FIG. 3 , when the notebook computer continues to detect the user's editing operation on the text content, it can edit the text content received from the mobile phone. Exemplarily, when the notebook computer can detect the user's operation of changing "positive success" to "Zheng Chenggong"; for another example, the notebook computer can detect the user's operation of adding a symbol "." after "annual feature"; another example, The laptop can detect the user inserting the symbol "," in "Medical Education Housing Food Safety". As another example, a laptop can detect the user's press enter after "Social Mindset?".
如图3中的(g)所示,当笔记本电脑检测到用户点击保存控件307的操作时,笔记本电脑可以将编辑后的文本内容发送给手机,从而使得手机在备忘录的显示界面上显示笔记本电脑上编辑后的文本内容。As shown in (g) of FIG. 3 , when the notebook computer detects that the user clicks the save control 307, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone displays the notebook computer on the display interface of the memo Edited text content.
参见图3中的(h)所示的GUI,该GUI为备忘录应用的另一显示界面。当手机检测到用户点击停止录音的控件308的操作时,手机可以显示如图3中的(i)所示的GUI。Referring to the GUI shown in (h) of FIG. 3 , the GUI is another display interface of the memo application. When the mobile phone detects that the user clicks the control 308 to stop recording, the mobile phone can display a GUI as shown in (i) in FIG. 3 .
参见图3中的(i)所示的GUI,该GUI为备忘录应用的另一显示界面。用户可以继续在手机上对笔记本电脑编辑后的文本内容进行编辑(例如,用户可以在文本内容中插入图片)。当手机检测到用户点击控件309的操作时,手机可以对当前界面上显示的文本内容进行保存并显示如图3中的(j)所示的GUI。Referring to the GUI shown in (i) of FIG. 3 , the GUI is another display interface of the memo application. The user can continue to edit the text content edited by the laptop on the mobile phone (for example, the user can insert a picture into the text content). When the mobile phone detects that the user clicks on the control 309, the mobile phone can save the text content displayed on the current interface and display the GUI shown in (j) in FIG. 3 .
参见图3中的(j)所示的GUI,该GUI为备忘录应用的另一显示界面。响应于检测到用户点击控件309的操作,手机可以将该笔记的标题更新为“1、热点”。当手机检测到用户点击控件310的操作时,手机可以显示如图3中的(k)所示的GUI.Referring to the GUI shown in (j) of FIG. 3 , the GUI is another display interface of the memo application. In response to detecting that the user clicks on the control 309, the mobile phone can update the title of the note to "1, hotspot". When the mobile phone detects that the user clicks on the control 310, the mobile phone can display the GUI as shown in (k) in Figure 3.
参见图3中的(k)所示的GUI,该GUI为备忘录应用的另一显示界面。手机上可以显示当前备忘录应用中包括2条笔记,最新增加的笔记内容可以显示在最上方。Referring to the GUI shown in (k) of FIG. 3 , the GUI is another display interface of the memo application. The mobile phone can display 2 notes included in the current memo application, and the newly added note content can be displayed at the top.
一个实施例中,若手机开始进行语音转文字时,笔记本电脑没有检测到用户点击控件304,那么手机可以开始进行录音转文字,而笔记本电脑可以继续显示提示框301。在手 机进行录音转文字的过程中,若笔记本电脑检测到用户点击控件304的操作,那么笔记本电脑可以向手机发送响应,该响应用于指示笔记本电脑可以进行文本编辑。手机在接收到该响应后,可以将接收到该响应之前语音转文字的文本内容1都发送给笔记本电脑,笔记本电脑可以打开App1并显示文本内容1。之后如果手机继续检测到用户输入的语音内容,那么手机可以在继续向笔记本电脑发送用户输入的语音内容对应的文本内容2。笔记本电脑可以将手机发送的文本内容2追加在文本内容1后面。In one embodiment, if the laptop computer does not detect that the user clicks on the control 304 when the mobile phone starts to convert speech to text, the mobile phone can start to convert audio to text, and the laptop computer can continue to display the prompt box 301 . During the process of the mobile phone converting audio to text, if the notebook computer detects that the user clicks on the control 304, the notebook computer can send a response to the mobile phone, where the response is used to indicate that the notebook computer can perform text editing. After receiving the response, the mobile phone can send all the text content 1 that is voice-to-text before receiving the response to the laptop, and the laptop can open App1 and display the text content 1. Afterwards, if the mobile phone continues to detect the voice content input by the user, the mobile phone can continue to send the text content 2 corresponding to the voice content input by the user to the notebook computer. The notebook computer can append the text content 2 sent by the mobile phone to the back of the text content 1.
本申请实施例中,手机在进行录音转文字操作时,手机可以通知笔记本电脑正在进行录音转文字功能,从而可以使得笔记本电脑提示用户是否需要在笔记本电脑上进行文本内容的编辑。当用户选择在笔记本电脑上进行编辑时,笔记本电脑可以实时显示手机发送的文本内容,从而方便用户对文本内容进行编辑,有助于提升用户的体验。In the embodiment of the present application, when the mobile phone performs the recording-to-text operation, the mobile phone can notify the notebook computer that the recording-to-text function is being performed, so that the notebook computer can prompt the user whether to edit the text content on the notebook computer. When the user chooses to edit on the laptop computer, the laptop computer can display the text content sent by the mobile phone in real time, thereby facilitating the user to edit the text content and helping to improve the user experience.
图4是本申请实施例提供的另一组GUI。FIG. 4 is another set of GUIs provided by the embodiments of the present application.
参见图4中的(a)所示,当手机检测到用户点击录音转文字控件的操作时,手机可以显示如图4中的(b)所示的界面。此时手机可以向笔记本电脑发送请求信息,该请求信息用于请求笔记本电脑对手机获取的语音内容对应的文本内容进行编辑。Referring to (a) in FIG. 4 , when the mobile phone detects that the user clicks the recording-to-text control, the mobile phone may display the interface shown in (b) in FIG. 4 . In this case, the mobile phone may send request information to the notebook computer, where the request information is used to request the notebook computer to edit the text content corresponding to the voice content obtained by the mobile phone.
参见图4中的(b)所示,响应于接收到手机发送的请求信息,笔记本电脑可以启动App1。响应于手机接收到笔记本电脑对该请求信息的响应,手机可以显示提醒框401,其中,提醒框401中包括提示信息“检测到周围有笔记本电脑,是否将文本内容发送至笔记本电脑上编辑”。当手机检测到用户点击控件402的操作时,手机开始检测环境中的语音内容。Referring to (b) in FIG. 4 , in response to receiving the request information sent by the mobile phone, the notebook computer can start App1. In response to the mobile phone receiving the laptop computer's response to the request information, the mobile phone may display a reminder box 401, wherein the reminder box 401 includes prompt information "detecting the surrounding laptop computer, whether to send the text content to the laptop computer for editing". When the mobile phone detects that the user clicks the control 402, the mobile phone starts to detect the voice content in the environment.
参见图4中的(c)所示,手机在检测到环境中的语音内容后,可以开始进行录音转文字。手机可以采集语音内容并将该语音内容转化文本内容“热度再变…社会心态”,同时手机还可以向笔记本电脑发送转化的文本内容,从而使得笔记本电脑通过App1显示该文本内容。Referring to (c) in FIG. 4 , after detecting the voice content in the environment, the mobile phone can start to convert the recording into text. The mobile phone can collect the voice content and convert the voice content into text content to "change the popularity...social mentality", and the mobile phone can also send the converted text content to the laptop, so that the laptop can display the text content through App1.
一个实施例中,当笔记本电脑检测到用户在笔记本电脑上对该文本内容进行编辑的操作以及检测到用户点击保存的操作时,笔记本电脑可以将编辑后的文本内容发送给手机,从而使得手机在备忘录的显示界面上显示编辑后的文本内容。具体过程可以参考图3中的(f)至(g)的过程,为了简洁,在此不再赘述。In one embodiment, when the notebook computer detects that the user edits the text content on the notebook computer and detects that the user clicks the save operation, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone can The edited text content is displayed on the display interface of the memo. For specific processes, reference may be made to processes (f) to (g) in FIG. 3 , which are not repeated here for brevity.
本申请实施例中,手机在进行录音转文字操作时,如果确定周围有方便用户进行文本编辑的设备(例如,笔记本电脑),那么手机可以提示用户是否在笔记本电脑上进行文本编辑。当用户选择在笔记本电脑上进行编辑时,笔记本电脑可以实时显示手机发送的文本内容,从而方便用户对文本内容进行编辑,有助于提升用户的体验。In the embodiment of the present application, when the mobile phone performs the recording-to-text operation, if it is determined that there is a device (eg, a laptop) that is convenient for the user to edit text, the mobile phone can prompt the user whether to perform text editing on the laptop. When the user chooses to edit on the laptop computer, the laptop computer can display the text content sent by the mobile phone in real time, thereby facilitating the user to edit the text content and helping to improve the user experience.
图5是本申请实施例提供的另一组GUI。FIG. 5 is another set of GUIs provided by the embodiments of the present application.
参见图5中的(a)所示的GUI,该GUI为手机的桌面。其中,手机的桌面上包括多个应用程序的图标,该多个应用程序包括录音机应用等等。当手机检测到用户点击录音机应用的图标的操作时,可以显示如图5中的(b)所示的GUI。Referring to the GUI shown in (a) of FIG. 5 , the GUI is the desktop of the mobile phone. Wherein, the desktop of the mobile phone includes icons of multiple applications, and the multiple applications include a voice recorder application and the like. When the mobile phone detects that the user clicks on the icon of the audio recorder application, the GUI as shown in (b) of FIG. 5 can be displayed.
参见图5中的(b)所示的GUI,该GUI为录音机应用的显示界面。该显示界面上显示有一条10月3日记录的录音文件,该录音文件的命名为“20201003_100832”。该显示界面上还包括创建新的录音文件的控件501。Referring to the GUI shown in (b) of FIG. 5 , the GUI is the display interface of the recorder application. A recording file recorded on October 3 is displayed on the display interface, and the recording file is named "20201003_100832". The display interface also includes a control 501 for creating a new recording file.
参见图5中的(c)所示,当手机检测到用户点击控件501的操作时,手机可以向笔 记本电脑发送指示信息,该指示信息用于指示手机正在进行录音。响应于接收到该指示信息,笔记本电脑可以显示提醒框502,其中该提醒框502包括提示信息“检测到手机正在进行录音,是否在笔记本电脑上进行录音转文字?”。Referring to (c) in Figure 5, when the mobile phone detects that the user clicks on the control 501, the mobile phone can send instruction information to the notebook computer, where the instruction information is used to indicate that the mobile phone is recording. In response to receiving the instruction information, the laptop computer can display a reminder box 502, wherein the reminder box 502 includes a prompt message "It is detected that the mobile phone is recording, do you want to convert the recording to text on the laptop computer?".
参见图5中的(d)所示,响应于笔记本电脑检测到用户点击控件503的操作,笔记本电脑可以启动App1。响应于手机检测到用户点击控件501的操作,手机可以显示录音界面,其中,该录音界面上显示当前已经录制了25秒31,且手机预计还能录制10小时以上。Referring to (d) of FIG. 5 , in response to the notebook computer detecting that the user clicks on the control 503, the notebook computer can launch App1. In response to the mobile phone detecting that the user clicks on the control 501, the mobile phone can display a recording interface, wherein the recording interface shows that 25 seconds have been recorded and the mobile phone is expected to record more than 10 hours.
手机在采集语音内容的同时还可以将该语音内容发送给笔记本电脑,笔记本电脑可以在接收到语音内容后,将该语音内容转化为文本内容“热度再变…社会心态”,从而笔记本电脑可以在App1中显示该文本内容。When the mobile phone collects the voice content, it can also send the voice content to the laptop computer. After the laptop computer receives the voice content, it can convert the voice content into text content. The text content is displayed in App1.
一个实施例中,笔记本电脑中可以包括ASR模块,其中,ASR模块的作用是将接收到的语音内容转化为文本内容。In one embodiment, an ASR module may be included in the notebook computer, wherein the function of the ASR module is to convert the received voice content into text content.
参见图5中的(e)所示,笔记本电脑响应于用户的编辑操作,可以对App1中的文本内容进行编辑。例如,用户可以在手机发送的文本内容中增加内容(在“热度”前增加“1、”,在“光志智库”前增加“2、”)。又例如,用户可以修改文本内容不准确的内容(例如,将“名义”修改为“民意”,将“突破时”修改为“top10”)。又例如,用户还可以对文本内容的格式进行修改(例如,笔记本电脑在“名声诉求”后检测到用户回车的操作)。As shown in (e) of FIG. 5 , the notebook computer can edit the text content in App1 in response to the user's editing operation. For example, users can add content to the text content sent by the mobile phone (add "1," before "Hotness", and "2," before "Guangzhi Think Tank"). For another example, the user can modify the inaccurate content of the text (for example, "name" is modified to "public opinion", and "breakthrough" is modified to "top10"). For another example, the user can also modify the format of the text content (for example, the laptop detects the user's carriage return operation after the "reputation appeal").
一个实施例中,笔记本电脑检测到用户点击保存控件的操作时,可以向手机发送编辑后的文本内容,从而手机可以将编辑后的文本内容保存在某个应用层序(例如,备忘录)中。In one embodiment, when the notebook computer detects that the user clicks the save control, it can send the edited text content to the mobile phone, so that the mobile phone can save the edited text content in a certain application layer sequence (eg, memo).
本申请实施例中,当手机检测到用户开始录音时,可以向周围的笔记本电脑发送指示信息,从而使得笔记本电脑提示用户是否需要在笔记本电脑上进行录音转文字。这样可以方便用户在笔记本电脑上对手机采集的语音内容进行转文字内容,同时利用了笔记本电脑编辑的便捷性,有助于提升用户的体验。In the embodiment of the present application, when the mobile phone detects that the user starts recording, the mobile phone can send instruction information to the surrounding notebook computers, so that the notebook computer prompts the user whether the recording needs to be converted into text on the notebook computer. In this way, it is convenient for the user to convert the voice content collected by the mobile phone to the text content on the laptop computer, and at the same time, the convenience of editing by the laptop computer is utilized, which helps to improve the user experience.
图6是本申请实施例提供的另一组GUI。FIG. 6 is another set of GUIs provided by the embodiments of the present application.
参见图6中的(a)所示,手机显示来电显示界面,该来电显示界面显示来自于广东深圳的手机号码“XXX-XXXXXXXX”正在进行呼叫。响应于手机接收到该来电,手机向周围的笔记本电脑发送指示信息,该指示信息用于指示手机检测到用户的来电。响应于接收到该指示信息,笔记本电脑可以显示提醒框601,其中提醒框601中包括提示信息“手机接收到来电,是否在笔记本电脑上进行通话内容转文字?”。Referring to (a) of FIG. 6 , the mobile phone displays a caller ID interface, and the caller ID interface shows that the mobile phone number "XXX-XXXXXXXX" from Shenzhen, Guangdong is making a call. In response to the mobile phone receiving the incoming call, the mobile phone sends indication information to the surrounding notebook computers, where the indication information is used to instruct the mobile phone to detect the user's incoming call. In response to receiving the instruction information, the laptop computer can display a reminder box 601, wherein the reminder box 601 includes the prompt information "The mobile phone has received an incoming call, do you want to convert the content of the call to text on the laptop computer?".
参见图6中的(b)所示,响应于笔记本电脑检测到用户点击控件602的操作,笔记本电脑可以开启App1并向手机发送响应,该响应用于指示笔记本电脑可以作为文本内容编辑的设备。响应于手机检测到用户接受该来电的操作,手机可以对端设备建立通话连接。手机可以接收对端设备发送的另一用户的语音内容。响应于接收到另一用户的语音内容,手机可以将该语音内容发送给笔记本电脑。Referring to (b) in FIG. 6 , in response to the notebook computer detecting that the user clicks on the control 602, the notebook computer can open App1 and send a response to the mobile phone, which is used to indicate that the notebook computer can be used as a text content editing device. In response to the mobile phone detecting the operation of the user accepting the incoming call, the mobile phone can establish a call connection to the terminal device. The mobile phone can receive the voice content of another user sent by the peer device. In response to receiving another user's voice content, the cell phone can send the voice content to the laptop.
一个实施例中,手机也可以是在检测到用户接受该来电的操作时,向笔记本电脑发送指示信息,该指示信息用于指示手机正在进行通话,请求笔记本电脑对通话内容对应的文本内容进行编辑。笔记本电脑在接收到该指示信息后,可以提示用户“手机正在通话,是否在笔记本电脑上进行通话内容转文字?”。当笔记本电脑检测到用户确定使用笔记本电 脑进行通话内容转文字的操作时,笔记本电脑可以开启App1,并从手机接收另一用户的语音内容。从而笔记本电脑可以将该语音内容转化为文本内容。In one embodiment, the mobile phone may also send indication information to the notebook computer when detecting the operation of the user accepting the incoming call, the indication information is used to indicate that the mobile phone is on a call, and request the notebook computer to edit the text content corresponding to the call content. . After receiving the instruction information, the laptop can prompt the user "The mobile phone is on a call, do you want to convert the content of the call to text on the laptop?". When the notebook computer detects that the user determines to use the notebook computer to convert the call content to text, the notebook computer can open App1 and receive the voice content of another user from the mobile phone. Thus, the notebook computer can convert the voice content into text content.
一个实施例中,手机在接收到该响应后,可以将获取到的另一个用户的语音内容转化为文本内容,从而手机可以向笔记本电脑发送该文本内容。In one embodiment, after receiving the response, the mobile phone can convert the acquired voice content of another user into text content, so that the mobile phone can send the text content to the laptop.
参见图6中的(c)所示,响应于从手机接收到该语音内容,笔记本电脑可以将该语音内容转化为文字内容“嗨,明天上午有个会议…请按时参加”,从而通过App1显示该文本内容。Referring to (c) in FIG. 6 , in response to receiving the voice content from the mobile phone, the notebook computer can convert the voice content into text content "Hi, there is a meeting tomorrow morning...please attend on time", thereby displaying through App1 the text content.
一个实施例中,当笔记本电脑检测到用户在笔记本电脑上对该文本内容进行编辑的操作以及检测到用户点击保存的操作时,笔记本电脑可以将编辑后的文本内容发送给手机,从而使得手机在备忘录的显示界面上显示编辑后的文本内容。具体过程可以参考图3中的(f)至(g)的过程,为了简洁,在此不再赘述。In one embodiment, when the notebook computer detects that the user edits the text content on the notebook computer and detects that the user clicks the save operation, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone can The edited text content is displayed on the display interface of the memo. For specific processes, reference may be made to processes (f) to (g) in FIG. 3 , which are not repeated here for brevity.
本申请实施例中,手机在检测到来电时,如果确定周围有方便用户进行文本编辑的设备(例如,笔记本电脑),那么手机可以向笔记本电脑指示手机检测到来电。当用户选择在笔记本电脑上进行编辑时,笔记本电脑可以实时将从手机获得的语音内容转化为文字内容并显示给用户,从而方便用户对文本内容进行编辑,有助于提升用户的体验。In this embodiment of the present application, when the mobile phone detects an incoming call, if it is determined that there is a device (for example, a laptop computer) that is convenient for the user to edit text nearby, the mobile phone can indicate to the laptop computer that the mobile phone has detected an incoming call. When the user chooses to edit on the laptop, the laptop can convert the voice content obtained from the mobile phone into text content in real time and display it to the user, so as to facilitate the user to edit the text content and help improve the user experience.
图7是本申请实施例提供的另一组GUI。FIG. 7 is another set of GUIs provided by the embodiments of the present application.
参见图7中的(a)所示,手机显示某个社交应用的视频通话界面,该视频通话界面显示另一用户正在邀请用户进行视频通话。响应于手机接收到该视频通话,手机向周围的笔记本电脑发送指示信息,该指示信息用于指示手机检测到另一用户的视频通话请求。响应于接收到该指示信息,笔记本电脑可以显示提醒框701,其中提醒框701中包括提示信息“手机接收到视频通话请求,是否在笔记本电脑上进行通话内容转文字?”。Referring to (a) in FIG. 7 , the mobile phone displays a video call interface of a social application, and the video call interface shows that another user is inviting the user to make a video call. In response to the mobile phone receiving the video call, the mobile phone sends indication information to the surrounding notebook computers, where the indication information is used to instruct the mobile phone to detect another user's video call request. In response to receiving the instruction information, the laptop computer may display a prompt box 701, wherein the prompt box 701 includes prompt information "The mobile phone has received a video call request, whether to convert the call content to text on the laptop computer?".
参见图7中的(b)所示,响应于笔记本电脑检测到用户点击控件702的操作,笔记本电脑可以开启App1并向手机发送响应,该响应用于指示笔记本电脑可以作为文本内容编辑的设备。响应于手机检测到用户接受该视频通话请求的操作,手机可以对端设备建立视频通话连接。手机可以接收对端设备发送的另一用户的语音内容。响应于接收到另一用户的语音内容,手机可以将该语音内容发送给笔记本电脑。Referring to (b) of FIG. 7 , in response to the notebook computer detecting that the user clicks on the control 702, the notebook computer can open App1 and send a response to the mobile phone, which is used to indicate that the notebook computer can be used as a text content editing device. In response to the mobile phone detecting the user's operation of accepting the video call request, the mobile phone can establish a video call connection to the terminal device. The mobile phone can receive the voice content of another user sent by the peer device. In response to receiving another user's voice content, the cell phone can send the voice content to the laptop.
一个实施例中,手机在接收到该响应后,可以将获取到的另一个用户的语音内容转化为文本内容,从而手机可以向笔记本电脑发送该文本内容。In one embodiment, after receiving the response, the mobile phone can convert the acquired voice content of another user into text content, so that the mobile phone can send the text content to the laptop.
参见图7中的(c)所示,响应于从手机接收到该语音内容,笔记本电脑可以将该语音内容转化为文字内容“嗨,明天上午有个会议…请按时参加”,从而通过App1显示该文本内容。Referring to (c) of FIG. 7 , in response to receiving the voice content from the mobile phone, the notebook computer can convert the voice content into text content "Hi, there is a meeting tomorrow morning...please attend on time", thereby displaying through App1 the text content.
一个实施例中,当笔记本电脑检测到用户在笔记本电脑上对该文本内容进行编辑的操作以及检测到用户点击保存的操作时,笔记本电脑可以将编辑后的文本内容发送给手机,从而使得手机在备忘录的显示界面上显示编辑后的文本内容。具体过程可以参考图3中的(f)至(g)的过程,为了简洁,在此不再赘述。In one embodiment, when the notebook computer detects that the user edits the text content on the notebook computer and detects that the user clicks the save operation, the notebook computer can send the edited text content to the mobile phone, so that the mobile phone can The edited text content is displayed on the display interface of the memo. For specific processes, reference may be made to processes (f) to (g) in FIG. 3 , which are not repeated here for brevity.
本申请实施例中,手机在检测到视频通话时,如果确定周围有方便用户进行文本编辑的设备(例如,笔记本电脑),那么手机可以向笔记本电脑指示手机检测到来单。当用户选择在笔记本电脑上进行编辑时,笔记本电脑可以实时将从手机获得的语音内容转化为文字内容并显示给用户,从而方便用户对文本内容进行编辑,有助于提升用户的体验。In this embodiment of the present application, when the mobile phone detects a video call, if it is determined that there is a device (for example, a laptop computer) that is convenient for the user to edit text nearby, the mobile phone can indicate to the laptop computer that the mobile phone detects an incoming call. When the user chooses to edit on the laptop, the laptop can convert the voice content obtained from the mobile phone into text content in real time and display it to the user, so as to facilitate the user to edit the text content and help improve the user experience.
应理解,以上结合图6和图7介绍了手机在接收到来电以及视频通话请求时,手机指示周围的笔记本电脑,从而使得笔记本电脑提示用户是否在笔记本电脑上进行通话内容转文字。本申请实施例中,手机也可以在接收到另一用户的语音通话请求后,指示笔记本电脑,从而使得笔记本电脑提示用户是否在笔记本电脑上进行通话内容转文字。It should be understood that when the mobile phone receives an incoming call and a video call request, the mobile phone indicates the surrounding laptops, so that the laptop prompts the user whether to convert the call content to text on the laptop. In this embodiment of the present application, the mobile phone may also instruct the laptop computer after receiving a voice call request from another user, so that the laptop computer prompts the user whether to convert the call content to text on the laptop computer.
图8是本申请实施例提供的另一组GUI。FIG. 8 is another set of GUIs provided by the embodiments of the present application.
参见图8中的(a)所示,手机显示来电显示界面,该来电显示界面显示来自于广东深圳的手机号码“XXX-XXXXXXXX”正在进行呼叫。响应于手机接收到该来电,手机向周围的笔记本电脑发送指示信息,该指示信息用于指示手机检测到用户的来电。响应于接收到该指示信息,笔记本电脑可以显示提醒框801,其中提醒框801中包括提示信息“手机接收到来电,是否在笔记本电脑上进行通话内容转文字?”。Referring to (a) of FIG. 8 , the mobile phone displays a caller ID interface, and the caller ID interface displays that the mobile phone number "XXX-XXXXXXXX" from Shenzhen, Guangdong is making a call. In response to the mobile phone receiving the incoming call, the mobile phone sends indication information to the surrounding notebook computers, where the indication information is used to instruct the mobile phone to detect the user's incoming call. In response to receiving the instruction information, the laptop computer can display a reminder box 801, wherein the reminder box 801 includes the prompt information "The mobile phone has received an incoming call, do you want to convert the content of the call to text on the laptop computer?".
参见图8中的(b)所示,笔记本电脑在预设时长内检测到用户没有点击取消控件或者确定控件802的操作时,可以隐藏该提醒框801。响应于手机检测到用户接受该来电的操作,手机可以对端设备建立通话连接。Referring to (b) in FIG. 8 , when the notebook computer detects that the user does not click on the cancel control or the confirmation control 802 within a preset time period, the reminder box 801 can be hidden. In response to the mobile phone detecting the operation of the user accepting the incoming call, the mobile phone can establish a call connection to the terminal device.
参见图8中的(c)所示,用户在是否手机进行通话的过程中,检测到用户点击录音控件803的操作,可以向笔记本电脑发送指示信息,该指示信息用于指示手机正在进行通话,请求笔记本电脑对通话内容对应的文本内容进行编辑。响应于接收到该指示信息,笔记本电脑可以再一次显示提醒框801。当笔记本电脑检测到用户点击控件802的操作时,笔记本电脑可以向手机发送响应,该响应用于指示笔记本电脑可以对通话内容对应的文本内容进行编辑。响应于接收到该响应,手机可以向笔记本电脑发送该通话内容。Referring to (c) in FIG. 8 , in the process of whether the mobile phone is making a call, the user detects that the user clicks the operation of the recording control 803, and can send instruction information to the notebook computer, and the instruction information is used to indicate that the mobile phone is making a call, Request the laptop to edit the text content corresponding to the call content. In response to receiving the indication information, the notebook computer may display the reminder box 801 again. When the laptop computer detects that the user clicks on the control 802, the laptop computer can send a response to the mobile phone, where the response is used to instruct the laptop computer to edit the text content corresponding to the call content. In response to receiving the response, the cell phone can send the call content to the laptop.
参见图8中的(d)所示,响应于接收到手机发送的通话内容,笔记本电脑的ASR模块可以将该通话内容转化为文本内容“嗨,明天上午…参加”,并显示在App1中。Referring to (d) in FIG. 8 , in response to receiving the call content sent by the mobile phone, the ASR module of the notebook computer can convert the call content into text content “Hi, tomorrow morning… join” and display it in App1.
参见图8中的(e)所示,笔记本电脑检测到用户的编辑操作,可以对文本内容进行编辑,并在App1中显示编辑后的文本内容(“领导:嗨,明天上午…参见”)。Referring to (e) in Figure 8, the laptop detects the user's editing operation, can edit the text content, and displays the edited text content in App1 ("Leader: Hi, tomorrow morning... see").
本申请实施例中,手机在用户接听来电的过程中检测到用户点击录音的操作时,向笔记本电脑发送指示信息,从而指示笔记本电脑对通话内容对应的文本内容进行编辑。当用户选择在笔记本电脑上进行编辑时,笔记本电脑可以实时将从手机获得的语音内容转化为文字内容并显示给用户,从而方便用户对文本内容进行编辑,有助于提升用户的体验。In the embodiment of the present application, when the mobile phone detects that the user clicks the recording operation during the process of answering the call, the mobile phone sends instruction information to the notebook computer, thereby instructing the notebook computer to edit the text content corresponding to the call content. When the user chooses to edit on the laptop, the laptop can convert the voice content obtained from the mobile phone into text content in real time and display it to the user, so as to facilitate the user to edit the text content and help improve the user experience.
下面结合图9和图10介绍笔记本电脑上另外两种显示方式。The other two display modes on the notebook computer are described below with reference to FIG. 9 and FIG. 10 .
图9是本申请实施例提供的另一组GUI。FIG. 9 is another set of GUIs provided by the embodiments of the present application.
如图9中的(a)所示,手机可以显示备忘录中的录音转文字界面,手机检测用户输入的语音内容“热度再变与不变之间中体现民生诉求”,手机可以将该语音内容转化为文本内容,从而向笔记本电脑发送该文本内容。笔记本电脑在接收到该文本内容后,可以通过字幕的方式向用户显示该文本内容。示例性的,笔记本电脑可以在字幕显示区域901中显示该文本内容“热度再变与不变之间中体现民生诉求”。As shown in (a) in Figure 9, the mobile phone can display the recording-to-text interface in the memo, and the mobile phone detects the voice content input by the user "reflecting the people's livelihood demands between the change of popularity and the constant change", and the mobile phone can convert the voice content to the voice content. Convert to text content and send that text content to the laptop. After receiving the text content, the notebook computer can display the text content to the user by means of subtitles. Exemplarily, the notebook computer can display the text content "reflecting the demands of people's livelihood between the change of popularity and the constant change" in the subtitle display area 901 .
如图9中的(b)所示,当手机接着检测到用户输入的语音内容“光志智库我们刚刚统计得到”,手机可以将该语音内容转化为文本内容,从而向笔记本电脑发送该文本内容。笔记本电脑在接收到该文本内容后,可以通过字幕的方式向用户显示该文本内容。示例性的,笔记本电脑可以在字幕显示区域901中显示该文本内容“光志智库我们刚刚统计得到”。As shown in (b) of Figure 9, when the mobile phone detects the voice content "We have just obtained statistics from the Guangzhi Think Tank" input by the user, the mobile phone can convert the voice content into text content and send the text content to the laptop. After receiving the text content, the notebook computer can display the text content to the user by means of subtitles. Exemplarily, the notebook computer can display the text content "We have just obtained statistics on Guangzhi Think Tank" in the subtitle display area 901 .
图10是本申请实施例提供的另一组GUI。FIG. 10 is another set of GUIs provided by the embodiments of the present application.
如图10中的(a)所示,手机可以显示备忘录中的录音转文字界面,手机检测用户输入的语音内容“热度再变与不变之间中体现民生诉求”,手机可以将该语音内容转化为文本内容,从而向笔记本电脑发送该文本内容。笔记本电脑在接收到该文本内容后,可以通过字幕的方式向用户显示该文本内容,同时,笔记本电脑还可以打开App1并在App1中显示该文本内容。示例性的,笔记本电脑可以在字幕显示区域1001中显示该文本内容“热度再变与不变之间中体现民生诉求”同时在App1中显示该文本内容。用户可以在App1中对该文本内容进行编辑。As shown in (a) in Figure 10, the mobile phone can display the recording-to-text interface in the memo, and the mobile phone detects the voice content input by the user "reflecting the people's livelihood demands between the hotness and the constant change", and the mobile phone can convert the voice content to the voice content. Convert to text content and send that text content to the laptop. After receiving the text content, the notebook computer can display the text content to the user by means of subtitles, and at the same time, the notebook computer can also open App1 and display the text content in App1. Exemplarily, the laptop computer can display the text content "Reflecting the demands of people's livelihood between the change of popularity and the constant change" in the subtitle display area 1001, and simultaneously display the text content in App1. The user can edit the text content in App1.
如图10中的(b)所示,当手机接着检测到用户输入的语音内容“光志智库我们刚刚统计得到”,手机可以将该语音内容转化为文本内容,从而向笔记本电脑发送该文本内容。笔记本电脑在接收到该文本内容后,可以通过字幕的方式向用户显示该文本内容,同时,笔记本电脑还可以在App1中“热度再变与不变之间中体现民生诉求”后面追加显示该文本内容。示例性的,笔记本电脑可以在字幕显示区域1001中显示该文本内容“光志智库我们刚刚统计得到”,并且在App1中显示“热度再变与不变之间中体现民生诉求,光志智库我们刚刚统计得到”。As shown in (b) of Figure 10, when the mobile phone detects the voice content "We have just obtained statistics from the Guangzhi Think Tank" input by the user, the mobile phone can convert the voice content into text content and send the text content to the laptop. After the notebook computer receives the text content, it can display the text content to the user through subtitles. At the same time, the notebook computer can also display the text in App1 after "reflecting the demands of people's livelihood between the hotness and the constant change". content. Exemplarily, the notebook computer can display the text content "We have just obtained the statistics of Guangzhi Think Tank" in the subtitle display area 1001, and display in App1 "The demand for people's livelihood is reflected in the change of popularity and the constant change, we have just calculated the Guangzhi Think Tank. get".
下面集合图11介绍本申请实施例提供的系统架构。该系统架构中包括设备A和设备B,其中设备A可以是具有拾音功能的设备(例如,手机),设备B可以是具备高效编辑功能的设备(例如,笔记本电脑)。设备A包括拾音模块1110、ASR模块1120、收发控制模块1130和替换模块1140;设备B包括收发控制模块1150、显示单元1160和编辑控制模块1170。The following describes the system architecture provided by the embodiments of the present application in combination with FIG. 11 . The system architecture includes device A and device B, where device A may be a device with a pickup function (eg, a mobile phone), and device B may be a device with an efficient editing function (eg, a laptop). Device A includes a pickup module 1110 , an ASR module 1120 , a transceiver control module 1130 and a replacement module 1140 ; device B includes a transceiver control module 1150 , a display unit 1160 and an editing control module 1170 .
拾音模块1110用于获取到需要处理的语音内容。本申请实施例中对设备A获取语音内容的方式并不作具体限定,获取方式可以很多样。例如,可以是从设备A周边环境录制实时语音;或者,也可以是用户在手机上播放的音视频的音频;或者,还可以是设备A通过移动网络打电话时获取的音频(包括远端和近端);还可以是手机里面的音视频文件。拾音模块负责把相关的音频转换成特定的音频格式,比如某种采样率的脉冲编码调制(pulse code modulation,PCM)音频流,从而作为ASR模块作为输入。The sound pickup module 1110 is used to acquire the voice content to be processed. In the embodiments of the present application, the manner in which the device A acquires the voice content is not specifically limited, and the acquisition manner may be various. For example, it can be real-time voice recorded from the surrounding environment of device A; or, it can also be the audio of the audio and video played by the user on the mobile phone; or, it can also be the audio (including remote and Near-end); it can also be the audio and video files in the mobile phone. The pickup module is responsible for converting the relevant audio into a specific audio format, such as a pulse code modulation (PCM) audio stream of a certain sampling rate, which is used as an input to the ASR module.
ASR模块1120可以将语音内容转文本内容。通过向ASR模块1120输入特定的PCM音频流,经过训练的深度神经网络声学模型,得出概率最高的音素序列,再经过对应的语言模型,得出概率最高的文字系列,完成从语音内容到文本内容的转换。The ASR module 1120 may convert the speech content to textual content. By inputting a specific PCM audio stream to the ASR module 1120, the deep neural network acoustic model is trained to obtain the phoneme sequence with the highest probability, and then through the corresponding language model, the text series with the highest probability is obtained, completing the process from speech content to text. content conversion.
收发控制模块1130用于设备A和其他设备进行信息同步的控制。本申请实施例中,收发控制模块1130可以负责信息的透传,收发控制模块1130不涉及传输内容的逻辑处理。信息的发送传输可以通过局域网络,比如蓝牙,Wi-Fi或者其他传输协议(例如,internet)。设备A通过网络通信协议跟其他设备(例如,设备B)建立连接网络链路,然后发起启动传输。The transceiver control module 1130 is used to control information synchronization between device A and other devices. In this embodiment of the present application, the transceiver control module 1130 may be responsible for transparent transmission of information, and the transceiver control module 1130 is not involved in the logical processing of the transmission content. The transmission of information may be through a local area network, such as Bluetooth, Wi-Fi, or other transmission protocols (eg, the internet). Device A establishes a connection network link with other devices (eg, device B) through a network communication protocol, and then initiates a start transmission.
应理解,收发控制模块1150的功能描述可以参考上述收发控制模块1130,为了简洁,在此不再赘述。It should be understood that for the functional description of the transceiver control module 1150, reference may be made to the aforementioned transceiver control module 1130, which is not repeated here for brevity.
显示单元1160用于提示用户是否在设备B上启动文本编辑。收发控制模块1150收到收发控制模块1130发送的设备A获取音频内容(例如,语音内容或者音频文件)的指示,或者,收发控制模块1150收到收发控制模块1130发送的在设备B上进行文本编辑的指示后,可以通过显示单元1160以通知形式弹出提示。如果设备B检测到用户的确定通过设 备B进行文本编辑的操作,则可以通过收发控制模块1150向收发控制模块1130发送响应,该响应可以用于指示设备B可以作为文本编辑的设备。当设备A的收发控制模块1130接收到该响应后,设备A开始实时向设备B传输ASR结果的传输。The display unit 1160 is used to prompt the user whether to start text editing on the device B. The transceiver control module 1150 receives the instruction sent by the transceiver control module 1130 for device A to acquire audio content (eg, voice content or audio files), or, the transceiver control module 1150 receives the instruction sent by the transceiver control module 1130 to perform text editing on device B. After the instruction is displayed, a prompt can be popped up in the form of a notification through the display unit 1160 . If the device B detects that the user determines to perform text editing through the device B, it can send a response to the transceiver control module 1130 through the transceiver control module 1150, and the response can be used to indicate that the device B can be used as a text editing device. After the transceiver control module 1130 of device A receives the response, device A starts to transmit the ASR result to device B in real time.
一个实施例中,当设备A检测到获取音频内容(例如,语音内容或者音频文件)后,收发控制模块1130可以发送查询请求,该查询请求用于查询周围具备文本编辑功能的设备。In one embodiment, after the device A detects the acquisition of audio content (eg, voice content or audio files), the transceiver control module 1130 may send a query request, where the query request is used to query surrounding devices with text editing functions.
示例性的,设备A和设备B是同一账号下的设备,那么设备A可以保存有设备B的设备类型、设备名称、MAC地址等信息。当设备A检测到获取语音内容时,设备A可以根据设备B的MAC地址,向设备B发送BLE数据包。该BLE数据包可以包括PDU,该查询请求可以携带在PDU中的服务数据字段中,或者,也可以携带在PDU中的厂商特定数据字段中。例如,服务数据字段的负荷中可以包括多个比特位,其中,多个比特位中包括可扩展的比特位。设备A和设备B可以约定某个可扩展的比特位的内容。当某个可扩展的比特位为1时,设备B就可以获取设备A查询设备B是否具备文本编辑功能。Exemplarily, if device A and device B are devices under the same account, device A may store information such as device type, device name, and MAC address of device B. When device A detects the acquisition of voice content, device A can send BLE data packets to device B according to the MAC address of device B. The BLE data packet may include a PDU, and the query request may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU. For example, the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits. Device A and Device B can agree on the content of a certain extensible bit. When a certain extensible bit is 1, device B can obtain device A and query whether device B has the text editing function.
例如,如图5中的(c)所示,当手机检测到用户点击控件501(控件501为开始录音的控件)的操作时,手机可以发送BLE数据包,该BLE数据包中可以携带该查询请求。For example, as shown in (c) of FIG. 5 , when the mobile phone detects that the user clicks on the control 501 (the control 501 is the control for starting recording), the mobile phone can send a BLE data packet, and the BLE data packet can carry the query ask.
一个实施例中,当设备A检测到用户进行语音转文字的操作时,设备A可以根据设备B的MAC地址,向设备B发送BLE数据包。In one embodiment, when device A detects that the user performs an operation of converting speech to text, device A can send a BLE data packet to device B according to the MAC address of device B.
例如,如图3中的(d)所示,当手机检测到用户点击控件302(控件302为手机进行语音转文字的控件)时,手机可以发送BLE数据包,该BLE数据包中可以携带该查询请求。For example, as shown in (d) of FIG. 3 , when the mobile phone detects that the user clicks on the control 302 (the control 302 is a control for the mobile phone to perform voice-to-text conversion), the mobile phone can send a BLE data packet, and the BLE data packet can carry the query request.
设备B的收发控制模块可以调用查询文本编辑功能的接口(例如,content provider接口)向应用层的一个或者多个应用程序发送请求,该请求用于请求应用程序判断是否具备文本编辑功能;若某个应用程序具备文本编辑功能,那么该应用程序可以向数据同步模块发送响应,该响应用于指示app2登录了账号。从而设备B可以确定设备B安装且登录了app2。The transceiver control module of device B can call the interface for querying the text editing function (for example, the content provider interface) to send a request to one or more applications in the application layer, and the request is used to request the application to determine whether it has the text editing function; If an application program has a text editing function, the application program can send a response to the data synchronization module, and the response is used to indicate that app2 has logged into the account. Therefore, device B can determine that device B has installed and logged in app2.
在设备B确定具备文本编辑功能后,设备B可以向设备A发送响应,该响应可以携带在BLE数据包。该BLE数据包可以包括PDU,该查询请求可以携带在PDU中的服务数据字段中,或者,也可以携带在PDU中的厂商特定数据字段中。例如,服务数据字段的负荷中可以包括多个比特位,其中,多个比特位中包括可扩展的比特位。设备A和设备B可以约定某个可扩展的比特位的内容。当某个可扩展的比特位为1时,设备A就可以获取设备B具备文本剪辑功能。After device B determines that it has the text editing function, device B can send a response to device A, and the response can be carried in a BLE data packet. The BLE data packet may include a PDU, and the query request may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU. For example, the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits. Device A and Device B can agree on the content of a certain extensible bit. When a certain extensible bit is 1, device A can obtain the text clipping function of device B.
当设备A确定设备B具备文本编辑功能后,设备A就可以向设备B发送获取的音频内容对应的文本内容以及指示信息,该指示信息用于指示在设备B上对该文本内容进行编辑。After device A determines that device B has the text editing function, device A can send the acquired text content and instruction information corresponding to the audio content to device B, where the instruction information is used to instruct device B to edit the text content.
示例性的,设备A可以根据设备B的MAC地址,向设备B发送BLE数据包。该BLE数据包可以包括PDU,该文本内容和指示信息可以携带在PDU中的服务数据字段中,或者,也可以携带在PDU中的厂商特定数据字段中。例如,服务数据字段的负荷中可以包括多个比特位,其中,多个比特位中包括可扩展的比特位。设备A和设备B可以约定某个可扩展的比特位的内容。设备A可以采用GBK、ISO8859-1或者Unicode(例如,UTF-8、 UTF-16)等编码方式,对设备A的ASR模块输出的文本内容进行编码,并将编码后得到的信息携带在一个或者多个可扩展比特位上。设备A还可以将某个可扩展的比特位设置为1。设备B在接收到该BLE数据包后,可以通过解码获得该文本内容和指示信息,从而设备B可以根据该指示信息显示该文本内容。Exemplarily, device A may send a BLE data packet to device B according to the MAC address of device B. The BLE data packet may include a PDU, and the text content and indication information may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU. For example, the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits. Device A and Device B can agree on the content of a certain extensible bit. Device A can use encoding methods such as GBK, ISO8859-1 or Unicode (for example, UTF-8, UTF-16) to encode the text content output by the ASR module of device A, and carry the encoded information in an or on multiple scalable bits. Device A may also set an extensible bit to 1. After receiving the BLE data packet, the device B can obtain the text content and the indication information by decoding, so that the device B can display the text content according to the indication information.
或者,当设备A确定设备B具备文本编辑功能后,设备A就可以向设备B发送获取的音频内容对应的文本内容。Alternatively, after device A determines that device B has a text editing function, device A may send text content corresponding to the acquired audio content to device B.
示例性的,设备A可以根据设备B的MAC地址,向设备B发送BLE数据包。该BLE数据包可以包括PDU,该文本内容可以携带在PDU中的服务数据字段中,或者,也可以携带在PDU中的厂商特定数据字段中。例如,服务数据字段的负荷中可以包括多个比特位,其中,多个比特位中包括可扩展的比特位。设备A和设备B可以约定某个可扩展的比特位的内容。设备A可以采用GBK、ISO8859-1或者Unicode(例如,UTF-8、UTF-16)等编码方式,对设备A的ASR模块输出的文本内容进行编码,并将编码后得到的信息携带在一个或者多个可扩展比特位上。设备B在接收到该BLE数据包后,可以通过解码获得该文本内容,从而设备B可以显示该文本内容。Exemplarily, device A may send a BLE data packet to device B according to the MAC address of device B. The BLE data packet may include a PDU, and the text content may be carried in a service data field in the PDU, or may also be carried in a vendor-specific data field in the PDU. For example, the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits. Device A and Device B can agree on the content of a certain extensible bit. Device A can use encoding methods such as GBK, ISO8859-1 or Unicode (for example, UTF-8, UTF-16) to encode the text content output by the ASR module of device A, and carry the encoded information in an or on multiple scalable bits. After receiving the BLE data packet, device B can obtain the text content by decoding, so that device B can display the text content.
一个实施例中,设备A和设备B是同一账号下的设备,那么设备A可以保存有设备B的设备类型、设备名称、MAC地址以及是否具备文本编辑功能等信息。那么当设备A获取音频内容时,设备A可以向设备B发送该音频内容对应的文本内容和指示信息,该指示信息用于指示在设备B上对该文本内容进行编辑。设备B响应于接收到文本内容和该指示信息,可以通过显示单元1160提示用户是否在设备B上进行文本编辑。响应于检测到用户在设备B上进行文本编辑的操作,设备B可以打开某个可以用于文本编辑的应用程序,从而将从设备A获取的文本内容显示出来。In one embodiment, device A and device B are devices under the same account, then device A can store information such as device type, device name, MAC address, and whether it has a text editing function of device B. Then, when device A acquires the audio content, device A can send the text content and indication information corresponding to the audio content to device B, where the indication information is used to instruct device B to edit the text content. In response to receiving the text content and the indication information, the device B may prompt the user whether to perform text editing on the device B through the display unit 1160 . In response to detecting that the user performs text editing on the device B, the device B can open an application that can be used for text editing, so as to display the text content obtained from the device A.
或者,设备A可以向设备B发送该音频内容对应的文本内容。设备B响应于接收到文本内容,设备B可以在设备B上对该文本内容进行编辑,从而设备B可以通过显示单元1160提示用户是否在设备B上进行文本编辑。响应于检测到用户在设备B上进行文本编辑的操作,设备B可以打开某个可以用于文本编辑的应用程序,从而将从设备A获取的文本内容显示出来。Alternatively, device A may send the text content corresponding to the audio content to device B. In response to receiving the text content, the device B can edit the text content on the device B, so that the device B can prompt the user whether to perform text editing on the device B through the display unit 1160 . In response to detecting that the user performs text editing on the device B, the device B can open an application that can be used for text editing, so as to display the text content obtained from the device A.
本申请实施例中,显示单元1160也可以用于显示ASR模块输出的文本内容(包括中间结果和最终确定的结果);显示单元1160还可以用于显示用户在设备B上编辑后的文本内容。In this embodiment of the present application, the display unit 1160 can also be used to display the text content output by the ASR module (including the intermediate result and the final result); the display unit 1160 can also be used to display the text content edited by the user on the device B.
一个实施例中,设备B的收发控制模块1150收到收发控制模块1130发送的文本内容后,可以将该文本内容追加到之前显示的文本内容的后面。示例性的,以设备B为windows系统为例,设备B的收发控制模块1150在接收到文本内容后,可以调用QT接口,执行以下步骤:(1)对App1显示的文本内容进行全选;(2)将光标移动到文末;(3)插入设备A发送的新的文本内容;(4)保存当前App1中的所有文本内容。In one embodiment, after receiving the text content sent by the transceiver control module 1130, the transceiver control module 1150 of the device B can append the text content to the back of the previously displayed text content. Exemplarily, taking device B as a windows system as an example, after receiving the text content, the transceiver control module 1150 of device B can call the QT interface and perform the following steps: (1) select all the text content displayed by App1; ( 2) Move the cursor to the end of the text; (3) Insert the new text content sent by device A; (4) Save all the text content in the current App1.
编辑控制模块1170用于对接收信息的保存,编辑和显示控制。The editing control module 1170 is used for saving, editing and displaying control of the received information.
本申请实施例中,通过ASR模块1120发送给设备B的结果可以包括2种:In this embodiment of the present application, the results sent to device B through the ASR module 1120 may include two types:
第一种是中间结果,比如一句话没有说完之前,ASR模块1120确定的文本内容是没有最终确定的,可以作为一个中间结果。为了体现实时性,中间结果也是要显示出来的,但并没有作为最终的结果保存。The first type is an intermediate result. For example, before a sentence is finished, the text content determined by the ASR module 1120 is not finalized and can be used as an intermediate result. In order to reflect the real-time performance, the intermediate results are also displayed, but are not saved as the final results.
示例性的,设备A的ASR模块检测到用户每说出一个字就向设备B发送对应的字,从而可以实现设备A和设备B上同步显示文本内容。例如,用户说“我是XX”,当设备A检测到用户说出“我”时,设备A的ASR模块在确定对应的文字“我”后就可以向设备B发送该文本内容(“我”),从而设备B可以显示该文本内容“我”。当设备A检测到用户说出“是”时,设备A的ASR模块在确定对应的文字“是”后就可以向设备B发送该文本内容(“是”),从而设备B可以在文本内容“我”后追加显示文本内容“是”,直至显示出“我是XX”。Exemplarily, the ASR module of device A detects that every time the user speaks a word, it sends a corresponding word to device B, so that text content can be displayed on device A and device B synchronously. For example, the user says "I am XX", when device A detects that the user has said "I", the ASR module of device A can send the text content ("I" to device B after determining the corresponding text "I" ), so that device B can display the text content "I". When device A detects that the user has said "yes", the ASR module of device A can send the text content ("yes") to device B after determining the corresponding text "yes", so that device B can Append the text "Yes" after "I" until "I am XX" is displayed.
示例性的,设备A的ASR模块检测到用户每说出一个词组就向设备B发送对应的词组,从而可以实现设备A和设备B上同步显示文本内容。例如,用户说“任何艰难困苦都不能阻挡我们前进的步伐”,当设备A检测到用户说出“任何”时,设备A的ASR模块在确定对应的文字“任何”后就可以向设备B发送该文本内容(“任何”),从而设备B可以显示该文本内容“任何”。当设备A检测到用户说出“艰难”时,设备A的ASR模块在确定对应的文字“艰难”后就可以向设备B发送该文本内容(“艰难”),从而设备B可以在文本内容“任何”后追加显示文本内容“艰难”,直至显示出“任何艰难困苦都不能阻挡我们前进的步伐”。Exemplarily, the ASR module of device A detects that every time the user speaks a phrase, it sends a corresponding phrase to device B, so that text content can be displayed on device A and device B synchronously. For example, the user says "no hardship can stop us from moving forward", when device A detects that the user has said "any", the ASR module of device A can send the corresponding word "any" to device B after determining the corresponding word "any" The text content ("any"), so that device B can display the text content "any". When device A detects that the user says "difficult", the ASR module of device A can send the text content ("difficult") to device B after determining the corresponding word "difficult", so that device B can Append the text "difficulty" after any" until it shows "no hardship can stop us from moving forward".
一个实施例中,设备A显示的内容与设备B上显示的内容可以同步,设备A在判断用户说完一句话时可能会对之前ASR模块转化的文本内容进行纠正。示例性的,用户说“今天我想吃面条”,设备A的ASR模块检测到用户每说出一个字或者一个词组时就向设备B发送对应的文本内容,当设备A检测到用户说出“想”时,有可能ASR模块确定对应的文字为“向”,那么设备A会向设备B发送对应的文本内容(“向”),设备B可以以显示该文本内容“向”。而当设备A在判断用户说完这句话后,可以确定之前转化的文本内容“今天我向吃面条”中的“向”转化存在错误,则设备A可以自动将该“向”纠正为“想”,从而更新用户说出的文本内容(从“今天我向吃面条”更新为“今天我想吃面条”)。设备A可以将更新后的文本内容发送给设备B,从而设备B也可以将之前显示的文本内容“今天我向吃面条”中的“向”更新为“想”,从而显示更新后的文本内容“今天我想吃面条”。In one embodiment, the content displayed by device A can be synchronized with the content displayed on device B, and device A may correct the text content previously converted by the ASR module when judging that the user has finished speaking. Exemplarily, when the user says "I want to eat noodles today", the ASR module of device A detects that every time the user speaks a word or a phrase, it sends the corresponding text content to device B. When device A detects that the user says " When "thinking", it is possible that the ASR module determines that the corresponding text is "xiang", then device A will send the corresponding text content ("xiang") to device B, and device B can display the text content "xiang". After judging that the user has finished saying this sentence, device A can determine that there is an error in the conversion of "Xiang" in the previously converted text content "Today I'm eating noodles", then device A can automatically correct the "Xiang" to "Xiang". I want to", thereby updating the text content spoken by the user (from "Today I want to eat noodles" to "Today I want to eat noodles"). Device A can send the updated text content to device B, so that device B can also update "Xiang" in the previously displayed text content "Today I want to eat noodles" to "Want", thereby displaying the updated text content "Today I want to eat noodles".
第二种是一句话说完后,ASR模块1120确定的这句话的识别结果以后不会再变回,这样的结果会把之前的中间结果替换显示并作为保存结果。The second is that after a sentence is spoken, the recognition result of the sentence determined by the ASR module 1120 will not be changed back in the future, and such a result will replace the previous intermediate result and display it as the saved result.
示例性的,设备A的ASR模块可以将用户说出的每一个字或者每一个词组转化为文本内容,但是设备A在判断用户没有说完一句话之前,可以不向设备B发送文本内容。而是,当设备A判断用户说完一句话之后并将之前转化文本内容进行纠正后,向设备B发送该句话对应的文本内容。Exemplarily, the ASR module of device A can convert each word or phrase spoken by the user into text content, but device A may not send text content to device B before judging that the user has not finished a sentence. Instead, when device A determines that the user has finished speaking a sentence and corrects the previously converted text content, it sends the text content corresponding to the sentence to device B.
一个实施例中,设备A的ASR模块可以将用户说出的每一字或者每一个词组转化为文本内容后发送给设备B,设备B可以以字幕的方式显示设备A发送的文本内容,但是设备B可以不将该文本内容显示在app1中。当设备A判断用户说完一句话(并对之前转化的文本内容进行纠正)之后,设备A可以向设备B发送该句话对应的文本内容以及指示信息,该指示信息指示设备B在app1中显示该句话对应的文本内容。设备B在接收到该文本内容和指示信息后,可以在app1中显示该句话对应的文本内容。In one embodiment, the ASR module of device A can convert each word or phrase spoken by the user into text content and send it to device B. Device B can display the text content sent by device A in the form of subtitles, but the device B may not display the text content in app1. When device A judges that the user has finished speaking a sentence (and corrects the previously converted text content), device A can send the text content corresponding to the sentence and indication information to device B, and the indication information instructs device B to display in app1 The text content corresponding to the sentence. After receiving the text content and the instruction information, device B can display the text content corresponding to the sentence in app1.
如果设备A在实时同步ASR模块的输出结果的过程中,设备B检测到用户在设备B上对文本内容进行了编辑,编辑控制模块1170也会重新保存编辑后的结果并显示出来, 后续的ASR结果会在此基础上追加。If device A detects that the user has edited the text content on device B during the process of synchronizing the output result of the ASR module in real time, the editing control module 1170 will also re-save the edited result and display it, and the subsequent ASR The result is appended on top of that.
编辑控制模块1170还用于将用户编辑后的文本内容发送给收发控制模块1150,从而收发控制模块1150将编辑后的文本内容发送给收发控制模块1130。The editing control module 1170 is further configured to send the text content edited by the user to the sending and receiving control module 1150 , so that the sending and receiving control module 1150 sends the edited text content to the sending and receiving control module 1130 .
替换模块1140用于在接收到收发控制模块1130发送的编辑后的文本内容后,将该编辑后的文本内容替换原来显示的文本内容。The replacement module 1140 is configured to replace the original displayed text content with the edited text content after receiving the edited text content sent by the transceiver control module 1130 .
一个实施例中,当录音结束或者音频获取结束时,设备A可以向设备B指示录音结束或者音频获取结束。用户在设备B上编辑完成后,可以一次性把编辑结果同步到设备A,整个同步流程完成。In one embodiment, when the recording ends or the audio acquisition ends, the device A may indicate the end of the recording or the audio acquisition to the device B. After the user finishes editing on device B, the editing result can be synchronized to device A at one time, and the entire synchronization process is completed.
图12示出了本申请实施例的文本编辑的方法1200的示意性流程图。该方法1200可以由上述设备A和设备B实现,该方法1200包括:FIG. 12 shows a schematic flowchart of a text editing method 1200 according to an embodiment of the present application. The method 1200 can be implemented by the above-mentioned device A and device B, and the method 1200 includes:
S1201,设备A获取音频内容。S1201, device A acquires audio content.
示例性的,如图3中的(d)所示,当手机检测到用户点击控件302的操作时,手机开始采集环境中的语音内容。Exemplarily, as shown in (d) of FIG. 3 , when the mobile phone detects that the user clicks on the control 302 , the mobile phone starts to collect the voice content in the environment.
示例性的,如图5中的(c)所示,当手机检测到用户点击控件501的操作时,手机还是采集环境中的语音内容。Exemplarily, as shown in (c) of FIG. 5 , when the mobile phone detects an operation of the user clicking on the control 501, the mobile phone still collects the voice content in the environment.
示例性的,如图6中的(b)所示,当手机检测到用户接受该来电的操作时,手机可以获取另一用户的语音内容。Exemplarily, as shown in (b) of FIG. 6 , when the mobile phone detects the user's operation of accepting the incoming call, the mobile phone can acquire the voice content of another user.
示例性的,如图7中的(b)所示,当手机检测到用户接受该视频通话请求的操作时,手机可以获取另一用户的语音内容。Exemplarily, as shown in (b) of FIG. 7 , when the mobile phone detects that the user accepts the operation of the video call request, the mobile phone can acquire the voice content of another user.
S1202,设备A根据该音频内容,向设备B发送第一信息。S1202, device A sends first information to device B according to the audio content.
一个实施例中,该第一信息为该音频内容对应的文本内容。In one embodiment, the first information is text content corresponding to the audio content.
应理解,本申请实施例中,设备A在获取到语音内容后,可以先通过ASR模块将该语音内容转化为文本内容,从而向设备B发送该文本内容。设备A将该语音内容转化为文本内容的过程可以参考上述实施例中的描述,为了简洁,在此不再赘述。It should be understood that in this embodiment of the present application, after acquiring the voice content, device A may first convert the voice content into text content through the ASR module, so as to send the text content to device B. For the process of converting the voice content into text content by the device A, reference may be made to the descriptions in the foregoing embodiments, which are not repeated here for brevity.
一个实施例中,设备A在向设备B发送该语音内容之前,该方法还包括:In one embodiment, before device A sends the voice content to device B, the method further includes:
设备A向发送查询请求,该查询请求用于请求接收到该查询请求的设备判断是否具备文本编辑功能;Device A sends a query request to request the device that received the query request to determine whether it has a text editing function;
响应于接收到设备A发送的请求信息,设备B向设备A发送响应,该响应用于指示设备B具备文本编辑功能;In response to receiving the request information sent by device A, device B sends a response to device A, where the response is used to indicate that device B has a text editing function;
响应于接收到该响应,设备A向设备B发送该第一信息。In response to receiving the response, device A sends device B the first information.
本申请实施例中,设备A向周围设备广播发送该查询请求。或者,设备A可以保存有设备B的设备信息(例如,设备A和设备B为同一账号下的设备或者,设备A和设备B为同一家庭群组中不同账号下的设备)。In this embodiment of the present application, device A broadcasts the query request to surrounding devices. Alternatively, device A may store device information of device B (eg, device A and device B are devices under the same account, or device A and device B are devices under different accounts in the same family group).
应理解,设备A发送查询请求的过程可以参考上述实施例中的描述,为了简洁,在此不再赘述。It should be understood that, for the process of sending the query request by the device A, reference may be made to the descriptions in the foregoing embodiments, which are not repeated here for brevity.
一个实施例中,设备B响应于接收到该查询请求,可以提示用户是否在设备B上进行文本编辑;响应于用户确定在设备B上进行文本编辑的操作,设备B向设备A发送该响应。In one embodiment, in response to receiving the query request, device B may prompt the user whether to perform text editing on device B; in response to the user determining to perform text editing on device B, device B sends the response to device A.
示例性的,如图3中(d)所示,当笔记本电脑检测到手机发送的查询请求后,笔记 本电脑可以显示提示框303,该提示框303中包括提示信息“检测到手机正在进行录音转文字,是否在笔记本电脑上进行文本编辑”。响应于用户点击控件304的操作,笔记本电脑可以向手机发送该响应。Exemplarily, as shown in (d) in FIG. 3 , when the laptop computer detects the query request sent by the mobile phone, the laptop computer can display a prompt box 303, which includes the prompt message “It is detected that the mobile phone is recording and transferring. text, whether text editing is done on a laptop". In response to the user clicking on control 304, the laptop can send the response to the phone.
一个实施例中,设备A在接收到设备B发送的响应后,可以向设备A发送请求信息,该请求信息用于请求设备B对设备A输出的文本内容进行编辑;或者,该请求信息用于请求设备B对该音频内容对应的文本内容进行编辑。In one embodiment, after receiving the response sent by device B, device A may send request information to device A, where the request information is used to request device B to edit the text content output by device A; Request device B to edit the text content corresponding to the audio content.
示例性的,如图3中的(d)所示,当手机检测到用户点击控件302的操作时,手机可以向周围设备发送查询请求,该查询请求用于查询周围设备是否具备文本编辑功能;响应于接收到笔记本电脑发送的响应,手机可以确定笔记本电脑具备文本编辑功能。手机可以继续向笔记本电脑发送请求信息,该请求信息用于对手机输出的文本内容进行编辑。响应于从手机接收到该请求信息,笔记本电脑可以显示提示框303。响应于用户点击控件304的操作,笔记本电脑可以向手机发送响应,该响应用于指示笔记本电脑可以对文本内容进行编辑。响应于接收到该响应,手机可以向笔记本电脑发送文本内容。Exemplarily, as shown in (d) in FIG. 3 , when the mobile phone detects that the user clicks on the control 302, the mobile phone can send a query request to the surrounding devices, and the query request is used to query whether the surrounding devices have a text editing function; In response to receiving the response sent by the laptop, the mobile phone can determine that the laptop has text editing capabilities. The mobile phone can continue to send request information to the notebook computer, and the request information is used to edit the text content output by the mobile phone. In response to receiving the request information from the cell phone, the laptop computer may display a prompt box 303 . In response to the user clicking on the control 304, the notebook computer can send a response to the mobile phone, the response being used to indicate that the notebook computer can edit the text content. In response to receiving the response, the cell phone can send the textual content to the laptop.
示例性的,如图5中的(c)所示,手机检测到用户点击控件501的操作时,向周围设备发送查询请求;响应于接收到笔记本电脑发送的响应,手机可以确定笔记本电脑具备文本编辑功能。手机可以向笔记本电脑发送请求信息,该请求信息用于请求笔记本电脑对手机获取的音频内容对应的文本内容进行编辑。响应于接收到该请求信息,笔记本电脑可以显示提示框502,其中,提示框502中包括提示信息“检测到手机正在进行录音,是否在笔记本电脑上进行录音转文字”。响应于用户点击控件503的操作,笔记本电脑可以向手机发送响应,该响应用于指示笔记本电脑可以对录音对应的文本内容进行编辑。响应于接收到该响应,手机可以向笔记本电脑发送文本内容。Exemplarily, as shown in (c) of FIG. 5 , when the mobile phone detects the operation of the user clicking on the control 501, it sends a query request to the surrounding devices; in response to receiving the response sent by the notebook computer, the mobile phone can determine that the notebook computer has text Edit function. The mobile phone can send request information to the laptop computer, where the request information is used to request the laptop computer to edit the text content corresponding to the audio content acquired by the mobile phone. In response to receiving the request information, the laptop computer may display a prompt box 502, wherein the prompt box 502 includes prompt information "it is detected that the mobile phone is recording, whether to convert the recording to text on the laptop computer". In response to the user clicking the control 503, the notebook computer can send a response to the mobile phone, where the response is used to instruct the notebook computer to edit the text content corresponding to the recording. In response to receiving the response, the cell phone can send the textual content to the laptop.
还应理解,设备B可以通过BLE数据包向设备A发送该响应信息,具体的发送过程可以参考上述设备A向设备B发送BLE数据包的过程,为了简洁,在此不再赘述。It should also be understood that device B can send the response information to device A through a BLE data packet, and the specific sending process can refer to the above-mentioned process of device A sending a BLE data packet to device B, which is not repeated here for brevity.
示例性的,该请求信息可以携带在用户数据报协议(user datagram protocol,UDP)数据包中。UPD数据包中包括IP数据报的数据部分。IP数据报的数据部分可以包括可扩展的比特位。设备A和设备B可以约定某个可扩展比特位的内容。当某个可扩展的比特位为1时,设备B就可以获知设备A请求对文本内容进行编辑。Exemplarily, the request information may be carried in a user datagram protocol (user datagram protocol, UDP) data packet. The UPD packet includes the data portion of the IP datagram. The data portion of an IP datagram may include extensible bits. Device A and Device B can agree on the content of a certain extensible bit. When a certain extensible bit is 1, device B can know that device A requests to edit the text content.
该UDP数据包还可以携带设备A的IP地址和端口号(包括源端口号和目的端口号,其中,源端口号是指设备A发送数据时使用的端口号,目的端口号是指设备A接收数据所使用的端口),设备A的IP地址和端口号可以携带在IP数据报的数据部分的UDP首部。设备B响应于接收到该UDP数据包,可以和设备A建立传输控制协议(transmission control protocol,TCP)连接。The UDP data packet can also carry the IP address and port number of device A (including source port number and destination port number, where the source port number refers to the port number used by device A to send data, and the destination port number refers to the port number that device A receives The port used by the data), the IP address and port number of device A can be carried in the UDP header of the data part of the IP datagram. In response to receiving the UDP data packet, device B can establish a transmission control protocol (transmission control protocol, TCP) connection with device A.
应理解,当设备B和设备A建立了TCP连接后,设备B可以通过TCP连接向设备A发送该响应。It should be understood that after device B and device A establish a TCP connection, device B can send the response to device A through the TCP connection.
示例性的,若UDP数据包中携带设备A的IP地址以及目的端口号。设备B在可以通过IP地址以及目的端口号,和设备A建立TCP连接。此后设备A可以通过TCP连接向设备B发送设备A的ASR模块输入的文本内容。Exemplarily, if the UDP data packet carries the IP address and destination port number of device A. Device B can establish a TCP connection with device A through the IP address and destination port number. After that, device A can send the text content input by the ASR module of device A to device B through the TCP connection.
一个实施例中,该方法1200还包括:在设备A将语音内容转化为文字内容时,设备A显示该文本内容。In one embodiment, the method 1200 further includes: when the device A converts the voice content into text content, the device A displays the text content.
示例性的,如图3中的(e)所示,手机在检测到环境中的语音内容后,可以将该语音内容转化为文本内容,从而通过备忘录应用的显示界面显示该文本内容。Exemplarily, as shown in (e) of FIG. 3 , after detecting the voice content in the environment, the mobile phone can convert the voice content into text content, so as to display the text content through the display interface of the memo application.
S1203,设备B根据该第一信息,显示该音频内容对应的文本内容。S1203: Device B displays the text content corresponding to the audio content according to the first information.
示例性的,如图3中的(e)所示,笔记本电脑在接收到手机发送的文本内容后,可以通过App1显示该文本内容。Exemplarily, as shown in (e) of FIG. 3 , after receiving the text content sent by the mobile phone, the notebook computer can display the text content through App1.
S1204,检测到用户对该文本内容进行编辑的操作,设备B可以显示编辑后的文本内容。S1204, an operation of editing the text content by the user is detected, and the device B can display the edited text content.
示例性的,如图3中的(f)和(g)所示,笔记本电脑在检测到用户编辑该文本内容的操作后,笔记本电脑可以显示编辑后的文本内容。Exemplarily, as shown in (f) and (g) of FIG. 3 , after the notebook computer detects an operation of editing the text content by the user, the notebook computer may display the edited text content.
一个实施例中,该方法1200还包括:设备B向设备A发送编辑后的文本内容。In one embodiment, the method 1200 further includes: device B sending the edited text content to device A.
一个实施例中,设备B检测到用户的第一操作,向设备A发送编辑后的文本内容。In one embodiment, device B detects the user's first operation, and sends the edited text content to device A.
示例性的,如图3中的(f)所示,当笔记本电脑检测到用户点击控件307的操作时,笔记本电脑可以向手机发送编辑后的文本内容。Exemplarily, as shown in (f) of FIG. 3 , when the notebook computer detects that the user clicks on the control 307, the notebook computer can send the edited text content to the mobile phone.
示例性的,如图3中的(g)所示,当笔记本电脑检测到用户点击控件307的操作时,笔记本电脑可以向手机发送编辑后的文本内容。Exemplarily, as shown in (g) of FIG. 3 , when the notebook computer detects that the user clicks on the control 307, the notebook computer can send the edited text content to the mobile phone.
一个实施例中,设备A在接收到编辑后的文本内容后,可以使用编辑后的文本内容对之前显示的文本内容进行编辑。示例性的,如图3中的(e)和(f)所示,当笔记本电脑检测到用户点击保存控件307的操作时,笔记本电脑可以向手机发送编辑后的文本内容。手机可以使用用户编辑后的文本内容替换之前显示的文本内容。In one embodiment, after receiving the edited text content, device A may use the edited text content to edit the previously displayed text content. Exemplarily, as shown in (e) and (f) of FIG. 3 , when the notebook computer detects that the user clicks the save control 307, the notebook computer may send the edited text content to the mobile phone. The phone can replace the previously displayed text content with the text content edited by the user.
本申请实施例中,设备B在检测到用户对设备A发送的文本内容的编辑操作时,可以对该文本内容进行相应的编辑。示例性的,参见图3中的(g)所示,当笔记本电脑检测到用户对文本内容编辑操作时,可以对从手机接收到的文本内容进行编辑。示例性的,当笔记本电脑可以检测到用户将“正成功”修改为“郑成功”的操作;又例如,笔记本电脑可以检测到用户在“年度特征”后增加符号“。”的操作;又例如,笔记本电脑可以检测到用户在“医疗教育住房食品安全”中插入符号“、”的操作。又例如,笔记本电脑可以检测到用户在“社会心态?”后回车的操作。In this embodiment of the present application, when device B detects an editing operation by the user on the text content sent by device A, it can edit the text content accordingly. Exemplarily, as shown in (g) of FIG. 3 , when the notebook computer detects that the user has edited the text content, it can edit the text content received from the mobile phone. Exemplarily, when the notebook computer can detect the user's operation of changing "positive success" to "Zheng Chenggong"; for another example, the notebook computer can detect the user's operation of adding a symbol "." after "annual feature"; another example, The laptop can detect the user inserting the symbol "," in "Medical Education Housing Food Safety". As another example, a laptop can detect the user's press enter after "Social Mindset?".
设备B可以通过BLE数据包或者通过TCP连接向设备A发送编辑后的文本内容,该发送方式可以参考上述实施例中的描述,为了简洁,在此不再赘述。Device B can send the edited text content to device A through a BLE data packet or through a TCP connection. For the sending method, reference may be made to the description in the above embodiment, which is not repeated here for brevity.
一个实施例中,如果设备B检测到用户编辑了该文本内容的格式,那么设备B向设备A发送编辑后的文本内容时,还可以向设备A指示该文本内容的格式信息。示例性的,该文本内容的格式可以包括该文本内容中某两个字之间的换行(或者回车)操作;或者,某两个字之间存在空格。示例性的,如图3中的(g)所示,笔记本电脑在向手机发送编辑后的文本内容的同时,还可以向手机指示“求”和“2”之间存在换行,以及“?”和“3”之间存在换行。In one embodiment, if device B detects that the user has edited the format of the text content, when device B sends the edited text content to device A, it can also indicate the format information of the text content to device A. Exemplarily, the format of the text content may include a line feed (or carriage return) operation between certain two words in the text content; or, there is a space between certain two words. Exemplarily, as shown in (g) of FIG. 3 , while sending the edited text content to the mobile phone, the notebook computer can also indicate to the mobile phone that there is a line break between "Q" and "2", and "?" There is a newline between "3".
一个实施例中,设备B在检测到用户对文本内容的格式进行修改的操作后,可以向设备A发送编辑后的文本内容以及该文本内容的格式信息。设备B向设备A发送编辑后的文本内容的实现方式可以参考上述实施例中的描述,为了简洁,在此不在赘述。下面介绍设备B向设备A发送编辑后的文本内容的格式信息的一种实现方式。示例性的,该文本内容格式包括字体大小、字体颜色、字体倾斜、字体下划线、字体的背景色、本本内容中 某个文字后的回车操作。In one embodiment, after detecting the user's operation of modifying the format of the text content, device B may send the edited text content and format information of the text content to device A. The implementation manner of the device B sending the edited text content to the device A may refer to the description in the foregoing embodiment, which is not repeated here for brevity. The following describes an implementation manner in which device B sends the format information of the edited text content to device A. Exemplarily, the text content format includes font size, font color, font inclination, font underline, font background color, and a carriage return operation after a certain text in this content.
示例性的,设备A可以通过BLE数据包向设备B发送编辑后的文本内容的格式信息。该文本内容可以携带在PDU中的服务数据字段或者厂商特定数据字段中。例如,服务数据字段的负荷中可以包括多个比特位,其中,多个比特位中包括可扩展的比特位。对于编辑后的文本内容中的某一个字符(例如,文字、字符或者符号),设备A可以和设备B约定某些可扩展的比特位的内容。示例性的,当某个可扩展的比特位为000时,设备A就可以获知该字符不倾斜、无下划线且该字符后无回车操作。示例性的,当某个可扩展的比特位为100时,设备A就可以获知该字符倾斜、无下划线且该字符后无回车操作。示例性的,当某个可扩展的比特位为010时,设备A就可以获知该字符不倾斜、有下划线且该字符后无回车操作。示例性的,当某个可扩展的比特位为001时,设备A就可以获知该字符不倾斜、无下划线且该字符后有回车操作。Exemplarily, device A may send the format information of the edited text content to device B through a BLE data packet. This textual content may be carried in a service data field or a vendor specific data field in the PDU. For example, the payload of the service data field may include multiple bits, wherein the multiple bits include scalable bits. For a certain character (for example, text, character or symbol) in the edited text content, device A may agree with device B on the content of some extensible bits. Exemplarily, when a certain extensible bit is 000, the device A can know that the character is not oblique, there is no underscore, and there is no carriage return operation after the character. Exemplarily, when a certain extensible bit is 100, device A can learn that the character is oblique, has no underline, and there is no carriage return operation after the character. Exemplarily, when a certain extensible bit is 010, device A can know that the character is not oblique, has an underline, and there is no carriage return operation after the character. Exemplarily, when a certain extensible bit is 001, device A can know that the character is not slanted, has no underscore, and there is a carriage return operation after the character.
对于编辑后的文本内容中的某一个字符(例如,文字、字符或者符号),设备A可以和设备B约定某些可扩展的比特位的内容。示例性的,当某个可扩展的比特位为000时,设备A就可以获知该字符的字体颜色为黑色。示例性的,当某个可扩展的比特位为001时,设备A就可以获知该字符的字体颜色为灰色。示例性的,当某个可扩展的比特位为010时,设备A就可以获知该字符的颜色为蓝色。示例性的,当某个可扩展的比特位为100时,设备A就可以获知该字符的字体颜色为蓝色。For a certain character (for example, text, character or symbol) in the edited text content, device A may agree with device B on the content of some extensible bits. Exemplarily, when a certain expandable bit is 000, device A can learn that the font color of the character is black. Exemplarily, when a certain extensible bit is 001, device A can learn that the font color of the character is gray. Exemplarily, when a certain expandable bit is 010, device A can know that the color of the character is blue. Exemplarily, when a certain expandable bit is 100, device A can learn that the font color of the character is blue.
应理解,设备B向设备A指示编辑后的文本内容中的某个字符的字体背景色的过程也可以参考上述实施例中的描述。It should be understood that the process of device B indicating the font background color of a certain character in the edited text content to device A may also refer to the description in the above embodiment.
对于编辑后的文本内容中的某一个字符(例如,文字、字符或者符号),设备A可以和设备B约定某些可扩展的比特位的内容。示例性的,当某个可扩展的比特位为000时,设备A就可以获知该字符的字体大小为10号。示例性的,当某个可扩展的比特位为001时,设备A就可以获知该字符的字体大小为12号。示例性的,当某个可扩展的比特位为010时,设备A就可以获知该字符的字体大小为14号。示例性的,当某个可扩展的比特位为100时,设备A就可以获知该字符的字体大小为18号。For a certain character (for example, text, character or symbol) in the edited text content, device A may agree with device B on the content of some extensible bits. Exemplarily, when a certain expandable bit is 000, device A can learn that the font size of the character is 10. Exemplarily, when a certain extensible bit is 001, device A can learn that the font size of the character is 12. Exemplarily, when a certain extensible bit is 010, device A can learn that the font size of the character is 14. Exemplarily, when a certain expandable bit is 100, device A can learn that the font size of the character is 18.
应理解,本申请实施例中对文本内容的格式并不作具体限定,设备B在接收到该编辑后的文本内容的格式信息后,可以根据编辑后的文本内容以及编辑后的文本内容的格式信息,显示编辑后的文本内容。设备B上显示的文本内容与设备A上显示的文本内容是相对应的。It should be understood that the format of the text content is not specifically limited in the embodiment of the present application. After receiving the format information of the edited text content, the device B can to display the edited text content. The text content displayed on device B corresponds to the text content displayed on device A.
一个实施例中,该方法1200还包括:设备A使用从设备B接收到的编辑后的文本内容替换原来的文本内容;响应于从设备B接收到编辑后的文本内容,设备A显示第二界面,该第二界面上包括编辑后的文本内容。In one embodiment, the method 1200 further includes: device A replaces the original text content with the edited text content received from device B; in response to receiving the edited text content from device B, device A displays a second interface , the second interface includes the edited text content.
示例性的,如图3中的(f)和(g)所示,笔记本电脑响应于用户点击保存控件307的操作,向手机发送编辑后的文本内容。手机在接收到编辑后的文本内容后,可以使用编辑后的文本内容替换原来界面上显示的文本内容。Exemplarily, as shown in (f) and (g) of FIG. 3 , the notebook computer sends the edited text content to the mobile phone in response to the operation of the user clicking the save control 307 . After receiving the edited text content, the mobile phone can replace the text content displayed on the original interface with the edited text content.
一个实施例中,设备B向设备A发送编辑后的文本内容时,还可以向设备A发送该编辑后的文本内容的标识信息。In one embodiment, when device B sends the edited text content to device A, it may also send the identification information of the edited text content to device A.
应理解,S1204为可选地步骤,设备B也可以不向设备A发送编辑后的文本内容。而是将编辑后的文本内容保存在设备B本地。It should be understood that S1204 is an optional step, and device B may also not send the edited text content to device A. Instead, save the edited text content locally on device B.
图13示出了本申请实施例提供的另一系统架构。该系统架构中包括设备A和设备B,其中设备A可以是具有拾音功能的设备(例如,手机),设备B可以是具备高效编辑功能的设备(例如,笔记本电脑)。设备A包括拾音模块1310和收发控制模块1320;设备B包括接收收发控制模块1330、ASR模块1340、显示模块1350和编辑控制模块1360。与图11所示的系统架构不同的是,ASR模块可以包括在设备B中。设备A可以负责获取语音内容或者音频文件,设备A通过收发控制模块1320向设备B的收发控制模块1330发送获取的语音内容或者音频文件,从而由设备B的ASR模块1340将该语音内容或者音频文件转化为文本内容,从而设备B可以向用户显示转化后的文本内容。设备B还可以通过编辑控制模块1360在检测到用户对文本内容的编辑操作后,通过显示模块1340显示编辑后的文本内容。FIG. 13 shows another system architecture provided by an embodiment of the present application. The system architecture includes device A and device B, where device A may be a device with a pickup function (eg, a mobile phone), and device B may be a device with an efficient editing function (eg, a laptop). Device A includes a pickup module 1310 and a transceiver control module 1320 ; device B includes a receive transceiver control module 1330 , an ASR module 1340 , a display module 1350 and an editing control module 1360 . Different from the system architecture shown in FIG. 11 , the ASR module can be included in device B. Device A can be responsible for acquiring voice content or audio files, and device A sends the acquired voice content or audio files to the transceiver control module 1330 of device B through the transceiver control module 1320, so that the ASR module 1340 of device B sends the voice content or audio files to the device B. Converted to text content, so that device B can display the converted text content to the user. Device B can also display the edited text content through the display module 1340 after detecting an editing operation of the text content by the user through the editing control module 1360 .
图14示出了本申请实施例提供的装置1400的示意性框图。该装置1400可以设置于上述图12中的设备A中,该装置1400中包括:获取单元1410,用于获取音频内容;发送单元1420,用于向第二电子设备发送第一信息,该第一信息为该音频内容,或者,该第一信息为该音频内容对应的第一文本内容,以使得该第二电子设备根据该第一信息显示该第一文本内容并检测用户对该第一文本内容的编辑操作。FIG. 14 shows a schematic block diagram of an apparatus 1400 provided by an embodiment of the present application. The apparatus 1400 can be set in the above-mentioned device A in FIG. 12 , and the apparatus 1400 includes: an acquiring unit 1410 for acquiring audio content; a sending unit 1420 for transmitting first information to the second electronic device, the first The information is the audio content, or the first information is the first text content corresponding to the audio content, so that the second electronic device displays the first text content according to the first information and detects the user's first text content. edit operation.
图15示出了本申请实施例提供的装置1200的示意性框图。该装置1500可以设置于上述图12中的设备B中,该装置1500中包括:接收单元1510,用于接收第一电子设备发送的第一信息,该第一信息为该第一电子设备获取的音频内容,或者,该第一信息为该音频内容对应的第一文本内容;显示单元1520,用于根据该第一信息,显示该第一文本内容;检测单元1530,用于检测用户对该第一文本内容的编辑操作;显示单元1520,还用于显示第二文本容,该第二文本内容为对该第一文本内容编辑后的文本内容。FIG. 15 shows a schematic block diagram of an apparatus 1200 provided by an embodiment of the present application. The apparatus 1500 can be set in the above-mentioned device B in FIG. 12 , and the apparatus 1500 includes: a receiving unit 1510 for receiving first information sent by a first electronic device, where the first information is acquired by the first electronic device audio content, or the first information is the first text content corresponding to the audio content; the display unit 1520 is used to display the first text content according to the first information; the detection unit 1530 is used to detect the user's An editing operation of text content; the display unit 1520 is further configured to display the second text content, where the second text content is the edited text content of the first text content.
图16示出了本申请实施例提供的电子设备1600的示意性结构图。如图16所示,该电子设备包括:一个或多个处理器1610,一个或多个存储器1620,该一个或多个存储器1620存储有一个或多个计算机程序,该一个或多个计算机程序包括指令。当该指令被所述一个或多个处理器1610运行时,使得第一电子设备或者第二电子设备执行上述实施例中的技术方案。FIG. 16 shows a schematic structural diagram of an electronic device 1600 provided by an embodiment of the present application. As shown in FIG. 16, the electronic device includes: one or more processors 1610, one or more memories 1620, and the one or more memories 1620 stores one or more computer programs, and the one or more computer programs include instruction. When the instruction is executed by the one or more processors 1610, the first electronic device or the second electronic device is caused to execute the technical solutions in the foregoing embodiments.
本申请实施例提供了一种系统,包括设备A和设备B,该系统用于执行上述实施例中的技术方案。其实现原理和技术效果与上述方法相关实施例类似,此处不再赘述。An embodiment of the present application provides a system including a device A and a device B, and the system is used to implement the technical solutions in the foregoing embodiments. The implementation principle and technical effect thereof are similar to the related embodiments of the above method, and are not repeated here.
本申请实施例提供一种计算机程序产品,当所述计算机程序产品在设备A(或者,上述实施例中的手机)运行时,使得设备A执行上述实施例中的技术方案。其实现原理和技术效果与上述方法相关实施例类似,此处不再赘述。The embodiments of the present application provide a computer program product, which enables the device A to execute the technical solutions in the foregoing embodiments when the computer program product runs on the device A (or the mobile phone in the foregoing embodiment). The implementation principle and technical effect thereof are similar to the related embodiments of the above method, and are not repeated here.
本申请实施例提供一种计算机程序产品,当所述计算机程序产品在设备B(或者,上述实施例中的笔记本电脑)运行时,使得设备B执行上述实施例中的技术方案。其实现原理和技术效果与上述方法相关实施例类似,此处不再赘述。An embodiment of the present application provides a computer program product, which enables the device B to execute the technical solutions in the foregoing embodiments when the computer program product runs on the device B (or the notebook computer in the foregoing embodiment). The implementation principle and technical effect thereof are similar to the related embodiments of the above method, and are not repeated here.
本申请实施例提供一种可读存储介质,所述可读存储介质包含指令,当所述指令在设备A(或者,上述实施例中的手机)运行时,使得所述设备A执行上述实施例的技术方案。其实现原理和技术效果类似,此处不再赘述。An embodiment of the present application provides a readable storage medium, where the readable storage medium contains instructions, when the instructions are executed on device A (or, the mobile phone in the above embodiment), the device A is caused to execute the above embodiment technical solution. The implementation principle and technical effect thereof are similar, and are not repeated here.
本申请实施例提供一种可读存储介质,所述可读存储介质包含指令,当所述指令在设备B(或者,上述实施例中的笔记本电脑)运行时,使得所述设备B执行上述实施例的技 术方案。其实现原理和技术效果类似,此处不再赘述。An embodiment of the present application provides a readable storage medium, where the readable storage medium contains instructions, when the instructions are executed on a device B (or a notebook computer in the above embodiment), the device B is caused to perform the above implementation. example technical solution. The implementation principle and technical effect thereof are similar, and are not repeated here.
本申请实施例提供一种芯片,所述芯片用于执行指令,当所述芯片运行时,执行上述实施例中的技术方案。其实现原理和技术效果类似,此处不再赘述。An embodiment of the present application provides a chip, which is used for executing an instruction, and when the chip is running, executes the technical solutions in the foregoing embodiments. The implementation principle and technical effect thereof are similar, and are not repeated here.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、或者计算机软件和电子硬件的结合来实现。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art can realize that the units and algorithm steps of each example described in conjunction with the embodiments disclosed herein can be implemented in electronic hardware, or a combination of computer software and electronic hardware. Whether these functions are performed in hardware or software depends on the specific application and design constraints of the technical solution. Skilled artisans may implement the described functionality using different methods for each particular application, but such implementations should not be considered beyond the scope of this application.
所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,上述描述的系统、装置和单元的具体工作过程,可以参考前述方法实施例中的对应过程,在此不再赘述。Those skilled in the art can clearly understand that, for the convenience and brevity of description, the specific working process of the above-described systems, devices and units may refer to the corresponding processes in the foregoing method embodiments, which will not be repeated here.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus and method may be implemented in other manners. For example, the apparatus embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or Can be integrated into another system, or some features can be ignored, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of devices or units, and may be in electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
所述功能如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(read-only memory,ROM)、随机存取存储器(random access memory,RAM)、磁碟或者光盘等各种可以存储程序代码的介质。The functions, if implemented in the form of software functional units and sold or used as independent products, may be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence, or the part that contributes to the prior art or the part of the technical solution. The computer software product is stored in a storage medium, including Several instructions are used to cause a computer device (which may be a personal computer, a server, or a network device, etc.) to execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM), random access memory (RAM), magnetic disk or optical disk and other media that can store program codes .
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above are only specific embodiments of the present application, but the protection scope of the present application is not limited to this. should be covered within the scope of protection of this application. Therefore, the protection scope of the present application should be subject to the protection scope of the claims.
Claims (29)
- 一种系统,其特征在于,所述系统包括第一电子设备和第二电子设备,其中,A system, characterized in that the system includes a first electronic device and a second electronic device, wherein,所述第一电子设备,用于获取音频内容;the first electronic device for acquiring audio content;所述第一电子设备,还用于向所述第二电子设备发送第一信息,所述第一信息为所述音频内容,或者,所述第一信息为所述音频内容对应的第一文本内容;The first electronic device is further configured to send first information to the second electronic device, where the first information is the audio content, or the first information is the first text corresponding to the audio content content;所述第二电子设备,用于根据所述第一信息,显示所述第一文本内容;the second electronic device, configured to display the first text content according to the first information;所述第二电子设备,还用于响应于用户对所述第一文本内容的编辑操作,显示第二文本容,所述第二文本内容为对所述第一文本内容编辑后的文本内容。The second electronic device is further configured to display second text content in response to an editing operation of the first text content by the user, where the second text content is the text content after editing the first text content.
- 根据权利要求1所述的系统,其特征在于,所述第二电子设备,还用于向所述第一电子设备发送所述第二文本内容。The system according to claim 1, wherein the second electronic device is further configured to send the second text content to the first electronic device.
- 根据权利要求2所述的系统,其特征在于,所述编辑操作包括对所述第一文本内容的格式修改操作,其中,The system according to claim 2, wherein the editing operation comprises a format modification operation of the first text content, wherein,所述第二电子设备,还用于向所述第一电子设备发送所述第二文本内容的格式信息。The second electronic device is further configured to send the format information of the second text content to the first electronic device.
- 根据权利要求3所述的系统,其特征在于,所述第二文本内容的格式信息包括所述第二文本内容的字体颜色、字体大小、字体背景色、字体倾斜、字体下划线以及所述第二文本内容中的回车操作种的一种或者多种。The system according to claim 3, wherein the format information of the second text content includes font color, font size, font background color, font inclination, font underline and the second text content of the second text content. One or more types of carriage return operations in the text content.
- 根据权利要求2至4中任一项所述的系统,其特征在于,所述第一电子设备,还用于:The system according to any one of claims 2 to 4, wherein the first electronic device is further configured to:在接收所述第二电子设备发送的所述第二文本内容之前,根据所述音频内容,显示所述第一文本内容;Before receiving the second text content sent by the second electronic device, display the first text content according to the audio content;在接收所述第二电子设备发送的所述第二文本内容之后,使用所述第二文本内容替换所述第一文本内容。After receiving the second text content sent by the second electronic device, the first text content is replaced with the second text content.
- 根据权利要求1至5中任一项所述的系统,其特征在于,所述第一电子设备,还用于:The system according to any one of claims 1 to 5, wherein the first electronic device is further configured to:发送查询请求,所述查询请求用于接收到所述查询请求的设备确定是否具有文本编辑功能;sending a query request, where the query request is used for the device receiving the query request to determine whether it has a text editing function;响应于接收到所述第二电子设备发送的响应,向所述第二电子设备发送所述第一信息,所述响应用于指示所述第二电子设备具备文本编辑功能。In response to receiving a response sent by the second electronic device, the first information is sent to the second electronic device, where the response is used to indicate that the second electronic device has a text editing function.
- 根据权利要求1至6中任一项所述的系统,其特征在于,所述第二电子设备具体用于:The system according to any one of claims 1 to 6, wherein the second electronic device is specifically used for:响应于接收到所述第一信息,提示用户是否在所述第二电子设备上进行文本编辑;In response to receiving the first information, prompting the user whether to perform text editing on the second electronic device;响应于用户确定在所述第二电子设备上进行文本编辑的操作,显示所述第一文本内容。The first text content is displayed in response to a user determining an operation to perform text editing on the second electronic device.
- 根据权利要求1至7中任一项所述的系统,其特征在于,所述第一信息为所述第一文本内容,所述第一电子设备具体用于:The system according to any one of claims 1 to 7, wherein the first information is the first text content, and the first electronic device is specifically used for:响应于获取到所述音频内容,将所述音频内容转化为所述第一文本内容;In response to acquiring the audio content, converting the audio content into the first text content;向所述第二电子设备发送所述第一文本内容。The first text content is sent to the second electronic device.
- 根据权利要求1至7中任一项所述的系统,其特征在于,所述第一信息为所述音 频内容,所述第二电子设备具体用于:The system according to any one of claims 1 to 7, wherein the first information is the audio content, and the second electronic device is specifically used for:响应于接收到所述音频内容,将所述音频内容转化为所述第一文本内容;In response to receiving the audio content, converting the audio content to the first textual content;显示所述第一文本内容。The first text content is displayed.
- 根据权利要求1至9中任一项所述的系统,其特征在于,所述第一电子设备的账号和所述第二电子设备的账号相关联。The system according to any one of claims 1 to 9, wherein the account of the first electronic device is associated with the account of the second electronic device.
- 一种文本编辑的方法,其特征在于,所述方法应用于第一电子设备,所述方法包括:A method for editing text, wherein the method is applied to a first electronic device, and the method includes:所述第一电子设备获取音频内容;the first electronic device acquires audio content;所述第一电子设备向第二电子设备发送第一信息,所述第一信息为所述音频内容,或者,所述第一信息为所述音频内容对应的第一文本内容,以使得所述第二电子设备根据所述第一信息显示所述第一文本内容并检测用户对所述第一文本内容的编辑操作。The first electronic device sends first information to the second electronic device, where the first information is the audio content, or the first information is the first text content corresponding to the audio content, so that the The second electronic device displays the first text content according to the first information and detects an editing operation of the first text content by the user.
- 根据权利要求11所述的方法,其特征在于,所述方法还包括:The method according to claim 11, wherein the method further comprises:所述第一电子设备接收所述第二电子设备发送的第二文本内容,所述第二文本内容为用户在所述第二电子设备上对所述第一文本内容进行编辑后的文本内容。The first electronic device receives second text content sent by the second electronic device, where the second text content is text content after the user has edited the first text content on the second electronic device.
- 根据权利要求12所述的方法,其特征在于,所述方法还包括:The method of claim 12, wherein the method further comprises:所述第一电子设备接收所述第二电子设备发送的所述第二文本内容的格式信息。The first electronic device receives the format information of the second text content sent by the second electronic device.
- 根据权利要求13所述的系统,其特征在于,所述第二文本内容的格式信息包括所述第二文本内容的字体颜色、字体大小、字体背景色、字体倾斜、字体下划线以及所述第二文本内容中的回车操作种的一种或者多种。The system according to claim 13, wherein the format information of the second text content includes font color, font size, font background color, font inclination, font underline and the second text content of the second text content. One or more types of carriage return operations in the text content.
- 根据权利要求12至14中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 12 to 14, wherein the method further comprises:在接收所述第二电子设备发送的所述第二文本内容之前,所述第一电子设备根据所述音频内容,显示所述第一文本内容;Before receiving the second text content sent by the second electronic device, the first electronic device displays the first text content according to the audio content;在接收所述第二电子设备发送的所述第二文本内容之后,所述第一电子设备使用所述第二文本内容替换所述第一文本内容。After receiving the second text content sent by the second electronic device, the first electronic device replaces the first text content with the second text content.
- 根据权利要求11至15中任一项所述的方法,其特征在于,所述方法包括:The method according to any one of claims 11 to 15, wherein the method comprises:在向所述第二电子设备发送所述第一信息之前,所述第一电子设备发送查询请求,所述查询请求用于接收到所述查询请求的设备确定是否具有文本编辑功能;Before sending the first information to the second electronic device, the first electronic device sends a query request, where the query request is used for the device receiving the query request to determine whether it has a text editing function;响应于接收到所述第二电子设备发送的响应,所述第一电子设备向所述第二电子设备发送所述第一信息,所述响应用于指示所述第二电子设备具备文本编辑功能。In response to receiving a response sent by the second electronic device, the first electronic device sends the first information to the second electronic device, where the response is used to indicate that the second electronic device has a text editing function .
- 根据权利要求11至16中任一项所述的方法,其特征在于,所述第一信息为所述第一文本内容,所述方法还包括:The method according to any one of claims 11 to 16, wherein the first information is the first text content, and the method further comprises:响应于获取到所述音频内容,所述第一电子设备将所述音频内容转化为所述第一文本内容;In response to acquiring the audio content, the first electronic device converts the audio content into the first text content;所述第一电子设备向所述第二电子设备发送所述第一文本内容。The first electronic device sends the first text content to the second electronic device.
- 根据权利要求11至17中任一项所述的方法,其特征在于,所述第一电子设备的账号和所述第二电子设备的账号相关联。The method according to any one of claims 11 to 17, wherein the account of the first electronic device is associated with the account of the second electronic device.
- 一种文本编辑的方法,其特征在于,所述方法应用于第二电子设备,所述方法包括:A method for editing text, wherein the method is applied to a second electronic device, and the method includes:所述第二电子设备接收第一电子设备发送的第一信息,所述第一信息为所述第一电子 设备获取的音频内容,或者,所述第一信息为所述音频内容对应的第一文本内容;The second electronic device receives first information sent by the first electronic device, where the first information is the audio content acquired by the first electronic device, or the first information is the first information corresponding to the audio content text content;所述第二电子设备根据所述第一信息,显示所述第一文本内容;The second electronic device displays the first text content according to the first information;响应于用户对所述第一文本内容的编辑操作,所述第二电子设备显示第二文本容,所述第二文本内容为对所述第一文本内容编辑后的文本内容。In response to a user's editing operation on the first text content, the second electronic device displays second text content, where the second text content is the text content after editing the first text content.
- 根据权利要求19所述的方法,其特征在于,所述方法还包括:The method of claim 19, wherein the method further comprises:所述第二电子设备向所述第一电子设备发送所述第二文本内容。The second electronic device sends the second text content to the first electronic device.
- 根据权利要求20所述的方法,其特征在于,所述编辑操作包括对所述第一文本内容的格式修改操作,所述方法还包括:The method according to claim 20, wherein the editing operation comprises a format modification operation of the first text content, and the method further comprises:所述第二电子设备向所述第一电子设备发送所述第二文本内容的格式信息。The second electronic device sends the format information of the second text content to the first electronic device.
- 根据权利要求21所述的方法,其特征在于,所述第二文本内容的格式信息包括所述第二文本内容的字体颜色、字体大小、字体背景色、字体倾斜、字体下划线以及所述第二文本内容中的回车操作种的一种或者多种。The method according to claim 21, wherein the format information of the second text content comprises font color, font size, font background color, font inclination, font underline and the second text content of the second text content. One or more types of carriage return operations in the text content.
- 根据权利要求19至22中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 19 to 22, wherein the method further comprises:在接收所述第一电子设备发送的所述第一信息之前,所述第二电子设备接收所述第一电子设备发送的查询请求,所述查询请求所述第二电子设备确定是否具有文本编辑功能;Before receiving the first information sent by the first electronic device, the second electronic device receives a query request sent by the first electronic device, the query requesting the second electronic device to determine whether there is a text editor Function;所述第二电子设备向所述第一电子设备发送响应,所述响应用于指示所述第二电子设备具备文本编辑功能。The second electronic device sends a response to the first electronic device, where the response is used to indicate that the second electronic device has a text editing function.
- 根据权利要求19至23中任一项所述的方法,其特征在于,所述第二电子设备根据所述第一信息,显示所述第一文本内容,包括:The method according to any one of claims 19 to 23, wherein the second electronic device displays the first text content according to the first information, comprising:响应于接收到所述第一信息,所述第二电子设备提示用户是否在所述第二电子设备上进行文本编辑;In response to receiving the first information, the second electronic device prompts the user whether to perform text editing on the second electronic device;响应于用户确定在所述第二电子设备上进行文本编辑的操作,所述第二电子设备显示所述第一文本内容。The second electronic device displays the first text content in response to the user determining an operation to perform text editing on the second electronic device.
- 根据权利要求19至24中任一项所述的方法,其特征在于,所述第一信息为所述音频内容,所述第二电子设备显示所述第一文本内容之前,所述方法还包括:The method according to any one of claims 19 to 24, wherein the first information is the audio content, and before the second electronic device displays the first text content, the method further comprises: :响应于接收到所述音频内容,将所述音频内容转化为所述第一文本内容。In response to receiving the audio content, converting the audio content to the first textual content.
- 根据权利要求19至25中任一项所述的方法,其特征在于,所述第一电子设备的账号和所述第二电子设备的账号相关联。The method according to any one of claims 19 to 25, wherein the account of the first electronic device is associated with the account of the second electronic device.
- 一种电子设备,其特征在于,包括:An electronic device, comprising:一个或多个处理器;one or more processors;一个或多个存储器;one or more memories;所述一个或多个存储器存储有一个或者多个计算机程序,所述一个或者多个计算机程序包括指令,当所述指令被所述一个或多个处理器执行时,使得所述电子设备执行如权利要求11至18中任一项所述的方法。The one or more memories store one or more computer programs comprising instructions that, when executed by the one or more processors, cause the electronic device to perform as The method of any one of claims 11 to 18.
- 一种电子设备,其特征在于,包括:An electronic device, comprising:一个或多个处理器;one or more processors;一个或多个存储器;one or more memories;所述一个或多个存储器存储有一个或者多个计算机程序,所述一个或者多个计算机程序包括指令,当所述指令被所述一个或多个处理器执行时,使得所述电子设备执行如权利 要求19至26中任一项所述的方法。The one or more memories store one or more computer programs comprising instructions that, when executed by the one or more processors, cause the electronic device to perform as The method of any one of claims 19 to 26.
- 一种计算机可读存储介质,其特征在于,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求11至18中任一项所述的方法;或者,A computer-readable storage medium, characterized in that it includes computer instructions, which, when the computer instructions are executed on an electronic device, cause the electronic device to perform the method according to any one of claims 11 to 18; or ,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如权利要求19至26中任一项所述的方法。The computer instructions, when executed on an electronic device, cause the electronic device to perform the method of any of claims 19-26.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011529621.0 | 2020-12-22 | ||
CN202011529621.0A CN114664306A (en) | 2020-12-22 | 2020-12-22 | Method, electronic equipment and system for editing text |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022135254A1 true WO2022135254A1 (en) | 2022-06-30 |
Family
ID=82025546
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2021/138655 WO2022135254A1 (en) | 2020-12-22 | 2021-12-16 | Text editing method, electronic device and system |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN114664306A (en) |
WO (1) | WO2022135254A1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN116737049B (en) * | 2022-11-22 | 2024-04-19 | 荣耀终端有限公司 | Audio playing method and terminal equipment |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030055655A1 (en) * | 1999-07-17 | 2003-03-20 | Suominen Edwin A. | Text processing system |
CN103959282A (en) * | 2011-09-28 | 2014-07-30 | 谷歌公司 | Selective feedback for text recognition systems |
CN105159870A (en) * | 2015-06-26 | 2015-12-16 | 徐信 | Processing system for precisely completing continuous natural speech textualization and method for precisely completing continuous natural speech textualization |
US20160125881A1 (en) * | 2014-06-30 | 2016-05-05 | Nuance Communications, Inc. | Mobile Device for Speech Input and Text Delivery |
CN109599115A (en) * | 2018-12-24 | 2019-04-09 | 苏州思必驰信息科技有限公司 | Minutes method and apparatus for audio collecting device and user terminal |
CN109782997A (en) * | 2019-01-25 | 2019-05-21 | 腾讯科技(深圳)有限公司 | A kind of data processing method, device and storage medium |
-
2020
- 2020-12-22 CN CN202011529621.0A patent/CN114664306A/en active Pending
-
2021
- 2021-12-16 WO PCT/CN2021/138655 patent/WO2022135254A1/en active Application Filing
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20030055655A1 (en) * | 1999-07-17 | 2003-03-20 | Suominen Edwin A. | Text processing system |
CN103959282A (en) * | 2011-09-28 | 2014-07-30 | 谷歌公司 | Selective feedback for text recognition systems |
US20160125881A1 (en) * | 2014-06-30 | 2016-05-05 | Nuance Communications, Inc. | Mobile Device for Speech Input and Text Delivery |
CN105159870A (en) * | 2015-06-26 | 2015-12-16 | 徐信 | Processing system for precisely completing continuous natural speech textualization and method for precisely completing continuous natural speech textualization |
CN109599115A (en) * | 2018-12-24 | 2019-04-09 | 苏州思必驰信息科技有限公司 | Minutes method and apparatus for audio collecting device and user terminal |
CN109782997A (en) * | 2019-01-25 | 2019-05-21 | 腾讯科技(深圳)有限公司 | A kind of data processing method, device and storage medium |
Also Published As
Publication number | Publication date |
---|---|
CN114664306A (en) | 2022-06-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020253719A1 (en) | Screen recording method and electronic device | |
WO2021000807A1 (en) | Processing method and apparatus for waiting scenario in application | |
WO2020078337A1 (en) | Translation method and electronic device | |
CN109286725B (en) | Translation method and terminal | |
KR102558615B1 (en) | Method and electronic device for presenting a video on an electronic device when there is an incoming call | |
WO2022037407A1 (en) | Method and system for replying to message, and electronic device | |
CN112119641B (en) | Method and device for realizing automatic translation through multiple TWS (time and frequency) earphones connected in forwarding mode | |
EP3826280B1 (en) | Method for generating speech control command, and terminal | |
WO2020224447A1 (en) | Method and system for adding smart home device to contacts | |
WO2022042770A1 (en) | Method for controlling communication service state, terminal device and readable storage medium | |
EP4293997A1 (en) | Display method, electronic device, and system | |
WO2022135157A1 (en) | Page display method and apparatus, and electronic device and readable storage medium | |
CN114115770A (en) | Display control method and related device | |
CN114827581A (en) | Synchronization delay measuring method, content synchronization method, terminal device, and storage medium | |
WO2022206763A1 (en) | Display method, electronic device, and system | |
WO2022161006A1 (en) | Photograph synthesis method and apparatus, and electronic device and readable storage medium | |
WO2022143258A1 (en) | Voice interaction processing method and related apparatus | |
WO2022135254A1 (en) | Text editing method, electronic device and system | |
WO2022032979A1 (en) | Method for invoking capabilities of other devices, electronic device, and system | |
WO2021052388A1 (en) | Video communication method and video communication apparatus | |
WO2022135273A1 (en) | Method for invoking capabilities of other devices, electronic device, and system | |
WO2022007757A1 (en) | Cross-device voiceprint registration method, electronic device and storage medium | |
WO2022062902A1 (en) | File transfer method and electronic device | |
WO2022052767A1 (en) | Method for controlling device, electronic device, and system | |
WO2022206769A1 (en) | Method for combining content, electronic device, and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 21909250 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 21909250 Country of ref document: EP Kind code of ref document: A1 |