CN112820296B - Data transmission method and electronic equipment - Google Patents

Data transmission method and electronic equipment Download PDF

Info

Publication number
CN112820296B
CN112820296B CN202110011895.9A CN202110011895A CN112820296B CN 112820296 B CN112820296 B CN 112820296B CN 202110011895 A CN202110011895 A CN 202110011895A CN 112820296 B CN112820296 B CN 112820296B
Authority
CN
China
Prior art keywords
audio data
segment
sending
awakening
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110011895.9A
Other languages
Chinese (zh)
Other versions
CN112820296A (en
Inventor
陈孝良
李晓
冯大航
常乐
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing SoundAI Technology Co Ltd
Original Assignee
Beijing SoundAI Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing SoundAI Technology Co Ltd filed Critical Beijing SoundAI Technology Co Ltd
Priority to CN202110011895.9A priority Critical patent/CN112820296B/en
Publication of CN112820296A publication Critical patent/CN112820296A/en
Application granted granted Critical
Publication of CN112820296B publication Critical patent/CN112820296B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/30Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/104Peer-to-peer [P2P] networks
    • H04L67/1074Peer-to-peer [P2P] networks for supporting data block transmission mechanisms
    • H04L67/1078Resource delivery mechanisms
    • H04L67/108Resource delivery mechanisms characterised by resources being split in blocks or fragments
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D30/00Reducing energy consumption in communication networks
    • Y02D30/70Reducing energy consumption in communication networks in wireless communication networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention relates to the technical field of communication, and provides a data transmission method and electronic equipment, which are used for solving the problem that data transmission delay is easily larger in a data uploading process. The method comprises the following steps: sending a first audio data segment in the audio data to be identified to a server; and sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on the sending consumed time, wherein the sending consumed time is the sending duration of the first audio data segment, and the awakening audio data is used for awakening the electronic equipment. In the data transmission method of the embodiment of the application, the first audio data segment in the audio data to be identified is sent to the server, and after the first audio data segment is sent, the second audio data segment in the audio data to be identified or the first awakening segment in the awakening audio data is sent to the server according to the sending time, that is, the sending of the data segment is realized, so that the sending quantity of the instantaneous data can be reduced, and the data transmission delay is reduced.

Description

Data transmission method and electronic equipment
Technical Field
The present invention relates to the field of communications technologies, and in particular, to a data transmission method and an electronic device.
Background
Along with the continuous development of intelligent technology, various intelligent products come into operation, and the functions of the intelligent products are more and more powerful, so that great convenience is brought to the life and work of users. For example, a user may perform voice interaction through the intelligent electronic device, first wake up the electronic device through a wake-up voice input by the user, then the electronic device may receive a control voice input by the user (i.e., a voice to be recognized), and the electronic device may upload collected voice data input by the user (including the wake-up voice and the control voice) to the server.
However, in the uploading process, the collected voice data is firstly stored, and the collected voice data is uploaded at the same time in the preset time, so that the amount of instantly sent data is large, and the data transmission delay is easily large.
Disclosure of Invention
The embodiment of the invention provides a data transmission method and electronic equipment, and aims to solve the problem that data transmission delay is easy to be large in the existing data uploading process.
In order to solve the technical problem, the invention is realized as follows:
in a first aspect, an embodiment of the present invention provides a data transmission method, which is applied to an electronic device, and the method includes:
sending a first audio data segment in the audio data to be identified to a server;
and sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on sending time consumption, wherein the sending time consumption is the sending duration of the first audio data segment, and the awakening audio data is used for awakening the electronic equipment.
In a second aspect, an embodiment of the present invention further provides an electronic device, including:
the first sending module is used for sending a first audio data segment in the audio data to be identified to the server;
and the second sending module is used for sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on sending consumed time, wherein the sending consumed time is the sending duration of the first audio data segment, and the awakening audio data is used for awakening the electronic equipment.
In a third aspect, an embodiment of the present invention further provides an electronic device, including: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps in the data transmission method as described above when executing the computer program.
In a fourth aspect, the embodiment of the present invention further provides a readable storage medium, on which a computer program is stored, where the computer program, when executed by a processor, implements the steps in the data transmission method as described above.
In the data transmission method of the embodiment of the application, the first audio data segment in the audio data to be identified is sent to the server, and after the data transmission is finished, the second audio data segment in the audio data to be identified or the first awakening segment in the awakening audio data is sent to the server according to the transmission time, that is, the data is sent in the segment segments, so that the size of the sending amount of the instant data can be reduced, and the data transmission delay can be reduced.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to these drawings without inventive exercise.
Fig. 1 is a flowchart of a data transmission method according to an embodiment of the present invention;
fig. 2 is one of the schematic diagrams of data transmission in the data transmission method provided by the embodiment of the present invention;
fig. 3 is one of the schematic diagrams of data transmission in the data transmission method provided by the embodiment of the present invention;
fig. 4 is one of the schematic diagrams of data transmission in the data transmission method provided by the embodiment of the present invention;
fig. 5 is a second flowchart of a data transmission method according to an embodiment of the present invention;
fig. 6 is a schematic block diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Referring to fig. 1, fig. 1 is a flowchart of a data transmission method provided by an embodiment of the present invention, where the method is applicable to an electronic device, and as shown in fig. 1, the method includes the following steps:
step 101: and sending a first audio data segment in the audio data to be identified to the server.
The audio data to be recognized is audio data to be voice-recognized, and may be control audio data, and the voice recognition technology is a technology in which audio (e.g., voice) is a research object, and a machine automatically recognizes and understands a human-dictated voice through voice signal processing and pattern recognition, and the voice recognition technology is a technology in which a machine converts a voice into a corresponding text or command through a recognition and understanding process. For example, a user inputs control audio data of "play music a", the control audio data is sent to a server, the server responds to the control audio data and sends the source data of the music a to the electronic device, and the electronic device can play music after receiving the source data of the music a.
The audio data to be identified may comprise a plurality of audio data segments, and a first audio data segment thereof may be uploaded to the server first. It should be noted that the first audio data segment is a segment of the audio data segments that have not been sent in the audio data to be identified. As an example, the first audio data segment is a segment with the earliest input time to the electronic device among audio data segments that have not been transmitted in the audio data to be recognized.
Step 102: and sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on the sending consumed time, wherein the sending consumed time is the sending duration of the first audio data segment, and the awakening audio data is used for awakening the electronic equipment.
It should be noted that the wake-up audio data is input into the electronic device before the audio data to be recognized, and may include a plurality of wake-up data segments for waking up the electronic device, and after waking up the electronic device, the audio data to be recognized input into the electronic device by a user may be received. For example, the wake-up voice data with the preset wake-up word (e.g., the wake-up voice data including the keyword "XX") is input to wake up the electronic device, and after the wake-up, the electronic device may output a prompt message to prompt the user that the electronic device is woken up, e.g., output a "while" prompt message. After the electronic equipment is awakened, the user inputs voice data to be recognized in the electronic equipment, and the electronic equipment receives the voice data to be recognized input by the user.
After the first audio data segment is sent, a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data can be sent to the server according to the sending time consumption. Namely, the data can be sent in segments, and the size of the instantaneous data sending quantity can be reduced. It should be noted that the second audio data segment is an audio data segment that is not sent in the audio data to be identified, and the first wakeup segment is a segment in a data segment that is not sent in the wakeup audio data. As an example, the first wake-up segment is a segment of the unsent data segments of the wake-up audio data in which an input time of the electronic device is earliest.
In the data transmission method of the embodiment of the application, the first audio data segment in the audio data to be identified is sent to the server, and after the data transmission is finished, the second audio data segment in the audio data to be identified or the first awakening segment in the awakening audio data is sent to the server according to the transmission time, that is, the data is sent in the segment segments, so that the size of the sending amount of the instant data can be reduced, and the data transmission delay can be reduced.
In one embodiment, based on the time consumed for transmission, transmitting a second audio data segment in the audio data to be identified or a first wake segment in the wake-up audio data to the server includes:
under the condition that the sending time is longer than a first preset time, sending a second audio data segment to the server, wherein the second audio data segment is adjacent to the first audio data segment, and the input time of the second audio data segment in the electronic equipment is behind the input time of the first audio data segment; alternatively, the first and second electrodes may be,
and sending the first awakening segment to the server under the condition that the sending time consumption is less than or equal to a first preset time length.
That is, if the transmission time is longer than the first preset time, the segment in the audio data to be identified is still transmitted, and if the transmission time is shorter than or equal to the first preset time, it indicates that the first audio data segment in the audio data to be identified is transmitted faster in the transmission process, and then the segment in the wakeup audio data can be transmitted. As one example, the first preset duration may be 0.1 milliseconds.
That is, in this embodiment, by comparing the sending time with the first preset time, it is determined whether to send the second audio data segment in the audio data to be identified or to wake up the first wake-up segment in the audio data according to the comparison result, that is, considering the network status, the sending sequence of the segments in the audio data to be identified and the wake-up audio data can be adjusted, the network load is reduced, the amount of data sent instantly is reduced, the sending delay is reduced, and the data transmission performance is improved.
In one embodiment, after sending the second audio data segment in the audio data to be identified or the first wake-up segment in the wake-up audio data to the server based on the time consumed for sending, the method further includes: and updating the transmission time by using the transmission time length of the second audio data segment or the transmission time length of the first awakening segment.
That is, if the second audio data segment in the audio data to be identified is sent to the server, the sending time consumption is updated by using the sending time length of the second audio data segment, that is, the updated sending time consumption is the sending time length of the second audio data segment. If the first wake-up segment in the wake-up audio data is sent to the server, the sending time duration of the first wake-up segment is used for updating the sending time duration, namely the updated sending time duration is the sending time duration of the first wake-up segment. And subsequently, based on the updated transmission time, determining whether to transmit the segment in the audio data to be identified or wake up the segment in the audio data. Therefore, the sending time consumption can be updated in time, so that the accuracy of subsequently sending the segments in the audio data to be identified or awakening the segments in the awakening audio data is improved.
In one embodiment, after updating the transmission elapsed time by using the transmission duration of the second audio data segment or the transmission duration of the first wakeup segment, the method may further include: updating second audio data by using the segment with the earliest input time of the electronic equipment in the unsent data segments in the audio data to be identified or updating a first awakening segment by using the segment with the earliest input time of the electronic equipment in the unsent data segments in the awakening audio data; and returning to the step of sending the second audio data segment in the audio data to be identified or the first awakening segment in the awakening audio data to the server based on the sending time consumption until the audio data to be identified and the awakening audio data are sent completely.
For the case that the second audio data segment in the audio data to be identified is sent to the server based on the sending elapsed time, since the second audio data segment in the audio data to be identified is sent once before the sending elapsed time is updated by using the sending duration of the second audio data segment, that is, the unsent data segment in the audio data to be identified is updated compared with the unsent data segment in the audio data to be identified before the second audio data segment in the audio data to be identified is sent to the server, that is, the second audio data segment becomes the sent data segment, the next time if the segment in the audio data to be identified is sent, that is, the segment with the earliest input time in the sent data segment in the audio data to be identified is sent, in this way, the second audio data can be updated by using the segment with the earliest input time in the electronic device in the unsent data segments in the audio data to be identified, and sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on the sending time consumption. Similarly, for the case of sending the first wake-up segment in the wake-up audio data to the server based on the sending elapsed time, since the first wake-up segment in the wake-up audio data that has been sent once before the sending elapsed time is updated by using the sending duration of the first wake-up segment, that is, the unsent data segment in the wake-up audio data is updated compared with the unsent data segment in the wake-up audio data that has been sent before the first wake-up segment in the wake-up audio data is sent to the server, that is, the first wake-up segment becomes the sent data segment, and the next time the segment in the wake-up audio data that is sent with the earliest input time in the sent data segment is sent, the first wake-up segment can be updated by using the segment with the earliest input time in the unsent data segments in the wake-up audio data, and sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on the sending time consumption.
And through the above cyclic sending process, completing the sending of the audio data to be identified and the awakening audio data until the sending of the audio data to be identified and the awakening audio data is completed. The next round of transmission of audio data is similar to the transmission process described above. By the data transmission method, the number of the instantly transmitted data can be reduced, the transmission sequence of the awakening audio data and the audio data to be identified is integrated, the network load is reduced, the network delay is reduced, and the like.
In one embodiment, before sending the first audio data segment in the audio data to be identified to the server, the method further includes: receiving awakening audio data input by a user; and carrying out segmentation processing on the awakening audio data to obtain a plurality of awakening data segments, wherein the plurality of awakening data segments comprise a first awakening segment.
That is, after receiving the wakeup audio data, in order to avoid the problem that the transmission delay is long easily caused by instantly uploading the whole wakeup audio data, the wakeup audio data can be processed in a segmented manner to obtain a plurality of wakeup data segments, and then the wakeup data segments are sent in a segmented manner according to the segmented sending process of the embodiment of the application, so that the network load is reduced, and the transmission delay is reduced.
In one embodiment, after receiving the wake-up audio data input by the user, the method further comprises:
under the condition that the awakening audio data include the preset awakening words, converting the electronic equipment into an awakening state;
receiving audio data to be identified input by a user under the condition that the electronic equipment is in an awakening state;
and carrying out segmentation processing on the audio data to be identified to obtain a plurality of audio data fragments, wherein the plurality of audio data fragments comprise a first audio data fragment and a second audio data fragment.
That is, the electronic device receives input audio data to be recognized in an awake state, then performs segmentation processing on the audio data to be recognized to obtain a plurality of audio data fragments, and then performs segmentation transmission according to the segmentation transmission process of the embodiment of the application, so that the network load is reduced, and the transmission delay is reduced.
It should be noted that the awake state may be understood as an operating state, and if the electronic device is switched to the awake state, the electronic device is switched to the sleep state if the audio data to be recognized, which is input by the user, is not received within a certain effective time after the electronic device enters the awake state.
The following describes the procedure of the data transmission method in an embodiment, taking the audio as voice and the first predetermined time duration as 0.1ms as an example.
Firstly, the received multi-channel awakening voice data is subjected to noise reduction processing to obtain single-channel awakening voice data.
Then, inputting the wake-up voice data of the single channel into a wake-up engine of the electronic device, if the wake-up voice data meets a wake-up condition (for example, includes a preset wake-up word), waking up to obtain the voice wake-up data at this time, segmenting the voice wake-up data to obtain n wake-up data segments, where n is an integer greater than 1, the length of the voice wake-up data is n, and the n wake-up data segments may be sequentially arranged from morning to evening according to the input time of the input electronic device, and the format is as follows:
kw_1 kw_2 kw_3 kw_n-2 kw_n-1 kw_n
secondly, when the wake-up engine detects voice wake-up, the electronic device is woken up, and in a wake-up state, the electronic device can receive voice recognition data to be recognized, i.e. voice data to be recognized, which is input by a user, and can be segmented to obtain m voice data segments, where m is an integer greater than 1, the length of the voice data to be recognized is m, and the m voice data segments can be sequentially arranged according to the input time of the input electronic device from morning to evening, and the format is as follows:
asr_1 asr_2 asr_3 asr_m-2 asr_m-1 asr_m
first, a voice data fragment asr _1 in the voice data to be recognized is transmitted to the server, as shown in fig. 2. And determining whether to send the segment in the wakeup voice data next time or the segment in the voice data to be recognized according to the sent time-consuming duration t (i.e. the sending time-consuming), and if t is less than 0.1ms or t is 0.1ms, sending the wakeup data segment kw _1 in the wakeup voice data next time, as shown in fig. 3, and updating the time-consuming duration to the sending duration of kw _ 1. If t >0.1ms, the segment asr _2 in the voice data to be recognized is transmitted next time as shown in FIG. 4, and the elapsed time period is updated to the transmission time period of asr _ 2. Then, the next segment sending process is similar to the above process, for example, after the segment asr _2 is sent, the time-consuming duration is updated to the sending duration of asr _2, and then it is determined whether the time-consuming duration is greater than 0.1ms, if t is less than 0.1ms or t is 0.1ms, the segment kw _1 in the wakeup voice data is sent next time, and the time-consuming duration is updated to the sending duration of kw _ 1; if t >0.1ms, the segment asr _3 in the voice data to be recognized is transmitted next time, and the elapsed time period is updated to the transmission time period of asr _ 3. Or, for example, after the segment kw _1 is sent, the time-consuming duration is updated to the sending duration of kw _1, and then it is determined whether the time-consuming duration is greater than 0.1ms, if t is less than 0.1ms or t is 0.1ms, the segment kw _2 in the wakeup voice data is sent next time, and the time-consuming duration is updated to the sending duration of kw _ 1; if t >0.1ms, the segment asr _2 in the voice data to be recognized is transmitted next time, and the elapsed time period is updated to the transmission time period of asr _ 2. Through the above loop transmission process, all the data segments in the audio data to be recognized and the wake-up voice data are transmitted, namely asr _1, asr _2, asr _3, …, asr _ m-2, asr _ m-1 and asr _ m, and kw _1, kw _2, kw _3, …, kw _ n-2, kw _ n-1 and kw _ n are transmitted.
That is, as shown in fig. 5, an embodiment of the present application provides a data transmission method, which may include:
step 501: sending a voice data segment asr _ i (i is 1 initially, i is less than or equal to m) in the voice data to be recognized to the server;
step 502: judging whether the sending time t is more than 0.1 ms;
if the sending time t is longer than 0.1ms, execute step 503: sending an unsent voice data segment asr _ i +1 to the server;
after sending the audio data piece asr _ i +1, the following steps are performed:
step 504: updating the sending time t by using the sending time length of the voice data segment asr _ i + 1;
step 505: judging whether the voice data to be recognized comprises unsent data segments or not;
if the voice data to be recognized includes the unsent data segment, step 506 is executed: i is increased by 1. And returns to step 502.
If the audio data to be identified does not include the unsent data segment (i.e. i equals to m), step 507 is executed: judging whether the awakening voice data comprises unsent data segments or not;
if the wake-up voice data includes unsent data segments, step 508 is executed: and judging and sequentially sending unsent data in the awakening voice data to the server. The transmission flow then ends. And if the awakening voice data does not comprise the unsent data segments, the transmission process is ended.
If it is determined through step 502 that the transmission time t is less than or equal to 0.1ms, step 509 is executed: sending unsent wake-up data segments kw _ j (j is 1 initially, and j is less than or equal to n) to the server;
after sending the wakeup data segment kw _ j, the following steps are performed:
step 510: updating the sending time t by using the sending time length of the awakening data segment kw _ j;
step 511: judging whether the awakening voice data comprises unsent data segments or not;
if the wakeup voice data includes the unsent data segment, step 512 is executed: j is increased by 1. And returns to step 502.
If the wake-up voice data does not include the unsent data segment (i.e. j is equal to n), step 513 is executed: judging whether the voice data to be recognized comprises unsent data segments or not;
if it is determined in step 513 that the to-be-recognized voice data includes unsent data segments, step 514 is executed: and judging and sequentially sending unsent data in the voice data to be recognized to the server. The transmission flow then ends. And if the voice data to be recognized does not comprise the unsent data segments, the transmission process is ended.
In the output transmission process of the embodiment of the application, the sending sequence of the voice data to be recognized and the awakening voice data is adjusted according to the network condition, so that the instantaneous data sending quantity is reduced, and the network load is reduced.
Referring to fig. 6, fig. 6 is a schematic block diagram of an electronic device 600 according to an embodiment of the present invention, and as shown in fig. 1, the electronic device 600 includes:
a first sending module 601, configured to send a first audio data segment in the audio data to be identified to a server;
a second sending module 602, configured to send, to the server, a second audio data segment in the audio data to be identified or a first wake-up segment in the wake-up audio data based on sending consumed time, where the sending consumed time is a sending duration of the first audio data segment, and the wake-up audio data is used to wake up the electronic device.
In one embodiment, based on the time consumed for transmission, transmitting a second audio data segment in the audio data to be identified or a first wake segment in the wake-up audio data to the server includes:
under the condition that the sending time is longer than a first preset time, sending a second audio data segment to the server, wherein the second audio data segment is adjacent to the first audio data segment, and the input time of the second audio data segment in the electronic equipment is behind the input time of the first audio data segment; alternatively, the first and second electrodes may be,
and sending the first awakening segment to the server under the condition that the sending time consumption is less than or equal to a first preset time length.
In one embodiment, the electronic device further comprises:
and the first updating module is used for updating the transmission time by utilizing the transmission time length of the second audio data segment or the transmission time length of the first awakening segment.
In one embodiment, the electronic device further comprises:
the first receiving module is used for receiving awakening audio data input by a user;
the first segmentation module is used for performing segmentation processing on the awakening audio data to obtain a plurality of awakening data segments, wherein the plurality of awakening data segments comprise a first awakening segment.
In one embodiment, the electronic device further comprises:
the conversion module is used for converting the electronic equipment into an awakening state under the condition that the awakening audio data is detected to include a preset awakening word;
the second receiving module is used for receiving audio data to be identified, which is input by a user under the condition that the electronic equipment is in an awakening state;
and the second segmentation module is used for performing segmentation processing on the audio data to be identified to obtain a plurality of audio data segments, wherein the plurality of audio data segments comprise a first audio data segment and a second audio data segment.
The electronic device 400 can implement the processes implemented by the method in the foregoing method embodiments, and details are not repeated here to avoid repetition.
In an embodiment, an embodiment of the present invention further provides an electronic device, including a processor, a memory, and a computer program stored in the memory and capable of running on the processor, where the computer program, when executed by the processor, implements each process in the data transmission method embodiment, and can achieve the same technical effect, and details are not repeated here to avoid repetition.
The embodiment of the present invention further provides a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the computer program implements the processes of the data transmission method embodiment, and can achieve the same technical effects, and in order to avoid repetition, the computer program is not described herein again. The computer-readable storage medium may be a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk.
It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or apparatus that comprises the element.
Through the above description of the embodiments, those skilled in the art will clearly understand that the method of the above embodiments can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware, but in many cases, the former is a better implementation manner. Based on such understanding, the technical solutions of the present invention may be embodied in the form of a software product, which is stored in a storage medium (such as ROM/RAM, magnetic disk, optical disk) and includes instructions for enabling an electronic device (such as a mobile phone, a computer, a server, an air conditioner, or a network device) to execute the method according to the embodiments of the present invention.
While the present invention has been described with reference to the embodiments shown in the drawings, the present invention is not limited to the embodiments, which are illustrative and not restrictive, and it will be apparent to those skilled in the art that various changes and modifications can be made therein without departing from the spirit and scope of the invention as defined in the appended claims.

Claims (10)

1. A data transmission method, applied to an electronic device, the method comprising:
sending a first audio data segment in the audio data to be identified to a server;
sending a second audio data segment in the audio data to be identified or a first awakening segment in the awakening audio data to the server based on sending time consumption, wherein the sending time consumption is the sending duration of the first audio data segment, and the awakening audio data is used for awakening the electronic equipment;
wherein, the sending a second audio data segment in the audio data to be identified or a first wake-up segment in the wake-up audio data to the server based on the sending elapsed time includes:
when the sending time is longer than a first preset time, sending the second audio data segment to the server, wherein the second audio data segment is adjacent to the first audio data segment, and the input time of the second audio data segment in the electronic equipment is after the input time of the first audio data segment; alternatively, the first and second electrodes may be,
and sending the first awakening segment to the server under the condition that the sending consumed time is less than or equal to a first preset time length.
2. The method of claim 1, wherein after sending the second piece of audio data in the audio data to be identified or the first wake-up piece in the wake-up audio data to the server based on the sending elapsed time, further comprising:
and updating the transmission time by using the transmission time of the second audio data segment or the transmission time of the first awakening segment.
3. The method of any of claims 1-2, wherein prior to sending the first audio data segment of the audio data to be identified to the server, further comprising:
receiving the awakening audio data input by a user;
and carrying out segmentation processing on the awakening audio data to obtain a plurality of awakening data segments, wherein the plurality of awakening data segments comprise the first awakening segment.
4. The method of claim 3, wherein after receiving the wake-up audio data input by the user, further comprising:
under the condition that the awakening audio data comprises a preset awakening word, converting the electronic equipment into an awakening state;
receiving the audio data to be identified input by a user under the condition that the electronic equipment is in an awakening state;
and performing segmentation processing on the audio data to be identified to obtain a plurality of audio data fragments, wherein the plurality of audio data fragments comprise the first audio data fragment and the second audio data fragment.
5. An electronic device, characterized in that the electronic device comprises:
the first sending module is used for sending a first audio data segment in the audio data to be identified to the server;
a second sending module, configured to send a second audio data segment in the audio data to be identified or a first wake-up segment in the wake-up audio data to the server based on sending time consumption, where the sending time consumption is a sending duration of the first audio data segment, and the wake-up audio data is used to wake up the electronic device;
the sending, based on the sending elapsed time, a second audio data segment in the audio data to be identified or a first wake-up segment in the wake-up audio data to the server includes:
when the sending time is longer than a first preset time, sending the second audio data segment to the server, wherein the second audio data segment is adjacent to the first audio data segment, and the input time of the second audio data segment in the electronic equipment is after the input time of the first audio data segment; alternatively, the first and second electrodes may be,
and sending the first awakening segment to the server under the condition that the sending consumed time is less than or equal to a first preset time length.
6. The electronic device of claim 5, wherein the electronic device further comprises:
and the first updating module is used for updating the transmission time by utilizing the transmission time of the second audio data segment or the transmission time of the first awakening segment.
7. The electronic device of any of claims 5-6, wherein the electronic device further comprises:
the first receiving module is used for receiving the awakening audio data input by a user;
and the first segmentation module is used for performing segmentation processing on the awakening audio data to obtain a plurality of awakening data segments, wherein the plurality of awakening data segments comprise the first awakening segment.
8. The electronic device of claim 7, wherein the electronic device further comprises:
the conversion module is used for converting the electronic equipment into an awakening state under the condition that the awakening audio data comprises a preset awakening word;
the second receiving module is used for receiving the audio data to be identified, which is input by a user under the condition that the electronic equipment is in an awakening state;
and the second segmentation module is used for performing segmentation processing on the audio data to be identified to obtain a plurality of audio data segments, wherein the plurality of audio data segments comprise the first audio data segment and the second audio data segment.
9. An electronic device, comprising: memory, processor and computer program stored on the memory and executable on the processor, which when executed by the processor implements the steps in the data transmission method according to any one of claims 1 to 4.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the data transmission method according to any one of claims 1 to 4.
CN202110011895.9A 2021-01-06 2021-01-06 Data transmission method and electronic equipment Active CN112820296B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110011895.9A CN112820296B (en) 2021-01-06 2021-01-06 Data transmission method and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110011895.9A CN112820296B (en) 2021-01-06 2021-01-06 Data transmission method and electronic equipment

Publications (2)

Publication Number Publication Date
CN112820296A CN112820296A (en) 2021-05-18
CN112820296B true CN112820296B (en) 2022-05-20

Family

ID=75857699

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110011895.9A Active CN112820296B (en) 2021-01-06 2021-01-06 Data transmission method and electronic equipment

Country Status (1)

Country Link
CN (1) CN112820296B (en)

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109817220A (en) * 2017-11-17 2019-05-28 阿里巴巴集团控股有限公司 Audio recognition method, apparatus and system
CN110097876A (en) * 2018-01-30 2019-08-06 阿里巴巴集团控股有限公司 Voice wakes up processing method and is waken up equipment
CN109147779A (en) * 2018-08-14 2019-01-04 苏州思必驰信息科技有限公司 Voice data processing method and device
CN109378000B (en) * 2018-12-19 2022-06-07 科大讯飞股份有限公司 Voice wake-up method, device, system, equipment, server and storage medium
CN109727602A (en) * 2018-12-29 2019-05-07 苏州思必驰信息科技有限公司 A kind of method for recognizing sound-groove and device of mobile device terminal
CN110060685B (en) * 2019-04-15 2021-05-28 百度在线网络技术(北京)有限公司 Voice wake-up method and device
CN111464644B (en) * 2020-04-01 2023-04-28 北京声智科技有限公司 Data transmission method and electronic equipment
CN112382285B (en) * 2020-11-03 2023-08-15 北京百度网讯科技有限公司 Voice control method, voice control device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN112820296A (en) 2021-05-18

Similar Documents

Publication Publication Date Title
CN109378000B (en) Voice wake-up method, device, system, equipment, server and storage medium
CN111223497B (en) Nearby wake-up method and device for terminal, computing equipment and storage medium
CN107704275B (en) Intelligent device awakening method and device, server and intelligent device
CN109584876B (en) Voice data processing method and device and voice air conditioner
CN111161714B (en) Voice information processing method, electronic equipment and storage medium
CN110111789B (en) Voice interaction method and device, computing equipment and computer readable medium
CN111179944B (en) Voice awakening and age detection method and device and computer readable storage medium
CN112820296B (en) Data transmission method and electronic equipment
CN113327610A (en) Voice wake-up method, device and equipment
CN111899737A (en) Audio data processing method, device, server and storage medium
CN106954191A (en) Broadcast transmission method, device and terminal device
CN112306560B (en) Method and apparatus for waking up an electronic device
CN111464644B (en) Data transmission method and electronic equipment
CN111081251B (en) Voice wake-up method and device
CN109493863A (en) A kind of intelligence awakening method and device
CN111508481A (en) Training method and device of voice awakening model, electronic equipment and storage medium
CN110784885A (en) Power consumption adjusting method and device of communication network and computer readable storage medium
CN110956964B (en) Method, apparatus, storage medium and terminal for providing voice service
CN111566727A (en) Multi-stage response in full duplex voice conversations
CN113160818A (en) Method, device and equipment for using intelligent sound box and storage medium
CN112687293B (en) Intelligent agent training method and system based on machine learning and data mining
CN113038438B (en) Data transmission method and device of gateway, gateway and storage medium
CN109785836B (en) Interaction method and device
CN113870851A (en) Voice interaction method and device, electronic equipment and storage medium
CN115410565A (en) Prompt information output method, system, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant